Html python在beautifulsoup can';找不到字符串

Html python在beautifulsoup can';找不到字符串,html,python-3.x,web-scraping,beautifulsoup,instagram,Html,Python 3.x,Web Scraping,Beautifulsoup,Instagram,我正在尝试从Instagram页面获取用户名。我应该使用在“data=soup.find_all('script')[3]之后得到的一部分数据 看起来是这样的: 大宗报价 (script type=“text/javascript”>window.\u sharedData={“config”:{“csrf_token”:“hiuzdxw17btxz5edly25ftqivoorlez”,“viewer”:null,“viewerId”:null},“supportses6”:false,“co

我正在尝试从Instagram页面获取用户名。我应该使用在“data=soup.find_all('script')[3]之后得到的一部分数据 看起来是这样的:

大宗报价 (script type=“text/javascript”>window.\u sharedData={“config”:{“csrf_token”:“hiuzdxw17btxz5edly25ftqivoorlez”,“viewer”:null,“viewerId”:null},“supportses6”:false,“country_code”:“RU”,“language_code”:“en”,“locale”:“en_-US”,“entry_-data”:{“postage”:[{“graphql”:“{”graphcode_-media”:“graphmage”,“id”:“196874936359883”,“shortcode”:“BtSZWokAZdj”,“尺寸”:{“高度”:640,“宽度”:640},“选通信息”:null,“媒体预览”:“ACOQ5MIITSXXIGHTHPGCD8ZFAGxRXSSWYPFSAAZ/lzjpn/CSM5SJAU7IB8MUAKU0LABVQ0LMUQSPUA/Q8VVPYNB9CKAOTUOFB9CD+UP5GQ19HUOPTP5UL1EXHDG7FU/WCGATJIRPQV5ZYB4ZYB4UYPUYPF1AZ/L7HFNPUCULDSUK904+Q7MVL7ZUYGZUL7D”:“显示”:”显示资源“:[{src:”,“配置宽度”:640,“配置高度”:640},{src:”,“配置宽度”:750,“配置高度”:750},{src:”,“配置宽度”:1080,“配置高度”:1080}],“可访问性”标题“:“图像可能包含:一个或多个人物和特写”,“是视频”:false,“应该记录客户端事件”:false,“跟踪令牌”:EYJ2ZXJZAW9UIJO1LCJWYXLSB2FKIJP7IMLZX2FUYWX5DGLJC190CMFJA2VKIJP0CNVLLCJ1DWLKIJOIN2YJG2NJ2NJ2MYW5OGIWHNWHNMRHNJZ3NDKZNJU5MZUWODGZINNpZ25HDHvYZI6IIJ9,“边缘到标记的用户”{“边缘”:[],“边缘到媒体”:{“边缘”:[]到标题”:{“边缘”:{“节点”:{“文本”:\u2022\n卡片展示你的故事。\n你的痛苦。\n你的仇恨。\n你的悲伤和绝望。\n它们造就了你自己,让你成为一个与众不同的人。\n有的留,有的走。\n有的亮,有的亮。\n有的大,有的小。\n有的深,有的在表面。\n但它们真的都一样,你看吗?\n它们都是伤疤,告诉我在我们生命的不同阶段,我们的故事。\n在我们的整个生命中,没有什么纪念品,这表明我们成长了多少。\n我们克服了多少困难,变得多么坚强。\n我们从生命中最艰难和最黑暗的时刻变得多么勇敢和勇敢。\u2022\n\u2022\n\u2022\n\u2022\n\u2022\n#诗歌#割伤#自残#塔图#黑暗#痛苦#悲伤#孤独#焦虑#抑郁###########################################!!!!!!!!查看者喜欢:false,“查看者已保存”:false,“查看者已保存”至“收藏”:false,“查看者在您的照片中”:false,“查看者可重新共享”:true,“所有者”:{“id”:“10173498181”,“已验证”:false,“个人资料图片url”:“用户名”:“魔鬼..茶”,“被查看者阻止”:false,“被查看者阻止”,后跟“查看者”:false,“全名”:“压抑\ud83e\d40”,“已阻止查看者”:false,“is_private”:false,“is_unpublished”:false,“requested_by_viewer”:false}

有“用户名”部分(在blockquote的末尾)。我认为它是一个字符串,但我抓不到它。所以它不是字符串,但它是什么?它是一个类?我应该使用哪种方法来检索用户名“用户名”:“devils..tea.”。如果您能提供帮助,请提前感谢您

....
req = requests.get(url)
soup = BeautifulSoup(req.text, "lxml")
data = soup.find_all('script') [3]
username = data.find_all_next(string="username")
print (username)
你可以用正则表达式

import re
data = '''
(script type="text/javascript">window._sharedData = {"config":{"csrf_token":"hIuZDxW17bTXz5EDLY25ftqivOOrLEeZ","viewer":null,"viewerId":null},"supports_es6":false,"country_code":"RU","language_code":"en","locale":"en_US","entry_data":{"PostPage":[{"graphql":{"shortcode_media":{"__typename":"GraphImage","id":"1968747493659350883","shortcode":"BtSZWokAZdj","dimensions":{"height":640,"width":640},"gating_info":null,"media_preview":"ACoq5miitSxxIGTHPXPGcd8ZFAGXRXSSWypFsAAZ/lzjpn/Csm5sjAu7Ib8MUAUaKU0lABVq0lMUqsPUA/Q8VVpynBB9CKAOtuOFB9CD+uP5Gq19HuiOPTP5Ul1exhdgy7kdF7fU/wCGatJiRPqv5ZFIZybnP4UynOpUlT1HFNpiClDFeRSUUATLcSJ904+lPF5MvR2H41WooAc7lzuY5J702iigD//Z","display_url":"https://instagram.fhel3-1.fna.fbcdn.net/vp/68311f4b09669fd75609e9fcabbf1ae0/5D0517DE/t51.2885-15/e35/49907137_294327238101721_6745007497573009307_n.jpg?_nc_ht=instagram.fhel3-1.fna.fbcdn.net","display_resources":[{"src":"https://instagram.fhel3-1.fna.fbcdn.net/vp/68311f4b09669fd75609e9fcabbf1ae0/5D0517DE/t51.2885-15/e35/49907137_294327238101721_6745007497573009307_n.jpg?_nc_ht=instagram.fhel3-1.fna.fbcdn.net","config_width":640,"config_height":640},{"src":"https://instagram.fhel3-1.fna.fbcdn.net/vp/68311f4b09669fd75609e9fcabbf1ae0/5D0517DE/t51.2885-15/e35/49907137_294327238101721_6745007497573009307_n.jpg?_nc_ht=instagram.fhel3-1.fna.fbcdn.net","config_width":750,"config_height":750},{"src":"https://instagram.fhel3-1.fna.fbcdn.net/vp/68311f4b09669fd75609e9fcabbf1ae0/5D0517DE/t51.2885-15/e35/49907137_294327238101721_6745007497573009307_n.jpg?_nc_ht=instagram.fhel3-1.fna.fbcdn.net","config_width":1080,"config_height":1080}],"accessibility_caption":"Image may contain: one or more people and closeup","is_video":false,"should_log_client_event":false,"tracking_token":"eyJ2ZXJzaW9uIjo1LCJwYXlsb2FkIjp7ImlzX2FuYWx5dGljc190cmFja2VkIjp0cnVlLCJ1dWlkIjoiN2Q1Yjg2NmY5OGIwNDVhNWIxMmRhNjEwZTA3NDY1MmYxOTY4NzQ3NDkzNjU5MzUwODgzIn0sInNpZ25hdHVyZSI6IiJ9","edge_media_to_tagged_user":{"edges":[]},"edge_media_to_caption":{"edges":[{"node":{"text":"\u2022\nScars show your story. \nYour pain. \nYour hate.\nYour sadness and despair. \nThey make you who you are, and one of a kind with every different mark. \nSome stay, some go.\nSome brighter, some lighter.\nSome bigger, some smaller.\nSome deeper, some one the surface. \nBut they are really all the same, you see?\nThey are all scars, just telling different points of our life, our story. \nOur souvenir throughout our whole life, that shows us how much we've grown. \nHow much we have overcome. How strong we've become.\nHow brave and courageous we've become from the hardest and darkest times of our life. \u2022\n\u2022\n\u2022\n\u2022\n#poem #cuts #selfharm #tatoo #dark #pain #sad #lonely #anxiety #depressed"}}]},"caption_is_edited":true,"has_ranked_comments":false,"edge_media_to_comment":{"count":1,"page_info":{"has_next_page":false,"end_cursor":null},"edges":[]},"comments_disabled":false,"taken_at_timestamp":1548913011,"edge_media_preview_like":{"count":17,"edges":[]},"edge_media_to_sponsor_user":{"edges":[]},"location":null,"viewer_has_liked":false,"viewer_has_saved":false,"viewer_has_saved_to_collection":false,"viewer_in_photo_of_you":false,"viewer_can_reshare":true,"owner":{"id":"10173498181","is_verified":false,"profile_pic_url":"https://instagram.fhel3-1.fna.fbcdn.net/vp/9a17134e8d0a36efec53f1da5cac1f38/5D14BC0F/t51.2885-19/s150x150/47690762_475199173011446_4764198224049209344_n.jpg?_nc_ht=instagram.fhel3-1.fna.fbcdn.net","username":"devils..tea.","blocked_by_viewer":false,"followed_by_viewer":false,"full_name":"depressed\ud83e\udd40","has_blocked_viewer":false,"is_private":false,"is_unpublished":false,"requested_by_viewer":false}......
'''

r = re.compile(r'username":"(.*)(?=","blocked)')
print(r.findall(data))

或者,对于我们这些不喜欢正则表达式的人(轻推,轻推@QHarr:D),您可以尝试以下方法:

data = [your quote above]
data_list = data.split(",")
for i in data_list:
   if 'username' in i:
       print(i)
输出:

"username":"devils..tea."