Python 从类中获取字符串的一部分
HTML是:Python 从类中获取字符串的一部分,python,selenium,beautifulsoup,Python,Selenium,Beautifulsoup,HTML是: <div class="_3u1 _gli _uvb" data-bt='{"id":xxxx,"rank":11,"abtest_version":null,"abtest_params":{"abtest_version":null,"origin":"A","ranker":null},"section":"main_column","owner_id":null,"sub_id":null,"browse_location":null,"query_data":[],
<div class="_3u1 _gli _uvb" data-bt='{"id":xxxx,"rank":11,"abtest_version":null,"abtest_params":{"abtest_version":null,"origin":"A","ranker":null},"section":"main_column","owner_id":null,"sub_id":null,"browse_location":null,"query_data":[],"is_headline":false}'>
我得到的回报是:
{"id":xxxxxx,"rank":1,"abtest_version":null,"abtest_params":{"abtest_version":null,"origin":"A","ranker":null},"section":"main_column","owner_id":null,"sub_id":null,"browse_location":null,"query_data":[],"is_headline":false}
我怎么才能把
xxxxx
退回呢?提前感谢。您可以解析数据bt
,因为它包含有效的json
import json
found = soup.find_all("div", "_3u1 _gli _uvb")
for fbid in found:
...
bt_json = json.loads(fbid.attrs['data-bt'])
print(bt_json['id'])
...
回答得好!感谢您使用json而不是将其解析为字符串。
import json
found = soup.find_all("div", "_3u1 _gli _uvb")
for fbid in found:
...
bt_json = json.loads(fbid.attrs['data-bt'])
print(bt_json['id'])
...