在python中删除html标记
所以我正试图从ESPN那里为一场NBA比赛争取一分。我试图先得到名字,但我很难摆脱html标签 我试过使用在python中删除html标记,python,html,Python,Html,所以我正试图从ESPN那里为一场NBA比赛争取一分。我试图先得到名字,但我很难摆脱html标签 我试过使用 get_text(), .text(), .string_strip() 但他们总是给我错误 这是我现在正在使用的代码 from bs4 import BeautifulSoup import requests url= "http://scores.espn.com/nba/boxscore?gameId=400900407" r = requests.get(url) soup =
get_text(), .text(), .string_strip()
但他们总是给我错误
这是我现在正在使用的代码
from bs4 import BeautifulSoup
import requests
url= "http://scores.espn.com/nba/boxscore?gameId=400900407"
r = requests.get(url)
soup = BeautifulSoup(r.text,"html.parser")
name = []
for row in soup.find_all('tr')[1:]:
player_name = row.find('td', attrs={'class': 'name'})
name.append(player_name)
print(name)
使用player_name.text应该可以,但问题是有时row.find'td',attrs={'class':'name'}是空的。试着这样做:
if player_name:
name.append(player_name.text)
使用player_name.text应该可以,但问题是有时row.find'td',attrs={'class':'name'}是空的。试着这样做:
if player_name:
name.append(player_name.text)
我这样解决这个问题:
from bs4 import BeautifulSoup
import requests
url= "http://scores.espn.com/nba/boxscore?gameId=400900407"
r = requests.get(url)
soup = BeautifulSoup(r.text,"html.parser")
name = []
for row in soup.find_all('tr')[1:]:
try:
player_name = row.select('td.name span')[0].text
name.append(player_name)
except:
pass
print(name)
我这样解决这个问题:
from bs4 import BeautifulSoup
import requests
url= "http://scores.espn.com/nba/boxscore?gameId=400900407"
r = requests.get(url)
soup = BeautifulSoup(r.text,"html.parser")
name = []
for row in soup.find_all('tr')[1:]:
try:
player_name = row.select('td.name span')[0].text
name.append(player_name)
except:
pass
print(name)
我的代码供您参考
import requests
from pyquery import PyQuery as pyq
url= "http://scores.espn.com/nba/boxscore?gameId=400900407"
r = requests.get(url)
doc = pyq(r.content)
print([h.text() for h in doc('.abbr').items()])
我的代码供您参考
import requests
from pyquery import PyQuery as pyq
url= "http://scores.espn.com/nba/boxscore?gameId=400900407"
r = requests.get(url)
doc = pyq(r.content)
print([h.text() for h in doc('.abbr').items()])
你说错话。什么错误?你说错误。什么错误?