Python 美化组或请求不阅读网页的某些部分
我不熟悉网页抓取,从网页获取数据时遇到一些问题 我正在尝试阅读此网页: 并试图通过类wstext的div元素获取风速数据,但出于某种原因,请求库通过internet获取的页面不包含该特定类及其一些祖先Python 美化组或请求不阅读网页的某些部分,python,web-scraping,beautifulsoup,python-requests,Python,Web Scraping,Beautifulsoup,Python Requests,我不熟悉网页抓取,从网页获取数据时遇到一些问题 我正在尝试阅读此网页: 并试图通过类wstext的div元素获取风速数据,但出于某种原因,请求库通过internet获取的页面不包含该特定类及其一些祖先 import requests import bs4 as bs import numpy as np wind = np.random.rand(120) dailyWindRecord = np.random.rand(30,4) html = requests.get('https://
import requests
import bs4 as bs
import numpy as np
wind = np.random.rand(120)
dailyWindRecord = np.random.rand(30,4)
html = requests.get('https://www.timeanddate.com/weather/pakistan/lahore/historic?month=7&year=2018')
print(html.text)
soup = bs.BeautifulSoup(html.content, 'html5lib')
print(soup.prettify)
windList = soup.findAll('div')
print(windList)
我曾尝试直接打印读取的html数据请求,并在通过beautifulsoup解析后打印,以查看html数据是否包含该类,但我找不到任何内容。任何帮助都将不胜感激。熊猫可以为您完成工作,而不是使用bs4或请求:
import numpy as np
import pandas as pd
wind = np.random.rand(120)
dailyWindRecord = np.random.rand(30,4)
url = 'https://www.timeanddate.com/weather/pakistan/lahore/historic?month=7&year=2018'
tables = pd.read_html(url)
table = tables[1]
print (table.iloc[:,4])
输出:
print (table.iloc[:,4])
0 3 mph
1 No wind
2 No wind
3 No wind
4 No wind
5 No wind
6 No wind
7 3 mph
8 5 mph
9 6 mph
10 5 mph
11 5 mph
12 6 mph
13 5 mph
14 No wind
15 3 mph
16 No wind
17 No wind
18 No wind
19 No wind
20 5 mph
21 No wind
22 6 mph
23 6 mph
24 5 mph
25 6 mph
26 7 mph
27 7 mph
28 7 mph
29 3 mph
30 3 mph
31 3 mph
32 3 mph
33 No wind
34 3 mph
35 3 mph
36 No wind
37 No wind
38 NaN
Name: (Unnamed: 4_level_0, Wind), dtype: object
Date: Sunday, 1 July 2018, 00:00 — 06:00 Wind: 0.621
Date: Sunday, 1 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Sunday, 1 July 2018, 12:00 — 18:00 Wind: 3.107
Date: Sunday, 1 July 2018, 18:00 — 00:00 Wind: 3.107
Date: Monday, 2 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Monday, 2 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Monday, 2 July 2018, 12:00 — 18:00 Wind: 8.7
Date: Monday, 2 July 2018, 18:00 — 00:00 Wind: 9.943
Date: Tuesday, 3 July 2018, 00:00 — 06:00 Wind: 10.564
Date: Tuesday, 3 July 2018, 06:00 — 12:00 Wind: 11.185
Date: Tuesday, 3 July 2018, 12:00 — 18:00 Wind: 9.943
Date: Tuesday, 3 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Wednesday, 4 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Wednesday, 4 July 2018, 06:00 — 12:00 Wind: 4.971
Date: Wednesday, 4 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Wednesday, 4 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Thursday, 5 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Thursday, 5 July 2018, 06:00 — 12:00 Wind: 1.864
Date: Thursday, 5 July 2018, 12:00 — 18:00 Wind: 3.107
Date: Thursday, 5 July 2018, 18:00 — 00:00 Wind: 3.107
Date: Friday, 6 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Friday, 6 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Friday, 6 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Friday, 6 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Saturday, 7 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Saturday, 7 July 2018, 06:00 — 12:00 Wind: 2.486
Date: Saturday, 7 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Saturday, 7 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Sunday, 8 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Sunday, 8 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Sunday, 8 July 2018, 12:00 — 18:00 Wind: 5.593
Date: Sunday, 8 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Monday, 9 July 2018, 00:00 — 06:00 Wind: 5.593
Date: Monday, 9 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Monday, 9 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Monday, 9 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Tuesday, 10 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Tuesday, 10 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Tuesday, 10 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 10 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Wednesday, 11 July 2018, 00:00 — 06:00 Wind: 6.214
Date: Wednesday, 11 July 2018, 06:00 — 12:00 Wind: 12.428
Date: Wednesday, 11 July 2018, 12:00 — 18:00 Wind: 8.078
Date: Wednesday, 11 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Thursday, 12 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Thursday, 12 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Thursday, 12 July 2018, 12:00 — 18:00 Wind: 7.457
Date: Thursday, 12 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Friday, 13 July 2018, 00:00 — 06:00 Wind: 5.593
Date: Friday, 13 July 2018, 06:00 — 12:00 Wind: 11.807
Date: Friday, 13 July 2018, 12:00 — 18:00 Wind: 9.321
Date: Friday, 13 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Saturday, 14 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Saturday, 14 July 2018, 06:00 — 12:00 Wind: 4.971
Date: Saturday, 14 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Saturday, 14 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Sunday, 15 July 2018, 00:00 — 06:00 Wind: 8.7
Date: Sunday, 15 July 2018, 06:00 — 12:00 Wind: 8.7
Date: Sunday, 15 July 2018, 12:00 — 18:00 Wind: 8.7
Date: Sunday, 15 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Monday, 16 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Monday, 16 July 2018, 06:00 — 12:00 Wind: 11.185
Date: Monday, 16 July 2018, 12:00 — 18:00 Wind: 11.185
Date: Monday, 16 July 2018, 18:00 — 00:00 Wind: 8.7
Date: Tuesday, 17 July 2018, 00:00 — 06:00 Wind: 7.457
Date: Tuesday, 17 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Tuesday, 17 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 17 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Wednesday, 18 July 2018, 00:00 — 06:00 Wind: 3.728
Date: Wednesday, 18 July 2018, 06:00 — 12:00 Wind: 2.486
Date: Wednesday, 18 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Wednesday, 18 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Thursday, 19 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Thursday, 19 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Thursday, 19 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Thursday, 19 July 2018, 18:00 — 00:00 Wind: 1.864
Date: Friday, 20 July 2018, 00:00 — 06:00 Wind: 2.486
Date: Friday, 20 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Friday, 20 July 2018, 12:00 — 18:00 Wind: 8.078
Date: Friday, 20 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Saturday, 21 July 2018, 00:00 — 06:00 Wind: 0.621
Date: Saturday, 21 July 2018, 06:00 — 12:00 Wind: 1.243
Date: Saturday, 21 July 2018, 12:00 — 18:00 Wind: 2.486
Date: Saturday, 21 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Sunday, 22 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Sunday, 22 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Sunday, 22 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Sunday, 22 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Monday, 23 July 2018, 00:00 — 06:00 Wind: 2.486
Date: Monday, 23 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Monday, 23 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Monday, 23 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Tuesday, 24 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Tuesday, 24 July 2018, 06:00 — 12:00 Wind: 7.457
Date: Tuesday, 24 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Tuesday, 24 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Wednesday, 25 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Wednesday, 25 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Wednesday, 25 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Wednesday, 25 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Thursday, 26 July 2018, 00:00 — 06:00 Wind: 7.457
Date: Thursday, 26 July 2018, 06:00 — 12:00 Wind: 9.321
Date: Thursday, 26 July 2018, 12:00 — 18:00 Wind: 11.185
Date: Thursday, 26 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Friday, 27 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Friday, 27 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Friday, 27 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Friday, 27 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Saturday, 28 July 2018, 00:00 — 06:00 Wind: 3.728
Date: Saturday, 28 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Saturday, 28 July 2018, 12:00 — 18:00 Wind: 1.864
Date: Saturday, 28 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Sunday, 29 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Sunday, 29 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Sunday, 29 July 2018, 12:00 — 18:00 Wind: 5.593
Date: Sunday, 29 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Monday, 30 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Monday, 30 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Monday, 30 July 2018, 12:00 — 18:00 Wind: 4.971
Date: Monday, 30 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Tuesday, 31 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Tuesday, 31 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Tuesday, 31 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 31 July 2018, 18:00 — 00:00 Wind: 3.107
选项2:
print (table.iloc[:,4])
0 3 mph
1 No wind
2 No wind
3 No wind
4 No wind
5 No wind
6 No wind
7 3 mph
8 5 mph
9 6 mph
10 5 mph
11 5 mph
12 6 mph
13 5 mph
14 No wind
15 3 mph
16 No wind
17 No wind
18 No wind
19 No wind
20 5 mph
21 No wind
22 6 mph
23 6 mph
24 5 mph
25 6 mph
26 7 mph
27 7 mph
28 7 mph
29 3 mph
30 3 mph
31 3 mph
32 3 mph
33 No wind
34 3 mph
35 3 mph
36 No wind
37 No wind
38 NaN
Name: (Unnamed: 4_level_0, Wind), dtype: object
Date: Sunday, 1 July 2018, 00:00 — 06:00 Wind: 0.621
Date: Sunday, 1 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Sunday, 1 July 2018, 12:00 — 18:00 Wind: 3.107
Date: Sunday, 1 July 2018, 18:00 — 00:00 Wind: 3.107
Date: Monday, 2 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Monday, 2 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Monday, 2 July 2018, 12:00 — 18:00 Wind: 8.7
Date: Monday, 2 July 2018, 18:00 — 00:00 Wind: 9.943
Date: Tuesday, 3 July 2018, 00:00 — 06:00 Wind: 10.564
Date: Tuesday, 3 July 2018, 06:00 — 12:00 Wind: 11.185
Date: Tuesday, 3 July 2018, 12:00 — 18:00 Wind: 9.943
Date: Tuesday, 3 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Wednesday, 4 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Wednesday, 4 July 2018, 06:00 — 12:00 Wind: 4.971
Date: Wednesday, 4 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Wednesday, 4 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Thursday, 5 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Thursday, 5 July 2018, 06:00 — 12:00 Wind: 1.864
Date: Thursday, 5 July 2018, 12:00 — 18:00 Wind: 3.107
Date: Thursday, 5 July 2018, 18:00 — 00:00 Wind: 3.107
Date: Friday, 6 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Friday, 6 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Friday, 6 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Friday, 6 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Saturday, 7 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Saturday, 7 July 2018, 06:00 — 12:00 Wind: 2.486
Date: Saturday, 7 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Saturday, 7 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Sunday, 8 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Sunday, 8 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Sunday, 8 July 2018, 12:00 — 18:00 Wind: 5.593
Date: Sunday, 8 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Monday, 9 July 2018, 00:00 — 06:00 Wind: 5.593
Date: Monday, 9 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Monday, 9 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Monday, 9 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Tuesday, 10 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Tuesday, 10 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Tuesday, 10 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 10 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Wednesday, 11 July 2018, 00:00 — 06:00 Wind: 6.214
Date: Wednesday, 11 July 2018, 06:00 — 12:00 Wind: 12.428
Date: Wednesday, 11 July 2018, 12:00 — 18:00 Wind: 8.078
Date: Wednesday, 11 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Thursday, 12 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Thursday, 12 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Thursday, 12 July 2018, 12:00 — 18:00 Wind: 7.457
Date: Thursday, 12 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Friday, 13 July 2018, 00:00 — 06:00 Wind: 5.593
Date: Friday, 13 July 2018, 06:00 — 12:00 Wind: 11.807
Date: Friday, 13 July 2018, 12:00 — 18:00 Wind: 9.321
Date: Friday, 13 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Saturday, 14 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Saturday, 14 July 2018, 06:00 — 12:00 Wind: 4.971
Date: Saturday, 14 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Saturday, 14 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Sunday, 15 July 2018, 00:00 — 06:00 Wind: 8.7
Date: Sunday, 15 July 2018, 06:00 — 12:00 Wind: 8.7
Date: Sunday, 15 July 2018, 12:00 — 18:00 Wind: 8.7
Date: Sunday, 15 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Monday, 16 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Monday, 16 July 2018, 06:00 — 12:00 Wind: 11.185
Date: Monday, 16 July 2018, 12:00 — 18:00 Wind: 11.185
Date: Monday, 16 July 2018, 18:00 — 00:00 Wind: 8.7
Date: Tuesday, 17 July 2018, 00:00 — 06:00 Wind: 7.457
Date: Tuesday, 17 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Tuesday, 17 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 17 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Wednesday, 18 July 2018, 00:00 — 06:00 Wind: 3.728
Date: Wednesday, 18 July 2018, 06:00 — 12:00 Wind: 2.486
Date: Wednesday, 18 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Wednesday, 18 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Thursday, 19 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Thursday, 19 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Thursday, 19 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Thursday, 19 July 2018, 18:00 — 00:00 Wind: 1.864
Date: Friday, 20 July 2018, 00:00 — 06:00 Wind: 2.486
Date: Friday, 20 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Friday, 20 July 2018, 12:00 — 18:00 Wind: 8.078
Date: Friday, 20 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Saturday, 21 July 2018, 00:00 — 06:00 Wind: 0.621
Date: Saturday, 21 July 2018, 06:00 — 12:00 Wind: 1.243
Date: Saturday, 21 July 2018, 12:00 — 18:00 Wind: 2.486
Date: Saturday, 21 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Sunday, 22 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Sunday, 22 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Sunday, 22 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Sunday, 22 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Monday, 23 July 2018, 00:00 — 06:00 Wind: 2.486
Date: Monday, 23 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Monday, 23 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Monday, 23 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Tuesday, 24 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Tuesday, 24 July 2018, 06:00 — 12:00 Wind: 7.457
Date: Tuesday, 24 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Tuesday, 24 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Wednesday, 25 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Wednesday, 25 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Wednesday, 25 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Wednesday, 25 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Thursday, 26 July 2018, 00:00 — 06:00 Wind: 7.457
Date: Thursday, 26 July 2018, 06:00 — 12:00 Wind: 9.321
Date: Thursday, 26 July 2018, 12:00 — 18:00 Wind: 11.185
Date: Thursday, 26 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Friday, 27 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Friday, 27 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Friday, 27 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Friday, 27 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Saturday, 28 July 2018, 00:00 — 06:00 Wind: 3.728
Date: Saturday, 28 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Saturday, 28 July 2018, 12:00 — 18:00 Wind: 1.864
Date: Saturday, 28 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Sunday, 29 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Sunday, 29 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Sunday, 29 July 2018, 12:00 — 18:00 Wind: 5.593
Date: Sunday, 29 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Monday, 30 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Monday, 30 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Monday, 30 July 2018, 12:00 — 18:00 Wind: 4.971
Date: Monday, 30 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Tuesday, 31 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Tuesday, 31 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Tuesday, 31 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 31 July 2018, 18:00 — 00:00 Wind: 3.107
您可以在html中找到并提取json结构,然后使用它。但是,当我尝试这样做时,它按小时列出了月份的数据,而不是一天的数据:
import numpy as np
import requests
import bs4
import json
wind = np.random.rand(120)
dailyWindRecord = np.random.rand(30,4)
url = 'https://www.timeanddate.com/weather/pakistan/lahore/historic?month=7&year=2018'
response = requests.get(url)
soup = bs4.BeautifulSoup(response.text, 'html.parser')
scripts = soup.find_all('script')
jsonObj = None
for script in scripts:
if 'var data=' in script.text:
jsonStr = script.text.strip()
jsonStr = jsonStr.split('var data=')[1]
jsonStr = jsonStr.split(';')[0]
jsonObj = json.loads(jsonStr)
for item in jsonObj['detail']:
date = item['ds']
wind = item['wind']
print ('Date: %-40s Wind: %s' %(date,wind) )
输出:
print (table.iloc[:,4])
0 3 mph
1 No wind
2 No wind
3 No wind
4 No wind
5 No wind
6 No wind
7 3 mph
8 5 mph
9 6 mph
10 5 mph
11 5 mph
12 6 mph
13 5 mph
14 No wind
15 3 mph
16 No wind
17 No wind
18 No wind
19 No wind
20 5 mph
21 No wind
22 6 mph
23 6 mph
24 5 mph
25 6 mph
26 7 mph
27 7 mph
28 7 mph
29 3 mph
30 3 mph
31 3 mph
32 3 mph
33 No wind
34 3 mph
35 3 mph
36 No wind
37 No wind
38 NaN
Name: (Unnamed: 4_level_0, Wind), dtype: object
Date: Sunday, 1 July 2018, 00:00 — 06:00 Wind: 0.621
Date: Sunday, 1 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Sunday, 1 July 2018, 12:00 — 18:00 Wind: 3.107
Date: Sunday, 1 July 2018, 18:00 — 00:00 Wind: 3.107
Date: Monday, 2 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Monday, 2 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Monday, 2 July 2018, 12:00 — 18:00 Wind: 8.7
Date: Monday, 2 July 2018, 18:00 — 00:00 Wind: 9.943
Date: Tuesday, 3 July 2018, 00:00 — 06:00 Wind: 10.564
Date: Tuesday, 3 July 2018, 06:00 — 12:00 Wind: 11.185
Date: Tuesday, 3 July 2018, 12:00 — 18:00 Wind: 9.943
Date: Tuesday, 3 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Wednesday, 4 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Wednesday, 4 July 2018, 06:00 — 12:00 Wind: 4.971
Date: Wednesday, 4 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Wednesday, 4 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Thursday, 5 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Thursday, 5 July 2018, 06:00 — 12:00 Wind: 1.864
Date: Thursday, 5 July 2018, 12:00 — 18:00 Wind: 3.107
Date: Thursday, 5 July 2018, 18:00 — 00:00 Wind: 3.107
Date: Friday, 6 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Friday, 6 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Friday, 6 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Friday, 6 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Saturday, 7 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Saturday, 7 July 2018, 06:00 — 12:00 Wind: 2.486
Date: Saturday, 7 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Saturday, 7 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Sunday, 8 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Sunday, 8 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Sunday, 8 July 2018, 12:00 — 18:00 Wind: 5.593
Date: Sunday, 8 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Monday, 9 July 2018, 00:00 — 06:00 Wind: 5.593
Date: Monday, 9 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Monday, 9 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Monday, 9 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Tuesday, 10 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Tuesday, 10 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Tuesday, 10 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 10 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Wednesday, 11 July 2018, 00:00 — 06:00 Wind: 6.214
Date: Wednesday, 11 July 2018, 06:00 — 12:00 Wind: 12.428
Date: Wednesday, 11 July 2018, 12:00 — 18:00 Wind: 8.078
Date: Wednesday, 11 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Thursday, 12 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Thursday, 12 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Thursday, 12 July 2018, 12:00 — 18:00 Wind: 7.457
Date: Thursday, 12 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Friday, 13 July 2018, 00:00 — 06:00 Wind: 5.593
Date: Friday, 13 July 2018, 06:00 — 12:00 Wind: 11.807
Date: Friday, 13 July 2018, 12:00 — 18:00 Wind: 9.321
Date: Friday, 13 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Saturday, 14 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Saturday, 14 July 2018, 06:00 — 12:00 Wind: 4.971
Date: Saturday, 14 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Saturday, 14 July 2018, 18:00 — 00:00 Wind: 6.214
Date: Sunday, 15 July 2018, 00:00 — 06:00 Wind: 8.7
Date: Sunday, 15 July 2018, 06:00 — 12:00 Wind: 8.7
Date: Sunday, 15 July 2018, 12:00 — 18:00 Wind: 8.7
Date: Sunday, 15 July 2018, 18:00 — 00:00 Wind: 5.593
Date: Monday, 16 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Monday, 16 July 2018, 06:00 — 12:00 Wind: 11.185
Date: Monday, 16 July 2018, 12:00 — 18:00 Wind: 11.185
Date: Monday, 16 July 2018, 18:00 — 00:00 Wind: 8.7
Date: Tuesday, 17 July 2018, 00:00 — 06:00 Wind: 7.457
Date: Tuesday, 17 July 2018, 06:00 — 12:00 Wind: 8.078
Date: Tuesday, 17 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 17 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Wednesday, 18 July 2018, 00:00 — 06:00 Wind: 3.728
Date: Wednesday, 18 July 2018, 06:00 — 12:00 Wind: 2.486
Date: Wednesday, 18 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Wednesday, 18 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Thursday, 19 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Thursday, 19 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Thursday, 19 July 2018, 12:00 — 18:00 Wind: 6.214
Date: Thursday, 19 July 2018, 18:00 — 00:00 Wind: 1.864
Date: Friday, 20 July 2018, 00:00 — 06:00 Wind: 2.486
Date: Friday, 20 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Friday, 20 July 2018, 12:00 — 18:00 Wind: 8.078
Date: Friday, 20 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Saturday, 21 July 2018, 00:00 — 06:00 Wind: 0.621
Date: Saturday, 21 July 2018, 06:00 — 12:00 Wind: 1.243
Date: Saturday, 21 July 2018, 12:00 — 18:00 Wind: 2.486
Date: Saturday, 21 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Sunday, 22 July 2018, 00:00 — 06:00 Wind: 4.971
Date: Sunday, 22 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Sunday, 22 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Sunday, 22 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Monday, 23 July 2018, 00:00 — 06:00 Wind: 2.486
Date: Monday, 23 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Monday, 23 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Monday, 23 July 2018, 18:00 — 00:00 Wind: 4.971
Date: Tuesday, 24 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Tuesday, 24 July 2018, 06:00 — 12:00 Wind: 7.457
Date: Tuesday, 24 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Tuesday, 24 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Wednesday, 25 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Wednesday, 25 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Wednesday, 25 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Wednesday, 25 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Thursday, 26 July 2018, 00:00 — 06:00 Wind: 7.457
Date: Thursday, 26 July 2018, 06:00 — 12:00 Wind: 9.321
Date: Thursday, 26 July 2018, 12:00 — 18:00 Wind: 11.185
Date: Thursday, 26 July 2018, 18:00 — 00:00 Wind: 7.457
Date: Friday, 27 July 2018, 00:00 — 06:00 Wind: 6.836
Date: Friday, 27 July 2018, 06:00 — 12:00 Wind: 5.593
Date: Friday, 27 July 2018, 12:00 — 18:00 Wind: 4.35
Date: Friday, 27 July 2018, 18:00 — 00:00 Wind: 4.35
Date: Saturday, 28 July 2018, 00:00 — 06:00 Wind: 3.728
Date: Saturday, 28 July 2018, 06:00 — 12:00 Wind: 6.214
Date: Saturday, 28 July 2018, 12:00 — 18:00 Wind: 1.864
Date: Saturday, 28 July 2018, 18:00 — 00:00 Wind: 3.728
Date: Sunday, 29 July 2018, 00:00 — 06:00 Wind: 3.107
Date: Sunday, 29 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Sunday, 29 July 2018, 12:00 — 18:00 Wind: 5.593
Date: Sunday, 29 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Monday, 30 July 2018, 00:00 — 06:00 Wind: 1.864
Date: Monday, 30 July 2018, 06:00 — 12:00 Wind: 3.728
Date: Monday, 30 July 2018, 12:00 — 18:00 Wind: 4.971
Date: Monday, 30 July 2018, 18:00 — 00:00 Wind: 2.486
Date: Tuesday, 31 July 2018, 00:00 — 06:00 Wind: 1.243
Date: Tuesday, 31 July 2018, 06:00 — 12:00 Wind: 6.836
Date: Tuesday, 31 July 2018, 12:00 — 18:00 Wind: 6.836
Date: Tuesday, 31 July 2018, 18:00 — 00:00 Wind: 3.107
下面是json格式的分解,以便访问wind
我的探索和非常非常肮脏的问题“解决方案” 1.漂亮的泡泡糖很好 看,它很好用 看看-我们看到熊猫正在使用
\u BeautifulSoupHtml5LibFrameParser
因此:美丽的乌苏很好
2.带卷曲的“本质脏的解决方案”
让我们试试:
我们在这里看到:
</script><script type="text/javascript">
var data={"copyright":"Contents are strictly for use by
timeanddate.com","units":
{"temp":"°C","prec":"mm","wind":"km\/h","baro":"mbar"},
"temp":
[{"date":15304047E5,"temp":29},{"date":15304065E5,"temp":29},
{"date":15304083E5,"temp":29},{"date":15304101E5,"temp":28},
...
var data={“版权”:“内容仅供用户使用。”
timeanddate.com,“单位”:
{“温度”:“摄氏度”,“气压”:“毫米”,“风”:“千米/小时”,“气压”:“毫巴”},
“临时工”:
[{“日期”:15304047E5,“临时”:29},{“日期”:15304065E5,“临时”:29},
{“日期”:15304083E5,“临时”:29},{“日期”:15304101E5,“临时”:28},
...
我想是OP在寻找数据
3.可能的解决办法
curl
/wget
/请求
-一切都必须正常var数据
.Pythonstr
-方法必须足够json.load
此提取的数据
附笔。
就个人而言,我喜欢熊猫解决方案
因为熊猫本身就是一个伟大的图书馆
但是不需要熊猫来解决这个问题。这个网页可能正在使用一些JavaScript来加载进一步的数据。你是否尝试过在这个页面上说
wget
/curl
然后在文件上运行你的脚本?我对curl没有问题https://www.timeanddate.com/weather/pakistan/lahore/historic\?月\=7年\=2018>结果t、 html
,我看到var data
-包含所有数据的json。@Thierrylahuille,我要查找的div类位于一个div id之下,我可以使用beautifulsoup找到该div id,但由于某种原因,该特定类的后代丢失了。我不知道Javascript可以用于添加或删除html元素,但可以单独读取ud,这是有意义的。所以当检查网页时,你怎么知道html元素是通过JS还是JS加载的呢nto@AlexYu,风速在你的解决方案中。它在键下细节。我将在下面的图片中发布给你看。哇!这是一个意想不到的解决方案!虽然OP问题可以在没有熊猫的情况下解决-这是一个很好的发现实际上做得很好,谢谢。因此,每当我需要查找一些表元素时,最好使用pandas,而不是requests或beautifulsoups。如果我看到
标记,我通常会尝试pandas。read_html()
首先。如果它能得到我所需要的,我就用它,我宁愿做一点工作来操作一个数据帧,也不愿做大量的工作来遍历所有的表、th、tr、td标记。@HamzaWaheed pandas under the hoods很棒!我认为“选项2”是最好的解决方案。啊,好吧。这与我有时做的事情没有太大区别。我有时会拉出json结构。我现在刚刚绑定了它,但似乎它的数据不是每小时一天,而是整个月的跨度(请参见下面的“编辑到我的解决方案”)。你的方法得到每小时一天的数据吗?我只是探索了一下“beautyfulsoap没有阅读页面的某些部分”-我甚至没有仔细查看OP要求的数据。我也喜欢你的“选项2”。我是否可以给你第二个加号。