Python 您好,我想提取一个网站并将数据存储在excel文件中
我想提取一个网站并将数据存储在excel文件中 网站:- 在本网站中存储了一份州级新冠病毒-19数据 我想用python在excel文件中提取数据 请记住,我只希望网站表格格式中的状态数据此代码可能会帮助您:(印度python程序中的Corona病毒(COVID-19)病例): 输出:Python 您好,我想提取一个网站并将数据存储在excel文件中,python,pandas,beautifulsoup,data-mining,Python,Pandas,Beautifulsoup,Data Mining,我想提取一个网站并将数据存储在excel文件中 网站:- 在本网站中存储了一份州级新冠病毒-19数据 我想用python在excel文件中提取数据 请记住,我只希望网站表格格式中的状态数据此代码可能会帮助您:(印度python程序中的Corona病毒(COVID-19)病例): 输出: 您尝试过什么?您需要提供一些代码来获得答案。我们不是来为你编程的。也就是说,从网站上获取数据的最简单方法是curl。如果您需要更多功能,可以尝试scrapy,如果您需要ajax和更高级别的scraping,可以使
您尝试过什么?您需要提供一些代码来获得答案。我们不是来为你编程的。也就是说,从网站上获取数据的最简单方法是curl。如果您需要更多功能,可以尝试scrapy,如果您需要ajax和更高级别的scraping,可以使用selenium,
import urllib.request
import os
import time
while(True):
urllib.request.urlretrieve ("https://www.mohfw.gov.in/", "webpage.html")
o=open('webpage.html',"r")
print("\n***********************************COVID-19 CASES IN INDIA********************************")
print("* STATE\t\t\t\t CONFIRMED CASES\t CURED\t\t\tDEATH *")
states=["Andaman and Nicobar Islands","Andhra Pradesh","Arunachal Pradesh","Assam","Bihar","Chandigarh","Chhattisgarh","Delhi","Goa","Gujarat","Haryana","Himachal Pradesh","Jammu and Kashmir","Jharkhand","Karnataka","Kerala","Ladakh","Madhya Pradesh","Maharashtra","Manipur","Mizoram","Odisha","Puducherry","Punjab","Rajasthan","Tamil Nadu","Telengana","Uttarakhand","Uttar Pradesh","West Bengal","Total"]
j=0
c=0
for i in o:
s=str(i)
if(c!=0):
temp=''
for p in s:
if p in '0123456789':
temp+=p
if(len(temp)==0):
continue
print(temp,end='')
c+=1
if(c==4):
c=0
for space in range(len(temp),9):
print(" ",end='')
print("*")
continue
for space in range(len(temp),23):
print(" ",end='')
continue
if(j>30):
break
if(states[j] in s):
print("*",end=' ')
print(states[j],end="")
for space in range(len(states[j]),32):
print("",end=' ')
c=1
j+=1
continue
print("******************************************************************************************")
o.close()
os.remove("webpage.html")
print("UPDATING IN 6 MINUTES...")
time.sleep(600)