Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/349.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 您好,我想提取一个网站并将数据存储在excel文件中_Python_Pandas_Beautifulsoup_Data Mining - Fatal编程技术网

Python 您好,我想提取一个网站并将数据存储在excel文件中

Python 您好,我想提取一个网站并将数据存储在excel文件中,python,pandas,beautifulsoup,data-mining,Python,Pandas,Beautifulsoup,Data Mining,我想提取一个网站并将数据存储在excel文件中 网站:- 在本网站中存储了一份州级新冠病毒-19数据 我想用python在excel文件中提取数据 请记住,我只希望网站表格格式中的状态数据此代码可能会帮助您:(印度python程序中的Corona病毒(COVID-19)病例): 输出: 您尝试过什么?您需要提供一些代码来获得答案。我们不是来为你编程的。也就是说,从网站上获取数据的最简单方法是curl。如果您需要更多功能,可以尝试scrapy,如果您需要ajax和更高级别的scraping,可以使

我想提取一个网站并将数据存储在excel文件中 网站:-

在本网站中存储了一份州级新冠病毒-19数据 我想用python在excel文件中提取数据

请记住,我只希望网站表格格式中的状态数据

此代码可能会帮助您:(印度python程序中的Corona病毒(COVID-19)病例):

输出:

您尝试过什么?您需要提供一些代码来获得答案。我们不是来为你编程的。也就是说,从网站上获取数据的最简单方法是curl。如果您需要更多功能,可以尝试scrapy,如果您需要ajax和更高级别的scraping,可以使用selenium,
import urllib.request
import os
import time
while(True):
    urllib.request.urlretrieve ("https://www.mohfw.gov.in/", "webpage.html")
    o=open('webpage.html',"r")
    print("\n***********************************COVID-19 CASES IN INDIA********************************")
    print("* STATE\t\t\t\t CONFIRMED CASES\t CURED\t\t\tDEATH    *")
    states=["Andaman and Nicobar Islands","Andhra Pradesh","Arunachal Pradesh","Assam","Bihar","Chandigarh","Chhattisgarh","Delhi","Goa","Gujarat","Haryana","Himachal Pradesh","Jammu and Kashmir","Jharkhand","Karnataka","Kerala","Ladakh","Madhya Pradesh","Maharashtra","Manipur","Mizoram","Odisha","Puducherry","Punjab","Rajasthan","Tamil Nadu","Telengana","Uttarakhand","Uttar Pradesh","West Bengal","Total"]        
    j=0
    c=0
    for i in o:
        s=str(i)
        if(c!=0):
            temp=''
            for p in s:
                if p in '0123456789':
                    temp+=p
            if(len(temp)==0):
                continue
            print(temp,end='')
            c+=1
            if(c==4):
                c=0
                for space in range(len(temp),9):
                    print(" ",end='')
                print("*")
                continue
            for space in range(len(temp),23):
                print(" ",end='')
            continue
        if(j>30):
            break                    
        if(states[j] in s):
            print("*",end=' ')
            print(states[j],end="")
            for space in range(len(states[j]),32):
                print("",end=' ')
            c=1
            j+=1
            continue
    print("******************************************************************************************")
    o.close()
    os.remove("webpage.html")
    print("UPDATING IN 6 MINUTES...")
    time.sleep(600)