Warning: file_get_contents(/data/phpspider/zhask/data//catemap/2/python/297.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python pandas read_html不存储完整数据_Python_Pandas_Web Scraping - Fatal编程技术网

Python pandas read_html不存储完整数据

Python pandas read_html不存储完整数据,python,pandas,web-scraping,Python,Pandas,Web Scraping,我在pandas中使用read_html函数从一些html表中提取数据。但由于某种原因,输出在达到一定大小后会被削减: 例如: 0 RECKITT BENCKISER INDIA PRIVATE LIMITED Vs.ST... 1 SMT. SONY AND ANOTHER Vs. STATE OF UTTARAKHA... 2 BHATIA BHAWAN DHARAMSHALA Vs. STATE OF UTTAR... 3 MOHD. YASEEN

我在pandas中使用read_html函数从一些html表中提取数据。但由于某种原因,输出在达到一定大小后会被削减:

例如:

0     RECKITT BENCKISER INDIA PRIVATE LIMITED  Vs.ST...
1     SMT. SONY AND ANOTHER  Vs.  STATE OF UTTARAKHA...
2     BHATIA BHAWAN DHARAMSHALA  Vs.  STATE OF UTTAR...
3     MOHD. YASEEN AND OTHERS  Vs.  STATE OF UTTARAK...
4     DR. ADITYA PRAKASH SINGH  Vs.  STATE OF UTTARA...
5     DR. MANOJ KUMAR UNIYAL  Vs.  STATE OF UTTARAKH...
6     DR. LALIT MOHAN PANDEY  Vs.  STATE OF UTTARAKH...
7     SUBHAM SAINI AND ANOTHER  Vs.  STATE OF UTTARA...
在这里的每种情况下,表都应该存储UTTARAKHAND的状态(+更多数据)

源代码:

<span class="style2">RECKITT BENCKISER INDIA PRIVATE LIMITED
</span><br><span class="style4"> Vs.</span><br><span  
class="style2">STATE OF UTTARAKHAND AND ANOTHER
</span></td><td width="20%"

它正在获得完整的文本。由于列宽有限,未显示全文

选中此项:

import pandas as pd

pd.set_option('max_colwidth',400)
df=pd.read_html('http://pastebin.com/raw/p7vfb2JG')[0]
df.head()
输出:


包括从中获取表格的url。url是某些响应的一部分:
import pandas as pd

pd.set_option('max_colwidth',400)
df=pd.read_html('http://pastebin.com/raw/p7vfb2JG')[0]
df.head()