使用python解析并另存为csv
我正在尝试用python和beautifulsoup包解析一个网页。我有控制台在保存前打印出解析结果。在cmd控制台中进行解析时,每列中的每个数据都可以很好地打印出来。但当保存到csv时,遇到逗号时,该行跳到下一列。遇到逗号之前的列都很好。我正在使用MS Professional Plus 2010执行csv数据 代码如下:使用python解析并另存为csv,python,excel,csv,parsing,beautifulsoup,Python,Excel,Csv,Parsing,Beautifulsoup,我正在尝试用python和beautifulsoup包解析一个网页。我有控制台在保存前打印出解析结果。在cmd控制台中进行解析时,每列中的每个数据都可以很好地打印出来。但当保存到csv时,遇到逗号时,该行跳到下一列。遇到逗号之前的列都很好。我正在使用MS Professional Plus 2010执行csv数据 代码如下: from urllib.request import urlopen as uReq from bs4 import BeautifulSoup as soup imp
from urllib.request import urlopen as uReq
from bs4 import BeautifulSoup as soup
import requests
from lxml import html
import re
filename = "eng.csv"
f = open(filename, "w")
headers ="abc, def, cdf\n"
f.write(headers)
url = ""
r = requests.get(url, headers = {"User-Agent":"Chrome/56.0.2924.87"})
tree = html.fromstring(r.content)
patternAB= r'ab\s=\s"(.*?)"'
script = tree.xpath('//script[contains(., "ab")]/text()')[0]
eng_name=re.search(patternAB, script).group(1)
script1 = tree.xpath('//script[contains(., "ab")]/text()')[2]
regions=re.search(patternAB, script1).group(1)
......#I use the above code repeatedly with different indexes
f.write(eng_name + ";" + regions + ";" + origins + ";" + "\n")
#I also tried "," as connector but to no avail.
f.close()
知道哪里出了问题吗?
提前感谢。在cmd提示符中添加您收到的样本数据。还可以使用csv模块,因为它会处理很多事情,如转义等。如果您对问题进行编辑以包含
url
,我们将能够重新创建您的问题。