Warning: file_get_contents(/data/phpspider/zhask/data//catemap/7/python-2.7/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 2.7 美化组输出未传输到CSV文件_Python 2.7_Web Scraping_Beautifulsoup_Export To Csv - Fatal编程技术网

Python 2.7 美化组输出未传输到CSV文件

Python 2.7 美化组输出未传输到CSV文件,python-2.7,web-scraping,beautifulsoup,export-to-csv,Python 2.7,Web Scraping,Beautifulsoup,Export To Csv,我正在尝试将输出从webscraper导出到CSV文件。当我在终端中运行代码时,代码工作正常,我得到了正确的输出,但它不会传输到CSV文件 问题 当我删除第一个for循环时,它工作得很好,但是我不能确切地指出这部分中的错误是什么 代码 import csv ; import requests from bs4 import BeautifulSoup outfile = open('ImplementTest8.csv','w') writer = csv.writer(outfile) wr

我正在尝试将输出从webscraper导出到CSV文件。当我在终端中运行代码时,代码工作正常,我得到了正确的输出,但它不会传输到CSV文件

问题

当我删除第一个for循环时,它工作得很好,但是我不能确切地指出这部分中的错误是什么

代码

import csv ; import requests
from bs4 import BeautifulSoup

outfile = open('ImplementTest8.csv','w')
writer = csv.writer(outfile)
writer.writerow(["job_link", "job_desc"])

res = requests.get("http://implementconsultinggroup.com/career/#/6257").text
soup = BeautifulSoup(res,"lxml")
links = soup.find_all("a")

for li in soup.find('ul', class_='list-articles list').find_all('li'):
    level = li.find_all('dd', {'class': 'author'})[1].get_text()
    if "Graduate" in level:
        links = li.find_all("href")
        for link in links:
            if "career" in link.get("href") and 'COPENHAGEN' in link.text:
                item_link = link.get("href").strip()
                item_text = link.text.replace("View Position","").encode('utf-8').strip()
                writer.writerow([item_link, item_text])
                print(item_link, item_text)
import csv ; import requests
from bs4 import BeautifulSoup

outfile = open('ImplementTest8.csv','w')
writer = csv.writer(outfile)
writer.writerow(["job_link", "job_desc"])

res = requests.get("http://implementconsultinggroup.com/career/#/6257").text
soup = BeautifulSoup(res,"lxml")
links = soup.find_all("a")

for li in soup.find('ul', class_='list-articles list').find_all('li'):
    level = li.find_all('dd', {'class': 'author'})[1].get_text()
    if "Graduate" in level:
        links = li.find_all(href=True)
        for link in links:
            if "career" in link.get("href") and 'COPENHAGEN' in link.text:
                item_link = link.get("href").strip()
                item_text = link.text.replace("View Position","").encode('utf-8').strip()
                writer.writerow([item_link, item_text])
                print(item_link, item_text)
编辑的代码

import csv ; import requests
from bs4 import BeautifulSoup

outfile = open('ImplementTest8.csv','w')
writer = csv.writer(outfile)
writer.writerow(["job_link", "job_desc"])

res = requests.get("http://implementconsultinggroup.com/career/#/6257").text
soup = BeautifulSoup(res,"lxml")
links = soup.find_all("a")

for li in soup.find('ul', class_='list-articles list').find_all('li'):
    level = li.find_all('dd', {'class': 'author'})[1].get_text()
    if "Graduate" in level:
        links = li.find_all("href")
        for link in links:
            if "career" in link.get("href") and 'COPENHAGEN' in link.text:
                item_link = link.get("href").strip()
                item_text = link.text.replace("View Position","").encode('utf-8').strip()
                writer.writerow([item_link, item_text])
                print(item_link, item_text)
import csv ; import requests
from bs4 import BeautifulSoup

outfile = open('ImplementTest8.csv','w')
writer = csv.writer(outfile)
writer.writerow(["job_link", "job_desc"])

res = requests.get("http://implementconsultinggroup.com/career/#/6257").text
soup = BeautifulSoup(res,"lxml")
links = soup.find_all("a")

for li in soup.find('ul', class_='list-articles list').find_all('li'):
    level = li.find_all('dd', {'class': 'author'})[1].get_text()
    if "Graduate" in level:
        links = li.find_all(href=True)
        for link in links:
            if "career" in link.get("href") and 'COPENHAGEN' in link.text:
                item_link = link.get("href").strip()
                item_text = link.text.replace("View Position","").encode('utf-8').strip()
                writer.writerow([item_link, item_text])
                print(item_link, item_text)

Href是标记属性,而不是标记名称。如果要确保所有链接都具有href属性,可以将其用作,否则请使用标记名

links = li.find_all(href=True)

Href是标记属性,而不是标记名称。如果要确保所有链接都具有href属性,可以将其用作,否则请使用标记名

links = li.find_all(href=True)

感谢T.M-我刚刚尝试将代码从(href)更改为(href=True),但它给了我以下错误:文件“”,第4行语法错误:关键字不能是表达式不要在
href
中使用引号,它是参数名称而不是参数值。该值为
True
已获取-CSV表仍然为空。我已经添加了上面编辑过的代码。对不起-csv文件的创建花了一点时间。谢谢-这解决了我的问题。谢谢T.M-我刚刚尝试将代码从(href)更改为(href=True),但它给了我以下错误:文件“”,第4行语法错误:关键字不能是表达式不要在
href
中使用引号,它是参数名称而不是参数值。该值为
True
已获取-CSV表仍然为空。我已经添加了上面编辑过的代码。对不起-csv文件的创建花了一点时间。谢谢-这解决了我的问题。