Python 3.x 使用bs4在excel中删除保加利亚语文本时出现问题_Python 3.x_Web Scraping_Beautifulsoup_Export To Csv - Fatal编程技术网

Python 3.x 使用bs4在excel中删除保加利亚语文本时出现问题

python-3.x web-scraping

Python 3.x 使用bs4在excel中删除保加利亚语文本时出现问题,python-3.x,web-scraping,beautifulsoup,export-to-csv,Python 3.x,Web Scraping,Beautifulsoup,Export To Csv,我正试图抓取一个包含保加利亚文字的网站。它已成功地刮取，但当我将其存储到CSV文件中时，它不可读。请参阅下面的代码和图片，以更好地了解我的问题 res = requests.get('https://m.mobile.bg/results? pubtype=1&marka=Toyota&currency=%D0%BB%D0%B2.&sort=1&nup=0~1') soup = bs4.BeautifulSoup(res.text, 'lxml') f

我正试图抓取一个包含保加利亚文字的网站。它已成功地刮取，但当我将其存储到CSV文件中时，它不可读。请参阅下面的代码和图片，以更好地了解我的问题

res = requests.get('https://m.mobile.bg/results? pubtype=1&marka=Toyota&currency=%D0%BB%D0%B2.&sort=1&nup=0~1') soup = bs4.BeautifulSoup(res.text, 'lxml') file = open('cars.csv', 'w') writer = csv.writer(file) # write title row writer.writerow(['Car_Make', 'Price', 'info', 'date']) for i in soup.select('.listItem'): car_make = i.find('div', attrs = {"class":"title"}) arr = i.text print(arr) writer.writerow([arr.encode('utf-8')]) file.close()

导入请求导入csv 从bs4导入BeautifulSoup def主（url）：参数={ “pubtype”：“1”， “marka”：“丰田”， “货币”：“аа”， “排序”：“1”， “nup”：“0~1” } r=requests.get（url，params=params） soup=BeautifulSoup（r.text'lxml'）将open（'d.csv'，'w'，newline=''，encoding='utf-8-sig'）作为f： writer=csv.writer（f） writer.writerows（[list（x.strings））用于汤中的x。选择（'.listItem.TOPitem'））主要（'https://m.mobile.bg/results')
输出：

如果支持utf-8-sig不能解决问题，请尝试使用utf-8-sig。非常感谢，@barny。我不知道术语，因为这是我第一次做这样的任务。谢谢你澄清术语。

[web scraping]相关文章推荐

Web scraping 服务器端的屏幕抓取 web-scraping

Web scraping 如何为页面添加标签以查找已删除的内容？ web-scraping

Web scraping 一个关于检测数据的问题 web-scraping

Web scraping 擦桌子 web-scraping

Web scraping Scrapy：将解析的数据导出到多个文件中 web-scraping scrapy

Web scraping 如何在Scrpay Spider中动态创建JOBDIR设置？ web-scraping scrapy

Web scraping 使用无头浏览器设置检索openid承载令牌 web-scraping openid

Web scraping 如何将Scrapy更改为在洋葱链接上爬行？ web-scraping scrapy

Web scraping Youtube是否已从描述区域中删除“类别”？ web-scraping youtube

Web scraping 关于网页抓取的问题（作为初学者） web-scraping

随机文章推荐

Server 用锌和法罗制成的 server smalltalk

Server PHP致命错误：类'；Phar'；在中找不到 server

Server 让WSO2 IOT服务器作为Windows服务运行 server

Server java.sql.SQLException:没有合适的驱动程序 server airflow

Server 谷歌云：我无法访问我的虚拟机 server google-cloud-platform virtual-machine

Server OroCRM会让我的主机服务器变慢吗？ server

Server 如何在.net c中生成Oauth2 OIDC授权代码# server oauth-2.0

Server opennebula sunstone服务失败 server cloud

Server 如何将Strapi端口号1337更改为其他号码？ server

Server 是否建议通过更新域名称服务器将域和主机与不同的公司分开 server dns

Server Netty和CXF-JAXWS server netty

Server 如何为grafana快照设置外部快照服务器 server grafana

Server 如何使用2节点MCU作为我的家庭wifi网络上的服务器和客户端，因为该网络覆盖我的家庭的各个角落，用于物联网设备 server arduino

Server 使用cloudfare的域转发 server dns

[python 3.x]相关推荐

Python 3.x 特金特：我该怎么清理窗户？
Python 3.x Tkinter

Python 3.x USB设备：无法分离内核驱动程序或找到设备句柄
Python 3.x

Python 3.x 熊猫不易损坏类型：'；numpy.ndarray和#x27；与熊猫群比
Python 3.x Pandas Dataframe

Python 3.x 我应该使用什么多处理方法进行机器学习培训+；准确度测试？
Python 3.x Parallel Processing

Python 3.x Dataset api中的多线程
Python 3.x

Python 3.x 如何从dropbox中读取csv文件作为字典（从csv.DictReader（）读取）？
Python 3.x Csv Dictionary

Python 3.x 使用lambda函数作为排序键
Python 3.x Lambda

Python 3.x RDS、AWS Lambda、应用程序客户端-设置类型
Python 3.x Aws Lambda

Python 3.x Scrapy crawler在增加爬行器的并发性的同时给出DNS查找错误
Python 3.x Scrapy

Python 3.x numpy timedelta64不显示分数
Python 3.x Numpy

Python 3.x 测试中的mock multiprocessing Pool.map
Python 3.x

Python 3.x 按钮布局未在框架中形成棋盘
Python 3.x Tkinter Grid

Python 3.x 如何在python日志记录中使用可调用过滤器
Python 3.x Logging

Python 3.x 聚合物链的metropolis模拟
Python 3.x Polymer

Python 3.x 如何将微控制器连接到gps模块并接收位置信息
Python 3.x

Python 3.x 如何通过python子进程执行带有两个输出重定向的diff命令？
Python 3.x

Python 3.x If…else语句未按预期工作
Python 3.x Asynchronous Discord.py

Python 3.x Post请求500错误Python请求Web抓取
Python 3.x Web Scraping

Python 3.x 使用字典替换SQL查询中的参数
Python 3.x

Python 3.x 单词标记化后返回原始
Python 3.x List

Python 3.x 客户可以'；从不同的烧瓶路径渲染时，无法连接到烧瓶socketio服务器[已解决]
Python 3.x Flask

Python 3.x 使用tkinter从另一个功能中的1个功能更新按钮
Python 3.x Tkinter

Python 3.x 语音助手使用python显示循环错误
Python 3.x

Python 3.x 使用pyQt编写面板/码头
Python 3.x

Python 3.x 仅在字符串中生成特定字符的排列
Python 3.x String

Python 3.x can'；t安装已安装numpy 1.19.2的熊猫
Python 3.x Pandas Numpy Pip

Python 3.x 连接预定义列列表中的数据帧列
Python 3.x

Python 3.x 在bot脱机之前运行命令
Python 3.x Discord.py

Python 3.x 从Pyinstaller构建和.spec文件中恢复代码
Python 3.x

Python 3.x 查找h3元素内部的所有锚元素
Python 3.x

Tags

Dll C# Zend Framework Filesystems Javascript Vim Discord.js Ubuntu Heroku Codenameone Formatting Docusignapi R Tabs Openshift Docker Primefaces F# Ibm Mq Data Structures Jpa Maven 2 Encoding Speech Recognition Next.js Glsl Leaflet Azure Data Factory Yii2 Html5 Canvas Oauth Oracle10g Linker Quickbooks Windows Phone 8.1 Windows Azure Functions Prolog Processing Logstash Internet Explorer 8 Asp.net Mvc 2 Serialization Alfresco Migration Markdown Tensorflow 3d Kdb Menu Composer Php Select Linq To Sql Groovy Cryptography C# 3.0 Asynchronous Synchronization Wso2 Methods Twitter Bootstrap Asp.net Mvc 4 Orchardcms Gtk Kibana Google Cloud Platform Windows 7 Vaadin Android Emulator Model View Controller Time Complexity Itext Gulp Javafx 2 Windows Phone 7 Kubernetes Twilio Meteor Prometheus Boost Artifactory Windows Phone Jasper Reports Reflection Timer Cluster Computing Entity Framework 4 Blazor Teradata Vbscript Webpack Programming Languages Actions On Google Objective C Exception Handling Llvm Sms Sap Asp.net Mvc 5 Fluent Nhibernate Ipython Jwt Adobe Applescript Compiler Construction Autodesk Forge Geometry Sharepoint Opencart Phpstorm Doctrine Orm Swagger Google Sheets Angular6 Memory Salesforce Windows Services .htaccess Web Scraping Google Cloud Dataflow Office Js Bootstrap 4 Colors Parallel Processing Asp.net Core Silverlight 4.0 Omnet++ Mod Rewrite Sql Server 2012 Ruby On Rails 3.1 Ag Grid Devexpress Parameters Google Visualization Odoo Mdx Gdb Three.js Opengl Es Cobol Zsh Calendar Rally Tcp Sip .net Plugins Tkinter Openlayers Android Layout Inheritance Tridion Gnuplot Rabbitmq Azure Cosmosdb Scala Nunit Amazon Dynamodb Dialogflow Es Google Chrome Jira Nest Transactions Security Forms Ssis Log4j Vhdl Domain Driven Design Grid Swift Filter Struts2 Twitter Bootstrap 3 Pandas Perforce Lambda Bots Oop Tcl Couchdb Server Nservicebus Arduino Dependency Injection Yaml Version Control Dom Url Visual Studio Jsf Zend Framework2 Mobile Rest Ios5 Loopbackjs Rss C++ Cli Webstorm Ms Office Xslt

Copyright © 2024. All Rights Reserved by - Fatal编程技术网