Python 如何将以下行解析为数据帧_Python_Regex_Python 3.x_Pandas_Dataframe - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/regex/16.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 如何将以下行解析为数据帧_Python_Regex_Python 3.x_Pandas_Dataframe - Fatal编程技术网

Python 如何将以下行解析为数据帧

python regex python-3.x pandas dataframe

Python 如何将以下行解析为数据帧,python,regex,python-3.x,pandas,dataframe,Python,Regex,Python 3.x,Pandas,Dataframe,我已经编写了一个基于Selenium的Python3程序，从一个懒散的滚动网站上抓取一个版主列表我在最后一步上被难倒了。变换线，例如： Eleni Efstathiou (Houston, US) Silke Gillessen (St. Gallen, CH) Susana Banerjee (London, GB) Sandro Pignata (Napoli, IT) Rick L. Haas (Amsterdam, NL) 进入名称、城市和国家变量中，然后将这些变量填充到数据框中。

我已经编写了一个基于Selenium的Python3程序，从一个懒散的滚动网站上抓取一个版主列表

我在最后一步上被难倒了。变换线，例如：

Eleni Efstathiou (Houston, US) Silke Gillessen (St. Gallen, CH)
Susana Banerjee (London, GB) Sandro Pignata (Napoli, IT) 
Rick L. Haas (Amsterdam, NL)

进入名称、城市和国家变量中，然后将这些变量填充到数据框中。上面的数据将生成五行

请提供有关正则表达式或其他更简单方法的帮助？
您可以使用正则表达式提取字段并获得元组列表：

s = '''Eleni Efstathiou (Houston, US) Silke Gillessen (St. Gallen, CH) Susana Banerjee (London, GB) Sandro Pignata (Napoli, IT) Rick L. Haas (Amsterdam, NL)''' import re mods = re.findall('(.+?)\s+\((.+?),\s+(.+?)\)\s*',s)
接下来，将列表转换为数据帧：

pd.DataFrame(mods, columns=('name', 'city', 'nation')) # name city nation #0 Eleni Efstathiou Houston US #1 Silke Gillessen St. Gallen CH #2 Susana Banerjee London GB #3 Sandro Pignata Napoli IT #4 Rick L. Haas Amsterdam NL

对不起，我问得不清楚。我无法定义一个正则表达式模式，只能从出现次数可变的行中提取名称、城市、国家信息。它在我回答的第一行中定义。非常感谢。这给了我一个很好的基础，以适用于我的情况。

[regex]相关文章推荐

随机文章推荐

如何从XSLT创建XSD文档？ xslt

检查起始字符应为'；T'；接下来的3个字符应该是xslt中的数字 xslt tridion

如何使用XSLT检查XML是否有节点或是否为空文件？ xslt

XPath/XSLT嵌套谓词：如何获取外部谓词的上下文？ xslt xpath

Xslt 明钦族 xslt

Xslt 如何有条件地更改属性以检查其同级'；s值多少？ xslt xpath

Xslt <；xsl:select的值="&引用/&燃气轮机；基于其子节点值的节点 xslt

HTML使用xslt转义一些xml标记 xslt

XSLT乘法返回不正确的结果 xslt

如何使用XSLT1.0从XML中提取字段？ xslt

为什么XSLT会意外地更改上下文？ xslt xpath

Xslt Amazon xml响应的类型破坏了php xsl xslt amazon-web-services

Xslt XSL选择第一行纯文本 xslt

xslt将相同的节点分组到父节点下以及子节点中 xslt

XSLT中的数据转换 xslt

Xslt 基于节点数拆分XML文件 xslt

Xslt 使用HTML字符的前XSL子字符串 xslt rss sharepoint-2013

Xslt Schematron使用Include失败，规则匹配不明确 xslt

Xslt 按编辑属性对元素排序 xslt

如何对相邻的“；p”；根据属性输入“；内容类型=”；Sta_index2和#x201C”；-XSLT xslt

[python]相关推荐

Python 如何更好地设置骰子程序以获得更好的输出
Python Python 3.x

Python 类错误缩进？但是在哪里呢？
Python Python 2.7

Python-将结果值导出到文本文件
Python

Python 读取具有字段长度限制的记录
Python String Parsing

Python重新替换添加了额外的反斜杠字符
Python Regex

Python 将混合格式的字符串日期转换为历元
Python Python 2.7

Python 如何用脚本处理参数
Python Python 3.x

Python 字符串正则表达式
Python Regex Python 2.7 Python 3.x

Python 将不同类型的多个变量写入文本文件
Python Text

Python 重塑数据帧（使用R或
Python R Dataframe

Python 没有名为request的模块
Python Django Opencv

Python 如何将文件移动到zip存档的根目录中？
Python

基本Python文件和打印问题
Python Python 3.x

Python dataframe中的值为13，但并不总是可以识别
Python Python 3.x Pandas Dataframe

Python 在txt中查找正好位于其他值之前的值
Python

Python 如何在嵌套列表中打印每个句子而不重复上一个句子
Python

Python 如何读取数组中的每个元素并将其分配给某个对象
Python

将Python列表转换为JSON
Python Json List

Python 跳过int 0也会跳过False
Python

Python py脚本将值写入文件（fobj）永远不会'；不要更新文件
Python

Python3：列表和字典。我如何创建一个字典，告诉我每个项目都来自哪个列表？
Python Python 3.x List Function Dictionary

Python 在元组列表中删除类似的单词
Python String List

Python Regex re.match（）在应该时不返回match
Python Regex Python 3.x String Parsing

Python 如何映射熊猫系列中的字符串
Python Pandas Dataframe

Python不检测模块
Python Python 3.x

当我有多个.py文件一起工作时，如何将python项目转换为可执行文件？
Python Python 3.x

Python 属性错误：'；网格绘图仪'；对象没有属性'；tk'；
Python Tkinter

我用python编写的程序有什么问题？
Python Dictionary

Python 华为的paramiko路由器
Python

Python 为Django Admin中的文件添加导出格式
Python Django

Tags

Seo Build Css Curl Google Cloud Dataflow Numpy Npm Liferay Webgl Enums Discord.py Service Vmware Cocoa Ipad Ignite Assembly Computer Science Next.js Tfs Hash Arrays Scala Angular Material C# Pagination Maps Codeigniter Nginx Mapbox Webview Parallel Processing Spring Mvc Apache Nifi Inno Setup Breeze X86 Orm Xcode Continuous Integration Razor Command Line Joomla Less Mediawiki Generics Google Drive Api Triggers Google Analytics Axapta Maven 2 Clearcase Sqlalchemy Rally Csv Nest Functional Programming Windows 8 Javafx 2 Mdx Nunit Linux Lotus Notes Notifications Phpmyadmin Compiler Construction Concurrency D3.js Yocto Hazelcast Chart.js Struct Typescript Opengl Es Sap Vim Autohotkey Memory Leaks Asp.net Web Api Macros Web Applications Windows Orchardcms Log4j Mapreduce Jdbc Grid Autodesk Forge Https 3d C Activerecord Isabelle Flutter Fiware Vhdl Selenium Karate Quickbooks Loops Couchbase Streaming Sapui5 Xamarin Xslt Glsl Cygwin Directx Vba Blackberry Prestashop Blazor Visual Studio 2013 Python Sphinx Struts2 Meteor Alfresco Dataframe Junit Uitableview Utf 8 Amazon Cloudformation .net 4.0 Internet Explorer Ethereum Gatsby Xaml Zsh Stripe Payments Web Services Gremlin Syntax Io Cassandra Sails.js Tabs Tinymce Timer Compression Docker Compose Pdf Sockets Embedded Angular File Io Url Rewriting Maven Express Scikit Learn Blockchain Sql Server Sed Codenameone Apache Storm Visual Studio 2017 Jenkins Ruby On Rails 3.1 Deep Learning Ibm Mobilefirst Prometheus Antlr Antlr4 Flash Llvm Winforms Java 8 Applescript Math Google Chrome Devtools Jquery Oracle Apex Jupyter Notebook Jms Debugging Sip Proxy Knockout.js Internet Explorer 8 Jar Spring Magento Synchronization Google Bigquery Visual Studio Code Geolocation Composer Php Moodle Transactions Protractor Debian Function Corda Artificial Intelligence Kotlin Data Structures Jira Mobile Stanford Nlp Twilio Firefox Xsd

Copyright © 2024. All Rights Reserved by - Fatal编程技术网