Python 在pandas中解析XML_Python_Xml_Pandas - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/xml/14.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/0/mercurial/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 在pandas中解析XML_Python_Xml_Pandas - Fatal编程技术网

Python 在pandas中解析XML

python xml pandas

Python 在pandas中解析XML,python,xml,pandas,Python,Xml,Pandas,我有一个xml文件列表，我想在每个文件中获得两个值，以便为数据帧创建索引。我使用for循环来实现这一点，因为我有大约1000个文件，这并没有那么大，我想计算这些文件上的一些特性以存储在数据帧中例如，第一个文件如下所示： <?xml version="1.0" encoding="utf-8"?> <tag1> <tag2> <tag3> <author>The author</au

我有一个xml文件列表，我想在每个文件中获得两个值，以便为数据帧创建索引。我使用for循环来实现这一点，因为我有大约1000个文件，这并没有那么大，我想计算这些文件上的一些特性以存储在数据帧中

例如，第一个文件如下所示：

<?xml version="1.0" encoding="utf-8"?>
<tag1>
    <tag2>
        <tag3>
            <author>The author</author>
            <title> The title </title>
        </tag3>
    </tag2>
</tag1>

我的问题是，由于文件之间的结构始终相同（标记数相同），因此标记的名称可能会从一个文件更改为另一个文件，例如：

<?xml version="1.0" encoding="utf-8"?>
<tag_1>
    <secondtag>
        <tag3>
            <author>The second author</author>
            <title> The second title </title>
        </tag3>
    </secondtag>
</tag_1>


第二作者
第二个标题

如何访问作者和标题而不事先知道标记的名称？

使用查找子节点而不是直接路径

如果它们总是作者标签和标题标签，只需在任何地方搜索即可？例如：

.xpath（'//author'）

？我总是有

author

和title

标记，但是当我尝试你的方法时，它不起作用，因为树（'//author'）
返回了一个空列表，所以列表索引超出了范围。尽管如此，我已经看到，被指控的文件是第一个带有以下标记的文件：。可能是问题吗？啊。。。。因此，您需要更改xpath以包含名称空间，然后。。。
<?xml version="1.0" encoding="utf-8"?>
<tag_1>
    <secondtag>
        <tag3>
            <author>The second author</author>
            <title> The second title </title>
        </tag3>
    </secondtag>
</tag_1>




[xml]相关文章推荐



                                                        
                                       





随机文章推荐


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
Python 用scipy.optimize提问
									Python
							 
Python大型应用程序：根据给定的请求，如何使用可能需要或不需要的其他模块？
									Python
							 
如何连接一组独立的Python程序
									Python
							 
Python 隐藏一个面板并显示另一个面板后指向另一个功能的按钮
									Python
							 									Wxpython
							 
Python Django模板重新组合意外结果
									Python
							 									Django
							 
从python中的嵌套dict查找返回默认值
									Python
							 									Dictionary
							 
将非英语操作系统更改为仅在Python中输出英语
									Python
							 									Qt
							 
Python 校验和串行消息
									Python
							 
Python 代码类似于智囊团
									Python
							 									Python 2.7
							 
Python RuntimeWarning:在longlong\u标量中遇到无效值
我想做什么
									Python
							 									Pandas
							 
PythonSignXML"；ValueError：无法取消序列化关键数据；
									Python
							 
Python 类型错误：不可损坏的类型：'；numpy.ndarray和#x27；张量流
									Python
							 									Tensorflow
							 
Python 转置列时按唯一值分组
									Python
							 									Python 2.7
							 									Pandas
							 
VBA Shell函数无法执行Python脚本
									Python
							 									Vba
							 									Excel
							 									Shell
							 
Python 将变量指定为列表，而不在函数中覆盖它
									Python
							 									List
							 									Recursion
							 
Python Keras与Tensorflow后端-在CPU上运行预测，但适合GPU
									Python
							 									Tensorflow
							 									Keras
							 
Python 可以打印或检查打开了多少数据库连接请求，以及是否关闭了多少数据库连接请求？德扬戈
									Python
							 									Django
							 									Database
							 									Mariadb
							 
Python 可扩展项目的graphql代码组织
									Python
							 									Django
							 									Graphql
							 
在python中创建不重复的随机数列表
									Python
							 
Python 为什么requests.get（）不能在for循环中工作？
									Python
							 
Python 解析XML文件以根据子元素检索父元素
									Python
							 
Python 如何在ctypes中将无符号字节数组转换为base64字符串
									Python
							 
Python 为什么在并行分配h5py数据集时没有输出？
									Python
							 									Parallel Processing
							 
Python ERR_中止404-Django-Static文件
									Python
							 									Django
							 									Apache
							 
Python 请求头给出TypeError:'；环境标题'；对象不可调用
									Python
							 									Python 3.x
							 									Api
							 									Flask
							 
Python scikit RandomForestClassifier-实际结果与预测分数不匹配
									Python
							 									Machine Learning
							 									Scikit Learn
							 
Python KNN图像分类器：权限和内存错误
									Python
							 									Image
							 
Python 我可以返回一个操作作为带有if…else条件的lambda函数的输出吗？
									Python
							 									List
							 									If Statement
							 
Python ttk按钮。将参数传递给OnClick
									Python
							 									Tkinter
							 									Lambda
							 
Python 我想打印我在Django中传递到HTML页面的字典的键和值
									Python
							 									Html
							 									Django
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Junit
Editor
Configuration
Streaming
Shopify
Localization
Ios7
Keras
Pandas
Here Api
Biztalk
Prometheus
Sms
Snowflake Cloud Data Platform
Xml
Vbscript
Content Management System
Less
Hibernate
Sencha Touch
Swift
Dojo
Odata
Twitter
Parallel Processing
Sprite Kit
Data Binding
Interface
Artifactory
Elm
Drupal
Keyboard
Cmd
Abap
Unicode
Responsive Design
Flutter
Smtp
Design Patterns
Jvm
Linq To Sql
Nhibernate
Jetty
Java Me
Serial Port
Eclipse Plugin
Robotframework
Date
.htaccess
Graph
Selenium
Db2
Pascal
Algorithm
Dll
Matplotlib
Xpath
Ruby On Rails 4
Blackberry
Spring Boot
Compilation
Dependency Injection
Machine Learning
Clojure
Graphql
Powershell
Project Management
Next.js
Download
Gradle
Prolog
Teradata
Web Crawler
Stanford Nlp
Moodle
Continuous Integration
Ravendb
Titanium
Spring
Jsf 2
Coding Style
Grafana
Bluetooth
Dotnetnuke
Angular
Formatting
Events
Twilio
Knockout.js
Tabs
Sql Server 2012
X86
Model
R
Class
Ldap
Tfs
Pip
Reporting Services
Rss
Jhipster
Firefox Addon
Generics
Grails
Canvas
Jupyter Notebook
Winforms
Teamcity
Automated Tests
Dictionary
Xpages
Meteor
.net
Oracle10g
Visual Studio 2015
Asp.net Mvc
Twitter Bootstrap 3
Snmp
Javafx
Google Chrome Devtools
Breeze
Jar
Activemq
Search
Websocket
Ruby On Rails 3.1
Debian
Pentaho
Api
Pycharm
Omnet++
Composer Php
Swift2
Character Encoding
Uiview
Curl
Kibana
Hyperledger Fabric
Jsp
Oracle11g
.net 4.0
Openshift
3d
Functional Programming
Autocomplete
Paypal
Opengl
Maven
Struts2
Terraform
Nest
Extjs4
Zend Framework
Google Maps Api 3
Plugins
Multithreading
Symfony
Dns
Amazon Dynamodb
Discord.py
Compression
Mpi
Architecture
Asp.net Core
Url
Computer Vision
Silverstripe
Angularjs
Amazon Cloudformation
Rx Java
Process
D
File Upload
Swift3
Doctrine Orm
Azure Functions
Julia
Ecmascript 6
Visual Studio 2013
Hive
Bison
Google Visualization
Activerecord
Iphone
Gwt
Youtube
Function
Oauth 2.0
Web Scraping
Odoo
Ignite
Merge
Gatsby
Node.js
Azure Data Factory
Grep
Oracle Apex
Jquery
Mapping
Push Notification
C++11


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网