跳绳；嵌套标记"；使用Python解析XML时_Python_Xml_Elementtree - Fatal编程技术网

跳绳；嵌套标记"；使用Python解析XML时

python xml

跳绳；嵌套标记"；使用Python解析XML时,python,xml,elementtree,Python,Xml,Elementtree,我目前有一个XML文件，我想用Python解析它。我正在使用Python的元素树，它工作得很好，只是我有一个问题该文件当前看起来像： <Instance> <TextContent> <Sentence>Hello, my name is John and his <Thing>name</Thing> is Tom.</Sentence> </TextContent> <Instance

我目前有一个XML文件，我想用Python解析它。我正在使用Python的元素树，它工作得很好，只是我有一个问题

该文件当前看起来像：

<Instance>
  <TextContent>
    <Sentence>Hello, my name is John and his <Thing>name</Thing> is Tom.</Sentence>
  </TextContent>
<Instance>

如何获取嵌套标记后面的文本部分

更好的是，有没有一种方法可以完全忽略嵌套标记

提前谢谢。

我稍微更改了您的源XML文件，因此句子中包含两个子元素：

<Instance> <TextContent> <Sentence>Hello, my <Thing>name</Thing> is John and his <Thing>name</Thing> is Tom.</Sentence> </TextContent> </Instance>
要查看所有直接子体文本节点的列表，请运行：

lst = list(allTextNodes(st))
结果是：

['Hello, my ', ' is John and his ', ' is Tom.']
但要将连接的文本作为单个变量，请运行：

txt = ''.join(allTextNodes(st))
获取：
您好，我的是约翰，他的是汤姆。
（注意双空格，
“环绕”两个省略的事物元素。
文本部分（“是汤姆”）在
结束标记之后出现的是该元素的
尾部。请参见，谢谢！我没有意识到这一点。不幸的是，有多个嵌套标记，因此如果有办法完全忽略它们，也会很有帮助，但似乎我必须手动编写规则。请尝试itertext（）：您是否可以编辑您的问题以添加一个包含多个嵌套标记的示例，以及所需的输出？ ['Hello, my ', ' is John and his ', ' is Tom.'] txt = ''.join(allTextNodes(st))

[xml]相关文章推荐

Xml 为什么人们还在创建RSS提要？ xml rss

在不循环的情况下将XmlNodeList加载到XmlDocument中？ xml vb.net

合并XML文档 xml

使用actionscript 2.0获取xml属性 xml

Xml 基于子元素XSLT删除元素 xml xslt

Xml Flex+；数据网格&x2B；选择时动态显示 xml apache-flex

如何卸下'；设计'；Eclipse中xml文档的选项卡 xml eclipse

Xml 如何通过XPath对值进行排序 xml xslt xpath

Xml AS3加载多个图像 xml actionscript-3

XML-XSLT到XML的转换 xml xslt

使用XML和jquery用文本填充滑块 xml jquery

Can'；t使用powershell提取XML节点的值 xml powershell

在vb.net中构建XML文档时，为什么会出现此错误 xml vb.net

Xml 使用Wix在安装程序中插入版权/注册符号 xml wix

为什么我会得到一个“；HTTP状态400-非法请求正文“；这个职位？ POSThttp://anyservice.com/my/servlet/interface/v1/book/events 内容类型：application/xml 接受：应用程序/xml 授权：基本cXRE456ggz 基础知识 2 ； xyz xml http

Xml 如何在运行时指定StreamingMarkupBuilder标记名？ xml groovy

将文本文件转换为XML文件 xml

Golang XML解组问题：本地名称冲突失败 xml go

如何完成xsl文件以将XML转换为其他XML xml xslt

xml序列化格式vb.net xml vb.net serialization

随机文章推荐

Stanford nlp 使用API对Stanford tagger进行培训和再培训 stanford-nlp

Stanford nlp 在Stanford CoreNLP中强制使用POS标记 stanford-nlp

Stanford nlp Tokens regex：最后一个标记/无更多标记的表达式 stanford-nlp

Stanford nlp 为什么在使用引理之前先使用引理 stanford-nlp

[python]相关推荐

是什么使Python成为一种好的脚本语言？
Python Scripting

python中的非局部极大值抑制
Python Opencv Numpy Computer Vision

Python 模块pexpect截断输出
Python

使用Python与正在运行的控制台应用程序交互
Python Linux

Python 导入一个不使用文件名的_init__uu.py模块
Python

Python 下载文件后运行命令
Python File

使用python在列表列表中循环
Python Loops Matplotlib

Python If-else-If语句工作不正常
Python If Statement

Python游戏开发-编程时更新Glut窗口-如Livereload
Python

Python 将dict的dict转换为数据帧
Python Dictionary Pandas

是否可以使用Bigcommerce Python API更新Google产品搜索映射？
Python Api

Python 如何向appspot提供密钥
Python Google App Engine

使用python返回原始url源代码
Python Html

Python 按bs4标记拆分/在两个标记之间获取文本
Python Python 3.x

有没有在windows上使用gradle编译python protobuf的方法？
Python Gradle Protocol Buffers

Python，高效地清除大列表
Python List Data Structures

Python 如何在django rest中自动继承foreignkey模型的字段？
Python Django Database Django Models Django Rest Framework

Python 异常值：“非类型”对象没有属性“添加”，类别名称不显示Django
Python Django

Python torch.cat沿负尺寸
Python Pytorch

Python 使用Pandas将Postgres表中的行写入CSV文件
Python Pandas Postgresql

Python nn.Module的Pytorch子类没有属性“parameters”
Python Python 3.x Neural Network Pytorch

Python 如何使用spaCy编写代码来合并标点符号和短语我想做什么
Python Python 3.x Nlp

Python 在tkinter中调整笔记本选项卡的大小
Python Tkinter

Python 词典中的自我参照或替代
Python Python 3.x

Python AttributeError:“TestCase”对象没有属性“lineno”，当我尝试使用datadriver库运行测试时
Python Robotframework

将两列的值作为键值返回到Python、SQLAchemy或SQL中的列表或dict？
Python Sql Pandas Sqlalchemy

Python 如何将包含多个字典的列表展开到一个数据帧中？
Python Pandas List Dictionary

如何检查表是否包含数据？（Python/MYSQL）
Python Mysql Flask

Python 如何计算列表中的整数范围
Python List

Python正则表达式匹配虽然匹配但不匹配
Python Regex

Tags

Webview Sharepoint 2007 Ipython Azure Sql Database Optimization Phpmyadmin Performance Ssrs 2008 Email Discord.js Xquery Memory Management Design Patterns Events Passwords Google Drive Api Signalr Pytorch Security Report Jquery Mobile Node.js Vector Azure Cosmosdb Migration Sql Server 2012 Redis Jenkins Visual Studio Code Tags Xcode For Loop Paypal Extjs Unit Testing Checkbox Excel Formula Angularjs Geometry Ssh Sphinx Tensorflow Ios7 Tree Tomcat Sqlite Flask Nativescript Netlogo Quickbooks Solr Sequelize.js Xpath Dynamic Lotus Notes Jvm Openstack Jqgrid Sip Html5 Canvas Select Resharper Generics Haskell Swing Youtube Api Eclipse Plugin Mobile Proxy Kdb Apache Flink Assembly Batch File Spring Boot Localization Tcp Uitableview Scrapy Mapping Servlets Html Isabelle Command Line Architecture System Verilog Oauth Intellij Idea Web Scraping Sql Server 2008 R2 Geolocation Sparql Acumatica Video Streaming Kotlin Applescript Extjs4 Canvas Aem Angular6 Rx Java Compression Apache Vmware Jasper Reports Botframework Puppet Ubuntu Object Express Usb Jsf 2 Directory Iis 7 Vb6 Formatting Autodesk Forge Sublimetext2 Gstreamer Javascript Rest Boost Electron Computer Vision Random Compiler Construction Macos Sbt Angular Material .htaccess Streaming Bots Swagger Libgdx .net 4.0 .net Airflow File Upload Text Opencart Perl Sorting Wso2 Iis Sdk Logging Kibana Keyboard Office365 Methods Ada Datetime Ruby On Rails 3.1 Oracle Apex Common Lisp Filter Search Artifactory Arangodb Tcl Apache Kafka Jira Browser Ssis Talend Version Control Drupal Import Prolog Inno Setup Django Rest Framework Speech Recognition Virtual Machine Requirejs Excel Blazor Docker Compose Cordova Racket Button Redux Terminal C Functional Programming Asp.net Core Openlayers Soap Reporting Services .net Core Unicode Directx Vbscript Log4j Combobox Vuejs2 Testng Xamarin.android Dll Azure Concurrency Compilation Https

Copyright © 2024. All Rights Reserved by - Fatal编程技术网