Python lxml:如何使用xpath选择器获取XML标记名?
我正在尝试使用Python和Python lxml:如何使用xpath选择器获取XML标记名?,python,xml,xpath,lxml,Python,Xml,Xpath,Lxml,我正在尝试使用Python和lxml解析以下XML: <?xml version="1.0" encoding="UTF-8"?> <?xml-stylesheet type="text/xsl" href="/bind9.xsl"?> <isc version="1.0"> <bind> <statistics version="2.2"> <memory> <summary&
lxml
解析以下XML:
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="/bind9.xsl"?>
<isc version="1.0">
<bind>
<statistics version="2.2">
<memory>
<summary>
<TotalUse>1232952256
</TotalUse>
<InUse>835252452
</InUse>
<BlockSize>598212608
</BlockSize>
<ContextSize>52670016
</ContextSize>
<Lost>0
</Lost>
</summary>
</memory>
</statistics>
</bind>
</isc>
我已经设法提取了元素值,但是我无法找到xpath表达式来获取元素标记名
示例脚本:
from lxml import etree as et
def main():
xmlfile = "bind982.xml"
location = "bind/statistics/memory/summary/*"
label_selector = "??????" ## what to put here...?
value_selector = "text()"
with open(xmlfile, "r") as data:
xmldata = et.parse(data)
etree = xmldata.getroot()
statlist = etree.xpath(location)
for stat in statlist:
label = stat.xpath(label_selector)[0]
value = stat.xpath(value_selector)[0]
print "{0}: {1}".format(label, value)
if __name__ == '__main__':
main()
我知道我可以使用value=stat.tag
而不是stat.xpath()
,但是脚本必须具有足够的通用性,以处理标签选择器不同的其他XML片段
哪个xpath选择器将返回元素的标记名?我认为这两个值不需要xpath,元素节点具有属性
标记
和文本
,因此请使用列表:
[(element.tag, element.text) for element in etree.xpath(location)]
或者如果您真的想使用XPath
result = [(element.xpath('name()'), element.xpath('string()')) for element in etree.xpath(location)]
当然,您也可以构建字典列表:
result = [{ element.tag : element.text } for element in root.xpath(location)]
或
只需使用XPath的
name()
,并删除零索引,因为这将返回字符串而不是列表
from lxml import etree as et
def main():
xmlfile = "ExtractXPathTagName.xml"
location = "bind/statistics/memory/summary/*"
label_selector = "name()" ## what to put here...?
value_selector = "text()"
with open(xmlfile, "r") as data:
xmldata = et.parse(data)
etree = xmldata.getroot()
statlist = etree.xpath(location)
for stat in statlist:
label = stat.xpath(label_selector)
value = stat.xpath(value_selector)[0]
print("{0}: {1}".format(label, value).strip())
if __name__ == '__main__':
main()
输出
TotalUse: 1232952256
InUse: 835252452
BlockSize: 598212608
ContextSize: 52670016
Lost: 0
from lxml import etree as et
def main():
xmlfile = "ExtractXPathTagName.xml"
location = "bind/statistics/memory/summary/*"
label_selector = "name()" ## what to put here...?
value_selector = "text()"
with open(xmlfile, "r") as data:
xmldata = et.parse(data)
etree = xmldata.getroot()
statlist = etree.xpath(location)
for stat in statlist:
label = stat.xpath(label_selector)
value = stat.xpath(value_selector)[0]
print("{0}: {1}".format(label, value).strip())
if __name__ == '__main__':
main()
TotalUse: 1232952256
InUse: 835252452
BlockSize: 598212608
ContextSize: 52670016
Lost: 0