Python LXML杀死我的CDATA分区
我正在批量转换大量XML文件,将其字符编码更改为UTF-8:Python LXML杀死我的CDATA分区,python,xml,lxml,cdata,Python,Xml,Lxml,Cdata,我正在批量转换大量XML文件,将其字符编码更改为UTF-8: with open(source_filename, "rb") as source: tree = etree.parse(source) with open(destination_filename, "wb") as destination: tree.write(destination, encoding="UTF-8", xml_declaration=True) 不幸的是,它正在破坏我的
with open(source_filename, "rb") as source:
tree = etree.parse(source)
with open(destination_filename, "wb") as destination:
tree.write(destination, encoding="UTF-8", xml_declaration=True)
不幸的是,它正在破坏我的CDATA
部分,而只是转义它们
来源:
<d><![CDATA[áÌÀøÅàùÑÄéú ëÌÄé áÈàÅùÑ éäå''ä ðÄùÑÀôÌÈè <small><small>(ùí ëå èæ)</small></small>
<d>בְּרֵאשִׁית כִּי בָאֵשׁ יהו''ה נִשְׁפָּט <small><small>(שם כו טז)</small></small>
(ùíåèèèè)
目的地:
<d><![CDATA[áÌÀøÅàùÑÄéú ëÌÄé áÈàÅùÑ éäå''ä ðÄùÑÀôÌÈè <small><small>(ùí ëå èæ)</small></small>
<d>בְּרֵאשִׁית כִּי בָאֵשׁ יהו''ה נִשְׁפָּט <small><small>(שם כו טז)</small></small>
בְּרֵאשִׁית כִּי בָאֵשׁ יהו''ה נִשְׁפָּטsmallsmall(שםכטזז)/小/小
是否有一个我可以设置的设置,它会告诉我不要使用CDATA分区?我主要使用LXML来更改字符编码并正确写入XML头。使用: