使用Python将XML转换为csv文件
我正在尝试将xml文件转换为csv。该文件如下所示:使用Python将XML转换为csv文件,python,xml,csv,Python,Xml,Csv,我正在尝试将xml文件转换为csv。该文件如下所示: <?xml version="1.0" encoding="UTF-8" standalone="no"?> <tns:FlxPtn xmlns:ts2c="http://interop.covea.fr/Covea-Flx-TypesS2C-009" xmlns:tns="http://interop.covea.fr/Covea-A
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<tns:FlxPtn xmlns:ts2c="http://interop.covea.fr/Covea-Flx-TypesS2C-009" xmlns:tns="http://interop.covea.fr/Covea-App-PolRntVie-024" xmlns:rf2c="http://interop.covea.fr/Covea-Referentiel" xmlns:cov="http://interop.covea.fr/Covea-FlxPtn-002" xmlns:fs2c="http://interop.covea.fr/Covea-Flx-EneFncS2C-007" xsi:schemaLocation="http://interop.covea.fr/Covea-Flx-TypesS2C-009 Covea-Flx-TypesS2C-009.xsd http://interop.covea.fr/Covea-App-PolRntVie-024 S2C_XSD_VIGERIRVES_V10.0_024_MMA_124.xsd http://interop.covea.fr/Covea-App-PolRntVie-024 S2C_XSD_VIGERIRVES_V10.0_024.xsd http://interop.covea.fr/Covea-Referentiel Covea-Referentiel.xsd http://interop.covea.fr/Covea-FlxPtn-002 Covea-FlxPtn-002.xsd http://interop.covea.fr/Covea-Flx-EneFncS2C-007 Covea-Flx-EneFncS2C-007.xsd" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<cov:DonEneTch>
<cov:IdVrsEne>002</cov:IdVrsEne>
<cov:IdFlx>1V400220191231VIGERIRVESMMA1</cov:IdFlx>
<cov:TsCraFlx>2020-01-13 10.02.13.000000</cov:TsCraFlx>
<cov:IdEmtFlx>MMA</cov:IdEmtFlx>
<cov:IdRctFlx>MMA</cov:IdRctFlx>
<cov:TyFlx>NOTIF</cov:TyFlx>
<cov:TyTrtFlx>1</cov:TyTrtFlx>
<cov:AquFlx>0</cov:AquFlx>
<cov:EvrExu>PRODUCTION</cov:EvrExu>
<cov:IdApnEmt>MMA</cov:IdApnEmt>
<cov:AcnApl>VIGERIRVES</cov:AcnApl>
<cov:IdVrsFlx>124</cov:IdVrsFlx>
<cov:IdTrtEmt>Rentes Vie</cov:IdTrtEmt>
<cov:IdUtl>TTBATCH</cov:IdUtl>
<cov:VrsCbl></cov:VrsCbl>
<cov:ChpLbr>1V400220191231VIGERIRVESMMA1377900</cov:ChpLbr>
</cov:DonEneTch>
<tns:DonMet>
<tns:DonEneFnc>
<fs2c:CodSocJur>1V4002</fs2c:CodSocJur>
<fs2c:DatArr>20191231</fs2c:DatArr>
<fs2c:TypFiColl>VIGERIRVES</fs2c:TypFiColl>
<fs2c:TimStmCreFic>2020-01-13 10.02.13.000000</fs2c:TimStmCreFic>
<fs2c:CodEns>MMA</fs2c:CodEns>
</tns:DonEneFnc>
<tns:PolRntVie>
<tns:NumEnr>20191290</tns:NumEnr>
<tns:NumPol>050000111901</tns:NumPol>
<tns:PMVie>997.75</tns:PMVie>
</tns:PolRntVie>
<tns:PolRntVie>
<tns:NumEnr>20191291</tns:NumEnr>
<tns:NumPol>050000112002</tns:NumPol>
<tns:PMVie>4385.15</tns:PMVie>
</tns:PolRntVie>
这给了我一个只有列名称的空DF
最后我想得到的是下表:
Numenr NumPol PMVie
20191290 050000111901 997.75
20191291 050000112002 4385.15
如果您有任何这样做的想法,我将非常感谢您可以使用
beautifulsoup
如下所示:
from bs4 import BeautifulSoup
import csv
with open('rente.xml') as f_input:
soup = BeautifulSoup(f_input, "lxml")
with open('b.csv', 'w', newline='', encoding='utf-8') as f_output:
csv_output = csv.writer(f_output)
csv_output.writerow(['Numenr', 'NumPol', 'PMVie'])
for tns in soup.find_all("tns:polrntvie"):
csv_output.writerow(tns.find(entry).text for entry in ['tns:numenr', 'tns:numpol', 'tns:pmvie'])
这将为您提供一个b.csv
文件,其中包含:
Numenr、NumPol、PMVie
20191290,050000111901,997.75
20191291,050000112002,4385.15
如果项目缺失:
from bs4 import BeautifulSoup
import csv
with open('rente.xml') as f_input:
soup = BeautifulSoup(f_input, "lxml")
with open('b.csv', 'w', newline='', encoding='utf-8') as f_output:
csv_output = csv.writer(f_output)
csv_output.writerow(['Numenr', 'NumPol', 'PMVie'])
for tns in soup.find_all("tns:polrntvie"):
row = []
for entry in ['tns:numenr', 'tns:numpol', 'tns:pmvie']:
try:
row.append(tns.find(entry).text)
except AttributeError:
row.append('')
csv_output.writerow(row)
非常感谢,如果我可以滥用它,它工作得很好…我试图在更大的文件上复制代码,但出现以下错误:AttributeError:'NoneType'对象没有属性'text'。你知道pb背后的含义吗?这可能意味着并非所有条目都包含所有项目。它将需要测试丢失的条目。谢谢,我会搜索这个。我已经添加了一个版本,可以处理丢失的条目为您,希望这应该在您的更大的文件工作。
from bs4 import BeautifulSoup
import csv
with open('rente.xml') as f_input:
soup = BeautifulSoup(f_input, "lxml")
with open('b.csv', 'w', newline='', encoding='utf-8') as f_output:
csv_output = csv.writer(f_output)
csv_output.writerow(['Numenr', 'NumPol', 'PMVie'])
for tns in soup.find_all("tns:polrntvie"):
row = []
for entry in ['tns:numenr', 'tns:numpol', 'tns:pmvie']:
try:
row.append(tns.find(entry).text)
except AttributeError:
row.append('')
csv_output.writerow(row)