Python 如何处理unicode?
我有以下代码从xml文件中剥离白线:Python 如何处理unicode?,python,xml,unicode,Python,Xml,Unicode,我有以下代码从xml文件中剥离白线: #working for all files in dir. from pathlib import Path for path in Path(r'attachments\75090058\Status\XMLsend').glob('*.xml'): with open(path, 'r+') as f: lines = f.readlines() f.seek(0, 0) for line i
#working for all files in dir.
from pathlib import Path
for path in Path(r'attachments\75090058\Status\XMLsend').glob('*.xml'):
with open(path, 'r+') as f:
lines = f.readlines()
f.seek(0, 0)
for line in lines:
if line.strip() != '':
f.write(line)
f.truncate()
它工作得很好,但是现在我发现以下错误:
UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 1291: character maps to <undefined>
UnicodeDecodeError:“charmap”编解码器无法解码位置1291:字符映射到的字节0x9d
怎么办
请帮忙 尝试将文本编码为
UTF-8格式
请尝试以下代码:
from pathlib import Path
for path in Path(r'attachments\75090058\Status\XMLsend').glob('*.xml'):
with open(path, 'r+', encoding='utf-8') as f:
lines = f.readlines()
f.seek(0, 0)
for line in lines:
if line.strip() != '':
f.write(line)
f.truncate()
请澄清标题并显示发生错误的完整回溯。问题字符是在文件名中还是在其内容中?为什么你得出结论说这和Unicode有什么关系?(可能是的,但这里没有告诉我们怎么做。)抱歉@tripleee我对UniCodePerhaps的了解有限请参见