Python 如何处理unicode?

Python 如何处理unicode?,python,xml,unicode,Python,Xml,Unicode,我有以下代码从xml文件中剥离白线: #working for all files in dir. from pathlib import Path for path in Path(r'attachments\75090058\Status\XMLsend').glob('*.xml'): with open(path, 'r+') as f: lines = f.readlines() f.seek(0, 0) for line i

我有以下代码从xml文件中剥离白线:

#working for all files in dir.
from pathlib import Path


for path in Path(r'attachments\75090058\Status\XMLsend').glob('*.xml'):
    with open(path, 'r+') as f:
        lines = f.readlines()
        f.seek(0, 0)
        for line in lines:
            if line.strip() != '':
                f.write(line)
        f.truncate()
它工作得很好,但是现在我发现以下错误:

UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 1291: character maps to <undefined>
UnicodeDecodeError:“charmap”编解码器无法解码位置1291:字符映射到的字节0x9d
怎么办


请帮忙

尝试将文本编码为
UTF-8格式

请尝试以下代码:

from pathlib import Path

for path in Path(r'attachments\75090058\Status\XMLsend').glob('*.xml'):
    with open(path, 'r+', encoding='utf-8') as f:
        lines = f.readlines()
        f.seek(0, 0)
        for line in lines:
            if line.strip() != '':
                f.write(line)
        f.truncate()

请澄清标题并显示发生错误的完整回溯。问题字符是在文件名中还是在其内容中?为什么你得出结论说这和Unicode有什么关系?(可能是的,但这里没有告诉我们怎么做。)抱歉@tripleee我对UniCodePerhaps的了解有限请参见