Python 3.x 带pyrouge的UnicodeDecodeError

Python 3.x 带pyrouge的UnicodeDecodeError,python-3.x,summarization,precision-recall,rouge,Python 3.x,Summarization,Precision Recall,Rouge,我想评估一下我的摘要。pyrouge是ROUGE摘要评估包的python包装器 我的问题是当我想用pyrouge来评估我的总结时。我写了以下命令: from pyrouge import Rouge155 r = Rouge155() r.system_dir = '/Users/raghad/Desktop/Articles/system_summaries/' r.model_dir = '/Users/raghad/Desktop/Articles/model_summaries/' r

我想评估一下我的摘要。pyrouge是ROUGE摘要评估包的python包装器

我的问题是当我想用pyrouge来评估我的总结时。我写了以下命令:

from pyrouge import Rouge155

r = Rouge155()
r.system_dir = '/Users/raghad/Desktop/Articles/system_summaries/'
r.model_dir = '/Users/raghad/Desktop/Articles/model_summaries/'
r.system_filename_pattern = 'summary.(\d+).txt'
r.model_filename_pattern = 'summary.[A-Z].#ID#.txt'

output = r.convert_and_evaluate()
print(output)
output_dict = r.output_to_dict(output)
但我收到了这个错误:

2018-08-10 19:05:34,194 [MainThread  ] [INFO ]  Writing summaries.
2018-08-10 19:05:34,197 [MainThread  ] [INFO ]  Processing summaries. Saving system files to /var/folders/yq/dcjh2lr14676s_8kl6s6vjgc0000gn/T/tmptyzxws4q/system and model files to /var/folders/yq/dcjh2lr14676s_8kl6s6vjgc0000gn/T/tmptyzxws4q/model.
2018-08-10 19:05:34,197 [MainThread  ] [INFO ]  Processing files in /Users/raghad/Desktop/Articles/system_summaries/.
2018-08-10 19:05:34,197 [MainThread  ] [INFO ]  Processing .DS_Store.
Traceback (most recent call last):
  File "rouge3.py", line 12, in <module>
    output = r.convert_and_evaluate()
  File "/usr/local/lib/python3.6/site-packages/pyrouge/Rouge155.py", line 360, in convert_and_evaluate
    self.__write_summaries()
  File "/usr/local/lib/python3.6/site-packages/pyrouge/Rouge155.py", line 489, in __write_summaries
    self.__process_summaries(self.convert_summaries_to_rouge_format)
  File "/usr/local/lib/python3.6/site-packages/pyrouge/Rouge155.py", line 482, in __process_summaries
    process_func(self._system_dir, new_system_dir)
  File "/usr/local/lib/python3.6/site-packages/pyrouge/Rouge155.py", line 202, in convert_summaries_to_rouge_format
    input_dir, output_dir, Rouge155.convert_text_to_rouge_format)
  File "/usr/local/lib/python3.6/site-packages/pyrouge/utils/file_utils.py", line 29, in process
    input_string = f.read()
  File "/usr/local/Cellar/python/3.6.5/Frameworks/Python.framework/Versions/3.6/lib/python3.6/codecs.py", line 700, in read
    return self.reader.read(size)
  File "/usr/local/Cellar/python/3.6.5/Frameworks/Python.framework/Versions/3.6/lib/python3.6/codecs.py", line 503, in read
    newchars, decodedbytes = self.decode(data, self.errors)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x86 in position 23: invalid start byte
2018-08-1019:05:34194[主线][INFO]撰写总结。
2018-08-10 19:05:34197[主线][信息]处理摘要。将系统文件保存到/var/folders/yq/dcjh2lr14676vjgc000gn/T/tmptyzxws4q/system,将模型文件保存到/var/folders/yq/dcjh2lr14676s_8kl6s6vjgc000gn/T/tmptyzxws4q/model。
2018-08-10 19:05:34197[主线程][信息]处理/Users/raghad/Desktop/Articles/system_summaries/中的文件。
2018-08-10 19:05:34197[主线][信息]处理。DS_商店。
回溯(最近一次呼叫最后一次):
文件“rouge3.py”,第12行,在
输出=r.转换_和_求值()
文件“/usr/local/lib/python3.6/site packages/pyrouge/Rouge155.py”,第360行,在convert_和_中
自我介绍。撰写总结()
文件“/usr/local/lib/python3.6/site packages/pyrouge/Rouge155.py”,第489行,在“编写摘要”中
self.\u流程\u摘要(self.convert\u摘要\u格式)
文件“/usr/local/lib/python3.6/site packages/pyrouge/Rouge155.py”,第482行,流程摘要
进程功能(自我系统目录、新系统目录)
文件“/usr/local/lib/python3.6/site packages/pyrouge/Rouge155.py”,第202行,将摘要转换为胭脂格式
输入\u目录,输出\u目录,胭脂155。将\u文本\u转换为\u胭脂\u格式)
文件“/usr/local/lib/python3.6/site packages/pyrouge/utils/File_utils.py”,第29行,正在处理中
输入字符串=f.read()
文件“/usr/local/ceral/python/3.6.5/Frameworks/python.framework/Versions/3.6/lib/python3.6/codecs.py”,第700行,已读
返回self.reader.read(大小)
文件“/usr/local/ceral/python/3.6.5/Frameworks/python.framework/Versions/3.6/lib/python3.6/codecs.py”,第503行,已读
newchars,decodedbytes=self.decode(数据,self.errors)
UnicodeDecodeError:“utf-8”编解码器无法解码位置23中的字节0x86:无效的开始字节
我该怎么办?
我的默认编码是UTF-8

如果在MacOs上运行此功能,则在同一文件夹(隐藏文件)中将有一个名为“.DS_Store”的文件。移除它

见:

如果您在不同的系统上,请在同一目录中查找隐藏文件