String NLTK：将原始字符串转换为普通字符串_String_Python 3.x_Nlp_Nltk

String NLTK：将原始字符串转换为普通字符串

string python-3.x nlp

String NLTK：将原始字符串转换为普通字符串,string,python-3.x,nlp,nltk,String,Python 3.x,Nlp,Nltk,我从NLTK的路透社语料库中得到了一篇文章，作为原始字符串 >>>from nltk.corpus import reuters >>>retail_article = reuters.raw('training/8173') >>>print (retail_article) "FRENCH GDP SHOULD RISE 2.3 PCT IN 1988 - MINISTRY\n French gross domestic prod

我从NLTK的路透社语料库中得到了一篇文章，作为原始字符串

>>>from nltk.corpus import reuters
>>>retail_article = reuters.raw('training/8173')
>>>print (retail_article)

"FRENCH GDP SHOULD RISE 2.3 PCT IN 1988 - MINISTRY\n  French gross domestic 
 product should grow\n  by 2.3 pct in 1988 after two pct growth this year 
 and 2.1 pct\n  in 1986, the Finance Ministry said.\n      The latest 
 forecast,...

我希望文本在单独的一行中，而不是

\n

。我该怎么做呢

>>>type(retail_article)
str

decode（）

对字符串不起作用，

encode（）

没有给出所需的结果

TIA

您显示的输出不是通过写入

打印（零售文章）

生成的。很明显，您只是在解释器提示下键入了

retail\u article

。这种差异是显著的，它似乎是你困惑的根源

不需要转换。“原始”字符串是编写文本字符串的一种方式。语料库阅读器的

raw（）

方法不返回“原始字符串”，它只返回一个字符串。（方法名称表示未进行任何处理；返回文件的确切内容。）

您的字符串

retail\u文章

包含实际的换行符，而不是

，

序列。要查看换行符，请使用

打印（零售文章）

。当您在命令提示下编写

retail\u article

时，您会得到字符串的明确表示形式（所谓的“repr”形式）——您正在查看的内容。

您显示的输出不是通过编写

print（retail\u article）