Warning: file_get_contents(/data/phpspider/zhask/data//catemap/7/python-2.7/5.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Python 2.7 如何在python中打印标签_Python 2.7_Nltk - Fatal编程技术网

Python 2.7 如何在python中打印标签

Python 2.7 如何在python中打印标签,python-2.7,nltk,Python 2.7,Nltk,如果我有这样一个字符串: text = "They refuse to permit us." txt = nltk.word_tokenize(text) 如果我打印POS标签,则使用此选项nltk.pos_标记(txt)我得到 [(‘他们’、‘PRP’、(‘拒绝’、‘VBP’)、(‘to’、‘to’、(‘许可’、‘VB’)、(‘我们’、‘PRP’)] 如何仅打印以下内容: ['PRP','VBP','TO','VB','PRP'] 如果你有一个元组列表,你应该遍历它,只得到每个元组的第二

如果我有这样一个字符串:

text = "They refuse to permit us."

txt = nltk.word_tokenize(text)
如果我打印POS标签,则使用此选项<代码>nltk.pos_标记(txt)我得到

[(‘他们’、‘PRP’、(‘拒绝’、‘VBP’)、(‘to’、‘to’、(‘许可’、‘VB’)、(‘我们’、‘PRP’)]

如何仅打印以下内容:

['PRP','VBP','TO','VB','PRP']


如果你有一个元组列表,你应该遍历它,只得到每个元组的第二个元素

>>> tagged = nltk.pos_tag(txt)
>>> tags =  [ e[1] for e in tagged]
>>> tags
['PRP', 'VBP', 'TO', 'VB', 'PRP'] 
看看

可能在某个时候,您会发现POS标记器运行缓慢,您需要执行此操作(请参阅):


您可以像这样迭代

print [x[1] for x in nltk.pos_tag(txt)]
>>> from nltk import pos_tag, word_tokenize
>>> from nltk.tag import PerceptronTagger
>>> tagger = PerceptronTagger()
>>> text = "They refuse to permit us."
>>> tagged_text = tagger.tag(word_tokenize(text))
>>> tokens, pos = zip(*tagged_text)
>>> pos
('PRP', 'VBP', 'TO', 'VB', 'PRP', '.')
print [x[1] for x in nltk.pos_tag(txt)]