Python 在中找不到stanford-postagger.jar文件
我试图在脚本中使用StanfordTokenizerPython 在中找不到stanford-postagger.jar文件,python,nltk,stanford-nlp,Python,Nltk,Stanford Nlp,我试图在脚本中使用StanfordTokenizertokenize(),但它似乎无法在我订购的类路径中找到jar 我试图将\u JAR='stanford postagger.JAR'更改为C:\Program Files\JetBrains\PyCharm 2017.1.2\stanford-postagger-2016-10-31',但似乎不起作用 这是我的剧本: from nltk.tokenize.stanford import StanfordTokenizer def Analyz
tokenize()
,但它似乎无法在我订购的类路径中找到jar
我试图将\u JAR='stanford postagger.JAR'
更改为C:\Program Files\JetBrains\PyCharm 2017.1.2\stanford-postagger-2016-10-31'
,但似乎不起作用
这是我的剧本:
from nltk.tokenize.stanford import StanfordTokenizer
def AnalyzeText(text):
t = StanfordTokenizer(path_to_jar='C:\Program Files\JetBrains\PyCharm 2017.1.2\stanford-postagger-2016-10-31\stanford-postagger.jar')
return t.tokenize(text)
我确实更新了nltk
。如您所见,我还下载了stanford postagger
。我不明白问题出在哪里。在Python3中,执行以下操作一次:
import urllib.request
import zipfile
# Download the file.
urllib.request.urlretrieve(r'http://nlp.stanford.edu/software/stanford-postagger-full-2015-04-20.zip', r'C:\Program Files\JetBrains\PyCharm 2017.1.2\stanford-postagger-full-2015-04-20.zip')
# Initialize a zipfile object.
zfile = zipfile.ZipFile(r'C:\Program Files\JetBrains\PyCharm 2017.1.2\stanford-pos-2015-04-20.zip')
# Unzip the file.
zfile.extractall(r'C:\Program Files\JetBrains\PyCharm 2017.1.2\stanford-pos')
然后:
看起来类路径还不错。NLTK需要JDK1.8版本
import os
java_path = "C:/Program Files/Java/jdk1.8.0_131/bin/java.exe"
os.environ['JAVAHOME'] = java_path
请考虑将回溯在图像中粘贴为文本。这样就更容易找到可能的关键字,也更容易阅读。从回溯来看,问题似乎在于查找java.exe
,而不是jar文件。
import os
java_path = "C:/Program Files/Java/jdk1.8.0_131/bin/java.exe"
os.environ['JAVAHOME'] = java_path