Java OpenNLP在使用Thai模型时出错

Java OpenNLP在使用Thai模型时出错,java,nlp,opennlp,thai,Java,Nlp,Opennlp,Thai,我曾尝试遵循的建议,但我得到了以下错误: C:\OpenNLP_models\tool\apache-opennlp-1.5.3-bin\apache-opennlp-1.5.3\bin>opennlp TokenizerME C:\OpenNLP_models\tool\apache-opennlp-1.5.3-bin\apache-opennlp-1.5.3\bin\thai.tok.bin < test.txt Loading Tokenizer model ... Exce

我曾尝试遵循的建议,但我得到了以下错误:

C:\OpenNLP_models\tool\apache-opennlp-1.5.3-bin\apache-opennlp-1.5.3\bin>opennlp TokenizerME C:\OpenNLP_models\tool\apache-opennlp-1.5.3-bin\apache-opennlp-1.5.3\bin\thai.tok.bin < test.txt

Loading Tokenizer model ... Exception in thread "main" java.lang.NullPointerException
    at opennlp.tools.util.model.BaseModel.getManifestProperty(BaseModel.java:491)
    at opennlp.tools.util.model.BaseModel.initializeFactory(BaseModel.java:245)
    at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:237)
    at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
    at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
    at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
    at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
    at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
    at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
    at opennlp.tools.cmdline.CLI.main(CLI.java:225)
C:\OpenNLP\u models\tool\apache-OpenNLP-1.5.3-bin\apache-OpenNLP-1.5.3\bin>OpenNLP-TokenizerME C:\OpenNLP\models\tool\apache-OpenNLP-1.5.3-bin\apache-OpenNLP-1.5.3\bin\thai.tok.bin
test.txt
文件包含以下句子ผมหิวข้าว".

有人能告诉我怎么修吗?我想用邮戳。
谢谢。

我认为您缺少
清单.properties
文件。您能否解压缩
泰文.tok.bin
文件并检查其中是否包含以下文件:

  • token.model
    (二进制标记器模型)
  • manifest.properties
    (配置)
  • manifest.properties
    的内容应如下所示,取自您链接到的问题:

    Manifest-Version=1.0.
    Language=th
    OpenNLP-Version=1.5.0
    Component-Name=TokenizerME
    useAlphaNumericOptimization=false
    

    请复制粘贴您的错误消息,不要将其作为屏幕截图发布…很抱歉这样做。我刚刚编辑了它。我的manifest.properties文件与您发布的相同,但我得到了相同的错误。很抱歉,当时不知道。很抱歉,我的错误。您的解决方案很实用,但我定义了manifest.properties的错误文件扩展名。谢谢。我可以问您更多吗问题?我遇到了一个新问题,结果似乎没有被编码为utf-8“a¸?a¸¸T T T T™à¸­à¸°à¹„รยังน你知道我如何解决它吗。听起来像是一个区域设置问题。如果你键入
    locale
    ,LANG和LC\u ALL的值是多少?(另外,你在使用Cygwin吗?我对Windows不太了解…)