Lucene 在WhitespaceAnalyzer中使用停止字_Lucene_Lucene.net

Lucene 在WhitespaceAnalyzer中使用停止字

lucene

Lucene 在WhitespaceAnalyzer中使用停止字,lucene,lucene.net,Lucene,Lucene.net,Lucene的StandardAnalyzer在索引字符串/首字母缩略词时会删除其中的点。我希望Lucene保留点，因此我使用WhitespaceAnalyzer类我可以将我的停止词列表提供给StandardAnalyzer…但如何将其提供给WhitespaceAnalyzer 感谢阅读。通过扩展WhiteSpaceAnalyzer和重写方法创建您自己的分析器，如下所示 public TokenStream tokenStream(String fieldName, Reader reade

Lucene的StandardAnalyzer在索引字符串/首字母缩略词时会删除其中的点。我希望Lucene保留点，因此我使用WhitespaceAnalyzer类

我可以将我的停止词列表提供给StandardAnalyzer…但如何将其提供给WhitespaceAnalyzer

感谢阅读。

通过扩展WhiteSpaceAnalyzer和重写方法创建您自己的分析器，如下所示

public TokenStream tokenStream(String fieldName, Reader reader) {
    TokenStream result = super.tokenStream(fieldName, reader);
    result = new StopFilter(result, stopSet);
    return result;
}

这里的stopSet是一组停止字，您可以通过向分析器添加一个构造函数来获得它，该分析器接受一个停止字列表

如果您计划重用TokenStream，您也可能希望以类似的方式重写reusableTokenStream（）方法。

请查看我的回答和评论：@Shashikant Kore-有问题的输入吗-