Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/sorting/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Solr Lucene 5.4-得分除以搜索词的数量?_Solr_Lucene_Text Mining_Scoring_Multi Term - Fatal编程技术网

Solr Lucene 5.4-得分除以搜索词的数量?

Solr Lucene 5.4-得分除以搜索词的数量?,solr,lucene,text-mining,scoring,multi-term,Solr,Lucene,Text Mining,Scoring,Multi Term,我使用IndexSearcher、QueryParser和SimpleAnalyzer进行了简单的设置。 运行一些查询时,我发现一个包含多个术语的查询返回的ScoreDoc[I]。分数与解释查询语句中显示的分数不同。显然,这是解释中显示的分数除以搜索词的数量。对这种行为有何解释 Running search(TERM1 TERM2 TERM3) line:term1 line:term2 line:term3 2.167882 = sum of: 0.6812867 = weight(lin

我使用IndexSearcher、QueryParser和SimpleAnalyzer进行了简单的设置。 运行一些查询时,我发现一个包含多个术语的查询返回的ScoreDoc[I]。分数与解释查询语句中显示的分数不同。显然,这是解释中显示的分数除以搜索词的数量。对这种行为有何解释

Running search(TERM1 TERM2 TERM3)
line:term1 line:term2 line:term3
2.167882 = sum of:
  0.6812867 = weight(line:term1 in 6594) [DefaultSimilarity], result of:
    0.6812867 = score(doc=6594,freq=2.0), product of:
      0.5389907 = queryWeigh

totalHits 1
1678413725, TERM1 TERM2 TERM3, score: 0.72262734
我知道coord()语句将用于惩罚仅包含所提供搜索词子集的文档。但是,本文件包括所有条款。有什么建议吗


编辑:似乎只有当查询配置为使用OR语句而不是AND语句时,才会进行除法。因此,使用或查询并匹配所有术语仍然是除以搜索查询中的术语数。我在文档中找不到这一部分,但至少它解释了区别

然而,应用QueryWrapperFilter似乎再次改变了分数。虽然根据文件,它应该只过滤结果而不影响评分


更多细节

这两个分数是相同查询的结果。只有第二个查询被分割

0.114700586 = product of:
  0.34410176 = sum of:
    0.34410176 = weight(line:term1 in 24) [DefaultSimilarity], result of:
      0.34410176 = score(doc=24,freq=1.0), product of:
        0.5389907 = queryWeight, product of:
          8.17176 = idf(docFreq=14, maxDocs=19532)
          0.065957725 = queryNorm
        0.63841873 = fieldWeight in 24, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.17176 = idf(docFreq=14, maxDocs=19532)
          0.078125 = fieldNorm(doc=24)
  0.33333334 = coord(1/3)

item_id: 1495958818, item_name: term 1 dolor sit met, score: 0.114700586


0.18352094 = product of:
  0.5505628 = sum of:
    0.5505628 = weight(line:term 1 in 6112) [DefaultSimilarity], result of:
      0.5505628 = score(doc=6112,freq=1.0), product of:
        0.5389907 = queryWeight, product of:
          8.17176 = idf(docFreq=14, maxDocs=19532)
          0.065957725 = queryNorm
        1.02147 = fieldWeight in 6112, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.17176 = idf(docFreq=14, maxDocs=19532)
          0.125 = fieldNorm(doc=6112)
  0.33333334 = coord(1/3)

item_id: 1677761523, item_name: some text term 1, score: 0.061173648

您是如何生产商品的?\u id:。。调试值?直接来自服务器的响应在管理控制台中看起来如何?索引存储为StringField,内容存储为TextField。使用卢克,我得到了同样的结果。需要哪些调试值?