C# 如何为lucene添加多个和布尔查询

C# 如何为lucene添加多个和布尔查询,c#,lucene,lucene.net,C#,Lucene,Lucene.net,我有一千万个lucene文档,看起来是这样的: { "0": 230, "1": 12, "2": 611, "3": 800 } 我正在尝试查找所有文档,所有字段都小于10。这是我的lucene代码: BooleanQuery bq = new BooleanQuery(); bq.Add(NumericRangeQuery.NewIntRange("0", 1, 10, true, true), Occur.MUST); bq.Add(Numeric

我有一千万个lucene文档,看起来是这样的:

{
     "0": 230,
     "1": 12,
     "2": 611,
     "3": 800
}
我正在尝试查找所有文档,所有字段都小于10。这是我的lucene代码:

BooleanQuery bq = new BooleanQuery();
bq.Add(NumericRangeQuery.NewIntRange("0", 1, 10, true, true), Occur.MUST);
bq.Add(NumericRangeQuery.NewIntRange("1", 1, 10 , true, true), Occur.MUST);
bq.Add(NumericRangeQuery.NewIntRange("2", 1, 10, true, true), Occur.MUST);
//bq.Add(NumericRangeQuery.NewIntRange("3", 1, 1000, true, true), Occur.MUST);

TopDocs hits = searcher.Search(bq, 10);
int counter = 0;
foreach (ScoreDoc scoreDoc in hits.ScoreDocs)
{

   Lucene.Net.Documents.Document doc = searcher.Doc(scoreDoc.Doc);
   Console.WriteLine("3: " + doc.Get("3"));
   counter++;
}
我遇到的问题是,当我检查所有4个属性以查看所有4个属性是否都在1到10之间时,我没有得到任何结果。当我检查前3个属性时,我得到了正确的结果。但是当我加上第四个,我什么也得不到。正如您所看到的,forth boolean子句被注释掉了,因为它不会产生任何结果。我甚至在1到1000之间的整个范围内进行了第四次财产检查,但仍然没有结果。我做错什么了吗?下面是我如何建立索引的

public static void BuildIndex()
{
    Directory directory = FSDirectory.Open(new System.IO.DirectoryInfo("C:\\Users\\Luke\\Desktop\\1"));
    Analyzer analyzer = new Lucene.Net.Analysis.Standard.StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_30);
    IndexWriter writer = new IndexWriter(directory, analyzer, new IndexWriter.MaxFieldLength(100000));


    for (int x = 0; x < 10000000; x++)
    {
        Document doc = new Document();
        doc.Add(new NumericField("id", 100000, Field.Store.YES, true).SetIntValue(x));
        for (int i = 0; i < 5; i++)
        {
            doc.Add(new NumericField(i.ToString(), 100000, Field.Store.YES, true).SetIntValue(rand.Next(1, 1000)));
        }

        writer.AddDocument(doc);
        if (x % 500 == 0)
        {
            Console.WriteLine(x);
        }
    }

    writer.Optimize();
    writer.Flush(true, true, true);
    writer.Dispose();
    directory.Dispose();

    Console.WriteLine("done");
    Console.Read();
}
publicstaticvoidbuildindex()
{
Directory Directory=FSDirectory.Open(new System.IO.DirectoryInfo(“C:\\Users\\Luke\\Desktop\\1”);
Analyzer Analyzer=new Lucene.Net.Analysis.Standard.StandardAnalyzer(Lucene.Net.Util.Version.Lucene_30);
IndexWriter writer=new IndexWriter(目录、分析器、new IndexWriter.MaxFieldLength(100000));
对于(int x=0;x<10000000;x++)
{
单据单据=新单据();
doc.Add(新的数字字段(“id”,100000,Field.Store.YES,true).SetIntValue(x));
对于(int i=0;i<5;i++)
{
doc.Add(新的数值字段(i.ToString(),100000,Field.Store.YES,true).SetIntValue(rand.Next(11000));
}
writer.AddDocument(doc);
如果(x%500==0)
{
控制台写入线(x);
}
}
writer.Optimize();
writer.Flush(真的,真的,真的);
writer.Dispose();
Dispose()目录;
控制台。写入线(“完成”);
Console.Read();
}

我刚刚用Java Lucene(4.4)重新创建了这个程序,在数值范围查询中没有发现任何问题

1) 3文件

field:0 - value:137
field:1 - value:41
field:2 - value:908
field:3 - value:871
field:4 - value:686

field:0 - value:598
field:1 - value:623
field:2 - value:527
field:3 - value:364
field:4 - value:800

field:0 - value:96
field:1 - value:301
field:2 - value:323
field:3 - value:94
field:4 - value:653
2) 索引器

如您所见,我使用的精度步长值为'6'。 我通过Luke验证了文档的索引是否正确,并通过Luke启动了相同的查询

您可以尝试通过Luke接口触发查询吗?根据您的文档更改值


+0:[100至600]+1:[40至700]+2:[500至1000]+3:[300至900]+4:[600至800]

你绝对确定你将
“3”
索引为数字而不是字符串吗?@mindas Hi是的,我确定。我已经更新了我的问题来展示我是如何建立索引的。你检查过luke中字段“3”的lucene索引了吗?它是否显示正确/预期值?此外,我可以看到您在NumericField中使用的“precisionStep”值为100000,不确定,但根据lucene文档,您最好使用4-6-8(字段中有唯一的1000个值)。你玩过这个吗?太好了,它很管用。谢谢你的详细解释。我知道我还有很多关于lucene的事情要学。
package com.numericrange;

import java.io.File;
import java.io.IOException;

import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.document.Field;
import org.apache.lucene.document.IntField;
import org.apache.lucene.index.IndexWriter;
import org.apache.lucene.index.IndexWriterConfig;
import org.apache.lucene.index.IndexWriterConfig.OpenMode;
import org.apache.lucene.store.Directory;
import org.apache.lucene.store.FSDirectory;
import org.apache.lucene.util.Version;

public class IndexBuilder
{

    /**
     * @param args
     * @throws IOException 
     */
    public static void main(String[] args) throws IOException
    {
        Directory dir = FSDirectory.open(new File("/Users/Lucene/indexes"));
        IndexWriterConfig iwc = new IndexWriterConfig(Version.LUCENE_44, new StandardAnalyzer(Version.LUCENE_44));
        iwc.setOpenMode(OpenMode.CREATE);
        IndexWriter writer = new IndexWriter(dir, iwc);

        for (int x = 0; x < 3; x++)
        {
            Document doc = new Document();
            IntField iFldOut = new IntField("id", 6, Field.Store.YES);
            iFldOut.setIntValue(x);
            doc.add(iFldOut);
            for (int i = 0; i < 5; i++)
            {
                int randomVal = (int)(Math.random() * 1000) + 1;
                IntField iFld = new IntField(Integer.toString(i), 6, Field.Store.YES);
                iFld.setIntValue(randomVal);
                doc.add(iFld);
                System.out.println("i:" + i + " - Random Value:" + randomVal);
            }

            writer.addDocument(doc);

        }
        int newNumDocs = writer.numDocs();
        System.out.println("************************");
        System.out.println(newNumDocs + " documents added.");
        System.out.println("************************");

        writer.close();
    }

}
package com.numericrange;

import java.io.File;
import java.io.IOException;

import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.index.DirectoryReader;
import org.apache.lucene.index.IndexReader;
import org.apache.lucene.search.BooleanClause.Occur;
import org.apache.lucene.search.BooleanQuery;
import org.apache.lucene.search.IndexSearcher;
import org.apache.lucene.search.NumericRangeQuery;
import org.apache.lucene.search.ScoreDoc;
import org.apache.lucene.search.TopScoreDocCollector;
import org.apache.lucene.store.FSDirectory;
import org.apache.lucene.util.Version;

public class NumericQueryDemo
{

    public static void main(String[] args) throws IOException, Exception
    {
        // Use Indexes from existing folder
        String dirPath = "/Users/Lucene/indexes";
        IndexReader reader = DirectoryReader.open(FSDirectory.open(new File(dirPath)));
        IndexSearcher searcher = new IndexSearcher(reader);

        Analyzer analyzer = new StandardAnalyzer(Version.LUCENE_44);

        BooleanQuery bq = new BooleanQuery();
        bq.add(NumericRangeQuery.newIntRange("0", 100, 600, true, true), Occur.MUST);
        bq.add(NumericRangeQuery.newIntRange("1", 40, 700, true, true), Occur.MUST);
        bq.add(NumericRangeQuery.newIntRange("2", 500, 1000, true, true), Occur.MUST);
        bq.add(NumericRangeQuery.newIntRange("3", 300, 900, true, true), Occur.MUST);
        bq.add(NumericRangeQuery.newIntRange("4", 600, 800, true, true), Occur.MUST);
        System.out.println("Query Data:" + bq.toString());

        TopScoreDocCollector collector = TopScoreDocCollector.create(500, true);
        long startTime = System.currentTimeMillis();
        searcher.search(bq, collector);
        System.out.println("Search Time: "+(System.currentTimeMillis() - startTime)+"ms");

        // Display Results
        ScoreDoc[] hits = collector.topDocs().scoreDocs;
        System.out.println("Found " + hits.length + " hits.");
        for(int i=0; i < hits.length; ++i) 
        {
            int docId = hits[i].doc;
            Document d = searcher.doc(docId);
            System.out.println((i + 1) + ". " + hits[i].score + " "+ d.get("id") + " ==== " + d.get("0") +
                    " ==== " + d.get("1") + " ==== " + d.get("2") + " ==== " + d.get("3") + " ==== " + d.get("4"));
        }
    }

}
Query Data:+0:[100 TO 600] +1:[40 TO 700] +2:[500 TO 1000] +3:[300 TO 900] +4:[600 TO 800]
Search Time: 27ms
Found 2 hits.
1. 2.236068 0 ==== 137 ==== 41 ==== 908 ==== 871 ==== 686
2. 2.236068 1 ==== 598 ==== 623 ==== 527 ==== 364 ==== 800