elasticsearch Elasticsearch:过滤器聚合的准确性,elasticsearch,filter,aggregation,date-histogram,elasticsearch,Filter,Aggregation,Date Histogram" /> elasticsearch Elasticsearch:过滤器聚合的准确性,elasticsearch,filter,aggregation,date-histogram,elasticsearch,Filter,Aggregation,Date Histogram" />

elasticsearch Elasticsearch:过滤器聚合的准确性

elasticsearch Elasticsearch:过滤器聚合的准确性,elasticsearch,filter,aggregation,date-histogram,elasticsearch,Filter,Aggregation,Date Histogram,我对Elasticsearch(使用2.2版)相当陌生。 为了简化我的问题,我的文档中有一个名为termination的字段,它有时可以进行值转移 我目前请求按月汇总终止的文件数量: { "size": 0, "sort": [{ "@timestamp": { "order": "desc", "unmapped_type": "boolean" } }], "query": { "match_all": {} }, "aggs":

我对Elasticsearch(使用2.2版)相当陌生。 为了简化我的问题,我的文档中有一个名为termination的字段,它有时可以进行值转移

我目前请求按月汇总终止的文件数量:

{
  "size": 0,
  "sort": [{
    "@timestamp": {
      "order": "desc",
      "unmapped_type": "boolean"
    }
  }],
  "query": { "match_all": {} },
  "aggs": {
    "report": {
      "date_histogram": {
        "field": "@timestamp",
        "interval": "month",
        "min_doc_count": 0
      },
      "aggs": {
        "documents_with_termination_transfer": {
          "filter": {
            "term": {
              "termination": "transfer"
            }
          }
        }
      }
    }
  }
}
以下是答复:

{
    "_shards": {
        "failed": 0, 
        "successful": 206, 
        "total": 206
    }, 
    "aggregations": {
        "report": {
            "buckets": [
                {
                    "calls_with_termination_transfer": {
                        "doc_count": 209163
                    }, 
                    "doc_count": 278100, 
                    "key": 1451606400000, 
                    "key_as_string": "2016-01-01T00:00:00.000Z"
                }, 
                {
                    "calls_with_termination_transfer": {
                        "doc_count": 107244
                    }, 
                    "doc_count": 136597, 
                    "key": 1454284800000, 
                    "key_as_string": "2016-02-01T00:00:00.000Z"
                }
            ]
        }
    }, 
    "hits": {
        "hits": [], 
        "max_score": 0.0, 
        "total": 414699
    }, 
    "timed_out": false, 
    "took": 90
}
为什么点击数(414699)大于文档计数总数(278100+136597=414697)?我读过关于精度问题的书,但它似乎不适用于过滤器的情况。。。
如果我对传输终止的文档总数求和,是否也存在准确性问题?

我猜有些文档缺少
@timestamp


您可以通过在该字段上运行来验证这一点。

您是对的!有两个完全不同的文档,似乎由kibana使用:“hits”:[{“\u id”:“4.4.1”,“\u index”:“.kibana”,…},{“\u id”:“logstash-*”,“\u index”:“.kibana”,…}]