<img src="//i.stack.imgur.com/RUiNP.png" height="16" width="18" alt="" class="sponsor tag img">elasticsearch 如何使用多重匹配的ngram分析仪_<img Src="//i.stack.imgur.com/RUiNP.png" Height="16" Width="18" Alt="" Class="sponsor Tag Img">elasticsearch

elasticsearch 如何使用多重匹配的ngram分析仪

elasticsearch 如何使用多重匹配的ngram分析仪,elasticsearch,elasticsearch,我有ngram_分析仪 "analysis": { "tokenizer": { "ngram_tokenizer": { "type": "ngram", "min_gram": 2, "max_gram": 10, "token_chars": [] } }, "analyzer": { "ngram_analyzer": { "type": "cu

我有ngram_分析仪

  "analysis": {
    "tokenizer": {
      "ngram_tokenizer": {
        "type": "ngram",
        "min_gram": 2,
        "max_gram": 10,
        "token_chars": []
      }
    },
    "analyzer": {
      "ngram_analyzer": {
        "type": "custom",
        "tokenizer": "ngram_tokenizer",
        "filter": [
          "lowercase",
        ]
      }
    }
  }

并尝试搜索所有字段：

  "query": {
   "multi_match" : {
      "query":      "jan teach",
      "analyzer": "ngram_analyzer", 
      "operator":   "and",
      "type":       "cross_fields",
      "fields":     [ "name", "occupation", "surname", ... ]
    }
  }

此不幸事件不会返回任何结果

希望此项与name=“Jane”、accountry=“teacher”匹配

还是有更好的方法来实现这一点

首先，您需要的不是ngram标记器（因为它创建了更多的标记，所以索引空间很昂贵），因为您正在对标记进行前缀搜索（Jan in Jane和tech in teacher）

其次，使用搜索时间，您应该使用标准分析器，因为令牌（jan和teacher）已经存在

工作示例：

索引定义

{
    "settings": {
        "index": {
            "analysis": {
                "analyzer": {
                    "edgengram_analyzer": {
                        "type": "custom",
                        "filter": [
                            "lowercase"
                        ],
                        "tokenizer": "edgeNGramTokenizer"
                    }
                },
                "tokenizer": {
                    "edgeNGramTokenizer": {
                        "token_chars": [
                            "letter",
                            "digit"
                        ],
                        "min_gram": "2",
                        "type": "edgeNGram",
                        "max_gram": "10"
                    }
                }
            },
            "max_ngram_diff": "10"
        }
    },
    "mappings": {
        "properties": {
            "name": {
                "type": "text",
                "analyzer" : "edgengram_analyzer",
                "search_analyzer" : "standard"
            },
            "occupation" :{
                "type" : "text",
                "analyzer" : "edgengram_analyzer",
                "search_analyzer" : "standard"
            }
        }
    }
}

索引样本文档

{
    "name" : "Jane",
    "occupation" : "teacher"
}

为
Jane

POST yourindexname/_analyze

{
    "text" : "Jane",
    "analyzer": "edgengram_analyzer"
}

    {
        "tokens": [
            {
                "token": "ja",
                "start_offset": 0,
                "end_offset": 2,
                "type": "word",
                "position": 0
            },
            {
                "token": "jan",
                "start_offset": 0,
                "end_offset": 3,
                "type": "word",
                "position": 1
            },
            {
                "token": "jane",
                "start_offset": 0,
                "end_offset": 4,
                "type": "word",
                "position": 2
            }
        ]
    }

搜索查询与您的查询相同（但不带分析器）

和搜索结果

"hits": [
            {
                "_index": "ngram",
                "_type": "_doc",
                "_id": "1",
                "_score": 0.5753642,
                "_source": {
                    "name": "Jane",
                    "occupation": "teacher"
                }
            }
        ]