Java elasticsearch中嵌套分析字段未命中
我们想用多种语言搜索嵌套字段“文本”。然而,当使用不同的分析器应用额外的字段时,我们永远不会得到任何点击 我们的配置适用于非嵌套字段(如下面示例中的“title”),因此它似乎与嵌套有某种关联 映射配置:Java elasticsearch中嵌套分析字段未命中,java,
elasticsearch,Java,
elasticsearch,我们想用多种语言搜索嵌套字段“文本”。然而,当使用不同的分析器应用额外的字段时,我们永远不会得到任何点击 我们的配置适用于非嵌套字段(如下面示例中的“title”),因此它似乎与嵌套有某种关联 映射配置: { "properties": { "title": { "type": "string", "fields": { "en": { "type": "string", "analyzer": "english" } } }, "texts"
{
"properties": {
"title": {
"type": "string",
"fields": {
"en": {
"type": "string",
"analyzer": "english"
}
}
},
"texts": {
"type": "nested",
"value": {
"type": "string",
"fields": {
"en": {
"type": "string",
"analyzer": "english"
}
}
}
}
}
}
测试代码:
TransportClient testClient = new TransportClient()
.addTransportAddress(new InetSocketTransportAddress(hostname, port));
String test_index = "test_index";
IndicesExistsResponse indicesExistsResponse = testClient.admin().indices().exists(new IndicesExistsRequest(test_index))
.actionGet();
if (indicesExistsResponse.isExists()) {
testClient.admin().indices().prepareDelete(test_index).execute().actionGet();
}
testClient.admin().indices().prepareCreate(test_index).execute().actionGet();
String source = Streams.copyToStringFromClasspath("/index.json");
testClient.admin().indices()
.preparePutMapping(test_index)
.setType(ARTICLE_TYPE)
.setSource(source).execute().actionGet();
Article article = new Article();
article.title = "Winter is coming";
Text text = new Text();
text.value = "The nicest summer shoes";
text.textId = UUID.randomUUID().toString();
article.texts = Collections.singletonList(text);
testClient.index(new IndexRequest(test_index, ARTICLE_TYPE, article.articleId)
.source(objectMapper.writeValueAsBytes(article))
.refresh(true)).actionGet();
SearchHits rawTitleSearchHits = testClient.prepareSearch(test_index)
.setTypes(ARTICLE_TYPE)
.setSearchType(SearchType.DFS_QUERY_THEN_FETCH)
.setQuery(queryStringQuery("title:winter"))
.execute().get().getHits();
SearchHits enTitleSearchHits = testClient.prepareSearch(test_index)
.setTypes(ARTICLE_TYPE)
.setSearchType(SearchType.DFS_QUERY_THEN_FETCH)
.setQuery(queryStringQuery("title.en:winter"))
.execute().get().getHits();
SearchHits rawTextSearchHits = testClient.prepareSearch(test_index)
.setTypes(ARTICLE_TYPE)
.setSearchType(SearchType.DFS_QUERY_THEN_FETCH)
.setQuery(nestedQuery("texts", queryStringQuery("value:summer")))
.execute().get().getHits();
SearchHits enTextSearchHits = testClient.prepareSearch(test_index)
.setTypes(ARTICLE_TYPE)
.setSearchType(SearchType.DFS_QUERY_THEN_FETCH)
.setQuery(nestedQuery("texts", queryStringQuery("value.en:summer")))
.execute().get().getHits();
SearchHits enTextSearchMatchHits = client.prepareSearch(test_index)
.setTypes(ARTICLE_TYPE)
.setSearchType(SearchType.DFS_QUERY_THEN_FETCH)
.setQuery(nestedQuery("texts", matchQuery("value.en", "summer")))
.execute().get().getHits();
assertThat(rawTextSearchHits.getTotalHits(), is(1L));
assertThat(rawTitleSearchHits.getTotalHits(), is(1L));
assertThat(enTitleSearchHits.getTotalHits(), is(1L));
// Fails. Why??
assertThat(enTextSearchHits.getTotalHits(), is(1L));
// Also fails. Why??
assertThat(enTextSearchMatchHits.getTotalHits(), is(1L));
嵌套
字段的映射缺少属性
结构:
{
"properties": {
"title": {
"type": "string",
"fields": {
"en": {
"type": "string",
"analyzer": "english"
}
}
},
"texts": {
"type": "nested",
"properties": { <--- this structure is missing
"value": {
"type": "string",
"fields": {
"en": {
"type": "string",
"analyzer": "english"
}
}
}
}
}
}
}
谢谢你的回复。但还是不行。上一个断言的结果相同,命中率为0。是否清除索引并重新为数据编制索引?是。正如您在测试代码中看到的,我在每次测试运行时都会删除并重新创建索引。是的,我的坏消息是我忽略了这一点。实际上,问题似乎来自尝试在
查询字符串中使用嵌套字段。据我所知。如果您可以使用嵌套匹配查询,那么应该可以。我认为您需要指定完整的字段路径,即matchQuery(“text.value.en”,“summer”)
SearchHits enTextSearchHits = testClient.prepareSearch(test_index)
.setTypes(ARTICLE_TYPE)
.setSearchType(SearchType.DFS_QUERY_THEN_FETCH)
.setQuery(nestedQuery("texts", queryStringQuery("texts.value.en:summer")))
.execute().get().getHits();
SearchHits enTextSearchMatchHits = client.prepareSearch(test_index)
.setTypes(ARTICLE_TYPE)
.setSearchType(SearchType.DFS_QUERY_THEN_FETCH)
.setQuery(nestedQuery("texts", matchQuery("texts.value.en", "summer")))
.execute().get().getHits();