Warning: file_get_contents(/data/phpspider/zhask/data//catemap/9/solr/3.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Solr为术语向量查询返回错误500_Solr - Fatal编程技术网

Solr为术语向量查询返回错误500

Solr为术语向量查询返回错误500,solr,Solr,在solr查询web界面中,我希望获得术语向量值,例如术语频率最高的术语等等 为此,我使用了一个查询http://domain/tvrh?q=text:[*TO*]&wt=json&indent=true&tv.all=true&terms.fl=text 此查询出现以下错误: "termVectors": [ "uniqueKeyFieldName", "_id", "14708d4c-7145-46b7-98d0-727baff35ab9", [ "uniqueKey", "1470

在solr查询web界面中,我希望获得术语向量值,例如术语频率最高的术语等等

为此,我使用了一个查询
http://domain/tvrh?q=text:[*TO*]&wt=json&indent=true&tv.all=true&terms.fl=text

此查询出现以下错误:

"termVectors": [
"uniqueKeyFieldName",
"_id",
"14708d4c-7145-46b7-98d0-727baff35ab9",
[
  "uniqueKey",
  "14708d4c-7145-46b7-98d0-727baff35ab9"
]
],
  "error": {
"trace": "java.lang.NullPointerException
at org.apache.solr.handler.component.TermVectorComponent.process(TermVectorComponent.java:329)
at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:277)
at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:143)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:2068)
at org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:669)
at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:462)
at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:214)
at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:179)
at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)
at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)
at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:577)
at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:223)
at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)
at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)
at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)
at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)
at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:215)
at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:110)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)
at org.eclipse.jetty.server.Server.handle(Server.java:499)
at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)
at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)
at org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)
at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)
at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)
at java.lang.Thread.run(Thread.java:745)
",
            "code": 500
有什么想法吗

编辑:

My schema.xml描述字段:

<field name="description" 
 type="text_general"
 indexed="true" 
 stored="true" 
 multiValued="true" 
 termVectors="true" 
 termPositions="true" 
 termOffsets="true"/>


<fieldType name="text_general" class="solr.TextField" positionIncrementGap="100">
  <analyzer type="index">
    <tokenizer class="solr.StandardTokenizerFactory"/>
    <filter class="solr.StopFilterFactory" ignoreCase="true" words="stop-words-all-sorted.txt" />
    <!-- in this example, we will only use synonyms at query time
    <filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>
    -->
    <filter class="solr.LowerCaseFilterFactory"/>
  </analyzer>
  <analyzer type="query">
    <tokenizer class="solr.StandardTokenizerFactory"/>
    <filter class="solr.StopFilterFactory" ignoreCase="true" words="stop-words-all-sorted.txt" />
    <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
    <filter class="solr.LowerCaseFilterFactory"/>
  </analyzer>
</fieldType>

根据Karsten的建议,无论查询如何,我都使用/terms来获取术语频率<代码>http://localhost:8983/solr/core/terms?wt=json&indent=true&terms.fl=description


现在,我得到了术语频率,但是没有显示单个术语,而是存储了整个文本。

由于结果为null,您有一个null指针异常,因此很可能您没有使用(
termVectors=“true”
)编制索引

您可以在schema.xml的中添加术语向量。例如:

<field name="includes"
   type="text_general"
   indexed="true"
   stored="true"
   multiValued="true"
   termVectors="true"
   termPositions="true"
   termOffsets="true" />

顺便说一句:

如果希望文档频率不受查询限制,则应使用


如果希望将文档频率限制为查询,可以使用facetting(将“text”作为facet)。

Karsten,对于架构浏览器中的
text
字段,我选择了以下标志:标志:索引标记化存储Multivalued@Jakub,我添加了如何为termVectors插入缺少的标志的示例。