Java ApacheSolr:我已经在Cygwin终端上运行了这个命令/nutch爬网URL-dir newCrawl-solrhttp://localhost:8939/solr/ -深度10-顶部10

Java ApacheSolr:我已经在Cygwin终端上运行了这个命令/nutch爬网URL-dir newCrawl-solrhttp://localhost:8939/solr/ -深度10-顶部10,java,indexing,solr,cygwin,nutch,Java,Indexing,Solr,Cygwin,Nutch,SolrIndexer:从2015-10-15 10:13:00开始 添加90个文档: java.io.IOException: Job failed! SolrDeleteDuplicates: starting at 2015-10-15 10:13:11 SolrDeleteDuplicates: Solr url: http://localhost:8939/solr/ Exception in thread "main" java.io.IOException: org.apache.

SolrIndexer:从2015-10-15 10:13:00开始
添加90个文档:

java.io.IOException: Job failed!
SolrDeleteDuplicates: starting at 2015-10-15 10:13:11
SolrDeleteDuplicates: Solr url: http://localhost:8939/solr/
Exception in thread "main" java.io.IOException: org.apache.solr.client.solrj.SolrServerException: java.net.ConnectException: Connection refused: connect
        at org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:200)
        at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
        at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
        at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
        at org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicates.java:373)
        at org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicates.java:353)
        at org.apache.nutch.crawl.Crawl.run(Crawl.java:153)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:55)
Caused by: org.apache.solr.client.solrj.SolrServerException: java.net.ConnectException: Connection refused: connect
        at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:478)
        at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
        at org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java:89)
        at org.apache.solr.client.solrj.SolrServer.query(SolrServer.java:118)
        at org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:198)
        ... 9 more
Caused by: java.net.ConnectException: Connection refused: connect
        at java.net.DualStackPlainSocketImpl.connect0(Native Method)
        at java.net.DualStackPlainSocketImpl.socketConnect(DualStackPlainSocketImpl.java:79)
        at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:345)
        at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
        at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
        at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:172)
        at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)
        at java.net.Socket.connect(Socket.java:589)
        at java.net.Socket.connect(Socket.java:538)
        at java.net.Socket.<init>(Socket.java:434)
        at java.net.Socket.<init>(Socket.java:286)
        at org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
        at org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
        at org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
        at org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
        at org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
        at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:397)
        at org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:323)
        at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:422)
        ... 13 more
java.io.IOException:作业失败!
SOLRDELETE副本:从2015-10-15 10:13:11开始
SolrDeleteDuplicates:Solr url:http://localhost:8939/solr/
线程“main”java.io.IOException:org.apache.solr.client.solrj.SolrServerException:java.net.ConnectException:Connection拒绝:connect
位于org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:200)
位于org.apache.hadoop.mapred.JobClient.writeldsplits(JobClient.java:810)
位于org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
位于org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
位于org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1249)
位于org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicates.java:373)
位于org.apache.nutch.indexer.solr.SolrDeleteDuplicates.dedup(SolrDeleteDuplicates.java:353)
位于org.apache.nutch.crawl.crawl.run(crawl.java:153)
位于org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
位于org.apache.nutch.crawl.crawl.main(crawl.java:55)
原因:org.apache.solr.client.solrj.SolrServerException:java.net.ConnectException:连接被拒绝:连接
位于org.apache.solr.client.solrj.impl.commonHttpSolrServer.request(commonHttpSolrServer.java:478)
在org.apache.solr.client.solrj.impl.commonHttpSolrServer.request(commonHttpSolrServer.java:244)上
位于org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java:89)
位于org.apache.solr.client.solrj.SolrServer.query(SolrServer.java:118)
位于org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat.getSplits(SolrDeleteDuplicates.java:198)
... 9更多
原因:java.net.ConnectException:连接被拒绝:连接
位于java.net.DualStackPlainSocketImpl.connect0(本机方法)
位于java.net.DualStackPlainSocketImpl.socketConnect(DualStackPlainSocketImpl.java:79)
位于java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:345)
位于java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:206)
位于java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:188)
位于java.net.PlainSocketImpl.connect(PlainSocketImpl.java:172)
位于java.net.socksocketimpl.connect(socksocketimpl.java:392)
位于java.net.Socket.connect(Socket.java:589)
位于java.net.Socket.connect(Socket.java:538)
位于java.net.Socket。(Socket.java:434)
位于java.net.Socket(Socket.java:286)
位于org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:80)
位于org.apache.commons.httpclient.protocol.DefaultProtocolSocketFactory.createSocket(DefaultProtocolSocketFactory.java:122)
位于org.apache.commons.httpclient.HttpConnection.open(HttpConnection.java:707)
位于org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:387)
位于org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)
位于org.apache.commons.httpclient.httpclient.executeMethod(httpclient.java:397)
位于org.apache.commons.httpclient.httpclient.executeMethod(httpclient.java:323)
位于org.apache.solr.client.solrj.impl.commonHttpSolrServer.request(commonHttpSolrServer.java:422)
... 还有13个

如何解决此问题?

Solr似乎没有运行-连接被拒绝表示某些东西无法联系到它应该联系的任何东西。您的配置可能已损坏,或者您尚未启动Solr,或者您正在另一个端口上运行。

您的Solr url无效。您还需要编写核心。默认情况下,核心是collection1。因此,在您的情况下,url为

,请查看并重试。