Warning: file_get_contents(/data/phpspider/zhask/data//catemap/9/java/354.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Java 将Apache Nutch 2.3与Hbase 0.94.14和Solr 5.2.1集成时出错_Java_Apache_Hadoop_Solr - Fatal编程技术网

Java 将Apache Nutch 2.3与Hbase 0.94.14和Solr 5.2.1集成时出错

Java 将Apache Nutch 2.3与Hbase 0.94.14和Solr 5.2.1集成时出错,java,apache,hadoop,solr,Java,Apache,Hadoop,Solr,我正在将Nutch与Hbase和Solr集成 启动Hadoop和Hbase服务后,我在NutchHome中运行以下命令 sudo-E-bin/crawl url/seed.txt TestCrawlhttp://localhost:8983/solr/ 2 我面临这些错误: Injecting seed URLs /usr/local/apache-nutch-2.3.1/runtime/local/bin/nutch inject urls/seed.txt -crawlId TestCraw

我正在将Nutch与Hbase和Solr集成

启动Hadoop和Hbase服务后,我在NutchHome中运行以下命令

sudo-E-bin/crawl url/seed.txt TestCrawlhttp://localhost:8983/solr/ 2

我面临这些错误:

Injecting seed URLs
/usr/local/apache-nutch-2.3.1/runtime/local/bin/nutch inject urls/seed.txt -crawlId TestCrawl
InjectorJob: starting at 2016-05-26 15:41:14
InjectorJob: Injecting urlDir: urls/seed.txt
Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/hadoop/hbase/HBaseConfiguration
    at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:114)
    at org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
    at org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
    at org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
    at org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:78)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:218)
    at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:252)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:275)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
    at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:284)
Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.hbase.HBaseConfiguration
    at java.net.URLClassLoader.findClass(URLClassLoader.java:381)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:331)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
    ... 10 more
Error running:
  /usr/local/apache-nutch-2.3.1/runtime/local/bin/nutch inject urls/seed.txt -crawlId TestCrawl
Failed with exit value 1.

有人能告诉我它有什么问题吗?

这是一个错误,在执行爬网脚本时,它无法找到可传递的依赖项

更好的配置是nutch-2.3.1和hbase-0.98.8-hadoop2

为了更好地理解,请参考下面的url

这是gora hbase 0.6.1中的一个错误

 <dependency org="org.apache.hbase" name="hbase-common" rev="0.98.8-hadoop2" conf="*->default" />
另外,添加缺少的hbase-common-0.98.8-hadoop2.jar可传递依赖项,这是gora hbase 0.6.1中的一个bug

 <dependency org="org.apache.hbase" name="hbase-common" rev="0.98.8-hadoop2" conf="*->default" />

有了它,我可以成功地爬行。