Hadoop 使用Hbase运行Nutch crawler 2.2时获取空指针异常

Hadoop 使用Hbase运行Nutch crawler 2.2时获取空指针异常,hadoop,nutch,web-crawler,Hadoop,Nutch,Web Crawler,当我运行Nutch命令时:~/Nutch/runtime/deploy$bin/Nutch crawl-url-dir/user/dlequoc/url-depth 2-topN 5,我得到了以下异常: ======================================================= 13/08/26 16:30:15信息映射。作业客户端:映射100%减少0%13/08/26 16:30:29信息映射。作业客户端:任务Id: 尝试\u 201308261546\u

当我运行Nutch命令时:~/Nutch/runtime/deploy$bin/Nutch crawl-url-dir/user/dlequoc/url-depth 2-topN 5,我得到了以下异常:

=======================================================

13/08/26 16:30:15信息映射。作业客户端:映射100%减少0%13/08/26 16:30:29信息映射。作业客户端:任务Id: 尝试\u 201308261546\u 0004\u r\u000000\u 0,状态:失败 位于的java.lang.NullPointerException org.apache.avro.util.Utf8.(Utf8.java:37)位于 org.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100) 位于org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174) org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) 位于org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417) org.apache.hadoop.mapred.Child$4.run(Child.java:255)位于 java.security.AccessController.doPrivileged(本机方法)位于 javax.security.auth.Subject.doAs(Subject.java:396)位于 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) 在org.apache.hadoop.mapred.Child.main(Child.java:249)13/08/26 16:30:32信息映射。作业客户端:任务Id: 尝试\u 201308261546\u 0004\u r\u000001\u 0,状态:失败 位于的java.lang.NullPointerException org.apache.avro.util.Utf8.(Utf8.java:37)位于 org.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100) 位于org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174) org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) 位于org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417) org.apache.hadoop.mapred.Child$4.run(Child.java:255)位于 java.security.AccessController.doPrivileged(本机方法)位于 javax.security.auth.Subject.doAs(Subject.java:396)位于 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) 在org.apache.hadoop.mapred.Child.main(Child.java:249)13/08/26 16:30:32信息映射。作业客户端:任务Id: 尝试\u 201308261546\u 0004\u r\u000005\u 0,状态:失败 位于的java.lang.NullPointerException org.apache.avro.util.Utf8.(Utf8.java:37)位于 org.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100) 位于org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174) org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) 位于org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417) org.apache.hadoop.mapred.Child$4.run(Child.java:255)位于 java.security.AccessController.doPrivileged(本机方法)位于 javax.security.auth.Subject.doAs(Subject.java:396)位于 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) 在org.apache.hadoop.mapred.Child.main(Child.java:249)13/08/26 16:30:32信息映射。作业客户端:任务Id: 尝试\u 201308261546\u 0004\u r\u000004\u 0,状态:失败 位于的java.lang.NullPointerException org.apache.avro.util.Utf8.(Utf8.java:37)位于 org.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100) 位于org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174) org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) 位于org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417) org.apache.hadoop.mapred.Child$4.run(Child.java:255)位于 java.security.AccessController.doPrivileged(本机方法)位于 javax.security.auth.Subject.doAs(Subject.java:396)位于 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) 在org.apache.hadoop.mapred.Child.main(Child.java:249)13/08/26 16:30:32信息映射。作业客户端:任务Id: 尝试\u 201308261546\u 0004\u r\u000002\u 0,状态:失败 位于的java.lang.NullPointerException org.apache.avro.util.Utf8.(Utf8.java:37)位于 org.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100) 位于org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174) org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) 位于org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417) org.apache.hadoop.mapred.Child$4.run(Child.java:255)位于 java.security.AccessController.doPrivileged(本机方法)位于 javax.security.auth.Subject.doAs(Subject.java:396)位于 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) 在org.apache.hadoop.mapred.Child.main(Child.java:249)13/08/26 16:30:32信息映射。作业客户端:任务Id: 尝试\u 201308261546\u 0004\u r\u000003\u 0,状态:失败 位于的java.lang.NullPointerException org.apache.avro.util.Utf8.(Utf8.java:37)位于 org.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100) 位于org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174) org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) 位于org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417) org.apache.hadoop.mapred.Child$4.run(Child.java:255)位于 java.security.AccessController.doPrivileged(本机方法)位于 javax.security.auth.Subject.doAs(Subject.java:396)位于 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121) 在org.apache.hadoop.mapred.Child.main(Child.java:249)13/08/26 16:30:44信息映射。作业客户端:任务Id: 尝试\u 201308261546\u 0004\u r\u000001\u 1,状态:失败 位于的java.lang.NullPointerException org.apache.avro.util.Utf8.(Utf8.java:37)位于 org.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100) 位于org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174) org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649) 位于org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417) org.apache.hadoop.mapred.Child$4.run(Child.java:255)位于 java.security.AccessController.doPrivileged(本机方法)位于 javax.security.auth.Subject.doAs(Subject.java:396)位于 org.apache.hadoop.security.UserGrou