Web crawler Nutch未在'中列出任何代理商;http.agent.name';

Web crawler Nutch未在'中列出任何代理商;http.agent.name';,web-crawler,nutch,Web Crawler,Nutch,每次我运行时。/nutch crawl url-dir crawl-depth 3-topN 5。纳奇决定抛出这个错误。我的nutch-site.xml和nutch-default.xml都设置了 Exception in thread "main" java.lang.IllegalArgumentException: Fetcher: No agents listed in 'http.agent.name' property. at org.apache.nutch.fet

每次我运行时。/nutch crawl url-dir crawl-depth 3-topN 5。纳奇决定抛出这个错误。我的nutch-site.xml和nutch-default.xml都设置了

Exception in thread "main" java.lang.IllegalArgumentException: Fetcher: No agents listed in 'http.agent.name' property.
        at org.apache.nutch.fetcher.Fetcher.checkConfiguration(Fetcher.java:1166)
        at org.apache.nutch.fetcher.Fetcher.fetch(Fetcher.java:1068)
        at org.apache.nutch.crawl.Crawl.run(Crawl.java:135)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:54)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:616)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:156)

http.agent.name
废话

为了便于阅读,把描述删掉了。但是我看不出还有什么地方可以指定代理名称。如果有人有任何建议,我将不胜感激

使用1.3?如果是这样,请确保在runtime/local/conf中更改了nutch-site.xml(而不是默认值)
除非使用ant重建,否则在NUTCH_HOME/conf中更改conf不会复制到运行时目录

也尝试为http.robots.agents指定代理名称。这对我有用。从那以后我就没有收到这个消息

邮件列表似乎不起作用。我尝试订阅nutch邮件列表几次,但都失败了
 <property>
  <name>http.agent.name</name>
  <value>blah</value>
  </property>