Apache storm 提交拓扑后出现异常

Apache storm 提交拓扑后出现异常,apache-storm,apache-zookeeper,Apache Storm,Apache Zookeeper,我是风暴中的新手,正在尝试提交拓扑图,发现了这个 内部主管 我在工人的日志文件中找到了这个 [ERROR] Async loop died! java.lang.RuntimeException: org.apache.thrift7.transport.TTransportException: java.net.ConnectException: Connection refused at backtype.storm.drpc.DRPCInvocationsClient.<i

我是风暴中的新手,正在尝试提交拓扑图,发现了这个 内部主管 我在工人的日志文件中找到了这个

 [ERROR] Async loop died!
java.lang.RuntimeException: org.apache.thrift7.transport.TTransportException: java.net.ConnectException: Connection refused
    at backtype.storm.drpc.DRPCInvocationsClient.<init>(DRPCInvocationsClient.java:23)
    at backtype.storm.drpc.DRPCSpout.open(DRPCSpout.java:69)
    at storm.trident.spout.RichSpoutBatchTriggerer.open(RichSpoutBatchTriggerer.java:41)
    at backtype.storm.daemon.executor$fn__3985$fn__3997.invoke(executor.clj:460)
    at backtype.storm.util$async_loop$fn__465.invoke(util.clj:375)
    at clojure.lang.AFn.run(AFn.java:24)
    at java.lang.Thread.run(Unknown Source)
Caused by: org.apache.thrift7.transport.TTransportException: java.net.ConnectException: Connection refused
这也在主管日志文件中

   at java.lang.Thread.run(Unknown Source)
2015-09-15 02:00:54 supervisor [INFO] ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
2015-09-15 02:00:55 ClientCnxn [INFO] Client session timed out, have not heard from server in 20020ms for sessionid 0x14fce3996380015, closing socket connection and attempting reconnect
2015-09-15 02:00:58 ClientCnxn [INFO] Opening socket connection to server localhost/127.0.0.1:2181
2015-09-15 02:00:58 ClientCnxn [INFO] Socket connection established to localhost/127.0.0.1:2181, initiating session
2015-09-15 02:00:59 supervisor [INFO] ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
2015-09-15 02:01:01 supervisor [INFO] ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
2015-09-15 02:00:59 util [INFO] Halting process: ("Error when processing an event")

这个问题有很多可能的原因

  • zookeeper未启动
  • CPU达到峰值一段时间,在超时时没有心跳发送,因此nimbus认为主管已死亡,因此断开了连接
  • 工作者超时时间太短,可能默认值为10秒,您可以将其更改为600秒或更长时间来尝试。几乎像是#2
  • 确保nimbus工作正常
  • worker.childopts不正确,这意味着内存设置不正确,请更改xmx和maxpermsize,然后重试
  • 如果使用winrm或powershell启动风暴,可能默认内存不足,因为默认内存只有1024M,您需要设置更多,例如2048M才能尝试

  • zookeper正在运行吗?感谢您的回复,是的,但我注意到时间结束后。/zkCli.sh已关闭。是否有任何方法可以控制连接时间?我想这就是为什么我在supervisor连接中也有这样的消息警告:文件名与资源不匹配/**!!?你启动Zk服务器了吗?zkCli.sh是客户端。是的,当然我先启动了服务器,然后启动了客户端,但过了一段时间,客户端关闭了,服务器仍在工作!!该网站支持一套丰富的标记工具,用于设置列表格式;你应该使用它们!
       at java.lang.Thread.run(Unknown Source)
    2015-09-15 02:00:54 supervisor [INFO] ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
    2015-09-15 02:00:55 ClientCnxn [INFO] Client session timed out, have not heard from server in 20020ms for sessionid 0x14fce3996380015, closing socket connection and attempting reconnect
    2015-09-15 02:00:58 ClientCnxn [INFO] Opening socket connection to server localhost/127.0.0.1:2181
    2015-09-15 02:00:58 ClientCnxn [INFO] Socket connection established to localhost/127.0.0.1:2181, initiating session
    2015-09-15 02:00:59 supervisor [INFO] ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
    2015-09-15 02:01:01 supervisor [INFO] ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
    2015-09-15 02:00:59 util [INFO] Halting process: ("Error when processing an event")