Hadoop Cloudera Manager安装无法从代理接收心跳信号-无法将新主机添加到群集

Hadoop Cloudera Manager安装无法从代理接收心跳信号-无法将新主机添加到群集,hadoop,cloudera,cloudera-manager,Hadoop,Cloudera,Cloudera Manager,我尝试使用标准版本在Ubuntu 12.04.1 LTS上安装cloudera manager,当我想添加新主机时,我会遇到下一个错误: Installation failed.Failed to receive heartbeat from agent. Ensure that the host's hostname is configured properly. Ensure that port 7182 is accesible on the Cloudera Manager server

我尝试使用标准版本在Ubuntu 12.04.1 LTS上安装cloudera manager,当我想添加新主机时,我会遇到下一个错误:

Installation failed.Failed to receive heartbeat from agent.
Ensure that the host's hostname is configured properly.
Ensure that port 7182 is accesible on the Cloudera Manager server (check firewall rules).
Ensure that ports 9000 an 9001 are free on the host being added.
Check agent logs in /var/log/cloudera-scm-agent/ on the host being added (some of the logs can be found in the installation details).
/etc/hosts
文件中,我将其配置为:

127.0.0.1 localhost
127.0.0.1 hadoop-ubuntu
192.168.5.xyz hadoop-ubuntu.dana.local hadoop-ubuntu
192.168.3.xyz ro-m81.dana.local ro-m81
192.168.3.abc ro-m41.dana.local ro-m41
以下线路适用于支持IPv6的主机

::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters     
The **/var/log/cloudera-scm-agent/cloudera-scm-agent.log** shows the next error::   
[09/Oct/2013 16:04:23 +0000] 4532 MainThread agent ERROR Heartbeating to 192.168.5.xyz:7182 failed.
Traceback (most recent call last):
File "/usr/lib64/cmf/agent/src/cmf/agent.py", line 747, in send_heartbeat
response = self.requestor.request('heartbeat', dict(request=heartbeat))
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 145, in request
return self.issue_request(call_request, message_name, request_datum)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 256, in issue_request
call_response = self.transceiver.transceive(call_request)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 485, in transceive
result = self.read_framed_message()
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 489, in read_framed_message
response = self.conn.getresponse()
File "/usr/lib64/python2.6/httplib.py", line 990, in getresponse
response.begin()
File "/usr/lib64/python2.6/httplib.py", line 391, in begin
version, status, reason = self._read_status()
File "/usr/lib64/python2.6/httplib.py", line 349, in _read_status
line = self.fp.readline()
File "/usr/lib64/python2.6/socket.py", line 433, in readline
data = recv(1)
error: [Errno 104] Connection reset by peer

请帮助我找出为什么会出现此错误或缺少什么。

我也遇到了同样的问题。这就是我的诀窍

键入ifconfig并查找您的ip地址。不是127.0.0.1

键入$hostname并查找您的主机名

编辑/etc/hosts文件

在那里为您的IP地址添加一个条目。差不多

192.168.8.xxx   hostname.test.com   hostname
重新启动cloudera服务。请访问sonic.test.com:7180并重试。 它应该会起作用。即使不起作用,也要检查主机的状态


事实证明,即使我收到心跳错误,主机实际上已经启动并运行。

在检查集群中所有节点上的主机文件后,确保在安装程序上打开端口7180和7182,在集群节点(安装程序除外)上打开端口9000


我在Cloudera安装中遇到了“inspector failed.IO Exception thrown”错误,直到我查看了安装程序(服务器)日志,发现客户端无法在端口9000上通信。

我也遇到了同样的问题,我最终解决了它


我的问题是代理的
cloudera scm代理的版本与服务器的
cloudera scm服务器的版本不同,您可以使用
dpkg
yum
来检查自己

我面对同样的问题,然后我找到了一个解决方案

我使用了两台机器,一台用于
master
,另一台用于
slave

拥有
cloudera scm服务器的主计算机

我在两台机器上都配置了
/etc/hosts
,最后错误消失了

主机Ip为:192.168.1.10

主计算机/etc/hosts中的

127.0.0.1本地主机
192.168.1.10     
从机Ip为:192.168.1.8

从机/etc/hosts中的

127.0.0.1本地主机
192.168.1.8     
  • 首先使用“sudo服务Cloudera scm代理状态”检查Cloudera scm代理状态是否正在运行
  • 2.在/var/log/cloudera scm agent中检查此目录中的代理日志文件/


    解决方案资源:

    有人找到了比编辑配置文件更好的解决方案吗?
    In Master Machine /etc/hosts
    
    127.0.0.1       localhost
    
    192.168.1.10     <hostname>
    
    In Slave Machine /etc/hosts
    
    127.0.0.1       localhost
    
    192.168.1.8     <hostname>