Cluster computing Mesos代理始终处于停用状态

Cluster computing Mesos代理始终处于停用状态,cluster-computing,mesos,Cluster Computing,Mesos,我在VMware WorkStation的两个虚拟主机中部署了一个Mesos群集: heron01 ip:192.168.201.131:运行Mesos主机,动物园管理员 heron02 ip:192.168.201.128:正在运行Mesos从机 但是,从机始终处于停用状态。mesos主错误日志如下所示: Log file created at: 2018/02/18 02:08:35 Running on machine: ubuntu Log line format: [IWEF]mm

我在VMware WorkStation的两个虚拟主机中部署了一个Mesos群集:

  • heron01 ip:192.168.201.131:运行Mesos主机,动物园管理员
  • heron02 ip:192.168.201.128:正在运行Mesos从机
但是,从机始终处于停用状态。mesos主错误日志如下所示:

Log file created at: 2018/02/18 02:08:35
Running on machine: ubuntu
Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
W0218 02:08:35.859475  5857 authenticator.cpp:513] No credentials provided, authentication requests will be refused
E0218 02:08:40.518481  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39882: Transport endpoint is not connected
E0218 02:08:40.523883  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39884: Transport endpoint is not connected
W0218 02:08:40.532027  5854 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S0 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:41.131724  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39886: Transport endpoint is not connected
W0218 02:08:41.135860  5857 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S1 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:41.580379  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39888: Transport endpoint is not connected
E0218 02:08:41.583258  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39890: Transport endpoint is not connected
W0218 02:08:41.585355  5858 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S2 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:48.556628  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39892: Transport endpoint is not connected
E0218 02:08:48.562399  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39894: Transport endpoint is not connected
E0218 02:08:48.566049  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39896: Transport endpoint is not connected
W0218 02:08:48.567793  5853 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S3 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:09:00.063712  5859 process.cpp:2577] Failed to shutdown socket with fd 35, address 127.0.0.1:39914: Transport endpoint is not connected
Log file created at: 2018/02/17 08:25:51
Running on machine: ubuntu
Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
E0217 08:25:51.034782 48017 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45090: Transport endpoint is not connected
E0217 08:25:51.040766 48017 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45092: Transport endpoint is not connected
W0217 08:25:51.041786 48017 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:51.631784 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45094: Transport endpoint is not connected
W0217 08:25:51.632076 48017 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
W0217 08:25:52.095075 48011 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:52.095427 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45096: Transport endpoint is not connected
W0217 08:25:59.021628 48012 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:59.022001 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45098: Transport endpoint is not connected
W0217 08:26:10.564131 48016 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:10.564538 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45100: Transport endpoint is not connected
W0217 08:26:12.141916 48012 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:12.142215 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45102: Transport endpoint is not connected
W0217 08:26:39.090140 48018 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:39.090345 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45104: Transport endpoint is not connected
E0217 08:27:38.279918 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45106: Transport endpoint is not connected
Mesos从站警告日志如下所示:

Log file created at: 2018/02/18 02:08:35
Running on machine: ubuntu
Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
W0218 02:08:35.859475  5857 authenticator.cpp:513] No credentials provided, authentication requests will be refused
E0218 02:08:40.518481  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39882: Transport endpoint is not connected
E0218 02:08:40.523883  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39884: Transport endpoint is not connected
W0218 02:08:40.532027  5854 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S0 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:41.131724  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39886: Transport endpoint is not connected
W0218 02:08:41.135860  5857 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S1 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:41.580379  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39888: Transport endpoint is not connected
E0218 02:08:41.583258  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39890: Transport endpoint is not connected
W0218 02:08:41.585355  5858 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S2 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:48.556628  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39892: Transport endpoint is not connected
E0218 02:08:48.562399  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39894: Transport endpoint is not connected
E0218 02:08:48.566049  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39896: Transport endpoint is not connected
W0218 02:08:48.567793  5853 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S3 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:09:00.063712  5859 process.cpp:2577] Failed to shutdown socket with fd 35, address 127.0.0.1:39914: Transport endpoint is not connected
Log file created at: 2018/02/17 08:25:51
Running on machine: ubuntu
Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
E0217 08:25:51.034782 48017 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45090: Transport endpoint is not connected
E0217 08:25:51.040766 48017 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45092: Transport endpoint is not connected
W0217 08:25:51.041786 48017 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:51.631784 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45094: Transport endpoint is not connected
W0217 08:25:51.632076 48017 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
W0217 08:25:52.095075 48011 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:52.095427 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45096: Transport endpoint is not connected
W0217 08:25:59.021628 48012 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:59.022001 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45098: Transport endpoint is not connected
W0217 08:26:10.564131 48016 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:10.564538 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45100: Transport endpoint is not connected
W0217 08:26:12.141916 48012 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:12.142215 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45102: Transport endpoint is not connected
W0217 08:26:39.090140 48018 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:39.090345 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45104: Transport endpoint is not connected
E0217 08:27:38.279918 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45106: Transport endpoint is not connected

我使用了修改配置文件的方法来配置集群环境。配置如下所示
1.mesos-master-env.sh

export MESOS_log_dir=/home/yitian/mesosdata/log
export MESOS_work_dir=/home/yitian/mesosdata/data
export MESOS_ZK=zk://heron01:2181/mesos
export MESOS_quorum=1
二,。mesos-slave-env.sh和mesos-agent-env.sh

export MESOS_master=heron01:5050
export MESOS_log_dir=/home/yitian/mesosdata/log
export MESOS_work_dir=/home/yitian/mesosdata/run
三,。大师

heron01
四,。奴隶

heron02

此外,主机名和ip已添加到/etc/hosts。两台主机具有相同的配置文件。。我怎样才能修好它?谢谢你的帮助

我相信您没有正确设置主IP,以下是正确的命令。如果使用zk,您也不能使用127.0.0.1,仅供参考

掌握

mesos-master --ip=192.168.201.131 --work_dir=/tmp/mesos
代理

mesos-agent --ip=192.168.201.128 --master=192.168.201.131:5050 --work_dir=/tmp/mesos

我曾经修改配置文件来配置集群环境。我以前忘了解释这个。我现在编辑了这个问题。谢谢你的帮助。@YitianZhang从你的日志
运行在机器上:ubuntu
似乎两台机器都被列为
ubuntu
。我猜当你安装操作系统时,最初的名字是
ubuntu
。这里的系统有点不对劲,所以mesos将节点名改为
ubuntu
,并且必须将其翻译成127.0.0.1。您能直接切换到Ip进行确认吗?非常感谢。你说得对。在修改/etc/hostname以配置每个主机的“hostname”之后,我解决了这个问题。我没有试图通过更改mesos配置文件中的IP地址来解决问题。但它也应该是工作。