Linux openmpi python超过3个副本
我运行了一个包含3个以上进程的简单mpi python程序。 例如:Linux openmpi python超过3个副本,linux,networking,ssh,python,Linux,Networking,Ssh,Python,我运行了一个包含3个以上进程的简单mpi python程序。 例如: mpiexec -host master,w1,w2,w3 python code.py 显示这一点有一些错误 ssh: Could not resolve hostname w3: Name or service not known ORTE was unable to reliably start one or more daemons. This usually is caused by: * not findin
mpiexec -host master,w1,w2,w3 python code.py
显示这一点有一些错误
ssh: Could not resolve hostname w3: Name or service not known
ORTE was unable to reliably start one or more daemons.
This usually is caused by:
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.
* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
但是,如果我使用w1、w2、w3中的任意两个运行程序,它就可以工作。
例:
这就是代码
import random
import numpy as np
from mpi4py import MPI
comm = MPI.COMM_WORLD
rank = comm.rank
size = comm.size
if rank ==0:
print rank, 'worker'
else:
print rank, 'worker'
我怎样才能解决它?谢谢。根据输出:
ssh:无法解析主机名w3:名称或服务未知
,问题很明显:
主机名w3
无法由主节点(计算机)识别。
您可以将名称
-ip
映射添加到/etc/hosts
,格式为ip名称
。
例如:
255.255.255.0\u名称
import random
import numpy as np
from mpi4py import MPI
comm = MPI.COMM_WORLD
rank = comm.rank
size = comm.size
if rank ==0:
print rank, 'worker'
else:
print rank, 'worker'