Configuration 专用群集的Apache Ray自动群集设置
我成功地手动设置了一个迷你光线群集(1个头+1个工人,每个人有4个CPU核)。但是,我无法使用ApacheRayAutoScaler自动设置它。头节点正确启动,而工作节点从未加入集群。下面是我对自动缩放器的YAML配置。我做错什么了吗Configuration 专用群集的Apache Ray自动群集设置,configuration,installation,cluster-computing,ray,Configuration,Installation,Cluster Computing,Ray,我成功地手动设置了一个迷你光线群集(1个头+1个工人,每个人有4个CPU核)。但是,我无法使用ApacheRayAutoScaler自动设置它。头节点正确启动,而工作节点从未加入集群。下面是我对自动缩放器的YAML配置。我做错什么了吗 cluster_name: my_ray_cluster min_workers: 8 initial_workers: 8 max_workers: 8 provider: type: local head_ip: 10.148.186.17
cluster_name: my_ray_cluster
min_workers: 8
initial_workers: 8
max_workers: 8
provider:
type: local
head_ip: 10.148.186.178
worker_ips: [10.148.186.18]
auth:
ssh_user: USER_NAME
ssh_private_key: ~/.ssh/id_rsa
# Files or directories to copy to the head and worker nodes.
file_mounts: {
# "/path1/on/remote/machine": "/path1/on/local/machine",
# "/path2/on/remote/machine": "/path2/on/local/machine",
}
head_setup_commands:
- pip3 install ray[debug,dashboard]
setup_commands:
- pip3 install ray[debug,dashboard]
# Command to start ray on the head node. You don't need to change this.
head_start_ray_commands:
- ray stop
- ray start --head --redis-port=6379
worker_start_ray_commands:
- ray stop
- ray start --address=10.148.186.178:6379
你能解决它吗?我在私有集群上也面临同样的问题!