Rancher Kubernetes仪表板-服务不可用

Rancher Kubernetes仪表板-服务不可用,kubernetes,rancher,Kubernetes,Rancher,我对牧场主和容器都是新手。在使用Rancher设置Kubernetes集群时,我在访问Kubernetes仪表板时遇到了问题 rancher/server: 1.6.6 Single node Rancher server + External MySQL + 3 agent nodes Infrastructure Stack versions: healthcheck: v0.3.1 ipsec: net:v0.11.5 network-services: metadata:v0.9.2

我对牧场主和容器都是新手。在使用Rancher设置Kubernetes集群时,我在访问Kubernetes仪表板时遇到了问题

rancher/server: 1.6.6

Single node Rancher server + External MySQL + 3 agent nodes

Infrastructure Stack versions:
healthcheck: v0.3.1
ipsec: net:v0.11.5
network-services: metadata:v0.9.2 / network-manager:v0.7.7
scheduler: k8s:v1.7.2-rancher5
kubernetes (if applicable): kubernetes-agent:v0.6.3


# docker info
Containers: 1
Running: 1
Paused: 0
Stopped: 0
Images: 1
Server Version: 17.03.1-ce
Storage Driver: overlay
Backing Filesystem: extfs
Supports d_type: true
Logging Driver: json-file
Cgroup Driver: cgroupfs
Plugins:
Volume: local
Network: bridge host macvlan null overlay
Swarm: inactive
Runtimes: runc
Default Runtime: runc
Init Binary: docker-init
containerd version: 4ab9917febca54791c5f071a9d1f404867857fcc
runc version: 54296cf40ad8143b62dbcaa1d90e520a2136ddfe
init version: 949e6fa
Security Options:
seccomp
Profile: default
Kernel Version: 4.9.34-rancher
Operating System: RancherOS v1.0.3
OSType: linux
Architecture: x86_64
CPUs: 2
Total Memory: 7.798 GiB
Name: ch7radod1
ID: IUNS:4WT2:Y3TV:2RI4:FZQO:4HYD:YSNN:6DPT:HMQ6:S2SI:OPGH:TX4Y
Docker Root Dir: /var/lib/docker
Debug Mode (client): false
Debug Mode (server): false
Http Proxy: http://proxy.ch.abc.net:8080
Https Proxy: http://proxy.ch.abc.net:8080
No Proxy: localhost,.xyz.net,abc.net
Registry: https://index.docker.io/v1/
Insecure Registries:
127.0.0.0/8
Live Restore Enabled: false
访问UI URL时显示“服务不可用”

如果从UI使用CLI部分,则会得到以下结果:

> kubectl get nodes
NAME              STATUS    AGE       VERSION
ch7radod3       Ready     1d        v1.7.2
ch7radod4       Ready     5d        v1.7.2
ch7radod1       Ready     1d        v1.7.2

> kubectl get pods --all-namespaces
NAMESPACE     NAME                                   READY     STATUS              RESTARTS   AGE
kube-system   heapster-4285517626-4njc2              0/1       ContainerCreating   0          5d
kube-system   kube-dns-3942128195-ft56n              0/3       ContainerCreating   0          19d
kube-system   kube-dns-646531078-z5lzs               0/3       ContainerCreating   0          5d
kube-system   kubernetes-dashboard-716739405-lpj38   0/1       ContainerCreating   0          5d
kube-system   monitoring-grafana-3552275057-qn0zf    0/1       ContainerCreating   0          5d
kube-system   monitoring-influxdb-4110454889-79pvk   0/1       ContainerCreating   0          5d
kube-system   tiller-deploy-737598192-f9gcl          0/1       ContainerCreating   0          5d
安装程序使用私有注册表(Artifactory)。我检查了Artifactory,我可以看到一些与Docker相关的图片。我正在浏览这个部分,我还看到了这个文件。如果需要这个文件,我应该把它保存在哪里,以便Rancher可以获取它并配置Kubernetes仪表板

更新:

$ sudo ros engine switch docker-1.12.6
> ERRO[0031] Failed to load https://raw.githubusercontent.com/rancher/os-services/v1.0.3/index.yml: Get https://raw.githubusercontent.com/rancher/os-services/v1.0.3/index.yml: Proxy Authentication Required
> FATA[0031] docker-1.12.6 is not a valid engine
我想这可能是由于NGINX,所以我停止了NGINX容器,但我仍然得到上述错误。早些时候,我在这个Rancher服务器上尝试了相同的命令,它过去工作得很好。它在代理节点上运行良好,尽管它们已经配置了1.12.6

更新2:

> kubectl -n kube-system get po
NAME                                 READY STATUS            RESTARTS AGE
heapster-4285517626-4njc2            1/1   Running           0        12d
kube-dns-2588877561-26993            0/3   ImagePullBackOff  0        5h
kube-dns-646531078-z5lzs             0/3   ContainerCreating 0        12d
kubernetes-dashboard-716739405-zq3s9 0/1   CrashLoopBackOff  67       5h
monitoring-grafana-3552275057-qn0zf  1/1   Running           0        12d
monitoring-influxdb-4110454889-79pvk 1/1   Running           0        12d
tiller-deploy-737598192-f9gcl        0/1   CrashLoopBackOff  72       12d

没有一个播客在运行,你需要先解决这个问题。尝试重新启动整个集群,并查看以上所有POD的运行状态。

基于@ivan.sim,我发布了“更新2”。这让我终于开始朝着正确的方向看。然后,我开始在线查找
CrashLoopBackOff
error,遇到link并尝试了以下命令(使用Rancher控制台中的CLI选项),该命令实际上与@ivan.sim上面建议的非常类似,但这对运行仪表板进程的节点有帮助:

> kubectl get pods -a -o wide --all-namespaces
NAMESPACE     NAME                                   READY  STATUS              RESTARTS   AGE  IP                  NODE
kube-system   heapster-4285517626-4njc2              1/1    Running             0          12d  10.42.224.157       radod4
kube-system   kube-dns-2588877561-26993              0/3    ImagePullBackOff    0          5h   <none>              radod1
kube-system   kube-dns-646531078-z5lzs               0/3    ContainerCreating   0          12d  <none>              radod4
kube-system   kubernetes-dashboard-716739405-zq3s9   0/1    Error               70         5h   10.42.218.11        radod1
kube-system   monitoring-grafana-3552275057-qn0zf    1/1    Running             0          12d  10.42.202.44        radod4
kube-system   monitoring-influxdb-4110454889-79pvk   1/1    Running             0          12d  10.42.111.171       radod4
kube-system   tiller-deploy-737598192-f9gcl          0/1    CrashLoopBackOff    76         12d  10.42.213.24        radod4
在我得到上述错误后,我再次在网上搜索,并尝试了一些事情。最后,林克帮了忙。在所有代理节点上执行了以下命令后,Kubernetes仪表板终于开始工作了

docker volume rm etcd
rm -rf /var/etcd/backups/*

问题似乎在于Docker版本(17.03.1-ce)与Kubernetes的兼容性。我知道这一点,但我在修理其他东西时忘记了。在我做这些的时候,我最近将NGINX用于HTTPS。现在,如果我试图将docker版本更改为1.12.6,则会收到错误消息。请参阅我文章中的更新。rancherOS的哪个版本?rancherOS v1.0.3您是否支持代理?你能到达
https://raw.githubusercontent.com/rancher/os-services/v1.0.3/index.yml
使用
curl
或其他方法?尝试运行
kubectl-n kube system get po
查看是否安装了POD。使用
kubectl-n kube系统日志kubernetes-dashboard-716739405-zq3s9
检查错误日志和事件,以及
kubectl-n kube系统描述kubernetes-dashboard-716739405-zq3s9
docker volume rm etcd
rm -rf /var/etcd/backups/*