Kubernetes 启用stackdriver监控会使元数据代理盒崩溃

Kubernetes 启用stackdriver监控会使元数据代理盒崩溃,kubernetes,google-cloud-platform,google-kubernetes-engine,Kubernetes,Google Cloud Platform,Google Kubernetes Engine,启用监视时创建的POD列表: ➜ kubectl get pods --namespace=kube-system | grep metadata-agent NAME READY STATUS RESTARTS AGE metadata-agent-cluster-level-579ffb7c5f-vm8q8 1/1 Running 908 3d m

启用监视时创建的POD列表:

➜ kubectl get pods --namespace=kube-system | grep metadata-agent
NAME                                                READY   STATUS    RESTARTS   AGE
metadata-agent-cluster-level-579ffb7c5f-vm8q8       1/1     Running   908        3d
metadata-agent-gdnb6                                1/1     Running   908        3d
metadata-agent-q7vct                                1/1     Running   885        3d
metadata-agent-rcfl8                                1/1     Running   907        3d
metadata-agent-vvtss                                1/1     Running   908        3d
metadata-agent-zvz6f                                1/1     Running   816        3d
来自元数据代理的日志:

➜ kubectl logs pods/metadata-agent-gdnb6  --namespace=kube-system
I0130 10:32:38 7eff97c7f740 updater.cc:40 Not starting DockerUpdater
I0130 10:32:38 7eff97c7f740 kubernetes.cc:1324 Watching for node-level metadata
I0130 10:32:38 7eff94e58700 kubernetes.cc:1163 Watch thread (pods) started for node gke-rain-rain-node-pool-16891a38-p99s
I0130 10:32:38 7eff8effd700 kubernetes.cc:1203 Watch thread (node) started for node gke-rain-rain-node-pool-16891a38-p99s
I0130 10:32:38 7eff7ffff700 reporter.cc:46 Metadata reporter started
I0130 10:32:41 7eff7ffff700 environment.cc:270 No credentials found at /etc/google/auth/application_default_credentials.json
I0130 10:32:41 7eff7ffff700 environment.cc:146 Got project id from metadata server: 11111111
I0130 10:32:41 7eff7ffff700 oauth2.cc:283 Getting auth token from metadata server
E0130 10:32:41 7eff7ffff700 reporter.cc:64 Metadata request unsuccessful: Server responded with 'Forbidden' (403): Transport endpoint is not connected
E0130 10:33:41 7eff7ffff700 reporter.cc:64 Metadata request unsuccessful: Server responded with 'Forbidden' (403): Transport endpoint is not connected
E0130 10:34:41 7eff7ffff700 reporter.cc:64 Metadata request unsuccessful: Server responded with 'Forbidden' (403): Transport endpoint is not connected
E0130 10:35:41 7eff7ffff700 reporter.cc:64 Metadata request unsuccessful: Server responded with 'Forbidden' (403): Transport endpoint is not connected
E0130 10:36:41 7eff7ffff700 reporter.cc:64 Metadata request unsuccessful: Server responded with 'Forbidden' (403): Transport endpoint is not connected
E0130 10:37:41 7eff7ffff700 reporter.cc:64 Metadata request unsuccessful: Server responded with 'Forbidden' (403): Transport endpoint is not connected
元数据:

GKE 1.11.6-GKE.3 通过云控制台启用stackdriver监控。 注:

只有在创建集群后启用stackdriver监控时才会发生这种情况,而不是作为集群创建的一部分。
Google Kubernetes引擎默认使用fluentd作为日志代理,在进行研究时,我的想法是您进行了手动安装,根据Kubernetes监控:

警告:不建议在GKE上手动安装。手动安装是为了避免在安装Stackdriver Kubernetes监控的托管支持时出现临时问题。这个问题已经消除。请参阅安装Stackdriver Kubernetes Monitoring以安装或升级到最新版本


我的建议是使用默认代理来避免此类问题

-这篇文章是关于stackdriver监控的-安装不是手动的,而是通过云控制台UI进行的,除非您使用的是测试版v2,否则不能保证它能正常工作,也没有SLA。我的建议是打开一个支持案例。有同样的问题。您找到解决方案了吗?通过云控制台禁用测试版解决了这个问题。