我正在使用 kOps 1.26.3 和 Kubernetes 1.26.5,在 AWS 上运行。使用 升级后kops upgrade cluster
,指标服务器停止工作。
它是使用集群清单安装的:
metricsServer:
enabled: true
已创建一些资源(如服务)。但是没有 metric-server pod,也没有部署:
$ kubectl get service metrics-server -n kube-system
E0523 12:28:33.191542 35151 memcache.go:287] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
E0523 12:28:33.517819 35151 memcache.go:121] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
E0523 12:28:33.679998 35151 memcache.go:121] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
E0523 12:28:33.836750 35151 memcache.go:121] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
metrics-server ClusterIP 100.69.1.148 <none> 443/TCP 213d
$
$ kubectl get pods -A | grep metrics-server
E0523 12:44:00.464132 36275 memcache.go:287] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
E0523 12:44:00.780095 36275 memcache.go:121] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
E0523 12:44:00.942123 36275 memcache.go:121] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
E0523 12:44:01.103146 36275 memcache.go:121] couldn't get resource list for metrics.k8s.io/v1beta1: the server is currently unable to handle the request
由于没有 Pod,我不知道在哪里查找日志。我尝试终止实例并让 kOps 重新创建它们几次,结果都一样。