我正在尝试使用 IPI 安装在 AWS 上部署 OpenShift 集群。我使用了 t3.xlarge 实例并按照以下指南中的步骤进行操作https://docs.openshift.com/container-platform/4.12/installing/installing_aws/installing-aws-vpc.html#installation-configure-proxy_installing-aws-vpc,使用我现有的 VPC。
成功完成所有配置步骤并下载安装文件后,当我运行命令 ./openshift-install create cluster --dir path/todir 时,遇到以下显示的错误。
ERROR Attempted to gather debug logs after installation failure: failed to create SSH client: failed to use the provided keys for authentication: ssh: handshake failed: ssh: unable to authenticate, attempted methods [none publickey], no supported methods remain
ERROR Attempted to gather ClusterOperator status after installation failure: listing ClusterOperator objects: Get "
https://api.openshift-cluster.dummytest.eu:6443/apis/config.openshift.io/v1/clusteroperators":
dial tcp: lookup api.openshift-cluster.dummytest.eu on 10.232.192.2:53: no such host
ERROR Bootstrap failed to complete: Get "
https://api.openshift-cluster.dummytest.eu:6443/version":
dial tcp: lookup api.openshift-cluster.dummytest.eu on 10.232.192.2:53: no such host
ERROR Failed waiting for Kubernetes API. This error usually happens when there is a problem on the bootstrap host that prevents creating a temporary control plane.
我想补充一下,我已经生成了 SSH 密钥,在 Route 53 上创建了一个托管区域,并检查了互联网权限。我没有在 install-config.yml 中指定子网,但只指定了一个可用区域以及机器 CIDR 和集群 CIDR 的默认 IP 地址。但是,我无法继续。
这是我的 install-config.yml
apiVersion: v1
baseDomain: dummytest.eu
credentialsMode: Manual
controlPlane:
hyperthreading: Enabled
name: master
platform:
aws:
iamRole: openshift-ec2-role
zones:
- eu-south-1a
rootVolume:
iops: 4000
size: 500
type: io1
metadataService:
authentication: Optional
type: t3.xlarge
replicas: 1
compute:
- hyperthreading: Enabled
name: worker
platform:
aws:
rootVolume:
iops: 2000
size: 500
type: io1
metadataService:
authentication: Optional
type: c5.4xlarge
zones:
- eu-south-1a
replicas: 0
metadata:
name: openshift-cluster
networking:
clusterNetwork:
- cidr: 10.128.0.0/14
hostPrefix: 23
machineNetwork:
- cidr: 10.0.0.0/16
networkType: OVNKubernetes
serviceNetwork:
- 172.30.0.0/16
platform:
aws:
region: eu-south-1
propagateUserTags: true
userTags:
adminContact: jdoe
costCenter: 7536
subnets:
amiID: ami-01317af8d99bf5105
fips: false
pullSecret: ''
在网上搜索时,一些消息来源表明问题可能在于无法创建 Route 53 托管区域。但是,如上所述,托管区域是公共的并且是之前创建的。此外,似乎 openshift 已经创建了其他资源(例如 EIP、NAT 网关和使用 terraform 管理的私有托管区域、引导程序、工作节点和主节点的角色),但未完成安装有人有什么想法吗?感谢您的帮助