AKS 群集配置错误 Azure 机器学习服务 Python SDK

AKS 群集配置错误 Azure 机器学习服务 Python SDK

我正在尝试按照本教程将 ML 模型部署到 AKShttps://docs.microsoft.com/en-us/azure/machine-learning/service/how-to-deploy-azure-kubernetes-service

使用 Python SDK 创建集群时:

from azureml.core.compute import AksCompute, ComputeTarget

# Use the default configuration (you can also provide parameters to customize this).
# For example, to create a dev/test cluster, use:
# prov_config = AksCompute.provisioning_configuration(cluster_purpose = AksCompute.ClusterPurpose.DEV_TEST)
prov_config = AksCompute.provisioning_configuration()

aks_name = 'myaks'
# Create the cluster
aks_target = ComputeTarget.create(workspace = ws,
                                    name = aks_name,
                                    provisioning_configuration = prov_config)

# Wait for the create process to complete
aks_target.wait_for_completion(show_output = True)

我收到以下错误:

ComputeTargetException: ComputeTargetException:
    Message: Compute object provisioning polling reached non-successful terminal state, current provisioning state: Failed
Provisioning operation error:
StatusCode: 400
Message: The request is invalid
    InnerException None
    ErrorResponse 
{
    "error": {
        "message": "Compute object provisioning polling reached non-successful terminal state, current provisioning state: Failed\nProvisioning operation error:\nStatusCode: 400\nMessage: The request is invalid"
    }
}

在 Azure 门户中,计算目标显示配置状态失败,并显示以下错误:配置错误:InvalidTemplateDeployment:根据验证程序,模板部署“0086633c-90b8-466d-9124-7e2cd78c5abc”无效。跟踪 ID 为“f0ca368a-dd3c-4fa3-9daf-ee49c82f41e4”。有关详细信息,请参阅内部错误。

客户端请求ID:fe9ff62f-666d-4b23-b6ff-33cd38f669ff 服务请求ID:|64f42c9db113451395735386668ae64c。

我尝试了不同大小的 VM,但都遇到了同样的错误。我正在美国东部地区进行部署。

相关内容