我无法使用 datastax opscenter 5.0.1 创建新集群
我可以将 datastax 代理添加到现有集群上的节点,但无法创建一个(端口已打开,ssh 连接正常,sudo 适用于安装用户)
这是我的做法:
- 点击“创建全新集群”
- 在下拉菜单中选择 datastax community 2.0.9
- 添加机器的 IP
- 点击“建立集群”
- 等待
- 得到错误安装错误:安装的代理似乎没有响应。
通过 SSH 连接到机器,我可以看到没有安装/传输任何内容,显然“agent_files.tar”尚未被 scp 处理。没有关于传输过程中可能失败的详细信息。
opscenterd.log 摘录,日志级别 DEBUG(此处自愿仅使用 1 台服务器以避免多个日志条目):
2014-11-04 15:48:11+0000 [] INFO: Testing SSH connectivity to 10.133.243.24
2014-11-04 15:48:11+0000 [] INFO: Testing SSH login to 10.133.243.24
2014-11-04 15:48:11+0000 [] DEBUG: performing ssh: ['/usr/bin/ssh', '-l', u'deploy', '-p', '22', '-o', 'LogLevel=Error', u'10.133.243.24', '/usr/bin/test', '0']
2014-11-04 15:48:11+0000 [] INFO: SSH connectivity/login test succeeded
2014-11-04 15:48:11+0000 [] INFO: agent_config items: {'cassandra_log_location': '/var/log/cassandra/system.log', 'thrift_port': 9160, 'jmx_pass': '*****', 'thrift_ssl_truststore': None, 'rollups86400_ttl': -1, 'api_port': '61621', 'use_ssl': 0, 'rollups7200_ttl': 31536000, 'kerberos_debug': False, 'storage_keyspace': 'OpsCenter', 'thrift_user': '', 'provisioning': 0, 'metrics_ignored_column_families': '', 'metrics_ignored_keyspaces': 'system, system_traces, system_auth, dse_auth, OpsCenter', 'jmx_user': '', 'cassandra_install_location': '', 'kerberos_use_keytab': True, 'rollups300_ttl': 2419200, 'thrift_pass': '*****', 'metrics_ignored_solr_cores': '', 'metrics_enabled': 1, 'kerberos_use_ticket_cache': True, 'thrift_ssl_truststore_type': 'JKS', 'rollups60_ttl': 604800, 'ec2_metadata_api_host': '169.254.169.254', 'kerberos_renew_tgt': True, 'thrift_ssl_truststore_password': '*****'}
2014-11-04 15:48:12+0000 [] INFO: Starting provisioning process
2014-11-04 15:48:12+0000 [] DEBUG: Persisting config file /etc/opscenter/clusters/ChallengerDeep.conf
2014-11-04 15:48:12+0000 [] INFO: Starting installation phase of cluster provisioning
2014-11-04 15:48:12+0000 [] DEBUG: performing ssh: ['/usr/bin/ssh', '-l', u'deploy', '-p', '22', '-o', 'LogLevel=Error', u'10.133.243.24', 'echo', '-n', '.$(which apt-get 2> /dev/null) .$(which yum 2> /dev/null)']
2014-11-04 15:48:12+0000 [] DEBUG: Seeing if ip/hostname 10.133.243.24 is an ipv4 address
2014-11-04 15:48:12+0000 [] DEBUG: 10.133.243.24 is an ipv4 address
2014-11-04 15:48:12+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:12+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:12+0000 [] INFO: Beginning install of OpsCenter agent to 10.133.243.24
2014-11-04 15:48:12+0000 [] DEBUG: Prepping ssh connections
2014-11-04 15:48:12+0000 [] DEBUG: performing scp: ['/usr/bin/scp', '-q', '-P', '22', '/tmp/tmpdJdGZ3', u'[email protected]:/tmp/tmpdJdGZ3']
2014-11-04 15:48:13+0000 [] DEBUG: performing scp: ['/usr/bin/scp', '-q', '-P', '22', './agent_files.tar', u'[email protected]:agent_files.tar']
2014-11-04 15:48:13+0000 [] DEBUG: performing ssh: ['/usr/bin/ssh', '-l', u'deploy', '-p', '22', '-o', 'LogLevel=Error', u'10.133.243.24', 'rm', '-rf', 'datastax-agent-installer', '&&', 'mkdir', 'datastax-agent-installer', '&&', 'cp', 'agent_files.tar', 'datastax-agent-installer/agent_files.tar', '&&', 'cd', 'datastax-agent-installer', '&&', 'tar', 'xvf', 'agent_files.tar', '&&', 'cd', '../', '&&', 'mv', '/tmp/tmpdJdGZ3', 'datastax-agent-installer/pfile', '&&', './datastax-agent-installer/bin/install_agent.sh', '', '10.133.249.88', ';', 'rm', '-rf', 'datastax-agent-installer', 'agent_files.tar']
2014-11-04 15:48:14+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:14+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:14+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:19+0000 [] DEBUG: Average opscenterd CPU usage: 2.24%, memory usage: 43 MB
2014-11-04 15:48:19+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:19+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:19+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:24+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:24+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:24+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:29+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:29+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:29+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:34+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:34+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:34+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:39+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:39+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:39+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:44+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:44+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:44+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:49+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:49+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:49+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:54+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:54+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:54+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:48:59+0000 [] DEBUG: Performing HTTP request (GET): http://10.133.243.24:61621/alive?, body: None
2014-11-04 15:48:59+0000 [] WARN: HTTP request http://10.133.243.24:61621/alive? failed: Connection was refused by other side: 111: Connection refused.
2014-11-04 15:48:59+0000 [] DEBUG: Agent is still not alive, sleeping 5 seconds...
2014-11-04 15:49:04+0000 [] WARN: Marking request 58fdf092-fc83-4b82-a2be-22b3e63ff795 as failed: The installed agent doesn't seem to be responding.
2014-11-04 15:49:04+0000 [] INFO: Successfully installed agent and dsc on node 10.133.243.24
2014-11-04 15:49:04+0000 [] DEBUG: Subrequests complete for 'install stage' (31cc12a1-5552-443b-8cd5-ec1a91a9191d)
2014-11-04 15:49:04+0000 [] WARN: Marking request 'install stage' (31cc12a1-5552-443b-8cd5-ec1a91a9191d) as failed: The installed agent doesn't seem to be responding.
2014-11-04 15:49:04+0000 [] ERROR: Installation stage failed: The installed agent doesn't seem to be responding.
2014-11-04 15:49:04+0000 [] DEBUG: Subrequest failed (key=install request=RequestCollection[31cc12a1-5552-443b-8cd5-ec1a91a9191d](error, The installed agent doesn't seem to be responding.)): Installation stage failed: The installed agent doesn't seem to be responding.
2014-11-04 15:49:04+0000 [] WARN: Marking request 'provision' (c5243946-3bb6-4eb5-b669-04355c319339) as failed: Installation stage failed: The installed agent doesn't seem to be responding.
2014-11-04 15:49:04+0000 [] ERROR:
2014-11-04 15:49:04+0000 [] ERROR: Cluster provisioning failed: Exception: Installation stage failed: The installed agent doesn't seem to be responding.
2014-11-04 15:49:04+0000 [] DEBUG: Seeing if ip/hostname 10.133.243.24 is an ipv4 address
2014-11-04 15:49:04+0000 [] DEBUG: 10.133.243.24 is an ipv4 address
2014-11-04 15:49:04+0000 [] ERROR: Failed to provision cluster: Cluster provisioning failed: Exception: Installation stage failed: The installed agent doesn't seem to be responding.
答案1
如果用户的主目录(在本例中为“deploy”)位于以 noexec 方式挂载的文件系统上,则安装脚本将失败,因为它将从 deploy 的用户主目录运行。