在 curl 中用 IP 替换服务器地址

Question

服务器并不“仅仅知道”请求了哪个域名：客户端会自行解析域名并直接连接到 IP。事实证明，从单个 IP 为多个网站提供服务的能力非常方便，因此Host在 HTTP 标准的修订版中引入了标头。符合规范的 HTTP 客户端将从请求 URL 中提取域名并将其发送到标头中Host：

_{示例 1}

$ curl -v superuser.com 
* Rebuilt URL to: superuser.com/
*   Trying 151.101.1.69...
* TCP_NODELAY set
* Connected to superuser.com (151.101.1.69) port 80 (#0)
> GET / HTTP/1.1
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 301 Moved Permanently
< cache-control: no-cache, no-store, must-revalidate
< location: https://superuser.com/
[...]
< 
* Connection #0 to host superuser.com left intact

客户端Host: superuser.com在请求中向superuser.com的 IP 发送标头。服务器回复请求重定向到网站的 HTTPS 版本。没有文档主体，这是有道理的，因为浏览器应该重定向您。curl没有就不会这样做-L。

现在我们尝试直接使用IP：

_{示例 2}

$ curl -v 151.101.1.69             
* Rebuilt URL to: 151.101.1.69/
*   Trying 151.101.1.69...
* TCP_NODELAY set
* Connected to 151.101.1.69 (151.101.1.69) port 80 (#0)
> GET / HTTP/1.1
> Host: 151.101.1.69
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 500 Domain Not Found
< Server: Varnish
[...]
< 

<html>
<head>
<title>Fastly error: unknown domain 151.101.1.69</title>
</head>
<body>
<p>Fastly error: unknown domain: 151.101.1.69. Please check that this domain has been added to a service.</p>
* Connection #0 to host 151.101.1.69 left intact
<p>Details: cache-ams21021-AMS</p></body></html>

curl在标头中发送了 IP Host，响应是 500 错误，正文详细说明了问题。服务器不提供Host标头中提供的域。

让我们手动提供标题：

_{示例 3}

$ curl -H 'Host: superuser.com' -v 151.101.1.69
* Rebuilt URL to: 151.101.1.69/
*   Trying 151.101.1.69...
* TCP_NODELAY set
* Connected to 151.101.1.69 (151.101.1.69) port 80 (#0)
> GET / HTTP/1.1
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 301 Moved Permanently
< cache-control: no-cache, no-store, must-revalidate
< location: https://superuser.com/
[...]
< 
* Connection #0 to host 151.101.1.69 left intact

正如预期的那样，我们再次获得了重定向。服务器并不“仅仅知道”请求是通过直接提供 IP 发出的，因为它总是这样发出的：客户端负责解析域名。事实证明，从单个 IP 为多个网站提供服务的能力非常方便，因此Host在 HTTP 标准的修订版中引入了标头。

不幸的是，这不适用于 HTTPS。HTTPS 基本上是包裹在 TLS 中的 HTTP。在通过 HTTP 发送任何内容之前，需要设置 TLS 连接。此过程涉及服务器为请求的域提供适当的证书。这需要了解域，所以我们又回到了原点。此问题由 SNI 解决，SNI 是 TLS 的扩展，它指定客户端如何将域传达给服务器，以便可以使用正确的证书。

您可以使用 curl 来模拟这种情况--resolve：

_{示例 4}

$ curl -v --resolve superuser.com:443:151.101.65.69 https://superuser.com
* Added superuser.com:443:151.101.65.69 to DNS cache
* Rebuilt URL to: https://superuser.com/
* Hostname superuser.com was found in DNS cache
[...]
* Connected to superuser.com (151.101.65.69) port 443 (#0)
[...]
* SSL connection using TLSv1.2 / ECDHE-RSA-AES128-GCM-SHA256
* ALPN, server accepted to use h2
* Server certificate:
*  subject: CN=*.stackexchange.com
*  start date: Aug  7 13:01:00 2020 GMT
*  expire date: Nov  5 13:01:00 2020 GMT
*  subjectAltName: host "superuser.com" matched cert's "superuser.com"
*  issuer: C=US; O=Let's Encrypt; CN=Let's Encrypt Authority X3
*  SSL certificate verify ok.
[...]
> GET / HTTP/2
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
[...]
< HTTP/2 200 
< cache-control: private
< content-type: text/html; charset=utf-8
[...]
<!DOCTYPE html>
[...]

--resolve绕过给定主机的 DNS 解析。正如手册所述，它是“一种 /etc/hosts 替代方案”。参数语法是<host>:<port>:<ip>。因此此命令：

_{示例 5}

curl -v --resolve superuser.com:443:151.101.65.69 https://superuser.com

方法：

-v：详细（打印标题和 TLS 详细信息）
--resolve superuser.com:443:151.101.65.69：如果连接到superuser.com端口443，则实际使用IP 151.101.65.69
https://superuser.com：使用 HTTPS 向 superuser.com 发出请求

至于为什么域名必须重复两次，当单次 curl 调用会产生多个请求时，这很有意义，例如由于重定向和-L提供：

_{示例 6}

$ curl -v --resolve superuser.com:443:151.101.65.69 -L http://superuser.com

此命令将首先superuser.com使用 DNS 进行解析。--resolve不适用于此请求，因为它是为端口 443 指定的，而我们通过 HTTP 连接，在端口 80 上。服务器以 301 重定向响应到https://superuser.com。我们已指定-L，因此 curl 将向该 URL 发出第二个请求。这次它通过端口 443 上的 HTTPS 进行，并且我们已使用为该主机和端口指定了一个 IP --resolve，因此将使用指定的 IP（先前的 DNS 查找将被忽略）。在两种情况下Host都会生成标头，superuser.com因为这就是我们所请求的。

这是实际的 curl 输出。请注意，第二个请求导致出现“在 DNS 缓存中找到主机名 superuser.com”消息，这是--resolve实际操作。

_{示例 6（续）}

* Added superuser.com:443:151.101.65.69 to DNS cache
* Rebuilt URL to: http://superuser.com/
*   Trying 151.101.65.69...
* TCP_NODELAY set
* Connected to superuser.com (151.101.65.69) port 80 (#0)
> GET / HTTP/1.1
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 301 Moved Permanently
< cache-control: no-cache, no-store, must-revalidate
< location: https://superuser.com/
[...]
* Ignoring the response-body
[...]
* Connection #0 to host superuser.com left intact
* Issue another request to this URL: 'https://superuser.com/'
* Hostname superuser.com was found in DNS cache
*   Trying 151.101.65.69...
* TCP_NODELAY set
* Connected to superuser.com (151.101.65.69) port 443 (#1)
[...]

进一步澄清正确使用`--resolve`

使用时--resolve，必须请求域名（，而不是直接请求IP。请求IP将会：

Host为 IP 而不是域名生成标头，
在 SNI 步骤中声明你直接访问 IP，而不是通过域名访问（如果使用 HTTPS），
--resolve不适用的原因是因为--resolve绕过了域名解析，并且当没有提供域名时，就不需要进行域名解析。

所以你想要这个：

_{示例 7}

curl --resolve example.com:80:93.184.216.34 http://example.com

而不是这样：

_{示例 8}

curl --resolve example.com:80:93.184.216.34 http://93.184.216.34

在示例 7 中，curl将使用提供的 IP 地址--resolve，而不是example.comDNS 解析的 IP 地址。

何时`--resolve`适用

每个--resolve（允许多个）由 3 个组件组成：主机、端口和 IP。--resolve如果主机和端口匹配，则适用于请求，在这种情况下，将绕过此特定请求的 DNS 解析并--resolve使用匹配的 IP。在许多情况下，单个curl调用只会发出一个请求，在这种情况下，--resolve只有当其主机和端口与请求的主机和端口匹配时才有意义。因此，此调用没有意义，因为--resolve由于端口不匹配而永远不会匹配（HTTPS 默认使用 443）：

_{示例 9}

curl --resolve example.com:80:93.184.216.34 https://example.com

什么时候curl每次调用都会发出多个请求？我知道的情况是，当-L提供时，第一个请求导致 3xx 响应（这是重定向响应系列，请参阅httpstatuses.com）。这些响应带有一个Location标头，告知浏览器向该标头中提供的地址发出另一个请求。如果没有-L，curl则只会打印 3xx 响应。有了-L它将像浏览器一样发出另一个请求。（请注意，第二个请求也可能导致 3xx 响应，从而生成第三个响应，依此类推）。

例如，对 superuser.com 的 HTTP 请求会导致 301 响应并重定向到 HTTPS 版本，请参见Location显示标头的示例 1。这样，-L您将获得与首先请求 HTTPS 版本相同的响应。HTTP 和 HTTPS 使用不同的端口（80 和 443），因此--resolve在这种情况下您需要两个 s，每个端口一个。您还可以有意指定一个 s 来覆盖仅针对 HTTP（或 HTTPS）请求的域名解析，而让另一个指向 DNS 将返回的实际 IP。

Answer 1

服务器并不“仅仅知道”请求了哪个域名：客户端会自行解析域名并直接连接到 IP。事实证明，从单个 IP 为多个网站提供服务的能力非常方便，因此Host在 HTTP 标准的修订版中引入了标头。符合规范的 HTTP 客户端将从请求 URL 中提取域名并将其发送到标头中Host：

_{示例 1}

$ curl -v superuser.com 
* Rebuilt URL to: superuser.com/
*   Trying 151.101.1.69...
* TCP_NODELAY set
* Connected to superuser.com (151.101.1.69) port 80 (#0)
> GET / HTTP/1.1
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 301 Moved Permanently
< cache-control: no-cache, no-store, must-revalidate
< location: https://superuser.com/
[...]
< 
* Connection #0 to host superuser.com left intact

客户端Host: superuser.com在请求中向superuser.com的 IP 发送标头。服务器回复请求重定向到网站的 HTTPS 版本。没有文档主体，这是有道理的，因为浏览器应该重定向您。curl没有就不会这样做-L。

现在我们尝试直接使用IP：

_{示例 2}

$ curl -v 151.101.1.69             
* Rebuilt URL to: 151.101.1.69/
*   Trying 151.101.1.69...
* TCP_NODELAY set
* Connected to 151.101.1.69 (151.101.1.69) port 80 (#0)
> GET / HTTP/1.1
> Host: 151.101.1.69
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 500 Domain Not Found
< Server: Varnish
[...]
< 

<html>
<head>
<title>Fastly error: unknown domain 151.101.1.69</title>
</head>
<body>
<p>Fastly error: unknown domain: 151.101.1.69. Please check that this domain has been added to a service.</p>
* Connection #0 to host 151.101.1.69 left intact
<p>Details: cache-ams21021-AMS</p></body></html>

curl在标头中发送了 IP Host，响应是 500 错误，正文详细说明了问题。服务器不提供Host标头中提供的域。

让我们手动提供标题：

_{示例 3}

$ curl -H 'Host: superuser.com' -v 151.101.1.69
* Rebuilt URL to: 151.101.1.69/
*   Trying 151.101.1.69...
* TCP_NODELAY set
* Connected to 151.101.1.69 (151.101.1.69) port 80 (#0)
> GET / HTTP/1.1
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 301 Moved Permanently
< cache-control: no-cache, no-store, must-revalidate
< location: https://superuser.com/
[...]
< 
* Connection #0 to host 151.101.1.69 left intact

正如预期的那样，我们再次获得了重定向。服务器并不“仅仅知道”请求是通过直接提供 IP 发出的，因为它总是这样发出的：客户端负责解析域名。事实证明，从单个 IP 为多个网站提供服务的能力非常方便，因此Host在 HTTP 标准的修订版中引入了标头。

不幸的是，这不适用于 HTTPS。HTTPS 基本上是包裹在 TLS 中的 HTTP。在通过 HTTP 发送任何内容之前，需要设置 TLS 连接。此过程涉及服务器为请求的域提供适当的证书。这需要了解域，所以我们又回到了原点。此问题由 SNI 解决，SNI 是 TLS 的扩展，它指定客户端如何将域传达给服务器，以便可以使用正确的证书。

您可以使用 curl 来模拟这种情况--resolve：

_{示例 4}

$ curl -v --resolve superuser.com:443:151.101.65.69 https://superuser.com
* Added superuser.com:443:151.101.65.69 to DNS cache
* Rebuilt URL to: https://superuser.com/
* Hostname superuser.com was found in DNS cache
[...]
* Connected to superuser.com (151.101.65.69) port 443 (#0)
[...]
* SSL connection using TLSv1.2 / ECDHE-RSA-AES128-GCM-SHA256
* ALPN, server accepted to use h2
* Server certificate:
*  subject: CN=*.stackexchange.com
*  start date: Aug  7 13:01:00 2020 GMT
*  expire date: Nov  5 13:01:00 2020 GMT
*  subjectAltName: host "superuser.com" matched cert's "superuser.com"
*  issuer: C=US; O=Let's Encrypt; CN=Let's Encrypt Authority X3
*  SSL certificate verify ok.
[...]
> GET / HTTP/2
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
[...]
< HTTP/2 200 
< cache-control: private
< content-type: text/html; charset=utf-8
[...]
<!DOCTYPE html>
[...]

--resolve绕过给定主机的 DNS 解析。正如手册所述，它是“一种 /etc/hosts 替代方案”。参数语法是<host>:<port>:<ip>。因此此命令：

_{示例 5}

curl -v --resolve superuser.com:443:151.101.65.69 https://superuser.com

方法：

-v：详细（打印标题和 TLS 详细信息）
--resolve superuser.com:443:151.101.65.69：如果连接到superuser.com端口443，则实际使用IP 151.101.65.69
https://superuser.com：使用 HTTPS 向 superuser.com 发出请求

至于为什么域名必须重复两次，当单次 curl 调用会产生多个请求时，这很有意义，例如由于重定向和-L提供：

_{示例 6}

$ curl -v --resolve superuser.com:443:151.101.65.69 -L http://superuser.com

此命令将首先superuser.com使用 DNS 进行解析。--resolve不适用于此请求，因为它是为端口 443 指定的，而我们通过 HTTP 连接，在端口 80 上。服务器以 301 重定向响应到https://superuser.com。我们已指定-L，因此 curl 将向该 URL 发出第二个请求。这次它通过端口 443 上的 HTTPS 进行，并且我们已使用为该主机和端口指定了一个 IP --resolve，因此将使用指定的 IP（先前的 DNS 查找将被忽略）。在两种情况下Host都会生成标头，superuser.com因为这就是我们所请求的。

这是实际的 curl 输出。请注意，第二个请求导致出现“在 DNS 缓存中找到主机名 superuser.com”消息，这是--resolve实际操作。

_{示例 6（续）}

* Added superuser.com:443:151.101.65.69 to DNS cache
* Rebuilt URL to: http://superuser.com/
*   Trying 151.101.65.69...
* TCP_NODELAY set
* Connected to superuser.com (151.101.65.69) port 80 (#0)
> GET / HTTP/1.1
> Host: superuser.com
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/1.1 301 Moved Permanently
< cache-control: no-cache, no-store, must-revalidate
< location: https://superuser.com/
[...]
* Ignoring the response-body
[...]
* Connection #0 to host superuser.com left intact
* Issue another request to this URL: 'https://superuser.com/'
* Hostname superuser.com was found in DNS cache
*   Trying 151.101.65.69...
* TCP_NODELAY set
* Connected to superuser.com (151.101.65.69) port 443 (#1)
[...]

进一步澄清正确使用`--resolve`

使用时--resolve，必须请求域名（，而不是直接请求IP。请求IP将会：

Host为 IP 而不是域名生成标头，
在 SNI 步骤中声明你直接访问 IP，而不是通过域名访问（如果使用 HTTPS），
--resolve不适用的原因是因为--resolve绕过了域名解析，并且当没有提供域名时，就不需要进行域名解析。

所以你想要这个：

_{示例 7}

curl --resolve example.com:80:93.184.216.34 http://example.com

而不是这样：

_{示例 8}

curl --resolve example.com:80:93.184.216.34 http://93.184.216.34

在示例 7 中，curl将使用提供的 IP 地址--resolve，而不是example.comDNS 解析的 IP 地址。

何时`--resolve`适用

每个--resolve（允许多个）由 3 个组件组成：主机、端口和 IP。--resolve如果主机和端口匹配，则适用于请求，在这种情况下，将绕过此特定请求的 DNS 解析并--resolve使用匹配的 IP。在许多情况下，单个curl调用只会发出一个请求，在这种情况下，--resolve只有当其主机和端口与请求的主机和端口匹配时才有意义。因此，此调用没有意义，因为--resolve由于端口不匹配而永远不会匹配（HTTPS 默认使用 443）：

_{示例 9}

curl --resolve example.com:80:93.184.216.34 https://example.com

什么时候curl每次调用都会发出多个请求？我知道的情况是，当-L提供时，第一个请求导致 3xx 响应（这是重定向响应系列，请参阅httpstatuses.com）。这些响应带有一个Location标头，告知浏览器向该标头中提供的地址发出另一个请求。如果没有-L，curl则只会打印 3xx 响应。有了-L它将像浏览器一样发出另一个请求。（请注意，第二个请求也可能导致 3xx 响应，从而生成第三个响应，依此类推）。

例如，对 superuser.com 的 HTTP 请求会导致 301 响应并重定向到 HTTPS 版本，请参见Location显示标头的示例 1。这样，-L您将获得与首先请求 HTTPS 版本相同的响应。HTTP 和 HTTPS 使用不同的端口（80 和 443），因此--resolve在这种情况下您需要两个 s，每个端口一个。您还可以有意指定一个 s 来覆盖仅针对 HTTP（或 HTTPS）请求的域名解析，而让另一个指向 DNS 将返回的实际 IP。

在 curl 中用 IP 替换服务器地址

答案1

进一步澄清正确使用`--resolve`

何时`--resolve`适用

相关内容

答案1

进一步澄清正确使用--resolve

何时--resolve适用

相关内容

进一步澄清正确使用`--resolve`

何时`--resolve`适用