我正在尝试从最新的帖子中获取源 HTML 代码,但是......
这个网址:
http://www.sportsbookreview.com/forum/search.php?do=getnew
将我重定向到:
http://www.sportsbookreview.com/forum/search.php?searchid=26505884
在哪里可以找到最新的帖子
如何使用 cURL 获取最终 URL?
如果我使用:
curl -s http://www.sportsbookreview.com/forum/search.php?do=getnew > $shDir$urlLatestPosts
然后它会得到我正在寻找的页面之外的其他页面,那么有没有办法获得最终的URL?
答案1
使用curl -L
。从手册中:
-L, --location
(HTTP/HTTPS) If the server reports that the requested page has moved to a different location (indicated with a Location: header and a 3XX response code), this option
will make curl redo the request on the new place. If used together with -i, --include or -I, --head, headers from all requested pages will be shown. When authentica-
tion is used, curl only sends its credentials to the initial host. If a redirect takes curl to a different host, it won't be able to intercept the user+password. See
also --location-trusted on how to change this. You can limit the amount of redirects to follow by using the --max-redirs option.
When curl follows a redirect and the request is not a plain GET (for example POST or PUT), it will do the following request with a GET if the HTTP response was 301,
302, or 303. If the response code was any other 3xx code, curl will re-send the following request using the same unmodified method.