从某个地方下载内容使用 wget

Question

这个问题很困难，因为完整的图片不在父级树下，因此很难将这些路径与站点上的任何其他路径区分开来。此外，指向完整图片的链接实际上是指向嵌入了全分辨率图片的页面的链接。可能有更优雅的解决方案，但这里有一种可行的方法。

#!/bin/bash
wget -np http://www.imagebam.com/gallery/hwtfu6m7es3gun1emmpy2uheohrcckmt/
grep HTML-Code index.html > html_code
grep -E -o 'http://thumbnails[^"]+' html_code > thumb_urls
grep -E -o 'http://www[^"]+' html_code > image_pages
wget -i thumb_urls
wget -P image_pages_dir -i image_pages
for file in image_pages_dir/*
do
    echo $file
    grep -m 1 -o -E 'http://.*jpg' $file >> full_image_urls
done
wget -i full_image_urls

Answer 1

这个问题很困难，因为完整的图片不在父级树下，因此很难将这些路径与站点上的任何其他路径区分开来。此外，指向完整图片的链接实际上是指向嵌入了全分辨率图片的页面的链接。可能有更优雅的解决方案，但这里有一种可行的方法。

#!/bin/bash
wget -np http://www.imagebam.com/gallery/hwtfu6m7es3gun1emmpy2uheohrcckmt/
grep HTML-Code index.html > html_code
grep -E -o 'http://thumbnails[^"]+' html_code > thumb_urls
grep -E -o 'http://www[^"]+' html_code > image_pages
wget -i thumb_urls
wget -P image_pages_dir -i image_pages
for file in image_pages_dir/*
do
    echo $file
    grep -m 1 -o -E 'http://.*jpg' $file >> full_image_urls
done
wget -i full_image_urls

从某个地方下载内容使用 wget

答案1

相关内容