从文件文本中过滤多个 URL

Question

这可能不是最好的，但尝试一下：

创建一个文件urlcheck.sh，然后授予执行权限。或者只需输入以下命令：

touch urlcheck.sh
chmod +x urlcheck.sh

将以下脚本粘贴到urlcheck.sh

#!/bin/bash
TIMEOUT=3

if [ ! -f output404.txt ]; then
    touch output404.txt
fi

while IFS= read -r line; do
    OUT_URL=$(curl -I $line 2>&1 -m $TIMEOUT| awk '/HTTP\// {print $2}')
    if [ "$OUT_URL" == "404" ]; then
        echo $line >> output404.txt
        echo "$line written to output404.txt"
    else
        echo "$line     $OUT_URL"
    fi
done < "$1"

并保存。

运行脚本：

./urlcheck.sh urls.txt

然后，检查output404.txt脚本生成的。

请注意每行中的 url 必须是可读取的 url，curl例如https://unix.stackexchange.com/.

您可以更改第二行的超时时间TIMEOUT=3。

Answer 1