wget：动态修改输入数据文件时检索 URL 列表

Question

它不起作用的原因是语法错误：

wget -nc -i $(cut -f1 '-d ' inp)

...问题是-i交换机需要：

包含 URL 列表的本地文本文件
包含 URL 列表的远程文本文件
包含本地文件列表的远程 HTML 文件。

但上面的代码给出的是-i http://whatever.site/data/samples/hexfilename1.mp3，它不是文本或 HMTL 文件。 man wget说：

COLUMNS=72 man wget | grep -m1 -A 22 '\-i '
   -i file
   --input-file=file
       Read URLs from a local or external file.  If - is specified
       as file, URLs are read from the standard input.  (Use ./-
       to read from a file literally named -.)

       If this function is used, no URLs need be present on the
       command line.  If there are URLs both on the command line
       and in an input file, those on the command lines will be
       the first ones to be retrieved.  If --force-html is not
       specified, then file should consist of a series of URLs,
       one per line.

       However, if you specify --force-html, the document will be
       regarded as html.  In that case you may have problems with
       relative links, which you can solve either by adding "<base
       href="url">" to the documents or by specifying --base=url
       on the command line.

       If the file is an external one, the document will be
       automatically treated as html if the Content-Type matches
       text/html.  Furthermore, the file's location will be
       implicitly used as base href if none was specified.

修复包括：

使用标准输入对于-i参数按照加雷思·红的评论:
```
cut -d' ' -f1 inp | wget -nc -i -
```
或者这个bash以中心为中心的方法，它与最初的预期相差大约一个字节，根据语法错误的评论:
```
wget -nc -i <(cut -f1 '-d ' inp)
```

Answer 1

它不起作用的原因是语法错误：

wget -nc -i $(cut -f1 '-d ' inp)

...问题是-i交换机需要：

包含 URL 列表的本地文本文件
包含 URL 列表的远程文本文件
包含本地文件列表的远程 HTML 文件。

但上面的代码给出的是-i http://whatever.site/data/samples/hexfilename1.mp3，它不是文本或 HMTL 文件。 man wget说：

COLUMNS=72 man wget | grep -m1 -A 22 '\-i '
   -i file
   --input-file=file
       Read URLs from a local or external file.  If - is specified
       as file, URLs are read from the standard input.  (Use ./-
       to read from a file literally named -.)

       If this function is used, no URLs need be present on the
       command line.  If there are URLs both on the command line
       and in an input file, those on the command lines will be
       the first ones to be retrieved.  If --force-html is not
       specified, then file should consist of a series of URLs,
       one per line.

       However, if you specify --force-html, the document will be
       regarded as html.  In that case you may have problems with
       relative links, which you can solve either by adding "<base
       href="url">" to the documents or by specifying --base=url
       on the command line.

       If the file is an external one, the document will be
       automatically treated as html if the Content-Type matches
       text/html.  Furthermore, the file's location will be
       implicitly used as base href if none was specified.

修复包括：

使用标准输入对于-i参数按照加雷思·红的评论:
```
cut -d' ' -f1 inp | wget -nc -i -
```
或者这个bash以中心为中心的方法，它与最初的预期相差大约一个字节，根据语法错误的评论:
```
wget -nc -i <(cut -f1 '-d ' inp)
```

wget：动态修改输入数据文件时检索 URL 列表

答案1

相关内容