将链接的 html 文件转换为 pdf 文件？

Question

正如您链接的说明中所述：

默认的全局扩展将页面按字母顺序排列。

索引页链接到九个不同的文档，其名称不按字母顺序排列。当您说时htmldoc ... *.html，工具会按该顺序查看它们，并按字母顺序将页面放入文档中。您需要按照要htmldoc处理的顺序在命令行上列出文件。

在这种特定情况下，您可以生成文件名的有序列表，因为它们在索引中链接为：

awk '/http:|\.\./ {next}; /<a href.*\.html/ { gsub(/.*href="/, "") ; gsub(".html.*", ".html") ; print }' index.html | uniq

所以

htmldoc --webpage -f gdb.pdf index.html $(awk '/http:|\.\./ {next}; /<a href.*\.html/ { gsub(/.*href="/, "") ; gsub(".html.*", ".html") ; print }' index.html | uniq)

就会达到你想要的效果。

Answer 1

正如您链接的说明中所述：

默认的全局扩展将页面按字母顺序排列。

索引页链接到九个不同的文档，其名称不按字母顺序排列。当您说时htmldoc ... *.html，工具会按该顺序查看它们，并按字母顺序将页面放入文档中。您需要按照要htmldoc处理的顺序在命令行上列出文件。

在这种特定情况下，您可以生成文件名的有序列表，因为它们在索引中链接为：

awk '/http:|\.\./ {next}; /<a href.*\.html/ { gsub(/.*href="/, "") ; gsub(".html.*", ".html") ; print }' index.html | uniq

所以

htmldoc --webpage -f gdb.pdf index.html $(awk '/http:|\.\./ {next}; /<a href.*\.html/ { gsub(/.*href="/, "") ; gsub(".html.*", ".html") ; print }' index.html | uniq)

就会达到你想要的效果。

将链接的 html 文件转换为 pdf 文件？

答案1

相关内容