如何向扫描的 PDF 文档添加文本（以启用搜索和复制粘贴）？

Question 1

pdftk您可以使用和命令来实现所需的结果multistamp。

首先将 M$-Word 文档导出为 PDF 文件，document.pdf将签名文件导出为document_signed.pdf。然后按如下方式合并两个文档：

pdftk document.pdf multistamp document_signed.pdf output document_signed_searchable.pdf

这将创建一个document_signed_searchable.pdf具有您想要的功能的。

以下是手册中的相关摘录：

background <background PDF filename | - | PROMPT>
    Applies a PDF watermark to the background of a single input PDF.  Pass the background PDF's filename after background like so:
  
    pdftk in.pdf background back.pdf output out.pdf
  
    Pdftk uses only the first page from the background PDF and applies it to every page of the input PDF.  This page is scaled and rotated as needed to fit the input page.  You can use - to pass a background PDF into pdftk via stdin.
  
    If the input PDF does not have a transparent background (such as a PDF created from page scans) then the resulting background won't be visible -- use the stamp operation instead.
  
multibackground <background PDF filename | - | PROMPT>
    Same as the background operation, but applies each page of the background PDF to the corresponding page of the input PDF.  If the input PDF has more pages than the stamp PDF, then the final stamp page is repeated across these remaining pages in the input PDF.
  
stamp <stamp PDF filename | - | PROMPT>
    This behaves just like the background operation except it overlays the stamp PDF page on top of the input PDF document's pages.  This works best if the stamp PDF page has a transparent background.

multistamp <stamp PDF filename | - | PROMPT>
    Same as the stamp operation, but applies each page of the background PDF to the corresponding page of the input PDF.  If the input PDF has more pages than the stamp PDF, then the final stamp page is repeated across these remaining pages in the input PDF.

Answer

pdftk您可以使用和命令来实现所需的结果multistamp。

首先将 M$-Word 文档导出为 PDF 文件，document.pdf将签名文件导出为document_signed.pdf。然后按如下方式合并两个文档：

pdftk document.pdf multistamp document_signed.pdf output document_signed_searchable.pdf

这将创建一个document_signed_searchable.pdf具有您想要的功能的。

以下是手册中的相关摘录：

background <background PDF filename | - | PROMPT>
    Applies a PDF watermark to the background of a single input PDF.  Pass the background PDF's filename after background like so:
  
    pdftk in.pdf background back.pdf output out.pdf
  
    Pdftk uses only the first page from the background PDF and applies it to every page of the input PDF.  This page is scaled and rotated as needed to fit the input page.  You can use - to pass a background PDF into pdftk via stdin.
  
    If the input PDF does not have a transparent background (such as a PDF created from page scans) then the resulting background won't be visible -- use the stamp operation instead.
  
multibackground <background PDF filename | - | PROMPT>
    Same as the background operation, but applies each page of the background PDF to the corresponding page of the input PDF.  If the input PDF has more pages than the stamp PDF, then the final stamp page is repeated across these remaining pages in the input PDF.
  
stamp <stamp PDF filename | - | PROMPT>
    This behaves just like the background operation except it overlays the stamp PDF page on top of the input PDF document's pages.  This works best if the stamp PDF page has a transparent background.

multistamp <stamp PDF filename | - | PROMPT>
    Same as the stamp operation, but applies each page of the background PDF to the corresponding page of the input PDF.  If the input PDF has more pages than the stamp PDF, then the final stamp page is repeated across these remaining pages in the input PDF.

Question 2

因为该文件的重点是它是源文件的签名副本（否则添加签名就没有意义了）。

然后，您需要将签名返回到应该签名的位置，这意味着将其添加回源 DocX，就像在 Word 中签名一样，在那里它可以作为真正的 PDF 副本存档。对于 Linux，您显然需要使用 Open 或 LibreOffice。否则，您需要将扫描添加到 DocX 的媒体文件夹中，并将高度复杂的内容添加到文档 XML 中。

这样，就毫无疑问它是可搜索的源签名文档，而不必担心 OCR 损坏或降级。

Answer

因为该文件的重点是它是源文件的签名副本（否则添加签名就没有意义了）。

然后，您需要将签名返回到应该签名的位置，这意味着将其添加回源 DocX，就像在 Word 中签名一样，在那里它可以作为真正的 PDF 副本存档。对于 Linux，您显然需要使用 Open 或 LibreOffice。否则，您需要将扫描添加到 DocX 的媒体文件夹中，并将高度复杂的内容添加到文档 XML 中。

这样，就毫无疑问它是可搜索的源签名文档，而不必担心 OCR 损坏或降级。

如何向扫描的 PDF 文档添加文本（以启用搜索和复制粘贴）？

答案1

答案2

相关内容