我有几个 Microsoft Word 模板文件。它们有*.dot
扩大:
$ file file.dot file.dot: Composite Document File V2 Document, Little Endian, Os: Windows, Version 6.1, Code page: 1252, Author: user, Template: file.dot, Last Saved By: user, Revision Number: 2, Name of Creating Application: Microsoft Office Word, Total Editing Time: 01:00, Last Printed: Tue Nov 21 14:41:00 1995, Create Time/Date: Fri Dec 20 11:46:00 2019, Last Saved Time/Date: Fri Dec 20 11:46:00 2019, Number of Pages: 3, Number of Words: 300, Number of Characters: 1713, Security: 0
我需要使用一些 CLI 应用程序将它们转换为纯文本。
是否可以?
答案1
答案2
还有antiword
(使用deb 软件包),它以非常高效但并不总是完全正确的方式从旧的(XML 之前的)Word 文档中提取纯文本。