windows 批量比较然后跳过文件第一行的 utf ∩╗┐

Question

您的list.txt已保存在带字节顺序标记的 UTF-8 编码和∩╗┐角色的外观UTF-8 字节顺序标记编码中的字节数CP437（也请参见下面的示例）。

非 UTF-8 软件可能会将 BOM 显示为三个垃圾字符，例如， "ï»¿"在将文档解释为 ISO 8859-1 或 Windows-1252 的软件中，以及"∩╗┐"在解释为代码页 437 时。这是莫吉巴克，当使用非预期的字符编码解码文本时，会输出乱码文本。

我猜你的脚本保存在合称为“ANSI”（CP1252）编码，因此改用ï»¿：

if "!Line[1]:~0,3!" == "ï»¿" set "Line[1]=!Line[1]:~3!"

示例；（添加了代码页 1250 的实例，结果产生了 mojibake ď»ż）：

chcp 1250
type D:\bat\SU\list1545301_UTF8-BOM.txt
chcp 1252
type D:\bat\SU\list1545301_UTF8-BOM.txt
chcp 437
type D:\bat\SU\list1545301_UTF8-BOM.txt

Active code page: 1250
ď»żthis is line1
this is line2

Active code page: 1252
ï»¿this is line1
this is line2

Active code page: 437
∩╗┐this is line1
this is line2

Answer 1

您的list.txt已保存在带字节顺序标记的 UTF-8 编码和∩╗┐角色的外观UTF-8 字节顺序标记编码中的字节数CP437（也请参见下面的示例）。

非 UTF-8 软件可能会将 BOM 显示为三个垃圾字符，例如， "ï»¿"在将文档解释为 ISO 8859-1 或 Windows-1252 的软件中，以及"∩╗┐"在解释为代码页 437 时。这是莫吉巴克，当使用非预期的字符编码解码文本时，会输出乱码文本。

我猜你的脚本保存在合称为“ANSI”（CP1252）编码，因此改用ï»¿：

if "!Line[1]:~0,3!" == "ï»¿" set "Line[1]=!Line[1]:~3!"

示例；（添加了代码页 1250 的实例，结果产生了 mojibake ď»ż）：

chcp 1250
type D:\bat\SU\list1545301_UTF8-BOM.txt
chcp 1252
type D:\bat\SU\list1545301_UTF8-BOM.txt
chcp 437
type D:\bat\SU\list1545301_UTF8-BOM.txt

Active code page: 1250
ď»żthis is line1
this is line2

Active code page: 1252
ï»¿this is line1
this is line2

Active code page: 437
∩╗┐this is line1
this is line2

windows 批量比较然后跳过文件第一行的 utf ∩╗┐

答案1

相关内容