非 ASCII 字符的神秘空格代码

Question

冒号后有一个控制字符（显然是 U+2028，即 LINE SEPARATOR），它在 UTF-8 中占用三个字节，但在 latex 中使用默认的单字节输入编码，因此每个字节都被打印为一个单独的字符，如上所示，它由

    \documentclass{article}
    \usepackage[T1]{fontenc}

    \begin{document}

    U+2008 Punctation space : x

    U+2028 Line separator :     x

    \end{document}

如果你添加

\usepackage[utf8]{inputenc}

然后你会得到一个更容易理解的行为：

\documentclass{article}
\usepackage[T1]{fontenc}
\usepackage[utf8]{inputenc}
\begin{document}

U+2008 Punctation space : x

U+2028 Line separator :     x

\end{document}

产生终端输出：

! Package inputenc Error: Unicode char   (U+2008)
(inputenc)                not set up for use with LaTeX.

..

! Package inputenc Error: Unicode char  (U+2028)
(inputenc)                not set up for use with LaTeX.

Answer 1

冒号后有一个控制字符（显然是 U+2028，即 LINE SEPARATOR），它在 UTF-8 中占用三个字节，但在 latex 中使用默认的单字节输入编码，因此每个字节都被打印为一个单独的字符，如上所示，它由

    \documentclass{article}
    \usepackage[T1]{fontenc}

    \begin{document}

    U+2008 Punctation space : x

    U+2028 Line separator :     x

    \end{document}

如果你添加

\usepackage[utf8]{inputenc}

然后你会得到一个更容易理解的行为：

\documentclass{article}
\usepackage[T1]{fontenc}
\usepackage[utf8]{inputenc}
\begin{document}

U+2008 Punctation space : x

U+2028 Line separator :     x

\end{document}

产生终端输出：

! Package inputenc Error: Unicode char   (U+2008)
(inputenc)                not set up for use with LaTeX.

..

! Package inputenc Error: Unicode char  (U+2028)
(inputenc)                not set up for use with LaTeX.

非 ASCII 字符的神秘空格代码

答案1

相关内容