如何逐行读取文件并将每行存储到数组中？

Question 1

使用并不难。我定义了一个接受两个参数的expl3命令\ReadFile

\ReadFile{\myarray}{somefile.dat}

第一个参数是控制序列名称，第二个参数是要读取的文件。这样，文件将被读入，命令\myarray将被定义，以便

\myarray{2}

产生第二项；特殊调用\myarray{*}将返回项数。您还可以调用\myarray{-1}来访问最后的物品。

\documentclass{article}
\usepackage{xparse}

\ExplSyntaxOn
\ior_new:N \g_hringriin_file_stream

\NewDocumentCommand{\ReadFile}{mm}
 {
  \hringriin_read_file:nn { #1 } { #2 }
  \cs_new:Npn #1 ##1
   {
    \str_if_eq:nnTF { ##1 } { * }
      { \seq_count:c { g_hringriin_file_ \cs_to_str:N #1 _seq } }
      { \seq_item:cn { g_hringriin_file_ \cs_to_str:N #1 _seq } { ##1 } }
   }
 }

\cs_new_protected:Nn \hringriin_read_file:nn
 {
  \ior_open:Nn \g_hringriin_file_stream { #2 }
  \seq_gclear_new:c { g_hringriin_file_ \cs_to_str:N #1 _seq }
  \ior_map_inline:Nn \g_hringriin_file_stream
   {
    \seq_gput_right:cx 
     { g_hringriin_file_ \cs_to_str:N #1 _seq }
     { \tl_trim_spaces:n { ##1 } }
   }
  \ior_close:N \g_hringriin_file_stream
 }

\ExplSyntaxOff

\begin{document}

\ReadFile{\myarray}{somearray.dat}

\myarray{*}

\myarray{1}

\myarray{2}

\myarray{3}

\myarray{-1}

\myarray{-2}

\myarray{-3}

\end{document}

如果文件somearray.dat是

And now for something completely different 
1 2 3 4 5 6 7 8
a bc def ghij

（我使用了与 Christian 相同的方法），结果将是

Answer

使用并不难。我定义了一个接受两个参数的expl3命令\ReadFile

\ReadFile{\myarray}{somefile.dat}

第一个参数是控制序列名称，第二个参数是要读取的文件。这样，文件将被读入，命令\myarray将被定义，以便

\myarray{2}

产生第二项；特殊调用\myarray{*}将返回项数。您还可以调用\myarray{-1}来访问最后的物品。

\documentclass{article}
\usepackage{xparse}

\ExplSyntaxOn
\ior_new:N \g_hringriin_file_stream

\NewDocumentCommand{\ReadFile}{mm}
 {
  \hringriin_read_file:nn { #1 } { #2 }
  \cs_new:Npn #1 ##1
   {
    \str_if_eq:nnTF { ##1 } { * }
      { \seq_count:c { g_hringriin_file_ \cs_to_str:N #1 _seq } }
      { \seq_item:cn { g_hringriin_file_ \cs_to_str:N #1 _seq } { ##1 } }
   }
 }

\cs_new_protected:Nn \hringriin_read_file:nn
 {
  \ior_open:Nn \g_hringriin_file_stream { #2 }
  \seq_gclear_new:c { g_hringriin_file_ \cs_to_str:N #1 _seq }
  \ior_map_inline:Nn \g_hringriin_file_stream
   {
    \seq_gput_right:cx 
     { g_hringriin_file_ \cs_to_str:N #1 _seq }
     { \tl_trim_spaces:n { ##1 } }
   }
  \ior_close:N \g_hringriin_file_stream
 }

\ExplSyntaxOff

\begin{document}

\ReadFile{\myarray}{somearray.dat}

\myarray{*}

\myarray{1}

\myarray{2}

\myarray{3}

\myarray{-1}

\myarray{-2}

\myarray{-3}

\end{document}

如果文件somearray.dat是

And now for something completely different 
1 2 3 4 5 6 7 8
a bc def ghij

（我使用了与 Christian 相同的方法），结果将是

Question 2

Latex 没有任何特别好的数组数据类型——特别是在普通语言中，你会期望有一些常数时间操作，一些线性时间。在 Latex 中，在开头或结尾插入和连接可能是常数时间，但所有其他操作都是通过在列表上映射一些函数来完成的，因此是线性时间。

标准 LaTeX 数组看起来像\def\myarray{\\{entry1}\\{entry2}\\{entry3}...}.这样，适合连接和映射。您可以通过以下方式将另一个函数映射到它上面：\def\\#1{do something with #1} \myarray.

如果你想存储这样的行，你会说：

\newtoks\temptoksi
\newtoks\temptoksii
\def\myarray{}

\newread\file
    \openin\file=myfilename.txt
    \loop\unless\ifeof\file
        \read\file to\fileline % Reads a line of the file into \fileline
        \temptoksi\expandafter{\myarray}
        \temptoksii\expandafter{\fileline}
        \edef\myarray{\the\temptoksi\\{\the\temptoksii}}
    \repeat
\closein\file

其工作原理是 \edef 递归扩展其定义主体，然后将 \myarray 设置为等于结果。标记列表有点像没有参数的宏，只不过您通过说 \the\tokenlist 来调用它，并且标记列表的内容不会在 \edef 中递归扩展。因此，我在循环主体中添加的三行表示“设置为\temptoksi等于的内容\myarray，设置\temptoksii为的内容\fileline，然后设置\myarray为"old contents of myarray" \\{"contents of fileline"}。当然，为了便于阅读，您可以进行单独的定义：

\def\addtomacro#1#2{
    \temptoksi\expandafter{#1}\temptoksii\expandafter{#2}
    \edef#1{\the\temptoksi\\{\the\temptoksii}
}

然后您只需说 \addtomacro\myarray\fileline。

编辑：实际上有一种方法可以实现更好的数组，但它更技术性。基本上，这个想法是将数组的每个元素存储在不同的控制序列中，因此第一个元素将存储在名为“myarray1”的控制序列中，第二个元素存储在“myarray2”中，依此类推。这本质上是一个哈希数组，因为 latex 在内部哈希表中查找控制序列的定义。要使用这种数组的极简版本，您可以这样说：

\def\setarrayelt#1#2{\expandafter\xdef\csname array@#1@#2\endcsname}
\def\getarrayelt#1#2{\csname array@#1@#2\endcsname}

然后你可以说：

\newcount\arraylength
\arraylen=0
\newread\file
    \openin\file=myfilename.txt
    \loop\unless\ifeof\file
        \read\file to\fileline % Reads a line of the file into \fileline
        \setarrayelt{myarray}{\the\mycount}{\fileline}
        \advance\arraylength1
    \repeat
\closein\file

当然，您可以做更复杂的事情，例如存储长度\array@myarray@length，然后\getarrayelt如果给它的参数太大（或不是数字），则抛出越界错误。然后您可以添加宏，例如\addelttoendofarray,等。

Answer