从文件中分隔多个部分/行

Question 1

给定

$ cat thisthat 
THIS(
is first
Line);
THAT(
is second line);
THIS(
is third
line);
THAT(
is 
fourth
line);

然后

awk -vRS=';\n' 'BEGIN{ORS=RS} /^THIS/ {print > "these"} /^THAT/ {print > "those"}' thisthat

结果

$ head these those 
==> these <==
THIS(
is first
Line);
THIS(
is third
line);

==> those <==
THAT(
is second line);
THAT(
is 
fourth
line);

Answer

给定

$ cat thisthat 
THIS(
is first
Line);
THAT(
is second line);
THIS(
is third
line);
THAT(
is 
fourth
line);

然后

awk -vRS=';\n' 'BEGIN{ORS=RS} /^THIS/ {print > "these"} /^THAT/ {print > "those"}' thisthat

结果

$ head these those 
==> these <==
THIS(
is first
Line);
THIS(
is third
line);

==> those <==
THAT(
is second line);
THAT(
is 
fourth
line);

Question 2

对于任何 awk：

$ awk -v RS=';' 'NF{sub(/^\n/,""); print > (/^THIS/ ? "first_file" : "second_file")}' file

$ cat first_file
THIS(
is first
Line)
THIS(
is third
line)

$ cat second_file
THAT(
is second line)
THAT(
is
fourth
line)

或使用 GNU awk 实现多字符 RS 和 RT：

$ awk -v RS='(THIS|THAT)[^;]+;\n' -v ORS= '{$0=RT; print > (/^THIS/ ? "first_file" : "second_file")}' file

$ cat first_file
THIS(
is first
Line);
THIS(
is third
line);

$ cat second_file
THAT(
is second line);
THAT(
is
fourth
line);

两种解决方案都假设您的示例是准确的，并且;除了在块的末尾（例如不在部件内(...)）之外，您永远不会有 s 。

Answer

对于任何 awk：

$ awk -v RS=';' 'NF{sub(/^\n/,""); print > (/^THIS/ ? "first_file" : "second_file")}' file

$ cat first_file
THIS(
is first
Line)
THIS(
is third
line)

$ cat second_file
THAT(
is second line)
THAT(
is
fourth
line)

或使用 GNU awk 实现多字符 RS 和 RT：

$ awk -v RS='(THIS|THAT)[^;]+;\n' -v ORS= '{$0=RT; print > (/^THIS/ ? "first_file" : "second_file")}' file

$ cat first_file
THIS(
is first
Line);
THIS(
is third
line);

$ cat second_file
THAT(
is second line);
THAT(
is
fourth
line);

两种解决方案都假设您的示例是准确的，并且;除了在块的末尾（例如不在部件内(...)）之外，您永远不会有 s 。

Question 3

使用ed文件编辑器，但需要事先创建第二个（空）文件。原始文件file1在此示例中命名。

创建第二个文件名为file2

> file2

定理

ed -s file1 << 'EOF'
H
g/^THAT($/;/^.*\;$/d
w
u
g/^THIS($/;/^.*\;$/d
0r file2
w file2
EOF

使用heredocfile1提供的第一行eded -s file1 << 'EOF'

第二行打印对错误有用的更详细消息。H

第三行删除以 THAT( 开头的行，直到以 ; 结尾的行

g/^THAT($/;/^.*\;$/d

第四行将更改写入 file1w

第五行仅撤消缓冲区中的更改，而不是 file1 中的更改。u

第六行删除以 THIS( 开头的行，直到以 ; 结尾的行

g/^THIS($/;/^.*\;$/d

第七行将缓冲区中剩余的文本添加到 file2 的开头

0r file2

八行写入更改文件2。w file2

唯一的班轮。

>file2; printf '%s\n' H 'g/^THAT($/;/^.*\;$/d' w u 'g/^THIS($/;/^.*\;$/d' '0r file2' 'w file2' | ed -s file1

请注意，ed编辑文件in-place意味着文件将被直接编辑，并且不会将输出打印到其他地方，因此ed在生产文件中运行之前首先使用一些示例对其进行测试。

Answer