如何根据行首将文件拆分成两部分？

Question 1

使用grep：

grep -E '^-e' 1.txt >2.txt
grep -E '[^-]' 1.txt >3.txt

@braemar：使用grep -v相同的正则表达式会错误地检测空行、文本行等。这不是我们想要的。

Answer

使用grep：

grep -E '^-e' 1.txt >2.txt
grep -E '[^-]' 1.txt >3.txt

@braemar：使用grep -v相同的正则表达式会错误地检测空行、文本行等。这不是我们想要的。

Question 2

这是awk解决方案：

awk '{ if ( /^-/ ) print > "2.txt"; else if ( NF ) print > "3.txt" }' 1.txt

性能测试：

$ cat 1.txt | wc -l | sed -r -e 's/([0-9]{6}$)/ \1/' -e 's/([0-9]{3}$)/ \1 lines/'
1 144 270 lines
$ TIMEFORMAT=%R

$ time awk '{ if ( /^-/ ) print > "2.txt"; else if ( NF ) print > "3.txt" }' 1.txt
0.372

Answer

这是awk解决方案：

awk '{ if ( /^-/ ) print > "2.txt"; else if ( NF ) print > "3.txt" }' 1.txt

性能测试：

$ cat 1.txt | wc -l | sed -r -e 's/([0-9]{6}$)/ \1/' -e 's/([0-9]{3}$)/ \1 lines/'
1 144 270 lines
$ TIMEFORMAT=%R

$ time awk '{ if ( /^-/ ) print > "2.txt"; else if ( NF ) print > "3.txt" }' 1.txt
0.372

Question 3

保留空行：

$ sed -n -e '/^-e/{w 2.txt' -e 'd}' -e 'w 3.txt' 1.txt

给予

$ head {1,2,3}.txt
==> 1.txt <==
-e a
b
-e c

d
-e e
f

==> 2.txt <==
-e a
-e c
-e e

==> 3.txt <==
b

d
f

如果您希望省略空行，请在最后写入时添加“任何字符”正则表达式：

sed -n -e '/^-e/{w 2.txt' -e 'd}' -e '/./w 3.txt' 1.txt

Answer

保留空行：

$ sed -n -e '/^-e/{w 2.txt' -e 'd}' -e 'w 3.txt' 1.txt

给予

$ head {1,2,3}.txt
==> 1.txt <==
-e a
b
-e c

d
-e e
f

==> 2.txt <==
-e a
-e c
-e e

==> 3.txt <==
b

d
f

如果您希望省略空行，请在最后写入时添加“任何字符”正则表达式：

sed -n -e '/^-e/{w 2.txt' -e 'd}' -e '/./w 3.txt' 1.txt

Question 4

以下是sed使用delete 标志的解决方案：

sed -e '/^-/!d' -e '/^[[:space:]]*$/d' 1.txt > 2.txt

上述命令有两个正则表达式，第一个'/^-/!d'将匹配所有不以开头的行-，并且它们将从输出中删除，第二个'/^[[:space:]]*$/d'将匹配所有仅包含空格的行，并且它们将从输出中删除。

sed -e '/^-/d' -e '/^[[:space:]]*$/d' 1.txt > 3.txt

上述命令也有两个正则表达式，第一个'/^-/d'将匹配以开头的所有行-，并且它们将从输出中删除，第二个与预览情况相同。

另一种方法是保留-n正常输出sed，然后p仅打印匹配的行：

sed -n '/^-/p' 1.txt > 2.txt

sed -n -r '/^(-|[[:space:]]*$)/!p' 1.txt > 3.txt

以下是性能测试：

$ cat 1.txt | wc -l | sed -r -e 's/([0-9]{6}$)/ \1/' -e 's/([0-9]{3}$)/ \1 lines/'
1 144 270 lines
$ TIMEFORMAT=%R

$ time sed -e '/^-/!d' -e '/^[[:space:]]*$/d' 1.txt > 2.txt
0.357
$ time sed -e '/^-/d' -e '/^[[:space:]]*$/d' 1.txt > 3.txt
0.323

$ time sed -n '/^-/p' 1.txt > 2.txt
0.221
$ time sed -n -r '/^(-|[[:space:]]*$)/!p' 1.txt > 3.txt
0.402

Answer