Grepping 字符串，但包含每个 grep 匹配后面的所有非空行

Question 1

使用awk而不是grep：

awk '/FOO/ { if (matching) printf("\n"); matching = 1 }
     /^$/  { if (matching) printf("\n"); matching = 0 }
     matching' file

枚举匹配的版本：

awk 'function flush_print_maybe() {
         if (matching) printf("Match %d\n%s\n\n", ++n, buf)
         buf = ""
     }
     /FOO/ { flush_print_maybe(); matching = 1 }
     /^$/  { flush_print_maybe(); matching = 0 }
     matching { buf = (buf == "" ? $0 : buf ORS $0) }
     END   { flush_print_maybe() }' file

这两个awk程序都使用一个非常简单的“状态机”来确定当前是否匹配。模式的匹配FOO将使其进入matching状态，模式的匹配^$（空行）将使其进入非状态matching。

匹配数据集之间的空行输出发生在状态转换时从 matching（进入matching或进入非matching）。

第一个程序在处于该matching状态时打印任何行。

第二个程序buf在某种状态下收集变量中的行matching。它在可能打印它（取决于状态）之后刷新（清空）它，以及Match N状态转换时的标签（当第一个程序输出空行时）。

最后一个程序对样本数据的输出：

Match 1
this line contains FOO
this line is not blank

Match 2
This line also contains FOO

Match 3
This line contains FOO too
Not blank
Also not blank

Match 4
FOO!
Yet more random text

Match 5
FOO!

Answer

使用awk而不是grep：

awk '/FOO/ { if (matching) printf("\n"); matching = 1 }
     /^$/  { if (matching) printf("\n"); matching = 0 }
     matching' file

枚举匹配的版本：

awk 'function flush_print_maybe() {
         if (matching) printf("Match %d\n%s\n\n", ++n, buf)
         buf = ""
     }
     /FOO/ { flush_print_maybe(); matching = 1 }
     /^$/  { flush_print_maybe(); matching = 0 }
     matching { buf = (buf == "" ? $0 : buf ORS $0) }
     END   { flush_print_maybe() }' file

这两个awk程序都使用一个非常简单的“状态机”来确定当前是否匹配。模式的匹配FOO将使其进入matching状态，模式的匹配^$（空行）将使其进入非状态matching。

匹配数据集之间的空行输出发生在状态转换时从 matching（进入matching或进入非matching）。

第一个程序在处于该matching状态时打印任何行。

第二个程序buf在某种状态下收集变量中的行matching。它在可能打印它（取决于状态）之后刷新（清空）它，以及Match N状态转换时的标签（当第一个程序输出空行时）。

最后一个程序对样本数据的输出：

Match 1
this line contains FOO
this line is not blank

Match 2
This line also contains FOO

Match 3
This line contains FOO too
Not blank
Also not blank

Match 4
FOO!
Yet more random text

Match 5
FOO!

Question 2

sed -ne '/FOO/{x;P;x};/FOO/,/^$/p' testfile

输出中的每个非空行块都是来自输入的匹配数据的单个块。换行符的数量各不相同。

这

抑制输出 ( -n);然后
在每次出现“FOO”之前打印一个空行（/FOO/{x;P;x}- 使用空的保留空间）；
选择从包含 FOO ( ) 的行开始到空行 ( )/FOO/结束的行范围；/^$/最后
打印这些行 ( p)。

this line contains FOO
this line is not blank


This line also contains FOO


This line contains FOO too
Not blank
Also not blank


FOO!
Yet more random text

FOO!

Answer

sed -ne '/FOO/{x;P;x};/FOO/,/^$/p' testfile

输出中的每个非空行块都是来自输入的匹配数据的单个块。换行符的数量各不相同。

这

抑制输出 ( -n);然后
在每次出现“FOO”之前打印一个空行（/FOO/{x;P;x}- 使用空的保留空间）；
选择从包含 FOO ( ) 的行开始到空行 ( )/FOO/结束的行范围；/^$/最后
打印这些行 ( p)。

this line contains FOO
this line is not blank


This line also contains FOO


This line contains FOO too
Not blank
Also not blank


FOO!
Yet more random text

FOO!

Question 3

我不认为这对于是可行的grep，但对于 AWK 是可行的：

#! /usr/bin/awk -f

/FOO/ {
  matched = 1
  if (notfirst) print ""
  notfirst = 1
}

/^$/ {
  matched = 0
}

matched

匹配次数：

#! /usr/bin/awk -f

/FOO/ {
  matched = 1
  if (matches) print ""
  printf "Match %d\n", ++matches
}

/^$/ {
  matched = 0
}

matched

在这两种情况下，前两个块确定是否应将当前记录复制到输出。当当前记录匹配“FOO”时，第一个块设置matched为1，如果需要则输出空白记录（以将即将输出的内容与上一个匹配的内容分开）；在第二种变体中，它还增加matches计数器并输出标头。当当前记录为空时，第二个块设置为 0。如果为 1，matched孤独matched条件将打印当前记录。matched

Answer

我不认为这对于是可行的grep，但对于 AWK 是可行的：

#! /usr/bin/awk -f

/FOO/ {
  matched = 1
  if (notfirst) print ""
  notfirst = 1
}

/^$/ {
  matched = 0
}

matched

匹配次数：

#! /usr/bin/awk -f

/FOO/ {
  matched = 1
  if (matches) print ""
  printf "Match %d\n", ++matches
}

/^$/ {
  matched = 0
}

matched

在这两种情况下，前两个块确定是否应将当前记录复制到输出。当当前记录匹配“FOO”时，第一个块设置matched为1，如果需要则输出空白记录（以将即将输出的内容与上一个匹配的内容分开）；在第二种变体中，它还增加matches计数器并输出标头。当当前记录为空时，第二个块设置为 0。如果为 1，matched孤独matched条件将打印当前记录。matched

Question 4

awk '/FOO/{print "===match " ++i "==="} /FOO/,/^$/' file

===match 1===
this line contains FOO
this line is not blank

===match 2===
This line also contains FOO

===match 3===
This line contains FOO too
Not blank
Also not blank

===match 4===
FOO!
Yet more random text
===match 5===
FOO!

类似的变体FOO可以轻松更改为其他内容：

awk -vpat=FOO '$0~pat{print "===match " ++i "==="} $0~pat,/^$/' file

省略默认打印中的终止空行作为练习留给读者;-)

Answer

awk '/FOO/{print "===match " ++i "==="} /FOO/,/^$/' file

===match 1===
this line contains FOO
this line is not blank

===match 2===
This line also contains FOO

===match 3===
This line contains FOO too
Not blank
Also not blank

===match 4===
FOO!
Yet more random text
===match 5===
FOO!

类似的变体FOO可以轻松更改为其他内容：

awk -vpat=FOO '$0~pat{print "===match " ++i "==="} $0~pat,/^$/' file

省略默认打印中的终止空行作为练习留给读者;-)

Grepping 字符串，但包含每个 grep 匹配后面的所有非空行

答案1

答案2

答案3

答案4

相关内容