OS X 上的 sed - 提取方括号之间的所有文本

Question 1

awk 对此也很有效：使用[ 或者 ]作为字段分隔符，打印每个偶数的场地：

awk -F '[][]' '{for (i=2; i<=NF; i+=2) {printf "%s ", $i}; print ""}' file

使用 sed，我会写

sed -E 's/(^|\])[^[]*($|\[)/ /g' file

Answer

awk 对此也很有效：使用[ 或者 ]作为字段分隔符，打印每个偶数的场地：

awk -F '[][]' '{for (i=2; i<=NF; i+=2) {printf "%s ", $i}; print ""}' file

使用 sed，我会写

sed -E 's/(^|\])[^[]*($|\[)/ /g' file

Question 2

这会将第一个（左）方括号内的任何内容与后面的第一个（右）方括号匹配多次。

$ sed 's/[^[]*\[\([^]]*\)\][^[]*/\1 /g' file
foo bar
gar har
uf gc br

描述：

sed '                      # start a sed script
        s/                 # start a substitute command
        [^[]*              # match all leading characters (except [)
        \[                 # match an explicit [
        \([^]]*\)          # capture text inside brackets.
        \]                 # match the closing ]
        [^[]*              # match trailing text (if any).
        /\1 /              # replace everything matched by the captured text.
        g                  # repeat for all the line.
       ' file              # close script. Apply to file.

这会为每场比赛添加一个尾随空格。如果必须删除，请在末尾添加删除：

sed -e 's/[^[]*\[\([^]]*\)\][^[]*/\1 /g' -e 's/ $//' file

如果您有 GNU grep，这可能会有所帮助（每次捕获一行）。

grep -Po '\[\K[^]]*(?=])'

而且，如果上面的方法不起作用，awk 也可以做到：

awk '{print gensub(/\[([^]]*)\][^[]*/,"\\1 ","g")}' file

Answer

这会将第一个（左）方括号内的任何内容与后面的第一个（右）方括号匹配多次。

$ sed 's/[^[]*\[\([^]]*\)\][^[]*/\1 /g' file
foo bar
gar har
uf gc br

描述：

sed '                      # start a sed script
        s/                 # start a substitute command
        [^[]*              # match all leading characters (except [)
        \[                 # match an explicit [
        \([^]]*\)          # capture text inside brackets.
        \]                 # match the closing ]
        [^[]*              # match trailing text (if any).
        /\1 /              # replace everything matched by the captured text.
        g                  # repeat for all the line.
       ' file              # close script. Apply to file.

这会为每场比赛添加一个尾随空格。如果必须删除，请在末尾添加删除：

sed -e 's/[^[]*\[\([^]]*\)\][^[]*/\1 /g' -e 's/ $//' file

如果您有 GNU grep，这可能会有所帮助（每次捕获一行）。

grep -Po '\[\K[^]]*(?=])'

而且，如果上面的方法不起作用，awk 也可以做到：

awk '{print gensub(/\[([^]]*)\][^[]*/,"\\1 ","g")}' file

Question 3

一种惯用的方法是使用环顾断言，请参阅例如https://www.regular-expressions.info/lookaround.html，但 sed 不支持这些，仅在符合 PCRE 的正则表达式处理器中支持。

由于默认情况下 Perl 应该在 macOS 上可用，因此也许这是一个可行的替代方案。

使用 Perl，你可以说

perl -pe 's/.+?(?<=\[)(.+?)(?=\]).+?/$1 /g'

（请注意，这会在行尾添加一个空格）

有关该模式的解释，请参阅https://regexr.com/41gi5。

Answer