打印两个记住的模式，并且仅打印它们之间的非字母数字字符

Question 1

这在中很难做到sed（因为您需要对s///每个输入行的三个不同部分做不同的事情 - 什么都不做，用修改，然后什么也不做），但在中很容易做到perl。

$ perl -lne '($first,$middle,$last) = (/({[^}]*})([^@]*)(@.*)/);
             $middle =~ s/[[:alnum:]]+//g;
             print $first, $middle, $last' file 
{string-no1}@string-no2@
{AAAAAAAAAA},.£@ZZZZZZZZZZ@
{GGGGGGGGGG}&:?@@@@@@@@@@@@

首先，它使用正则表达式将输入行的第一部分、中间部分和最后部分提取到适当命名的变量中。然后它从 $middle 中删除所有字母数字字符。然后它打印它们。

Answer

这在中很难做到sed（因为您需要对s///每个输入行的三个不同部分做不同的事情 - 什么都不做，用修改，然后什么也不做），但在中很容易做到perl。

$ perl -lne '($first,$middle,$last) = (/({[^}]*})([^@]*)(@.*)/);
             $middle =~ s/[[:alnum:]]+//g;
             print $first, $middle, $last' file 
{string-no1}@string-no2@
{AAAAAAAAAA},.£@ZZZZZZZZZZ@
{GGGGGGGGGG}&:?@@@@@@@@@@@@

首先，它使用正则表达式将输入行的第一部分、中间部分和最后部分提取到适当命名的变量中。然后它从 $middle 中删除所有字母数字字符。然后它打印它们。

Question 2

您的尝试不起作用，因为中缀字符串（中间位）包含字母数字和非字母数字字符的混合。该中缀必须使用进行处理s/[[:alnum:]]//g，同时避免对前缀和后缀字符串执行相同的操作。

因此，您需要隔离变量中的中缀字符串，或者，在的情况下sed，在编辑缓冲区中，对其应用删除字母数字字符的操作，然后将前缀和后缀字符串重新应用到结果。

使用sed编辑脚本：

h
s/^{[^}]*}//
s/@[^@]*@$//
s/[[:alnum:]]//g
G
s/^\(.*\)\n\({[^}]*}\).*\(@[^@]*@\)$/\2\1\3/

测试：

$ sed -f script file
{string-no1}@string-no2@
{AAAAAAAAAA},.£@ZZZZZZZZZZ@
{GGGGGGGGGG}&:?@@@@@@@@@@@@

请注意，最后一行的中缀字符串实际上是

&:?@@@@@@@@@@

后缀是

@@

带注释的脚本：

# Remember the original line in the hold space.
h

# Remove the prefix and the suffix strings.
# The prefix is "{...}" at the start of the line.
# The suffix is "@...@" at the end of the line.
# The interior of these strings does not contain
# the respective string terminator.
s/^{[^}]*}//
s/@[^@]*@$//

# We are left with the isolated infix portion of the
# original line. Remove the alphanumerical characters
# from this. This creates the final infix string.
s/[[:alnum:]]//g

# Append the original line from the hold space to the end of
# the infix string with a newline (\n) as the delimiter.
G

# Match the modified infix, prefix, and suffix only, and
# substitute the entire buffer with these parts in the
# correct order.
s/^\(.*\)\n\({[^}]*}\).*\(@[^@]*@\)$/\2\1\3/

Answer

您的尝试不起作用，因为中缀字符串（中间位）包含字母数字和非字母数字字符的混合。该中缀必须使用进行处理s/[[:alnum:]]//g，同时避免对前缀和后缀字符串执行相同的操作。

因此，您需要隔离变量中的中缀字符串，或者，在的情况下sed，在编辑缓冲区中，对其应用删除字母数字字符的操作，然后将前缀和后缀字符串重新应用到结果。

使用sed编辑脚本：

h
s/^{[^}]*}//
s/@[^@]*@$//
s/[[:alnum:]]//g
G
s/^\(.*\)\n\({[^}]*}\).*\(@[^@]*@\)$/\2\1\3/

测试：

$ sed -f script file
{string-no1}@string-no2@
{AAAAAAAAAA},.£@ZZZZZZZZZZ@
{GGGGGGGGGG}&:?@@@@@@@@@@@@

请注意，最后一行的中缀字符串实际上是

&:?@@@@@@@@@@

后缀是

@@

带注释的脚本：

# Remember the original line in the hold space.
h

# Remove the prefix and the suffix strings.
# The prefix is "{...}" at the start of the line.
# The suffix is "@...@" at the end of the line.
# The interior of these strings does not contain
# the respective string terminator.
s/^{[^}]*}//
s/@[^@]*@$//

# We are left with the isolated infix portion of the
# original line. Remove the alphanumerical characters
# from this. This creates the final infix string.
s/[[:alnum:]]//g

# Append the original line from the hold space to the end of
# the infix string with a newline (\n) as the delimiter.
G

# Match the modified infix, prefix, and suffix only, and
# substitute the entire buffer with these parts in the
# correct order.
s/^\(.*\)\n\({[^}]*}\).*\(@[^@]*@\)$/\2\1\3/

Question 3

使用perl，您还可以执行以下操作：

perl -lne 'print /^\{.*?\}|@.*|\W/g' < your-file

\W匹配除 alnum 和下划线之外的字符（默认情况下仅匹配 ASCII 字符）。如果您希望包含下划线，则可以替换为[^a-zA-Z0-9]或。[^[:alnum:]]

使用，您可以在循环中sed删除第一个}和之后第一个之间的 alnum 字符：@

sed -e :1 -e 's/^\([^}]*}[^@]*\)[[:alnum:]]/\1/; t1' < your-file

对于sed，[[:alnum:]]是在语言环境中进行分类的，并且文本根据语言环境的字符集进行解码，而perl默认情况下，文本被解释为好像在 iso8859-1 中编码，并且[[:alnum:]]仅与 ASCII 数字匹配（只要您不这样做） t 添加/u标志）。

通过将区域设置固定为( )，您可以获得类似于perls in 的行为，以及通过添加选项来获得类似于s in 的行为，该选项将根据区域设置字符集解码字符并使用 Unicode 属性（而不是区域设置分类）对字符进行分类。sedCLC_ALL=C sed...sedperl-Mopen=locale

Answer