正则表达式匹配单个字符实例

Question 1

$ sed 'h;s/@@[^@ ]*@@//g;/@/!d;g' file
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

该sed命令删除有效的变量占位符并报告仍包含字符的行@。它还会查找包含@两侧都有两个以上占位符的行。

我们可以通过将每行保存到保留空间来报告原始的故障行h。然后运行删除潜在有效占位符的替换，如果之后不包含任何@字符，我们将删除该行。我们从保留空间中获取原始行，g如果存在则打印它。

如果您的变量遵循与大多数编程语言相同的命名规则，则可以将有效占位符的模式@@[^@ ]*@@更改为。@@[[:alpha:]_][[:alnum:]_]*@@

假设您需要能够@在文本本身中包含字符。在这种情况下，您需要@在上面命令中的替换之前删除所有可能出现的不是变量占位符的有效星座。

一种更系统的方法是提取包含占位符的行，@其中一侧或另一侧有太多字符，删除正确的占位符，然后拉出占位符@在变量名称的两侧仅包含一个字符的行。

sed -e '/@\{3,\}[^@ ]*@\{1,\}/b' \
    -e '/@\{1,\}[^@ ]*@\{3,\}/b' \
    -e h \
    -e 's/@@[^@ ]*@@//g' \
    -e '/@[^@ ]*@/!d' \
    -e g file

上面的内容将允许您的文本在其他地方包含@字符，前提是它们不会以看起来像占位符的模式出现。

Answer

$ sed 'h;s/@@[^@ ]*@@//g;/@/!d;g' file
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

该sed命令删除有效的变量占位符并报告仍包含字符的行@。它还会查找包含@两侧都有两个以上占位符的行。

我们可以通过将每行保存到保留空间来报告原始的故障行h。然后运行删除潜在有效占位符的替换，如果之后不包含任何@字符，我们将删除该行。我们从保留空间中获取原始行，g如果存在则打印它。

如果您的变量遵循与大多数编程语言相同的命名规则，则可以将有效占位符的模式@@[^@ ]*@@更改为。@@[[:alpha:]_][[:alnum:]_]*@@

假设您需要能够@在文本本身中包含字符。在这种情况下，您需要@在上面命令中的替换之前删除所有可能出现的不是变量占位符的有效星座。

一种更系统的方法是提取包含占位符的行，@其中一侧或另一侧有太多字符，删除正确的占位符，然后拉出占位符@在变量名称的两侧仅包含一个字符的行。

sed -e '/@\{3,\}[^@ ]*@\{1,\}/b' \
    -e '/@\{1,\}[^@ ]*@\{3,\}/b' \
    -e h \
    -e 's/@@[^@ ]*@@//g' \
    -e '/@[^@ ]*@/!d' \
    -e g file

上面的内容将允许您的文本在其他地方包含@字符，前提是它们不会以看起来像占位符的模式出现。

Question 2

$ grep -E '(^|[^@])@([^@]|$)|@@@' file
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

或者：

$ awk '/(^|[^@])@([^@]|$)|@@@/' file
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

或一次分析一个字段：

$ cat tst.awk
{
    for (i=1; i<=NF; i++) {
        if ( $i ~ /^@[^@]|[^@]@$|@@@/ ) {
            print "Failed line:", NR, $0
            print "\tbecause of field", i, $i
        }
    }
}

$ awk -f tst.awk file
Failed line: 2 @Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
        because of field 1 @Var1@@
Failed line: 3 @@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
        because of field 1 @@Var1@
Failed line: 4 @@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
        because of field 5 @Var2@@
Failed line: 5 @@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.
        because of field 5 @@Var2@

您不需要任何额外的东西来查找@@@案例，上面也包括查找该案例。

Answer

$ grep -E '(^|[^@])@([^@]|$)|@@@' file
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

或者：

$ awk '/(^|[^@])@([^@]|$)|@@@/' file
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

或一次分析一个字段：

$ cat tst.awk
{
    for (i=1; i<=NF; i++) {
        if ( $i ~ /^@[^@]|[^@]@$|@@@/ ) {
            print "Failed line:", NR, $0
            print "\tbecause of field", i, $i
        }
    }
}

$ awk -f tst.awk file
Failed line: 2 @Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
        because of field 1 @Var1@@
Failed line: 3 @@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
        because of field 1 @@Var1@
Failed line: 4 @@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
        because of field 5 @Var2@@
Failed line: 5 @@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.
        because of field 5 @@Var2@

您不需要任何额外的东西来查找@@@案例，上面也包括查找该案例。

Question 3

另一种解决方案是grep -E.正则表达式也适用于 awk

grep -E '[^@]@[^@]|^@[^@]|@[^@]$' tmpfile
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

awk '/[^@]@[^@]|^@[^@]|@[^@]$/' tmpfile
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

Answer

另一种解决方案是grep -E.正则表达式也适用于 awk

grep -E '[^@]@[^@]|^@[^@]|@[^@]$' tmpfile
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

awk '/[^@]@[^@]|^@[^@]|@[^@]$/' tmpfile
@Var1@@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@ words words words @@Var2@@   #This will fail because Var1 is wrong.
@@Var1@@ words words words @Var2@@   #This will fail because Var2 is wrong.
@@Var1@@ words words words @@Var2@   #This will fail because Var2 is wrong.

Question 4

在支持的情况下，您可以使用负环视运算符grep -P来匹配单个s 或不被 s 包围的@3 个或更多 s 的序列：@@

<test.txt grep --color -P '(?<!@)(@|@{3,})(?!@)'

然而，这仍然会标记@@@@in@@var1@@@@var2@@并且无法标记不匹配的@@s，如 in@@var1或@@var1@@var2@@。

另一种方法是：

<test.txt grep --color -P '@@\w+@@(*SKIP)(*FAIL)|@+'

这将标记@不属于@@word@@序列的部分。

$$ <test.txt grep --color -P '@@\w+@@(*SKIP)(*FAIL)|@+' @Var1@@ Words Words @@Var2@@ #这会失败，因为 Var1 是错误的。 @@Var1@wordswords@@Var2@@ #这会失败，因为 Var1 是错误的。 @@Var1@@wordswords@Var2@@#这将失败，因为 Var2 是错误的。 @@Var1@@wordswords@@Var2@#这将失败，因为Var2是错误的。$

Answer