grep 标记周围的单词

Question 1

使用 GNU grep：

start cmd:> echo "This is one word1:word2 of the lines" |
  grep -Eo '[[:alnum:]]+:[[:alnum:]]+'
word1:word2

start cmd:> echo "This is one wordx:wordy of the lines" |
  grep -Eo '[[:alpha:]]*:[[:alpha:]]*'
wordx:wordy

start cmd:> echo "This is one wo_rdx:wo_rdy of the lines" |
  grep -Eo '[[:alpha:]_]*:[[:alpha:]_]*'
wo_rdx:wo_rdy

Answer

使用 GNU grep：

start cmd:> echo "This is one word1:word2 of the lines" |
  grep -Eo '[[:alnum:]]+:[[:alnum:]]+'
word1:word2

start cmd:> echo "This is one wordx:wordy of the lines" |
  grep -Eo '[[:alpha:]]*:[[:alpha:]]*'
wordx:wordy

start cmd:> echo "This is one wo_rdx:wo_rdy of the lines" |
  grep -Eo '[[:alpha:]_]*:[[:alpha:]_]*'
wo_rdx:wo_rdy

Question 2

POSIXly（尽管要注意某些tr实现（例如 GNU 的）不能正确处理多字节字符）。

tr -s '[:space:]_' '[\n*]' << 'EOF' |
  grep -xE '[[:alnum:]_]+:[[:alnum:]_]+'
This is one word1:word2 of the lines and another is word:word   
This is another word3:word4 of the lines  and this is not wordnot::wordnot
Line without a match    
Yet another line word5:word6 for test
This is one wo_rdx:wo_rdy of the lines
This is one wordx:wordy of the lines
not/a:match
EOF

给出：

word1:word2
word:word
word3:word4
word5:word6
rdx:wo
wordx:wordy

Answer

POSIXly（尽管要注意某些tr实现（例如 GNU 的）不能正确处理多字节字符）。

tr -s '[:space:]_' '[\n*]' << 'EOF' |
  grep -xE '[[:alnum:]_]+:[[:alnum:]_]+'
This is one word1:word2 of the lines and another is word:word   
This is another word3:word4 of the lines  and this is not wordnot::wordnot
Line without a match    
Yet another line word5:word6 for test
This is one wo_rdx:wo_rdy of the lines
This is one wordx:wordy of the lines
not/a:match
EOF

给出：

word1:word2
word:word
word3:word4
word5:word6
rdx:wo
wordx:wordy

Question 3

对于您想要的结果的所有情况，您可以使用grep带有 PCRE support( -P) 的 GNU 及其单词正则表达式 ( \w)，如下所示：

grep -oP '\w+:\w+' file

输入文件：

This is one word1:word2 of the lines and another is word:word   
This is another word3:word4 of the lines  and this is not wordnot::wordnot
Line without a match    
Yet another line word5:word6 for test
This is one wo_rdx:wo_rdy of the lines
This is one wordx:wordy of the lines

输出：

word1:word2
word:word
word3:word4
word5:word6
wo_rdx:wo_rdy
wordx:wordy

正如您所看到的，与模式grep不匹配，因为它本身之间wordnot::wordnot有额外的内容。:

Answer

对于您想要的结果的所有情况，您可以使用grep带有 PCRE support( -P) 的 GNU 及其单词正则表达式 ( \w)，如下所示：

grep -oP '\w+:\w+' file

输入文件：

This is one word1:word2 of the lines and another is word:word   
This is another word3:word4 of the lines  and this is not wordnot::wordnot
Line without a match    
Yet another line word5:word6 for test
This is one wo_rdx:wo_rdy of the lines
This is one wordx:wordy of the lines

输出：

word1:word2
word:word
word3:word4
word5:word6
wo_rdx:wo_rdy
wordx:wordy

正如您所看到的，与模式grep不匹配，因为它本身之间wordnot::wordnot有额外的内容。:

Question 4

通过 grep，

grep -oP '[^:\s]+:[^:\s]+' file

或者

grep -oP '\S+?:\S+' file

上面的命令不仅获取字符串foo:bar，而且还获取?foo:bar?

Answer

通过 grep，

grep -oP '[^:\s]+:[^:\s]+' file

或者

grep -oP '\S+?:\S+' file

上面的命令不仅获取字符串foo:bar，而且还获取?foo:bar?

grep 标记周围的单词

答案1

答案2

答案3

答案4

相关内容