使用 awk 查找匹配项并从每个匹配项之前提取字符 - 帮助！

Question 1

这是一个示例awk脚本。

 awk '/..WAP../{print substr($0, index($0,"WAP") - 2, 7);}' input.csv

示例输入：

junk
line 1 12WAP34 678
another line  abWAPcdefg
WAP123
junk WAP

输出：

12WAP34
abWAPcd

解释：

/..WAP../{                          # for line containt WAP with 2 chars wrap
    wapPosition = index($0,"WAP") - 2;  # find the position of WAP - 2 chars
    output = substr($0, wapPosition, 7);# output is 7 chars length from wapPostion
    print output;                   # print output
}

Answer

这是一个示例awk脚本。

 awk '/..WAP../{print substr($0, index($0,"WAP") - 2, 7);}' input.csv

示例输入：

junk
line 1 12WAP34 678
another line  abWAPcdefg
WAP123
junk WAP

输出：

12WAP34
abWAPcd

解释：

/..WAP../{                          # for line containt WAP with 2 chars wrap
    wapPosition = index($0,"WAP") - 2;  # find the position of WAP - 2 chars
    output = substr($0, wapPosition, 7);# output is 7 chars length from wapPostion
    print output;                   # print output
}

Question 2

使用 GNU Awk，您可以在函数中使用捕获组match并通过可选的数组参数访问其内容：

$ echo ',x,x,x,x,x,xx,Yes,"1 WAP, other stuff, other stuff",no,x' | 
    awk 'match($0,/([0-9]).WAP/,a) {print a[1]}'
1

更方便的是，您可以使用match+substr作为

awk 'match($0,/[0-9].WAP/) {print substr($0,RSTART,1)}'

Answer

使用 GNU Awk，您可以在函数中使用捕获组match并通过可选的数组参数访问其内容：

$ echo ',x,x,x,x,x,xx,Yes,"1 WAP, other stuff, other stuff",no,x' | 
    awk 'match($0,/([0-9]).WAP/,a) {print a[1]}'
1

更方便的是，您可以使用match+substr作为

awk 'match($0,/[0-9].WAP/) {print substr($0,RSTART,1)}'

Question 3

假设WAP每行只能发生一次，我认为这可能是您真正想要的。给定这个输入文件：

$ cat file
,x,x,x,x,x,xx,Yes,7,WAP,no,x
,x,x,x,x,x,xx,Yes,3 WAP,no,x
,x,x,x,x,x,xx,Yes,"1 WAP",no,x

使用 GNU awk：

$ awk 'match($0,/([0-9])[^,]WAP/,a){print a[1]}' file
3
1

对于任何 awk：

$ awk 'match($0,/[0-9][^,]WAP/){print substr($0,RSTART,1)}' file
3
1

Answer

假设WAP每行只能发生一次，我认为这可能是您真正想要的。给定这个输入文件：

$ cat file
,x,x,x,x,x,xx,Yes,7,WAP,no,x
,x,x,x,x,x,xx,Yes,3 WAP,no,x
,x,x,x,x,x,xx,Yes,"1 WAP",no,x

使用 GNU awk：

$ awk 'match($0,/([0-9])[^,]WAP/,a){print a[1]}' file
3
1

对于任何 awk：

$ awk 'match($0,/[0-9][^,]WAP/){print substr($0,RSTART,1)}' file
3
1

使用 awk 查找匹配项并从每个匹配项之前提取字符 - 帮助！

答案1

答案2

答案3

相关内容