查找特定字符串时如何打印最接近的列

Question 1

我假设您有一个包含您正在查找的各种字符串的文件。像这样的东西：

fck=35 fmd
fck=78 fcv
bnv=12 fcv

对于其中的每一个，您想要搜索文件，并且如果一行与任何模式匹配，则您需要fmd匹配字符串后面的第一个值。如果是这样，我会在 perl 中执行如下操作：

#!/usr/bin/env perl

## Open the list of search patterns.
## The script expects it to be the 1st argument.
open(my $list,"$ARGV[0]");
## Read the file and save the patterns
## in the %pat hash.
while (<$list>) {
    ## remove trailing newlines
    chomp;
    ## separate the search pattern from the target
    my @fields=split(/\s+/);

    ## Save the search pattern and accompanying target in
    ## in the hash (%pats). 
    $pats{$fields[0]}=$fields[1];
}

## Open the list of search patterns.
## The script expects it to be the 2nd argument.
open(my $file,"$ARGV[1]");

## Read the file
while (<$file>) {
    ## split the line on ';' into the @fields array
    my @fields=split(/;/);

    ## This is the string that will be printed for
    ## the current line.
    my $outstring="";
    ## Check each of the search patterns against
    ## each of the fields.
    foreach my $pat(keys(%pats)) {
        ## Add the pattern to the outstring
        $outstring.="$pat;";
        ## save all all 1st fmd values that follow
        ## this pattern. 
        my @matches= ( /$pat.+?($pats{$pat}=[^;]+)/g );
        ## Add this pattern's matches to the output string.
        $outstring.= join(";",@matches) . ";";
    }
    ## Print the output string for this line
    print "$outstring\n";
}

例如，如果将上面的脚本保存parser.pl在您的文件中$PATH并使其可执行 ( chmod 755 ~/bin/parser.pl)，则可以像这样运行它：

$ parser.pl list.txt file.txt 
bnv=12;;fck=35;fmd=1422745568,;fck=78;;
bnv=12;;fck=35;fmd=1421428238,;fck=78;;
bnv=12;;fck=35;fmd=1421687191 fmd=1111111111;fck=78;fcv=de724a544277d79c14d19809fe51ab71;

Answer

我假设您有一个包含您正在查找的各种字符串的文件。像这样的东西：

fck=35 fmd
fck=78 fcv
bnv=12 fcv

对于其中的每一个，您想要搜索文件，并且如果一行与任何模式匹配，则您需要fmd匹配字符串后面的第一个值。如果是这样，我会在 perl 中执行如下操作：

#!/usr/bin/env perl

## Open the list of search patterns.
## The script expects it to be the 1st argument.
open(my $list,"$ARGV[0]");
## Read the file and save the patterns
## in the %pat hash.
while (<$list>) {
    ## remove trailing newlines
    chomp;
    ## separate the search pattern from the target
    my @fields=split(/\s+/);

    ## Save the search pattern and accompanying target in
    ## in the hash (%pats). 
    $pats{$fields[0]}=$fields[1];
}

## Open the list of search patterns.
## The script expects it to be the 2nd argument.
open(my $file,"$ARGV[1]");

## Read the file
while (<$file>) {
    ## split the line on ';' into the @fields array
    my @fields=split(/;/);

    ## This is the string that will be printed for
    ## the current line.
    my $outstring="";
    ## Check each of the search patterns against
    ## each of the fields.
    foreach my $pat(keys(%pats)) {
        ## Add the pattern to the outstring
        $outstring.="$pat;";
        ## save all all 1st fmd values that follow
        ## this pattern. 
        my @matches= ( /$pat.+?($pats{$pat}=[^;]+)/g );
        ## Add this pattern's matches to the output string.
        $outstring.= join(";",@matches) . ";";
    }
    ## Print the output string for this line
    print "$outstring\n";
}

例如，如果将上面的脚本保存parser.pl在您的文件中$PATH并使其可执行 ( chmod 755 ~/bin/parser.pl)，则可以像这样运行它：

$ parser.pl list.txt file.txt 
bnv=12;;fck=35;fmd=1422745568,;fck=78;;
bnv=12;;fck=35;fmd=1421428238,;fck=78;;
bnv=12;;fck=35;fmd=1421687191 fmd=1111111111;fck=78;fcv=de724a544277d79c14d19809fe51ab71;

Question 2

如果您正在寻找最接近的列并且知道分隔符，这对于 grep 和 sed 来说应该是一个简单的任务。

grep -e "fck=35"

将返回整行 fck=35 is on。然后将其通过管道传输到两个 sed 以获得您想要的内容。

grep -e "fck=35" | sed s/.*fck=35;//g | sed s/;.*//g

第一个 sed 替换 fck=35 之前的所有内容； fck=35；本身什么也没有（删除它），第二个 sed 删除下一个分隔符之后的所有内容。

但听起来您还希望能够选择线上的特定列（fmd），因为您需要更多类似的东西：

grep -e "fck=35" | sed s/.*fmd=/fmd=/g | sed s/[;,].*//g

这将删除“fmd=”之前的所有元素，然后删除下一个分隔符（或逗号，似乎您需要处理它）之后的所有内容。

Answer