逐行合并两个文件中的信息

Question 1

作为一个难以辨认的 awk 语句

$ awk 'NR>1{a[0]=$3;a[1]=$4;getline<f;for(i=1;i<=NF;i++)$i=a[$i];print}' f=file2 file1
A C A C A A A C
C C T T C T T C
G A A A G G G A

更具可读性：

awk '
    # skip the header in file1
    NR == 1 {next}
    {
        # read the values from the file1 line
        a[0] = $3
        a[1] = $4

        # replace the current record with the corresponding line from the map file
        getline < map_file

        # and now substitute the 0/1 with the values
        for (i=1; i<=NF; i++)
            $i = a[$i]
        print
    }
' map_file=file2  file1

Answer

作为一个难以辨认的 awk 语句

$ awk 'NR>1{a[0]=$3;a[1]=$4;getline<f;for(i=1;i<=NF;i++)$i=a[$i];print}' f=file2 file1
A C A C A A A C
C C T T C T T C
G A A A G G G A

更具可读性：

awk '
    # skip the header in file1
    NR == 1 {next}
    {
        # read the values from the file1 line
        a[0] = $3
        a[1] = $4

        # replace the current record with the corresponding line from the map file
        getline < map_file

        # and now substitute the 0/1 with the values
        for (i=1; i<=NF; i++)
            $i = a[$i]
        print
    }
' map_file=file2  file1

Question 2

您可以完全执行此操作，awk但作为变体，这里有一个awk+paste解决方案。您需要bash或另一个支持进程替换的 shell

paste <(tail -n +2 file1) file2 | 
awk '{a["0"]=$3; a["1"]=$4; for (i=5; i<=NF; ++i) printf "%s%s", a[$i], i==NF?"\n": " "}'

需要tail -n +2跳过标题行file1。

Answer

您可以完全执行此操作，awk但作为变体，这里有一个awk+paste解决方案。您需要bash或另一个支持进程替换的 shell

paste <(tail -n +2 file1) file2 | 
awk '{a["0"]=$3; a["1"]=$4; for (i=5; i<=NF; ++i) printf "%s%s", a[$i], i==NF?"\n": " "}'

需要tail -n +2跳过标题行file1。

Question 3

#!/usr/bin/env perl
# TODO docs on usage here, or write perldocs below, etc.
use strict;
use warnings;

die "Usage: $0 headerfile datafile\n" if @ARGV != 2;

my ($headerfile, $datafile) = @ARGV;

open(my $hfh, '<', $headerfile) or die "could not open '$headerfile': $!\n";
open(my $dfh, '<', $datafile) or die "could not open '$datafile': $!\n";

readline $hfh; # skip the header line

my $lineno = 1;
while (!eof($hfh) and !eof($dfh)) {
  my $convert_to = join '', (split ' ', scalar readline $hfh)[-2,-1];
  die sprintf "no conversion at $headerfile:%d\n", $lineno+1
    if !defined $convert_to;

  $_ = readline $dfh;
  die "no data to convert at $datafile:$lineno\n" if !defined;

  eval "tr/01/$convert_to/, 1" or die $@;
  print;

  $lineno++;
}

Answer

#!/usr/bin/env perl
# TODO docs on usage here, or write perldocs below, etc.
use strict;
use warnings;

die "Usage: $0 headerfile datafile\n" if @ARGV != 2;

my ($headerfile, $datafile) = @ARGV;

open(my $hfh, '<', $headerfile) or die "could not open '$headerfile': $!\n";
open(my $dfh, '<', $datafile) or die "could not open '$datafile': $!\n";

readline $hfh; # skip the header line

my $lineno = 1;
while (!eof($hfh) and !eof($dfh)) {
  my $convert_to = join '', (split ' ', scalar readline $hfh)[-2,-1];
  die sprintf "no conversion at $headerfile:%d\n", $lineno+1
    if !defined $convert_to;

  $_ = readline $dfh;
  die "no data to convert at $datafile:$lineno\n" if !defined;

  eval "tr/01/$convert_to/, 1" or die $@;
  print;

  $lineno++;
}

逐行合并两个文件中的信息

答案1

答案2

答案3

相关内容