从文本文件中的不同位置删除值

Question 1

如果infile是：

0 -1 0.000532 -0.00026 0.000465 etc...
0 0.000294 1 -0.000102 -0.1146 etc...
0 -0.000134 0.0000967 1 -0.9972 etc...

delete是您要从您的喜欢中删除它们的列号infile：

2 4 6

与awk，你可以这样做：

awk 'NR==FNR { split($0, to_delete); next }
             # split 'delete' file into an array called to_delete on default FS (white-space)
             { for (col in to_delete) $to_delete[col]=""; print }' delete infile
             # delete the columns from 'infile' that match with $column getting from array

这将为您提供从文件中删除第 2、4 和 6 列的输出。

0  0.000532  0.000465
0  1  -0.1146
0  0.0000967  -0.9972

Answer

如果infile是：

0 -1 0.000532 -0.00026 0.000465 etc...
0 0.000294 1 -0.000102 -0.1146 etc...
0 -0.000134 0.0000967 1 -0.9972 etc...

delete是您要从您的喜欢中删除它们的列号infile：

2 4 6

与awk，你可以这样做：

awk 'NR==FNR { split($0, to_delete); next }
             # split 'delete' file into an array called to_delete on default FS (white-space)
             { for (col in to_delete) $to_delete[col]=""; print }' delete infile
             # delete the columns from 'infile' that match with $column getting from array

这将为您提供从文件中删除第 2、4 和 6 列的输出。

0  0.000532  0.000465
0  1  -0.1146
0  0.0000967  -0.9972

Question 2

听起来这就是您正在寻找的：

awk '
NR==FNR { split($0,del); next }
{
    out = sep = ""
    for (i=1; i<=NF; i++) {
        if ( !(i in del) ) {
            out = out sep $i
            sep = OFS
        }
    }
    print out
}
' delete.txt mat.txt

Answer

听起来这就是您正在寻找的：

awk '
NR==FNR { split($0,del); next }
{
    out = sep = ""
    for (i=1; i<=NF; i++) {
        if ( !(i in del) ) {
            out = out sep $i
            sep = OFS
        }
    }
    print out
}
' delete.txt mat.txt

Question 3

假设delete.txt只有一行，我们可以使用以下代码获取所需的列：

$ perl -psale '$. == 1 and 
   @indices2P = grep { my $c=$_+1; $d !~ /\b$c\b/ } 0 .. $#F;
   $_ = "@F[@indices2P]";
' -- -d="$(< delete.txt)" mat.txt

结果：

0 0.000532 0.000465
0 1 -0.1146
0 0.0000967 -0.9972

解释：

将要删除的列存储在标量变量中$d，并在读取文件的第一行mat.txt计算需要打印的列索引。

然后在访问数组@F进行打印时仅应用这些索引。

Answer

假设delete.txt只有一行，我们可以使用以下代码获取所需的列：

$ perl -psale '$. == 1 and 
   @indices2P = grep { my $c=$_+1; $d !~ /\b$c\b/ } 0 .. $#F;
   $_ = "@F[@indices2P]";
' -- -d="$(< delete.txt)" mat.txt

结果：

0 0.000532 0.000465
0 1 -0.1146
0 0.0000967 -0.9972

解释：

将要删除的列存储在标量变量中$d，并在读取文件的第一行mat.txt计算需要打印的列索引。

然后在访问数组@F进行打印时仅应用这些索引。

Question 4

$ < delete.txt \
       tr -s ' \t' '\n\n' | sort -nru |
       sed -e 's|.*|s/\\s*\\S+//&|' |
       sed -Ef - mat.txt

结果：

0 0.000532 0.000465
0 1 -0.1146
0 0.0000967 -0.9972

解释：

使用打开扩展正则表达式模式的 GNU sed，我们首先生成一个 sed 代码，当应用于 mat.txt 文件时，我们会得到我们喜欢的输出。

假设：

o The file delete.txt comprises only positive integers and max value < 512

Answer

$ < delete.txt \
       tr -s ' \t' '\n\n' | sort -nru |
       sed -e 's|.*|s/\\s*\\S+//&|' |
       sed -Ef - mat.txt

结果：

0 0.000532 0.000465
0 1 -0.1146
0 0.0000967 -0.9972

解释：

使用打开扩展正则表达式模式的 GNU sed，我们首先生成一个 sed 代码，当应用于 mat.txt 文件时，我们会得到我们喜欢的输出。

假设：

o The file delete.txt comprises only positive integers and max value < 512

从文本文件中的不同位置删除值

答案1

答案2

答案3

答案4

相关内容