根据列比较两个文件并打印

Question 1

join file1 file2

默认情况下，它将为每个文件使用第 1 列，并忽略其中任何一个文件中缺少的行，这就是您想要的。另外，文件需要排序，情况已经如此。

Answer

join file1 file2

默认情况下，它将为每个文件使用第 1 列，并忽略其中任何一个文件中缺少的行，这就是您想要的。另外，文件需要排序，情况已经如此。

Question 2

如果数量独特的中的元素file2不太大，那么可行的解决方案可能是使用处理两个文件的经典方法awk，首先创建的第 1 列中的唯一元素的数组file2，然后测试的第 1 列file1是否为数组中的成员资格，即

awk 'FNR==NR {a[$1]++}; FNR!=NR && a[$1]' file2 file1

使用关联数组的等效方法bash 4+可能类似于

#!/bin/bash

declare -A a

while read col1 _ ; do
  ((a[$col1]++))
done < file2

while IFS= read -r line; do
  # compare only with 1st column of second file
  read -r col1 _ <<< "$line"
  [[ -n "${a[$col1]}" ]] && printf "$line\n"
done < file1

Answer

如果数量独特的中的元素file2不太大，那么可行的解决方案可能是使用处理两个文件的经典方法awk，首先创建的第 1 列中的唯一元素的数组file2，然后测试的第 1 列file1是否为数组中的成员资格，即

awk 'FNR==NR {a[$1]++}; FNR!=NR && a[$1]' file2 file1

使用关联数组的等效方法bash 4+可能类似于

#!/bin/bash

declare -A a

while read col1 _ ; do
  ((a[$col1]++))
done < file2

while IFS= read -r line; do
  # compare only with 1st column of second file
  read -r col1 _ <<< "$line"
  [[ -n "${a[$col1]}" ]] && printf "$line\n"
done < file1

Question 3

这是您正在寻找的东西吗？我习惯cut将列表拆分为数组，每个数组包含一列。这假设列由制表符分隔。您可以通过指定选项来更改分隔符剪切的使用-d。在下划线处分割：cut -d '_'.

    #!/bin/bash

    FILE1='somefile'
    FILE2='someotherfile'

    # File 1, column 1
    f1c1=($(cut -f1 -s $FILE1))
    # File 1, column 2
    #f1c2=($(cut -f2 -s $FILE1))

    # File 2, column 1
    f2c1=($(cut -f1 -s $FILE2))
    # File 2, column 2
    #f2c2=($(cut -f2 -s $FILE2))

    # Looping through all items in file 1 column 1
    for x in "${f1c1[@]}"
    do
        # For each item in f1c1, check all items in f2c1 for a match
        for y in "${f2c1[@]}"
        do
            if [[ $x == $y ]]
            then
                # The items matched!
                echo $x
                # Breaking out of the loop (no need to check for more than one
                # match, right?)
                break
            fi
        done
    done

Answer

这是您正在寻找的东西吗？我习惯cut将列表拆分为数组，每个数组包含一列。这假设列由制表符分隔。您可以通过指定选项来更改分隔符剪切的使用-d。在下划线处分割：cut -d '_'.

    #!/bin/bash

    FILE1='somefile'
    FILE2='someotherfile'

    # File 1, column 1
    f1c1=($(cut -f1 -s $FILE1))
    # File 1, column 2
    #f1c2=($(cut -f2 -s $FILE1))

    # File 2, column 1
    f2c1=($(cut -f1 -s $FILE2))
    # File 2, column 2
    #f2c2=($(cut -f2 -s $FILE2))

    # Looping through all items in file 1 column 1
    for x in "${f1c1[@]}"
    do
        # For each item in f1c1, check all items in f2c1 for a match
        for y in "${f2c1[@]}"
        do
            if [[ $x == $y ]]
            then
                # The items matched!
                echo $x
                # Breaking out of the loop (no need to check for more than one
                # match, right?)
                break
            fi
        done
    done

根据列比较两个文件并打印

答案1

答案2

答案3

相关内容