从一个文件中查找另一个文件中的字符串（如果不存在），然后从原始文件中删除

Question 1

如果你有gnu grep你可以运行：

grep -oFf file1 file2 | sort | uniq | grep -Ff - file1

grep如果不需要保留中行的顺序，请删除最后一个file1。
如果您无权访问gnu grep, ，则awk：

awk 'NR==FNR{z[$0]++;next};{for (l in z){if (index($0, l)) y[l]++}}
END{for (i in y) print i}' file1 file2

Answer

如果你有gnu grep你可以运行：

grep -oFf file1 file2 | sort | uniq | grep -Ff - file1

grep如果不需要保留中行的顺序，请删除最后一个file1。
如果您无权访问gnu grep, ，则awk：

awk 'NR==FNR{z[$0]++;next};{for (l in z){if (index($0, l)) y[l]++}}
END{for (i in y) print i}' file1 file2

Question 2

如果您有，请寻求 don_crissti（已接受）的答案GNU grep。以防万一您不这样做（例如在标准 Mac OS X 上，这不起作用），您也可以将此代码片段保存到 bash 脚本中，例如myconvert.sh

#!/bin/bash
while IFS='' read -r line || [[ -n "$line" ]]; do
    if ! grep -Fq "$line" $2
    then
        sed -i '' "/$(echo $line | sed -e 's/[]\/$*.^|[]/\\&/g')/d" $1
    fi
done < "$1"

以两个文件作为参数调用它

./myconvert.sh file1 file2

但是，请注意下面 don_crissti 关于 while/read 的使用以及调用的明显性能缺陷的专业评论sed。

Answer

如果您有，请寻求 don_crissti（已接受）的答案GNU grep。以防万一您不这样做（例如在标准 Mac OS X 上，这不起作用），您也可以将此代码片段保存到 bash 脚本中，例如myconvert.sh

#!/bin/bash
while IFS='' read -r line || [[ -n "$line" ]]; do
    if ! grep -Fq "$line" $2
    then
        sed -i '' "/$(echo $line | sed -e 's/[]\/$*.^|[]/\\&/g')/d" $1
    fi
done < "$1"

以两个文件作为参数调用它

./myconvert.sh file1 file2

但是，请注意下面 don_crissti 关于 while/read 的使用以及调用的明显性能缺陷的专业评论sed。

从一个文件中查找另一个文件中的字符串（如果不存在），然后从原始文件中删除

答案1

答案2

相关内容