使用 sort -u 将单词附加到单词列表以避免重复

Question 1

如果您愿意按total.txt排序顺序（以mike和paul开头），您可以执行以下任一操作：

sort -u one.txt two.txt > total.txt或者
sort -u total.txt two.txt -o total.txt

如果需要保持顺序（one.txt先排序的内容，后排序的内容two.txt 除了)中的行one.txt，然后执行

sort -u two.txt | awk '!seen[$0]++' total.txt - > temp.txt; mv temp.txt total.txt

这相当于

(cat total.txt; sort -u two.txt) | awk '!seen[$0]++' > temp.txt; mv temp.txt total.txt

即，获取的内容total.txt （已经排序和去重），跟随的排序、去重的内容two.txt，并通过先前记录的 awk命令对未排序的文件进行重复数据删除。

Answer

如果您愿意按total.txt排序顺序（以mike和paul开头），您可以执行以下任一操作：

sort -u one.txt two.txt > total.txt或者
sort -u total.txt two.txt -o total.txt

如果需要保持顺序（one.txt先排序的内容，后排序的内容two.txt 除了)中的行one.txt，然后执行

sort -u two.txt | awk '!seen[$0]++' total.txt - > temp.txt; mv temp.txt total.txt

这相当于

(cat total.txt; sort -u two.txt) | awk '!seen[$0]++' > temp.txt; mv temp.txt total.txt

即，获取的内容total.txt （已经排序和去重），跟随的排序、去重的内容two.txt，并通过先前记录的 awk命令对未排序的文件进行重复数据删除。

Question 2

您可以使用sedplussponge安全地覆盖输入文件。这允许您用作total输入文件 -sponge在软件包（Ubuntu）中可用moreutils。

Sponge 读取标准输入并将其写入指定文件。与 shell 重定向不同，sponge 在打开输出文件之前吸收所有输入。这允许构建读取和写入同一文件的管道。

file[0]=total; [[ -f "$file" ]] || touch "$file"
file[1]=any
file[2]=number 
file[3]=of
file[4]=files
sed 's/[[:space:]]\+$//' "${file[@]}" | sort -u | sponge "$file"

请注意，bash var 数组中的第一项${file[0]}可以被引用并设置其值，而无需使用索引，即。$file（正如我上面所做的 - 它只是更容易打字）。如果尚未退出，则创建
。您可以使用任意数量的文件 - 只需相应地增加索引号即可。您可以重新运行同一组文件，并且内容将保持与第一次运行相同（对于该组文件）[[ -f total ]] || touch totaltotal

total

而不是sponge您可以只输出到临时文件，然后替换total为该临时文件（但我喜欢sponge）

Answer

您可以使用sedplussponge安全地覆盖输入文件。这允许您用作total输入文件 -sponge在软件包（Ubuntu）中可用moreutils。

Sponge 读取标准输入并将其写入指定文件。与 shell 重定向不同，sponge 在打开输出文件之前吸收所有输入。这允许构建读取和写入同一文件的管道。

file[0]=total; [[ -f "$file" ]] || touch "$file"
file[1]=any
file[2]=number 
file[3]=of
file[4]=files
sed 's/[[:space:]]\+$//' "${file[@]}" | sort -u | sponge "$file"

请注意，bash var 数组中的第一项${file[0]}可以被引用并设置其值，而无需使用索引，即。$file（正如我上面所做的 - 它只是更容易打字）。如果尚未退出，则创建
。您可以使用任意数量的文件 - 只需相应地增加索引号即可。您可以重新运行同一组文件，并且内容将保持与第一次运行相同（对于该组文件）[[ -f total ]] || touch totaltotal

total

而不是sponge您可以只输出到临时文件，然后替换total为该临时文件（但我喜欢sponge）

使用 sort -u 将单词附加到单词列表以避免重复

答案1

答案2

相关内容