合并多个子文件夹中同名的文件

Question 1

export dir='/path/to/folder'

find "$dir" -mindepth 2 -type f -name 'EAF*.txt' \
  -exec sh -c 'for f; do
                 bn=$(basename "$f" .txt);
                 cat "$f" >> "$dir/$bn.merged.txt";
               done' sh {} +

该-mindepth 2选项排除 /path/to/folder 目录本身中的文件进行处理（即，它只查找子目录中的文件），这样如果输出文件已经存在，它就不会将输出文件连接到自身上。

无论是否存在重复的文件名，这都会将文件附加到“merged.txt”输出文件中。

如果您只想合并重复的文件名：

typeset -Ax counts # declare $counts to be an exported associative array
export dir='/path/to/folder'

# find out how many there are of each filename
while read -d '' -r f; do
  let counts[$f]++;
done < <(find "$dir" -mindepth 2 -type f -name 'EAF*.txt' -print0)

# concatenate only the duplicates
find "$dir" -mindepth 2 -type f -name 'EAF*.txt' \
  -exec bash -c 'for f; do
                   if [ "${counts[$f]}" -gt 1 ]; then
                     bn=$(basename "$f" .txt);
                     cat "$f" >> "$dir/$bn.merged.txt";
                   fi
                 done' sh {} +

这需要bash或一些其他支持关联数组的 shell（即不是 POSIX sh）。

Answer

export dir='/path/to/folder'

find "$dir" -mindepth 2 -type f -name 'EAF*.txt' \
  -exec sh -c 'for f; do
                 bn=$(basename "$f" .txt);
                 cat "$f" >> "$dir/$bn.merged.txt";
               done' sh {} +

该-mindepth 2选项排除 /path/to/folder 目录本身中的文件进行处理（即，它只查找子目录中的文件），这样如果输出文件已经存在，它就不会将输出文件连接到自身上。

无论是否存在重复的文件名，这都会将文件附加到“merged.txt”输出文件中。

如果您只想合并重复的文件名：

typeset -Ax counts # declare $counts to be an exported associative array
export dir='/path/to/folder'

# find out how many there are of each filename
while read -d '' -r f; do
  let counts[$f]++;
done < <(find "$dir" -mindepth 2 -type f -name 'EAF*.txt' -print0)

# concatenate only the duplicates
find "$dir" -mindepth 2 -type f -name 'EAF*.txt' \
  -exec bash -c 'for f; do
                   if [ "${counts[$f]}" -gt 1 ]; then
                     bn=$(basename "$f" .txt);
                     cat "$f" >> "$dir/$bn.merged.txt";
                   fi
                 done' sh {} +

这需要bash或一些其他支持关联数组的 shell（即不是 POSIX sh）。

Question 2

find您可以循环遍历 txt 文件并使用和计算重复名称wc。如果重复名称的计数大于 1，则将其附加到 merge.txt 文件。

#!/bin/bash

output_dir="output"
rm -rf "$output_dir"
mkdir "$output_dir"

for file in */*.txt; do
  file_name=$(basename "$file" .txt)
  duplicate_names_count=$(find . -type f -name "$file_name.txt" |  wc -l)
  if [ "$duplicate_names_count" -gt 1 ]; then
    cat "$file" >> "$output_dir/${file_name}.merge.txt"
  fi
done

Answer

find您可以循环遍历 txt 文件并使用和计算重复名称wc。如果重复名称的计数大于 1，则将其附加到 merge.txt 文件。

#!/bin/bash

output_dir="output"
rm -rf "$output_dir"
mkdir "$output_dir"

for file in */*.txt; do
  file_name=$(basename "$file" .txt)
  duplicate_names_count=$(find . -type f -name "$file_name.txt" |  wc -l)
  if [ "$duplicate_names_count" -gt 1 ]; then
    cat "$file" >> "$output_dir/${file_name}.merge.txt"
  fi
done

合并多个子文件夹中同名的文件

答案1

答案2

相关内容