如何使用 for 循环将脚本的输出保存到单个文件中

Question 1

这应该做你想做的。我已在脚本本身中包含了使用信息。注意：我无法可靠地测试此脚本的输出，因为我无权访问源文件，但它应该工作。

这将简单地循环执行脚本的目录中的“.fa”文件，并在每个文件上运行您提供的脚本，并在名为“output”的子目录中为每个文件创建一个新文件。

#!/bin/bash
# Usage:
# Run this script from within the same directory as the .fa files.
# A subdirectory named 'output' will be created, in which every
# input file will have a corresponding output file, prefixed with 'seq.'
mkdir -p ./output
shopt -s nullglob
for f in *.fa
do
    nf="./output/seq.$f"
    echo "Copying sequence from '$f' to '$nf'"
    
    < $f tail -n+2 | tr -d '\n' > $nf

    n="$(stat -c "%s" $nf)"

    r="$(shuf -i1-"$((n-200+1))" -n1)"

    < $nf tail -c+"$r" | head -c200
done

Answer

这应该做你想做的。我已在脚本本身中包含了使用信息。注意：我无法可靠地测试此脚本的输出，因为我无权访问源文件，但它应该工作。

这将简单地循环执行脚本的目录中的“.fa”文件，并在每个文件上运行您提供的脚本，并在名为“output”的子目录中为每个文件创建一个新文件。

#!/bin/bash
# Usage:
# Run this script from within the same directory as the .fa files.
# A subdirectory named 'output' will be created, in which every
# input file will have a corresponding output file, prefixed with 'seq.'
mkdir -p ./output
shopt -s nullglob
for f in *.fa
do
    nf="./output/seq.$f"
    echo "Copying sequence from '$f' to '$nf'"
    
    < $f tail -n+2 | tr -d '\n' > $nf

    n="$(stat -c "%s" $nf)"

    r="$(shuf -i1-"$((n-200+1))" -n1)"

    < $nf tail -c+"$r" | head -c200
done

Question 2

将现有的单文件代码放入一个函数中：

random_sample() {
    local fasta_file=$1
    local n r tmp sample
    tmp=$(mktemp)
    < "$fasta_file" tail -n+2 | tr -d '\n' > "$tmp"
    n=$(stat -c "%s" "$tmp")
    r=$(shuf -i1-"$((n-200+1))" -n1)
    sample=$(tail -c+"$r" < "$tmp" | head -c200)
    rm "$tmp"
    printf "%s\n" "$sample"
}

然后你可以做

for file in *.fa; do
    random_sample "$file" > "${file%.fa}_200_substring.fa"
done

如果 fasta 文件不是很大，我不会使用 tmp 文件：

random_sample() {
    local fasta_file=$1
    local data n r
    data=$(tail -n+2 < "$fasta_file" | tr -d '\n')
    n=${#data}
    r=$(shuf -i1-"$((n-200+1))" -n1)
    tail -c+"$r" <<< "$data" | head -c200
}

如果文件 < 32767 字节

random_sample() {
    local fasta_file=$1
    local data
    data=$(tail -n+2 < "$fasta_file" | tr -d '\n')
    echo "${data:($RANDOM % ${#data}):200}"
}

Answer

将现有的单文件代码放入一个函数中：

random_sample() {
    local fasta_file=$1
    local n r tmp sample
    tmp=$(mktemp)
    < "$fasta_file" tail -n+2 | tr -d '\n' > "$tmp"
    n=$(stat -c "%s" "$tmp")
    r=$(shuf -i1-"$((n-200+1))" -n1)
    sample=$(tail -c+"$r" < "$tmp" | head -c200)
    rm "$tmp"
    printf "%s\n" "$sample"
}

然后你可以做

for file in *.fa; do
    random_sample "$file" > "${file%.fa}_200_substring.fa"
done

如果 fasta 文件不是很大，我不会使用 tmp 文件：

random_sample() {
    local fasta_file=$1
    local data n r
    data=$(tail -n+2 < "$fasta_file" | tr -d '\n')
    n=${#data}
    r=$(shuf -i1-"$((n-200+1))" -n1)
    tail -c+"$r" <<< "$data" | head -c200
}

如果文件 < 32767 字节

random_sample() {
    local fasta_file=$1
    local data
    data=$(tail -n+2 < "$fasta_file" | tr -d '\n')
    echo "${data:($RANDOM % ${#data}):200}"
}

如何使用 for 循环将脚本的输出保存到单个文件中

答案1

答案2

相关内容