通过bash脚本获取S3 Bucket子文件夹的大小

Question 1

我不会发出如此多的请求，而是递归地列出存储桶中的所有对象，然后从输出本地添加所有大小。

开始：aws s3 ls --recursive s3://path1/ > all-files.log

然后all-files.log在本地进行处理。容易多了:)

Answer

我不会发出如此多的请求，而是递归地列出存储桶中的所有对象，然后从输出本地添加所有大小。

开始：aws s3 ls --recursive s3://path1/ > all-files.log

然后all-files.log在本地进行处理。容易多了:)

Question 2

在第一步的原始脚本中，您使用$FILES存储 S3 文件名的临时文件名。但在最后一步中，您希望文件列表位于数组中$FILES。

我们可以修复这个错误，但我建议重写脚本，以便它只处理ls结果而不使用临时文件。这让事情变得简单很多。

这是工作脚本，您甚至可以将其添加为函数~/.bashrc：

function s3du {
    readonly folder_to_scan=${1:?"The argument 's3://bucket/folder_to_scan/' must be specified."}

     for subfolder in $(aws s3 ls "${folder_to_scan}" | grep PRE | awk '{print $2}'); do 
        echo "${folder_to_scan}${subfolder}:" 
        aws s3 ls "${folder_to_scan}${subfolder}" --recursive \
            --human-readable \
            --summarize \ 
            | tail -n2 
    done
}

像这样使用它s3du s3://my-bucket/my-folder/

Answer

在第一步的原始脚本中，您使用$FILES存储 S3 文件名的临时文件名。但在最后一步中，您希望文件列表位于数组中$FILES。

我们可以修复这个错误，但我建议重写脚本，以便它只处理ls结果而不使用临时文件。这让事情变得简单很多。

这是工作脚本，您甚至可以将其添加为函数~/.bashrc：

function s3du {
    readonly folder_to_scan=${1:?"The argument 's3://bucket/folder_to_scan/' must be specified."}

     for subfolder in $(aws s3 ls "${folder_to_scan}" | grep PRE | awk '{print $2}'); do 
        echo "${folder_to_scan}${subfolder}:" 
        aws s3 ls "${folder_to_scan}${subfolder}" --recursive \
            --human-readable \
            --summarize \ 
            | tail -n2 
    done
}

像这样使用它s3du s3://my-bucket/my-folder/

通过bash脚本获取S3 Bucket子文件夹的大小

答案1

答案2

相关内容