我想找到所有包含具有相同名称(和扩展名.md
)的 markdown 文件的子文件夹。
例如:我想查找以下子文件夹:
Apple/Banana/Orange #Apple/Banana/Orange/Orange.md exists
Apple/Banana #Apple/Banana/Banana.md exists
Apple/Banana/Papaya #Apple/Banana/Papaya/Papaya.md exists
- 注意:目录下可以有其他文件或子目录。
有什么建议么?
可以使用以下代码测试问题的解决方案:
#!/usr/bin/env bash
# - goal: "Test"
# - author: Nikhil Agarwal
# - date: Wednesday, August 07, 2019
# - status: P T' (P: Prototyping, T: Tested)
# - usage: ./Test.sh
# - include:
# 1.
# - refer:
# 1. [directory - Find only those folders that contain a File with the same name as the Folder - Unix & Linux Stack Exchange](https://unix.stackexchange.com/questions/534190/find-only-those-folders-that-contain-a-file-with-the-same-name-as-the-folder)
# - formatting:
# shellcheck disable=
#clear
main() {
TestData
ExpectedOutput
TestFunction "${1:?"Please enter a test number, as the first argument, to be executed!"}"
}
TestFunction() {
echo "Test Function"
echo "============="
"Test${1}"
echo ""
}
Test1() {
echo "Description: Thor"
find . -type f -regextype egrep -regex '.*/([^/]+)/\1\.md$' | sort
echo "Observation: ${Green:=}Pass, but shows filepath instead of directory path${Normal:=}"
}
Test2() {
echo "Description: Kusalananda1"
find . -type d -exec sh -c '
dirpath=$1
set -- "$dirpath"/*.md
[ -f "$dirpath/${dirpath##*/}.md" ] && [ "$#" -eq 1 ]' sh {} \; -print | sort
echo "Observation: ${Red:=}Fails as it ignores B.md${Normal:=}"
}
Test3() {
echo "Description: Kusalananda2"
find . -type d -exec sh -c '
for dirpath do
set -- "$dirpath"/*.md
if [ -f "$dirpath/${dirpath##*/}.md" ] && [ "$#" -eq 1 ]
then
printf "%s\n" "$dirpath"
fi
done' sh {} + | sort
echo "Observation: ${Red:=}Fails as it ignores B.md${Normal:=}"
}
Test4() {
echo "Description: steeldriver1"
find . -type d -exec sh -c '[ -f "$1/${1##*/}.md" ]' find-sh {} \; -print | sort
echo "Observation: ${Green:=}Pass${Normal:=}"
}
Test5() {
echo "Description: steeldriver2"
find . -type d -exec sh -c '
for d do
[ -f "$d/${d##*/}.md" ] && printf "%s\n" "$d"
done' find-sh {} + | sort
echo "Observation: ${Green:=}Pass${Normal:=}"
}
Test6() {
echo "Description: Stéphane Chazelas"
find . -name '*.md' -print0 \
| gawk -v RS='\0' -F/ -v OFS=/ '
{filename = $NF; NF--
if ($(NF)".md" == filename) include[$0]
else exclude[$0]
}
END {for (i in include) if (!(i in exclude)) print i}'
echo "Observation: ${Red:=}Fails as it ignores B.md${Normal:=}"
}
Test7() {
echo "Description: Zach"
#shellcheck disable=2044
for fd in $(find . -type d); do
dir=${fd##*/}
if [ -f "${fd}/${dir}.md" ]; then
ls "${fd}/${dir}.md"
fi
done
echo "Observation: ${Green:=}Pass but shows filepath instead of directory${Normal:=}"
}
ExpectedOutput() {
echo "Expected Output"
echo "==============="
cat << EOT
./GeneratedTest/A
./GeneratedTest/A/AA
./GeneratedTest/B
./GeneratedTest/C/CC1
./GeneratedTest/C/CC2
EOT
}
TestData() {
rm -rf GeneratedTest
mkdir -p GeneratedTest/A/AA
touch GeneratedTest/index.md
touch GeneratedTest/A/A.md
touch GeneratedTest/A/AA/AA.md
mkdir -p GeneratedTest/B
touch GeneratedTest/B/B.md
touch GeneratedTest/B/index.md
mkdir -p GeneratedTest/C/CC1
touch GeneratedTest/C/index.md
touch GeneratedTest/C/CC1/CC1.md
mkdir -p GeneratedTest/C/CC2
touch GeneratedTest/C/CC2/CC2.md
mkdir -p GeneratedTest/C/CC3
touch GeneratedTest/C/CC3/CC.md
mkdir -p GeneratedTest/C/CC4
}
main "$@"
答案1
假设你的文件被合理命名,即不需要-print0
等等。你可以使用 GNU find 来做到这一点,如下所示:
find . -type f -regextype egrep -regex '.*/([^/]+)/\1\.md$'
输出:
./Apple/Banana/Orange/Orange.md
./Apple/Banana/Papaya/Papaya.md
./Apple/Banana/Banana.md
如果您只需要目录名称,请添加一个-printf
参数:
find . -type f -regextype egrep -regex '.*/([^/]+)/\1\.md$' -printf '%h\n'
在更新的测试数据上运行时的输出:
GeneratedTest/A/AA
GeneratedTest/A
GeneratedTest/C/CC2
GeneratedTest/C/CC1
GeneratedTest/B
答案2
在 GNU 系统上,您可以执行以下操作:
find . -name '*.md' -print0 |
gawk -v RS='\0' -F/ -v OFS=/ '
{filename = $NF; NF--
if ($(NF)".md" == filename) include[$0]
else exclude[$0]
}
END {for (i in include) if (!(i in exclude)) print i}'
答案3
find . -type d -exec sh -c '
dirpath=$1
set -- "$dirpath"/*.md
[ -f "$dirpath/${dirpath##*/}.md" ] && [ "$#" -eq 1 ]' sh {} \; -print
上面的代码将查找当前目录(包括当前目录)下的所有目录,并为每个目录执行一个简短的 shell 脚本。
shell代码将测试目录中是否存在与目录同名的markdown文件,以及这是否是*.md
该目录中的唯一名称。如果这样的文件存在并且它是唯一的*.md
名称,则内联 shell 脚本将以零退出状态退出。否则,它将以非零退出状态退出(信令失败)。
该set -- "$dirpath"/*.md
位将位置参数设置为与模式匹配的路径名列表(与.md
目录中带有后缀的任何名称匹配)。然后我们可以$#
稍后查看我们从中获得了多少匹配项。
如果 shell 脚本成功退出,-print
将打印找到的目录的路径。
速度稍快的版本,使用更少的内联脚本调用,但这并不能让您对找到的路径名本身执行更多操作find
(但内联脚本可能会进一步扩展):
find . -type d -exec sh -c '
for dirpath do
set -- "$dirpath"/*.md
[ -f "$dirpath/${dirpath##*/}.md" ] &&
[ "$#" -eq 1 ] &&
printf "%s\n" "$dirpath"
done' sh {} +
相同的命令,但不关心.md
目录中是否还有其他文件:
find . -type d -exec sh -c '
dirpath=$1
[ -f "$dirpath/${dirpath##*/}.md" ]' sh {} \; -print
find . -type d -exec sh -c '
for dirpath do
[ -f "$dirpath/${dirpath##*/}.md" ] &&
printf "%s\n" "$dirpath"
done' sh {} +
也可以看看:
答案4
这是我的。我添加了更多目录和文件来验证。我也无聊,所以加上了最后修改时间和MD5。也许您正在寻找重复项。
GREEN='\033[0;32m'
RED='\033[0;31m'
NC='\033[0m'
mkdir -pv {Pear,Grape,Raisin,Plaintain}/{DragonFruit,Nababa,Strawberry,Grape,Raisin}
touch {Pear,Grape,Raisin,Plaintain}/{DragonFruit,Nababa,Strawberry,Grape,Raisin}/{Strawberry,Grape,Raisin}.md
for dir in $(find ./ -type d)
do
dirname="${dir##*/}"
fname="${dirname}.md"
if [ -f "${dir}/${fname}" ]
then
STAT=$(stat --printf="%y %s" "${dir}/${fname}")
STAT="${STAT:0:19}"
MD5=$(md5sum "${dir}/${fname}")
MD5="${MD5:0:32}"
printf "${GREEN}%-60s${NC}%-40s%-40s\n" "'${dir}/${fname}' exists" "$STAT" "$MD5"
else
echo -e "${RED}'${dir}/${fname}' doesn't exist${NC}"
fi
done
'.//.md' doesn't exist
'./Raisin/Raisin.md' doesn't exist
'./Raisin/Raisin/Raisin.md' exists 2019-08-07 19:54:09 a3085274bf23c52c58dd063faba0c36a
'./Raisin/Nababa/Nababa.md' doesn't exist
'./Raisin/Strawberry/Strawberry.md' exists 2019-08-07 19:54:09 3d2eca1d4a3c539527cb956affa8b807
'./Raisin/Grape/Grape.md' exists 2019-08-07 19:54:09 f577b20f93a51286423c1d8973973f01
'./Raisin/DragonFruit/DragonFruit.md' doesn't exist
'./Pear/Pear.md' doesn't exist
'./Pear/Raisin/Raisin.md' exists 2019-08-07 19:54:09 61387f5d87f125923c2962b389b0dd67
'./Pear/Nababa/Nababa.md' doesn't exist
'./Pear/Strawberry/Strawberry.md' exists 2019-08-07 19:54:09 02c9e39ba5b77954082a61236f786d34
'./Pear/Grape/Grape.md' exists 2019-08-07 19:54:09 43e85d5651cac069bba8ba36e754079d
'./Pear/DragonFruit/DragonFruit.md' doesn't exist
'./Apple/Apple.md' doesn't exist
'./Apple/Banana/Banana.md' exists 2019-08-07 19:54:09 a605268f3314411ec360d7e0dd234960
'./Apple/Banana/Papaya/Papaya.md' exists 2019-08-07 19:54:09 e759a879942fe986397e52b7ba21a9ff
'./Apple/Banana/Orange/Orange.md' exists 2019-08-07 19:54:09 127618fe9ab73937836b809fa0593572
'./Plaintain/Plaintain.md' doesn't exist
'./Plaintain/Raisin/Raisin.md' exists 2019-08-07 19:54:09 13ed6460f658ca9f7d222ad3d07212a2
'./Plaintain/Nababa/Nababa.md' doesn't exist
'./Plaintain/Strawberry/Strawberry.md' exists 2019-08-07 19:54:09 721d7a5a32f3eacf4b199b74d78b91f0
'./Plaintain/Grape/Grape.md' exists 2019-08-07 19:54:09 0bdaff592bbd9e2ed5fac5a992bb3566
'./Plaintain/DragonFruit/DragonFruit.md' doesn't exist
'./Grape/Grape.md' doesn't exist
'./Grape/Raisin/Raisin.md' exists 2019-08-07 19:54:09 aa5d4c970e7b4b6dc35cd16d1863b5bb
'./Grape/Nababa/Nababa.md' doesn't exist
'./Grape/Strawberry/Strawberry.md' exists 2019-08-07 19:54:09 8b02f8273bbff1bb3162cb088813e0c9
'./Grape/Grape/Grape.md' exists 2019-08-07 19:54:09 5593d7d6fdcbb48ab5901ba30469bbe8