如果 sed 中包含正则表达式，如何跳过文件？

Question 1

如果您相信的git关于什么是二进制文件的观点，您可以使用git grep来获取非二进制文件的列表。假设t.cpp是一个文本文件，并且ls是一个二进制文件，两者都已签入：

$ ls
t.cpp ls
$ git grep -I --name-only -e ''
t.cpp

该-I选项的含义是：

-I
不匹配二进制文件中的模式。

将其与您的sed表达式结合起来：

$ git grep -I --name-only -z -e '' | \
       xargs -0 sed -i.bk -e 's/[ \t]\+\(\r\?\)$/\1/;$a\'

（-z/xargs -0帮助处理奇怪的文件名。）

查看git grep手册页以获取其他有用的选项 ---no-index或者--cached可能会有所帮助，具体取决于您想要操作的文件集。

Answer

如果您相信的git关于什么是二进制文件的观点，您可以使用git grep来获取非二进制文件的列表。假设t.cpp是一个文本文件，并且ls是一个二进制文件，两者都已签入：

$ ls
t.cpp ls
$ git grep -I --name-only -e ''
t.cpp

该-I选项的含义是：

-I
不匹配二进制文件中的模式。

将其与您的sed表达式结合起来：

$ git grep -I --name-only -z -e '' | \
       xargs -0 sed -i.bk -e 's/[ \t]\+\(\r\?\)$/\1/;$a\'

（-z/xargs -0帮助处理奇怪的文件名。）

查看git grep手册页以获取其他有用的选项 ---no-index或者--cached可能会有所帮助，具体取决于您想要操作的文件集。

Question 2

如果任何行与 sed 中的正则表达式匹配，有没有办法跳过整个文件？

就在这里。

# test case for skipping file if a sed regex match succeeds

echo 'Hello, world!' > hello_world.txt
cat hello_world.txt
ls -li hello_world.txt

sed -i -e '/.*Hello.*/{q;}; s/world/WORLD/g' hello_world.txt # skips file
sed -i -e '/.*HeLLo.*/{q;}; s/world/WORLD/g' hello_world.txt

Answer

如果任何行与 sed 中的正则表达式匹配，有没有办法跳过整个文件？

就在这里。

# test case for skipping file if a sed regex match succeeds

echo 'Hello, world!' > hello_world.txt
cat hello_world.txt
ls -li hello_world.txt

sed -i -e '/.*Hello.*/{q;}; s/world/WORLD/g' hello_world.txt # skips file
sed -i -e '/.*HeLLo.*/{q;}; s/world/WORLD/g' hello_world.txt

Question 3

下面是一个 Perl 脚本，它迭代其参数（必须是文件名）并向每个不以换行符结尾的文件附加换行符。包含空字节的文件将被跳过。已经以换行符结尾的文件不会被修改。包含 CR 的文件会附加 CRLF，其他文件则仅附加 LF。未经测试。

#!/usr/bin/env perl
foreach my $f (@ARGV) {
    open F, "<", $f or die;
    my $last = undef;
    my $cr = 0;
    while (<>) {if (/\0/) {undef $last; break} $last = $_; ++$cr if /\r$/}
    close F;
    if (defined $last && $last !~ /\n\Z/) {
        open F, ">>", $f or die;
        print($cr ? "\r\n" : "\n");
        close F or die;
    }
}

Answer

下面是一个 Perl 脚本，它迭代其参数（必须是文件名）并向每个不以换行符结尾的文件附加换行符。包含空字节的文件将被跳过。已经以换行符结尾的文件不会被修改。包含 CR 的文件会附加 CRLF，其他文件则仅附加 LF。未经测试。

#!/usr/bin/env perl
foreach my $f (@ARGV) {
    open F, "<", $f or die;
    my $last = undef;
    my $cr = 0;
    while (<>) {if (/\0/) {undef $last; break} $last = $_; ++$cr if /\r$/}
    close F;
    if (defined $last && $last !~ /\n\Z/) {
        open F, ">>", $f or die;
        print($cr ? "\r\n" : "\n");
        close F or die;
    }
}

如果 sed 中包含正则表达式，如何跳过文件？

答案1

答案2

答案3

相关内容