使用 sed 进行大小写匹配模式替换

Question 1

便携式解决方案使用sed：

sed '
:1
/[aA][bB][cC][dD][eE][fF]/!b
s//\
&\
pqrstu\
PQRSTU\
/;:2
s/\n[[:lower:]]\(.*\n\)\(.\)\(.*\n\).\(.*\n\)/\2\
\1\3\4/;s/\n[^[:lower:]]\(.*\n\).\(.*\n\)\(.\)\(.*\n\)/\3\
\1\2\4/;t2
s/\n.*\n//;b1'

使用 GNU sed 会更容易一些：

search=abcdef replace=pqrstuvwx
sed -r ":1;/$search/I!b;s//\n&&&\n$replace\n/;:2
    s/\n[[:lower:]](.*\n)(.)(.*\n)/\l\2\n\1\3/
    s/\n[^[:lower:]](.*\n)(.)(.*\n)/\u\2\n\1\3/;t2
    s/\n.*\n(.*)\n/\1/g;b1"

通过上面的使用&&&，我们在其余的替换中重用了字符串的大小写模式，因此将更ABcdef改为PQrstuVWx和。将其更改为仅影响前 6 个字符的大小写。AbCdEfPqRsTuVwX&

（请注意，如果替换可能需要替换（例如替换fooforfoo或bcdfor abcd），它可能不会执行您想要的操作，或者可能会陷入无限循环

Answer

便携式解决方案使用sed：

sed '
:1
/[aA][bB][cC][dD][eE][fF]/!b
s//\
&\
pqrstu\
PQRSTU\
/;:2
s/\n[[:lower:]]\(.*\n\)\(.\)\(.*\n\).\(.*\n\)/\2\
\1\3\4/;s/\n[^[:lower:]]\(.*\n\).\(.*\n\)\(.\)\(.*\n\)/\3\
\1\2\4/;t2
s/\n.*\n//;b1'

使用 GNU sed 会更容易一些：

search=abcdef replace=pqrstuvwx
sed -r ":1;/$search/I!b;s//\n&&&\n$replace\n/;:2
    s/\n[[:lower:]](.*\n)(.)(.*\n)/\l\2\n\1\3/
    s/\n[^[:lower:]](.*\n)(.)(.*\n)/\u\2\n\1\3/;t2
    s/\n.*\n(.*)\n/\1/g;b1"

通过上面的使用&&&，我们在其余的替换中重用了字符串的大小写模式，因此将更ABcdef改为PQrstuVWx和。将其更改为仅影响前 6 个字符的大小写。AbCdEfPqRsTuVwX&

（请注意，如果替换可能需要替换（例如替换fooforfoo或bcdfor abcd），它可能不会执行您想要的操作，或者可能会陷入无限循环

Question 2

便携式解决方案使用awk：

awk -v find=abcdef -v rep=pqrstu '{
  lwr=tolower($0)
  offset=index(lwr, tolower(find))

  if( offset > 0 ) {
    printf "%s", substr($0, 0, offset)
    len=length(find)

    for( i=0; i<len; i++ ) {
      out=substr(rep, i+1, 1)

      if( substr($0, offset+i, 1) == substr(lwr, offset+i, 1) )
        printf "%s", tolower(out)
      else
        printf "%s", toupper(out)
    }

    printf "%s\n", substr($0, offset+len)
  }
}'

输入示例：

other abcdef other
other Abcdef other
other AbCdEf other

输出示例：

other pqrstu other
other Pqrstu other
other PqRsTu other

更新

正如评论中指出的，上面的内容只会替换find每行中的第一个实例。替换所有实例：

awk -v find=abcdef -v rep=pqrstu '{
  input=$0
  lwr=tolower(input)
  offset=index(lwr, tolower(find))

  if( offset > 0 ) {
    while( offset > 0 ) {

      printf "%s", substr(input, 0, offset)
      len=length(find)

      for( i=0; i<len; i++ ) {
        out=substr(rep, i+1, 1)

        if( substr(input, offset+i, 1) == substr(lwr, offset+i, 1) )
          printf "%s", tolower(out)
        else
          printf "%s", toupper(out)
      }

      input=substr(input, offset+len)
      lwr=substr(lwr, offset+len)
      offset=index(lwr, tolower(find))
    }

    print input
  }
}'

输入示例：

other abcdef other ABCdef other
other Abcdef other abcDEF
other AbCdEf other aBCdEf other

输出示例：

other pqrstu other PQRstu other
other Pqrstu other pqrSTU
other PqRsTu other pQRsTu other

Answer

便携式解决方案使用awk：

awk -v find=abcdef -v rep=pqrstu '{
  lwr=tolower($0)
  offset=index(lwr, tolower(find))

  if( offset > 0 ) {
    printf "%s", substr($0, 0, offset)
    len=length(find)

    for( i=0; i<len; i++ ) {
      out=substr(rep, i+1, 1)

      if( substr($0, offset+i, 1) == substr(lwr, offset+i, 1) )
        printf "%s", tolower(out)
      else
        printf "%s", toupper(out)
    }

    printf "%s\n", substr($0, offset+len)
  }
}'

输入示例：

other abcdef other
other Abcdef other
other AbCdEf other

输出示例：

other pqrstu other
other Pqrstu other
other PqRsTu other

更新

正如评论中指出的，上面的内容只会替换find每行中的第一个实例。替换所有实例：

awk -v find=abcdef -v rep=pqrstu '{
  input=$0
  lwr=tolower(input)
  offset=index(lwr, tolower(find))

  if( offset > 0 ) {
    while( offset > 0 ) {

      printf "%s", substr(input, 0, offset)
      len=length(find)

      for( i=0; i<len; i++ ) {
        out=substr(rep, i+1, 1)

        if( substr(input, offset+i, 1) == substr(lwr, offset+i, 1) )
          printf "%s", tolower(out)
        else
          printf "%s", toupper(out)
      }

      input=substr(input, offset+len)
      lwr=substr(lwr, offset+len)
      offset=index(lwr, tolower(find))
    }

    print input
  }
}'

输入示例：

other abcdef other ABCdef other
other Abcdef other abcDEF
other AbCdEf other aBCdEf other

输出示例：

other pqrstu other PQRstu other
other Pqrstu other pqrSTU
other PqRsTu other pQRsTu other

Question 3

你可以使用perl.直接来自常见问题解答 - 引用自perldoc perlfaq6：

如何在 LHS 上不区分大小写地替换，同时在 RHS 上保留大小写？

这是 Larry Rosler 提出的一个可爱的 Perlish 解决方案。它利用了 ASCII 字符串按位异或的属性。

   $_= "this is a TEsT case";

   $old = 'test';
   $new = 'success';

   s{(\Q$old\E)}
   { uc $new | (uc $1 ^ $1) .
           (uc(substr $1, -1) ^ substr $1, -1) x
           (length($new) - length $1)
   }egi;

   print;

这里它是一个子例程，模仿上述内容：

       sub preserve_case($$) {
               my ($old, $new) = @_;
               my $mask = uc $old ^ $old;

               uc $new | $mask .
                       substr($mask, -1) x (length($new) - length($old))
   }

       $string = "this is a TEsT case";
       $string =~ s/(test)/preserve_case($1, "success")/egi;
       print "$string\n";

这打印：

           this is a SUcCESS case

作为替代方案，如果替换单词比原始单词长，要保留替换单词的大小写，您可以使用 Jeff Pinyan 编写的以下代码：

   sub preserve_case {
           my ($from, $to) = @_;
           my ($lf, $lt) = map length, @_;

           if ($lt < $lf) { $from = substr $from, 0, $lt }
           else { $from .= substr $to, $lf }

           return uc $to | ($from ^ uc $from);
           }

这会将句子更改为“这是一个成功的案例”。

只是为了表明 C 程序员可以用任何编程语言编写 C，如果您更喜欢更像 C 的解决方案，以下脚本使替换具有与原始字母相同的大小写。（它的运行速度也恰好比 Perlish 解决方案的运行速度慢约 240%。）如果替换的字符数多于被替换的字符串，则最后一个字符的大小写将用于替换的其余部分。

   # Original by Nathan Torkington, massaged by Jeffrey Friedl
   #
   sub preserve_case($$)
   {
           my ($old, $new) = @_;
           my ($state) = 0; # 0 = no change; 1 = lc; 2 = uc
           my ($i, $oldlen, $newlen, $c) = (0, length($old), length($new));
           my ($len) = $oldlen < $newlen ? $oldlen : $newlen;

           for ($i = 0; $i < $len; $i++) {
                   if ($c = substr($old, $i, 1), $c =~ /[\W\d_]/) {
                           $state = 0;
                   } elsif (lc $c eq $c) {
                           substr($new, $i, 1) = lc(substr($new, $i, 1));
                           $state = 1;
                   } else {
                           substr($new, $i, 1) = uc(substr($new, $i, 1));
                           $state = 2;
                   }
           }
           # finish up with any remaining new (for when new is longer than old)
           if ($newlen > $oldlen) {
                   if ($state == 1) {
                           substr($new, $oldlen) = lc(substr($new, $oldlen));
                   } elsif ($state == 2) {
                           substr($new, $oldlen) = uc(substr($new, $oldlen));
                   }
           }
           return $new;
   }

Answer

你可以使用perl.直接来自常见问题解答 - 引用自perldoc perlfaq6：

如何在 LHS 上不区分大小写地替换，同时在 RHS 上保留大小写？

这是 Larry Rosler 提出的一个可爱的 Perlish 解决方案。它利用了 ASCII 字符串按位异或的属性。

   $_= "this is a TEsT case";

   $old = 'test';
   $new = 'success';

   s{(\Q$old\E)}
   { uc $new | (uc $1 ^ $1) .
           (uc(substr $1, -1) ^ substr $1, -1) x
           (length($new) - length $1)
   }egi;

   print;

这里它是一个子例程，模仿上述内容：

       sub preserve_case($$) {
               my ($old, $new) = @_;
               my $mask = uc $old ^ $old;

               uc $new | $mask .
                       substr($mask, -1) x (length($new) - length($old))
   }

       $string = "this is a TEsT case";
       $string =~ s/(test)/preserve_case($1, "success")/egi;
       print "$string\n";

这打印：

           this is a SUcCESS case

作为替代方案，如果替换单词比原始单词长，要保留替换单词的大小写，您可以使用 Jeff Pinyan 编写的以下代码：

   sub preserve_case {
           my ($from, $to) = @_;
           my ($lf, $lt) = map length, @_;

           if ($lt < $lf) { $from = substr $from, 0, $lt }
           else { $from .= substr $to, $lf }

           return uc $to | ($from ^ uc $from);
           }

这会将句子更改为“这是一个成功的案例”。

只是为了表明 C 程序员可以用任何编程语言编写 C，如果您更喜欢更像 C 的解决方案，以下脚本使替换具有与原始字母相同的大小写。（它的运行速度也恰好比 Perlish 解决方案的运行速度慢约 240%。）如果替换的字符数多于被替换的字符串，则最后一个字符的大小写将用于替换的其余部分。

   # Original by Nathan Torkington, massaged by Jeffrey Friedl
   #
   sub preserve_case($$)
   {
           my ($old, $new) = @_;
           my ($state) = 0; # 0 = no change; 1 = lc; 2 = uc
           my ($i, $oldlen, $newlen, $c) = (0, length($old), length($new));
           my ($len) = $oldlen < $newlen ? $oldlen : $newlen;

           for ($i = 0; $i < $len; $i++) {
                   if ($c = substr($old, $i, 1), $c =~ /[\W\d_]/) {
                           $state = 0;
                   } elsif (lc $c eq $c) {
                           substr($new, $i, 1) = lc(substr($new, $i, 1));
                           $state = 1;
                   } else {
                           substr($new, $i, 1) = uc(substr($new, $i, 1));
                           $state = 2;
                   }
           }
           # finish up with any remaining new (for when new is longer than old)
           if ($newlen > $oldlen) {
                   if ($state == 1) {
                           substr($new, $oldlen) = lc(substr($new, $oldlen));
                   } elsif ($state == 2) {
                           substr($new, $oldlen) = uc(substr($new, $oldlen));
                   }
           }
           return $new;
   }

Question 4

像这样的事情就会做你所描述的事情。

sed -i.bak -e "s/abcdef/pqrstuvxyz/g" \
 -e "s/AbCdEf/PqRsTuVxYz/g" \
 -e "s/Abcdef/Pqrstuvxyz/g" files/src

Answer

像这样的事情就会做你所描述的事情。

sed -i.bak -e "s/abcdef/pqrstuvxyz/g" \
 -e "s/AbCdEf/PqRsTuVxYz/g" \
 -e "s/Abcdef/Pqrstuvxyz/g" files/src

使用 sed 进行大小写匹配模式替换

答案1

答案2

更新

答案3

答案4

相关内容