Bash 字符串按分隔符分割，并受字符数限制

Question

sed -r 's/([^ .]+ [^ .]+) /\1\n/g' <<< "The quick fox jumped over the lazy dog"

The quick
fox jumped
over the
lazy dog

字符集[^ .]+表示一个或多个+任何类型的字符（空格.除外^）。因此，捕获组([^ .]+ [^ .]+)匹配以下模式string string。整个正则表达式末尾有一个额外的空格([^ .]+ [^ .]+)（可以将其包含在捕获组中以保留它）。

通过sed使用替换s命令，我们用第一个捕获组的内容\1和换行符（\n而不是空格）替换匹配的模式。通过标志，g我们将命令重复到每行的末尾。该-r选项激活扩展正则表达式。

更新-这是实际答案：

sed -r 's/(.{8}) /\1\n/g' <<< "How do we know it is going to match the pre-defined number of characters?"

How do we
know it is
going to
match the
pre-defined
number of
characters?

在此示例中，我们捕获长度至少为 8 个字符（包括空格）且后跟一个空格的字符串。我们可以按如下方式检查输出行的实际长度：

sed -r 's/(.{8}) /\1\n/g' <<< "How do we know it is going to match the pre-defined number of characters?" \
    | awk '{print length}'

并借助问题的答案如何使用 printf 多次打印一个字符？[awk]我们就能达到预期的结果。

sed -r 's/(.{8}) /\1\n/g' <<< "How do we know it is going to match the pre-defined number of characters?" \
    | awk '{rest=(12 - length); printf "%s%s|\n", $0, substr(".........", 1, rest)}'

How do we...|
know it is..|
going to....|
match the...|
pre-defined.|
number of...|
characters?.|

如果你想拆分单词，请从上面的正则表达式中删除最后的空格/(.{8})/。下面是一个例子，其中最大行长度恰好为 10 个字符或更少，其中第二个sed命令将修剪每个新行周围的空格。

sed -r 's/(.{10})/\1\n/g' <<< "How do we know it is going to match the pre-defined number of characters?" \
    | sed -r 's/(^ | $)//g' \
    | awk '{rest=(10 - length); printf "%s%s|\n", $0, substr(".........", 1, rest)}'

How do we.|
know it is|
going to..|
match the.|
pre-define|
d number o|
f characte|
rs?.......|

Answer 1

sed -r 's/([^ .]+ [^ .]+) /\1\n/g' <<< "The quick fox jumped over the lazy dog"

The quick
fox jumped
over the
lazy dog