如何使用 Awk 修改/组织文件上的数据

Question 1

我只是把我的解决方案放在一起，sed即使它特别要求 AWK，我发现这个解决方案更紧凑和直接：

GNU Sed（在 CentOS 下测试）：

sed -n '1!p' addresses.csv | sed -r 's!^([0-9]*(\sbis|\ster)?),?(.*)$!\1,\3!g;s!(.*)([^,])(,[0-9]*)$!\1\2,\3!g'

OS-X / BSD Sed

sed -n '1!p' addresses.csv | sed -E 's!^([0-9]*( bis| ter)?),?(.*)$!\1,\3!g;s!(.*)([^,])(,[0-9]*)$!\1\2,\3!g'

第一个 sed 命令是获取除第一行（标题）之外的所有行。

对于第二个sed我使用替换：

^                : Starting text.
[0-9]*           : all numbers (0, 1, ... 99, 999, 99999999 and so on) 
( bis| ter)?     : optionally followed by " bis" or " ter" (notice the space before); group 2
,?           : optionally followed by a comma
(.*)$            : the rest of the string until the end ($) (group 3)

!\1,\3           : replaced by first group (number + extension) - comma - third group

注意第二组是“bis”和“ter”的括号，第一组是这个([0-9]*( bis| ter){0,1})

第二个替换是标准化逗号（如果没有完成，,,\d我们添加一个额外的逗号。

Answer

我只是把我的解决方案放在一起，sed即使它特别要求 AWK，我发现这个解决方案更紧凑和直接：