AWK - 打印列范围

Question 1

实用剪切有一个紧凑的表示法：

cut -d, -f2-7 <input-file>

生产：

第 2 列、第 3 列、第 4 列、第 5 列、第 6 列、第 7 列

回答 @PlasmaBinturong 的评论：我的目的是解决短调用序列的问题：“......我的 awk 命令会变得非常长......”。然而，人们也可以找到根据需要排列字段的代码。尽管我很喜欢 awk、perl、python，但我经常发现构建特定的实用程序来扩展标准 *nix 的功能很有用。因此，这里是测试脚本 s2 的摘录，显示实用程序重新剪切和排列，两者都允许重新排列和复制，排列还允许减少字段范围：

FILE=${1-data1}

# Utility functions: print-as-echo, print-line-with-visual-space.
pe() { for _i;do printf "%s" "$_i";done; printf "\n"; }
pl() { pe;pe "-----" ;pe "$*"; }

pl " Input data file $FILE:"
head $FILE

pl " Results, cut:"
cut -d, -f2-7 $FILE

pl " Results, recut (modified as my-recut):"
my-recut -d "," 7,6,2-5 < $FILE

pl " Results, arrange:"
arrange -s "," -f 5,3-1,7,5,3-4,5 $FILE

从这些版本生成结果：

OS, ker|rel, machine: Linux, 3.16.0-10-amd64, x86_64
Distribution        : Debian 8.11 (jessie) 
bash GNU bash 4.3.30
cut (GNU coreutils) 8.23
recut - ( local: RepRev 1.1, ~/bin/recut, 2010-06-10 )
arrange (local) 1.15

-----
 Input data file data1:
column1,column2,column3,column4,column5,column6,column7,column8

-----
 Results, cut:
column2,column3,column4,column5,column6,column7

-----
 Results, recut (modified as my-recut):
column7,column6,column2,column3,column4,column5

-----
 Results, arrange:
column5,column3,column2,column1,column7,column5,column3,column4,column5

my-recut 是对textutils 代码recut 的轻微修改，而arrange 是我们的扩展剪切版本。更多信息：

recut   Process fields like cut, allow repetitions and re-ordering. (what)
Path    : ~/bin/recut
Version : - ( local: RepRev 1.1, ~/bin/recut, 2010-06-10 )
Length  : 56 lines
Type    : Perl script, ASCII text executable
Shebang : #!/usr/bin/perl
Home    : http://www1.cuni.cz/~obo/textutils/ (doc)
Modules : (for perl codes)
 Getopt::Long   2.42

arrange Arrange fields, like cut, but in user-specified order. (what)
Path    : ~/bin/arrange
Version : 1.15
Length  : 355 lines
Type    : Perl script, ASCII text executable
Shebang : #!/usr/bin/perl
Modules : (for perl codes)
 warnings       1.23
 strict 1.08
 Carp   1.3301
 Getopt::Euclid 0.4.5

最美好的祝愿...干杯，drl

Answer

实用剪切有一个紧凑的表示法：

cut -d, -f2-7 <input-file>

生产：

第 2 列、第 3 列、第 4 列、第 5 列、第 6 列、第 7 列

回答 @PlasmaBinturong 的评论：我的目的是解决短调用序列的问题：“......我的 awk 命令会变得非常长......”。然而，人们也可以找到根据需要排列字段的代码。尽管我很喜欢 awk、perl、python，但我经常发现构建特定的实用程序来扩展标准 *nix 的功能很有用。因此，这里是测试脚本 s2 的摘录，显示实用程序重新剪切和排列，两者都允许重新排列和复制，排列还允许减少字段范围：

FILE=${1-data1}

# Utility functions: print-as-echo, print-line-with-visual-space.
pe() { for _i;do printf "%s" "$_i";done; printf "\n"; }
pl() { pe;pe "-----" ;pe "$*"; }

pl " Input data file $FILE:"
head $FILE

pl " Results, cut:"
cut -d, -f2-7 $FILE

pl " Results, recut (modified as my-recut):"
my-recut -d "," 7,6,2-5 < $FILE

pl " Results, arrange:"
arrange -s "," -f 5,3-1,7,5,3-4,5 $FILE

从这些版本生成结果：

OS, ker|rel, machine: Linux, 3.16.0-10-amd64, x86_64
Distribution        : Debian 8.11 (jessie) 
bash GNU bash 4.3.30
cut (GNU coreutils) 8.23
recut - ( local: RepRev 1.1, ~/bin/recut, 2010-06-10 )
arrange (local) 1.15

-----
 Input data file data1:
column1,column2,column3,column4,column5,column6,column7,column8

-----
 Results, cut:
column2,column3,column4,column5,column6,column7

-----
 Results, recut (modified as my-recut):
column7,column6,column2,column3,column4,column5

-----
 Results, arrange:
column5,column3,column2,column1,column7,column5,column3,column4,column5

my-recut 是对textutils 代码recut 的轻微修改，而arrange 是我们的扩展剪切版本。更多信息：

recut   Process fields like cut, allow repetitions and re-ordering. (what)
Path    : ~/bin/recut
Version : - ( local: RepRev 1.1, ~/bin/recut, 2010-06-10 )
Length  : 56 lines
Type    : Perl script, ASCII text executable
Shebang : #!/usr/bin/perl
Home    : http://www1.cuni.cz/~obo/textutils/ (doc)
Modules : (for perl codes)
 Getopt::Long   2.42

arrange Arrange fields, like cut, but in user-specified order. (what)
Path    : ~/bin/arrange
Version : 1.15
Length  : 355 lines
Type    : Perl script, ASCII text executable
Shebang : #!/usr/bin/perl
Modules : (for perl codes)
 warnings       1.23
 strict 1.08
 Carp   1.3301
 Getopt::Euclid 0.4.5

最美好的祝愿...干杯，drl

Question 2

$ awk -v b=2 -v e=7 'BEGIN{FS=OFS=","} {for (i=b;i<=e;i++) printf "%s%s", $i, (i<e ? OFS : ORS)}' file
column2,column3,column4,column5,column6,column7

b=开始字段编号，e=结束字段编号。如果您需要处理带有引号字段、嵌入逗号、换行符等的 CSV，请参阅https://stackoverflow.com/q/45420535/1745001。

Answer

$ awk -v b=2 -v e=7 'BEGIN{FS=OFS=","} {for (i=b;i<=e;i++) printf "%s%s", $i, (i<e ? OFS : ORS)}' file
column2,column3,column4,column5,column6,column7

b=开始字段编号，e=结束字段编号。如果您需要处理带有引号字段、嵌入逗号、换行符等的 CSV，请参阅https://stackoverflow.com/q/45420535/1745001。

Question 3

sed -e '
  s/,/\n/7        ;# tag the end of col7
  s/^/,/          ;# add a comma
  s/,/\n/2        ;# tag beginning of col2
  s/.*\n\(.*\)\n.*/\1/ ;# perform surgery 
' file.csv

结果：

column2,column3,column4,column5,column6,column7

Answer

sed -e '
  s/,/\n/7        ;# tag the end of col7
  s/^/,/          ;# add a comma
  s/,/\n/2        ;# tag beginning of col2
  s/.*\n\(.*\)\n.*/\1/ ;# perform surgery 
' file.csv

结果：

column2,column3,column4,column5,column6,column7

Question 4

这对我有用：

awk '{ for (i=2; i<=7;i++){ printf $i; if (i != 7){ printf "," }} print "" }'

对于少量的列来说没有多大意义（就命令的复杂性而言）。但如果您的输入确实有很多列，则这种方法很实用。

Answer

这对我有用：

awk '{ for (i=2; i<=7;i++){ printf $i; if (i != 7){ printf "," }} print "" }'

对于少量的列来说没有多大意义（就命令的复杂性而言）。但如果您的输入确实有很多列，则这种方法很实用。

AWK - 打印列范围

答案1

答案2

答案3

答案4

相关内容