在一个脚本中加入多个 sed 命令来处理 CSV 文件

Question 1

首先，正如 Michael 所展示的，您可以将所有这些组合到一个命令中：

sed '/^FOOTER/d; s/^\"//; s/\"$//; s/\"|\"/|/g' csv > csv1

我认为某些sed实现无法应对这一点，可能需要：

  sed -e '/^FOOTER/d' -e 's/^\"//' -e 's/\"$//' -e 's/\"|\"/|/g' csv > csv1

也就是说，看起来您的字段是由定义的|，您只想删除"整个字段，留下字段内的字段。在这种情况下，你可以这样做：

$ sed '/FOOTER/d; s/\(^\||\)"/\1/g; s/"\($\||\)/\1/g' csv 
HEADER
first, column|second "some random quotes" column|third ol' column

或者，使用 GNU sed：

sed -r '/FOOTER/d; s/(^|\|)"/\1/g; s/"($|\|)/\1/g' csv

您还可以使用 Perl：

$ perl -F"|" -lane 'next if /FOOTER/; s/^"|"$// for @F; print @F' csv 
HEADER
first, column|second some random quotes column|third ol' column

Answer

首先，正如 Michael 所展示的，您可以将所有这些组合到一个命令中：

sed '/^FOOTER/d; s/^\"//; s/\"$//; s/\"|\"/|/g' csv > csv1

我认为某些sed实现无法应对这一点，可能需要：

  sed -e '/^FOOTER/d' -e 's/^\"//' -e 's/\"$//' -e 's/\"|\"/|/g' csv > csv1

也就是说，看起来您的字段是由定义的|，您只想删除"整个字段，留下字段内的字段。在这种情况下，你可以这样做：

$ sed '/FOOTER/d; s/\(^\||\)"/\1/g; s/"\($\||\)/\1/g' csv 
HEADER
first, column|second "some random quotes" column|third ol' column

或者，使用 GNU sed：

sed -r '/FOOTER/d; s/(^|\|)"/\1/g; s/"($|\|)/\1/g' csv

您还可以使用 Perl：

$ perl -F"|" -lane 'next if /FOOTER/; s/^"|"$// for @F; print @F' csv 
HEADER
first, column|second some random quotes column|third ol' column

Question 2

这也可以工作：

sed's/^"//;s/"|"/|/g;s/""$/"/'

例子：

$ echo '"this"|" and "ths""|" and "|" this 2"|" also "this", "thi", "and th""' | 
sed 's/^"//; s/"|"/|/g; s/""$/"/'
this| and "ths"| and | this 2| also "this", "thi", "and th"

漂亮的版本

sed '
s/^"//
s/"|"/|/g
s/""$/"/
$d
'

Answer

这也可以工作：

sed's/^"//;s/"|"/|/g;s/""$/"/'

例子：

$ echo '"this"|" and "ths""|" and "|" this 2"|" also "this", "thi", "and th""' | 
sed 's/^"//; s/"|"/|/g; s/""$/"/'
this| and "ths"| and | this 2| also "this", "thi", "and th"

漂亮的版本

sed '
s/^"//
s/"|"/|/g
s/""$/"/
$d
'

Question 3

sed对我有用的命令是：

sed 's/ALA/A/g;s/CYS/C/g;s/ASP/D/g;s/GLU/E/g;s/PHE/F/g;s/GLY/G/g;s/HIS/H/g;s/HID/H/g;s/HIE/H/g;s/ILE/I/g;s/LYS/K/g;s/LEU/L/g;s/MET/M/g;s/ASN/N/g;s/PRO/P/g;s/GLN/Q/g;s/ARG/R/g;s/SER/S/g;s/THR/T/g;s/VAL/V/g;s/TRP/W/g;s/TYR/Y/g;s/MSE/X/g;s/ //g'  < old.txt > new.fasta

sed 命令无法通过管道传输。它必须作为单个命令给出。

Answer

sed对我有用的命令是：

sed 's/ALA/A/g;s/CYS/C/g;s/ASP/D/g;s/GLU/E/g;s/PHE/F/g;s/GLY/G/g;s/HIS/H/g;s/HID/H/g;s/HIE/H/g;s/ILE/I/g;s/LYS/K/g;s/LEU/L/g;s/MET/M/g;s/ASN/N/g;s/PRO/P/g;s/GLN/Q/g;s/ARG/R/g;s/SER/S/g;s/THR/T/g;s/VAL/V/g;s/TRP/W/g;s/TYR/Y/g;s/MSE/X/g;s/ //g'  < old.txt > new.fasta

sed 命令无法通过管道传输。它必须作为单个命令给出。

在一个脚本中加入多个 sed 命令来处理 CSV 文件

答案1

答案2

sed's/^"//;s/"|"/|/g;s/""$/"/'

答案3

相关内容