删除空行后的换行符

Question 1

我会使用 perl 或 awk 一次读取一段数据，并删除除第一个换行符之外的所有内容：

perl -00 -pe '$\="\n\n"; s/\n/\0/; s/\n//g; s/\0/\n/' file

评论过

perl -00 -pe '   # each record is separated by blank lines (-00)
                 # read the file a record at a time and auto-print (-p)
    $\="\n\n";   # auto-append 2 newlines to each record
    s/\n/\0/;    # turn the first newline into a null byte
    s/\n//g;     # remove all other newlines
    s/\0/\n/     # restore the first newline
' file

相似地

awk -v RS= -F'\n' '{print $1; for (i=2; i<=NF; i++) printf "%s", $i; print ""; print ""}' file

Answer

我会使用 perl 或 awk 一次读取一段数据，并删除除第一个换行符之外的所有内容：

perl -00 -pe '$\="\n\n"; s/\n/\0/; s/\n//g; s/\0/\n/' file

评论过

perl -00 -pe '   # each record is separated by blank lines (-00)
                 # read the file a record at a time and auto-print (-p)
    $\="\n\n";   # auto-append 2 newlines to each record
    s/\n/\0/;    # turn the first newline into a null byte
    s/\n//g;     # remove all other newlines
    s/\0/\n/     # restore the first newline
' file

相似地

awk -v RS= -F'\n' '{print $1; for (i=2; i<=NF; i++) printf "%s", $i; print ""; print ""}' file

Question 2

您可以使用：

sed '/[0-9]\./{n;:l;N;/\n$/!s/\n/ /;t l}' file

这将输出：

4. Alendronic acid
A. Antiosteoporotic agent.  B. Inhibit osteoclast formation and function by inhibiting FPPS enzyme, so increase bone mass.  C. Osteoporosis in combination with vitamin D. 

5. Aminophylline
A. Methylxanthine. Less potent and shorter-acting bronchodilator than Theophylline.  B. Phosphodiesterase (PDE) inhibitor, so increase cAMP so affecting calcium so relaxes respiratory SM and dilates bronchi/bronchioles.  C. Last option of asthma attack, COPD, Reversible airways obstruction.

解释

我们将行与数字相匹配，将句点与相匹配/[0-9]\./。然后我们输入一个代码块，该代码块转到下一行n。它以开始一个循环:l，用附加下一行N，并用替换换行符s/\n/ /。当循环到达空行时终止，该空行由条件选取/\n$/!。

Answer

您可以使用：

sed '/[0-9]\./{n;:l;N;/\n$/!s/\n/ /;t l}' file

这将输出：

4. Alendronic acid
A. Antiosteoporotic agent.  B. Inhibit osteoclast formation and function by inhibiting FPPS enzyme, so increase bone mass.  C. Osteoporosis in combination with vitamin D. 

5. Aminophylline
A. Methylxanthine. Less potent and shorter-acting bronchodilator than Theophylline.  B. Phosphodiesterase (PDE) inhibitor, so increase cAMP so affecting calcium so relaxes respiratory SM and dilates bronchi/bronchioles.  C. Last option of asthma attack, COPD, Reversible airways obstruction.

解释

我们将行与数字相匹配，将句点与相匹配/[0-9]\./。然后我们输入一个代码块，该代码块转到下一行n。它以开始一个循环:l，用附加下一行N，并用替换换行符s/\n/ /。当循环到达空行时终止，该空行由条件选取/\n$/!。

Question 3

这是一个awk解决方案，通过适当定义输入和输出的字段和记录分隔符来解决该问题；因此有效的命令 ( $1=$1 FS) 非常简单：

awk '
  BEGIN { RS="" ; FS="\n" ; OFS="" ; ORS="\n\n" }
  $1=$1 FS
'

解释：

RS=""- 将空行分隔数据块作为一条记录处理

FS="\n"- 将块的每一行定义为自己的可寻址字段

OFS=""- 由于空白终止数据，无需输出字段分隔符

ORS="\n\n"- 用空行分隔新块（作为输入数据）

$1=$1 FS- 第一个字段（即第一行）将通过换行符与块中的其余行分隔开；因为该分配是awk修改记录（块）中的真实条件，因此将被打印

Answer