使用 Bash 从文本文件中提取标记的字符串

Question

作为第一个想法，awk：

awk -vRS='#[^#]+#' 'RT{gsub(/#/,"",RT);p[RT]=1}END{for(i in p)print i}' the_file

但此决定可能取决于您必须执行的其他操作。

说明按照评论中的要求。

awk -vRS='#[^#]+#' '   # use /#[^#]+#/ as record separator
RT {   # record terminator not empty?
  gsub(/#/,"",RT)    # remove the # parameter delimiter markup
  p[RT]=1   # store it as key in array p
}
END {   # end of input?
  for (i in p) print i   # loop through array p and print each key
}' the_file

重要的部分是使用RT（记录终止符）内置变量：

   RT          The record terminator.  Gawk sets RT to the input text that
               matched the character or regular expression specified by
               RS.

Answer 1

作为第一个想法，awk：

awk -vRS='#[^#]+#' 'RT{gsub(/#/,"",RT);p[RT]=1}END{for(i in p)print i}' the_file

但此决定可能取决于您必须执行的其他操作。

说明按照评论中的要求。

awk -vRS='#[^#]+#' '   # use /#[^#]+#/ as record separator
RT {   # record terminator not empty?
  gsub(/#/,"",RT)    # remove the # parameter delimiter markup
  p[RT]=1   # store it as key in array p
}
END {   # end of input?
  for (i in p) print i   # loop through array p and print each key
}' the_file

重要的部分是使用RT（记录终止符）内置变量：

   RT          The record terminator.  Gawk sets RT to the input text that
               matched the character or regular expression specified by
               RS.

使用 Bash 从文本文件中提取标记的字符串

答案1

相关内容