从文本文件提取数据

Question

简单的管道和文本工具就可以完成这项工作：

walt@bat:~(0)$ grep -E -o 'Mapping filepath: [^*]+' Data.file | cut "-d " -f3
map_leaf_M_BAN.AC.txt
             # Note the following regexp is fixed below - user's file had a TAB
walt@bat:~(0)$ grep -E -o 'Total number seqs written +[0-9]+' Data.file | awk '{print $5}'
32310

由于该文件包含一个TAB字符（来自注释），

$ grep "Total number seqs written" split_library_log.txt | cat -t 
Total number seqs written^I32992 
Total number seqs written^I38519

第二条grep命令应该是

 grep -E -o 'Total number seqs written[[:space]]+[0-9]+' Data.file | awk '{print $5}'

当然读man grep;man cut;man awk;man 7 regex。

Answer 1

简单的管道和文本工具就可以完成这项工作：

walt@bat:~(0)$ grep -E -o 'Mapping filepath: [^*]+' Data.file | cut "-d " -f3
map_leaf_M_BAN.AC.txt
             # Note the following regexp is fixed below - user's file had a TAB
walt@bat:~(0)$ grep -E -o 'Total number seqs written +[0-9]+' Data.file | awk '{print $5}'
32310

由于该文件包含一个TAB字符（来自注释），

$ grep "Total number seqs written" split_library_log.txt | cat -t 
Total number seqs written^I32992 
Total number seqs written^I38519

第二条grep命令应该是

 grep -E -o 'Total number seqs written[[:space]]+[0-9]+' Data.file | awk '{print $5}'

当然读man grep;man cut;man awk;man 7 regex。

从文本文件提取数据

答案1

相关内容