我有一张如下的表格(.tsv):
s__Methanobrevibacter_smithii k__Archaea p__Euryarchaeota c__Methanobacteria o__Methanobacteriales f__Methanobacteriaceae g__Methanobrevibacter s__Methanobrevibacter_smithii
s__Methanosphaera_stadtmanae k__Archaea p__Euryarchaeota c__Methanobacteria o__Methanobacteriales f__Methanobacteriaceae g__Methanosphaera s__Methanosphaera_stadtmanae
s__Candidatus_Methanomassiliicoccus_intestinalis k__Archaea p__Euryarchaeota c__Thermoplasmata o__Methanomassiliicoccales f__Methanomassiliicoccaceae g__Methanomassiliicoccus s__Candidatus_Methanomassiliicoccus_intestinalis
s__Actinobaculum_sp_oral_taxon_183 k__Bacteria p__Actinobacteria c__Actinobacteria o__Actinomycetales f__Actinomycetaceae g__Actinobaculum s__Actinobaculum_sp_oral_taxon_183
s__Actinomyces_graevenitzii k__Bacteria p__Actinobacteria c__Actinobacteria o__Actinomycetales f__Actinomycetaceae g__Actinomyces s__Actinomyces_graevenitzii
我想只保留第三个下划线后面的单词并删除该列中的所有内容。此外,还想删除第一列中的第四个下划线及其后的所有内容,保留其他列不变。我希望得到如下输出:
s__Methanobrevibacter_smithii k__Archaea p__Euryarchaeota c__Methanobacteria o__Methanobacteriales f__Methanobacteriaceae g__Methanobrevibacter s__smithii
s__Methanosphaera_stadtmanae k__Archaea p__Euryarchaeota c__Methanobacteria o__Methanobacteriales f__Methanobacteriaceae g__Methanosphaera s__stadtmanae
s__Candidatus_Methanomassiliicoccus k__Archaea p__Euryarchaeota c__Thermoplasmata o__Methanomassiliicoccales f__Methanomassiliicoccaceae g__Methanomassiliicoccus s__intestinalis
s__Actinobaculum_sp k__Bacteria p__Actinobacteria c__Actinobacteria o__Actinomycetales f__Actinomycetaceae g__Actinobaculum s__sp
s__Actinomyces_graevenitzii k__Bacteria p__Actinobacteria c__Actinobacteria o__Actinomycetales f__Actinomycetaceae g__Actinomyces s__graevenitzii
有人可以帮我做这件事吗?
非常感谢