如何从文件中删除特殊字符？

Question 1

假设只有一个块class并且每个标签都在单独的行中，这将在 GNU awk 中为您工作：

awk '/<\/class>/{p=0};p{gsub(/[^A-Za-z0-9]/," ")};/<class>/{p=1};1' file.txt

Answer

假设只有一个块class并且每个标签都在单独的行中，这将在 GNU awk 中为您工作：

awk '/<\/class>/{p=0};p{gsub(/[^A-Za-z0-9]/," ")};/<class>/{p=1};1' file.txt

Question 2

通过下面的sed命令完成测试并工作正常使用下面的命令我删除了[<>&$@/'"]之间的所有特殊字符<class> and </class>

输入.txt

<class>
these are special @ $ characters / < > & " '
</class>

命令

sed -n '/<class>/,/<\/class>/p' input.txt | sed '/^[a-z]/s/[<>&$@/]//g' | sed "s/'//g" | sed 's/"//g'

输出

<class>
these are special   characters
</class>

Answer

通过下面的sed命令完成测试并工作正常使用下面的命令我删除了[<>&$@/'"]之间的所有特殊字符<class> and </class>

输入.txt

<class>
these are special @ $ characters / < > & " '
</class>

命令

sed -n '/<class>/,/<\/class>/p' input.txt | sed '/^[a-z]/s/[<>&$@/]//g' | sed "s/'//g" | sed 's/"//g'

输出

<class>
these are special   characters
</class>

相关内容