仅删除客户域名扩展（在 .csv 文件中）

Question 1

使用磨坊主( mlr) 将文件读取为无标头 CSV 文件，然后对其进行过滤，以便仅.br保留第二个字段不以以下字符结尾的记录：

mlr --csv -N filter '$2 !=~ "\.br$"' file

如果要引用输出中的所有字段，请--quote-all在后面添加-N。如果您有标头，请删除-N并使用标头名称代替$2，例如$email !=~ "\.br$"。

测试：

$ cat file
"Phone Number","[email protected]","NAME"
"Phone Number2","[email protected]","NAME2"
"Phone Number3","[email protected]","NAME3"
"Phone Number","[email protected]","NAME"
"Phone Number","[email protected]","NAME.br"

$ mlr --csv -N filter '$2 !=~ "\.br$"' file
Phone Number,[email protected],NAME
Phone Number3,[email protected],NAME3
Phone Number,[email protected],NAME
Phone Number,[email protected],NAME.br

Answer

使用磨坊主( mlr) 将文件读取为无标头 CSV 文件，然后对其进行过滤，以便仅.br保留第二个字段不以以下字符结尾的记录：

mlr --csv -N filter '$2 !=~ "\.br$"' file

如果要引用输出中的所有字段，请--quote-all在后面添加-N。如果您有标头，请删除-N并使用标头名称代替$2，例如$email !=~ "\.br$"。

测试：

$ cat file
"Phone Number","[email protected]","NAME"
"Phone Number2","[email protected]","NAME2"
"Phone Number3","[email protected]","NAME3"
"Phone Number","[email protected]","NAME"
"Phone Number","[email protected]","NAME.br"

$ mlr --csv -N filter '$2 !=~ "\.br$"' file
Phone Number,[email protected],NAME
Phone Number3,[email protected],NAME3
Phone Number,[email protected],NAME
Phone Number,[email protected],NAME.br

Question 2

你需要逃离.这样它就不会匹配任何字符，以确保它不会匹配类似“[电子邮件受保护]” 例如。您还可以查找.br出现的后一个@。

尝试

 sed -i '/".*\@[^"]*\.br"/d' customer.csv

这是一个运行示例：

~$ echo '"Phone Number","[email protected]","NAME"
> "Phone Number2","[email protected]","NAME2"
> "Phone Number3","[email protected]","NAME3"
> "Phone Number","[email protected]","NAME"
> "Phone Number","[email protected]","NAME.br"' > customers.csv

~$ cat customers.csv
"Phone Number","[email protected]","NAME" 
"Phone Number2","[email protected]","NAME2"  <-- should get deleted
"Phone Number3","[email protected]","NAME3"
"Phone Number","[email protected]","NAME"
"Phone Number","[email protected]","NAME.br"

~$ sed -i '/".*@.*\.br"/d' customer.csv 

~$ cat customers.csv 
"Phone Number","[email protected]","NAME"
"Phone Number3","[email protected]","NAME3"
"Phone Number","[email protected]","NAME"
"Phone Number","[email protected]","NAME.br"

Answer

你需要逃离.这样它就不会匹配任何字符，以确保它不会匹配类似“[电子邮件受保护]” 例如。您还可以查找.br出现的后一个@。

尝试

 sed -i '/".*\@[^"]*\.br"/d' customer.csv

这是一个运行示例：

~$ echo '"Phone Number","[email protected]","NAME"
> "Phone Number2","[email protected]","NAME2"
> "Phone Number3","[email protected]","NAME3"
> "Phone Number","[email protected]","NAME"
> "Phone Number","[email protected]","NAME.br"' > customers.csv

~$ cat customers.csv
"Phone Number","[email protected]","NAME" 
"Phone Number2","[email protected]","NAME2"  <-- should get deleted
"Phone Number3","[email protected]","NAME3"
"Phone Number","[email protected]","NAME"
"Phone Number","[email protected]","NAME.br"

~$ sed -i '/".*@.*\.br"/d' customer.csv 

~$ cat customers.csv 
"Phone Number","[email protected]","NAME"
"Phone Number3","[email protected]","NAME3"
"Phone Number","[email protected]","NAME"
"Phone Number","[email protected]","NAME.br"

仅删除客户域名扩展（在 .csv 文件中）

答案1

答案2

相关内容