如何用多个空格分隔字段并将它们存储在数组中？

Question 1

使用 colrm 从文件中删除列。

#!/bin/bash

shopt -s extglob

a=()
while read; do
   a+=("${REPLY%%*( )}")
done < <(colrm 26 < text.txt)

printf %s\\n "${a[@]:2:3}"

（Bash 内置版本）：

#!/bin/bash

shopt -s extglob

a=()
while read; do
    b="${REPLY::26}"; a+=("${b%%*( )}")
done < text.txt

printf %s\\n "${a[@]:2:3}"

Answer

使用 colrm 从文件中删除列。

#!/bin/bash

shopt -s extglob

a=()
while read; do
   a+=("${REPLY%%*( )}")
done < <(colrm 26 < text.txt)

printf %s\\n "${a[@]:2:3}"

（Bash 内置版本）：

#!/bin/bash

shopt -s extglob

a=()
while read; do
    b="${REPLY::26}"; a+=("${b%%*( )}")
done < text.txt

printf %s\\n "${a[@]:2:3}"

Question 2

移动我的评论，基于此来源，仅显示基于多空间的表上的特定列：

awk -F '  +' '{print $2}' mytxt.txt  # Or with -F ' {2,}'

请注意如果使用双引号则不起作用。

我发现查找重复项特别有用，使用如下方法：

somelist... | sort | uniq -c | sort -rn | grep -vP "^ +1 " | awk -F '  +' '{print $3}'

Answer

移动我的评论，基于此来源，仅显示基于多空间的表上的特定列：

awk -F '  +' '{print $2}' mytxt.txt  # Or with -F ' {2,}'

请注意如果使用双引号则不起作用。

我发现查找重复项特别有用，使用如下方法：

somelist... | sort | uniq -c | sort -rn | grep -vP "^ +1 " | awk -F '  +' '{print $3}'

Question 3

您可以使用 bash 内置命令mapfile（又名readarray）和回调，该回调使用参数扩展来修剪以两个空格开头的最长尾随子字符串：

mapfile -c 1 -C 'f() { field_one[$1]="${2%%  *}"; }; f' < mytxt

例如给定

$ cat mytxt
field1                    field2
------                    -------
this are numbers          12345
this letters              abc def ghi

然后

$ mapfile -c 1 -C 'f() { field_one[$1]="${2%%  *}"; }; f' < mytxt
$
$ printf '%s\n' "${field_one[@]}" | cat -A
field1$
------$
this are numbers$
this letters$

Answer

您可以使用 bash 内置命令mapfile（又名readarray）和回调，该回调使用参数扩展来修剪以两个空格开头的最长尾随子字符串：

mapfile -c 1 -C 'f() { field_one[$1]="${2%%  *}"; }; f' < mytxt

例如给定

$ cat mytxt
field1                    field2
------                    -------
this are numbers          12345
this letters              abc def ghi

然后

$ mapfile -c 1 -C 'f() { field_one[$1]="${2%%  *}"; }; f' < mytxt
$
$ printf '%s\n' "${field_one[@]}" | cat -A
field1$
------$
this are numbers$
this letters$

Question 4

该答案重点关注从数组中删除两个标题行以满足输出要求。

$ cat fieldone.txt
field1                    field2
------                    -------
this are numbers          12345
this letters              abc def ghi 

$ fieldone
this are numbers         
this letters

脚本如下：

#!/bin/bash

# NAME: fieldone
# PATH: $HOME/askubuntu/
# DESC: Answer for: https://askubuntu.com/questions/1194620/
# how-would-you-separate-fields-with-multiple-spaces-and-store-them-in-an-array

# DATE: December 8, 2019.

i=0                                     # Current 0-based array index number
while read line; do                     # Read all lines from input file
    ((LineNo++))                        # Current line number of input file
    [[ $LineNo -eq 1 ]] && continue     # "Field 1     Field 2" skip first line
    if [[ $LineNo -eq 2 ]] ; then       # Is this is line 2?
        # Grab the second column position explained in:
        # https://unix.stackexchange.com/questions/153339/
        # how-to-find-a-position-of-a-character-using-grep
        Len="$(grep -aob ' -' <<< "$line" | \grep -oE '[0-9]+')"
        continue                        # Loop back for first field
    fi

    field_one[$i]="${line:0:$Len}"      # Extract line position 0 for Len
    echo "${field_one[i]}"              # Display array index just added
    ((i++))                             # Increment for next array element

done < fieldone.txt                     # Input filename fed into read loop

希望代码和注释能够一目了然。如果不是，请随时发表评论。

如果两列之间只有一个空格，那么脚本仍然有效，而其他一些答案将会中断：

field1         field2
------         ------
this is letter abcdef
this is number 123456

Answer

该答案重点关注从数组中删除两个标题行以满足输出要求。

$ cat fieldone.txt
field1                    field2
------                    -------
this are numbers          12345
this letters              abc def ghi 

$ fieldone
this are numbers         
this letters

脚本如下：

#!/bin/bash

# NAME: fieldone
# PATH: $HOME/askubuntu/
# DESC: Answer for: https://askubuntu.com/questions/1194620/
# how-would-you-separate-fields-with-multiple-spaces-and-store-them-in-an-array

# DATE: December 8, 2019.

i=0                                     # Current 0-based array index number
while read line; do                     # Read all lines from input file
    ((LineNo++))                        # Current line number of input file
    [[ $LineNo -eq 1 ]] && continue     # "Field 1     Field 2" skip first line
    if [[ $LineNo -eq 2 ]] ; then       # Is this is line 2?
        # Grab the second column position explained in:
        # https://unix.stackexchange.com/questions/153339/
        # how-to-find-a-position-of-a-character-using-grep
        Len="$(grep -aob ' -' <<< "$line" | \grep -oE '[0-9]+')"
        continue                        # Loop back for first field
    fi

    field_one[$i]="${line:0:$Len}"      # Extract line position 0 for Len
    echo "${field_one[i]}"              # Display array index just added
    ((i++))                             # Increment for next array element

done < fieldone.txt                     # Input filename fed into read loop

希望代码和注释能够一目了然。如果不是，请随时发表评论。

如果两列之间只有一个空格，那么脚本仍然有效，而其他一些答案将会中断：

field1         field2
------         ------
this is letter abcdef
this is number 123456

如何用多个空格分隔字段并将它们存储在数组中？

答案1

答案2

答案3

答案4

相关内容