如何在脚本中保存变量，以便在两次 awk 运行之间针对脚本中的同一输入文件共享变量？

Question 1

根据@KM的要求，这是我的答案。

#! /bin/sh

# this script pulls all rows from a log that are directly or 
# indirectly related to a given session id.  Session IDs are stored
# in $2 of each row.  This field may be null.  Directly related
# rows are those with $2 matching the supplied parameter.  Indirectly
# related rows are those with $3 (aka xid) matching $3 in some other 
# row where $2 matches the supplied parameter.
# It may be assumed that for any rows with the same $3, 
# the $2 field will be identical or null.


SESS_SRCH="$1"
if [ -z $2 ]
then 
 LOGFILE=/path/to/default/log
else 
 LOGFILE=$2
fi

# pass 1:
# read the logfile once to find all unique XIDs associated
# with the supplied session ID ($SESS_SRCH)

XIDS=$(awk -F\| -v sessid="$1" '$2 ~ sessid { xids[$3]=0 } 
END{ 
    for (xid in xids) { 
        print xid 
    } 
}' < ${LOGFILE}
)

XID_SRCH=""

#build a search string from these xids to form a new search string.
for XID in $XIDS
do
 XID_SRCH="${XID_SRCH}|${XID}" 
done

#strip off the leading "|"
XID_SRCH=${XID_SRCH:1}

# pass 2
# read the logfile again, this time seaching on $3, for any of the
# xids found in pass 1.
awk -F\| -v search="$XID_SRCH" '$3 ~ search { print }' < ${LOGFILE}

Answer

根据@KM的要求，这是我的答案。

#! /bin/sh

# this script pulls all rows from a log that are directly or 
# indirectly related to a given session id.  Session IDs are stored
# in $2 of each row.  This field may be null.  Directly related
# rows are those with $2 matching the supplied parameter.  Indirectly
# related rows are those with $3 (aka xid) matching $3 in some other 
# row where $2 matches the supplied parameter.
# It may be assumed that for any rows with the same $3, 
# the $2 field will be identical or null.


SESS_SRCH="$1"
if [ -z $2 ]
then 
 LOGFILE=/path/to/default/log
else 
 LOGFILE=$2
fi

# pass 1:
# read the logfile once to find all unique XIDs associated
# with the supplied session ID ($SESS_SRCH)

XIDS=$(awk -F\| -v sessid="$1" '$2 ~ sessid { xids[$3]=0 } 
END{ 
    for (xid in xids) { 
        print xid 
    } 
}' < ${LOGFILE}
)

XID_SRCH=""

#build a search string from these xids to form a new search string.
for XID in $XIDS
do
 XID_SRCH="${XID_SRCH}|${XID}" 
done

#strip off the leading "|"
XID_SRCH=${XID_SRCH:1}

# pass 2
# read the logfile again, this time seaching on $3, for any of the
# xids found in pass 1.
awk -F\| -v search="$XID_SRCH" '$3 ~ search { print }' < ${LOGFILE}

Question 2

这是一些代码片段，应该满足您的要求，尽管在我看来，问题是一个逻辑问题，因为无论是否进行第二次测试，该循环的输出都是相同的，因为无论何时都会发生匹配。我猜您需要在第二次运行 awk 时进行比您描述的更复杂的测试。

此代码片段的作用是首先提取数据文件中与字段 2 匹配的所有行，并提取字段 3，然后通过使用排序和 uniq 消除字段 3 的重复项。然后对每个 uniq 字段 3 值运行 while 循环（4792761 或 4792964），但这次针对 PATTERN 测试字段 2，针对循环值测试字段 3。

PATTERN="05478900172"

awk -F\| -v matchpat="$PATTERN" '$2 ~ matchpat {print $3}' | sort | uniq | while read field 
do 
   awk -F\| -v matchpat=$PATTERN -v secondpat="$field" '$2 ~ matchpat { if ( $3 ~ secondpat ) {print $0}}' datafile
done

现在我猜你确实想做一些比你描述的更复杂的事情，因为你可以通过使用字段 3 作为排序键对第一个 awk 命令的输出进行排序来简化它并消除 while 循环。

Answer

这是一些代码片段，应该满足您的要求，尽管在我看来，问题是一个逻辑问题，因为无论是否进行第二次测试，该循环的输出都是相同的，因为无论何时都会发生匹配。我猜您需要在第二次运行 awk 时进行比您描述的更复杂的测试。

此代码片段的作用是首先提取数据文件中与字段 2 匹配的所有行，并提取字段 3，然后通过使用排序和 uniq 消除字段 3 的重复项。然后对每个 uniq 字段 3 值运行 while 循环（4792761 或 4792964），但这次针对 PATTERN 测试字段 2，针对循环值测试字段 3。

PATTERN="05478900172"

awk -F\| -v matchpat="$PATTERN" '$2 ~ matchpat {print $3}' | sort | uniq | while read field 
do 
   awk -F\| -v matchpat=$PATTERN -v secondpat="$field" '$2 ~ matchpat { if ( $3 ~ secondpat ) {print $0}}' datafile
done

现在我猜你确实想做一些比你描述的更复杂的事情，因为你可以通过使用字段 3 作为排序键对第一个 awk 命令的输出进行排序来简化它并消除 while 循环。

如何在脚本中保存变量，以便在两次 awk 运行之间针对脚本中的同一输入文件共享变量？

答案1

答案2

相关内容