逐行读取文件并记住文件中的最后位置

Question 1

如果你想获得匹配，那么你根本不需要使用循环。仅使用单个命令会快得多grep：

grep -Ff input_strings service.log > results.txt

也就是说，如果您想按字面意思执行问题中所述的操作，那么您可以使用变量来跟踪找到最后一个匹配项的行：

LINE_NUMBER=0
while read LINE; do

    # Search for the next match starting at the line number of the previous match
    MATCH="$(tail -n+${LINE_NUMBER} "service.log" | grep -n "${LINE}" | head -n1)";

    # Extract the line number from the match result
    LINE_NUMBER="${MATCH/:*/}";

    # Extract the matching string from the match result
    STRING="${x#*:}";

    # Output the matching string
    echo "${STRING}";

done < input_strings.txt > result.txt

Answer

如果你想获得匹配，那么你根本不需要使用循环。仅使用单个命令会快得多grep：

grep -Ff input_strings service.log > results.txt

也就是说，如果您想按字面意思执行问题中所述的操作，那么您可以使用变量来跟踪找到最后一个匹配项的行：

LINE_NUMBER=0
while read LINE; do

    # Search for the next match starting at the line number of the previous match
    MATCH="$(tail -n+${LINE_NUMBER} "service.log" | grep -n "${LINE}" | head -n1)";

    # Extract the line number from the match result
    LINE_NUMBER="${MATCH/:*/}";

    # Extract the matching string from the match result
    STRING="${x#*:}";

    # Output the matching string
    echo "${STRING}";

done < input_strings.txt > result.txt

Question 2

我猜你想搜索第一个关键字，然后在该匹配之后继续搜索下一个关键字等，并打印匹配的内容。

鉴于keywords：

foo
bar

和data：

bar 0
foo 1
bar 1
foo 2

这里的脚本awk应该做到这一点（使用 GNU awk 测试）：

$ awk 'BEGIN {i = j = 0} NR==FNR { k[i++] = $0; next} 
       $0 ~ k[j] {j++; print $0} j >= i {exit}' keywords data 
foo 1
bar 1

i从 0开始j，在第一个文件期间（将NR==FNR当前文件的记录/行号与所看到的总行数进行比较），我们将关键字收集到一个数组中。之后，尝试匹配j:th 关键字，并j在匹配时打印并增加。找到所有关键字后退出。

与一样grep，这里的关键字实际上是正则表达式模式，尽管awk这里显然是正则表达式。如果您想搜索固定字符串，请使用index($0, key)代替$0 ~ key。

或者，在开始时不加载关键字：

$ awk -vkeyfile=keywords 'BEGIN {getline key < keyfile } 
      $0 ~ key {print $0; if (!getline key < keyfile) exit;}' data
foo 1 
bar 1

这应该很简单。

Answer