求数组的补集？

Question 1

根据我在其他线程中的建议：

awk '
  BEGIN { srand(); do a[int(100*rand()+1)]; while (length(a)<10) }
  NR in a
' ~/orig.txt > ~/short.txt

可以将其更改为创建两个文件：

awk -v range=100 -v offset=1 -v amount=10 '
  BEGIN { srand(); do a[int(range*rand()+offset)]; while (length(a)<amount) }
  NR in a    { print > "short.txt" }
  !(NR in a) { print > "rest.txt" }
' ~/orig.txt

（请注意，在内部awk不能使用~。不过，可以使用HOMEthrough ENVIRON[]，如：print > ENVIRON["HOME"] "/short.txt"或 resp., print > ENVIRON["HOME"] "/rest.txt"。）

Answer

根据我在其他线程中的建议：

awk '
  BEGIN { srand(); do a[int(100*rand()+1)]; while (length(a)<10) }
  NR in a
' ~/orig.txt > ~/short.txt

可以将其更改为创建两个文件：

awk -v range=100 -v offset=1 -v amount=10 '
  BEGIN { srand(); do a[int(range*rand()+offset)]; while (length(a)<amount) }
  NR in a    { print > "short.txt" }
  !(NR in a) { print > "rest.txt" }
' ~/orig.txt

（请注意，在内部awk不能使用~。不过，可以使用HOMEthrough ENVIRON[]，如：print > ENVIRON["HOME"] "/short.txt"或 resp., print > ENVIRON["HOME"] "/rest.txt"。）

Question 2

好吧，转念一想——我工作过方式太难了。你只需要这个：

shuf -i 1-100 -n10 |
sed 's/$/{p;b\n}/' |
sed -nf - -e 'w separate_file' infile >outfile

尽管您可能需要一个文字换行符来代替替换n中的sed。无论如何，它的作用与下面相同 - 它只是不必费心执行所有其他 90 行 - 它们只是就位，因为它们在文件中 - 所以它们不需要任何特殊考虑。

这是整个交易：

set  " $(shuf -i 1-100 -n 10) "
while [ "$((i+=1))" -le 100 ]
do    [ -z "${1##*[!0-9]$i[!0-9]*}" ]
      printf "$i%.$((!$?))s%.$?s\n" p H 
done| sed -nf - -e '$!d;x;s/.//p' <infile >outfile

在那里 - 我们基本上只是编写一个sed如下所示的脚本：

1H
2H
3H
4p
5H
...
90p
91H
...

依此类推，直到 100。在最后一行 - 在所有随机选择的行都已被p打印之后，我们x更改为H旧空间，s///替换掉第一个插入的\newline 字符，并p打印其余的部分。

要在没有 shell 循环的情况下执行此操作，您可以执行以下操作：

set  "$(shuf -i 1-100 -n 10)"
{ seq 100 | grep -Fxv "$1"; echo "$1"; } |
sed '1,90s/$/H/;91,$s/$/p/' |
sed -nf - -e '$!d;x;s/.//p' <infile >outfile

但我不确定在这种规模上这是否会有好处。

不管怎样，我使用了一个seq 100输出文件作为测试，并在运行它后打印出来......

...对于未包含在初始随机 100 中的所有行，一直到 100。

Answer