如何为管道计时？

Question 1

它是在职的。

管道的不同部分是同时执行的。唯一同步/序列化管道中进程的是 IO，即一个进程写入管道中的下一个进程，下一个进程读取第一个进程写入的内容。除此之外，他们正在执行独立地彼此的。

由于管道中的进程之间没有发生读取或写入操作，因此执行管道所需的时间是最长sleep调用的时间。

你也可能写过

time ( foo.sh & bar.sh &; wait )

特登发布聊天中的几个稍微修改过的示例脚本：

#!/bin/sh
# This is "foo.sh"
echo 1; sleep 1
echo 2; sleep 1
echo 3; sleep 1
echo 4

和

#!/bin/sh
# This is "bar.sh"
sleep 2
while read line; do
  echo "LL $line"
done
sleep 1

问题是“为什么time ( sh foo.sh | sh bar.sh )返回 4 秒而不是 3+3 = 6 秒？”

要查看发生了什么，包括执行每个命令的大致时间，可以这样做（输出包含我的注释）：

$ time ( env PS4='$SECONDS foo: ' sh -x foo.sh | PS4='$SECONDS bar: ' sh -x bar.sh )
0 bar: sleep 2
0 foo: echo 1     ; The output is buffered
0 foo: sleep 1
1 foo: echo 2     ; The output is buffered
1 foo: sleep 1
2 bar: read line  ; "bar" wakes up and reads the two first echoes
2 bar: echo LL 1
LL 1
2 bar: read line
2 bar: echo LL 2
LL 2
2 bar: read line  ; "bar" waits for more
2 foo: echo 3     ; "foo" wakes up from its second sleep
2 bar: echo LL 3
LL 3
2 bar: read line
2 foo: sleep 1
3 foo: echo 4     ; "foo" does the last echo and exits
3 bar: echo LL 4
LL 4
3 bar: read line  ; "bar" fails to read more
3 bar: sleep 1    ; ... and goes to sleep for one second

real    0m4.14s
user    0m0.00s
sys     0m0.10s

因此，总而言之，由于对echoin的前两次调用的输出进行了缓冲，管道需要 4 秒而不是 6 秒foo.sh。

Answer

它是在职的。

管道的不同部分是同时执行的。唯一同步/序列化管道中进程的是 IO，即一个进程写入管道中的下一个进程，下一个进程读取第一个进程写入的内容。除此之外，他们正在执行独立地彼此的。

由于管道中的进程之间没有发生读取或写入操作，因此执行管道所需的时间是最长sleep调用的时间。

你也可能写过

time ( foo.sh & bar.sh &; wait )

特登发布聊天中的几个稍微修改过的示例脚本：

#!/bin/sh
# This is "foo.sh"
echo 1; sleep 1
echo 2; sleep 1
echo 3; sleep 1
echo 4

和

#!/bin/sh
# This is "bar.sh"
sleep 2
while read line; do
  echo "LL $line"
done
sleep 1

问题是“为什么time ( sh foo.sh | sh bar.sh )返回 4 秒而不是 3+3 = 6 秒？”

要查看发生了什么，包括执行每个命令的大致时间，可以这样做（输出包含我的注释）：

$ time ( env PS4='$SECONDS foo: ' sh -x foo.sh | PS4='$SECONDS bar: ' sh -x bar.sh )
0 bar: sleep 2
0 foo: echo 1     ; The output is buffered
0 foo: sleep 1
1 foo: echo 2     ; The output is buffered
1 foo: sleep 1
2 bar: read line  ; "bar" wakes up and reads the two first echoes
2 bar: echo LL 1
LL 1
2 bar: read line
2 bar: echo LL 2
LL 2
2 bar: read line  ; "bar" waits for more
2 foo: echo 3     ; "foo" wakes up from its second sleep
2 bar: echo LL 3
LL 3
2 bar: read line
2 foo: sleep 1
3 foo: echo 4     ; "foo" does the last echo and exits
3 bar: echo LL 4
LL 4
3 bar: read line  ; "bar" fails to read more
3 bar: sleep 1    ; ... and goes to sleep for one second

real    0m4.14s
user    0m0.00s
sys     0m0.10s

因此，总而言之，由于对echoin的前两次调用的输出进行了缓冲，管道需要 4 秒而不是 6 秒foo.sh。

Question 2

这是一个更好的例子吗？

$ time perl -e 'alarm(3); 1 while 1;' | perl -e 'alarm(4); 1 while 1;'
Alarm clock

real    0m4.004s
user    0m6.992s
sys     0m0.004s

脚本 busyloop 分别持续 3 秒和 4 秒，由于并行执行，总共花费了 4 秒的实时时间和 7 秒的 CPU 时间。（至少大约。）

或这个：

$ time ( sleep 2; echo) | ( read x; sleep 3 )

real    0m5.004s
user    0m0.000s
sys     0m0.000s

它们并不并行运行，因此总时间为 5 秒。全部都花在睡眠上，所以没有使用 CPU 时间。

Answer

这是一个更好的例子吗？

$ time perl -e 'alarm(3); 1 while 1;' | perl -e 'alarm(4); 1 while 1;'
Alarm clock

real    0m4.004s
user    0m6.992s
sys     0m0.004s

脚本 busyloop 分别持续 3 秒和 4 秒，由于并行执行，总共花费了 4 秒的实时时间和 7 秒的 CPU 时间。（至少大约。）

或这个：

$ time ( sleep 2; echo) | ( read x; sleep 3 )

real    0m5.004s
user    0m0.000s
sys     0m0.000s

它们并不并行运行，因此总时间为 5 秒。全部都花在睡眠上，所以没有使用 CPU 时间。

Question 3

如果有的话sysdig可以插入示踪剂在任意点，假设您可以修改代码以添加必要的写入/dev/null

echo '>::blah::' >/dev/null
foo.sh | bar.sh
echo '<::blah::' >/dev/null

（但这没有满足您的“单一操作”要求）然后通过记录事情

$ sudo sysdig -w blalog "span.tags contains blah"

然后你可能需要一个 sysdig chisel 来导出持续时间

description = "Exports sysdig span tag durations";
short_description = "Export span tag durations.";
category = "Tracers";

args = {}

function on_init()
    ftags = chisel.request_field("span.tags")
    flatency = chisel.request_field("span.duration")
    chisel.set_filter("evt.type=tracer and evt.dir=<")
    return true
end

function on_event()
    local tags = evt.field(ftags)
    local latency = evt.field(flatency)
    if latency then
        print(tostring(tags) .. "\t" .. tonumber(latency) / 1e9)
    end
    return true
end

一旦保存到您的sysdig/chisels目录中，该文件 spantagduration.lua就可以用作

$ sysdig -r blalog -c spantagduration
...

或者您可以使用csysdigJSON 输出。

Answer

如果有的话sysdig可以插入示踪剂在任意点，假设您可以修改代码以添加必要的写入/dev/null

echo '>::blah::' >/dev/null
foo.sh | bar.sh
echo '<::blah::' >/dev/null

（但这没有满足您的“单一操作”要求）然后通过记录事情

$ sudo sysdig -w blalog "span.tags contains blah"

然后你可能需要一个 sysdig chisel 来导出持续时间

description = "Exports sysdig span tag durations";
short_description = "Export span tag durations.";
category = "Tracers";

args = {}

function on_init()
    ftags = chisel.request_field("span.tags")
    flatency = chisel.request_field("span.duration")
    chisel.set_filter("evt.type=tracer and evt.dir=<")
    return true
end

function on_event()
    local tags = evt.field(ftags)
    local latency = evt.field(flatency)
    if latency then
        print(tostring(tags) .. "\t" .. tonumber(latency) / 1e9)
    end
    return true
end

一旦保存到您的sysdig/chisels目录中，该文件 spantagduration.lua就可以用作

$ sysdig -r blalog -c spantagduration
...

或者您可以使用csysdigJSON 输出。

Question 4

太有趣了，我知道这是一个旧线程，但我发现自己处于同样的情况。我发现别名是一个简单的解决方法。至少这适用于bash和fish。不确定所有贝壳。

代替：time ( foo.sh | bar.sh )

尝试：

alias foobar="foo.sh | bar.sh" time foobar

Answer

太有趣了，我知道这是一个旧线程，但我发现自己处于同样的情况。我发现别名是一个简单的解决方法。至少这适用于bash和fish。不确定所有贝壳。

代替：time ( foo.sh | bar.sh )

尝试：

alias foobar="foo.sh | bar.sh" time foobar

如何为管道计时？

答案1

答案2

答案3

答案4

相关内容