lsof：测量套接字 fd 的 I/O 速率

Question

考虑使用系统点击。它是 DTrace 克隆，但针对 Linux - 它编译动态修补内核的内核模块，并具有对其数据的完全访问权限（因此lsof在这种情况下可能不需要）。

然而，您询问的信息越多，脚本就会变得越棘手并且特定于内核版本。

例如，用于套接字的简单的类似于统计的实用程序将如下所示：

global stats;

probe begin {
    printf("%14s %6s %12s %5s %5s %8s\n", "NAME", "PID", "EXECNAME",
                "INO", "OPS/S", "BYTES");
}

function file_ino:long (file:long)
{
    if(file == 0) return -1;
    d_inode = @cast(file, "file", "kernel")->f_inode;
    if (d_inode == 0) return -1;
    return @cast(d_inode, "inode", "kernel")->i_ino;
}

probe socket.send, socket.receive {
    if(success == 0) next;

    /* Get inode number for a socket. Depending on 
       operation, struct file is contained in different fields. 
       Determine that field and get inode number */
    ino = -1;
    if(@defined($sock)) {
        ino = file_ino($sock->file);
    }
    else if(@defined($iocb)) {
        ino = file_ino($iocb->ki_filp);
    }

    stats[pid(), execname(), ino, name] <<< size;
}

probe timer.s(1) {
    /* Every 1 second print statistics */
    foreach([pid+, ename, ino, name] in stats) {
        printf("%14s %6d %12s %5d %5d %8d\n", name, pid, ename, ino, 
                    @count(stats[pid, ename, ino, name]), 
                    @sum(stats[pid, ename, ino, name]));
    }
    delete stats;
}

我在 vanilla Linux 3.12 上测试了它，但是正如你所看到的，获取 inode 编号的逻辑依赖于内部内核结构。

正如您所看到的，大多数时候，它会跟踪自己写入 SSH 会话：

       NAME    PID     EXECNAME   INO OPS/S    BYTES
socket.send   2655         sshd  7480     1       96
socket.send   2655         sshd  7480     1       96
socket.send   2655         sshd  7480     1       96
...

示例中有更复杂的脚本：https://sourceware.org/systemtap/examples/network/socktop

警告

SystemTap 是正在开发的内核级软件，因此存在内核崩溃或冻结的可能性。不过，这种情况很少见，但要小心。

参考

https://sourceware.org/systemtap/- 项目主页
https://sourceware.org/systemtap/wiki- 维基百科
https://sourceware.org/systemtap/tapsets/socket.stp.html- 演示脚本中使用的套接字 Tapset
https://sourceware.org/systemtap/langref/- SystemTap 语言参考

Answer 1