我们在接口中配置了 2x10G NIC bond
,我们随机看到:
RX errors 28751549 dropped 46541 overruns 28751549 frame 0
我们确实有多个 VLAN 在此 NIC 上运行,因此它是一个中继端口。
网卡是一个Broadcom Corporation NetXtreme II BCM57810 10 Gigabit Ethernet (rev 10)
我有以下 RX 环形缓冲区大小。不确定它来自哪里。
# ethtool -g eno49
Ring parameters for eno49:
Pre-set maximums:
RX: 4078
RX Mini: 0
RX Jumbo: 0
TX: 4078
Current hardware settings:
RX: 407
RX Mini: 0
RX Jumbo: 0
TX: 4078
一些统计数据:
# ethtool -S eno49 | grep rx
[0]: rx_bytes: 35806944147
[0]: rx_ucast_packets: 254721712
[0]: rx_mcast_packets: 109111
[0]: rx_bcast_packets: 26
[0]: rx_discards: 3599652
[0]: rx_phy_ip_err_discards: 0
[0]: rx_skb_alloc_discard: 0
[0]: rx_csum_offload_errors: 341
[1]: rx_bytes: 35586576857
[1]: rx_ucast_packets: 251529817
[1]: rx_mcast_packets: 0
[1]: rx_bcast_packets: 0
[1]: rx_discards: 3586872
[1]: rx_phy_ip_err_discards: 0
[1]: rx_skb_alloc_discard: 0
[1]: rx_csum_offload_errors: 127
[2]: rx_bytes: 35580155284
[2]: rx_ucast_packets: 251785183
[2]: rx_mcast_packets: 0
[2]: rx_bcast_packets: 0
[2]: rx_discards: 3584128
[2]: rx_phy_ip_err_discards: 0
[2]: rx_skb_alloc_discard: 0
[2]: rx_csum_offload_errors: 125
[3]: rx_bytes: 35614282304
[3]: rx_ucast_packets: 251757796
[3]: rx_mcast_packets: 0
[3]: rx_bcast_packets: 0
[3]: rx_discards: 3587512
[3]: rx_phy_ip_err_discards: 0
[3]: rx_skb_alloc_discard: 0
[3]: rx_csum_offload_errors: 161
[4]: rx_bytes: 35547320461
[4]: rx_ucast_packets: 251679615
[4]: rx_mcast_packets: 0
[4]: rx_bcast_packets: 0
[4]: rx_discards: 3596817
[4]: rx_phy_ip_err_discards: 0
[4]: rx_skb_alloc_discard: 0
[4]: rx_csum_offload_errors: 1171
[5]: rx_bytes: 36582099166
[5]: rx_ucast_packets: 253331335
[5]: rx_mcast_packets: 0
[5]: rx_bcast_packets: 0
[5]: rx_discards: 3590358
[5]: rx_phy_ip_err_discards: 0
[5]: rx_skb_alloc_discard: 0
[5]: rx_csum_offload_errors: 982
[6]: rx_bytes: 35710085729
[6]: rx_ucast_packets: 252379427
[6]: rx_mcast_packets: 0
[6]: rx_bcast_packets: 0
[6]: rx_discards: 3589068
[6]: rx_phy_ip_err_discards: 0
[6]: rx_skb_alloc_discard: 0
[6]: rx_csum_offload_errors: 745
[7]: rx_bytes: 166064885960
[7]: rx_ucast_packets: 418819356
[7]: rx_mcast_packets: 0
[7]: rx_bcast_packets: 0
[7]: rx_discards: 3617142
[7]: rx_phy_ip_err_discards: 0
[7]: rx_skb_alloc_discard: 0
[7]: rx_csum_offload_errors: 161
rx_bytes: 416492349908
rx_error_bytes: 0
rx_ucast_packets: 2186004241
rx_mcast_packets: 109111
rx_bcast_packets: 26
rx_crc_errors: 0
rx_align_errors: 0
rx_undersize_packets: 0
rx_oversize_packets: 0
rx_fragments: 0
rx_jabbers: 0
rx_discards: 28751549
rx_filtered_packets: 7716855
rx_mf_tag_discard: 0
rx_brb_discard: 0
rx_brb_truncate: 0
rx_pause_frames: 0
rx_mac_ctrl_frames: 0
rx_constant_pause_events: 0
rx_phy_ip_err_discards: 0
rx_skb_alloc_discard: 0
rx_csum_offload_errors: 3813
我如何追踪这个问题?它发生在多台服务器上,而不是在一台机器上。我们也没有那么多流量来填充链接。
答案1
尝试将“当前硬件设置”更改为最大允许值。
ethtool -G eno49 rx 4078