BUG:无法处理位于 0000000081b66158 的内核分页请求(针对 14.04 内核:4.4.0-31)

BUG:无法处理位于 0000000081b66158 的内核分页请求(针对 14.04 内核:4.4.0-31)

我在“crash”实用程序命令行使用“log”命令获得以下输出:

[9176.888659] perf interrupt took too long (5011 > 5000), lowering  kernel.perf_event_max_sample_rate to 25000. 
[92733.329430] BUG: unable to handle kernel paging request at 0000000081b66158  
[92733.329496] IP: [<0000000081b66158>] 0x81b66158. 
[92733.329533] PGD 1057674067 PUD 0   
[92733.329561] Oops: 0010 [#1] SMP    
[92733.329589] Modules linked in: ipt_REJECT nf_reject_ipv4   nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack iptable_filter xt_tcpudp xt_CT nf_conntrack iptable_raw ip_tables x_tables intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul aesni_intel aes_x86_64 lrw gf128mul glue_helper ipmi_devintf ipmi_ssif bonding ablk_helper cryptd dm_multipath sb_edac edac_core 8250_fintek shpchp lpc_ich mac_hid acpi_power_meter ipmi_si ipmi_msghandler iTCO_wdt iTCO_vendor_support raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid0 multipath linear raid1 ast ttm drm_kms_helper mxm_wmi syscopyarea ixgbe sysfillrect igb vxlan sysimgblt ip6_udp_tunnel fb_sys_fops dca udp_tunnel ptp drm ahci pps_core i2c_algo_bit. 
[92733.330174]  libahci mdio fjes wmi.  
[92733.330198] CPU: 24 PID: 24797 Comm: java Not tainted 4.4.0-31-generic #50~14.04.1-Ubuntu.  
[92733.330250] Hardware name: Quanta Computer Inc. P-100.Scaler.01/S2PC-MB(Dual 1G LOM), BIOS S2P_3B04.HGT04 04/12/2017.  
[92733.330316] task: ffff88104aa7ee00 ti: ffff88104c940000 task.ti: ffff88104c940000.  
[92733.330364] RIP: 0010:[<0000000081b66158>]  [<0000000081b66158>] 0x81b66158.  
[92733.330411] RSP: 0018:ffff88104c943d50  EFLAGS: 00010246.  
[92733.330445] RAX: 0000000000000000 RBX: ffff88104c943dc0 RCX: 0000000000000013.  
[92733.330490] RDX: 0000000000000001 RSI: 0000000000000000 RDI: 000000007a3e56da.  
[92733.330536] RBP: ffffffff8137c2b8 R08: 0000000000000003 R09: ffffc90007b4d6e0.  
[92733.330581] R10: 000000002b300067 R11: 0000000000000067 R12: ffffffff8173f1cb.  
[92733.330626] R13: 000000000000fe88 R14: ffff88104c943e30 R15: 0000000000000025  
[92733.330671] FS:  00007f93e36f6700(0000) GS:ffff88105f400000(0000) knlGS:0000000000000000.   
[92733.330723] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033.  
[92733.330760] CR2: 0000000081b66158 CR3: 000000104f5d4000 CR4: 00000000003406e0.  
[92733.330806] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000.  
[92733.330851] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400.  
[92733.330895] Stack:  
[92733.330911]  ffff880707f74600 ffff88104aa7ee00 ff01ffff00000000 0000000000000025.  
[92733.330965]  ffff880b00000000 000005a800000000 0000000000000025 ffffffff0000fe88.  
[92733.331019]  7fffffffffffffff ffff880b7dd618c0 ffff880707f74600 0000000000000000.  
[92733.331073] Call Trace:  
[92733.331097]  [<ffffffff81769937>] ? inet_sendmsg+0x67/0xa0.  
[92733.331137]  [<ffffffff816d6d58>] ? sock_sendmsg+0x38/0x50.  
[92733.331174]  [<ffffffff816d71f2>] ? SYSC_sendto+0x102/0x190.  
[92733.331213]  [<ffffffff8137ccb4>] ? aa_sock_opt_perm+0x54/0x110.  
[92733.331254]  [<ffffffff8173d00c>] ? tcp_getsockopt+0x2c/0x30.  
[92733.331293]  [<ffffffff816d7d3e>] ? SyS_sendto+0xe/0x10.  
[92733.331330]  [<ffffffff817f6f36>] ? entry_SYSCALL_64_fastpath+0x16/0x75.   
[92733.331372] Code:  Bad RIP value.   
[92733.331397] RIP  [<0000000081b66158>] 0x81b66158.  
[92733.331429]  RSP <ffff88104c943d50>.  
[92733.331453] CR2: 0000000081b66158.  

根据内核文档“kernel.org/doc/gorman/html/understand/understand007.html”第 4.6 节的内容如下:

内核模式下用户空间区域发生故障 - 错误 - 如果发生故障,则意味着内核系统未正确从用户空间复制并导致页面错误。这是一个内核错误,处理起来相当严肃。

因此,我提出这个问题,看看我是否错过了任何建议的修复,或者这个问题是否已经在处理中?
硬件:

  1. CPU 详细信息(cat /proc/cpuinfo 中的最后一个处理器)。

    processor   : 31     
    vendor_id   : GenuineIntel  
    cpu family  : 6.  
    model       : 79.  
    model name  : Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz.  
    stepping    : 1.  
    microcode   : 0xb000021.  
    
  2. 内存

    Memory Device.  
    Array Handle: 0x0037.  
    Error Information Handle: Not Provided.  
    Total Width: 72 bits.  
    Data Width: 64 bits.  
    Size: 16384 MB.  
    Form Factor: DIMM.  
    Set: None.  
    Locator: DIMM_A0.  
    Bank Locator: _Node0_Channel0_Dimm0.  
    Type: <OUT OF SPEC>.  
    Type Detail: Synchronous. 
    Speed: 2133 MHz.  
    Manufacturer: Samsung.  
    Serial Number: 4156465A.  
    Asset Tag: DIMM_A0_AssetTag.  
    Part Number: M393A2K40BB0-CPB     
    Rank: 1.  
    Configured Clock Speed: 2133 MHz.  
    Minimum voltage:  1.200 V.  
    Maximum voltage:  1.200 V.  
    Configured voltage:  1.200 V.  
    

相关内容