生产 Redhat 服务器上的 DB2 崩溃

生产 Redhat 服务器上的 DB2 崩溃

我们有一个 Redhat 虚拟机 (vmware vsphere),用于托管我们应用程序的数据库。该应用程序运行正常,直到某个时候崩溃,并在 db2diag 上看到以下错误:

2015-03-05-14.35.13.380939+000 E1406160203E1391      LEVEL: Error (OS)
PID     : 19969                TID : 47269155956480  PROC : db2sysc 0
INSTANCE: db2inst1             NODE : 000            DB   : SIMTVAIS
APPHDL  : 0-15775              APPID: ::ffff:10.86.22.164.51178.150305082202
AUTHID  : ESERVIS              HOSTNAME: DbSimpl
EDUID   : 120                  EDUNAME: db2agent (SIMTVAIS) 0
FUNCTION: DB2 UDB, oper system services, sqloseekwrite64, probe:40
MESSAGE : ZRC=0x840F0001=-2079391743=SQLO_ACCD "Access Denied"
DIA8701C Access denied for resource "", operating system return code was "".
CALLED  : OS, -, pwrite                           OSERR: EROFS (30)
DATA #1 : File handle, PD_TYPE_SQO_FILE_HDL, 8 bytes
  File Handle              = 751
  File System Block Size   = 4096 bytes
  File System Type         = ext4
  File Handle Flags :
  Require Sector Align   = No
  DIO/CIO Mode           = No
Raw Block Device       = No
Reserved Handle        = No
Flush On Close         = No
Thread-Level Lock      = No
Write-through Mode     = Yes
File Not Tracked       = Yes
DATA #2 : unsigned integer, 8 bytes
4096
DATA #3 : signed integer, 8 bytes
1172877312
DATA #4 : signed integer, 8 bytes
-1
DATA #5 : String, 105 bytes
Search for ossError*Analysis probe point after this log entry for
further self-diagnosis of this problem.

当我们检查操作系统系统日志时,我们会看到:

Mar  5 14:34:53 DbSimpl kernel: sd 2:0:1:0: timing out command, waited 180s
Mar  5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Unhandled error code
Mar  5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Result: hostbyte=DID_OK driverbyte=DRIVER_OK
Mar  5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] CDB: Write(10): 2a00
Mar  5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Unhandled error code
Mar  5 14:34:53 DbSimpl kernel: 08 82 44 b9 00 00 08 00
Mar  5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Result: hostbyte=DID_OK driverbyte=DRIVER_OK
Mar  5 14:34:53 DbSimpl kernel: Buffer I/O error on device dm-4, logical block 17844119
Mar  5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] CDB: Write(10): 2a 00 09 1c 4b 41 00 00 08 00
Mar  5 14:34:53 DbSimpl kernel: lost page write due to I/O error on dm-4

谁能帮助我找出问题的根本原因?

答案1

为您提供服务的 I/O 子系统/dev/sdb未及时响应;您的 DB2 客户服务器尝试将某些内容写入该磁盘,但等待三分钟后放弃。调查这方面的问题。

相关内容