我们有一个 Redhat 虚拟机 (vmware vsphere),用于托管我们应用程序的数据库。该应用程序运行正常,直到某个时候崩溃,并在 db2diag 上看到以下错误:
2015-03-05-14.35.13.380939+000 E1406160203E1391 LEVEL: Error (OS)
PID : 19969 TID : 47269155956480 PROC : db2sysc 0
INSTANCE: db2inst1 NODE : 000 DB : SIMTVAIS
APPHDL : 0-15775 APPID: ::ffff:10.86.22.164.51178.150305082202
AUTHID : ESERVIS HOSTNAME: DbSimpl
EDUID : 120 EDUNAME: db2agent (SIMTVAIS) 0
FUNCTION: DB2 UDB, oper system services, sqloseekwrite64, probe:40
MESSAGE : ZRC=0x840F0001=-2079391743=SQLO_ACCD "Access Denied"
DIA8701C Access denied for resource "", operating system return code was "".
CALLED : OS, -, pwrite OSERR: EROFS (30)
DATA #1 : File handle, PD_TYPE_SQO_FILE_HDL, 8 bytes
File Handle = 751
File System Block Size = 4096 bytes
File System Type = ext4
File Handle Flags :
Require Sector Align = No
DIO/CIO Mode = No
Raw Block Device = No
Reserved Handle = No
Flush On Close = No
Thread-Level Lock = No
Write-through Mode = Yes
File Not Tracked = Yes
DATA #2 : unsigned integer, 8 bytes
4096
DATA #3 : signed integer, 8 bytes
1172877312
DATA #4 : signed integer, 8 bytes
-1
DATA #5 : String, 105 bytes
Search for ossError*Analysis probe point after this log entry for
further self-diagnosis of this problem.
当我们检查操作系统系统日志时,我们会看到:
Mar 5 14:34:53 DbSimpl kernel: sd 2:0:1:0: timing out command, waited 180s
Mar 5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Unhandled error code
Mar 5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Result: hostbyte=DID_OK driverbyte=DRIVER_OK
Mar 5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] CDB: Write(10): 2a00
Mar 5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Unhandled error code
Mar 5 14:34:53 DbSimpl kernel: 08 82 44 b9 00 00 08 00
Mar 5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] Result: hostbyte=DID_OK driverbyte=DRIVER_OK
Mar 5 14:34:53 DbSimpl kernel: Buffer I/O error on device dm-4, logical block 17844119
Mar 5 14:34:53 DbSimpl kernel: sd 2:0:1:0: [sdb] CDB: Write(10): 2a 00 09 1c 4b 41 00 00 08 00
Mar 5 14:34:53 DbSimpl kernel: lost page write due to I/O error on dm-4
谁能帮助我找出问题的根本原因?
答案1
为您提供服务的 I/O 子系统/dev/sdb
未及时响应;您的 DB2 客户服务器尝试将某些内容写入该磁盘,但等待三分钟后放弃。调查这方面的问题。