- 论坛徽章:
- 0
|
各位大侠:\r\n这两天遇到一奇怪问题:\r\n硬件:SUN FIRE V250, 4 SCSI HD (73G), ROOT FS 由两块MIRROR成, OS: SOLARIS 9 With SUn Volume Manager\r\nMESSAGES 显示0号硬盘读写错误(only part of them):\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:04:01 sun2 Error for Command: write(10) Error Level: Retryable\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Requested Block: 139704496 Error Block: 139704496\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Sense Key: Unit Attention\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] ASC: 0x29 (<vendor unique code 0x29>), ASCQ: 0x3, FRU: 0x4\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:04:01 sun2 Error for Command: write(10) Error Level: Informational\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Requested Block: 142820212 Error Block: 142820212\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] Sense Key: Soft Error\r\nJan 1 11:04:01 sun2 scsi: [ID 107833 kern.notice] ASC: 0x5d (drive operation marginal, service immediately (failure prediction threshold excee\r\nded)), ASCQ: 0x0, FRU: 0x5\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:06:41 sun2 Error for Command: write(10) Error Level: Retryable\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] Requested Block: 142820212 Error Block: 142820212\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] Sense Key: Hardware Error\r\nJan 1 11:06:41 sun2 scsi: [ID 107833 kern.notice] ASC: 0x32 (no defect spare location available), ASCQ: 0x0, FRU: 0x4\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.warning] WARNING: /pci@1d,700000/scsi@4/sd@0,0 (sd0):\r\nJan 1 11:06:42 sun2 Error for Command: write(10) Error Level: Retryable\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] Requested Block: 142820212 Error Block: 142820212\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] Vendor: SEAGATE Serial Number: 0402B6RQM8\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] Sense Key: Hardware Error\r\nJan 1 11:06:42 sun2 scsi: [ID 107833 kern.notice] ASC: 0x32 (no defect spare location available), ASCQ: 0x0, FRU: 0x4\r\n\r\nIOSTAT -EN也显示同样问题. 可是METASTAT 却显示所有分区OK, 并且USER也没记得遇到读写错误. ORACLE数据库运行正常(一部分数据文件在ROOT上). W我试着光驱启动并FSCK硬盘分区,发现有数据坏块(只有一两块), 以及REFERENCE不对等小问题, 并回答\"Y\"修正这些问题, 然后发现, 修正后的硬盘不能METAREPLACE, 报告一些块读不到. 于是, 拿原来0号盘(本来想换下来的坏盘), 重新与一块新盘做MIRROR, 居然没有任何问题, 而且整个系统完全恢复了. \r\n\r\n现在, 我很迷惑:这0号盘究竟是不是真有问题?\r\n\r\n先谢了! |
|