- 论坛徽章:
- 0
|
本帖最后由 kingdisc666 于 2010-07-08 23:28 编辑
故障描述:SUN F4800,突然发现无法PING通主机了,通过控制器进入控制台查看,所有的目录的二级目录均为空
例如/etc无下级目录了,判断系统HUNG住了。于是重启主机,板卡自检后无法进入系统,具体提示当时没保存,大概
是说磁盘存在坏块之类的。
于是单用户fsck -y /dev/rdsk/c0t0d0s2 进行修复,重新引导可以进系统了,并且双机都上线了
次日系统日志系统如下,至今再无新日志产生,双机还是正常运行
Jul 3 02:55:31 dbserver1 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,700000/pci@1/SUNW,isptwo@4 (isp0):
Jul 3 02:55:31 dbserver1 Target 0 reducing transfer rate
Jul 3 02:55:31 dbserver1 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,700000/pci@1/SUNW,isptwo@4 (isp0):
Jul 3 02:55:31 dbserver1 Parity Error
Jul 3 02:55:31 dbserver1 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,700000/pci@1/SUNW,isptwo@4/sd@0,0 (sd0):
Jul 3 02:55:31 dbserver1 Error for Command: read(10) Error Level: Retryable
Jul 3 02:55:31 dbserver1 scsi: [ID 107833 kern.notice] Requested Block: 8967696 Error Block: 8967696
Jul 3 02:55:31 dbserver1 scsi: [ID 107833 kern.notice] Vendor: FUJITSU Serial Number: 0229X80455
Jul 3 02:55:31 dbserver1 scsi: [ID 107833 kern.notice] Sense Key: Aborted Command
Jul 3 02:55:31 dbserver1 scsi: [ID 107833 kern.notice] ASC: 0x48 (initiator detected error message received), ASCQ: 0x0, FR
U: 0x0
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,700000/pci@1/SUNW,isptwo@4 (isp0):
Jul 3 10:06:08 dbserver1 Target 0 reducing transfer rate
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,700000/pci@1/SUNW,isptwo@4 (isp0):
Jul 3 10:06:08 dbserver1 Parity Error
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,700000/pci@1/SUNW,isptwo@4 (isp0):
Jul 3 10:06:08 dbserver1 Target 0 disabled wide SCSI mode
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.warning] WARNING: /ssm@0,0/pci@18,700000/pci@1/SUNW,isptwo@4/sd@0,0 (sd0):
Jul 3 10:06:08 dbserver1 Error for Command: read Error Level: Retryable
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.notice] Requested Block: 740544 Error Block: 740544
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.notice] Vendor: FUJITSU Serial Number: 0229X80455
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.notice] Sense Key: Aborted Command
Jul 3 10:06:08 dbserver1 scsi: [ID 107833 kern.notice] ASC: 0x48 (initiator detected error message received), ASCQ: 0x0, FR
U: 0x0
我想问的是(1)偶的磁盘是否还有修复的可能,如何修复,还是需要要更换硬盘。
(2)如果更换硬盘,考虑到时双机环境,没有条件重做双机,讲故障盘DD一份到新盘可以正常使用吗,我担心故障盘的数据已经不完整,是否存在此问题
如果有修复的可能还是希望兄弟姐妹提供下安全的尝试方式,因为是生产环境 |
|