- 论坛徽章:
- 0
|
有一台DELL 6850服务器,通过SCSI线缆连接了一台PV220S磁盘阵列,直连存储。
DELL6850安装RHEL3操作系统。
今天上午发现该操作系统无法登录,相关端口也无法连接。用户登录的时候如果输入错误的密码那么为提示密码错误,如果输入的是正确的密码,不会进入SHELL程序界面。服务器卡死在那里,死机了。
通过强行重启服务器后,查看message信息,发现有以下一些错误:
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638029 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638386 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638196 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638247 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638184 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638209 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638218 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638039 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638293 cmd=2a <c=0 t=0 l=0>
Mar 14 09:12:55 crbt-web2 kernel: megaraid: aborting-138638106 cmd=2a <c=0 t=0 l=0>
。。。。。。。。。。。。。。。。。。。。。。。。。。。
Mar 14 09:13:18 crbt-web2 kernel: megaraid: aborted cmd 8437413[2] complete.
Mar 14 09:13:19 crbt-web2 kernel: megaraid: pending 7; remaining 155 seconds
Mar 14 09:13:19 crbt-web2 kernel: megaraid: aborted cmd 843741b[43] complete.
Mar 14 09:13:21 crbt-web2 kernel: megaraid: aborted cmd 843742b[14] complete.
Mar 14 09:13:23 crbt-web2 kernel: megaraid: aborted cmd 8437432[1d] complete.
Mar 14 09:13:24 crbt-web2 kernel: megaraid: pending 4; remaining 150 seconds
Mar 14 09:13:24 crbt-web2 kernel: megaraid: aborted cmd 8437439[5a] complete.
。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。
Mar 14 09:13:35 crbt-web2 kernel: scsi: device set offline - not ready or command retry failed after bus reset: host 4 channel 0 id 0 lun 0
Mar 14 09:13:35 crbt-web2 last message repeated 62 times
Mar 14 09:13:35 crbt-web2 kernel: SCSI disk error : host 4 channel 0 id 0 lun 0 return code = 70018
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804193616
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804193984
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804193992
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194000
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194016
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194024
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194032
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194040
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194048
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194064
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194072
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194080
Mar 14 09:13:35 crbt-web2 kernel: I/O error: dev 08:21, sector 804194088
。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。
报大量的I/O错误。通过查看dmesg信息,可以知道 host 4 channel 0 id 0 lun 0是sdc:
Vendor: DELL Model: PERC 4/DC Rev: 351X
Type: Processor ANSI SCSI revision: 02
blk: queue f2ff5218, I/O limit 4294967295Mb (mask 0xffffffffffffffff)
Vendor: DELL Model: PV22XS Rev: E.18
Type: Processor ANSI SCSI revision: 03
blk: queue f3874018, I/O limit 4294967295Mb (mask 0xffffffffffffffff)
Attached scsi disk sdc at scsi4, channel 0, id 0, lun 0
Attached scsi disk sdd at scsi4, channel 0, id 1, lun 0
Attached scsi generic sg5 at scsi4, channel 5, id 6, lun 0, type 3
Attached scsi generic sg6 at scsi4, channel 5, id 15, lun 0, type 3
scsi4 (0,0,0) : RESERVATION CONFLICT
SCSI device sdc: 1146060800 512-byte hdwr sectors (586783 MB)
sdc: sdc1
sdc是PV220S过来的分区。PV220S上有两个RAID,在操作系统这边就是sdc和sdd。
现在sdc报大量的i/o错误,应该是有坏道。
请问我应该怎么来处理这个问题呢? |
|