- 论坛徽章:
- 0
|
本帖最后由 lp8653 于 2012-04-20 14:07 编辑
兄弟好,昨天遇到一个问题,HP RX 1640的主机,突然间失去响应,但可以PING通。
找来控制台,直接连SERIAL控制口,仍然没有输出。
按电源关机,断电,重启,系统正常启动了,但启动时,DMESG报错SCSI BUS RESET
SYSLOG。LOG中记录如下:
Apr 18 14:27:13 HNDNS1 vmunix: SCSI Ultra320 0/1/1/0 instance 2: Driver initiating SCSI bus reset. Condition cleared, no intervention required.
Apr 18 14:27:21 HNDNS1 EMS [3559]: ------ EMS Event Notification ------ Value: "MAJORWARNING (3)" for Resource: "/storage/events/disks/default/0_1_1_0.0.0" (Threshold: >= " 3") Execute the following command to obtain event details: /opt/resmon/bin/resdata -R 233242626 -r /storage/events/disks/default/0_1_1_0.0.0 -n 233242625 -a
Apr 18 14:27:21 HNDNS1 EMS [3559]: ------ EMS Event Notification ------ Value: "CRITICAL (5)" for Resource: "/storage/events/disks/default/0_1_1_0.0.0" (Threshold: >= " 3") Execute the following command to obtain event details: /opt/resmon/bin/resdata -R 233242626 -r /storage/events/disks/default/0_1_1_0.0.0 -n 233242626 -a
event.log
>------------ Event Monitoring Service Event Notification ------------<
Notification Time: Wed Apr 18 14:27:21 2012
HNDNS1 sent Event Monitor notification information:
/storage/events/disks/default/0_1_1_0.0.0 is >= 1.
Its current value is MAJORWARNING(3).
Event data from monitor:
Event Time..........: Wed Apr 18 14:27:21 2012
Severity............: MAJORWARNING
Monitor.............: disk_em
Event #.............: 100091
System..............: HNDNS1.mnc001.mcc460.gprs
Summary:
Disk at hardware path 0/1/1/0.0.0 : Software configuration error
Description of Error:
The device is in a condition where it requires action on the part of the
device driver or a human operator.
Probable Cause / Recommended Action:
The device has been reset by a Bus Device Reset message, a hard reset
condition, or a power-on reset.
If this is the case, no action is necessary.
Alternatively, a removable medium has been loaded or replaced.
If this is the case, no action is necessary.
Alternatively, the mode parameters, microcode, or inquiry data for the
device have been changed.
If this is the case, no action is necessary.
Alternatively, the installed version of the device driver does not match
that of the installed version of HP-UX. Install the correct version of the
driver.
Additional Event Data:
System IP Address...: 220.206.140.1
Event Id............: 0x4f8e5ec900000000
Monitor Version.....: B.01.01
Event Class.........: I/O
Client Configuration File...........:
/var/stm/config/tools/monitor/default_disk_em.clcfg
Client Configuration File Version...: A.01.00
Qualification criteria met.
Number of events..: 1
Associated OS error log entry id(s):
0x4f8e5cb300000000
Additional System Data:
System Model Number.............: ia64 hp server rx1620
OS Version......................: B.11.23
STM Version.....................: C.51.00
EMS Version.....................: A.04.20
Latest information on this event:
http://docs.hp.com/hpux/content/hardware/ems/scsi.htm#100091
v-v-v-v-v-v-v-v-v-v-v-v-v D E T A I L S v-v-v-v-v-v-v-v-v-v-v-v-v
Component Data:
Physical Device Path...: 0/1/1/0.0.0
Device Class...........: Disk
Inquiry Vendor ID......: HP 36.4G
Inquiry Product ID.....: MAX3036NC
Firmware Version.......: HPC1
Serial Number..........: K1011411 0610
Product/Device Identification Information:
Logger ID.........: sdisk
Product Identifier: SCSI Disk
Product Qualifier.: HP36.4GMAX3036NC
SCSI Target ID....: 0x00
SCSI LUN..........: 0x00
I/O Log Event Data:
Driver Status Code..................: 0x0000000B
Length of Logged Hardware Status....: 36 bytes.
Offset to Logged Manager Information: 40 bytes.
Length of Logged Manager Information: 34 bytes.
Hardware Status:
Raw H/W Status:
0x0000: 00 00 00 02 70 00 06 00 00 00 00 28 00 00 00 00
0x0010: 29 02 00 00 00 00 00 2A 00 67 01 01 00 00 00 00
0x0020: 00 00 00 00
SCSI Status...: CHECK CONDITION (0x02)
Indicates that a contingent allegiance condition has occurred. Any
error, exception, or abnormal condition that causes sense data to be
set will produce the CHECK CONDITION status.
SCSI Sense Data:
Undecoded Sense Data:
0x0000: 70 00 06 00 00 00 00 28 00 00 00 00 29 02 00 00
0x0010: 00 00 00 2A 00 67 01 01 00 00 00 00 00 00 00 00
SCSI Sense Data Fields:
Error Code : 0x70
Segment Number : 0x00
Bit Fields:
Filemark : 0
End-of-Medium : 0
Incorrect Length Indicator : 0
Sense Key : 0x06
Information Field Valid : FALSE
Information Field : 0x00000000
Additional Sense Length : 40
Command Specific : 0x00000000
Additional Sense Code : 0x29
Additional Sense Qualifier : 0x02
Field Replaceable Unit : 0x00
Sense Key Specific Data Valid : FALSE
Sense Key Specific Data : 0x00 0x00 0x00
Sense Key 0x06, UNIT ATTENTION, indicates that the target has been
reset by a BUS DEVICE RESET message, a hard reset condition, or by a
power-on reset. If not a reset, then one of the following may have
occurred.
1. A removable medium may have been changed.
2. The mode parameters in effect for this initiator have been
changed by another initiator.
3. The version or level of microcode has been changed.
4. Tagged commands queued for this initiator were cleared by
another initiator.
5. INQUIRY data has been changed.
6. The mode parameters in effect for this initiator have been
restored from non-volatile memory.
7. A change in the condition of a synchronized spindle.
8. Any other event that requires the attention of the initiator.
The combination of Additional Sense Code and Sense Qualifier (0x2902)
indicates: SCSI bus reset occurred.
SCSI Command Data Block:
Command Data Block Contents:
0x0000: 2A 00 00 5C 71 82 00 00 10 00
Command Data Block Fields (10-byte fmt):
Command Operation Code...(0x2A)..: WRITE
Logical Unit Number..............: 0
DPO Bit..........................: 0
FUA Bit..........................: 0
Relative Address Bit.............: 0
Logical Block Address............: 6058370 (0x005C7182)
Transfer Length..................: 16 (0x0010)
Manager-Specific Data Fields:
Request ID.............: 0x02000EDF
Data Residue...........: 0x00002000
CDB status.............: 0x00000002
Sense Status...........: 0x00000000
Bus ID.................: 0x02
Target ID..............: 0x00
LUN ID.................: 0x00
Sense Data Length......: 0x20
Q Tag..................: 0xF9
Retry Count............: 0
>---------- End Event Monitoring Service Event Notification ----------<
>------------ Event Monitoring Service Event Notification ------------<
Notification Time: Wed Apr 18 14:27:21 2012
HNDNS1 sent Event Monitor notification information:
/storage/events/disks/default/0_1_1_0.0.0 is >= 1.
Its current value is CRITICAL(5).
Event data from monitor:
Event Time..........: Wed Apr 18 14:27:21 2012
Severity............: CRITICAL
Monitor.............: disk_em
Event #.............: 17
System..............: HNDNS1.mnc001.mcc460.gprs
Summary:
Disk at hardware path 0/1/1/0.0.0 : I/O request failed.
Description of Error:
The hardware responded to the initial query, but then it stopped
responding to more requests by the driver. The I/O request was not
completed.
Probable Cause / Recommended Action:
The bus device reset may have occurred or the device has failed. The bus
device reset could have occurred because the power was cycled or that the
device is a part of an enclosure and a device in that enclosure was pulled
out or put in, or that the interface card had a problem and it reset the
bus. If these errors continue, there may be a problem with the device or
the card.
Additional Event Data:
System IP Address...: 220.206.140.1
Event Id............: 0x4f8e5ec900000002
Monitor Version.....: B.01.01
Event Class.........: I/O
Client Configuration File...........:
/var/stm/config/tools/monitor/default_disk_em.clcfg
Client Configuration File Version...: A.01.00
Qualification criteria met.
Number of events..: 1
Associated OS error log entry id(s):
0x4f8e5cb300000002
Additional System Data:
System Model Number.............: ia64 hp server rx1620
OS Version......................: B.11.23
STM Version.....................: C.51.00
EMS Version.....................: A.04.20
Latest information on this event:
http://docs.hp.com/hpux/content/hardware/ems/disk_em.htm#17
v-v-v-v-v-v-v-v-v-v-v-v-v D E T A I L S v-v-v-v-v-v-v-v-v-v-v-v-v
Component Data:
Physical Device Path...: 0/1/1/0.0.0
Device Class...........: Disk
Inquiry Vendor ID......: HP 36.4G
Inquiry Product ID.....: MAX3036NC
Firmware Version.......: HPC1
Serial Number..........: K1011411 0610
Product/Device Identification Information:
Logger ID.........: sdisk
Product Identifier: SCSI Disk
Product Qualifier.: HP36.4GMAX3036NC
SCSI Target ID....: 0x00
SCSI LUN..........: 0x00
I/O Log Event Data:
Driver Status Code..................: 0x00000005
Length of Logged Hardware Status....: 4 bytes.
Offset to Logged Manager Information: 8 bytes.
Length of Logged Manager Information: 34 bytes.
SCSI Command Data Block:
Command Data Block Contents:
0x0000: 2A 00 00 5C 71 82 00 00 10 00
Command Data Block Fields (10-byte fmt):
Command Operation Code...(0x2A)..: WRITE
Logical Unit Number..............: 0
DPO Bit..........................: 0
FUA Bit..........................: 0
Relative Address Bit.............: 0
Logical Block Address............: 6058370 (0x005C7182)
Transfer Length..................: 16 (0x0010)
Manager-Specific Data Fields:
Request ID.............: 0x02000EDF
Data Residue...........: 0x00002000
CDB status.............: 0x00000400
Sense Status...........: 0x00000000
Bus ID.................: 0x02
Target ID..............: 0x00
LUN ID.................: 0x00
Sense Data Length......: 0x00
Q Tag..................: 0xF9
Retry Count............: 1
>---------- End Event Monitoring Service Event Notification ----------<
>------------ Event Monitoring Service Event Notification ------------<
Notification Time: Wed Apr 18 14:27:21 2012
HNDNS1 sent Event Monitor notification information:
/storage/events/disks/default/0_1_1_0.0.0 is >= 1.
Its current value is INFORMATION(1).
Event data from monitor:
Event Time..........: Wed Apr 18 14:27:21 2012
Severity............: INFORMATION
Monitor.............: disk_em
Event #.............: 100401
System..............: HNDNS1.mnc001.mcc460.gprs
Summary:
Disk at hardware path 0/1/1/0.0.0 : Successful completion of operation
Description of Error:
The device driver has successfully completed an I/O request.
Probable Cause / Recommended Action:
No action is necessary.
Additional Event Data:
System IP Address...: 220.206.140.1
Event Id............: 0x4f8e5ec900000004
Monitor Version.....: B.01.01
Event Class.........: I/O
Client Configuration File...........:
/var/stm/config/tools/monitor/default_disk_em.clcfg
Client Configuration File Version...: A.01.00
Qualification criteria met.
Number of events..: 1
Associated OS error log entry id(s):
0x4f8e5cb700000001
Additional System Data:
System Model Number.............: ia64 hp server rx1620
OS Version......................: B.11.23
STM Version.....................: C.51.00
EMS Version.....................: A.04.20
Latest information on this event:
http://docs.hp.com/hpux/content/hardware/ems/scsi.htm#100401
v-v-v-v-v-v-v-v-v-v-v-v-v D E T A I L S v-v-v-v-v-v-v-v-v-v-v-v-v
Component Data:
Physical Device Path...: 0/1/1/0.0.0
Device Class...........: Disk
Inquiry Vendor ID......: HP 36.4G
Inquiry Product ID.....: MAX3036NC
Firmware Version.......: HPC1
Serial Number..........: K1011411 0610
Product/Device Identification Information:
Logger ID.........: sdisk
Product Identifier: SCSI Disk
Product Qualifier.: HP36.4GMAX3036NC
SCSI Target ID....: 0x00
SCSI LUN..........: 0x00
I/O Log Event Data:
Driver Status Code..................: 0x00000000
Length of Logged Hardware Status....: 4 bytes.
Offset to Logged Manager Information: 8 bytes.
Length of Logged Manager Information: 34 bytes.
Hardware Status:
Raw H/W Status:
0x0000: 00 00 00 00
SCSI Status...: GOOD (0x00)
Indicates that the target has successfully completed the command.
SCSI Sense Data: (not present in log record)
SCSI Command Data Block:
Command Data Block Contents:
0x0000: 2A 00 00 5C 71 82 00 00 10 00
Command Data Block Fields (10-byte fmt):
Command Operation Code...(0x2A)..: WRITE
Logical Unit Number..............: 0
DPO Bit..........................: 0
FUA Bit..........................: 0
Relative Address Bit.............: 0
Logical Block Address............: 6058370 (0x005C7182)
Transfer Length..................: 16 (0x0010)
Manager-Specific Data Fields:
Request ID.............: 0x02000EDF
Data Residue...........: 0x00000000
CDB status.............: 0x00000000
Sense Status...........: 0x00000000
Bus ID.................: 0x02
Target ID..............: 0x00
LUN ID.................: 0x00
Sense Data Length......: 0x00
Q Tag..................: 0xF5
Retry Count............: 2
>---------- End Event Monitoring Service Event Notification ----------<
再次重启服务器后,DMESG中SCSI BUS RESET 报错消失,SYSLOG。LOG和EVENT。LOG中也没有SCSI BUS RESET 相关的报错了。
可以帮我看看是哪里问题么
|
|