免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
最近访问板块 发新帖
查看: 2839 | 回复: 2
打印 上一主题 下一主题

兄弟帮忙看看,HP-UX 报错 SCSI BUS RESET 在线等 [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2012-04-20 13:52 |只看该作者 |倒序浏览
本帖最后由 lp8653 于 2012-04-20 14:07 编辑

兄弟好,昨天遇到一个问题,HP RX 1640的主机,突然间失去响应,但可以PING通。
找来控制台,直接连SERIAL控制口,仍然没有输出。
按电源关机,断电,重启,系统正常启动了,但启动时,DMESG报错SCSI BUS RESET

SYSLOG。LOG中记录如下:
Apr 18 14:27:13 HNDNS1 vmunix: SCSI Ultra320 0/1/1/0 instance 2:       Driver initiating SCSI bus reset.       Condition cleared, no intervention required.
Apr 18 14:27:21 HNDNS1 EMS [3559]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)" for Resource: "/storage/events/disks/default/0_1_1_0.0.0"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 233242626 -r /storage/events/disks/default/0_1_1_0.0.0 -n 233242625 -a
Apr 18 14:27:21 HNDNS1 EMS [3559]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/storage/events/disks/default/0_1_1_0.0.0"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 233242626 -r /storage/events/disks/default/0_1_1_0.0.0 -n 233242626 -a

event.log


>------------ Event Monitoring Service Event Notification ------------<
Notification Time: Wed Apr 18 14:27:21 2012
HNDNS1 sent Event Monitor notification information:
/storage/events/disks/default/0_1_1_0.0.0 is >= 1.
Its current value is MAJORWARNING(3).

Event data from monitor:
Event Time..........: Wed Apr 18 14:27:21 2012
Severity............: MAJORWARNING
Monitor.............: disk_em
Event #.............: 100091              
System..............: HNDNS1.mnc001.mcc460.gprs
Summary:
     Disk at hardware path 0/1/1/0.0.0 : Software configuration error

Description of Error:
     The device is in a condition where it requires action on the part of the
     device driver or a human operator.
Probable Cause / Recommended Action:
     The device has been reset by a Bus Device Reset message, a hard reset
     condition, or a power-on reset.
     If this is the case, no action is necessary.
     Alternatively, a removable medium has been loaded or replaced.
     If this is the case, no action is necessary.
     Alternatively, the mode parameters, microcode, or inquiry data for the
     device have been changed.
     If this is the case, no action is necessary.
     Alternatively, the installed version of the device driver does not match
     that of the installed version of HP-UX. Install the correct version of the
     driver.
Additional Event Data:
     System IP Address...: 220.206.140.1
     Event Id............: 0x4f8e5ec900000000
     Monitor Version.....: B.01.01
     Event Class.........: I/O
     Client Configuration File...........:
     /var/stm/config/tools/monitor/default_disk_em.clcfg
     Client Configuration File Version...: A.01.00
          Qualification criteria met.
               Number of events..: 1
     Associated OS error log entry id(s):
          0x4f8e5cb300000000
     Additional System Data:
          System Model Number.............: ia64 hp server rx1620
          OS Version......................: B.11.23
          STM Version.....................: C.51.00
          EMS Version.....................: A.04.20
     Latest information on this event:
          http://docs.hp.com/hpux/content/hardware/ems/scsi.htm#100091
v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v

Component Data:
     Physical Device Path...: 0/1/1/0.0.0
     Device Class...........: Disk
     Inquiry Vendor ID......: HP 36.4G
     Inquiry Product ID.....: MAX3036NC      
     Firmware Version.......: HPC1
     Serial Number..........: K1011411    0610
Product/Device Identification Information:
     Logger ID.........: sdisk
     Product Identifier: SCSI Disk
     Product Qualifier.: HP36.4GMAX3036NC
     SCSI Target ID....: 0x00
     SCSI LUN..........: 0x00
I/O Log Event Data:
     Driver Status Code..................: 0x0000000B
     Length of Logged Hardware Status....: 36 bytes.
     Offset to Logged Manager Information: 40 bytes.
     Length of Logged Manager Information: 34 bytes.
Hardware Status:
     Raw H/W Status:
          0x0000: 00 00 00 02   70 00 06 00   00 00 00 28   00 00 00 00
          0x0010: 29 02 00 00   00 00 00 2A   00 67 01 01   00 00 00 00
          0x0020: 00 00 00 00  
     SCSI Status...: CHECK CONDITION (0x02)
          Indicates that a contingent allegiance condition has occurred.  Any
          error, exception, or abnormal condition that causes sense data to be
          set will produce the CHECK CONDITION status.
     
SCSI Sense Data:
     Undecoded Sense Data:
          0x0000: 70 00 06 00   00 00 00 28   00 00 00 00   29 02 00 00
          0x0010: 00 00 00 2A   00 67 01 01   00 00 00 00   00 00 00 00
     
     SCSI Sense Data Fields:
          Error Code                      : 0x70
          Segment Number                  : 0x00
          Bit Fields:      
               Filemark                   : 0
               End-of-Medium              : 0
               Incorrect Length Indicator : 0
          Sense Key                       : 0x06
          Information Field Valid         : FALSE               
          Information Field               : 0x00000000
          Additional Sense Length         : 40
          Command Specific                : 0x00000000
          Additional Sense Code           : 0x29
          Additional Sense Qualifier      : 0x02
          Field Replaceable Unit          : 0x00
          Sense Key Specific Data Valid   : FALSE               
          Sense Key Specific Data         : 0x00 0x00 0x00
                       
          Sense Key 0x06, UNIT ATTENTION, indicates that the target has been
          reset by a BUS DEVICE RESET message, a hard reset condition, or by a
          power-on reset. If not a reset, then one of the following may have
          occurred.
             1. A removable medium may have been changed.
             2. The mode parameters in effect for this initiator have been
             changed by another initiator.
             3. The version or level of microcode has been changed.
             4. Tagged commands queued for this initiator were cleared by
             another initiator.
             5. INQUIRY data has been changed.
             6. The mode parameters in effect for this initiator have been
             restored from non-volatile memory.
             7. A change in the condition of a synchronized spindle.
             8. Any other event that requires the attention of the initiator.
                       
          The combination of Additional Sense Code and Sense Qualifier (0x2902)
          indicates: SCSI bus reset occurred.
SCSI Command Data Block:
     Command Data Block Contents:
          0x0000: 2A 00 00 5C   71 82 00 00   10 00
     
     Command Data Block Fields (10-byte fmt):
          Command Operation Code...(0x2A)..: WRITE
          Logical Unit Number..............: 0
          DPO Bit..........................: 0
          FUA Bit..........................: 0
          Relative Address Bit.............: 0
          Logical Block Address............: 6058370 (0x005C7182)
          Transfer Length..................: 16 (0x0010)
Manager-Specific Data Fields:
     Request ID.............: 0x02000EDF
     Data Residue...........: 0x00002000
     CDB status.............: 0x00000002
     Sense Status...........: 0x00000000
     Bus ID.................: 0x02
     Target ID..............: 0x00
     LUN ID.................: 0x00
     Sense Data Length......: 0x20
     Q Tag..................: 0xF9
     Retry Count............: 0

>---------- End Event Monitoring Service Event Notification ----------<
>------------ Event Monitoring Service Event Notification ------------<
Notification Time: Wed Apr 18 14:27:21 2012
HNDNS1 sent Event Monitor notification information:
/storage/events/disks/default/0_1_1_0.0.0 is >= 1.
Its current value is CRITICAL(5).

Event data from monitor:
Event Time..........: Wed Apr 18 14:27:21 2012
Severity............: CRITICAL
Monitor.............: disk_em
Event #.............: 17                  
System..............: HNDNS1.mnc001.mcc460.gprs
Summary:
     Disk at hardware path 0/1/1/0.0.0 : I/O request failed.         

Description of Error:
     The hardware responded to the initial query, but then it stopped
     responding to more requests by the driver. The I/O request was not
     completed.
Probable Cause / Recommended Action:
     The bus device reset may have occurred or the device has failed. The bus
     device reset could have occurred because the power was cycled or that the
     device is a part of an enclosure and a device in that enclosure was pulled
     out or put in, or that the interface card had a problem and it reset the
     bus. If these errors continue, there may be a problem with the device or
     the card.   
Additional Event Data:
     System IP Address...: 220.206.140.1
     Event Id............: 0x4f8e5ec900000002
     Monitor Version.....: B.01.01
     Event Class.........: I/O
     Client Configuration File...........:
     /var/stm/config/tools/monitor/default_disk_em.clcfg
     Client Configuration File Version...: A.01.00
          Qualification criteria met.
               Number of events..: 1
     Associated OS error log entry id(s):
          0x4f8e5cb300000002
     Additional System Data:
          System Model Number.............: ia64 hp server rx1620
          OS Version......................: B.11.23
          STM Version.....................: C.51.00
          EMS Version.....................: A.04.20
     Latest information on this event:
          http://docs.hp.com/hpux/content/hardware/ems/disk_em.htm#17
v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v

Component Data:
     Physical Device Path...: 0/1/1/0.0.0
     Device Class...........: Disk
     Inquiry Vendor ID......: HP 36.4G
     Inquiry Product ID.....: MAX3036NC      
     Firmware Version.......: HPC1
     Serial Number..........: K1011411    0610
Product/Device Identification Information:
     Logger ID.........: sdisk
     Product Identifier: SCSI Disk
     Product Qualifier.: HP36.4GMAX3036NC
     SCSI Target ID....: 0x00
     SCSI LUN..........: 0x00
I/O Log Event Data:
     Driver Status Code..................: 0x00000005
     Length of Logged Hardware Status....: 4 bytes.
     Offset to Logged Manager Information: 8 bytes.
     Length of Logged Manager Information: 34 bytes.
SCSI Command Data Block:
     Command Data Block Contents:
          0x0000: 2A 00 00 5C   71 82 00 00   10 00
     
     Command Data Block Fields (10-byte fmt):
          Command Operation Code...(0x2A)..: WRITE
          Logical Unit Number..............: 0
          DPO Bit..........................: 0
          FUA Bit..........................: 0
          Relative Address Bit.............: 0
          Logical Block Address............: 6058370 (0x005C7182)
          Transfer Length..................: 16 (0x0010)
Manager-Specific Data Fields:
     Request ID.............: 0x02000EDF
     Data Residue...........: 0x00002000
     CDB status.............: 0x00000400
     Sense Status...........: 0x00000000
     Bus ID.................: 0x02
     Target ID..............: 0x00
     LUN ID.................: 0x00
     Sense Data Length......: 0x00
     Q Tag..................: 0xF9
     Retry Count............: 1

>---------- End Event Monitoring Service Event Notification ----------<
>------------ Event Monitoring Service Event Notification ------------<
Notification Time: Wed Apr 18 14:27:21 2012
HNDNS1 sent Event Monitor notification information:
/storage/events/disks/default/0_1_1_0.0.0 is >= 1.
Its current value is INFORMATION(1).

Event data from monitor:
Event Time..........: Wed Apr 18 14:27:21 2012
Severity............: INFORMATION
Monitor.............: disk_em
Event #.............: 100401              
System..............: HNDNS1.mnc001.mcc460.gprs
Summary:
     Disk at hardware path 0/1/1/0.0.0 : Successful completion of operation

Description of Error:
     The device driver has successfully completed an I/O request.
Probable Cause / Recommended Action:
     No action is necessary.
Additional Event Data:
     System IP Address...: 220.206.140.1
     Event Id............: 0x4f8e5ec900000004
     Monitor Version.....: B.01.01
     Event Class.........: I/O
     Client Configuration File...........:
     /var/stm/config/tools/monitor/default_disk_em.clcfg
     Client Configuration File Version...: A.01.00
          Qualification criteria met.
               Number of events..: 1
     Associated OS error log entry id(s):
          0x4f8e5cb700000001
     Additional System Data:
          System Model Number.............: ia64 hp server rx1620
          OS Version......................: B.11.23
          STM Version.....................: C.51.00
          EMS Version.....................: A.04.20
     Latest information on this event:
          http://docs.hp.com/hpux/content/hardware/ems/scsi.htm#100401
v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v

Component Data:
     Physical Device Path...: 0/1/1/0.0.0
     Device Class...........: Disk
     Inquiry Vendor ID......: HP 36.4G
     Inquiry Product ID.....: MAX3036NC      
     Firmware Version.......: HPC1
     Serial Number..........: K1011411    0610
Product/Device Identification Information:
     Logger ID.........: sdisk
     Product Identifier: SCSI Disk
     Product Qualifier.: HP36.4GMAX3036NC
     SCSI Target ID....: 0x00
     SCSI LUN..........: 0x00
I/O Log Event Data:
     Driver Status Code..................: 0x00000000
     Length of Logged Hardware Status....: 4 bytes.
     Offset to Logged Manager Information: 8 bytes.
     Length of Logged Manager Information: 34 bytes.
Hardware Status:
     Raw H/W Status:
          0x0000: 00 00 00 00  
     SCSI Status...: GOOD (0x00)
          Indicates that the target has successfully completed the command.
SCSI Sense Data: (not present in log record)
SCSI Command Data Block:
     Command Data Block Contents:
          0x0000: 2A 00 00 5C   71 82 00 00   10 00
     
     Command Data Block Fields (10-byte fmt):
          Command Operation Code...(0x2A)..: WRITE
          Logical Unit Number..............: 0
          DPO Bit..........................: 0
          FUA Bit..........................: 0
          Relative Address Bit.............: 0
          Logical Block Address............: 6058370 (0x005C7182)
          Transfer Length..................: 16 (0x0010)
Manager-Specific Data Fields:
     Request ID.............: 0x02000EDF
     Data Residue...........: 0x00000000
     CDB status.............: 0x00000000
     Sense Status...........: 0x00000000
     Bus ID.................: 0x02
     Target ID..............: 0x00
     LUN ID.................: 0x00
     Sense Data Length......: 0x00
     Q Tag..................: 0xF5
     Retry Count............: 2

>---------- End Event Monitoring Service Event Notification ----------<

再次重启服务器后,DMESG中SCSI BUS RESET 报错消失,SYSLOG。LOG和EVENT。LOG中也没有SCSI BUS RESET 相关的报错了。

可以帮我看看是哪里问题么

论坛徽章:
48
15-16赛季CBA联赛之青岛
日期:2021-01-07 13:41:2315-16赛季CBA联赛之上海
日期:2020-12-01 18:02:0720周年集字徽章-20	
日期:2020-10-28 14:14:2620周年集字徽章-20	
日期:2020-10-28 14:04:3015-16赛季CBA联赛之天津
日期:2020-10-18 22:51:412016猴年福章徽章
日期:2016-02-18 15:30:3415-16赛季CBA联赛之北控
日期:2015-12-22 13:30:48操作系统版块每日发帖之星
日期:2015-12-07 06:20:00操作系统版块每日发帖之星
日期:2015-09-04 06:20:002015亚冠之德黑兰石油
日期:2015-08-05 18:46:082015年亚洲杯之巴勒斯坦
日期:2015-04-19 10:42:502015年亚洲杯之巴林
日期:2015-04-09 08:03:23
2 [报告]
发表于 2012-04-20 16:50 |只看该作者
这种scsi bus reset还recover掉了,检查0/1/1/0.0.0对应的LV是否有问题。

0/1/1/0.0.0这块盘是根盘?可以尝试dd一下这块盘,看是否能否pass,如果不能pass的话就换掉看看。

论坛徽章:
0
3 [报告]
发表于 2012-04-24 13:02 |只看该作者
根盘坏了!换硬盘,重新安装OS!
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP