免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
123下一页
最近访问板块 发新帖
查看: 8875 | 回复: 24
打印 上一主题 下一主题

HP R2660 经常死机,面板显示红灯,查系统日志如下: [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2010-06-23 11:03 |只看该作者 |正序浏览
一、查看var/adm/syslog/syslog.log文件提示:
Jun 23 11:36:49 rx2600b /usr/sbin/envd[2321]: ***** FANFAIL_CRIT WARNING *****
Jun 23 11:36:49 rx2600b /usr/sbin/envd[2321]: Chassis fan failed.  Replace the fan.
Jun 23 11:36:49 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for
Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following c
ommand to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_corehw
/core_hw -n 231342082 -a
Jun 23 11:37:52 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for
Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following c
ommand to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_corehw
/core_hw -n 231342083 -a
Jun 23 11:37:52 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342084 -a
Jun 23 11:39:45 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342085 -a
Jun 23 11:39:47 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342086 -a
Jun 23 11:39:51 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342087 -a
Jun 23 11:39:58 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342088 -a
Jun 23 11:40:33 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342089 -a
Jun 23 11:41:10 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342090 -a
Jun 23 11:42:20 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342091 -a
Jun 23 11:42:22 rx2600b EMS [3530]: ------ EMS Event Notification ------   Value: "MAJORWARNING (3)"
for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the followi
ng command to obtain event details:   /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_co
rehw/core_hw -n 231342092 -a

================================================================

二、运行"/opt/resmon/bin/resdata -R 232062978 -r /system/events/ia64_corehw/core_hw -n 232063368 -a"内容提示:

# /opt/resmon/bin/resdata -R 231342082 -r /system/events/ia64_corehw/core_hw -n 231342092 -a

CURRENT MONITOR DATA:

Event Time..........: Wed Jun 23 11:42:22 2010
Severity............: MAJORWARNING
Monitor.............: ia64_corehw
Event #.............: 115002
System..............: rx2600b

Summary:
     Cooling Unit : Performance is degrading.


Description of Error:

     The cooling unit sensor has detected that one or more zones of chassis is
     not being cooled enough.

Probable Cause / Recommended Action:

     One or more fans of the cooling unit may not be functioning properly. If
     that is the case, please contact your HP support engineer to have the
     cooling unit checked. If problem persists and worsens, there may be
     hardware failure and the system may be shutdown automatically by the
     firmware.

     For information on the sensor that generated this event, refer to FRU ID
     in Event Details section.

Additional Event Data:
     System IP Address...: 10.37.136.249
     Event Id............: 0x4c22477e00000000
     Monitor Version.....: B.01.00
     Event Class.........: System
     Client Configuration File...........:
     /var/stm/config/tools/monitor/default_ia64_corehw.clcfg
     Client Configuration File Version...: A.01.00
          Qualification criteria met.
               Number of events..: 1
     Associated OS error log entry id(s):
          None
     Additional System Data:
          System Model Number.............: ia64 hp server rx2600
          EMS Version.....................: A.04.20
          STM Version.....................: C.56.00
          System Serial Number............: SG44016437
     Latest information on this event:
          http://docs.hp.com/hpux/content/ ... 4_corehw.htm#115002

v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v


Event Details :

     Event Date .............: Wed Jun 23 11:42:17 2010
     Sensor Number ..........: 0x11
     Sensor Type ............: Unknown
     Sensor Class ...........: Discrete severe
     Sensor Reading/Offset...: 0x01 (Offset)
     Event  Type.............: Assertion
     Entity ID ..............: 30
     Generic Message.........:
       Cooling Device :  Transition to non-critical
     Entity FRU Id Info......:
       cooling unit (Sensor ID: Cooling 1 (Sys))
不知是哪个部件出问题了?多谢!~

论坛徽章:
0
25 [报告]
发表于 2010-07-27 15:51 |只看该作者
给个例子:
MP:CM> ps


PS
System Power state: Off           
Temperature       : Low OverTemp  


Power supplies                 State                        
-----------------------------------------------------------
Power Supply 1                Normal                          
Power Supply 2                A/C Disconnected or Out of Range  


Fans                           State                        
-----------------------------------------------------------
Fan1A (CPU)                   Normal                        
Fan1B (CPU)                   Normal                        
Fan2 (Memory)                 Normal                        
Fan3 (I/O)                    Normal                        
CPU0 Fan                      Normal                        
CPU1 Fan                      Normal

论坛徽章:
10
处女座
日期:2015-01-22 16:08:50技术图书徽章
日期:2018-09-13 11:25:52技术图书徽章
日期:2018-09-13 11:25:45技术图书徽章
日期:2018-09-13 11:25:37技术图书徽章
日期:2018-09-13 11:25:29黑曼巴
日期:2018-06-04 09:03:192017金鸡报晓
日期:2017-01-10 15:19:56极客徽章
日期:2016-12-07 14:03:402015年迎新春徽章
日期:2015-03-04 09:50:28技术图书徽章
日期:2018-09-13 11:26:01
24 [报告]
发表于 2010-07-04 22:10 |只看该作者
22楼兄弟厉害

论坛徽章:
0
23 [报告]
发表于 2010-07-04 11:52 |只看该作者
楼上说的很对

论坛徽章:
48
15-16赛季CBA联赛之青岛
日期:2021-01-07 13:41:2315-16赛季CBA联赛之上海
日期:2020-12-01 18:02:0720周年集字徽章-20	
日期:2020-10-28 14:14:2620周年集字徽章-20	
日期:2020-10-28 14:04:3015-16赛季CBA联赛之天津
日期:2020-10-18 22:51:412016猴年福章徽章
日期:2016-02-18 15:30:3415-16赛季CBA联赛之北控
日期:2015-12-22 13:30:48操作系统版块每日发帖之星
日期:2015-12-07 06:20:00操作系统版块每日发帖之星
日期:2015-09-04 06:20:002015亚冠之德黑兰石油
日期:2015-08-05 18:46:082015年亚洲杯之巴勒斯坦
日期:2015-04-19 10:42:502015年亚洲杯之巴林
日期:2015-04-09 08:03:23
22 [报告]
发表于 2010-07-03 21:12 |只看该作者
一、查看var/adm/syslog/syslog.log文件提示:
...
Additional System Data:
          System Model Number.............: ia64 hp server rx2600
          EMS Version.....................: A.04.20
          STM Version.....................: C.56.00
  ......

v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v
Event Details :

     Event Date .............: Wed Jun 23 11:42:17 2010
     Sensor Number ..........: 0x11
     Sensor Type ............: Unknown
     Sensor Class ...........: Discrete severe
     Sensor Reading/Offset...: 0x01 (Offset)
     Event  Type.............: Assertion
     Entity ID ..............: 30
     Generic Message.........:
       Cooling Device :  Transition to non-critical
     Entity FRU Id Info......:
       cooling unit (Sensor ID: Cooling 1 (Sys))
太极球 发表于 2010-06-23 11:03

首先这个是rx2600,不是rx2660。
第二,event.log里面有说明了是Cooling 1(Sys)的问题,那么对应的风扇无非就是Fan 1A或Fan 1B(吹向CPU位置的那2个风扇)。

如果不进入MP卡查看具体信息的话,可以观察rx2600前面板状态灯来确定风扇位置:
如果System灯闪黄色的话,左边LED1常亮绿色并LED4常亮红色,那么对应的风扇是Fan 1A;
如果System灯闪黄色的话,左边LED2常亮绿色并LED4常亮红色,那么对应的风扇是Fan 1B;

论坛徽章:
0
21 [报告]
发表于 2010-06-30 15:17 |只看该作者
风扇坏了

论坛徽章:
0
20 [报告]
发表于 2010-06-29 10:30 |只看该作者
Chassis fan failed.  Replace the fan.

风扇故障,更换吧

论坛徽章:
48
15-16赛季CBA联赛之青岛
日期:2021-01-07 13:41:2315-16赛季CBA联赛之上海
日期:2020-12-01 18:02:0720周年集字徽章-20	
日期:2020-10-28 14:14:2620周年集字徽章-20	
日期:2020-10-28 14:04:3015-16赛季CBA联赛之天津
日期:2020-10-18 22:51:412016猴年福章徽章
日期:2016-02-18 15:30:3415-16赛季CBA联赛之北控
日期:2015-12-22 13:30:48操作系统版块每日发帖之星
日期:2015-12-07 06:20:00操作系统版块每日发帖之星
日期:2015-09-04 06:20:002015亚冠之德黑兰石油
日期:2015-08-05 18:46:082015年亚洲杯之巴勒斯坦
日期:2015-04-19 10:42:502015年亚洲杯之巴林
日期:2015-04-09 08:03:23
19 [报告]
发表于 2010-06-29 07:40 |只看该作者
rx2660是没有默认MP LAN的IP地址的~

论坛徽章:
10
处女座
日期:2015-01-22 16:08:50技术图书徽章
日期:2018-09-13 11:25:52技术图书徽章
日期:2018-09-13 11:25:45技术图书徽章
日期:2018-09-13 11:25:37技术图书徽章
日期:2018-09-13 11:25:29黑曼巴
日期:2018-06-04 09:03:192017金鸡报晓
日期:2017-01-10 15:19:56极客徽章
日期:2016-12-07 14:03:402015年迎新春徽章
日期:2015-03-04 09:50:28技术图书徽章
日期:2018-09-13 11:26:01
18 [报告]
发表于 2010-06-26 18:03 |只看该作者
mp卡ip默认是192.168.1.1
主机后面有一个类似网口,上面标注的有MP的,网线直连,然后telnet上去
默认用户名密码是Admin/ Admin
然后进入cm,敲ps看具体是哪个风扇的问题

论坛徽章:
0
17 [报告]
发表于 2010-06-25 08:23 |只看该作者
回复 16# mac2008


    我试一下,多谢了!~
  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP