免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
12下一页
最近访问板块 发新帖
查看: 17530 | 回复: 19

[求助] RP5470自动关机,大虾啊,怎么办,求助求助 [复制链接]

论坛徽章:
0
发表于 2013-09-04 08:35 |显示全部楼层
服务器无缘无故自动关机了,人工启动后运行了3天又没有问题,不知道问题所在啊,高手帮忙看下吧
附件是收集的日志

MP_10.178.151.62_20130902_080639.rar

10.99 KB, 下载次数: 29

sysinfo_yfjc11_201309020751.tgz.gz

502.1 KB, 下载次数: 20

论坛徽章:
5
技术图书徽章
日期:2013-08-27 10:03:49CU大牛徽章
日期:2013-09-18 15:16:55CU大牛徽章
日期:2013-09-18 15:18:22CU大牛徽章
日期:2013-09-18 15:18:43技术图书徽章
日期:2014-04-24 15:51:26
发表于 2013-09-04 09:13 |显示全部楼层
自动关机很可能是温度过高,硬件有自我保护功能。

论坛徽章:
0
发表于 2013-09-04 09:20 |显示全部楼层
回复 2# Purple_Grape
机房是恒温环境,应该不可能是温度过高吧,而且也没看到相关的温度报警信息

   

论坛徽章:
5
技术图书徽章
日期:2013-08-27 10:03:49CU大牛徽章
日期:2013-09-18 15:16:55CU大牛徽章
日期:2013-09-18 15:18:22CU大牛徽章
日期:2013-09-18 15:18:43技术图书徽章
日期:2014-04-24 15:51:26
发表于 2013-09-04 09:28 |显示全部楼层
本帖最后由 Purple_Grape 于 2013-09-04 09:34 编辑

根据提取的日志,可能是硬件电源问题

ALERT LEVEL: 13 = System hang detected via timer popping
ALERT LEVEL: 14 = Fatal power or environmental problem prevents operation
CALLER SUBACTIVITY: 04 = low voltage power supply

论坛徽章:
0
发表于 2013-09-04 09:30 |显示全部楼层
=~=~=~=~=~=~=~=~=~=~=~= PuTTY log 2013.09.02 08:06:39 =~=~=~=~=~=~=~=~=~=~=~=

Service Processor login: Admin

Service Processor password:




                  Hewlett-Packard Guardian Service Processor


    (c) Copyright Hewlett-Packard Company 1999-2002.  All Rights Reserved.

                          System Name: yfjc11MP



[bumped user -  ]


Leaving Console Mode - you may lose write access.
When Console Mode returns, type ^Ecf to get console write access.


GSP Host Name:  yfjc11MP

GSP> sl



SL


Select Chassis Code Buffer to be displayed:

Incoming, Activity, Error, Current boot or Last boot? (I/A/E/C/L) e

e


Set up filter options on this buffer? (Y/[N]) n

n


The first entry is the most recent Chassis Code

Type + CR and CR to go up (back in time),

Type - CR and CR to go down (forward in time),

Type Q/q CR to quit.



Log Entry #   0 :

SYSTEM NAME: yfjc11MP

DATE: 08/29/2013 TIME: 16:53:15

ALERT LEVEL: 13 = System hang detected via timer popping


SOURCE: 1 = processor

SOURCE DETAIL: 1 = processor general   SOURCE ID: 0

PROBLEM DETAIL: 4 = timeout


CALLER ACTIVITY: F = display_activity() update   STATUS: 0

CALLER SUBACTIVITY: 00 = implementation dependent

REPORTING ENTITY TYPE: E = HP-UX   REPORTING ENTITY ID: 00


0x78E000D41100F000 00000003 00000000 type 15 = Activity Level/Timeout

0x58E008D41100F000 00007107 1D10350F type 11 = Timestamp 08/29/2013 16:53:15

Type CR for next entry, Q CR to quit.




Log Entry #   1 :

SYSTEM NAME: yfjc11MP

DATE: 08/29/2013 TIME: 16:53:22

ALERT LEVEL: 14 = Fatal power or environmental problem prevents operation


SOURCE: 4 = power

SOURCE DETAIL: 4 = high voltage DC power   SOURCE ID: FF

PROBLEM DETAIL: 0 = no problem detail


CALLER ACTIVITY: 4 = monitor   STATUS: F

CALLER SUBACTIVITY: 04 = low voltage power supply

REPORTING ENTITY TYPE: 2 = power monitor   REPORTING ENTITY ID: 00


0x002000E044FF404F 00000000 00000000 type  0 = Data Field Unused

0x582008E044FF404F 00007107 1D103516 type 11 = Timestamp 08/29/2013 16:53:22

Type CR for next entry, - CR for previous entry, Q CR to quit.




Log Entry #   2 :

SYSTEM NAME: yfjc11MP

DATE: 08/29/2013 TIME: 16:53:22

ALERT LEVEL: 14 = Fatal power or environmental problem prevents operation


SOURCE: 4 = power

SOURCE DETAIL: 4 = high voltage DC power   SOURCE ID: FF

PROBLEM DETAIL: 4 = output undervoltage


CALLER ACTIVITY: 4 = monitor   STATUS: F

CALLER SUBACTIVITY: 04 = low voltage power supply

REPORTING ENTITY TYPE: 2 = power monitor   REPORTING ENTITY ID: 00


0x002000E444FF404F 00000000 00000000 type  0 = Data Field Unused

0x582008E444FF404F 00007107 1D103516 type 11 = Timestamp 08/29/2013 16:53:22

Type CR for next entry, - CR for previous entry, Q CR to quit.

论坛徽章:
0
发表于 2013-09-04 09:30 |显示全部楼层
本帖最后由 gzy512 于 2013-09-04 09:31 编辑

以上是Mp看到的信息

论坛徽章:
0
发表于 2013-09-04 09:36 |显示全部楼层
ts99信息都是0
HP-UX yfjc11 B.11.11 U 9000/800 1158424656

CPU-ID( Model ) = 0x13

-------  Processor 0 HPMC Information - PDC Version: 42.19  ------

   * * * No valid timestamp * * *


       No HPMC chassis codes logged

General Registers 0 - 31
00-03  0000000000000000  0000000000000000  0000000000000000  0000000000000000
04-07  0000000000000000  0000000000000000  0000000000000000  0000000000000000
08-11  0000000000000000  0000000000000000  0000000000000000  0000000000000000
12-15  0000000000000000  0000000000000000  0000000000000000  0000000000000000
16-19  0000000000000000  0000000000000000  0000000000000000  0000000000000000
20-23  0000000000000000  0000000000000000  0000000000000000  0000000000000000
24-27  0000000000000000  0000000000000000  0000000000000000  0000000000000000
28-31  0000000000000000  0000000000000000  0000000000000000  0000000000000000


Control Registers 0 - 31
00-03  0000000000000000  0000000000000000  0000000000000000  0000000000000000
04-07  0000000000000000  0000000000000000  0000000000000000  0000000000000000
08-11  0000000000000000  0000000000000000  0000000000000000  0000000000000000
12-15  0000000000000000  0000000000000000  0000000000000000  0000000000000000
16-19  0000000000000000  0000000000000000  0000000000000000  0000000000000000
20-23  0000000000000000  0000000000000000  0000000000000000  0000000000000000
24-27  0000000000000000  0000000000000000  0000000000000000  0000000000000000
28-31  0000000000000000  0000000000000000  0000000000000000  0000000000000000

Space Registers 0 - 7
00-03  00000000          00000000          00000000          00000000
04-07  00000000          00000000          00000000          00000000


IIA Space (back entry)       = 0x0000000000000000
IIA Offset (back entry)      = 0x0000000000000000
Check Type                   = 0x00000000
CPU State                    = 0x00000000
Cache Check                  = 0x00000000
TLB Check                    = 0x00000000
Bus Check                    = 0x00000000
Assists Check                = 0x00000000
Assist State                 = 0x00000000
Path Info                    = 0x00000000
System Responder Address     = 0x0000000000000000
System Requestor Address     = 0x0000000000000000


Floating Point Registers 0 - 31
00-03  0000000000000000  0000000000000000  0000000000000000  0000000000000000
04-07  0000000000000000  0000000000000000  0000000000000000  0000000000000000
08-11  0000000000000000  0000000000000000  0000000000000000  0000000000000000
12-15  0000000000000000  0000000000000000  0000000000000000  0000000000000000
16-19  0000000000000000  0000000000000000  0000000000000000  0000000000000000
20-23  0000000000000000  0000000000000000  0000000000000000  0000000000000000
24-27  0000000000000000  0000000000000000  0000000000000000  0000000000000000
28-31  0000000000000000  0000000000000000  0000000000000000  0000000000000000


Check Summary                = 0x0000000000000000
Available Memory             = 0x0000000000000000
CPU Diagnose Register 2      = 0x0000000000000000
CPU Status Register 0        = 0x0000000000000000
CPU Status Register 1        = 0x0000000000000000
SADD LOG                     = 0x0000000000000000
Read Short LOG               = 0x0000000000000000



-----------------  DEW 0 HPMC Information -  ------

   No DEW errors logged


--------------  Memory Error Log Information  --------------

Bus 0 Log Information


   No errors logged for this bus


Bus 1 Log Information


   No errors logged for this bus


------------  I/O Module Error Log Information  ------------



   No I/O module errors logged



-------  Processor 2 HPMC Information - PDC Version: 42.19  ------

   * * * No valid timestamp * * *


       No HPMC chassis codes logged

General Registers 0 - 31
00-03  0000000000000000  0000000000000000  0000000000000000  0000000000000000
04-07  0000000000000000  0000000000000000  0000000000000000  0000000000000000
08-11  0000000000000000  0000000000000000  0000000000000000  0000000000000000
12-15  0000000000000000  0000000000000000  0000000000000000  0000000000000000
16-19  0000000000000000  0000000000000000  0000000000000000  0000000000000000
20-23  0000000000000000  0000000000000000  0000000000000000  0000000000000000
24-27  0000000000000000  0000000000000000  0000000000000000  0000000000000000
28-31  0000000000000000  0000000000000000  0000000000000000  0000000000000000


Control Registers 0 - 31
00-03  0000000000000000  0000000000000000  0000000000000000  0000000000000000
04-07  0000000000000000  0000000000000000  0000000000000000  0000000000000000
08-11  0000000000000000  0000000000000000  0000000000000000  0000000000000000
12-15  0000000000000000  0000000000000000  0000000000000000  0000000000000000
16-19  0000000000000000  0000000000000000  0000000000000000  0000000000000000
20-23  0000000000000000  0000000000000000  0000000000000000  0000000000000000
24-27  0000000000000000  0000000000000000  0000000000000000  0000000000000000
28-31  0000000000000000  0000000000000000  0000000000000000  0000000000000000

Space Registers 0 - 7
00-03  00000000          00000000          00000000          00000000
04-07  00000000          00000000          00000000          00000000


IIA Space (back entry)       = 0x0000000000000000
IIA Offset (back entry)      = 0x0000000000000000
Check Type                   = 0x00000000
CPU State                    = 0x00000000
Cache Check                  = 0x00000000
TLB Check                    = 0x00000000
Bus Check                    = 0x00000000
Assists Check                = 0x00000000
Assist State                 = 0x00000000
Path Info                    = 0x00000000
System Responder Address     = 0x0000000000000000
System Requestor Address     = 0x0000000000000000


Floating Point Registers 0 - 31
00-03  0000000000000000  0000000000000000  0000000000000000  0000000000000000
04-07  0000000000000000  0000000000000000  0000000000000000  0000000000000000
08-11  0000000000000000  0000000000000000  0000000000000000  0000000000000000
12-15  0000000000000000  0000000000000000  0000000000000000  0000000000000000
16-19  0000000000000000  0000000000000000  0000000000000000  0000000000000000
20-23  0000000000000000  0000000000000000  0000000000000000  0000000000000000
24-27  0000000000000000  0000000000000000  0000000000000000  0000000000000000
28-31  0000000000000000  0000000000000000  0000000000000000  0000000000000000


Check Summary                = 0x0000000000000000
Available Memory             = 0x0000000000000000
CPU Diagnose Register 2      = 0x0000000000000000
CPU Status Register 0        = 0x0000000000000000
CPU Status Register 1        = 0x0000000000000000
SADD LOG                     = 0x0000000000000000
Read Short LOG               = 0x0000000000000000



-----------------  DEW 2 HPMC Information -  ------

   No DEW errors logged


--------------  Memory Error Log Information  --------------

Bus 0 Log Information


   No errors logged for this bus


Bus 1 Log Information


   No errors logged for this bus


------------  I/O Module Error Log Information  ------------



   No I/O module errors logged



WARNING:  The non-destructive test bit was set, so memory was not tested
            destructively.  Information only, no action required.

        Module              Revision

        ------              --------

        System Board        A14219

        PA 8700 CPU Module  3.1

        PA 8700 CPU Module  3.1


论坛徽章:
0
发表于 2013-09-04 09:37 |显示全部楼层
回复 4# Purple_Grape
电源不应该是冗余的嘛,因为是双电源,还是两个电源都有问题,有什么办法检测电源嘛

   

论坛徽章:
5
技术图书徽章
日期:2013-08-27 10:03:49CU大牛徽章
日期:2013-09-18 15:16:55CU大牛徽章
日期:2013-09-18 15:18:22CU大牛徽章
日期:2013-09-18 15:18:43技术图书徽章
日期:2014-04-24 15:51:26
发表于 2013-09-04 09:39 |显示全部楼层
本帖最后由 Purple_Grape 于 2013-09-04 10:00 编辑

根据你的关机日志,机器从04年4月开始服役,都9年了,加上之前的日志提取分析,肯定是电源出问题啦。

至于是哪个电源坏了,这要进机房搞硬件分析啦,反复插拔其中的一个,或者2个干脆全换掉。

当然也极有可能是电源的某一个部分坏掉了,比如风扇不转了,导致温度过高,然后机器罢工。

参见
http://bbs.chinaunix.net/thread-1449209-1-1.html

论坛徽章:
5
技术图书徽章
日期:2013-08-27 10:03:49CU大牛徽章
日期:2013-09-18 15:16:55CU大牛徽章
日期:2013-09-18 15:18:22CU大牛徽章
日期:2013-09-18 15:18:43技术图书徽章
日期:2014-04-24 15:51:26
发表于 2013-09-04 10:03 |显示全部楼层
时间久了,风扇进灰了,转不动了,电源温度过高,机器罢工。
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP