weixiaoma0 发表于 2013-03-29 11:24

RX6600故障

机器RX6600,mp卡登录后发现这个故障,节点宕掉,


Log Entry 187: 27 Mar 2013 05:57:57

Alert Level 2: Informational
Keyword: HP-UX_BOOT_COMPLETE
HP-UX OS early boot is complete.
Logged by: HP-UX Kernel 0
Data: Major change in system state - Boot Complete
0x54801C2F00E010B0 0000000000001001


Log Entry 186: 27 Mar 2013 05:56:41
Alert Level 2: Informational
Keyword: BOOT_SWITCH_INSECURE_MODE
System has been switched to insecure mode
Logged by: System Firmware0
Data: Data field unused
0x40801CBB00E01090 0000000000000000


MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >

** Invalid input. **


Event Log Navigation Help:

   +       View next block   (forward in time,e.g. from 3 to 4)
   -       View previous block (backward in time, e.g. from 3 to 2)
   <CR>    Continue to the next or previous block
   D       Dump the entire log
   F       First entry
   L       Last entry
   J       Jump to entry number
   H       View mode configuration - Hex
   K       View mode configuration - Keyword
   T       View mode configuration - Text
   A       Alert Level Filter options
   U       Alert Level Unfiltered
   ?       Display this Help menu
   Q       Quit and return to the Event Log Viewer Menu
   Ctrl-BExit command, and return to the MP Main Menu


MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >



Log Entry 185: 27 Mar 2013 05:56:01
Alert Level 2: Informational
Keyword: BOOT_START
CPU starting boot
Logged by: System Firmware0
Data: Major change in system state
0x5480006300E01070 0000000000000000


Log Entry 184: 27 Mar 2013 05:56:01
Alert Level 2: Informational
Keyword: CPU_START_BOOT
CPU starting boot
Logged by: Redundant w/ an E0 code;
Sensor: System Boot Initiated
Data1: transition to Running
0xC1515289F1021060 FFFF000A001D0300


MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >



Log Entry 183: 27 Mar 2013 05:55:51
Alert Level 2: Informational
Keyword: SOFT_RESET
Soft Reset
Logged by: Baseboard Management Controller;
Sensor: System Event
0x20515289E7021050 FFFF027000120300


Log Entry 182: 27 Mar 2013 05:54:11
Alert Level 7: Fatal
Keyword: HP-UX_OS_CRITICAL_SHUTDOWN
HP-UX OS shutdown due to an MCA or INIT
Logged by: HP-UX Kernel 0
Data: Major change in system state - State Change
0xF4801C3100E01030 000000000019100C


MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >

** Invalid input. **


Event Log Navigation Help:

   +       View next block   (forward in time,e.g. from 3 to 4)
   -       View previous block (backward in time, e.g. from 3 to 2)
   <CR>    Continue to the next or previous block
   D       Dump the entire log
   F       First entry
   L       Last entry
   J       Jump to entry number
   H       View mode configuration - Hex
   K       View mode configuration - Keyword
   T       View mode configuration - Text
   A       Alert Level Filter options
   U       Alert Level Unfiltered
   ?       Display this Help menu
   Q       Quit and return to the Event Log Viewer Menu
   Ctrl-BExit command, and return to the MP Main Menu


MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >



Log Entry 181: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INITIATED
INIT initiated
Logged by: System Firmware6
Data: Major change in system state - BCH or EFI
0xF480007906E01010 0000000000000006


Log Entry 180: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INTERRUPT_INITIATED
INIT Initiated
Logged by: Redundant w/ an E0 code;
Sensor: Critical Interrupt
Data1: Software NMI
Data2: OEM Code1: 0x3FOEM Code2: 0x00
0xC151528982021000 003FA36F00130300


MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >



Log Entry 179: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INITIATED
INIT initiated
Logged by: System Firmware4
Data: Major change in system state - Housekeeping On
0xF480007904E00FE0 0000000000000004


Log Entry 178: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INTERRUPT_INITIATED
INIT Initiated
Logged by: Redundant w/ an E0 code;
Sensor: Critical Interrupt
Data1: Software NMI
Data2: OEM Code1: 0x3FOEM Code2: 0x00
0xC151528982020FD0 003FA36F00130300

HYMC 发表于 2013-03-29 14:03

找HP报修吧,打400电话

东方蜘蛛 发表于 2013-03-29 17:16

估计没保了吧:-L有保还还用来这问。。。。。

yesday007 发表于 2013-03-29 22:30

都是些7级别的告警,没什么用了,主机自动重启了,看看eventlog上有没什么,CSTM里面看看内存啊什么的有没有问题。。。。

lbseraph 发表于 2013-03-30 07:07

从上面看应该是有dump生成的,很可能是集群触发的重启(INIT)。检查一下/var/adm/crash目录是否有新的crash.x生成,是的话找HP分析dump了解原因吧。

cjhvslhb 发表于 2013-03-30 10:40

1.有没有MCA文件;
2./var/opt/resmon/log/event.log;
3.crashinfo -v
这种莫名重启的机器分析大概就是这个思路

weixiaoma0 发表于 2013-04-01 10:43

之前看过,硬件和内存没有问题,

uriyliu 发表于 2013-04-01 11:09

MCA的话可以看看/var/tombstones下是否有新的文件生成,不过这个文件需要HP Support用工具去分析。

weixiaoma0 发表于 2013-04-01 13:36

嗯-好的!谢谢
页: [1]
查看完整版本: RX6600故障