RX6600故障
机器RX6600,mp卡登录后发现这个故障,节点宕掉,Log Entry 187: 27 Mar 2013 05:57:57
Alert Level 2: Informational
Keyword: HP-UX_BOOT_COMPLETE
HP-UX OS early boot is complete.
Logged by: HP-UX Kernel 0
Data: Major change in system state - Boot Complete
0x54801C2F00E010B0 0000000000001001
Log Entry 186: 27 Mar 2013 05:56:41
Alert Level 2: Informational
Keyword: BOOT_SWITCH_INSECURE_MODE
System has been switched to insecure mode
Logged by: System Firmware0
Data: Data field unused
0x40801CBB00E01090 0000000000000000
MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >
** Invalid input. **
Event Log Navigation Help:
+ View next block (forward in time,e.g. from 3 to 4)
- View previous block (backward in time, e.g. from 3 to 2)
<CR> Continue to the next or previous block
D Dump the entire log
F First entry
L Last entry
J Jump to entry number
H View mode configuration - Hex
K View mode configuration - Keyword
T View mode configuration - Text
A Alert Level Filter options
U Alert Level Unfiltered
? Display this Help menu
Q Quit and return to the Event Log Viewer Menu
Ctrl-BExit command, and return to the MP Main Menu
MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >
Log Entry 185: 27 Mar 2013 05:56:01
Alert Level 2: Informational
Keyword: BOOT_START
CPU starting boot
Logged by: System Firmware0
Data: Major change in system state
0x5480006300E01070 0000000000000000
Log Entry 184: 27 Mar 2013 05:56:01
Alert Level 2: Informational
Keyword: CPU_START_BOOT
CPU starting boot
Logged by: Redundant w/ an E0 code;
Sensor: System Boot Initiated
Data1: transition to Running
0xC1515289F1021060 FFFF000A001D0300
MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >
Log Entry 183: 27 Mar 2013 05:55:51
Alert Level 2: Informational
Keyword: SOFT_RESET
Soft Reset
Logged by: Baseboard Management Controller;
Sensor: System Event
0x20515289E7021050 FFFF027000120300
Log Entry 182: 27 Mar 2013 05:54:11
Alert Level 7: Fatal
Keyword: HP-UX_OS_CRITICAL_SHUTDOWN
HP-UX OS shutdown due to an MCA or INIT
Logged by: HP-UX Kernel 0
Data: Major change in system state - State Change
0xF4801C3100E01030 000000000019100C
MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >
** Invalid input. **
Event Log Navigation Help:
+ View next block (forward in time,e.g. from 3 to 4)
- View previous block (backward in time, e.g. from 3 to 2)
<CR> Continue to the next or previous block
D Dump the entire log
F First entry
L Last entry
J Jump to entry number
H View mode configuration - Hex
K View mode configuration - Keyword
T View mode configuration - Text
A Alert Level Filter options
U Alert Level Unfiltered
? Display this Help menu
Q Quit and return to the Event Log Viewer Menu
Ctrl-BExit command, and return to the MP Main Menu
MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >
Log Entry 181: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INITIATED
INIT initiated
Logged by: System Firmware6
Data: Major change in system state - BCH or EFI
0xF480007906E01010 0000000000000006
Log Entry 180: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INTERRUPT_INITIATED
INIT Initiated
Logged by: Redundant w/ an E0 code;
Sensor: Critical Interrupt
Data1: Software NMI
Data2: OEM Code1: 0x3FOEM Code2: 0x00
0xC151528982021000 003FA36F00130300
MP:SL (+,-,<CR>,D, F, L, J, H, K, T, A, U, ? for Help, Q or Ctrl-B to Quit) >
Log Entry 179: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INITIATED
INIT initiated
Logged by: System Firmware4
Data: Major change in system state - Housekeeping On
0xF480007904E00FE0 0000000000000004
Log Entry 178: 27 Mar 2013 05:54:10
Alert Level 7: Fatal
Keyword: INIT_INTERRUPT_INITIATED
INIT Initiated
Logged by: Redundant w/ an E0 code;
Sensor: Critical Interrupt
Data1: Software NMI
Data2: OEM Code1: 0x3FOEM Code2: 0x00
0xC151528982020FD0 003FA36F00130300
找HP报修吧,打400电话 估计没保了吧:-L有保还还用来这问。。。。。 都是些7级别的告警,没什么用了,主机自动重启了,看看eventlog上有没什么,CSTM里面看看内存啊什么的有没有问题。。。。 从上面看应该是有dump生成的,很可能是集群触发的重启(INIT)。检查一下/var/adm/crash目录是否有新的crash.x生成,是的话找HP分析dump了解原因吧。 1.有没有MCA文件;
2./var/opt/resmon/log/event.log;
3.crashinfo -v
这种莫名重启的机器分析大概就是这个思路 之前看过,硬件和内存没有问题, MCA的话可以看看/var/tombstones下是否有新的文件生成,不过这个文件需要HP Support用工具去分析。 嗯-好的!谢谢
页:
[1]