- 论坛徽章:
- 0
|
ES40 Tru64 4.0F TruCluster环境
(是否有代码手册之类的文档帮助分析)
使用wsea tra命令分析binary.errlog文件,wccat gui 分析crash文件有如下信息:
wsea tra:
Event: 430
Description: Tru64 UNIX Panic ASCII Message at Mon 13 Dec 2004 12:05:07 GMT+08:00 from xxx2
File: /var/adm/binary.errlog
================================================================================
COMMON EVENT HEADER (CEH) V2.0
Event_Leader xFFFF FFFE
Header_Length 252
Event_Length 312
Header_Rev_Major 2
Header_Rev_Minor 0
OS_Type 1 -- Tru64 UNIX
Hardware_Arch 4 -- Alpha
CEH_Vendor_ID 3,564 -- Hewlett-Packard Company
Hdwr_Sys_Type 34 -- Tsunami/Typhoon Corelogic
Logging_CPU 3 -- CPU Logging this Event
CPUs_In_Active_Set 4
Major_Class 302
Minor_Class 255
Entry_Type 302 -- Tru64 UNIX Panic ASCII Message
DSR_Msg_Num 1,820 -- AlphaServer ES40
.... CPU Slots: 4 (667Mhz)
.... PCI Slots: 10
.... MMB Slots: 8 (DIMMs)
Chip_Type 11 -- EV67 - 21264A
CEH_Device 255
CEH_Device_ID_0 x0000 03FF
CEH_Device_ID_1 x0000 0007
CEH_Device_ID_2 x0000 0007
Unique_ID_Count 211
Unique_ID_Prefix 49,228
Num_Strings 5
TLV Section of CEH
TLV_DSR_String AlphaServer ES40
TLV_OS_Version Digital UNIX V4.0F (Rev. 1229)
TLV_Sys_Serial_Num S31xxxxxxx
TLV_Time_as_Local Mon 13 Dec 2004 12:05:07 GMT+08:00
TLV_Computer_Name xxx2
Entry_Type 302
Tru64 UNIX Panic ASCII Message
Panic_Message_Header **** START Panic ASCII Message of Length: 36 Bytes ****
Panic_ASCII_Message panic (cpu 3): kernel memory fault
Event: 431
Description: Console Data Log Event at Tue 14 Dec 2004 09:55:38 GMT+08:00 from xxx2
File: /var/adm/binary.errlog
================================================================================
COMMON EVENT HEADER (CEH) V2.0
Event_Leader xFFFF FFFE
Header_Length 252
Event_Length 432
Header_Rev_Major 2
Header_Rev_Minor 0
OS_Type 1 -- Tru64 UNIX
Hardware_Arch 4 -- Alpha
CEH_Vendor_ID 3,564 -- Hewlett-Packard Company
Hdwr_Sys_Type 34 -- Tsunami/Typhoon Corelogic
Logging_CPU 0 -- CPU Logging this Event
CPUs_In_Active_Set 1
Major_Class 113
Minor_Class 0
Entry_Type 113 -- Console Data Log Event
DSR_Msg_Num 1,820 -- AlphaServer ES40
.... CPU Slots: 4 (667Mhz)
.... PCI Slots: 10
.... MMB Slots: 8 (DIMMs)
Chip_Type 11 -- EV67 - 21264A
CEH_Device 255
CEH_Device_ID_0 x0000 03FF
CEH_Device_ID_1 x0000 0007
CEH_Device_ID_2 x0000 0007
Unique_ID_Count 0
Unique_ID_Prefix 49,228
Num_Strings 5
TLV Section of CEH
TLV_DSR_String AlphaServer ES40
TLV_OS_Version Digital UNIX V4.0F (Rev. 1229)
TLV_Sys_Serial_Num S31xxxxxxx
TLV_Time_as_Local Tue 14 Dec 2004 09:55:38 GMT+08:00
TLV_Computer_Name xxx2
Entry_Type 113
Console_Data_log
START OF SUBPACKETS IN THIS EVENT
System Event Frame Header Subpacket - V1.0
Time_Stamp x0000 340C 0E01 3422 Time Stamp
Seconds[7] 34 Seconds
Minutes[15] 52 Minutes
Hours[23] 1 Hours Unix = GMT Ovms = Local
Day[31] 14 Day
Month[39] 12 December
Year[47] 52 2004
Fatal Environmental Error Frame Subpacket, Version 1
Cpu_Whami x0000 0000 0000 0000 CPU 0
Environmental Logout Frame, Version 1
Frame_Size x0000 0070
Frame_Flags x0000 0000
CPU_Area_Offset x0000 0018
System_Area_Offset x0000 0018
Mchk_Error_Code x0000 0206 Machine Check Logout Frame Error Code
Value[31] x206 Environmental Fatal or Non-Fatal
Frame_Rev x0000 0001
SW_Sum_Flags x0000 0000 0000 0000 Software Summary Flags Register
Cchip_DIR x0084 0000 0000 0000 Cchip Device Interrupt Request Register
Env_Cor_Err[50] x1 Environmental Error Detected
ES4X_Logout_Frame_System_Section
Environ_QW_1_ES40 x0000 0000 0000 0008 TIG SMIR Register
RMC_Cor_Evn[3] x1 Environmental FAIL/WARNING DETECTED
Environ_QW_2_ES40 x0000 0000 0000 000F TIG CPUIR Register
CPU0_Reg_Enabled[0]x1 CPU0 Regulator Enabled
CPU1_Reg_Enabled[1] x1 CPU1 Regulator Enabled
CPU2_Reg_Enabled[2] x1 CPU2 Regulator Enabled
CPU3_Reg_Enabled[3] x1 CPU3 Regulator Enabled
Environ_QW_3_ES40 x0000 0000 0000 0007 TIG PSIR Register
PS0_Enabled[0] x1 Power Supply 0 Enabled
PS1_Enabled[1] x1 Power Supply 1 Enabled
PS2_Enabled[2] x1 Power Supply 2 Enabled
Environ_QW_4_ES40 x0000 0000 0000 0000 No Non-Fatal Errors Detected
PS_Causng_Warng[41]x0 Bulk PS0 (See [47] for applicable detail)
PS_Temp_Warng[45] x0 Internal Temperature Normal
PS_AC_Low_Warng[46] x0 AC Input Low Limit Normal
PS_AC_high_Warng[47]x0 AC Input High Limit Normal
Environ_QW_5_ES40 x0000 0000 0000 0000 System Doors Activation Register
Environ_QW_6_ES40 x0000 0000 0000 0000 No System Temperature Warnings Detected
Environ_QW_7_ES40 x0000 0000 0000 0100 System Cooling Environmental Register
Fan5_6_Speed_Max[8] x1 Fan 5 Speed Detected at Maximum
Environ_QW_8_ES40 x0000 0000 0000 0000 No Fatal Errors Detected
wccat gui :
---------- - kernel memory fault Digital UNIX V4.0F Node: xxx2 ----------
Full Description:
---- Number of Rules Matching this Case ----
Rule Match Count: 0
---- Source Rule Info. ----
Source Rule Set: Tru64_Unix_RULES_Generic: 5/16/2003
Tru64_Unix_RULES_V40F: 5/16/2003
---- Rule Match Results ---
Status: UNIDENTIFIED
Evidence:
Tru64_Unix_Main Tru64_Unix_Generic Tru64_Unix_V4.0F
PHYSICAL_MEMORY: 4095
STACK_TRACE: stop_secondary_cpu panic event_timeout xcpu_puts printf panic trap _XentMM
RETURN_ADDR_I_MODULE:
CRASH_TIME: 12/14/2004 09:55:48
PANIC_STRING: kernel memory fault
PC_I_MODULE:
AVAILABLE_CPUS: 4
SAVED_EXCEPT_FRAME_PTR:
ARCHITECTURE: axp
UPTIME: 2.77 hours
HOSTNAME: xxx2
CRASH_ANALYSIS: gui
KMF_FAULTING_PC: 0xfffffb00003fef00
PANIC_CPU: 3
FAULT_VIRT_ADDRESS: fffffb00003fef00
FIRMWARE_REV:
OS_VERSION: V4.0F
SYSTEM_STRING: ES40
NUMBER_OF_CPUS: 4
OPERATING_SYSTEM: Tru64 Unix
************ End of Message ************ |
|