X4600服务器panic
X4600M2服务器出现panicexplorer里什么都没留下,通过把ILOM改成 命令行输出得到了下面的错误日志
ILOM里面也没有相关的任何信息留下
SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major
EVENT-TIME: 0x4e26ebf4.0x2695a142 (0x43e176a719ec)
PLATFORM: i86pc, CSN: -, HOSTNAME:
SOURCE: SunOS, REV: 5.10 Generic_127128-11
DESC: Errors have been detected that require a reboot to ensure system
integrity.See http://www.sun.com/msg/SUNOS-8000-0G for more information.
AUTO-RESPONSE: Solaris will attempt to save and diagnose the error telemetry
IMPACT: The system will sync files, save a crash dump if needed, and reboot
REC-ACTION: Save the error summary below in case telemetry cannot be saved
panic/thread=fffffe80000b9c80: Unrecoverable Machine-Check Exception
fffffe80000b9950 unix:cmi_mca_panic+1e ()
fffffe80000b9980 unix:cmi_mca_trap+13d ()
fffffe80000b9990 unix:mcetrap+17b ()
fffffe80000b9ab0 unix:setbackdq+1 ()
fffffe80000b9ae0 genunix:cv_unsleep+6e ()
fffffe80000b9b00 genunix:setrun_locked+7a ()
fffffe80000b9b30 genunix:eat_signal+c5 ()
fffffe80000b9b70 genunix:sigtoproc+417 ()
fffffe80000b9ba0 genunix:realitexpire+3d ()
fffffe80000b9be0 genunix:callout_execute+d6 ()
fffffe80000b9c10 unix:softint+108 ()
fffffe80000b9c20 unix:softlevel1+9 ()
fffffe80000b9c60 unix:av_dispatch_softvect+62 ()
fffffe80000b9c70 unix:intr_thread+b4 ()
syncing file systems...
panic/thread=fffffe80000b9c80: panic sync timeout
ereport.cpu.generic-x86.bus_interconnect_uc ena=3e176a6295700001 detector=[
version=0 scheme="hc" hc-list=[...] ] compound_errorname=
"BUSLG_SRC_ERR__NOTIMEOUT_ERR" disp=
"processor_context_corrupt,return_ip_invalid,unconstrained" IA32_MCG_STATUS=4
machine_check_in_progress=1 privileged=0 bank_number=4 bank_msr_offset=410
IA32_MCi_STATUS=b200000000070f0f overflow=0 error_uncorrected=1 error_enabled=
1 processor_context_corrupt=1 error_code=f0f model_specific_error_code=7
dumping to /dev/md/dsk/d60, offset 630063104, content: kernel
WARNING: /pci@0,0/pci1022,7458@11/pci1000,3060@4 (mpt0):
mpt_send_handshake_msg task 4 failed
panic/thread=fffffe80000b9c80: Unrecoverable Machine-Check Exception
dump aborted: please record the above information!
rebooting...
panic/thread=fffffe80000b9c80: Unrecoverable Machine-Check Exception
dump aborted: please record the above information!
rebooting... CPU0坏了。。。。。。。。。 CPU0挂了 不知道不要乱说,与CPU0无关。得收集coredump分析。 这个机器8CPU,之前两个风扇不转了,换了风扇还是不转 ,oracle给发了主板已经换上了 风扇正常了 嘎嘎 panic依旧很执着,我把1和8cpu调换了 firmware升级到了3.0.3 结果还是 cpu0panic
现在等背板...... 回复 4# lqiao
哈哈,dumpadm下面的路径里面 毛都没留下 ,dumping就被aborted 了 光盘启动 回复 7# easybegin
系统能正常启动就是不定时的发作,5-20小时不等就宕机一次,之前SVM两边都出了问题用ufsdump做了几次才搞起来的 进了系统,马上explorer 非常有可能是OS的bug,建议升级OS patch。