NT5220升完固件报错。
机器是这样的,客户的软件升级里要升机器的固件,他们做完升级以后机器就报下面的错误,然后我们就去换了主板,换完主板后机器好了没问题了,然后他们又进行升级,升完又成这个鸟样了,大神们看看怎么办。Password:
Waiting for daemons to initialize...
Daemons ready
Sun(TM) Integrated Lights Out Manager
Version 3.0.6.1.d r48331
Copyright 2009 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
sc> showplatform
SUNW,Netra-T5220
Chassis Serial Number: 1039FM900Y
Domain Status
------ ------
S0 Powered off
sc> poweron
sc> Chassis | major: Host has been powered on
sc> console -f
Enter #. to return to ALOM.
0:0:0>
0:0:0>Netra T5220 POST 4.30.4 2009/08/19 07:55
/export/delivery/delivery/4.30/4.30.4/post4.30.4-micro/Niagara/turgo/inte
grated(root)
0:0:0>Copyright 2009 Sun Microsystems, Inc. All rights reserved
0:0:0>POST enabling CMP 0 threads: ffffffff.ffffffff
0:0:0>VBSC mode is: 00000000.00000001
0:0:0>VBSC level is: 00000000.00000001
0:0:0>VBSC selecting Normal mode, MAX Testing.
0:0:0>VBSC setting verbosity level 2
0:0:0>Basic Memory Tests....Done
0:0:0>Test Memory....Done
0:0:0>Setup POST Mailbox ....Done
0:0:0>Master CPU Tests Basic....Done
0:0:0>Init MMU.....
0:0:0>NCU Setup and PIU link train....Done
0:0:0>L2 Tests....Done
0:0:0>Extended CPU Tests....Done
0:0:0>Scrub Memory....Done
0:0:0>SPU CWQ Tests...Done
0:0:0>MAU Tests...Done
0:0:0>Network Interface Unit Tests....Done
0:0:0>Functional CPU Tests....Done
0:0:0>Extended Memory Tests....Done
0:0:0>FATAL ERROR 0!!
0:0:0>ERROR: Illegal Instruction!
0:0:0>CPU 0 trap trace.
0:0:0>tl tt tstate hpstate tpc
00 0010 0000000040001605 0000000000000000 FFFFFFFF
F0208050
01 0180 0000010040001006 0000000000000004 000000FF
F022C80C
02 00C0 0000024440001200 0000000000000004 FFFFFFFF
F02291C0
03 0000 0000000000000000 0000000000000000 00000000
00000000
04 0000 0000000000000000 0000000000000000 00000000
00000000
05 0000 0000000000000000 0000000000000000 00000000
00000000
0:0:0>Decode of Disrupting Error Status Reg (DESR HW Corrected)bits c9000000.0
0000000
0:0:0> 1 DESR_F:Full, register contains valid data
0:0:0> 1 DESR_ME: Multiple errors detected
0:0:0> 9 DESR_L2C: L2 cache correctable
0:0:0>Decode of Disrupting Error Status Reg (DESR SW Recoverable)bits a4000000
.00000000
0:0:0> 1 DESR_F:Full, register contains valid data
0:0:0> 1 DESR_S: Set to 1 if SW recoverable error was logged. Set
to 0 when HW recoverable error was logged.
0:0:0> 4 DESR_DCL2C: dc L2 correctable
0:0:0>Decode of Dram Error Log Reg Branch 1 bits 20000000.00000004
0:0:0> 1 DAC 61 R/W1C Set to 1 if the error was a DRAM acce
ss CE
0:0:0> 4 SYND 15:0RW ECC syndrome.
0:0:0> L2 AFAR Branch 1 bank 3 = 00000002.5ccfa2c0
0:0:0> Dram Error Location Reg Branch 1 = 00000001.00000000
0:0:0> DRAM Retry Reg for Branch 1 = 00000010.00480012
0:0:0>Decode of L2 Error Log Reg Bank 3 bits 40000410.00000000
0:0:0> 1 MEC 62W1C Multiple corrected errors, one or more cor
rectederrors were not logged.
0:0:0> 1 DAC 42Set to 1 if the error was a DRAM accesscorre
ctable error.
0:0:0> 1 VEC 36Set to 1 if the register contains a valid corr
ectableerror.
0:0:0> 0 SYND 27:0 Preserved R/W Parity or ECC syndrome.
0:0:0> L2 AFAR Branch 1 bank 3 = 00000002.5c5fa2c0
0:0:0>ERROR:
0:0:0>POST toplevel status has the following failures:
0:0:0> MB/CMP0/L2_BANK0
0:0:0> MB/CMP0/L2_BANK1
0:0:0> MB/CMP0/L2_BANK2
0:0:0> MB/CMP0/L2_BANK3
0:0:0> MB/CMP0/L2_BANK4
0:0:0> MB/CMP0/L2_BANK5
0:0:0> MB/CMP0/L2_BANK6
0:0:0> MB/CMP0/L2_BANK7
0:0:0>END_ERROR
Fault | critical: SP detected fault at time Fri Apr4 11:48:17 2014. /SYS/MB/CM
P0/L2_BANK0 Forced fail (POST)
Fault | critical: SP detected fault at time Fri Apr4 11:48:19 2014. Apr4 11:
48:19 ERROR: Unsupported memory configuration
Chassis | major: Apr4 11:48:19 ERROR: Unsupported memory configuration
Chassis | critical: Apr4 11:48:19 FATAL: No memory available
Chassis | critical: Apr4 11:48:19 FATAL: The HOST Processor has a configuratio
n error, forcing a power-down
Chassis | critical: CRITICAL ALARM is set
Chassis | critical: Host has been powered off
Fault | critical: SP detected fault at time Fri Apr4 11:48:20 2014. /SYS/MB/CM
P0/L2_BANK1 Forced fail (POST)
Fault | critical: SP detected fault at time Fri Apr4 11:48:21 2014. /SYS/MB/CM
P0/L2_BANK2 Forced fail (POST)
Fault | critical: SP detected fault at time Fri Apr4 11:48:23 2014. /SYS/MB/CM
P0/L2_BANK3 Forced fail (POST)
Fault | critical: SP detected fault at time Fri Apr4 11:48:24 2014. /SYS/MB/CM
P0/L2_BANK4 Forced fail (POST)
Fault | critical: SP detected fault at time Fri Apr4 11:48:26 2014. /SYS/MB/CM
P0/L2_BANK5 Forced fail (POST)
Fault | critical: SP detected fault at time Fri Apr4 11:48:27 2014. /SYS/MB/CM
P0/L2_BANK6 Forced fail (POST)
Fault | critical: SP detected fault at time Fri Apr4 11:48:29 2014. /SYS/MB/CM
P0/L2_BANK7 Forced fail (POST)
Serial console stopped.
sc> showfaults
Last POST Run: Fri Apr4 11:48:16 2014
Post Status: Failed devices: MB/CMP0/L2_BANK0 MB/CMP0/L2_BANK1 MB/CMP0/L2_BANK2
MB/CMP0/L2_BANK3 MB/CMP0/L2_BANK4 MB/CMP0/L2_BANK5 MB/CMP0/L2_BANK6 MB/CMP0/L2_B
ANK7
ID FRU Fault
1 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK7 Forced fail (POS
T)
2 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK6 Forced fail (POS
T)
3 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK5 Forced fail (POS
T)
4 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK4 Forced fail (POS
T)
5 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK3 Forced fail (POS
T)
6 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK2 Forced fail (POS
T)
7 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK1 Forced fail (POS
T)
8 /SYS/MB SP detected fault: /SYS/MB/CMP0/L2_BANK0 Forced fail (POS
T)
9 /SYS SP detected fault: Apr4 11:48:19 ERROR: Unsupported mem
ory configuration
sc> showcomponent
Keys:
/SYS/MB/TTYA
/SYS/MB/PCI_MEZZ/PCIX3
/SYS/MB/PCI_MEZZ/PCIX4
/SYS/MB/PCI_MEZZ/PCIE5
/SYS/MB/PCI_MEZZ/PCIE-IO
/SYS/MB/RISER0/XAUI0
/SYS/MB/RISER0/PCIE0
/SYS/MB/RISER1/XAUI1
/SYS/MB/RISER1/PCIE1
/SYS/MB/RISER2/PCIE2
/SYS/MB/GBE0
/SYS/MB/GBE1
/SYS/MB/PCIE
/SYS/MB/PCIE-IO/USB
/SYS/MB/SASHBA
/SYS/MB/CMP0/NIU0
/SYS/MB/CMP0/NIU1
/SYS/MB/CMP0/MCU0
/SYS/MB/CMP0/MCU1
/SYS/MB/CMP0/MCU2
/SYS/MB/CMP0/MCU3
/SYS/MB/CMP0/L2_BANK0
/SYS/MB/CMP0/L2_BANK1
/SYS/MB/CMP0/L2_BANK2
/SYS/MB/CMP0/L2_BANK3
/SYS/MB/CMP0/L2_BANK4
/SYS/MB/CMP0/L2_BANK5
/SYS/MB/CMP0/L2_BANK6
/SYS/MB/CMP0/L2_BANK7
/SYS/MB/CMP0/BR0/CH0/D0
/SYS/MB/CMP0/BR0/CH0/D1
/SYS/MB/CMP0/BR0/CH1/D0
/SYS/MB/CMP0/BR0/CH1/D1
/SYS/MB/CMP0/BR1/CH0/D0
/SYS/MB/CMP0/BR1/CH0/D1
/SYS/MB/CMP0/BR1/CH1/D0
/SYS/MB/CMP0/BR1/CH1/D1
/SYS/MB/CMP0/BR2/CH0/D0
/SYS/MB/CMP0/BR2/CH0/D1
/SYS/MB/CMP0/BR2/CH1/D0
/SYS/MB/CMP0/BR2/CH1/D1
/SYS/MB/CMP0/BR3/CH0/D0
/SYS/MB/CMP0/BR3/CH0/D1
/SYS/MB/CMP0/BR3/CH1/D0
/SYS/MB/CMP0/BR3/CH1/D1
/SYS/MB/CMP0/P0
/SYS/MB/CMP0/P1
/SYS/MB/CMP0/P2
/SYS/MB/CMP0/P3
/SYS/MB/CMP0/P4
/SYS/MB/CMP0/P5
/SYS/MB/CMP0/P6
/SYS/MB/CMP0/P7
/SYS/MB/CMP0/P8
/SYS/MB/CMP0/P9
/SYS/MB/CMP0/P10
/SYS/MB/CMP0/P11
/SYS/MB/CMP0/P12
/SYS/MB/CMP0/P13
/SYS/MB/CMP0/P14
/SYS/MB/CMP0/P15
/SYS/MB/CMP0/P16
/SYS/MB/CMP0/P17
/SYS/MB/CMP0/P18
/SYS/MB/CMP0/P19
/SYS/MB/CMP0/P20
/SYS/MB/CMP0/P21
/SYS/MB/CMP0/P22
/SYS/MB/CMP0/P23
/SYS/MB/CMP0/P24
/SYS/MB/CMP0/P25
/SYS/MB/CMP0/P26
/SYS/MB/CMP0/P27
/SYS/MB/CMP0/P28
/SYS/MB/CMP0/P29
/SYS/MB/CMP0/P30
/SYS/MB/CMP0/P31
/SYS/MB/CMP0/P32
/SYS/MB/CMP0/P33
/SYS/MB/CMP0/P34
/SYS/MB/CMP0/P35
/SYS/MB/CMP0/P36
/SYS/MB/CMP0/P37
/SYS/MB/CMP0/P38
/SYS/MB/CMP0/P39
/SYS/MB/CMP0/P40
/SYS/MB/CMP0/P41
/SYS/MB/CMP0/P42
/SYS/MB/CMP0/P43
/SYS/MB/CMP0/P44
/SYS/MB/CMP0/P45
/SYS/MB/CMP0/P46
/SYS/MB/CMP0/P47
/SYS/MB/CMP0/P48
/SYS/MB/CMP0/P49
/SYS/MB/CMP0/P50
/SYS/MB/CMP0/P51
/SYS/MB/CMP0/P52
/SYS/MB/CMP0/P53
/SYS/MB/CMP0/P54
/SYS/MB/CMP0/P55
/SYS/MB/CMP0/P56
/SYS/MB/CMP0/P57
/SYS/MB/CMP0/P58
/SYS/MB/CMP0/P59
/SYS/MB/CMP0/P60
/SYS/MB/CMP0/P61
/SYS/MB/CMP0/P62
/SYS/MB/CMP0/P63
Disabled Devices
/SYS/MB/CMP0/L2_BANK0 Forced fail (POST)
/SYS/MB/CMP0/L2_BANK1 Forced fail (POST)
/SYS/MB/CMP0/L2_BANK2 Forced fail (POST)
/SYS/MB/CMP0/L2_BANK3 Forced fail (POST)
/SYS/MB/CMP0/L2_BANK4 Forced fail (POST)
/SYS/MB/CMP0/L2_BANK5 Forced fail (POST)
/SYS/MB/CMP0/L2_BANK6 Forced fail (POST)
/SYS/MB/CMP0/L2_BANK7 Forced fail (POST)
sc> logout
SC下enablecomponent /SYS/MB/CMP0/L2_BANK0-7过这几个部件,然后poweron加电就又成这样了。
另外他们有很多同样的机器,升完都没事,就这一台报这个错。
是升级完后内存不支持了呢?还是这个L2是CPU的2级缓存? 同志们不要只看不回,是内存问题还是CPU的L2缓存? 看内存和其他机器上的有啥区别。 不会是用T5220的固件来升级NT5220了吧? 这个应该不会吧,有好几台都没有问题,就这一台有,以经让客户从好的机器上拔内存插过来试了,等试完就知道是不是内存的问题了。回复 5# znnnz
1、看看内存的型号?
2、原来的固件版本?
3、现在的固件版本? 内存型号暂时不知道,他们需要的版本firmware version 3.0.10.2.a,原来的是3.0.6.1.d,他们升到3.0.10.2.a就报错,没升时正常,然后在刷回3.0.6.1.d也不管用还是报错。回复 7# DC_楚楚
看着像CPU cache
L2 AFAR Branch 1 bank 3 = 00000002.5c5fa2c0
仔细看看吧 回复 8# kwtip
可以参考一下文档看看符不符合,Solution1573160.1 : IBIST Memory Failures on T5xx0 Servers
页:
[1]
2