免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
12下一页
最近访问板块 发新帖
查看: 5600 | 回复: 18
打印 上一主题 下一主题

请教SUN F6800 C域无法启动问题? [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2007-07-04 17:57 |只看该作者 |倒序浏览
50可用积分
有一SUN F6800服务器C域有时setkey on时无法启动,有时反复几次setkey off/on后系统能正常启动,报错信息和启动过程在下面:
另:硬件未报黄灯
sc0:C> setkeyswitch on
Powering boards on ...
Testing CPU Boards ...
{/N0/SB3/P2} Running CPU POR and Set Clocks
{/N0/SB3/P3} Running CPU POR and Set Clocks
{/N0/SB3/P2} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P3} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P0} Running CPU POR and Set Clocks
{/N0/SB3/P1} Running CPU POR and Set Clocks
{/N0/SB3/P0} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P1} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P0} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P0} Use is subject to license terms.
{/N0/SB3/P1} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P1} Use is subject to license terms.
{/N0/SB3/P0} Subtest: Setting Fireplane Config Registers for aid 0xc
{/N0/SB3/P1} Subtest: Setting Fireplane Config Registers for aid 0xd
{/N0/SB3/P2} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P3} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P2} Use is subject to license terms.
{/N0/SB3/P2} Subtest: Setting Fireplane Config Registers for aid 0xe
{/N0/SB3/P0} Subtest: Display CPU Version, frequency
{/N0/SB3/P2} Subtest: Display CPU Version, frequency
{/N0/SB3/P2} Version register = 003e0015.b0000507
{/N0/SB3/P2} Cpu/System ratio = 8, cpu actual frequency = 1200
{/N0/SB3/P1} Subtest: Display CPU Version, frequency
{/N0/SB3/P3} Use is subject to license terms.
{/N0/SB3/P3} Subtest: Setting Fireplane Config Registers for aid 0xf
{/N0/SB3/P3} Subtest: Display CPU Version, frequency
{/N0/SB3/P3} Version register = 003e0015.b0000507
{/N0/SB3/P3} Cpu/System ratio = 8, cpu actual frequency = 1200
{/N0/SB3/P0} Version register = 003e0015.b0000507
{/N0/SB3/P0} Cpu/System ratio = 8, cpu actual frequency = 1200
{/N0/SB3/P1} Version register = 003e0015.b0000507
{/N0/SB3/P1} Cpu/System ratio = 8, cpu actual frequency = 1200
{/N0/SB3/P2} Running Basic CPU
{/N0/SB3/P0} Running Basic CPU
{/N0/SB3/P3} Running Basic CPU
{/N0/SB3/P1} Running Basic CPU
{/N0/SB3/P2} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P0} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P3} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P1} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P2} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P2} Use is subject to license terms.
{/N0/SB3/P2} Subtest: I-Cache Initialization
{/N0/SB3/P3} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P3} Use is subject to license terms.
{/N0/SB3/P3} Subtest: I-Cache Initialization
{/N0/SB3/P3} Subtest: D-Cache Initialization
{/N0/SB3/P2} Subtest: D-Cache Initialization
{/N0/SB3/P0} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P3} Subtest: W-Cache Initialization
{/N0/SB3/P0} Use is subject to license terms.
{/N0/SB3/P0} Subtest: I-Cache Initialization
{/N0/SB3/P0} Subtest: D-Cache Initialization
{/N0/SB3/P1} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P1} Use is subject to license terms.
{/N0/SB3/P1} Subtest: I-Cache Initialization
{/N0/SB3/P1} Subtest: D-Cache Initialization
{/N0/SB3/P1} Subtest: W-Cache Initialization
{/N0/SB3/P1} Subtest: P-Cache Initialization
{/N0/SB3/P1} Subtest: Branch Prediction Initialization
{/N0/SB3/P1} Subtest: E-Cache Global Variables Initialization
{/N0/SB3/P1} Subtest: Fast Init. Verification Test
{/N0/SB3/P0} Subtest: W-Cache Initialization
{/N0/SB3/P2} Subtest: W-Cache Initialization
{/N0/SB3/P3} Subtest: P-Cache Initialization
{/N0/SB3/P0} Subtest: P-Cache Initialization
{/N0/SB3/P2} Subtest: P-Cache Initialization
{/N0/SB3/P3} Subtest: Branch Prediction Initialization
{/N0/SB3/P0} Subtest: Branch Prediction Initialization
{/N0/SB3/P2} Subtest: Branch Prediction Initialization
{/N0/SB3/P3} Subtest: E-Cache Global Variables Initialization
{/N0/SB3/P0} Subtest: E-Cache Global Variables Initialization
{/N0/SB3/P2} Subtest: E-Cache Global Variables Initialization
{/N0/SB3/P3} Subtest: Fast Init. Verification Test
{/N0/SB3/P0} Subtest: Fast Init. Verification Test
{/N0/SB3/P2} Subtest: Fast Init. Verification Test
{/N0/SB3/P0} Running Enable MMU
{/N0/SB3/P1} Running Enable MMU
{/N0/SB3/P2} Running Enable MMU
{/N0/SB3/P3} Running Enable MMU
{/N0/SB3/P0} Subtest: IMMU Initialization
{/N0/SB3/P1} Subtest: IMMU Initialization
{/N0/SB3/P2} Subtest: IMMU Initialization
{/N0/SB3/P3} Subtest: IMMU Initialization
{/N0/SB3/P0} Subtest: DMMU Initialization
{/N0/SB3/P1} Subtest: DMMU Initialization
{/N0/SB3/P0} Subtest: Map LPOST to local space
{/N0/SB3/P1} Subtest: Map LPOST to local space
{/N0/SB3/P2} Subtest: DMMU Initialization
{/N0/SB3/P3} Subtest: DMMU Initialization
{/N0/SB3/P2} Subtest: Map LPOST to local space
{/N0/SB3/P3} Subtest: Map LPOST to local space
{/N0/SB3/P2} Running Basic Ecache
{/N0/SB3/P3} Running Basic Ecache
{/N0/SB3/P0} Running Basic Ecache
{/N0/SB3/P1} Running Basic Ecache
{/N0/SB3/P2} Subtest: E-Cache Initialization of first 1K
{/N0/SB3/P3} Subtest: E-Cache Initialization of first 1K
{/N0/SB3/P2} Subtest: E-Cache Initialization
{/N0/SB3/P3} Subtest: E-Cache Initialization
{/N0/SB3/P0} Subtest: E-Cache Initialization of first 1K
{/N0/SB3/P1} Subtest: E-Cache Initialization of first 1K
{/N0/SB3/P0} Subtest: E-Cache Initialization
{/N0/SB3/P1} Subtest: E-Cache Initialization
{/N0/SB3/P2} Running Memory Registers Tests
{/N0/SB3/P3} Running Memory Registers Tests
{/N0/SB3/P0} Running Memory Registers Tests
{/N0/SB3/P1} Running Memory Registers Tests
{/N0/SB3/P2} Subtest: Disable Memory Controllers
{/N0/SB3/P0} Subtest: Disable Memory Controllers
{/N0/SB3/P3} Subtest: Disable Memory Controllers
{/N0/SB3/P1} Subtest: Disable Memory Controllers
{/N0/SB3/P0} Running Memory Configuration Tests
{/N0/SB3/P2} Running Memory Configuration Tests
{/N0/SB3/P1} Running Memory Configuration Tests
{/N0/SB3/P3} Running Memory Configuration Tests
{/N0/SB3/P0} Subtest: Memory Controller Configuration
{/N0/SB3/P2} Subtest: Memory Controller Configuration
{/N0/SB3/P1} Subtest: Memory Controller Configuration
{/N0/SB3/P3} Subtest: Memory Controller Configuration
{/N0/SB3/P0} Subtest: UP Memory Clear
{/N0/SB3/P1} Subtest: UP Memory Clear
{/N0/SB3/P2} Subtest: UP Memory Clear
{/N0/SB3/P3} Subtest: UP Memory Clear
{/N0/SB3/P0} Running Board Memory Interleave
{/N0/SB3/P1} Running Board Memory Interleave
{/N0/SB3/P2} Running Board Memory Interleave
{/N0/SB3/P3} Running Board Memory Interleave
{/N0/SB3/P0} Subtest: Board Memory Interleave Configuration
{/N0/SB3/P1} Subtest: Board Memory Interleave Configuration
{/N0/SB3/P2} Subtest: Board Memory Interleave Configuration
{/N0/SB3/P3} Subtest: Board Memory Interleave Configuration
{/N0/SB3/P0} Passed
{/N0/SB3/P1} Passed
{/N0/SB3/P2} Passed
{/N0/SB3/P3} Passed
/N0/IB9 : Failed AR interconnect test. Status = 00080004
IB9/ar0 Bit in error P3_ADDR [6]  
Jul 03 23:01:35 cwsc0 Domain-C.SC: AR Interconnect test: System board IB9/ar0 address repeater connections to system board RP2/ar0 failed
Testing IO Boards ...
Copying IO prom to Cpu dram
.Jul 03 23:01:45 cwsc0 Domain-C.SC: ErrorMonitor: Domain C has a SYSTEM ERROR
Jul 03 23:01:45 cwsc0 Domain-C.SC: /N0/IB9 encountered the first error
Jul 03 23:01:45 cwsc0 Domain-C.SC: RP2 encountered the first error
Jul 03 23:01:45 cwsc0 Domain-C.SC: ArAsic reported first error on /N0/IB9
Jul 03 23:01:45 cwsc0 Domain-C.SC:
/partition1/domain0/IB9/ar0:
>>> SafariPortError6[0x260] : 0x00008001
                 AdrPErr [00:00] : 0x1 Address parity error
                      FE [15:15] : 0x1

Jul 03 23:01:45 cwsc0 Domain-C.SC:
Jul 03 23:01:45 cwsc0 Domain-C.SC:
/partition1/RP2/ar0:
>>> SafariPortError9[0x290] : 0x00008001
                 AdrPErr [00:00] : 0x1 Address parity error
                      FE [15:15] : 0x1

Jul 03 23:01:56 cwsc0 Domain-C.SC: [AD] Event: SF6800.ASIC.AR.ADR_PERR.10442006
     CSN: 0350HH225B DomainID: C ADInfo: 1.SCAPP.15.3
     Time: Tue Jul 03 23:01:46 GMT+08:00 2007
     FRU-List-Count: 2; FRU-PN: 5014404; FRU-SN: 046949; FRU-LOC: /N0/IB9
                        FRU-PN: 5016418; FRU-SN: 002590; FRU-LOC: RP2
     Recommended-Action: Service action required

Jul 03 23:01:56 cwsc0 Domain-C.SC: Domain C is currently paused due to an error.  This domain must be turned off via "setkeyswitch off" to recover
..Jul 03 23:02:00 cwsc0 Domain-C.POST: {/N0/SB3/P0} not responding
................................
{/N0/SB3/P0} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P0} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P0} Use is subject to license terms.
{/N0/IB7/P0} Unknown
{/N0/IB7/P1} Unknown
Jul 03 23:02:06 cwsc0 Domain-C.SC: Excluded unusable, unlicensed, failed or disabled board: /N0/IB7
Copying IO prom to Cpu dram
....Jul 03 23:02:23 cwsc0 Domain-C.POST: {/N0/SB3/P1} not responding
...............................
{/N0/SB3/P1} @(#) lpost         5.15.3  2003/09/30 23:01
{/N0/SB3/P1} Copyright 2001-2003 Sun Microsystems, Inc.  All rights reserved.
{/N0/SB3/P1} Use is subject to license terms.
{/N0/IB9/P0} Unknown
{/N0/IB9/P1} Unknown
Jul 03 23:02:28 cwsc0 Domain-C.SC: Excluded unusable, unlicensed, failed or disabled board: /N0/IB9
Jul 03 23:02:28 cwsc0 Domain-C.SC: No usable Io board in domain.
setkeyswitch operation did not complete

论坛徽章:
0
2 [报告]
发表于 2007-07-04 17:58 |只看该作者

补充showlog信息

sc0:C> showlogs

Jul 03 22:56:36 cwsc0 Domain-C.SC: [ID 564364 local0.error] ArAsic reported first error on /N0/IB9
Jul 03 22:56:36 cwsc0 Domain-C.SC: [ID 305790 local0.error]
/partition1/domain0/IB9/ar0:
>>> SafariPortError6[0x260] : 0x00008001
                 AdrPErr [00:00] : 0x1 Address parity error
                      FE [15:15] : 0x1

Jul 03 22:56:36 cwsc0 Domain-C.SC: [ID 648255 local0.error]
Jul 03 22:56:36 cwsc0 Domain-C.SC: [ID 718700 local0.error]
/partition1/RP2/ar0:
>>> SafariPortError9[0x290] : 0x00008001
                 AdrPErr [00:00] : 0x1 Address parity error
                      FE [15:15] : 0x1

Jul 03 22:56:47 cwsc0 Domain-C.SC: [ID 166449 local0.error] [AD] Event: SF6800.ASIC.AR.ADR_PERR.10442006
     CSN: 0350HH225B DomainID: C ADInfo: 1.SCAPP.15.3
     Time: Tue Jul 03 22:56:37 GMT+08:00 2007
     FRU-List-Count: 2; FRU-PN: 5014404; FRU-SN: 046949; FRU-LOC: /N0/IB9
                        FRU-PN: 5016418; FRU-SN: 002590; FRU-LOC: RP2
     Recommended-Action: Service action required

Jul 03 22:56:47 cwsc0 Domain-C.SC: [ID 317625 local0.crit] Domain C is currently paused due to an error.  This domain must be turned off via "setkeyswitch off" to recover
Jul 03 22:56:51 cwsc0 Domain-C.POST: [ID 455119 local0.error] {/N0/SB3/P0} not responding
Jul 03 22:56:57 cwsc0 Domain-C.SC: [ID 970614 local0.warning] Excluded unusable, unlicensed, failed or disabled board: /N0/IB7
Jul 03 22:57:13 cwsc0 Domain-C.POST: [ID 520655 local0.error] {/N0/SB3/P1} not responding
Jul 03 22:57:20 cwsc0 Domain-C.SC: [ID 970616 local0.warning] Excluded unusable, unlicensed, failed or disabled board: /N0/IB9
Jul 03 22:57:20 cwsc0 Domain-C.SC: [ID 969319 local0.error] No usable Io board in domain.
Jul 03 23:01:35 cwsc0 Domain-C.SC: [ID 861998 local0.error] /N0/IB9 : Failed AR interconnect test. Status = 00080004
Jul 03 23:01:35 cwsc0 Domain-C.SC: [ID 882583 local0.error] AR Interconnect test: System board IB9/ar0 address repeater connections to system board RP2/ar0 failed
Jul 03 23:01:35 cwsc0 Domain-C.SC: [ID 127933 local0.error] IB9/ar0 Bit in error P3_ADDR [6]  
Jul 03 23:01:45 cwsc0 Domain-C.SC: [ID 427810 local0.crit] ErrorMonitor: Domain C has a SYSTEM ERROR
Jul 03 23:01:45 cwsc0 Domain-C.SC: [ID 219031 local0.error] /N0/IB9 encountered the first error
Jul 03 23:01:45 cwsc0 Domain-C.SC: [ID 346375 local0.error] RP2 encountered the first error
Jul 03 23:01:45 cwsc0 Domain-C.SC: [ID 564364 local0.error] ArAsic reported first error on /N0/IB9
Jul 03 23:01:45 cwsc0 Domain-C.SC: [ID 305790 local0.error]
/partition1/domain0/IB9/ar0:
>>> SafariPortError6[0x260] : 0x00008001
                 AdrPErr [00:00] : 0x1 Address parity error
                      FE [15:15] : 0x1

Jul 03 23:01:45 cwsc0 Domain-C.SC: [ID 648255 local0.error]
Jul 03 23:01:45 cwsc0 Domain-C.SC: [ID 718700 local0.error]
/partition1/RP2/ar0:
>>> SafariPortError9[0x290] : 0x00008001
                 AdrPErr [00:00] : 0x1 Address parity error
                      FE [15:15] : 0x1

Jul 03 23:01:56 cwsc0 Domain-C.SC: [ID 467782 local0.error] [AD] Event: SF6800.ASIC.AR.ADR_PERR.10442006
     CSN: 0350HH225B DomainID: C ADInfo: 1.SCAPP.15.3
     Time: Tue Jul 03 23:01:46 GMT+08:00 2007
     FRU-List-Count: 2; FRU-PN: 5014404; FRU-SN: 046949; FRU-LOC: /N0/IB9
                        FRU-PN: 5016418; FRU-SN: 002590; FRU-LOC: RP2
     Recommended-Action: Service action required

Jul 03 23:01:56 cwsc0 Domain-C.SC: [ID 317625 local0.crit] Domain C is currently paused due to an error.  This domain must be turned off via "setkeyswitch off" to recover
Jul 03 23:02:00 cwsc0 Domain-C.POST: [ID 455119 local0.error] {/N0/SB3/P0} not responding
Jul 03 23:02:06 cwsc0 Domain-C.SC: [ID 970614 local0.warning] Excluded unusable, unlicensed, failed or disabled board: /N0/IB7
Jul 03 23:02:23 cwsc0 Domain-C.POST: [ID 520655 local0.error] {/N0/SB3/P1} not responding
Jul 03 23:02:28 cwsc0 Domain-C.SC: [ID 970616 local0.warning] Excluded unusable, unlicensed, failed or disabled board: /N0/IB9
Jul 03 23:02:28 cwsc0 Domain-C.SC: [ID 969319 local0.error] No usable Io board in domain.

论坛徽章:
0
3 [报告]
发表于 2007-07-04 18:34 |只看该作者
没有买服务?

/N0/IB9或者RP2坏了,需要更换

论坛徽章:
0
4 [报告]
发表于 2007-07-04 19:48 |只看该作者
原帖由 nus 于 2007-7-4 18:34 发表
没有买服务?

/N0/IB9或者RP2坏了,需要更换



感谢回复!
已经过保修期了,我觉得也是硬件问题,但刚才启动了一次又可以起来了,这我就不明白了!

论坛徽章:
0
5 [报告]
发表于 2007-07-04 20:03 |只看该作者
FRU-List-Count: 2; FRU-PN: 5014404; FRU-SN: 046949; FRU-LOC: /N0/IB9
                        FRU-PN: 5016418; FRU-SN: 002590; FRU-LOC: RP2
     Recommended-Action: Service action required

论坛徽章:
0
6 [报告]
发表于 2007-07-04 20:14 |只看该作者
把  /N0/IB9 从 domainc 中剔除掉...然后在


keyswitch on

论坛徽章:
0
7 [报告]
发表于 2007-07-04 20:26 |只看该作者
原帖由 alex_linux 于 2007-7-4 20:14 发表
把  /N0/IB9 从 domainc 中剔除掉...然后在


keyswitch on

关键是我刚刚重复启了几次domainc ,都能正常起来,现在把/N0/IB9 替换掉没什么意义啊!

论坛徽章:
0
8 [报告]
发表于 2007-07-04 20:51 |只看该作者
原帖由 lyh303 于 2007-7-4 20:26 发表

关键是我刚刚重复启了几次domainc ,都能正常起来,现在把/N0/IB9 替换掉没什么意义啊!

那不能说明一的 ib9没问题啊

论坛徽章:
0
9 [报告]
发表于 2007-07-04 21:04 |只看该作者
有没有可能跟Firmware有关呢?

论坛徽章:
0
10 [报告]
发表于 2007-07-04 22:05 |只看该作者
原帖由 lyh303 于 2007-7-4 21:04 发表
有没有可能跟Firmware有关呢?

这个很难说的.

做post了吗?
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP