免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
12下一页
最近访问板块 发新帖
查看: 6408 | 回复: 10
打印 上一主题 下一主题

求助 V440 不能启动,不能进入ok模式 [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2009-09-03 14:52 |只看该作者 |倒序浏览
7可用积分
一台V440的机器,无法启动,想进行诊断。
用串口线与另外一台机器相连。 在相连的机器上输入命令 tip hardwire 进入。提示 connected.
然后屏幕上就循环的提示一些莫名奇妙的东西。 总也没有ok提示符
查看帮助文档,好像都是说在ok模式下运行什么什么命令
咋办?

最佳答案

查看完整内容

替换法测试

论坛徽章:
0
2 [报告]
发表于 2009-09-03 14:52 |只看该作者
替换法测试

论坛徽章:
0
3 [报告]
发表于 2009-09-03 14:56 |只看该作者
打印什么东西呢?

论坛徽章:
0
4 [报告]
发表于 2009-09-03 15:07 |只看该作者
RED State Exception
Error enable reg: 0000.0000.0000.0000
CPU 0000.0000.0000.0003 AFSR 0030.06be.0080.00e9  Multiple PRIV UCC UCU CPU WDU EDC EDU UE CE
AFAR 0000.0032.3ff0.34f0
CPU 0000.0000.0000.0002 AFSR 0030.0001.0300.0000  Multiple PRIV RUE
AFAR 0000.0032.3ff0.3540
CPU 0000.0000.0000.0001 AFSR 0030.0001.0300.0000  Multiple PRIV RUE
AFAR 0000.0032.3ff0.3540
CPU 0000.0000.0000.0000 AFSR 0030.0001.0300.0000  Multiple PRIV RUE
AFAR 0000.0032.3ff0.3540
CPU: 0000.0000.0000.0003
TL=0000.0000.0000.0005  TT=0000.0000.0000.0064
   TPC=0000.07ff.f7e0.4910  TnPC=0000.07ff.881d.a55c  TSTATE=0000.0099.1500.1501
TL=0000.0000.0000.0004  TT=0000.0000.0000.0010
   TPC=0000.07ff.f004.4c80  TnPC=0000.07ff.f004.4c84  TSTATE=0000.0099.1504.1401
TL=0000.0000.0000.0003  TT=0000.0000.0000.0064
   TPC=0000.07ff.f7e0.4910  TnPC=0000.07ff.881d.a55c  TSTATE=0000.0099.1500.1501
TL=0000.0000.0000.0002  TT=0000.0000.0000.0010
   TPC=0000.07ff.f004.0200  TnPC=0000.07ff.f004.0204  TSTATE=0000.0099.1500.1501
TL=0000.0000.0000.0001  TT=0000.0000.0000.0010
   TPC=0000.0000.0000.0284  TnPC=0000.0000.0000.0288  TSTATE=0000.0099.1500.1501

CPU 0000.0000.0000.0003 AFSR 0020.0080.0000.0203  Multiple CPU
AFAR 0000.0032.3ff0.34f0
CPU 0000.0000.0000.0002 AFSR 0030.0001.0300.0000  Multiple PRIV RUE
AFAR 0000.0032.3ff0.3540
CPU 0000.0000.0000.0001 AFSR 0030.0001.0300.0000  Multiple PRIV RUE
AFAR 0000.0032.3ff0.3540
CPU 0000.0000.0000.0000 AFSR 0030.0001.0300.0000  Multiple PRIV RUE
AFAR 0000.0032.3ff0.3540

论坛徽章:
0
5 [报告]
发表于 2009-09-03 15:12 |只看该作者
开机盖子检查是不是少插了某根数据线。
检查CPU板,插拔内存。
个人觉得是接触产生的故障。

论坛徽章:
0
6 [报告]
发表于 2009-09-03 15:20 |只看该作者
这个机器一直没开过盖,机房离的比较远,实在不行再开盖检测吧

论坛徽章:
0
7 [报告]
发表于 2009-09-03 16:01 |只看该作者
Sun System Handbook - CD 2.1.17 September 2008 Internal/Partner Edition
   Home | Current Systems | EOL Systems | Components | General Info | Search | Feedback
       


Asset ID:        1-61-211473-1
Update Date:        Tue Aug 12 00:00:00 MDT 2008
Keywords:       

Solution Type Technical Instruction

Solution  211473 :   How to Verify whether a System Reboot is Caused by a Fatal Reset or a Red State Exception   
Related Categories

Home>Product>Systems>Servers




Description
This document will help identify if the reason for an unexpected or unexplained system reboot is due to a Fatal reset error or a Red State Exception (RSE) condition.


Please note that the purpose of this document is to help you with the root cause.  In case the symptoms described in this document, are indeed what your system is experiencing, you will need to make a contact with qualified engineers at Sun Support.  Please reference this document ID number once you are ready to make contact with Sun Support for assistance.

Steps to Follow
The unexpected reboots are most often caused by hardware faults and reported by the system as a fatal reset or a red state exception.


When errors like these occur, the OS  is abruptly interrupted and can't continue to log error messages in /var/adm/messages or generate a core file. As a result, the system reboots but the error messages and all output will only appear on the system console (will be in console logs). So in order to do further troubleshooting, it is very important to gather the complete console logs at the time of the error (reboot).

1. The system reboot could be due to fatal reset errors. The fatal errors are most often caused by hardware (bad CPU, MB switches, I/O bridge, etc.) and are the result of an 'illegal' hardware state that is detected by the system. The Fatal Reset error and all output are only logged to the system console (ttya or RSC). Here are examples of fatal errors caused by CPU and motherboard switch ASICs (the full fatal reset output is too long and is not included):
ERROR: System Hardware FATAL RESET from CPU0
System State (CPU3 reporting)
ERROR: System "FATAL RESET" from DAR/DCS/CDX
System State (CPU2 reporting)

For systems using ALOM serial console the fatal error would be reported as:
Fatal Error Reset
SC Alert: Host System has Reset

When your system reboots after fatal error, you will may also see ONLY a notice in the /var/adm/messages file like this one:
[ID 796976 kern.notice] System booting after fatal error FATAL Sys Hardware

Also, the prtconf -vp may show Fatal Sys Hardware message under " reset-reason: "
# prtconf -vp
System Configuration: Sun Microsystems sun4u
Memory size: 8192 Megabytes
System Peripherals (PROM Nodes):
.....................
banner-name: 'Sun Fire 880'
watchdog-enable:
reset-reason: 'FATAL Sys Hardware' <<<<<<<
model: 'SUNW,501-6323'

In case the console logs have fatal errors. If your system is experiencing these errors, please contact a qualified engineer at Sun Support for assistance.

1.a) For the UltraSPARC III/IV platforms (280R, V480/V880, V490/V890) and UltraSPARC IIIi platforms (V210/V240, V440) a trained Sun Support Engineer has access to important information along with an AFAR decoder tool and will carefully guide you through the steps to resolution.

Sun Support can also assist you if you are experiencing V480 Fatal Resets with specific network and I/O configurations.

2. The unexpected reboot could also be due to Red State Exception (RSE) errors. The user needs to verify if the console output has any Red State Exception (RSE) errors. The RSE can be triggered by both Software and/or Hardware, but this condition is most commonly due to a hardware fault (bad DIMM or bad CPU/ L2SRAM). The RSE error and all output are only logged to the system console (ttya or RSC)  and usually is reported by one of the CPUs:
ERROR: CPU3 RED State Exception
System State (CPU3 reporting)

If your system does reboot after RSE, you may also see ONLY a notice in the /var/adm/messages file like this one:
[ID 993603 kern.notice] System booting after RED CPU RED-State
The prtconf -vp may show RED CPU RED-State message under " reset-reason: "

#prtconf -vp
System Configuration: Sun Microsystems sun4u
Memory size: 32768 Megabytes
System Peripherals (PROM Nodes):
banner-name: 'Sun Fire 880'
watchdog-enable:
reset-reason: 'RED CPU RED-State' <--- reset-reason

In case the console logs have RSE errors, once again, this is a critical issue where you will need a qualified Sun Support Engineer to assist you, so please contact a qualified engineer at Sun Support for assistance.:

2.a) for the UltraSPARC III/IV platforms (280R, V480/V880, V490/V890) and UltraSPARC IIIi platforms (V210/V240, V440) please contact Sun Support for assistance.

Internal Comments
Internal comments:


This document contains normalized content and is managed by the the Domain Lead(s) of the respective domains. To notify content owners of a knowledge gap contained in this document, and/or prior to updating this document, please contact the domain engineers that are managing this document via the "Document Feedback" alias(es) listed below:

Normalization Lead: Jim Robbins Domain Engineer/Lead : Josh Freeman

VSP-SPARC-Normalization@sun.com

REFERENCES:

In case the console logs have fatal errors, reference the following docs:

1.a) for the UltraSPARC III/IV platforms (280R, V480/V880, V490/V890) refer to: Troubleshooting < Solution: 209123 > : Sun Fire V880 FATAL Resets.

< Solution: 205066 > : V480 Fatal Resets with specific network and I/O configurations.

Note: The procedures < Solution: 209123 > apply to all V4x0/V8x0 platforms, since they are using the same CPU/memory board.

1.b) for the UltraSPARC IIIi platforms (V210/V240, V440) you may use US3i AFAR decoder tool in conjunction with < Solution: 206870 > :  Event Messages for UltraSPARC-III[R], UltraSPARC-III+[R], UltraSPARC-IIIi[R], UltraSPARC-IV[R] and UltraSPARC-IV+[R] CPU Modules .

In case the console logs have RSE errors, reference the following docs:

2.a) for the UltraSPARC III/IV platforms (280R, V480/V880, V490/V890) refer to:

  < Solution: 209130 > :  Troubleshooting Sun Fire V880 RED STATE EXCEPTION .

  < Solution: 216842 > : Troubleshooting Red State Exception Memory Errors .

2.b) for the UltraSPARC IIIi platforms (V210/V240, V440) you may use US3i AFAR decoder tool  in conjunction with < Solution: 206870 > :  Event Messages for UltraSPARC-III[R], UltraSPARC-III+[R], UltraSPARC-IIIi[R], UltraSPARC-IV[R] and UltraSPARC-IV+[R] CPU Modules .

More Reference Material:

Internal Tool: Fatal Reset Decoder

Internal Tool: RED State Exception Decoder

Internal Tool: US3iAFAR Decoder

Sun Alert < Solution: 200502 > Sun Systems Equipped ASICs Version 2.3 or Higher May Experience Either Domain Stop (Dstop), Domain Pause or FATAL RESET Under Heavy I/O

FCO AO226-1 Click Here V480 Fatal Resets with specific network and I/O configurations

Sun Alert < Solution: 201170 > Sun Fire V440 and Netra 440 Systems Using a Specific Networking Configuration may Unexpectedly Reset

Troubleshooting < Solution: 216842 >  Troubleshooting Red State Exception Memory Errors

Troubleshooting < Solution: 209123 > Sun Fire V880 FATAL Resets

Troubleshooting < Solution: 209130 >  Troubleshooting Sun Fire V880 RED STATE EXCEPTION

< Solution: 206870 > :  Event Messages for UltraSPARC-III[R], UltraSPARC-III+[R], UltraSPARC-IIIi[R], UltraSPARC-IV[R] and UltraSPARC-IV+[R] CPU Modules .

Product
Sun Fire V890 Server
Sun Fire V880z Visualization Server
Sun Fire V880 Server
Sun Fire V490 Server
Sun Fire V480 Server
Sun Fire V445 Server
Sun Fire V440 Server
Sun Fire V240 Server
Sun Fire V210 Server


Keywords
normalized, unexplained reboot, console logs, red state, fatal reset, Problem Solved = Identify Fatal Reset or Red State


Previously Published As
91380


Change History
emailed Author Dencho Kojucharov 8/11/08

I was working to publish this article but ran across 3 links in your Internal Comments Statement that were not found.  They are listed below:
Sun Alert 101456
FCO AO226-1
Sun Alert 101548
If you could fix these links as soon as possible so that we may publish it would be greatly appreciated.  if you have any questions please let me know.
Thank you

Date: 2008-01-08
User Name: 7058
Action: Update Started
Comment: Updating doc per Jim Koontz and Dencho's approval to make it more suitable for customer viewing.
Version: 0

Attachments
This solution has no attachment
                       
Copyright 1994-2008 Sun Microsystems, Inc.    All rights reserved.
Legal Terms Privacy Policy Feedback

论坛徽章:
0
8 [报告]
发表于 2009-09-04 10:00 |只看该作者
现在想进ALOM检测下哪里坏了,但是admin密码忘记了,系统又进不去,还有什么办法能进ALOM吗

论坛徽章:
0
9 [报告]
发表于 2009-09-04 11:23 |只看该作者
那个密码好像就只能拔电池才能清的吧。。。

论坛徽章:
0
10 [报告]
发表于 2009-09-04 16:07 |只看该作者
等着学习
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP