netra t2000 故障现象
sc> showfaults -vLast POST run: TUE AUG 14 16:01:13 2012
POST status: Passed all devices
ID Time FRU Fault
0 JAN 11 16:20:48 IOBD Host detected fault, MSGID: PCIEX-8000-DJUUID: ec25257b-019b-6c0c-de6f-94792c409a98
1 JAN 11 16:20:48 MB Host detected fault, MSGID: PCIEX-8000-DJUUID: ec25257b-019b-6c0c-de6f-94792c409a98
sc> showenvironment
=============== Environmental Status ===============
--------------------------------------------------------------------------------
System Temperatures (Temperatures in Celsius):
--------------------------------------------------------------------------------
Sensor StatusTemp LowHard LowSoft LowWarn HighWarn HighSoft HighHard
--------------------------------------------------------------------------------
PDB/T_AMB OK 23 -10 -5 0 58 60 62
PDB/ADT7462_AMBOK 25 -10 -5 0 58 60 62
MB/T_AMB OK 30 -- -- -- -- -- --
MB/CMP0/T_TCOREOK 60 -10 -5 0 95 101 105
MB/CMP0/T_BCOREOK 60 -10 -5 0 95 101 105
IOBD/IOB/T_COREOK 59 -10 -5 0 95 100 105
IOBD/T_AMB OK 38 -- -- -- -- -- --
--------------------------------------------------------
System Indicator Status:
--------------------------------------------------------
SYS/LOCATE SYS/SERVICE SYS/ACT
--------------------------------------------------------
OFF ON ON
--------------------------------------------
System Disks:
--------------------------------------------
Disk Status ServiceOK2RM
--------------------------------------------
HDD0 OK OFF OFF
HDD1 OK OFF OFF
----------------------------------------------------------
Fans (Speeds Revolution Per Minute):
----------------------------------------------------------
Sensor Status Speed Warn Low
----------------------------------------------------------
FT0/F0/TACH OK 5137 -- 2500
FT0/F1/TACH OK 4995 -- 2500
FT0/F2/TACH OK 5192 -- 2500
FT1/F0/TACH OK 7747 -- 4000
FT1/F1/TACH OK 7747 -- 4000
PS0/F0 OK 9540 -- 2000
PS1/F0 OK 9574 -- 2000
--------------------------------------------------------------------------------
Voltage sensors (in Volts):
--------------------------------------------------------------------------------
Sensor Status Voltage LowSoft LowWarn HighWarn HighSoft
--------------------------------------------------------------------------------
MB/V_+1V5 OK 1.48 1.36 1.39 1.60 1.63
MB/V_VMEML OK 1.79 1.63 1.67 1.92 1.98
MB/V_VMEMR OK 1.79 1.63 1.67 1.92 1.98
MB/V_VTTL OK 0.89 0.81 0.83 0.96 0.99
MB/V_VTTR OK 0.87 0.81 0.83 0.96 0.99
MB/V_+3V3STBY OK 3.31 3.13 3.16 3.53 3.59
MB/V_VCORE OK 1.32 1.20 1.24 1.36 1.39
IOBD/V_+1V5 OK 1.48 1.36 1.39 1.60 1.63
IOBD/V_+1V8 OK 1.78 1.63 1.67 1.92 1.96
IOBD/V_+3V3MAIN OK 3.39 3.06 3.10 3.49 3.53
IOBD/V_+3V3STBY OK 3.33 3.13 3.16 3.53 3.59
IOBD/V_+1V OK 1.18 1.09 1.11 1.28 1.30
IOBD/V_+1V2 OK 1.16 1.09 1.11 1.28 1.30
IOBD/V_+5V OK 5.15 4.55 4.75 5.35 5.45
IOBD/V_-12V OK -12.04-13.08-12.84-11.16 -10.92
IOBD/V_+12V OK 12.00 10.92 11.16 12.84 13.08
SC/BAT/V_BAT OK 2.74 -- 2.25 -- --
-----------------------------------------------------------
System Load (in amps):
-----------------------------------------------------------
Sensor Status Load Warn Shutdown
-----------------------------------------------------------
MB/I_VCORE OK 36.480 80.000 88.000
MB/I_VMEML OK 5.820 60.000 66.000
MB/I_VMEMR OK 6.420 60.000 66.000
-----------------------------------------------------------
----------------------
Current sensors:
----------------------
Sensor Status
----------------------
IOBD/I_USB0 OK
IOBD/I_USB1 OK
------------------------------------------------------------------------------
Power Supplies:
------------------------------------------------------------------------------
SupplyStatus UnderspeedOvertempOvervoltUndervoltOvercurrent
------------------------------------------------------------------------------
PS0 OK OFF OFF OFF OFF OFF
PS1 OK OFF OFF OFF OFF OFF
--------------------------------------------
System Alarms:
--------------------------------------------
Alarm Relay LED
--------------------------------------------
ALARM/CRITICAL OFF OFF
ALARM/MAJOR ON ON
ALARM/MINOR ON ON
ALARM/USER OFF OFF
主板有问题吗!!!!!!!!!!!!!1 clearfault UUID
power on
MAX POST看看~~~ PCI。。。
最大化自检一下。起来再看看。 ID Time FRU Fault
0 JAN 11 16:20:48 IOBD Host detected fault, MSGID: PCIEX-8000-DJUUID: ec25257b-019b-6c0c-de6f-94792c409a98
1 JAN 11 16:20:48 MB Host detected fault, MSGID: PCIEX-8000-DJUUID: ec25257b-019b-6c0c-de6f-94792c409a98
這些Fault是從OS的 fmadm 傳回 SC的, 在Solaris下輸入 as root:
fmadm faulty
你會看到相同的 UUID. 90%機率是使用hard-raid設定變更造成, 如果沒有修正設定每次reboot相同問題會繼續發生.
LZ近日是否有換過HDD?
硬件故障的機率很小...
回复 5# watchsat
应该没换过硬盘,系统下有相同的uuid 貼一下 echo | format 的output 看看
页:
[1]