SUN T2000风扇疑问 求解答
Solaris10,T2000的机器。有两台T2000都是下述现象。最近几个月吧,偶尔会死机,必须手动去把它起起来。以前是看系统日志有风扇报错,但是一下报三个,也没多想,就换了3个风扇,结果换了之后还是会报错,偶尔死机。确定不是风扇本身的问题。
贴出有关口令输出(很详细,但是不会每个口令的全部输出,那样费你们的眼睛):
prtdiag -v
============================ Environmental Status ============================
Fan sensors:
------------------------------------------------------------
Location Sensor Status
------------------------------------------------------------
0850NNN00A:CH/FT0/FM0 RS ok
0850NNN00A:CH/FT0/FM1 RS ok
0850NNN00A:CH/FT0/FM2 RS ok
0850NNN00A:CH/FT2 RS ok
Temperature sensors:
------------------------------------------------------------
Location Sensor Status
------------------------------------------------------------
0850NNN00A:CH/IOBD/IOB T_CORE ok
0850NNN00A:CH/IOBD T_AMB ok
0850NNN00A:CH/MB/CMP0 T_TCORE ok
0850NNN00A:CH/MB/CMP0 T_BCORE ok
0850NNN00A:CH/MB T_AMB ok
0850NNN00A:CH/PDB T_AMB ok
Current sensors:
------------------------------------------------------------
Location Sensor Status
------------------------------------------------------------
0850NNN00A:CH/MB I_VCORE ok
0850NNN00A:CH/MB I_VMEML ok
0850NNN00A:CH/MB I_VMEMR ok
Current indicators:
------------------------------------------------------------
Location Indicator Condition
------------------------------------------------------------
0850NNN00A:CH/IOBD I_USB0 ok
0850NNN00A:CH/IOBD I_USB1 ok
0850NNN00A:CH/FIOBD I_USB ok
Voltage sensors:
------------------------------------------------------------
Location Sensor Status
------------------------------------------------------------
0850NNN00A:CH/SC/BAT V_BAT ok
0850NNN00A:CH/IOBD V_+1V5 ok
0850NNN00A:CH/IOBD V_+1V8 ok
0850NNN00A:CH/IOBD V_+3V3MAIN ok
0850NNN00A:CH/IOBD V_+3V3STBY ok
0850NNN00A:CH/IOBD V_+1V ok
0850NNN00A:CH/IOBD V_+1V2 ok
0850NNN00A:CH/IOBD V_+5V ok
0850NNN00A:CH/IOBD V_-12V ok
0850NNN00A:CH/IOBD V_+12V ok
0850NNN00A:CH/MB V_+1V5 ok
0850NNN00A:CH/MB V_VMEML ok
0850NNN00A:CH/MB V_VMEMR ok
0850NNN00A:CH/MB V_VTTL ok
0850NNN00A:CH/MB V_VTTR ok
0850NNN00A:CH/MB V_+3V3STBY ok
0850NNN00A:CH/MB V_VCORE ok
LEDs:
------------------------------------------------------------
Location LED State
------------------------------------------------------------
0850NNN00A:CH/FT0/FM0 SERVICE off
0850NNN00A:CH/FT0/FM1 SERVICE off
0850NNN00A:CH/FT0/FM2 SERVICE off
0850NNN00A:CH/FT2 SERVICE off
0850NNN00A:CH/SYS ACT steady
0850NNN00A:CH/SYS LOCATE off
0850NNN00A:CH/SYS SERVICE off
0850NNN00A:CH/SYS REAR_FAULT off
0850NNN00A:CH/SYS TEMP_FAULT off
0850NNN00A:CH/SYS TOP_FAN_FAULToff
0850NNN00A:CH/HDD0 SERVICE off
0850NNN00A:CH/HDD0 OK2RM off
0850NNN00A:CH/HDD1 SERVICE off
0850NNN00A:CH/HDD1 OK2RM off
0850NNN00A:CH/HDD2 SERVICE off
0850NNN00A:CH/HDD2 OK2RM off
0850NNN00A:CH/HDD3 SERVICE off
0850NNN00A:CH/HDD3 OK2RM off
============================ FRU Status ============================
Location Name Status
------------------------------------------------------
0850NNN00A:CH/FT0/FM0 FAN enabled
0850NNN00A:CH/FT0/FM1 FAN enabled
0850NNN00A:CH/FT0/FM2 FAN enabled
0850NNN00A:CH/FT2 FAN enabled
0850NNN00A:CH SC enabled
0850NNN00A:CH IOBD enabled
0850NNN00A:CH MB enabled
0850NNN00A:CH/MB/CMP0/CH0/R0/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH0/R0/D1 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH0/R1/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH0/R1/D1 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH1/R0/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH1/R0/D1 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH1/R1/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH1/R1/D1 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH2/R0/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH2/R0/D1 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH2/R1/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH2/R1/D1 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH3/R0/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH3/R0/D1 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH3/R1/D0 DIMM enabled
0850NNN00A:CH/MB/CMP0/CH3/R1/D1 DIMM enabled
0850NNN00A:CH PDB enabled
0850NNN00A:CH FIOBD enabled
0850NNN00A:CH SASBP enabled
0850NNN00A:CH/PS0 PS enabled
0850NNN00A:CH/PS1 PS enabled
============================ FW Version ============================
Version
------------------------------------------------------------
System Firmware 6.7.42009/06/10 13:32
====================== System PROM revisions =======================
Version
------------------------------------------------------------
OBP 4.30.3 2009/06/08 13:29
messages里边没什么看的,都是SYS_FAN at FT0/FM2(FM0、FM1) has FAILED.和机器启动的提示。
sc> showenvironment
=============== Environmental Status ===============
--------------------------------------------------------------------------------
System Temperatures (Temperatures in Celsius):
--------------------------------------------------------------------------------
Sensor StatusTemp LowHard LowSoft LowWarn HighWarn HighSoft HighHard
--------------------------------------------------------------------------------
PDB/T_AMB OK 24 -10 -5 0 45 50 55
MB/T_AMB OK 26 -10 -5 0 50 55 60
MB/CMP0/T_TCOREOK 43 -10 -5 0 85 90 95
MB/CMP0/T_BCOREOK 44 -10 -5 0 85 90 95
IOBD/IOB/T_COREOK 45 -10 -5 0 95 100 105
IOBD/T_AMB OK 27 -10 -5 0 52 57 62
--------------------------------------------------------
System Indicator Status:
--------------------------------------------------------
SYS/LOCATE SYS/SERVICE SYS/ACT
OFF OFF ON
--------------------------------------------------------
SYS/REAR_FAULT SYS/TEMP_FAULT SYS/TOP_FAN_FAULT
OFF OFF OFF
--------------------------------------------------------
--------------------------------------------
System Disks:
--------------------------------------------
Disk Status ServiceOK2RM
--------------------------------------------
HDD0 OK OFF OFF
HDD1 OK OFF OFF
HDD2 OK OFF OFF
HDD3 OK OFF OFF
---------------------------------------------------
Fans Status:
---------------------------------------------------
Fans (Speeds Revolution Per Minute):
Sensor Status Speed Warn Low
----------------------------------------------------------
FT0/FM0 OK 3683 -- 1920
FT0/FM1 OK 3525 -- 1920
FT0/FM2 OK 3618 -- 1920
FT2 OK 2713 -- 1900
----------------------------------------------------------
--------------------------------------------------------------------------------
Voltage sensors (in Volts):
--------------------------------------------------------------------------------
Sensor Status Voltage LowSoft LowWarn HighWarn HighSoft
--------------------------------------------------------------------------------
MB/V_+1V5 OK 1.48 1.36 1.39 1.60 1.63
MB/V_VMEML OK 1.78 1.63 1.67 1.92 1.98
MB/V_VMEMR OK 1.79 1.63 1.67 1.92 1.98
MB/V_VTTL OK 0.87 0.81 0.83 0.96 0.99
MB/V_VTTR OK 0.87 0.81 0.83 0.96 0.99
MB/V_+3V3STBY OK 3.31 3.13 3.16 3.53 3.59
MB/V_VCORE OK 1.27 1.17 1.21 1.32 1.34
IOBD/V_+1V5 OK 1.48 1.36 1.39 1.60 1.63
IOBD/V_+1V8 OK 1.78 1.63 1.67 1.92 1.96
IOBD/V_+3V3MAIN OK 3.34 3.06 3.10 3.49 3.53
IOBD/V_+3V3STBY OK 3.31 3.13 3.16 3.53 3.59
IOBD/V_+1V OK 1.18 1.09 1.11 1.28 1.30
IOBD/V_+1V2 OK 1.16 1.09 1.11 1.28 1.30
IOBD/V_+5V OK 5.12 4.55 4.75 5.35 5.45
IOBD/V_-12V OK -12.04-13.08-12.84-11.16 -10.92
IOBD/V_+12V OK 12.00 10.92 11.16 12.84 13.08
SC/BAT/V_BAT OK 2.85 -- 2.25 -- --
-----------------------------------------------------------
System Load (in amps):
-----------------------------------------------------------
Sensor Status Load Warn Shutdown
-----------------------------------------------------------
MB/I_VCORE OK 29.040 80.000 88.000
MB/I_VMEML OK 6.420 60.000 66.000
MB/I_VMEMR OK 5.820 60.000 66.000
-----------------------------------------------------------
----------------------
Current sensors:
----------------------
Sensor Status
----------------------
IOBD/I_USB0 OK
IOBD/I_USB1 OK
NULL_HANDLE OK
------------------------------------------------------------------------------
Power Supplies:
------------------------------------------------------------------------------
SupplyStatus UnderspeedOvertempOvervoltUndervoltOvercurrent
------------------------------------------------------------------------------
PS0 OK OFF OFF OFF OFF OFF
PS1 OK OFF OFF OFF OFF OFF
showfaults_-v没有报错
showlogs -v
…………………………………………………………………………………………………………都是那些风扇报错,只有最下边这一点点提到了有用的东西。
DEC 07 03:06:24: 00040066: "SYS_FAN at FT0/FM2 has FAILED."
DEC 07 03:13:34: 00040066: "SYS_FAN at FT0/FM2 has FAILED."
DEC 07 03:20:08: 00040066: "SYS_FAN at FT0/FM1 has FAILED."
DEC 07 03:21:42: 00040066: "SYS_FAN at FT0/FM2 has FAILED."
DEC 07 03:44:43: 00040066: "SYS_FAN at FT0/FM1 has FAILED."
DEC 07 03:45:33: 00040066: "SYS_FAN at FT0/FM0 has FAILED."
DEC 07 03:53:18: 00040066: "SYS_FAN at FT0/FM1 has FAILED."
DEC 07 04:12:08: 00040066: "SYS_FAN at FT0/FM2 has FAILED."
DEC 07 04:21:57: 00040066: "SYS_FAN at FT0/FM1 has FAILED."
DEC 07 04:21:57: 00040066: "SYS_FAN at FT0/FM2 has FAILED."
DEC 07 04:21:57: 00040088: "SC initiating soft host system shutdown due to insufficient fan cooling."
DEC 07 04:21:58: 00040000: "SC Request to Power Off Host."
DEC 07 04:23:04: 00040029: "Host system has shut down."
DEC 07 06:08:29: 00040002: "Host System has Reset"
DEC 07 06:11:13: 00040029: "Host system has shut down."
DEC 07 06:11:18: 00040002: "Host System has Reset"
这些口令应该差不多了吧,各位大大们给点参考意见呗,明明已经换过风扇了但是就是不行,而且不是一台是这种诡异的现象,有两台。去现场看灯都是绿的,top fan盖子打开看那三个风扇也是转的,感觉是风扇接口的可能性几乎为0啊,还有什么其他的原因吗。难道报SYS_FAN at FT0/FM2 has FAILED.的时候风扇真的停止运转了?这到底是为什么呢???求解惑。(还需要贴什么口令说话)
风扇在加电的时候报这个错是正常的。从你收集的信息来看,风扇和其他硬件都没有问题
可能是BUG,建议升级固件版本或者系统patch 实际情况是不管是否加电,在系统运行的时候他也一直报这个错。
bug的话最多也是系统层面的呗,怎么一个系统bug能导致硬件上的风扇停止运转呢,这套系统是由好几台t2000组成的,现在是其中两台报这样的东西,而这几台的系统都是同时做的,应该是一模一样的系统。这个难道还有一定的偶然性啊。:em14: 如果所有风扇都有问题,那就是风扇电源板出问题了,是给风扇供电的一块板子坏了,它上面有芯片的. 换了它就行了. 楼主,你是不是搞差了。但愿不是。
SYS_FAN是屁股后头的那个风扇。
你说TOP fan是前面可以打开顶盖看见的风扇。 回复wenmaming_cu:
你说的那个模块就是那三个风扇插的那个小板子吧(长条状),你说有可能是那个小玩意坏了对吧,如果实在没有办法的话我会试试的。
回复wait空白:
你可能对T2000这个机器不熟,那个日志报的是SYS_FAN at FT0/FM2 has FAILED,FT0这个就是那三个风扇,那三个top fan,也就是我随时能够打开盖子看的那三个风扇。话说T2000屁股后边还有专门的冷却风扇?可能我没怎么关注过吧。我以为只有电源风扇呢。 后背风扇是FT2 回复 4# wenmaming_cu
你说的那个模块就是那三个风扇插的那个小板子吧(长条状),你说有可能是那个小玩意坏了对吧,如果实在没有办法的话我会试试的。
回复 5# wait空白
你可能对T2000这个机器不熟,那个日志报的是SYS_FAN at FT0/FM2 has FAILED,FT0这个就是那三个风扇,那三个top fan,也就是我随时能够打开盖子看的那三个风扇。话说T2000屁股后边还有专门的冷却风扇?可能我没怎么关注过吧。我以为只有电源风扇呢。
很有可能是风扇电源板故障,我遇到过这种情况!
页:
[1]
2