Chinaunix

标题: 谁知道SUNV4-8001-N4这个solaris的sp诊断信息代表什么意思吗? [打印本页]

作者: wto4883204    时间: 2014-07-21 16:02
标题: 谁知道SUNV4-8001-N4这个solaris的sp诊断信息代表什么意思吗?
服务器自动关机了,SP只诊断到上面的这条信息。已经有两次关机的历史了。
作者: 东方蜘蛛    时间: 2014-07-21 16:09
-> show /SP/logs/event/list
-> show -d properties -level all /SP/faultmgmt
贴出来看看~
作者: wto4883204    时间: 2014-07-21 18:31
-> show -d properties -level all /SP/faultmgmt
  /SP/faultmgmt
    Properties:

  /SP/faultmgmt/0
    Properties:
        fru = /SYS/MB

  /SP/faultmgmt/0/faults
    Properties:

  /SP/faultmgmt/0/faults/0
    Properties:
        sunw-msg-id = SUN4V-8001-N4
        uuid = b3ff888f-e088-6d7a-edc6-965c7573876f
        timestamp = Jul 21 15:11:15

  /SP/faultmgmt/0/faults/1
    Properties:
        sunw-msg-id = PCIEX-8000-3S
        uuid = e8f92a4a-915b-c980-991f-cf8a09954e63
        timestamp = Jul 21 15:11:15

另外event的以下(数量太多了,我只截了一小部分):

ID     Date/Time                 Class     Type      Severity
-----  ------------------------  --------  --------  --------
434    Mon Jul 21 16:32:09 2014  Audit     Log       minor
       root : Open Session : object = /session/type : value = shell : success
433    Mon Jul 21 15:28:27 2014  Audit     Log       minor
       root : Close Session : object = /session/type : value = www : success
432    Mon Jul 21 15:12:19 2014  Audit     Log       minor
       root : Open Session : object = /session/type : value = www : success
431    Mon Jul 21 15:11:27 2014  Audit     Log       minor
       root : Open Session : object = /session/type : value = shell : success
430    Mon Jul 21 15:11:16 2014  Chassis   Log       major
       Host detected fault, MSGID: SUN4V-8001-N4
429    Mon Jul 21 15:11:15 2014  Chassis   Log       major
       Host detected fault, MSGID: PCIEX-8000-3S
428    Mon Jul 21 15:08:56 2014  Audit     Log       minor
       root : Open Session : object = /session/type : value = www : success
427    Mon Jul 21 15:06:54 2014  Audit     Log       minor
       root : Set : object = /SYS/MB/clear_fault_action : value = true : success
426    Mon Jul 21 14:59:21 2014  Chassis   Log       major
       Host is running
425    Mon Jul 21 14:58:13 2014  Audit     Log       minor
       root : Open Session : object = /session/type : value = www : success

Paused: press any key to continue, or 'q' to quit
424    Mon Jul 21 14:55:06 2014  Chassis   Log       major

       Host has been powered on
423    Mon Jul 21 14:55:04 2014  Audit     Log       minor
       root : Set : object = /SYS/power_state : value = on : success
422    Mon Jul 21 14:54:52 2014  Audit     Log       minor
       root : Open Session : object = /session/type : value = shell : success
421    Mon Jul 21 14:54:41 2014  Chassis   Log       major
       Host detected fault, MSGID: SUN4V-8001-N4
420    Mon Jul 21 14:54:40 2014  Chassis   Log       major
       Host detected fault, MSGID: PCIEX-8000-3S
419    Mon Jul 21 14:52:33 2014  Reset     Log       major
       Reset of /SP initiated by root.  Success unless failure noted.
418    Mon Jul 21 14:52:10 2014  Audit     Log       minor
       root : Set : object = /SYS/MB/clear_fault_action : value = true : success
417    Mon Jul 21 14:34:00 2014  Audit     Log       minor
       root : Set : object = /clock/datetime : value = 072114342014 : success


回复 2# 东方蜘蛛


   
作者: wto4883204    时间: 2014-07-21 19:06
两次都是发生了这件事后才关机的,似乎有关联:

回复 2# 东方蜘蛛


   

微信图片_20140721185105139.png (3.67 MB, 下载次数: 232)

微信图片_20140721185105139.png

作者: watchsat    时间: 2014-07-22 00:31
The Message ID:  SUN4V-8001-N4indicates a problem in the interconnect between the UltraSPARC T2 Plus processors as a result of an uncorrectable link error. There will have been a system reset. The nature of the fault does not allow an automated attempt to disable the affected resource.

Use the command  fmdump -v -u   with the    from the PSH console message to locate the suspected faulty components. For this fault, the fault manager identifies each thread in the processor as faulty as shown...
作者: watchsat    时间: 2014-07-22 00:33
簡單的說, CPU掛了...
作者: z191837954    时间: 2014-08-14 11:16
回复 2# 东方蜘蛛

-> show faulty
Target              | Property               | Value                           
--------------------+------------------------+---------------------------------
/SP/faultmgmt/0     | fru                    | /SYS/MB                         
/SP/faultmgmt/0     | timestamp              | Aug 13 05:07:39                 
/SP/faultmgmt/0/    | sunw-msg-id            | SUN4-8000-0Y                    
 faults/0           |                        |                                 
/SP/faultmgmt/0/    | uuid                   | 3ee32d84-b8ae-ec68-f79b-97ca643d
 faults/0           |                        | 7f6a                            
/SP/faultmgmt/0/    | timestamp              | Aug 13 05:07:39                 
 faults/0           |                        |        



    /SP/faultmgmt
    Properties:

  /SP/faultmgmt/0
    Properties:
        fru = /SYS/MB
        timestamp = Aug 13 05:07:39

  /SP/faultmgmt/0/faults
    Properties:

  /SP/faultmgmt/0/faults/0
    Properties:
        sunw-msg-id = SUN4-8000-0Y
        uuid = 3ee32d84-b8ae-ec68-f79b-97ca643d7f6a
        timestamp = Aug 13 05:07:39

求教

作者: z191837954    时间: 2014-08-14 11:30
回复 2# 东方蜘蛛


   
-> ls

/SP/faultmgmt
    Targets:
        0 (/SYS/MB)

    Properties:

    Commands:
        cd
        show

-> start /SP/faultmgmt/shell
start: Invalid target /SP/faultmgmt/shell

-> cd /
/

-> start /SP/faultmgmt/shell
start: Invalid target /SP/faultmgmt/shell

-> fmadm faulty
Invalid command 'fmadm' - type help for a list of commands.

-> fmadm repair
Invalid command 'fmadm' - type help for a list of commands.

-> cd /
/

-> show -d properties -level all /SP/faultmgmt
  /SP/faultmgmt
    Properties:

  /SP/faultmgmt/0
    Properties:
        fru = /SYS/MB
        timestamp = Aug 13 05:07:39

  /SP/faultmgmt/0/faults
    Properties:

  /SP/faultmgmt/0/faults/0
    Properties:
        sunw-msg-id = SUN4-8000-0Y
        uuid = 3ee32d84-b8ae-ec68-f79b-97ca643d7f6a
        timestamp = Aug 13 05:07:39


-> show

/
    Targets:
        HOST
        SYS
        SP

    Properties:

    Commands:
        cd
        show

->  show faultmgmt
show: No such target /faultmgmt

->  show faultmgmt
show: No such target /faultmgmt

-> show /SP/faultmgmt

/SP/faultmgmt
    Targets:
        0 (/SYS/MB)

    Properties:

    Commands:
        cd
        show

-> show /SP/faultmgmt

/SP/faultmgmt
    Targets:
        0 (/SYS/MB)

    Properties:

    Commands:
        cd
        show

-> set /SP/faultmgmt/0 component_stat=enabled
set: No such property /SP/faultmgmt/0/component_stat

-> set /SP/faultmgmt/0 clear_fault_action=true
set: No such property /SP/faultmgmt/0/clear_fault_action

-> show /SP/faultmgmt

/SP/faultmgmt
    Targets:
        0 (/SYS/MB)

    Properties:

    Commands:
        cd
        show

-> cd  /SP/faultmgmt/0
/SP/faultmgmt/0

-> ls

/SP/faultmgmt/0
    Targets:
        faults

    Properties:
        fru = /SYS/MB
        timestamp = Aug 13 05:07:39

    Commands:
        cd
        show

->  /SP/faultmgmt/0/faults
Invalid command '/SP/faultmgmt/0/faults' - type help for a list of commands.

-> cd  /SP/faultmgmt/0/faults
/SP/faultmgmt/0/faults

-> ls

/SP/faultmgmt/0/faults
    Targets:
        0 ()

    Properties:

    Commands:
        cd
        show

-> cd /SP/faultmgmt/0/faults/0
/SP/faultmgmt/0/faults/0

-> ls

/SP/faultmgmt/0/faults/0
    Targets:

    Properties:
        sunw-msg-id = SUN4-8000-0Y
        uuid = 3ee32d84-b8ae-ec68-f79b-97ca643d7f6a
        timestamp = Aug 13 05:07:39

    Commands:
        cd
        show

->
作者: z191837954    时间: 2014-08-14 11:47
回复 2# 东方蜘蛛


     show /SP/faultmgmt

/SP/faultmgmt
    Targets:
        0 (/SYS/MB)

    Properties:

    Commands:
        cd
        show

-> show /SP/faultmgmt

/SP/faultmgmt
    Targets:
        0 (/SYS/MB)

    Properties:

    Commands:
        cd
        show

-> set /SP/faultmgmt/0 component_stat=enabled
set: No such property /SP/faultmgmt/0/component_stat

-> set /SP/faultmgmt/0 clear_fault_action=true
set: No such property /SP/faultmgmt/0/clear_fault_action

-> show /SP/faultmgmt

/SP/faultmgmt
    Targets:
        0 (/SYS/MB)

    Properties:

    Commands:
        cd
        show

-> cd  /SP/faultmgmt/0
/SP/faultmgmt/0

-> ls

/SP/faultmgmt/0
    Targets:
        faults

    Properties:
        fru = /SYS/MB
        timestamp = Aug 13 05:07:39

    Commands:
        cd
        show

->  /SP/faultmgmt/0/faults
Invalid command '/SP/faultmgmt/0/faults' - type help for a list of commands.

-> cd  /SP/faultmgmt/0/faults
/SP/faultmgmt/0/faults

-> ls

/SP/faultmgmt/0/faults
    Targets:
        0 ()

    Properties:

    Commands:
        cd
        show

-> cd /SP/faultmgmt/0/faults/0
/SP/faultmgmt/0/faults/0

-> ls

/SP/faultmgmt/0/faults/0
    Targets:

    Properties:
        sunw-msg-id = SUN4-8000-0Y
        uuid = 3ee32d84-b8ae-ec68-f79b-97ca643d7f6a
        timestamp = Aug 13 05:07:39

    Commands:
        cd
        show
作者: DC_楚楚    时间: 2014-08-14 12:32
回复 9# z191837954


    0、有啥问题自己开贴

    1、什么机器
   
    2、微码版本

    3、-> show fualty

    4、-> show /SP/logs/event/list

    5、-> show -d properties -level all /SP/faultmgmt



作者: znnnz    时间: 2014-08-14 13:11
10楼处处好NB啊
作者: linyunxingxi    时间: 2019-04-12 15:06
我们机房的机器也是这样了  是CPU坏了 主板的




欢迎光临 Chinaunix (http://bbs.chinaunix.net/) Powered by Discuz! X3.2