免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
最近访问板块 发新帖
查看: 7299 | 回复: 9
打印 上一主题 下一主题

[小机硬件] P5系列硬件PD练习第一题 [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2007-11-19 17:20 |只看该作者 |倒序浏览
P561新装机,加电报错,代码为B1551380,ASMI中有如下log纪录
Error/Event Logs
Platform Event Log - 5000241C  
Created at : 11/17/2007 17:15:10  
Driver Name : fips240/b0413a_0715.240  
Subsystem : CEC Hardware: VPD Interface  
Event Severity : Unrecoverable Error  
Action Flags : Report to Operating System  
Service Action Required  
HMC Call Home  
Service Processor Call Home Required  
Action Status : Processed  
Primary System Reference Code  
Reference Code : B1551380  
Hex Words 2 - 5 : 020000F0 28DA4410 C1009023 200000FF  
Hex Words 6 - 9 : 00000006 00060052 000F00C0 00000000  
Maintenance Procedure Required  
Priority : Medium Priority  
Procedure Number : FSPSP41  
Maintenance Procedure Required  
Priority : Lowest priority replacement  
Procedure Number : FSPSP04  
Maintenance Procedure Required  
Priority : Lowest priority replacement  
Procedure Number : FSPSP06  
Log Hex Dump  
  00000000        50480030  01002104  20071117  17151056         PH.0..!. ......V  
  00000010        20071117  17151059  4500010D  00000000          ......YE.......  
  00000020        00000000  00000000  5000241C  5000241C         ........P.$.P.$.  
  00000030        55480018  01004F30  55034000  0000F962         UH....O0U.@....b  
  00000040        0000A902  01005000  50530084  01012104         ......P.PS....!.  
  00000050        02010009  0000007C  020000F0  28DA4410         .......|....(.D.  
  00000060        C1009023  200000FF  00000006  00060052         ...# ..........R  
  00000070        000F00C0  00000000  42313535  31333830         ........B1551380  
  00000080        20202020  20202020  20202020  20202020                           
  00000090        20202020  20202020  C000000D  10284D00                 .....(M.  
  000000A0        49440C42  46535053  50343100  10284C00         ID.BFSPSP41..(L.  
  000000B0        49440C42  46535053  50303400  10284C00         ID.BFSPSP04..(L.  
  000000C0        49440C42  46535053  50303600  55440094         ID.BFSPSP06.UD..  
  000000D0        02044400  00000286  2F6F7074  2F666970         ..D...../opt/fip  
  000000E0        732F6269  6E2F6365  63736572  76657200         s/bin/cecserver.  
  000000F0        00000000  00000000  00000000  00000000         ................  
  00000100        00000000  00000000  00000000  00000000         ................  
  00000110        00000000  00000000  00000000  00000000         ................  
  00000120        00000000  00000000  00000000  66697073         ............fips  
  00000130        3234302F  62303431  33615F30  3731352E         240/b0413a_0715.  
  00000140        32343000  00000000  00000000  00000001         240.............  
  00000150        00000002  00000801  0000010D  00000006         ................  
  00000160        4D54001C  01004400  39313136  2D353631         MT....D.9116-561  
  00000170        36353032  43454500  00000000  5544003C         6502CEE.....UD.<
  00000180        01012104  00830006  02EF6001  600003F8         ..!.......`.`...  
  00000190        00040800  00005200  0402C001  0301C109         ......R.........  
  000001A0        00015200  0402C000  0301C1FF  00015200         ..R...........R.  
  000001B0        FF025200  0F020000  5544007C  01022104         ..R.....UD.|..!.  
  000001C0        706F7274  3D307836  2C206164  64726573         port=0x6, addres  
  000001D0        733D3078  63302C20  6875623D  30783532         s=0xc0, hub=0x52  
  000001E0        2C206875  62636861  6E6E656C  3D307866         , hubchannel=0xf  
  000001F0        2C206572  726E6F3D  362C206D  61736B31         , errno=6, mask1  
  00000200        3D307831  30302C20  6D61736B  323D3078         =0x100, mask2=0x  
  00000210        302C2077  72697465  666C6167  3D312C20         0, writeflag=1,   
  00000220        696E6465  783D3235  352C2068  75627479         index=255, hubty  
  00000230        70653D31  53570014  02017300  0000F7A0         pe=1SW....s.....  
  00000240        000B0008  73000B01  53570014  02016700         ....s...SW....g.  
  00000250        0000F503  0003012E  00000000  53570014         ............SW..  
  00000260        02017600  0000F962  00030003  00000000         ..v....b........  
  00000270        53570014  02017600  0000F962  00050001         SW....v....b....  
  00000280        00000000  53570014  02017600  0000F95D         ....SW....v....]  
  00000290        00060012  00000000  55440218  00006700         ........UD....g.  
  000002A0        FED00005  00066368  69636D64  20636869         ......chicmd chi  
  000002B0        63627569  6C64686F  6D636172  64732068         cbuildhomcards h  
  000002C0        6F6D5F73  79737465  6D202D74  202D6320         om_system -t -c   
  000002D0        63686963  6275696C  64686F6D  70726F63         chicbuildhomproc  
  000002E0        7320686F  6D5F7379  7374656D  202D7420         s hom_system -t   
  000002F0        2D632063  68696363  6C6F636B  5F746573         -c chicclock_tes  
  00000300        7420686F  6D5F7379  7374656D  202D7420         t hom_system -t   
  00000310        2D632063  68696342  75696C64  486F6D43         -c chicBuildHomC  
  00000320        68697073  20484F4D  5F535953  54454D20         hips HOM_SYSTEM   
  00000330        2D74202D  63206368  6963496E  6974486F         -t -c chicInitHo  
  00000340        6D537973  74656D20  484F4D5F  53595354         mSystem HOM_SYST  
  00000350        454D202D  74202D63  20636869  6356616C         EM -t -c chicVal  
  00000360        69646174  65486F6D  20484F4D  5F535953         idateHom HOM_SYS  
  00000370        54454D20  2D74202D  63206368  6963446F         TEM -t -c chicDo  
  00000380        47617264  20484F4D  5F535953  54454D20         Gard HOM_SYSTEM   
  00000390        2D74202D  63206368  69634672  65717565         -t -c chicFreque  
  000003A0        6E637943  6F6E7472  6F6C2048  4F4D5F53         ncyControl HOM_S  
  000003B0        59535445  4D202D74  202D6320  63686963         YSTEM -t -c chic  
  000003C0        496E6974  4A746167  20484F4D  5F535953         InitJtag HOM_SYS  
  000003D0        54454D20  2D74202D  63206368  69634D61         TEM -t -c chicMa  
  000003E0        736B4174  74656E74  696F6E73  20484F4D         skAttentions HOM  
  000003F0        5F535953  54454D20  2D74202D  63206368         _SYSTEM -t -c ch  
  00000400        69634365  63466C75  73682048  4F4D5F53         icCecFlush HOM_S  
  00000410        59535445  4D202D74  202D6320  63686963         YSTEM -t -c chic  
  00000420        53696261  74734465  7020484F  4D5F5359         SibatsDep HOM_SY  
  00000430        5354454D  202D7420  2D632063  6869634C         STEM -t -c chicL  
  00000440        62697374  20484F4D  5F535953  54454D20         bist HOM_SYSTEM   
  00000450        2D74202D  63206368  69634162  69737420         -t -c chicAbist   
  00000460        484F4D5F  53595354  454D202D  74202D63         HOM_SYSTEM -t -c  
  00000470        20636869  63417373  65745072  6F746563          chicAssetProtec  
  00000480        74696F6E  20484F4D  5F535953  54454D20         tion HOM_SYSTEM   
  00000490        2D74202D  63206368  69634D65  6D53697A         -t -c chicMemSiz  
  000004A0        6520484F  4D5F5359  5354454D  202D7400         e HOM_SYSTEM -t.  

问题:什么地方有问题,应该更换什么!?

论坛徽章:
0
2 [报告]
发表于 2007-11-19 17:21 |只看该作者
希望LZ多教教我们,硬件太不熟悉了

论坛徽章:
0
3 [报告]
发表于 2007-11-19 17:33 |只看该作者
别灌水,技术上想到什么说什么,就是别灌水别拍马屁别吹牛,下次见了直接删掉了。
如果通篇都是得到的CU想避免的这些东西,也不用费心开个什么讨论话题了,大家安心做好自己事情赚自己的一份钱就完事了。

论坛徽章:
0
4 [报告]
发表于 2007-11-19 19:17 |只看该作者
VPD卡?

论坛徽章:
0
5 [报告]
发表于 2007-11-19 19:40 |只看该作者
别只说个答案,可以聊聊你的分析过程,不管对错,至少是说明了可以采用的思路。

PD重要的是依据得到的信息逐步分析获得最终的结果,绝对不是靠猜,PD的功力更加不是看谁猜得准。

刚才还有人误以为我在利用这种方式变相的想解决我自己碰到的问题,呵呵,我只能说这绝对是个误会。

这个P5 PD的专题是我昨天提到的系列活动之一。

你可以选择无视,也可以选择旁观,没关系的,但是可以明确的是,大家在这个话题中的发挥和投入绝对会在其他的活动中获得优先权。

包括楼上兄弟的回复,我同样不会说对错。

我们的想法是把CU/AIX版面打造成为真正能够互相学习,互相帮助,一起提高的园地。希望这个目标没错!

论坛徽章:
1
荣誉版主
日期:2011-11-23 16:44:17
6 [报告]
发表于 2007-11-19 23:51 |只看该作者
先拿Medium Priority 的FSP41开刀,在信息中心搜一下

FSPSP41: Model 570

Shows how to isolate to a failing FRU.
To isolate to a failing FRU do the following:

1. Record words 3, 7, and 8 from the primary system reference code (SRC) of the error log.
28DA4410 00060052 000F00C0
2. Determine the enclosure that contains the service processor that started the I2C operation.
Note: The remainder of this procedure refers to this enclosure as the FSP enclosure.
Refer to the last byte in word 3 of the primary SRC to find the service processor enclosure:
         1. If the last byte is a 10, then the primary unit service processor started the I2C.
         2. If the last byte is a 20, then the secondary unit 1 service processor started the I2C.
         3. If the last byte is a 30, use one of the following methods to determine the enclosure:

If you have access to the Advanced System Management Interface (ASMI), log on and display the details of the service processor error log. Using the
Platform Event Log id (shown in the first table of each detail of the log), look at the first byte.
5000241C  
* If the first byte is a 50, then the primary unit service processor started the I2C.
* If the first byte is a 51, then the secondary unit 1 service processor started the I2C.

If you have access to an Hardware Management Console (HMC), log in as PE user and bring up Manage Serviceable Events under the Service Focal Point screen. Display the events for the corresponding service processor system, then double-click the system to see the details. Look at the Field Platform log ID, which contains a decimal value that you need to convert to a hexadecimal value.
* If the first byte of the hexadecimal value is a 50, then the primary unit service processor started the I2C.
* If the first byte of the hexadecimal value is a 51, then the secondary unit 1 service processor started the I2C.

3. Determine which enclosure contains the target I2C device.
      Note: The remainder of this procedure refers to the enclosure that contains the I2C device as the target enclosure.
      In the I2C Target enclosure table, find word 7 from the primary SRC. The enclosure listed beside word 7 is the target enclosures. Click the following link to display the I2C Target enclosure table: I2C Target enclosures for model 570 (4-way or 8-way through 16-way)
Word 7: Leftmost 2 bytes          Enclosure
0006                                          Secondary unit 1


Word 7                               Word 8                           Word 8                                          I2C FRU list
Rightmost 2 bytes         Leftmost 2 bytes       Rightmost 2 bytes          
0052                                000F                              00C0                                              1. Service processor card (if present in target enclosure)
                                                                                                                                       2. I/O backplane


俺在这里琢磨了很久,高人一语点破,second unit了都,那就是io backplane,惭愧惭愧

4. Does word 7 have a match in the I2C Target enclosure table?
          * Yes: Continue with the step 6.
          * No: Continue with the next step.

5. The right most 2 bytes of word 7 does not have a match in the I2C Target enclosure table. Use the following statements to determine which enclosure contains the target I2C device:
         1. If the right most 2 bytes of word 7 is either 0002 or 0003, then the FSP enclosure and the target enclosure are the same.
         2. If the right most 2 bytes of word 7 is either 0000 or 0001, then refer to the following conditions
               1. i. If the FSP enclosure is the primary unit, then the target enclosure is the secondary unit
               2. If the FSP enclosure is secondary unit 1, then the target enclosure is the primary unit.

6. Determine which parts you need to replace on the target enclosure.
         1. Using the following I2C FRU table for model 570, refer to words 7 and 8 of the primary SRC to locate the I2C FRU list. The I2C FRU list identifies the parts that you need to replace. Click the following link to display the I2C FRU table: I2C FRU table for model 570 (4-way or 8-way through 16-way)
         2. Did you find the I2C FRU list?
           * Yes: Continue with the next step.
                * No: Contact your next level of support. This ends the procedure.

7. Starting with the first FRU in the list that you located the I2C FRU table, do the following:.
         1. Power off the system.
         2. Replace the FRU on the target enclosure. Go to Finding part locations for location, part number, and exchange information.
         3. Go to Perform a slow boot, then continue with the next step.
         4. Is the problem resolved?
                * Yes: This ends the procedure.
                * No: Continue with the next step.

8. Have you replaced all of the FRUs in the list?
          * No: Repeat step 7 for the next FRU in the list.
          * Yes: Continue with the next step.

9. Replace the following FRUs, one at a time and in the order listed. After you replace each FRU, perform a slow boot until the problem is resolved. For more information, see Perform a slow boot.
         1. System/service processor cable connected from the service processor enclosure to the target enclosure
         2. Service processor on the service processor enclosure
         3. I/O backplane on the target enclosure
         4. I/O backplane on the service processor enclosure

10. Is the problem resolved?
          * Yes: This ends the procedure.
          * No: Contact your next level of support. This ends the procedure.

[ 本帖最后由 yddll 于 2007-11-20 01:39 编辑 ]

论坛徽章:
0
7 [报告]
发表于 2007-11-20 01:15 |只看该作者
1. 先从information center 中错误代码 B1551380 .
   " B1551380 Error trying to write register to device "

2. 从题目给的信息类型来看 , 直接go到 Service processor firmware reference code callouts .
    大致如下 :
     B1xx reference code indicates that an error occurred in a service processor firmware-detected operation. To resolve any B1xx reference code, use th  e following procedure:

Does the reference code have a format of B151 72xx or B181 72xx that displays during the power-on process after power was connected to the server but before the server reaches the Power On Standby state?

Yes: See the description in Table 1. (B1xx) Service processor early termination reference codes. Perform Failing Item FSPSP02.
No: Continue with the next step.

a.   Perform the following:
Using the callout list provided with the error replace the first or next FRU or perform the first or next isolation procedure in the list starting with the highest priority. For more information, see the following:
Finding part locations for links to removal and replacement information
(查找位置,有图 . 不好截图)

Service processor isolation procedures for links to service processor isolation procedures
从上面红色的信息来看 , 先从最高优先级来做 . 从给出的信息提示来看 ,是这个
Priority : Medium Priority  
Procedure Number : FSPSP41   

Service processor isolation procedures
These procedures help you to isolate problems with the service processor.

DANGER
When working on or around the system, observe the following precautions:
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard:
Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Do not open or service any power supply assembly.
Do not connect or disconnect any cables or perform installation, maintenance, or reconfiguration of this product during an electrical storm.
The product might be equipped with multiple power cords. To remove all hazardous voltages, disconnect all power cords.
Connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate.
Connect any equipment that will be attached to this product to properly wired outlets.
When possible, use one hand only to connect or disconnect signal cables.
Never turn on any equipment when there is evidence of fire, water, or structural damage.
Disconnect the attached power cords, telecommunications systems, networks, and modems before you open the device covers, unless instructed otherwise in the installation and configuration procedures.
Connect and disconnect cables as described in the following procedures when installing, moving, or opening covers on this product or attached devices.
To Disconnect:
Turn off everything (unless instructed otherwise).
Remove the power cords from the outlets.
Remove the signal cables from the connectors.
Remove all cables from the devices
To Connect:
Turn off everything (unless instructed otherwise).
Attach all cables to the devices.
Attach the signal cables to the connectors.
Attach the power cords to the outlets.
Turn on the devices.

根据FSPSP41 这个型号 找到以下信息 (同样由于没有561的型号信息 , 我查看的是570的信息)
FSPSP41: Model 570
Use this procedure to help you isolate a problem that occurred when trying to access a chip on the I2C bus on model 570 (4-way or 8-way through 16-way).

1. Record words 3, 7, and 8 from the primary system reference code (SRC) of the error log.
   28DA4410 00060052 000F00C0

下面的分析 ,yddll已经分析的很细致了 .

2.  Determine the enclosure that contains the service processor that started the I2C operation.
Note: The remainder of this procedure refers to this enclosure as the FSP enclosure.
Refer to the last byte in word 3 of the primary SRC to find the service processor enclosure:
If the last byte is a 10, then the primary unit service processor started the I2C.
If the last byte is a 20, then the secondary unit 1 service processor started the I2C.
If the last byte is a 30, use one of the following methods to determine the enclosure:

If you have access to the Advanced System Management Interface (ASMI), log on and display the details of the service processor error log. Using the Platform Event Log id (shown in the first table of each detail of the log), look at the first byte.

If the first byte is a 50, then the primary unit service processor started the I2C.
If the first byte is a 51, then the secondary unit 1 service processor started the I2C.

If you have access to an Hardware Management Console (HMC), log in as PE user and bring up Manage Serviceable Events under the Service Focal Point screen. Display the events for the corresponding service processor system, then double-click the system to see the details. Look at the Field Platform log ID, which contains a decimal value that you need to convert to a hexadecimal value.

If the first byte of the hexadecimal value is a 50, then the primary unit service processor started the I2C.
If the first byte of the hexadecimal value is a 51, then the secondary unit 1 service processor started the I2C.

3. Determine which enclosure contains the target I2C device.
Note: The remainder of this procedure refers to the enclosure that contains the I2C device as the target enclosure.
In the I2C Target enclosure table, find word 7 from the primary SRC. The enclosure listed beside word 7 is the target enclosures. Click the following link to display the I2C Target enclosure table: I2C Target enclosures for model 570 (4-way or 8-way through 16-way)
Does word 7 have a match in the I2C Target enclosure table?
Yes: Continue with the step 6.
No: Continue with the next step.
The upper word of word 7 does not have a match in the I2C Target enclosure table. Use the following statements to determine which enclosure contains the target I2C device:
If the upper word of word 7 is either 0002 or 0003, then the FSP enclosure and the target enclosure are the same.
If the upper word of word 7 is either 0000 or 0001, then refer to the following conditions
i. If the FSP enclosure is the primary unit, then the target enclosure is the secondary unit
If the FSP enclosure is secondary unit 1, then the target enclosure is the primary unit.
Determine which parts you need to replace on the target enclosure.
Using the following I2C FRU table for model 570, refer to words 7 and 8 of the primary SRC to locate the I2C FRU list. The I2C FRU list identifies the parts that you need to replace. Click the following link to display the I2C FRU table: I2C FRU table for model 570 (4-way or 8-way through 16-way)
Did you find the I2C FRU list?
Yes: Continue with the next step.
No: Contact your next level of support. This ends the procedure.
Starting with the first FRU in the list that you located the I2C FRU table, do the following:.
Power off the system.
Replace the FRU on the target enclosure. Go to Finding part locations for location, part number, and exchange information.
Go to Perform a slow boot, then continue with the next step.
Is the problem resolved?
Yes: This ends the procedure.
No: Continue with the next step.
Have you replaced all of the FRUs in the list?
No: Repeat step 7 for the next FRU in the list.
Yes: Continue with the next step.
Replace the following FRUs, one at a time and in the order listed. After you replace each FRU, perform a slow boot until the problem is resolved. For more information, see Perform a slow boot.
System/service processor cable connected from the service processor enclosure to the target enclosure
Service processor on the service processor enclosure
I/O backplane on the target enclosure
I/O backplane on the service processor enclosure
Is the problem resolved?
Yes: This ends the procedure.
No: Contact your next level of support. This ends the procedure.

b .  Perform a slow-boot IPL.

   通过ASMI来设置启动 .

last :    Did the same error code appear?
No: Go to Verifying the repair. This ends the procedure.
Yes: Continue with the next step.

Is the new error code the same as the one that brought you to this procedure?
Yes: Replace the next FRU or perform the next procedure in the list.
No: Go to the Start of call procedure to service this new error code. This ends the procedure.

如果还是出现同样的错误代码 . 做另外个vpd的程序和上面一样  .
如果出现新的代码 , 要开个IBM call

Start of call procedure
Provides a starting point for analyzing problems. You should begin all service actions with this procedure.

This is the starting point for diagnosing and repairing servers. From this point, you will be guided to the appropriate information to help you diagnose server problems, determine the appropriate repair action, and then perform the necessary steps to repair the server. A system attention light indicates there is a serviceable event (an SRC in the control panel or in one of the serviceable event views) on the system. This procedure will guide you through finding the serviceable event.
Note: In this topic, control panel and operator panel are synonymous.
Before beginning, perform as many of the following tasks as possible:
Note: Ask the customer for the Using the problem reporting forms (available in the Customer service, support, and troubleshooting topic) that they should have completed when working with the support center.
Verify the symptoms and service call data, including:
The server machine type, model number, and serial number.
The customer problem number.
A reference code (SRC, SRN or progress code). If you do not have a reference code, ask the customer for the date and time of the problem.
Any available location code information
Any data stored in:
the service action event log in Service Focal Point
the Service Action Log (i5/OS&reg; servers)
other operating system-specific logs as directed
Symptoms reported to you by the customer.
Symptoms reported to you by the service support center.
Symptoms reported to you by an attached input/output (I/O) device, for example, a disk drive enclosure expansion drawer.
Record information to help you return the server to the same state that the customer typically uses, such as:
The IPL type that the customer typically uses for the server. (See Function 01: Display selected IPL type, system operating mode, and IPL speed for more information.)
The IPL mode that is used by the customer on this server.
The way in which the server is configured or partitioned.
Ensure that the customer has put the server into a state in which you can perform service tasks.
Is the failing component a monitor (display) or keyboard?
No:
Continue with the next step.
Yes:
Is the monitor or keyboard attached to a keyboard, video, mouse (KVM) switch?
No:
Continue with the next step.
Yes:
Go to Troubleshooting the keyboard, video, and mouse (KVM) switch for the 1x8 and 2x8 console manager.
Is the failing unit a 7037-A50 or a 7047-185?
No: Continue with the next step.
Yes: Go to Diagnosing a problem on a 7037-A50 or a 7047-185.
Is the failing server an xSeries or a BladeCenter blade server attached to an iSeries™ server through an iSCSI Host Bus Adapter (HBA)?
No: Continue with the next step.
Yes: Go to the iSeries integrated xSeries troubleshooting procedure.
Is there a Hardware Management Console (HMC) attached to the failing unit?
No: Continue with the next step.
Yes: Continue with step 7.
Is this an HMC-managed system?
Notes:
Look for HMC=x displayed in the control panel. The x represents a 0, 1, or 2, and depends on the number of HMCs attached to your server.
Ask the customer.
If the server does not have a control panel, then it should have an HMC.
No: Go to step 9.
Yes: Continue with the next step.
Inform the customer that an HMC is required to continue servicing the system, and ask the customer to reattach the HMC to the server. When the HMC is reattached to the managed system, continue with the next step.
Is the HMC functional?
No: Go to HMC problem isolation procedures. Once the HMC is functional, return here and continue with the next step.
Yes: Continue with the next step.
Perform the following steps from the HMC that is used to manage the server. During these steps, refer to the service data that was gathered earlier:
In the Navigation Area, open Service Applications.
Select Service Focal Point.
Select Repair Serviceable Event.
On the Select Failing System window, select the managed system that has the problem, and click OK.
Scroll through the log and verify that there is a problem with the status of Open to correspond with the customer's reported problem.
Note: If you are unable to locate the reported problem, and there is more than one open problem near the time of the reported failure, use the earliest problem in the log.
Do you find the reported problem, or an open problem near the time of the reported problem?
No: Continue with the next step.
Yes: Select the serviceable event you want to repair, and select Repair from the Selected menu.
This launches a series of windows that guides you through the steps to repair the serviceable event. The system guides you through one of the two following methods of repair, depending on the type of FRU you need to exchange:
An interactive step-by-step process that provides illustrations and video presentations to help you exchange the FRU.
A link to the appropriate information center topic that provides instructions to help you exchange the FRU.
After you complete the repair procedure, the system automatically closes the serviceable event. This ends the procedure.

Note: If the Repair procedures are not available, continue with the next step.
Is there an eight-digit reference code (except Cxxxxxxx) displayed in function 11 on the control panel, or if applicable on the HMC?
No: Continue with the next step.
Yes: Record all reference code data, including the values for functions 11 through 20 (see Collecting reference codes and system information), and note the failing partition. Then, go to the Reference codes topic to further isolate the problem. This ends the procedure.
Notes: If you are working at an HMC, check the HMC for operator panel values by performing the following steps:
In the Navigation Area, expand Server and Partition > Server Management.
Choose from the following options:
To view managed system reference codes:
In the right pane, right-click the system and select Properties.
Select the Reference Code tab to view the codes.
When finished, click Cancel.
To view logical partition reference codes:
In the right pane expand the system that contains the partition.
Open Partitions.
Right-click the logical partition and select Properties.
Select the Reference Code tab to view the codes.
When finished, click Cancel.
Is the system managed by the Integrated Virtualization Manager (IVM)?
Note: For information about the IVM, refer to Managing the Integrated Virtualization Manager
Yes: Go to step 13.
No: Continue with the next step.
Is an operating system console session available and can you log into the session?
Notes:
For servers with logical partitions, the console session must be available for the failing partition.
If you suspect a problem is on a device (a disk drive enclosure expansion drawer for example) that is attached to a server or a partition, the console session must be available for the partition to which the device is attached.
See 5250 Console in the Managing your server topic for details on accessing a 5250 console session on the HMC.
Yes: Go to step 14.
No: Continue with the next step.
Work with the customer to open an operating system console session. Can you successfully open an operating system console session?
Yes: Go to step 14.
No: Go to step 16.
Work with the customer to access the IVM web interface if it is not already running. Refer to Connecting to Integrated Virtualization Manager. Can you successfully open the console session for IVM?
Yes: Continue with the next step.
No: Go to step 16.
Use the operating system-specific, the IVM-specific, or the Virtual I/O server (VIOS)-specific service tools and system logs to locate a "serviceable event" in the logs, or if there is no serviceable event, use the customer-reported symptom in step 14.b.
Ask the customer for the date and time of the problem.
Note: If you are unable to locate the reported problem and there is more than one open problem near the time of the reported failure, use the earliest failure.
Go to the appropriate procedure depending on the operating system, IVM, or VIOS that is reporting a problem.
If your server is managed by IVM, go to Collect serviceable events in IVM, and then return here and continue on step 15.
If your server or partition is running AIX, go to AIX fast-path problem isolation.
If your server or partition is running Linux&reg;, go to Linux fast-path problem isolation.
If your server or partition is running i5/OS, go to Using the Service Action Log.
If your server or partition is running a Virtual I/O Server partition, go to Virtual I/O Server troubleshooting.
Did you find a reference code associated with this failure near the time of the reported problem?
No: Continue with the next step.
Yes: Collect all reference code data and note the failing partition. Go to the Reference codes topic to further isolate the problem. This ends the procedure.
Use the HMC or a web browser to access the Advanced System Management Interface (ASMI). See Accessing the Advanced System Management Interface for details. Then, perform the following steps:
Log into the ASMI.
Click the plus sign next to System Service Aids.
Select Error/Event Logs to view the service processor error log.
Note: For more information on using the ASMI, see Managing your server using the Advanced System Management Interface.
Were you able to access ASMI and is there a reference code in the service processor error log that requires service?

No: Continue with the next step.
Yes: Collect all reference code data and note the failing partition. Go to the Reference codes topic to further isolate the problem. This ends the procedure.
The following examples show the possible formats of reference codes that display during the power-on process:
C1xx xxxx codes are displayed during the time after power is connected to the server until the service processor initial power-on process completes (indicated by "01" in the upper-left corner of the physical or logical control panel).
C2xx xxxx and C7xx xxxx codes are displayed after the power-on sequence is initiated.
CAxx xxxx codes are displayed while an AIX or Linux partition is initializing its resources.
xxx, 0xxx, 2xxxx, or C6xx xxxx, C9xx xxxx codes are displayed during the time that the operating system boots and configures resources.
888-xxx code
Does the power-on process for the server appear to be stopped, and does function 11 display a reference code of a format shown in the list above?

No: Go to the Symptom index. This ends the procedure.
Yes: Collect all reference code data and note the failing partition. Go to the List of progress codes topic to further isolate the problem. This ends the procedure.
Diagnosing a problem on a 7037-A50 or a 7047-185
This is the entry procedure for diagnosing problems on a 7037-A50 or a 7047-185


PS : yddll , 看到你的更改了 .  自己还是没算到这一步 . 谢谢
       以后不能偷懒了 , 得检讨下自己了

[ 本帖最后由 金牌小卧底 于 2007-11-20 02:16 编辑 ]

论坛徽章:
0
8 [报告]
发表于 2007-11-20 01:24 |只看该作者
楼上两位亲爱的兄弟,咋又不说结论了呢!呵呵

论坛徽章:
0
9 [报告]
发表于 2007-11-25 22:15 |只看该作者
到第一题summary的时候了。

其实这个case是很明确的case,所需要的只是好好的走map,就能够找到问题的原因。

给出的信息已经足够分析判断了,只不过流程稍长且容易有误判断的岔路口。

需要明确的是map一定要follow information,而不是靠猜或者是估计、也许之类的。

避免了这个不好的习惯,PD过程会少走很多弯路。

论坛徽章:
0
10 [报告]
发表于 2008-06-24 01:08 |只看该作者
这种帖子值得顶,期望lz多搞些这样的case。

PS:6F,7F思路很清晰,过程详细,学习了。。。
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP