免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
最近访问板块 发新帖
查看: 3442 | 回复: 0

[存储备份] 7133 D40更换硬盘(方案+实战) [复制链接]

论坛徽章:
0
发表于 2009-09-18 15:34 |显示全部楼层
IBM 7133磁盘阵列硬盘更换操作: A.检查7133磁盘阵列连接的SSA环路中每台RS6000小型机的errpt及mail中报错信息,记录报错pdisk及SRN等信息;检查每台小型机报错后,下列步骤在其中一台小型机上完成,如该array在使用的小型机; B.通过smitty ssaraid命令,进入SSA的专用操作菜单,检查阵列当前array状态,是否有array degraded,是否有hot spare盘被使用; C.检查报错的硬盘所在array对应hdisk,确定所在array raid类型; D.lspv检查确定有硬盘故障hdisk所在vg,lsvg –l *vg检查当前vg内lv是否都是syncd; E.根据阵列状态及报错确定故障pdisk后,lscfg –vl pdisk*检查该pdisk VPD信息,确认FRU,SN等信息供IBM工程师订新备件; F.如果该array中故障pdisk已经failed,则直接跳到H;如果该array中报错的pdisk仍在使用,则进行下一步; G.通过smitty ssaraid命令进入菜单,选择 Change Member Disks in an SSA RAID Array Remove a Disk From an SSA RAID Array 从array中移除报错的pdisk,这时array状态会变成degreded; H.通过diag命令进入菜单,选择 Task selection->SSA->set service mode 选中报错的pdisk并设置service mode; I.根据7133存储上亮起的黄色指示灯确认故障硬盘位置,平稳拔出硬盘,等待5s,平稳插入新硬盘; J.在该7133 SSA Link中每台IBM小型机上,执行rmdev –dl pdisk*删除故障pdisk信息(注意每台小型机上pdisk并不一至),并cfgmgr重认设备,检查发现新硬盘,lscfg –vl pdisk*检查新硬盘信息正常; K.通过smitty ssaraid菜单,选择 Change/Show Use of an SSA Physical Disk 若hot spare在使用中则将新pdisk设置成hot spare,跳到步骤M; 若无hot spare则将新pdisk设置成array candidate,继续下一步; L.在array无hot spare情况下,这时需要将新pdisk加入degraded状态的array,在SSA菜单中选择 Change Member Disks in an SSA RAID Array Swap Members of an SSA RAID Array 菜单中分别选择:故障所在array,被剔出pdisk留下的blank信息,新pdisk;执行后新pdisk就加入该array并开始同步阵列; M.同步开始后, 重新在SSA菜单中检查SSA Link中每台IBM小型机以下内容 List All Defined SSA RAID Arrays List Status Of All Defined SSA RAID Arrays 确认array rebuilding状态,及完成进度; N.重新检查每台小型机errpt,mail及diag中ssa link status等无异常,其中errpt中有更换磁盘时的ssa open link等报错属正常现象; O.耐心等待array同步完成显示good状态,所有工作完成. 实战操作过程:(1)故障信息,7133连接12块73.4G磁盘,所有磁盘状态灯亮绿灯,连接的两台主机上有报错,初步诊断是硬盘或者是SSA卡有问题。 IBM,7028-6C4 --------------------------------------------------------------------------- LABEL: SSA_DEVICE_ERROR IDENTIFIER: FE9E9357 Date/Time: Fri Sep 18 00:00:00 beij Sequence Number: 43354 Machine Id: 0001C79E4C00 Node Id: BJBJ_BS_USSD_SV02 Class: H Type: PERM Resource Name: ssa0 Resource Class: adapter Resource Type: ssa160 Location: 1H-08 VPD: Part Number................. 27H1204 FRU Number.................. 34L5388 Serial Number...............S3352180 EC Level.................... E28793 Manufacturer................IBM053 ROS Level and ID............BD00 0000 Loadable Microcode Level....05 Device Driver Level.........00 Displayable Message.........SSA-ADAPTER Device Specific.(Z0)........SDRAM=128 Device Specific.(Z1)........CACHE=32 Device Specific.(Z2)........UID=00B006B80000C018 Description DISK OPERATION ERROR Standard inputProbable Causes DASD DEVICE Failure Causes DISK DRIVE Recommended Actions PERFORM PROBLEM DETERMINATION PROCEDURES Detail Data ERROR CODE 0440 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 --------------------------------------------------------------------------- LABEL: SSA_ARRAY_ERROR IDENTIFIER: B4C00618 Date/Time: Fri Sep 18 00:00:00 beij Sequence Number: 43353 Machine Id: 0001C79E4C00 Node Id: BJBJ_BS_USSD_SV02 Class: H Type: PERM Resource Name: ssa0 Resource Class: adapter Resource Type: ssa160 Location: 1H-08 VPD: Part Number................. 27H1204 Standard input FRU Number.................. 34L5388 Serial Number...............S3352180 EC Level.................... E28793 Manufacturer................IBM053 ROS Level and ID............BD00 0000 Loadable Microcode Level....05 Device Driver Level.........00 Displayable Message.........SSA-ADAPTER Device Specific.(Z0)........SDRAM=128 Device Specific.(Z1)........CACHE=32 Device Specific.(Z2)........UID=00B006B80000C018 Description RESOURCE UNAVAILABLE Probable Causes DASD DEVICE Failure Causes DISK DRIVE Recommended Actions PERFORM PROBLEM DETERMINATION PROCEDURES Detail Data SENSE DATA 0490 0001 4337 3545 3431 3945 4346 3731 3443 4F00 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 --------------------------------------------------------------------------- (2)登陆主机查看SSA ARRAY状态 Components Primary Secondary Status hdisk2 degraded pdisk0 pdisk6 good pdisk1 pdisk7 good pdisk2 pdisk8 good pdisk3 pdisk9 good pdisk4 pdisk10 good BlankReservedAZ pdisk11 degraded hdisk2 C75E419ECF714CO system degraded 437.4GB raid_10 pdisk0 00B006227F6500D memberpresent72.9GB Physical disk pdisk6 00B006A2779900D memberpresent72.9GB Physical disk pdisk1 00B00622800900D memberpresent72.9GB Physical disk pdisk7 00B006A278A600D memberpresent72.9GB Physical disk pdisk2 00B00642EE7200D memberpresent72.9GB Physical disk pdisk8 00B006BD12C200D memberpresent72.9GB Physical disk pdisk3 00B00642EE7600D memberpresent72.9GB Physical disk pdisk9 00B006BD19A200D memberpresent72.9GB Physical disk pdisk4 00B00642EE8000D memberpresent72.9GB Physical disk pdisk1 000B006BD19BF00D memberpresent72.9GB Physical disk BlankReservedAZ BlankReservedAZ Missing member disk pdisk11 00B006BD19D200D memberpresent72.9GB Physical disk List Rejected Array Disks显示为空 List Array Candidate Disks显示为pdisk5 00B00642F29700D free good 72.9GB # Secondary Disks pdisk606A27799 1H-08-630E-02-P present72.9GB pdisk706A278A6 1H-08-630E-01-P present72.9GB pdisk806BD12C2 1H-08-630E-06-P present72.9GB pdisk906BD19A2 1H-08-630E-12-P present72.9GB pdisk1006BD19BF 1H-08-630E-13-P present72.9GB pdisk1106BD19D2 1H-08-630E-16-P present72.9GB # Primary Disks pdisk0 06227F65 1H-08-630E-08-P present 72.9GB pdisk0 06227F65 1H-08-630E-08-P present 72.9GB pdisk1 06228009 1H-08-630E-04-P present 72.9GB pdisk1 06228009 1H-08-630E-04-P present 72.9GB pdisk2 0642EE72 1H-08-630E-07-P present 72.9GB pdisk2 0642EE72 1H-08-630E-07-P present 72.9GB pdisk3 0642EE76 1H-08-630E-03-P present 72.9GB pdisk3 0642EE76 1H-08-630E-03-P present 72.9GB pdisk4 0642EE80 1H-08-630E-05-P present 72.9GB pdisk4 0642EE80 1H-08-630E-05-P present 72.9GB BlankReservedAZ not_presenAID Manager ssa0 在Change Use of Multiple SSA Physical Disks中Swap Members of an SSA RAID Array 中 SSA RAID Manager ssa0 SSA RAID Arrayhdisk2 RAID Array Typeraid_10 Connection Address / Array NameC75E419ECF714CO * Disk to Remove+ 显示BlankReservedAZ * Disk to Add显示为空 确认pdisk5故障后Set Service Mode,找到亮黄灯pdisk5的物理位置。拔下盘5秒后物理更换硬盘。退出service mode,在两台主机上 # rmdev -dl pdisk5 pdisk5 deleted 按照标准步骤进行下一步操作。


本文来自ChinaUnix博客,如果查看原文请点:http://blog.chinaunix.net/u3/102136/showart_2055386.html
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP