- 论坛徽章:
- 0
|
在一个SAN网络中挂载有2个SUN的3510磁盘阵列,使用正常。
后来又接入了一个SUN的6140磁盘阵列,问题就来了:
1、最初是在应用中发现的,业务处理程序出现异常,磁盘IO非常高。
iostat -xtn 1 10
显示应用所在磁盘(是3510阵列中的一个逻辑盘)IO非常高。%b的值近乎100%
2、dmesg信息:
Apr 8 11:37:53 ZZ-K5-SMMC1-PUSH scsi: [ID 799468 kern.info] ssd32 at scsi_vhci0: name g600a0b800048213a0000041749dbbc54, bus address g600a0b800048213a0000041749dbbc54
Apr 8 11:37:53 ZZ-K5-SMMC1-PUSH genunix: [ID 936769 kern.info] ssd32 is /scsi_vhci/ssd@g600a0b800048213a0000041749dbbc54
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g600c0ff000000000099bd9626d053400 (ssd27):
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH Error for Command: write(10) Error Level: Retryable
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] Requested Block: 46962192 Error Block: 46962192
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] Vendor: SUN Serial Number: 626D0534-00
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] Sense Key: Unit Attention
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH genunix: [ID 408114 kern.info] /scsi_vhci/ssd@g600a0b800048213a0000041749dbbc54 (ssd32) online
Apr 8 11:37:54 ZZ-K5-SMMC1-PUSH genunix: [ID 834635 kern.info] /scsi_vhci/ssd@g600a0b800048213a0000041749dbbc54 (ssd32) multipath status: degraded, path /pci@7c0/pci@0/pci@1/pci@0,2/SUNW,qlc@1/fp@0,0 (fp0) to target address: w201300a0b848213a,0 is standby Load balancing: round-robin
Apr 8 11:37:55 ZZ-K5-SMMC1-PUSH fctl: [ID 517869 kern.warning] WARNING: fp(1)::N_x Port with D_ID=10700, PWWN=210000e08b85a411 reappeared in fabric
Apr 8 11:37:55 ZZ-K5-SMMC1-PUSH genunix: [ID 834635 kern.info] /scsi_vhci/ssd@g600a0b8000485f500000040349dbbc98 (ssd31) multipath status: optimal, path /pci@7c0/pci@0/pci@1/pci@0,2/SUNW,qlc@2/fp@0,0 (fp1) to target address: w201200a0b848213a,1 is standby Load balancing: round-robin
Apr 8 11:37:55 ZZ-K5-SMMC1-PUSH genunix: [ID 834635 kern.info] /scsi_vhci/ssd@g600a0b800048213a0000041749dbbc54 (ssd32) multipath status: optimal, path /pci@7c0/pci@0/pci@1/pci@0,2/SUNW,qlc@2/fp@0,0 (fp1) to target address: w201200a0b848213a,0 is online Load balancing: round-robin
Apr 8 11:37:56 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.warning] WARNING: /scsi_vhci/ssd@g600c0ff000000000099bd9626d053400 (ssd27):
Apr 8 11:37:56 ZZ-K5-SMMC1-PUSH Error for Command: write(10) Error Level: Retryable
Apr 8 11:37:56 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] Requested Block: 46899968 Error Block: 46899968
Apr 8 11:37:56 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] Vendor: SUN Serial Number: 626D0534-00
Apr 8 11:37:56 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] Sense Key: Unit Attention
Apr 8 11:37:56 ZZ-K5-SMMC1-PUSH scsi: [ID 107833 kern.notice] ASC: 0x29 (power on, reset, or bus reset occurred), ASCQ: 0x0, FRU: 0x0
Apr 8 12:17:01 ZZ-K5-SMMC1-PUSH fctl: [ID 517869 kern.warning] WARNING: fp(1)::GPN_ID for D_ID=10d00 failed
Apr 8 12:17:01 ZZ-K5-SMMC1-PUSH fctl: [ID 517869 kern.warning] WARNING: fp(1)::N_x Port with D_ID=10d00, PWWN=201200a0b848213a disappeared from fabric
Apr 8 12:17:04 ZZ-K5-SMMC1-PUSH fctl: [ID 517869 kern.warning] WARNING: fp(0)::GPN_ID for D_ID=10d00 failed
Apr 8 12:17:04 ZZ-K5-SMMC1-PUSH fctl: [ID 517869 kern.warning] WARNING: fp(0)::N_x Port with D_ID=10d00, PWWN=201300a0b848213a disappeared from fabric
Apr 8 12:17:20 ZZ-K5-SMMC1-PUSH scsi: [ID 243001 kern.info] /pci@7c0/pci@0/pci@1/pci@0,2/SUNW,qlc@2/fp@0,0 (fcp1):
Apr 8 12:17:20 ZZ-K5-SMMC1-PUSH offlining lun=1 (trace=0), target=10d00 (trace=2800004)
Apr 8 12:17:20 ZZ-K5-SMMC1-PUSH scsi: [ID 243001 kern.info] /pci@7c0/pci@0/pci@1/pci@0,2/SUNW,qlc@2/fp@0,0 (fcp1):
Apr 8 12:17:20 ZZ-K5-SMMC1-PUSH offlining lun=1f (trace=0), target=10d00 (trace=2800004)
Apr 8 12:17:20 ZZ-K5-SMMC1-PUSH scsi: [ID 243001 kern.info] /pci@7c0/pci@0/pci@1/pci@0,2/SUNW,qlc@2/fp@0,0 (fcp1):
Apr 8 12:17:20 ZZ-K5-SMMC1-PUSH offlining lun=0 (trace=0), target=10d00 (trace=2800004)
后来把6140从SAN中拔掉,重启了3510阵列。恢复正常。
请高手帮忙分析一下,到底是什么原因导致了上述的异常?
谢谢。 |
|