免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
123下一页
最近访问板块 发新帖
查看: 15919 | 回复: 26
打印 上一主题 下一主题

系统中in.mpathd错误,这主要是什么问题,如何解决,谢谢 [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2009-03-31 12:10 |只看该作者 |倒序浏览
系统中in.mpathd错误,这主要是什么问题,如何解决,谢谢
产生频率较高


Mar 30 12:02:42 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 12:02:42 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 12:03:09 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 12:03:09 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 12:04:02 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 12:04:02 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 12:05:28 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 12:05:28 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 12:05:57 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 12:05:57 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 12:10:37 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 12:10:37 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 12:11:16 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 12:11:16 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 12:15:41 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 12:15:41 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 12:19:27 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 12:19:27 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 12:20:23 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 12:20:23 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 12:23:27 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 12:23:27 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 12:23:42 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 12:23:42 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 12:27:29 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 12:27:29 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 12:31:18 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 12:31:18 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 13:06:22 sunserver in.mpathd[157]: [ID 585766 daemon.error] Cannot meet requested failure detection time of 10000 ms on (inet ce2) new failure detection time for group "ipmp0" is 23786 ms
Mar 30 13:07:23 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 11893 ms on (inet ce2) for group "ipmp0"
Mar 30 13:07:23 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet ce0) for group "ipmp0"
Mar 30 13:54:39 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 13:54:39 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 13:59:33 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 13:59:33 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 14:00:02 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 14:00:02 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 14:04:10 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 14:04:10 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 14:05:12 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 14:05:12 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 14:08:10 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 14:08:10 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 14:08:12 sunserver in.mpathd[157]: [ID 585766 daemon.error] Cannot meet requested failure detection time of 10000 ms on (inet ce0) new failure detection time for group "ipmp0" is 86786 ms
Mar 30 14:09:12 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 43393 ms on (inet ce2) for group "ipmp0"
Mar 30 14:09:14 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 21696 ms on (inet ce2) for group "ipmp0"
Mar 30 14:09:14 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 10848 ms on (inet ce0) for group "ipmp0"
Mar 30 14:09:15 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet ce2) for group "ipmp0"
Mar 30 14:09:48 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 14:09:48 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 14:38:01 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 14:38:01 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 14:46:57 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 14:46:57 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 14:53:44 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 14:53:44 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 19:54:23 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 19:54:23 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 02:55:37 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 02:55:37 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 08:24:20 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 08:24:20 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 09:12:10 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 09:12:10 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 09:13:17 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 09:13:17 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 10:24:45 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 10:24:45 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 10:24:49 sunserver in.mpathd[157]: [ID 585766 daemon.error] Cannot meet requested failure detection time of 10000 ms on (inet ce0) new failure detection time for group "ipmp0" is 156296 ms
Mar 31 10:26:45 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 78148 ms on (inet ce0) for group "ipmp0"
Mar 31 10:27:17 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 39074 ms on (inet ce0) for group "ipmp0"
Mar 31 10:27:45 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 19537 ms on (inet ce0) for group "ipmp0"
Mar 31 10:27:52 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet ce0) for group "ipmp0"
Mar 31 10:28:14 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 10:28:14 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 10:29:59 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 10:29:59 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 10:30:35 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 10:30:35 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 10:39:28 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 10:39:28 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 10:41:49 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 10:41:49 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2

论坛徽章:
0
2 [报告]
发表于 2009-03-31 12:11 |只看该作者
其它的一些信息

root@sunserver # netstat -in
Name  Mtu  Net/Dest      Address        Ipkts  Ierrs Opkts  Oerrs Collis Queue
lo0   8232 127.0.0.0     127.0.0.1      135764928 0     135764928 0     0      0     
ce0   1500 192.2.1.0     192.2.1.42     1196263598 0     2580404654 0     0      0     
ce2   1500 192.2.1.0     192.2.1.10     2203615821 0     4085668554 0     0      0     


root@sunserver # ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
        inet 127.0.0.1 netmask ff000000
ce0: flags=19040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER,FAILED> mtu 1500 index 2
        inet 192.2.1.42 netmask ffffff00 broadcast 192.2.1.255
        groupname ipmp0
        ether 0:14:4f:47:97:28
ce2: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 3
        inet 192.2.1.10 netmask ffffff00 broadcast 192.2.1.255
        groupname ipmp0
        ether 0:14:4f:47:97:28
ce2:1: flags=9040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER> mtu 1500 index 3
        inet 192.2.1.41 netmask ffffff00 broadcast 192.2.1.255
root@sunserver #



root@sunserver # more hostname.ce0
sun1-test-ce1 deprecated -failover netmask + broadcast + group ipmp0 up
root@sunserver # more hostname.ce2
sunserver netmask + broadcast + group ipmp0 up \
addif sun1-test-ce0 deprecated -failover netmask + broadcast + up



root@sunserver # more /etc/hosts
#
# Internet host table
#
127.0.0.1       localhost      
192.2.1.10      sunserver       sunserver.com   loghost
192.2.1.41      sun1-test-ce0
192.2.1.42      sun1-test-ce1
root@sunserver #

论坛徽章:
0
3 [报告]
发表于 2009-03-31 12:12 |只看该作者
是物理网卡有问题吗?
处理的步骤是什么?谢谢

论坛徽章:
0
4 [报告]
发表于 2009-03-31 12:24 |只看该作者
ping 192.2.1.10 有丢包

论坛徽章:
0
5 [报告]
发表于 2009-03-31 12:33 |只看该作者
我查了一下
在配置了IPMP的系统,原因是网络负载过大,ICMP包不能在指定的时间内返回,建议增加IPMP的缺省延时配置后重

启IPMP进程即可减少报错的次数。
可以修改/etc/default/mpathd 文件
将变量FAILURE_DETECTION_TIME 的值增加就行了

/mpathd 文件已经修改了,
如何重启IPMP进程

论坛徽章:
0
6 [报告]
发表于 2009-03-31 14:04 |只看该作者
系统日志
1. Mar 31 10:24:45 NIC repair detected on ce0 of group ipmp0
2. Mar 31 10:24:45 Successfully failed back to NIC ce0
3.Mar 31 10:24:49 Cannot meet requested failure detection time of 10000 ms on (inet ce0) new failure detection time for group "ipmp0" is 156296 ms
4. Mar 31 10:26:45 Improved failure detection time 78148 ms on (inet ce0) for group "ipmp0"
Mar 31 10:27:17 Improved failure detection time 39074 ms on (inet ce0) for group "ipmp0"
5.Mar 31 10:28:14 NIC failure detected on ce0 of group ipmp0
6.Mar 31 10:28:14 Successfully failed over from NIC ce0 to NIC ce2
7.Mar 31 10:29:59 NIC repair detected on ce0 of group ipmp0
8.Mar 31 10:29:59 Successfully failed back to NIC ce0

1.ipmp0中的ce0修复了
2.fail back 到 ce0
3.超过IPMP的缺省延时10000ms
4. Improved failure detection time on (inet ce0) for group "ipmp0"
5. ipmp0中的ce0错误被探测到
6.网卡从ce0 fail over 到ce2
循环又开始了
7. .ipmp0中的ce0修复了
8.fail back 到 ce0

论坛徽章:
0
7 [报告]
发表于 2009-03-31 15:53 |只看该作者

回复 #5 道可道非常道 的帖子

看上去你已经找到答案了,只是不明白如何重启生效?好像没有单独重启IPMP服务的,你只能重启一下系统。

论坛徽章:
0
8 [报告]
发表于 2009-03-31 16:05 |只看该作者
原帖由 li_hunter 于 2009-3-31 15:53 发表
看上去你已经找到答案了,只是不明白如何重启生效?好像没有单独重启IPMP服务的,你只能重启一下系统。


没有啊,您有什么高见啊,谢谢

论坛徽章:
0
9 [报告]
发表于 2009-03-31 16:06 |只看该作者
重新启动 in.mpathd 守护进程。



# pkill -HUP in.mpathd

论坛徽章:
0
10 [报告]
发表于 2009-04-01 09:09 |只看该作者
没有解决啊,就算调整到30000ms,也是会报错的啊
根源是什么,是网卡问题,还是网络问题
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP