免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
最近访问板块 发新帖
查看: 6778 | 回复: 7
打印 上一主题 下一主题

系统中in.mpathd错误,这主要是什么问题,如何解决,谢谢 [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2009-03-31 12:22 |只看该作者 |倒序浏览
系统中in.mpathd错误,这主要是什么问题,如何解决,谢谢
产生频率较高

Mar 30 14:09:12 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 43393 ms on (inet ce2) for group \"ipmp0\"
Mar 30 14:09:14 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 21696 ms on (inet ce2) for group \"ipmp0\"
Mar 30 14:09:14 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 10848 ms on (inet ce0) for group \"ipmp0\"
Mar 30 14:09:15 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet ce2) for group \"ipmp0\"
Mar 30 14:09:48 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 14:09:48 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 14:38:01 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 14:38:01 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 14:46:57 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 14:46:57 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 30 14:53:44 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 30 14:53:44 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 30 19:54:23 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 30 19:54:23 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 02:55:37 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 02:55:37 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 08:24:20 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 08:24:20 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 09:12:10 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 09:12:10 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 09:13:17 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 09:13:17 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 10:24:45 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 10:24:45 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 10:24:49 sunserver in.mpathd[157]: [ID 585766 daemon.error] Cannot meet requested failure detection time of 10000 ms on (inet ce0) new failure detection time for group \"ipmp0\" is 156296 ms
Mar 31 10:26:45 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 78148 ms on (inet ce0) for group \"ipmp0\"
Mar 31 10:27:17 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 39074 ms on (inet ce0) for group \"ipmp0\"
Mar 31 10:27:45 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 19537 ms on (inet ce0) for group \"ipmp0\"
Mar 31 10:27:52 sunserver in.mpathd[157]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet ce0) for group \"ipmp0\"
Mar 31 10:28:14 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 10:28:14 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 10:29:59 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 10:29:59 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 10:30:35 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 10:30:35 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2
Mar 31 10:39:28 sunserver in.mpathd[157]: [ID 299542 daemon.error] NIC repair detected on ce0 of group ipmp0
Mar 31 10:39:28 sunserver in.mpathd[157]: [ID 620804 daemon.error] Successfully failed back to NIC ce0
Mar 31 10:41:49 sunserver in.mpathd[157]: [ID 594170 daemon.error] NIC failure detected on ce0 of group ipmp0
Mar 31 10:41:49 sunserver in.mpathd[157]: [ID 832587 daemon.error] Successfully failed over from NIC ce0 to NIC ce2

论坛徽章:
0
2 [报告]
发表于 2009-03-31 12:22 |只看该作者
其它的一些信息

root@sunserver # netstat -in
Name  Mtu  Net/Dest      Address        Ipkts  Ierrs Opkts  Oerrs Collis Queue
lo0   8232 127.0.0.0     127.0.0.1      135764928 0     135764928 0     0      0     
ce0   1500 192.2.1.0     192.2.1.42     1196263598 0     2580404654 0     0      0     
ce2   1500 192.2.1.0     192.2.1.10     2203615821 0     4085668554 0     0      0     


root@sunserver # ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
        inet 127.0.0.1 netmask ff000000
ce0: flags=19040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER,FAILED> mtu 1500 index 2
        inet 192.2.1.42 netmask ffffff00 broadcast 192.2.1.255
        groupname ipmp0
        ether 0:14:4f:47:97:28
ce2: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 3
        inet 192.2.1.10 netmask ffffff00 broadcast 192.2.1.255
        groupname ipmp0
        ether 0:14:4f:47:97:28
ce2:1: flags=9040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER> mtu 1500 index 3
        inet 192.2.1.41 netmask ffffff00 broadcast 192.2.1.255
root@sunserver #



root@sunserver # more hostname.ce0
sun1-test-ce1 deprecated -failover netmask + broadcast + group ipmp0 up
root@sunserver # more hostname.ce2
sunserver netmask + broadcast + group ipmp0 up \\
addif sun1-test-ce0 deprecated -failover netmask + broadcast + up



root@sunserver # more /etc/hosts
#
# Internet host table
#
127.0.0.1       localhost      
192.2.1.10      sunserver       sunserver.com   loghost
192.2.1.41      sun1-test-ce0
192.2.1.42      sun1-test-ce1
root@sunserver #

论坛徽章:
0
3 [报告]
发表于 2009-03-31 12:23 |只看该作者
是物理网卡有问题吗?或者是网络有问题吗?
处理的步骤是什么?谢谢
ping 192.2.1.10 有丢包

论坛徽章:
0
4 [报告]
发表于 2009-03-31 12:33 |只看该作者
我查了一下
在配置了IPMP的系统,原因是网络负载过大,ICMP包不能在指定的时间内返回,建议增加IPMP的缺省延时配置后重

启IPMP进程即可减少报错的次数。
可以修改/etc/default/mpathd 文件
将变量FAILURE_DETECTION_TIME 的值增加就行了

/mpathd 文件已经修改了,
如何重启IPMP进程

论坛徽章:
0
5 [报告]
发表于 2009-03-31 14:03 |只看该作者
4# 道可道非常道



系统日志
1. Mar 31 10:24:45 NIC repair detected on ce0 of group ipmp0
2. Mar 31 10:24:45 Successfully failed back to NIC ce0
3.Mar 31 10:24:49 Cannot meet requested failure detection time of 10000 ms on (inet ce0) new failure detection time for group \"ipmp0\" is 156296 ms
4. Mar 31 10:26:45 Improved failure detection time 78148 ms on (inet ce0) for group \"ipmp0\"
Mar 31 10:27:17 Improved failure detection time 39074 ms on (inet ce0) for group \"ipmp0\"
5.Mar 31 10:28:14 NIC failure detected on ce0 of group ipmp0
6.Mar 31 10:28:14 Successfully failed over from NIC ce0 to NIC ce2
7.Mar 31 10:29:59 NIC repair detected on ce0 of group ipmp0
8.Mar 31 10:29:59 Successfully failed back to NIC ce0

1.ipmp0中的ce0修复了
2.fail back 到 ce0
3.超过IPMP的缺省延时10000ms
4. Improved failure detection time on (inet ce0) for group \"ipmp0\"
5. ipmp0中的ce0错误被探测到
6.网卡从ce0 fail over 到ce2
循环又开始了
7. .ipmp0中的ce0修复了
8.fail back 到 ce0

论坛徽章:
0
6 [报告]
发表于 2009-04-01 09:08 |只看该作者
没有解决啊,就算调整到30000ms,也是会报错的啊
根源是什么,是网卡问题,还是网络问题

论坛徽章:
0
7 [报告]
发表于 2010-09-08 10:43 |只看该作者
有可能是网络的问题。

Oracle的官方诊断描述(http://docs.sun.com/app/docs/doc/816-0211/6m6nc66sm?a=view):
Cannot meet requested failure detection time of time ms on (inet[6] interface_name) new failure detection is time ms.

The round trip time for ICMP probes is higher than the specified failure detection time. The network is probably congested or the probe targets are loaded. in.mpathd automatically increases the failure detection time to whatever it can achieve under these conditions.

Improved failure detection time time ms on (inet[6] interface_name).
The round trip time for ICMP probes has now decreased and in.mpathd has lowered the failure detection time correspondingly.

故障解释和处理办法:该报错是指,从报文发送到接收返回的时间比指定的检测时间要长。当时的网络很有可能存在堵塞的情况。在这种情况下,进程in.mpathd会自动增加故障检测时间。建议继续观察网络的通信情况,看看是否存在堵塞。

论坛徽章:
0
8 [报告]
发表于 2010-10-09 21:38 |只看该作者
配了路由,但主机无法ping通,会自己down掉
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP