Chinaunix

标题: 请教heartbeat的一个错误问题!(已自己解决) [打印本页]

作者: yakamoz1121    时间: 2009-04-23 11:07
标题: 请教heartbeat的一个错误问题!(已自己解决)
/etc/ha.d/ha.c
logfile /var/log/ha-log

debugfile /var/log/ha-debug

logfacility local0

#use_logd on

keepalive 2

deadtime 30

warntime 10

initdead 120

#band 19200

#serial /dev/ttyS0

udpport 694

#bcast eth1

ucast eth0 172.16.0.160

#auto_failback on

node  node1
node  node2

ping 172.16.39.254

respawn root /usr/lib/heartbeat/pingd -m 100 -d 5s

crm yes

情况是node1可以正常运行,而node2出现无法通信得错误

错误信息和日志
在node2上
tail /var/log/ha-log
heartbeat[3910]: 2009/04/23_11:04:30 info: glib: ucast: bound receive socket to device: eth0
heartbeat[3910]: 2009/04/23_11:04:30 info: glib: ucast: started on port 694 interface eth0 to 172.16.0.160
heartbeat[3910]: 2009/04/23_11:04:30 info: glib: ping heartbeat started.
heartbeat[3910]: 2009/04/23_11:04:30 info: G_main_add_SignalHandler: Added signal handler for signal 17
heartbeat[3910]: 2009/04/23_11:04:30 info: Local status now set to: 'up'
heartbeat[3910]: 2009/04/23_11:04:31 info: Link 172.16.39.254:172.16.39.254 up.
heartbeat[3910]: 2009/04/23_11:04:31 info: Status update for node 172.16.39.254: status ping
heartbeat[3910]: 2009/04/23_11:04:31 info: Link drm_scheduler_active:eth0 up.
heartbeat[3910]: 2009/04/23_11:04:31 info: Status update for node drm_scheduler_active: status active
heartbeat[3910]: 2009/04/23_11:05:35 info: all clients are now paused
heartbeat[3910]: 2009/04/23_11:06:41 ERROR: Message hist queue is filling up (200 messages in queue)
heartbeat[3910]: 2009/04/23_11:06:42 ERROR: Message hist queue is filling up (200 messages in queue)
heartbeat[3910]: 2009/04/23_11:06:42 ERROR: Message hist queue is filling up (200 messages in queue)
heartbeat[3910]: 2009/04/23_11:06:43 ERROR: Message hist queue is filling up (200 messages in queue)
heartbeat[3910]: 2009/04/23_11:06:44 ERROR: Message hist queue is filling up (200 messages in queue)
heartbeat[3910]: 2009/04/23_11:06:44 ERROR: Message hist queue is filling up (200 messages in queue)

ha-debug显示:
heartbeat[3910]: 2009/04/23_11:13:30 debug:  
heartbeat[3910]: 2009/04/23_11:13:30 debug: hist->ackseq =0
heartbeat[3910]: 2009/04/23_11:13:30 debug: hist->lowseq =613, hist->hiseq=813
heartbeat[3910]: 2009/04/23_11:13:30 debug: expecting from node1
heartbeat[3910]: 2009/04/23_11:13:30 debug: it's ackseq=0


最后查了一下iptables,发现694端口没有放行(两台都没有发行,我汗)。看来,我做事还需要更细心啊!!

[ 本帖最后由 yakamoz1121 于 2009-4-23 13:48 编辑 ]
作者: kns1024wh    时间: 2009-04-23 21:59
标题: 回复 #1 yakamoz1121 的帖子
测试之前的第一个事情就是将iptables的规则清除,在一个没有限制的环境下操作




欢迎光临 Chinaunix (http://bbs.chinaunix.net/) Powered by Discuz! X3.2