免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
12
最近访问板块 发新帖
楼主: gutou888
打印 上一主题 下一主题

rhel 6 双机 fence时两台主机执行关机 [复制链接]

论坛徽章:
0
11 [报告]
发表于 2012-11-05 09:53 |只看该作者
sleepcat 发表于 2012-11-02 23:20
只有一条心跳线?

还是做bonding两条都拔了?


只一条心跳,没有bonding,心跳直连的

论坛徽章:
0
12 [报告]
发表于 2012-11-05 10:05 |只看该作者
PinkOrient 发表于 2012-11-04 21:46
回复 1# gutou888


心跳是直连的,此外,fence信息是从心跳口出去的吗?

论坛徽章:
0
13 [报告]
发表于 2012-11-05 10:06 |只看该作者
yjs_sh 发表于 2012-11-02 15:59
配置quoram disk 看看吧


没有共享盘呃。。。

论坛徽章:
0
14 [报告]
发表于 2012-11-05 10:08 |只看该作者
q1208c 发表于 2012-11-04 23:15
你是在集群正常的情况下拔的心跳线么?
出现你说的现象, 有点象是两台机器都是 slave, 没有 primary时的情 ...


用clustat -l看状态是正常的,主备什么的,停掉资源也可以正常进行切换,只是fence的时候有问题,此外,fence如何配置reboot和shutdown?

论坛徽章:
0
15 [报告]
发表于 2012-11-05 10:58 |只看该作者
本帖最后由 PinkOrient 于 2012-11-06 09:48 编辑

回复 11# gutou888


http://infrastructureadventures.com/tag/fencing/

Heartbeat

Each node in the cluster sends out a multicast heart beat that tells the other member of the cluster that it is alive and healthy. By default a cluster node will consider another node dead if it misses the heartbeat from that node for 10 seconds.

The interface used for heartbeats is configured in the cluster.conf file (see configuration section for more details ). When discussion cluster configuration with Redhat support they highly recommended that a cross-connect between the two nodes is not used and that an interface connected to a switch and a private VLAN be used for heartbeats. They also recommended that this be the same interface used to initiate fencing (See below).

Fencing

One of the strategies used by Redhat clusters to prevent split brain is a concept called fencing.

While there are several different types of fencing, fencing via the HP iLO devices (or similar) built into the servers is the recommended method. With this type of Fencing when the passive node stops receiving heartbeats from the active node it will connect to the iLO of the active node and reboot the active node. Once the passive node reboots (i.e. fences) the active node it will then start the cluster services.

By rebooting the active node the passive node can be sure that the active node is no longer running the cluster services and it is safe to start them.

From a design point of view the NIC used to connect to the iLO of a node’s partner server is the NIC that should also be used for heartbeat. This ensures that the node that lost its connection to the heartbeat network cannot fence its partner server.


可能你的Fence Device的IP和服务IP放到一个VLAN了,脑裂的时候尽管HB断了,但是两边都能通过服务VLAN fence掉对方了,留意红色那段的建议。
Heartbeat不要用直连的方式,和fence放到一个私有vlan,应该就好了,之于为什么是shutdown不是reboot,可以登录到你的fence device看看log,看对方发来的fencing请求是啥。

论坛徽章:
0
16 [报告]
发表于 2012-11-06 09:59 |只看该作者
已经搞定,就像LS说的,心跳不能直连
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP