免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
最近访问板块 发新帖
楼主: lky
打印 上一主题 下一主题

redhat cluster 问题 [复制链接]

论坛徽章:
33
荣誉会员
日期:2011-11-23 16:44:17天秤座
日期:2014-08-26 16:18:20天秤座
日期:2014-08-29 10:12:18丑牛
日期:2014-08-29 16:06:45丑牛
日期:2014-09-03 10:28:58射手座
日期:2014-09-03 16:01:17寅虎
日期:2014-09-11 14:24:21天蝎座
日期:2014-09-17 08:33:55IT运维版块每日发帖之星
日期:2016-04-17 06:23:27操作系统版块每日发帖之星
日期:2016-04-18 06:20:00IT运维版块每日发帖之星
日期:2016-04-24 06:20:0015-16赛季CBA联赛之天津
日期:2016-05-06 12:46:59
11 [报告]
发表于 2005-08-29 18:34 |只看该作者

redhat cluster 问题

原帖由 "nntp" 发表:


机器hung了,OS都不工作了,什么东西在占用资源?

所以要heart-beat+quorum牙,否则 lock manager干吗用的?


可不一定是 os 不工作.

所以, 重启了它就最好了.  

论坛徽章:
0
12 [报告]
发表于 2005-08-29 20:02 |只看该作者

redhat cluster 问题

据我所知以前clustersuite是不重起的,是不是楼主quorum没配好?

论坛徽章:
0
13 [报告]
发表于 2005-08-30 17:28 |只看该作者

redhat cluster 问题

[quote]原帖由 "cnadl"]据我所知以前clustersuite是不重起的,是不是楼主quorum没配好?[/quote 发表:


我试过好多次,在实机上还是vmware上都是会重启的。

论坛徽章:
33
荣誉会员
日期:2011-11-23 16:44:17天秤座
日期:2014-08-26 16:18:20天秤座
日期:2014-08-29 10:12:18丑牛
日期:2014-08-29 16:06:45丑牛
日期:2014-09-03 10:28:58射手座
日期:2014-09-03 16:01:17寅虎
日期:2014-09-11 14:24:21天蝎座
日期:2014-09-17 08:33:55IT运维版块每日发帖之星
日期:2016-04-17 06:23:27操作系统版块每日发帖之星
日期:2016-04-18 06:20:00IT运维版块每日发帖之星
日期:2016-04-24 06:20:0015-16赛季CBA联赛之天津
日期:2016-05-06 12:46:59
14 [报告]
发表于 2005-08-30 17:30 |只看该作者

redhat cluster 问题

[quote]原帖由 "cnadl"]据我所知以前clustersuite是不重起的,是不是楼主quorum没配好?[/quote 发表:


会重启的.
当 hosta 不能访问 quorum 或是 tieroute 之类的就会因为 watchdog 而自已重启. 另一台当然就会接管.

论坛徽章:
0
15 [报告]
发表于 2005-11-29 15:56 |只看该作者
原帖由 lky 于 2005-8-29 15:51 发表
现在网线断了能切换了,但是系统会把网线断了的机器直接重启掉。很奇怪,什么提示都没有就直接强制重启,跟按主机上的restart按键一样的。



我做的也有这个情况,断掉主机的连交换机的网线,服务也能转移到备份机,但是主机直接重新启动,这个是不是不正常啊,怎么解决呢?

论坛徽章:
0
16 [报告]
发表于 2005-12-01 00:57 |只看该作者
原帖由 q1208c 于 2005-8-29 18:34 发表


可不一定是 os 不工作.

所以, 重启了它就最好了.  



LOL.... if rebooting is the silver bullet, why do RAC getta to market? :"  

for most mission critical systems, rebooting means your boss is going to  kick your ass.

rebooting is the most stupid way to solve HA problem.

[ 本帖最后由 nntp 于 2005-12-1 00:58 编辑 ]

论坛徽章:
0
17 [报告]
发表于 2005-12-01 01:06 |只看该作者
原帖由 q1208c 于 2005-8-30 17:30 发表
[quote]原帖由 "cnadl"]据我所知以前clustersuite是不重起的,是不是楼主quorum没配好?[/quote 发表:


会重启的.
当 hosta 不能访问 quorum 或是 tieroute 之类的就会因为 watchdog 而自 ...



This is because redhat cluster is a silly ha cluster system that was developped base on Kimberlite.
If you really familiar with HA system, Kimberlite and it's commerical edition Convolo Data Guard is the protype of RHCS.

RHCS utilize SCSI reservation to issue lock conflict when whole nodes are in brain-split status. To avoid 50%-50% election in a "dual node HA cluster ", trigger watchdog to reboot the system is the easiest way for those RHCS developer to make system rolling back.

Actually advanced HA software never use such mechanism for a Real "High Availability Production System".

论坛徽章:
33
荣誉会员
日期:2011-11-23 16:44:17天秤座
日期:2014-08-26 16:18:20天秤座
日期:2014-08-29 10:12:18丑牛
日期:2014-08-29 16:06:45丑牛
日期:2014-09-03 10:28:58射手座
日期:2014-09-03 16:01:17寅虎
日期:2014-09-11 14:24:21天蝎座
日期:2014-09-17 08:33:55IT运维版块每日发帖之星
日期:2016-04-17 06:23:27操作系统版块每日发帖之星
日期:2016-04-18 06:20:00IT运维版块每日发帖之星
日期:2016-04-24 06:20:0015-16赛季CBA联赛之天津
日期:2016-05-06 12:46:59
18 [报告]
发表于 2005-12-04 17:14 |只看该作者
I know few about the HA software. The Red Hat Cluster Suite is the first one that I learned.


Thanks for your information for the HA software.

Could you tell me more about the RAC ? I just know the Oracle RAC.

And as I think. if the hosta hang, the hostb is take over the service. How to deal the hosta ? just let it hang?

If the OS is hang, the administrator is just power off the machine, and then restart it. that we do it more times ( the Windows machine).


Waiting for your information.

Thanks again!!

论坛徽章:
0
19 [报告]
发表于 2005-12-06 05:01 |只看该作者
Morning dude,

I was shocked by TruCluster When i started to help manufacture customers to setup high availability environment with Tru64 Unix on Alpha System.

I said, WoW ,  the cluster is the strongest HA cluster i've seen that can support real time sync data between nodes via special technology (we called it memory channel - A kind of HBA adapters on all nodes) . Ofcoz the whole system has sucessfully implemented SSI (single system image).And now days, people  know OCFS from Oracle, but a few years ago, Alpha already has it's VERY mature clusterwide filesystem by it's TruCluster.

Let's back to Oracle RAC:

- A not so mature ocfs , standalone clulster-aware(even not cluster-wide),
- A replacement solution but only stick on Oracle data management - ASM which support clusteraware storage too.
- A cache fusion core to support sync data between nodes, but limited to Oracle

anyway, Oracle learns faster,  they made it possible to  deploy a real time SSI Database Cluster with complete high availability on linux platform.

Try to access otn.oracle.com and check out linux section, there are 9i/10g RAC step by step guides for you . Furthermore, a guru who wrote series articles to instruct you setp by step installing  Oracle RAC on Linux with IEEE1394 system also can be searched. Very interesting? yep,that's made me happy for several weeks. ( but buddy you've to purchase Maxtor OneTouch Button IEEE1394 hardisk and enclosure to support two concurrent I/O block access from two nodes, suppose they are two standalone PC).

Metalink account(metalink.oracle.com) will booth you directly to getin know everytihing arround RAC.

I' don't want to explain more on how do these modern HA clusters to solve  the node dead/rebooting/hang situation because you can obtain those information easily from their official website. Some of them even provide evaluation copy of the cluster software as well.

To me, i'm an experienced HP MC/SG cluster guy, so i can explain your further question on this HA product if you really wanna give a shoot.

docs.hp.com --> high availability -> MC/SG for linux  Admin guide + toolkit guide witll be a good start point.

www.steeleye.com provide lifekeeper for linux, another pretty nice , well designed HA cluster for people who love to do so.

[ 本帖最后由 nntp 于 2005-12-6 05:12 编辑 ]

论坛徽章:
33
荣誉会员
日期:2011-11-23 16:44:17天秤座
日期:2014-08-26 16:18:20天秤座
日期:2014-08-29 10:12:18丑牛
日期:2014-08-29 16:06:45丑牛
日期:2014-09-03 10:28:58射手座
日期:2014-09-03 16:01:17寅虎
日期:2014-09-11 14:24:21天蝎座
日期:2014-09-17 08:33:55IT运维版块每日发帖之星
日期:2016-04-17 06:23:27操作系统版块每日发帖之星
日期:2016-04-18 06:20:00IT运维版块每日发帖之星
日期:2016-04-24 06:20:0015-16赛季CBA联赛之天津
日期:2016-05-06 12:46:59
20 [报告]
发表于 2005-12-06 22:36 |只看该作者
Thanks for your information.

I can try to read the basic documents first.
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP