免费注册 查看新帖 |

Chinaunix

  平台 论坛 博客 文库
123下一页
最近访问板块 发新帖
查看: 9317 | 回复: 23
打印 上一主题 下一主题

使用oracle CRS的两个v490节点频繁重启 [复制链接]

论坛徽章:
0
跳转到指定楼层
1 [收藏(0)] [报告]
发表于 2007-08-13 19:45 |只看该作者 |倒序浏览
10可用积分
操作系统为10,oracle也是10g。网卡ce0和ce1配置为ipmp,ce3做为crs的心跳。ce0和ce1分别连接到两个独立的交换机,然后交换机级联,避免网卡或交换机单点故障。之前交换机没有级联,系统不停重启,级联后还是重启。节点机的defaultrouter使用的是盘柜控制器的地址,都能ping通。附件为last和messages输出

messages.rar

52.11 KB, 下载次数: 117

最佳答案

查看完整内容

建议你看看oracle的日志还有你把rac停一下看看有没有问题建议分段一步一步的测试搅在一起谁也说不好哪里有问题

论坛徽章:
0
2 [报告]
发表于 2007-08-13 19:45 |只看该作者
建议你看看oracle的日志
还有你把rac停一下看看有没有问题
建议分段一步一步的测试
搅在一起谁也说不好哪里有问题

论坛徽章:
0
3 [报告]
发表于 2007-08-13 23:20 |只看该作者
把你的ifconfig的结果,还有hosts的内容帖一下看看。

论坛徽章:
0
4 [报告]
发表于 2007-08-14 02:01 |只看该作者
比较难讲是哪个地方的问题。安装crs之前 os是正常的?

论坛徽章:
0
5 [报告]
发表于 2007-08-14 14:44 |只看该作者
已经把a机 init.crs disable了,等待结果,下面是馒头要的输出

ifconfig:
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
        inet 127.0.0.1 netmask ff000000
ce0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
        inet 192.168.1.101 netmask ffffff00 broadcast 192.168.1.255
        groupname orapub
        ether 0:14:4f:23:2c:2f
ce0:1: flags=9040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER> mtu 1500 index 2
        inet 192.168.1.105 netmask ffffff00 broadcast 192.168.1.255
ce0:2: flags=1040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4> mtu 1500 index 2
        inet 192.168.1.111 netmask ffffff00 broadcast 192.168.1.255
ce1: flags=69040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER,STANDBY,INACTIVE> mtu 1500 index 3
        inet 192.168.1.103 netmask ffffff00 broadcast 192.168.1.255
        groupname orapub
        ether 0:14:4f:23:2d:34
ce3: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 4
        inet 10.1.1.1 netmask ffffff00 broadcast 10.1.1.255
        ether 0:14:4f:4b:c:be


hosts:

#
# Internet host table
#
127.0.0.1        localhost       
192.168.1.102   ora2            #data address for ce0,A mumber of IPMP
192.168.1.104   ora2-ce1        #data address for ce1,A mumber of IPMP      
192.168.1.106   ora2-test       #test address for ce0,used for ip multipathing
192.168.1.108   ora2-ce1-test   #test address for ce1,used for ip multipathing  
192.168.1.112   ora2-vip        #address for oracle failover,not a physical NIC
10.1.1.2        ora2-priv       #address for interconnection,applying on ce3

192.168.1.101   ora1        loghost        #data address for ce0,A mumber of IPMP
192.168.1.103   ora1-ce1        #data address for ce1,A mumber of IPMP      
192.168.1.105   ora1-test       #test address for ce0,used for ip multipathing
192.168.1.107   ora1-ce1-test   #test address for ce1,used for ip multipathing  
192.168.1.111   ora1-vip        #address for oracle failover,not a physical NIC
10.1.1.1        ora1-priv       #address for interconnection,applying on ce3

论坛徽章:
0
6 [报告]
发表于 2007-08-14 15:03 |只看该作者
取消IPMP后试试

论坛徽章:
0
7 [报告]
发表于 2007-08-14 15:17 |只看该作者
原帖由 dftbj 于 2007-8-14 15:03 发表
取消IPMP后试试

我也认为是IPMP的设置问题,再贴一下hostname.xxx的内容。

论坛徽章:
0
8 [报告]
发表于 2007-08-14 15:29 |只看该作者
Aug  7 10:15:04 ora1 in.mpathd[168]: [ID 302819 daemon.error] Improved failure detection time 92687 ms on (inet ce0) for group "orapub"
Aug  7 10:15:05 ora1 in.mpathd[168]: [ID 302819 daemon.error] Improved failure detection time 46343 ms on (inet ce0) for group "orapub"
Aug  7 10:15:07 ora1 in.mpathd[168]: [ID 302819 daemon.error] Improved failure detection time 23171 ms on (inet ce0) for group "orapub"
Aug  7 10:15:08 ora1 in.mpathd[168]: [ID 302819 daemon.error] Improved failure detection time 11585 ms on (inet ce0) for group "orapub"
…………………………………………………………………………………………………………………………………………………………
Aug  7 10:28:14 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 10:28:14 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 10:28:16 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 10:28:16 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex
Aug  7 10:29:11 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 10:29:11 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 10:29:14 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 10:29:14 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex
Aug  7 11:02:25 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 11:02:25 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 11:02:28 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 11:02:28 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex
Aug  7 11:03:24 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 11:03:24 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 11:03:27 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 11:03:27 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex
Aug  7 11:03:59 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 11:03:59 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 11:04:08 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 11:04:08 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex
Aug  7 11:04:15 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 11:04:15 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 11:04:19 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 11:04:19 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex
Aug  7 11:04:54 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 11:04:54 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 11:04:58 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 11:04:58 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex
Aug  7 11:05:54 ora1 genunix: [ID 408789 kern.warning] WARNING: ce3: fault detected external to device; service degraded
Aug  7 11:05:54 ora1 genunix: [ID 451854 kern.warning] WARNING: ce3: xcvr addr:0x01 - link down
Aug  7 11:05:56 ora1 genunix: [ID 408789 kern.notice] NOTICE: ce3: fault cleared external to device; service available
Aug  7 11:05:56 ora1 genunix: [ID 451854 kern.notice] NOTICE: ce3: xcvr addr:0x01 - link up 1000 Mbps full duplex

[ 本帖最后由 metor78 于 2007-8-14 15:40 编辑 ]

论坛徽章:
0
9 [报告]
发表于 2007-08-14 15:48 |只看该作者
因为开始心跳是交叉线直连两个机器网卡的,两个机器频繁重启,所以另外一台就老显示网卡down。现在已经都连到一个交换机了。
Improved failure detection time 92687 ms on (inet ce0) for group "orapub"
这个提示我猜是因为原先只有一个业务网卡在交换机上,IPMP的另外一个网卡没有接交换机,所以
in.mpathd发出的ping包另外一个网卡无法回应。现在IPMP的两个网卡都结到交换机了

论坛徽章:
0
10 [报告]
发表于 2007-08-14 16:19 |只看该作者
hostname.ce0:
ora1 netmask + broadcast + group orapub up addif ora1-test deprecated -failover netmask + broadcast + up
hostname.ce1:
ora1-ce1 netmask + broadcast + deprecated group orapub -failover standby up
hostname.ce3:
ora1-priv
您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

  

北京盛拓优讯信息技术有限公司. 版权所有 京ICP备16024965号-6 北京市公安局海淀分局网监中心备案编号:11010802020122 niuxiaotong@pcpop.com 17352615567
未成年举报专区
中国互联网协会会员  联系我们:huangweiwei@itpub.net
感谢所有关心和支持过ChinaUnix的朋友们 转载本站内容请注明原作者名及出处

清除 Cookies - ChinaUnix - Archiver - WAP - TOP