- 论坛徽章:
- 0
|
M80的奇怪问题,请各位兄弟帮忙分析解决一下\r\n\r\n\r\n环境:两台M80+7133、AIX5300-03+HA5.1\r\n\r\n故障现象:作为HA环境的A机不定期自动关机\r\n\r\n在cluster.log中有如下记录\r\n\r\nNov 30 08:20:32 Test_A daemon:err|error topsvcs[516136]: (Recorded using libct_ffdc.a cv 2):::Error ID: 6o2Oo9/EDWP3/n51/P3.bW0...................:::Reference ID: :::Template ID: 4bd1a134::etails File: : cation: rsct,Hb_Rsock.C,1.57,1493 :::TS_CPU_USE_ER Using too much CPU: exiting CPU usage in milliseconds -1273798618 Interval in milliseconds where CPU usage was measured 32003\r\nNov 30 08:20:34 Test_A daemon:err|error grpsvcs[598080]: (Recorded using libct_ffdc.a cv 2):::Error ID: 62IcBY/GDWP3/5N7/P3.bW0...................:::Reference ID: :::Template ID: 64368504::etails File: : cation: RSCT,PMClient.C,1.72,1049 :::GS_TS_RETCODE_ER Connection failure between Group Services and Topology Services DIAGNOSTIC EXPLANATION topsvcs subsystem died with hb_errno = 16, grpsvcs will also exit.\r\nNov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 announcementCb: Called, state=ST_STABLE\r\nNov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 announcementCb: GRPSVCS announcment code=512; exiting\r\nNov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 CHECK FOR FAILURE OF RSCT SUBSYSTEMS (topsvcs or grpsvcs)\r\nNov 30 08:20:34 Test_A local0:crit clstrmgrES[610358]: Thu Nov 30 08:20:34 clstrmgr on node 1 is exiting with code 4\r\nNov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX packet from (127.0.0.1+32771+2)\r\nNov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX trap: (6 10) (127.0.0.1+32771+2)\r\nNov 30 08:20:34 Test_A daemon:err|error haemd[606256]: LPP=PSSP,Fn=emd_gsi.c,SID=1.4.1.36,L#=1361, haemd: 2521-032 Cannot dispatch group services (1).\r\nNov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX packet from (127.0.0.1+32771+2)\r\nNov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX trap: (6 11) (127.0.0.1+32771+2)\r\nNov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX packet from (127.0.0.1+32771+2)\r\nNov 30 08:20:34 Test_A daemon:notice snmpd[614488]: NOTICE: SMUX trap: (6 15) (127.0.0.1+32771+2)\r\nNov 30 08:20:34 Test_A user:notice HACMP for AIX: clexit.rc : Unexpected termination of clstrmgrES. \r\nNov 30 08:20:35 Test_A user:notice HACMP for AIX: clexit.rc : Halting system immediately!!! \r\n\r\nerrpt中无CPU方面的错误\r\n\r\n请教各位兄弟帮忙啦 |
|