Chinaunix

标题: kernel panic not syncing non maskable interrupt [打印本页]

作者: fflixiang    时间: 2013-10-14 11:07
标题: kernel panic not syncing non maskable interrupt
ibm pc server rhel 6.0 服务不定时hung,重启后查看messages日志没有异常,图片有一次hung时,屏目信息,请大家指教!


作者: itfly3    时间: 2013-10-14 11:29
你google搜索    kernel panic not syncing non maskable interrupt
第一个页面,说这好像是个bug
Due to the way ftrace works when modifying the code during startup, the NMI watchdog causes too much noise and ftrace can not find a quiet period to instrument the code. Consequently, machines with more than 512 cpus will encounter issues with the NMI watchdog. Such issues will return error messages similar to "BUG: NMI Watchdog detected LOCKUP" and have either 'ftrace_modify_code' or 'ipi_handler' in the backtrace. To work around this issue, disable nmi_watchdog using the command:
nmi_watchdog=0
你看下是不是你这种情况, disable nmi_watchdog 试试
作者: humjb_1983    时间: 2013-10-14 12:46
本帖最后由 humjb_1983 于 2013-10-14 12:53 编辑

回复 1# fflixiang
触发了nmi_watchdog,该看门狗的原理的,内核定时参数nmi中断,以检测内核运行是否正常,如果发现内核异常,则触发panic。
触发nmi_watchdog,意味着系统死机,内核挂死,此时需要部署kdump,复现后,详细分析相应的vmcore文件。


   
作者: humjb_1983    时间: 2013-10-14 12:50
再补充下,触发nmi_watchdog表示,内核处于关中断死锁状态,即在内核态关中断后,出现死锁,导致内核不能响应普通中断,而此时还能响应nmi中断(不可屏蔽)。

作者: fflixiang    时间: 2013-10-16 10:10
我内存256G 内核转储后vmcore也是不是接近这么大呀,不过空闲内存多




欢迎光临 Chinaunix (http://bbs.chinaunix.net/) Powered by Discuz! X3.2