- 论坛徽章:
- 0
|
求救,还是sun e450 不定期重启的问题?
我刚刚到sunsolve上找到了一个与我的错误信息一样的提问和回答,不过全是英文,还请老大们帮着看一看,能不能找到问题的所在,其中有sun公司bill先生的回答。\r\n\r\nE450 RAM/CPU problems Thu, 24 April 2003 10:00 \r\n \r\nHi,\r\n\r\nI was wondering if anyone can help me with this. I am experiencing problems\r\nwith our 4 CPU Sun Enterprise 450. We\'ve had four hangs/reboots in one month\r\ntime. All seem to be related to CPU cache or main ram:\r\n\r\npr 22 10:16:30 ultra SUNW,UltraSPARC-II: [ID 125214 kern.warning] WARNING: [AFT1] WP event on CPU3, errID 0x0001406d.1a26892e\r\nApr 22 10:16:30 ultra AFSR 0x00000000.00800002 AFAR 0x000001ff.f1500000\r\nApr 22 10:16:30 ultra AFSR.PSYND 0x0002(Score 95) AFSR.ETS 0x00 Fault_PC 0x114fc9c\r\nApr 22 10:16:30 ultra UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0000 UDBL.ESYND 0x00\r\nApr 22 10:16:30 ultra SUNW,UltraSPARC-II: [ID 460393 kern.warning] WARNING: [AFT1] Uncorrectable Memory Error on CPU3 Data access at TL=0, errID 0x0001406d.1b803fd7\r\nApr 22 10:16:30 ultra AFSR 0x00000000.00200000 AFAR 0x00000000.2284edf8\r\nApr 22 10:16:30 ultra AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0xff3d6830\r\nApr 22 10:16:30 ultra UDBH 0x0000 UDBH.ESYND 0x00 UDBL 0x0203 UDBL.ESYND 0x03\r\nApr 22 10:16:30 ultra UDBL Syndrome 0x3 Memory Module 180x\r\nApr 22 10:16:31 ultra SUNW,UltraSPARC-II: [ID 337151 kern.warning] WARNING: [AFT1] errID 0x0001406d.1b803fd7 Syndrome 0x3 indicates that this may not be a memory module problem\r\nApr 22 10:16:31 ultra SUNW,UltraSPARC-II: [ID 132011 kern.info] [AFT2] errID 0x0001406d.1b803fd7 PA=0x00000000.2284edf8\r\n0x0d :31 ultra E$tag 0x00000000.1ac00450 E$State: Exclusive E$parity --More--(2%)\r\nApr 22 10:16:31 ultra SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2] E$Data (0x00): 0x0000001d.000007eb\r\nApr 22 10:16:31 ultra SUNW,UltraSPARC-II: [ID 359263 kern.info] [AFT2] E$Data (0x0: 0x0000000f.000007eb\r\n....\r\n\r\n3 times on CPU3 and once CPU2. I\'ve read that defective 400MHz UltraSparcII modules can lead to these symptoms.\r\nI am now trying to determine where the actual problem lies. This part taken from prtdiag also puzzles me:\r\n\r\n========================= Memory =========================\r\n\r\nMemory Interleave Factor = 2-way\r\n\r\nInterlv. Socket Size\r\nBank Group Name (MB) Status\r\n---- ----- ------ ---- ------\r\n0 0 1901 256 OK\r\n0 0 1902 256 OK\r\n0 0 1903 256 OK\r\n0 0 1904 256 OK\r\n0 0 1801 256 OK\r\n0 0 1802 256 OK\r\n0 0 1803 256 OK\r\n0 0 1804 256 OK\r\n1 0 1801 256 OK\r\n1 0 1802 256 OK\r\n1 0 1803 256 OK\r\n1 0 1804 256 OK\r\n\r\nI have 2GB of memory installed, but 4 simms are counted twice. The system\r\ndoes report 2GB installed, however. Could this be a clue that the problem\r\nlies in the simm rather than in the L2 cache, which is a well known problem?\r\n\r\nDoes Sun replace defective UltraSparc modules for free, given that this is a\r\nsecond hand machine?\r\n\r\nThanks in advance,\r\n\r\nMaarten Huizinga \r\n[Report message to a moderator] \r\n \r\n \r\n \r\n News Poster\r\n \r\n\r\n\r\n Re: E450 RAM/CPU problems Thu, 24 April 2003 12:01 \r\n \r\nIf you had a maintenance contract it would be taken care of. It being a second hand machine would probably negate any type of warranty the system had. You can call Sun and see what they say, or you could contact whom you bought it from. Hopefully you bought it from a reseller of some sorts and they offer some type of warranty. If they are authorized by Sun to sell used equipment you would be in a much better situation. If they are not, you will be on your own. \r\n[Report message to a moderator] \r\n \r\n \r\n \r\n News Poster\r\n \r\n\r\n\r\n Re: E450 RAM/CPU problems Sat, 26 April 2003 23:22 \r\n \r\nHi Maarten,\r\n\r\nLooking at the error messages. The CPU3 is a suspect one. You can use the command psradm -f CPU3 to disable it until you have a chance to replace that cpu3. If you have any problem, write me an email I will show you more how to do it.\r\n\r\nTravis Bui\r\n\r\nSunSolarisMaster@yahoo.com \r\n[Report message to a moderator] |
|