[Search for users]
[Overall Top Noters]
[List of all Conferences]
[Download this site]
Title: | DIGITAL UNIX (FORMERLY KNOWN AS DEC OSF/1) |
Notice: | Welcome to the Digital UNIX Conference |
Moderator: | SMURF::DENHAM |
|
Created: | Thu Mar 16 1995 |
Last Modified: | Fri Jun 06 1997 |
Last Successful Update: | Fri Jun 06 1997 |
Number of topics: | 10068 |
Total number of notes: | 35879 |
8686.0. "4.0a hang on 8400 - bad memory?" by CSC32::TRENTA () Mon Feb 03 1997 16:36
Hi,
I have a customer that experienced a hang on a AlphaServer 8400 Model 5/300
running V4.0a Digital UNIX. They sent me the forced crash, messages,
binary.errlog, etc. and in looking at it the problem appears to be a
bad memory module. However, I am not sure and would appreciate anyones
feedback on this.
Below is some of the output. Notice that most of the kernel threads are
in a U state. In binary.errlog only one entry that was a '620 System
Correctable Error' with a 'Write Bank Unlock Failing Bank=0' around the
time of the hang. In the message buffer there were "too many system corrected errors on cpu0,
2, 3, 4, and 5" with nothing making it to messages file or syslog.dated
files.
Any ideas as to what is wrong? If it is memory, which memory module?
Thanks in advance.
Debbie Trenta
Lucent/AT&T UNIX Support
*******************************************************************
cda> kps -cmopstuwMSTW
PID PROC TASK UTASK MAP
PMAP THREAD STATE WAIT_EVENT WAIT_MESG WAIT_TIME CPU SWAP_ALLOC
SWAP_INUSE COMMAND
0 0xfffffc017fdceca0 0xfffffc017fdcea80 0xfffffc017fdcee58 0xfffffc017fdbc000 0xfffffc0
00066bce0 0xfffffc017fdd22c0 I R 0 - 0 0 0.00K
0.00K kernel idle
0xfffffc017fdd2580 WU 0 malloc_thread 0x31f5905
0xfffffc017fdd2840 R U 0 - 0
0xfffffc017fdd2b00 WU 0xfffffc00004be748 (null) 0x3315125
0xfffffc017fdd2dc0 WU 0xfffffc0000484cb0 (null) 0x31fc746
0xfffffc017fdd3080 WU 0xfffffc00005ad668 Zzzzzz 0xf4c062c
0xfffffc017fdd3340 WU 0xfffffc00004c871c (null) 0xf4c062c
0xfffffc017fdd3600 WU 0xfffffc017fe11680 (null) 0xf4c062c
0xfffffc017fdd38c0 WU 0xfffffc017fe11b00 (null) 0xf4bf527
0xfffffc017fdd3b80 WU 0xfffffc0169b22000 (null) 0xf4be227
0xfffffc015a936000 WU 0xfffffc00005a00b0 (null) 0xf4bc27e
0xfffffc015a9362c0 WU 0xfffffc00005a0348 (null) 0x31f54f6
0xfffffc015a936580 R 0 - 0 0
0xfffffc015a936840 WU 0xfffffc0169b22480 (null) 0xf4bc27e
0xfffffc015a936b00 WU 0xfffffc0169b22900 (null) 0xf4bae17
0xfffffc015a936dc0 WU 0xfffffc0169b22d80 (null) 0xf4b99af
0xfffffc015a937080 WU 0xfffffc0169b23200 (null) 0xf4b8814
0xfffffc015a937340 WU 0xfffffc0169b23680 (null) 0xf4b73ab
0xfffffc015a937600 R 0 - 0 5
0xfffffc015a9378c0 W 0xfffffc00005abdc8 netisr 0xf4b6168
0xfffffc015a937b80 W 0xfffffc00005abdc8 netisr 0x3264d4b
0xfffffc015ab6a000 W 0xfffffc00005abdc8 netisr 0x69a75c7
0xfffffc015ab6a2c0 R 0 - 0
0xfffffc015ab6a580 W 0xfffffc00005abdc8 netisr 0xae5f178
0xfffffc015ab6a840 R 0 - 0 3
0xfffffc015ab6ab00 WU 0xfffffc000049c278 (null) 0xf4b6168
0xfffffc015ab6adc0 WU 0xfffffc00005ad990 kio_thread 0xf4b60e1
0xfffffc015ab6b080 WU 0xfffffc015ab0ae40 (null) 0xf4b60e1
0xfffffc015ab6b340 WU 0xfffffc015ab0b600 (null) 0xf4b60e1
0xfffffc015ab6b600 I R 0 - 0 1
0xfffffc015ab6b8c0 I R 0 - 0 2
0xfffffc015ab6bb80 I R 0 - 0 3
0xfffffc015aa72000 I R 0 - 0 4
0xfffffc015aa722c0 I R 0 - 0 5
0xfffffc015aa72580 R U 0 - 0
0xfffffc015aa72840 WU 0xfffffc000064aad0 (null) 0xf4b6098
0xfffffc015aa72dc0 WU 0xfffffc000064bd70 Tswpin 0xf4b5ff9
0xfffffc015aa73080 WU 0xfffffc000064bd80 Tswpout 0xf4b5ff9
0xfffffc015aa73340 WU 0xfffffc00005ace78 pageout 0x31f54f9
0xfffffc015aa73600 WU 0xfffffc000064bda0 reaper 0xef1dd9b
0xfffffc015aa738c0 WU 0xfffffc000064bdb0 tswpin 0xf4b5ff8
0xfffffc015aa73b80 WU 0xfffffc00005a2508 tswpout 0xf4b5ff8
0xfffffc015a43c000 WU 0xfffffc000064bd40 actque 0xf4b5ff8
0xfffffc015a43c2c0 WU 0xfffffc00002c4420 acctwtch 0x31f5dae
0xfffffc015a43cb00 WU 0xfffffc00006b9568 (null) 0xf4abcb9
0xfffffc015a43cdc0 WU 0xfffffc00006b9568 (null) 0xf4abcb9
1 0xfffffc017fdcf720 0xfffffc017fdcf500 0xfffffc017fdcf8d8 0xfffffc015aa78240 0xfffffc0
15ab2bd00 0xfffffc015aa72b00 W 0xfffffc015aa72b00 pause 0x320636c 0.00K
0.00K init
40 0xfffffc015a89e220 0xfffffc015a89e000 0xfffffc015a89e3d8 0xfffffc015aa78540 0xfffffc0
0530a4100 0xfffffc015a43d080 W 0xfffffc015a43d298 event 0x31fa61a 0.00K
0.00K vold
164 0xfffffc015a89eca0 0xfffffc015a89ea80 0xfffffc015a89ee58 0xfffffc015aa78300 0xfffffc0
15a432100 0xfffffc017fdd2000 W 0xfffffc015a4db060 sv_msg_rcv 0xf49928c 0.00K
0.00K kloadsrv
178 0xfffffc015a431720 0xfffffc015a431500 0xfffffc015a4318d8 0xfffffc015aa786c0 0xfffffc0
15d54b200 0xfffffc015a43c840 W 0xfffffc015a43c840 pause 0x31f8842 0.00K
0.00K update
309 0xfffffc0052175720 0xfffffc0052175500 0xfffffc00521758d8 0xfffffc015aa78900 0xfffffc0
15d937200 0xfffffc015a43d600 R 0 - 0 2 0.00K
0.00K syslogd
311 0xfffffc0052957720 0xfffffc0052957500 0xfffffc00529578d8 0xfffffc015aa78a80 0xfffffc0
15d937400 0xfffffc0052550000 WU 0xfffffc0000ab9ad0 event 0x31f5472 0.00K
0.00K binlogd
321 0xfffffc0052956220 0xfffffc0052956000 0xfffffc00529563d8 0xfffffc015aa78cc0 0xfffffc0
15d937c00 0xfffffc00525502c0 W 0xfffffc00525504d8 event 0x32347f2 0.00K
0.00K auditd
358 0xfffffc0052174220 0xfffffc0052174000 0xfffffc00521743d8 0xfffffc015aa78e40 0xfffffc0
17f80bf00 0xfffffc0052550b00 W 0xfffffc00005a0968 Zzzzzz 0xf498bdc 0.00K
0.00K epcadl
370 0xfffffc015b580ca0 0xfffffc015b580a80 0xfffffc015b580e58 0xfffffc015aa78fc0 0xfffffc0
17f80c100 0xfffffc0052550580 W 0xfffffc0052550798 event 0x3274c3b 0.00K
0.00K inetd
373 0xfffffc0052174ca0 0xfffffc0052174a80 0xfffffc0052174e58 0xfffffc015aa79980 0xfffffc0
17f808a00 0xfffffc015a43c580 W 0xfffffc0052174ca0 wait 0xf499972 0.00K
0.00K volwatch
377 0xfffffc015b580220 0xfffffc015b580000 0xfffffc015b5803d8 0xfffffc015aa79b00 0xfffffc0
17f808300 0xfffffc0052550840 W 0xfffffc015a4e1494 (null) 0xf495bfa 0.00K
0.00K volwatch
378 0xfffffc0052154220 0xfffffc0052154000 0xfffffc00521543d8 0xfffffc015aa79c80 0xfffffc0
17f808600 0xfffffc0052551080 W 0xfffffc0052551298 event 0xf495d47 0.00K
0.00K volnotify
16809 0xfffffc0082aa2220 0xfffffc0082aa2000 0xfffffc0082aa23d8 0xfffffc00520a7b00 0xfffffc0
052475400 0xfffffc017f804580 W 0xfffffc017f804580 pause 0x31f5f6c 0.00K
0.00K update
439 0xfffffc0052154ca0 0xfffffc0052154a80 0xfffffc0052154e58 0xfffffc015aa79d40 0xfffffc0
052878400 0xfffffc0052551340 W 0xfffffc0052551558 event 0xb04980b 0.00K
0.00K portmap
21067 0xfffffc0052c3aca0 0xfffffc0052c3aa80 0xfffffc0052c3ae58 0xfffffc017f7e3440 0xfffffc0
170e0b300 0xfffffc005ef08b00 W 0xfffffc005ef08d18 event 0x40011ea 0.00K
0.00K rlogind
21072 0xfffffc017fe3f720 0xfffffc017fe3f500 0xfffffc017fe3f8d8 0xfffffc017f7e2540 0xfffffc0
170e0a100 0xfffffc005ef09080 W 0xfffffc017fe3f720 wait 0xef48056 0.00K
0.00K sh
21091 0xfffffc017fe3e220 0xfffffc017fe3e000 0xfffffc017fe3e3d8 0xfffffc00520a7e00 0xfffffc0
078670300 0xfffffc005ef08dc0 W 0xfffffc017fe3e220 wait 0xef28c59 0.00K
0.00K sh
21231 0xfffffc015b4ca220 0xfffffc015b4ca000 0xfffffc015b4ca3d8 0xfffffc005f70a840 0xfffffc0
15e55de00 0xfffffc017f7f6840 W 0xfffffc017f7f6840 pause 0x31fc865 0.00K
0.00K update
21276 0xfffffc0052956ca0 0xfffffc0052956a80 0xfffffc0052956e58 0xfffffc005f70b5c0 0xfffffc0
052474700 0xfffffc017f7f62c0 W 0xfffffc015d9b3c68 tty 0x40011e9 0.00K
0.00K ksh
21372 0xfffffc015a430ca0 0xfffffc015a430a80 0xfffffc015a430e58 0xfffffc015aa78600 0xfffffc0
06c5fe300 0xfffffc00525518c0 W 0xfffffc00525518c0 pause 0x31f653e 0.00K
0.00K scm
21509 0xfffffc015b4cb720 0xfffffc015b4cb500 0xfffffc015b4cb8d8 0xfffffc005f70a3c0 0xfffffc0
15b8ef700 0xfffffc017f7f6b00 W 0xfffffc00008e30d8 sv_msg_rcv 0x32f7a6b 0.00K
0.00K csop
21513 0xfffffc0052c41720 0xfffffc0052c41500 0xfffffc0052c418d8 0xfffffc005f70a240 0xfffffc0
052474100 0xfffffc017f7f7b80 W 0xfffffc017f7f7b80 pause 0x31f5474 0.00K
0.00K lan_mgr
21514 0xfffffc0052c40220 0xfffffc0052c40000 0xfffffc0052c403d8 0xfffffc005f70bb00 0xfffffc0
06eee6d00 0xfffffc017f7f6580 W 0xfffffc015c5fa540 sv_msg_rcv 0x31f98d1 0.00K
0.00K dbmaint
21518 0xfffffc017f800220 0xfffffc017f800000 0xfffffc017f8003d8 0xfffffc005f70aa80 0xfffffc0
06eee7b00 0xfffffc017f7f78c0 W 0xfffffc00008e2f38 sv_msg_rcv 0x31f5473 0.00K
0.00K rip
21520 0xfffffc0052155720 0xfffffc0052155500 0xfffffc00521558d8 0xfffffc005f70a600 0xfffffc0
06eee7600 0xfffffc017f7f7600 R 0 - 0 4 0.00K
0.00K lan_mgr_srv
21526 0xfffffc017f802ca0 0xfffffc017f802a80 0xfffffc017f802e58 0xfffffc005f70afc0 0xfffffc0
0685ea100 0xfffffc017fe41b80 W 0xfffffc017fe41b80 pause 0x31f664c 0.00K
0.00K hcm
13339 0xfffffc0052448220 0xfffffc0052448000 0xfffffc00524483d8 0xfffffc00520a7140 0xfffffc0
08e6bb700 0xfffffc0052551600 W 0xfffffc0052551600 pause 0x31fb1b2 0.00K
0.00K update
21534 0xfffffc0052448ca0 0xfffffc0052448a80 0xfffffc0052448e58 0xfffffc005f70b440 0xfffffc0
078276000 0xfffffc017fe41340 W 0xfffffc017fe41340 pause 0x31f580d 0.00K
0.00K sched
21537 0xfffffc017f7fcca0 0xfffffc017f7fca80 0xfffffc017f7fce58 0xfffffc005f70a780 0xfffffc0
078276900 0xfffffc017fe41600 W 0xfffffc017fe41600 pause 0x31f55fe 0.00K
0.00K ss7omap
21539 0xfffffc0052449720 0xfffffc0052449500 0xfffffc00524498d8 0xfffffc005f70bbc0 0xfffffc0
078277400 0xfffffc017fe402c0 WU 0xfffffc00976c620c wlock 0x31f557d 0.00K
0.00K ss7ioc
21541 0xfffffc017f7fc220 0xfffffc017f7fc000 0xfffffc017f7fc3d8 0xfffffc005f70a0c0 0xfffffc0
0685ea700 0xfffffc017fe40580 W 0xfffffc017fe40580 pause 0x31f5611 0.00K
0.00K ss7trap
17447 0xfffffc017f7e8220 0xfffffc017f7e8000 0xfffffc017f7e83d8 0xfffffc005f70b800 0xfffffc0
07ddc9f00 0xfffffc015a43db80 W 0xfffffc015a43db80 pause 0x31f73c1 0.00K
0.00K update
17518 0xfffffc017f7e8ca0 0xfffffc017f7e8a80 0xfffffc017f7e8e58 0xfffffc005f70b740 0xfffffc0
03d977400 0xfffffc006faeab00 W 0xfffffc006faeab00 pause 0x31f9013 0.00K
0.00K update
21743 0xfffffc017f7e4220 0xfffffc017f7e4000 0xfffffc017f7e43d8 0xfffffc005f70a180 0xfffffc0
05f5aa200 0xfffffc005ef09600 W 0xfffffc005ef09600 pause 0xef1fbdb 0.00K
0.00K ca
21912 0xfffffc005ef01720 0xfffffc005ef01500 0xfffffc005ef018d8 0xfffffc005f70b080 0xfffffc0
17fe45f00 0xfffffc0052457600 W 0xfffffc0052457600 pause 0x31f55b7 0.00K
0.00K dbmaint
21941 0xfffffc017f7e4ca0 0xfffffc017f7e4a80 0xfffffc017f7e4e58 0xfffffc005f70b680 0xfffffc0
05f5ab900 0xfffffc005ef09b80 W 0xfffffc005ef09b80 pause 0xef1fb34 0.00K
0.00K ca
21948 0xfffffc017fe42220 0xfffffc017fe42000 0xfffffc017fe423d8 0xfffffc017f7e2e40 0xfffffc0
15ed87400 0xfffffc0052457080 W 0xfffffc0052457080 pause 0xef1faf6 0.00K
0.00K emd
21950 0xfffffc005ef00ca0 0xfffffc005ef00a80 0xfffffc005ef00e58 0xfffffc005f70ac00 0xfffffc0
15cdd8700 0xfffffc005ef09340 W 0xfffffc005ef09340 pause 0x83a360d 0.00K
0.00K bdsop
21954 0xfffffc017f7e5720 0xfffffc017f7e5500 0xfffffc017f7e58d8 0xfffffc005f70b500 0xfffffc0
052164a00 0xfffffc005ef08000 W 0xfffffc015a421894 (null) 0xef1e790 0.00K
0.00K emi
21955 0xfffffc005ef00220 0xfffffc005ef00000 0xfffffc005ef003d8 0xfffffc005f70a6c0 0xfffffc0
042fdf500 0xfffffc0052457340 W 0xfffffc0052457340 pause 0x32348aa 0.00K
0.00K cyclic
21956 0xfffffc017f7f9720 0xfffffc017f7f9500 0xfffffc017f7f98d8 0xfffffc017f7e29c0 0xfffffc0
15d5da300 0xfffffc005ef08580 W 0xfffffc005ef08580 pause 0x31f5cfb 0.00K
0.00K oscm
21957 0xfffffc017f7f8220 0xfffffc017f7f8000 0xfffffc017f7f83d8 0xfffffc017f7e2b40 0xfffffc0
052165600 0xfffffc00524578c0 W 0xfffffc00524578c0 pause 0x83aa501 0.00K
0.00K sdval
21958 0xfffffc017f7f8ca0 0xfffffc017f7f8a80 0xfffffc017f7f8e58 0xfffffc005f70ab40 0xfffffc0
052164d00 0xfffffc0052457b80 W 0xfffffc0052457b80 pause 0x83a3575 0.00K
0.00K mcd_maint
21960 0xfffffc017fe43720 0xfffffc017fe43500 0xfffffc017fe438d8 0xfffffc017f7e3200 0xfffffc0
15ed87500 0xfffffc0052456b00 W 0xfffffc0052456b00 pause 0x31f555d 0.00K
0.00K snmpd
21961 0xfffffc0052c3b720 0xfffffc0052c3b500 0xfffffc0052c3b8d8 0xfffffc005f70ba40 0xfffffc0
15ed86d00 0xfffffc0052456840 W 0xfffffc0052456840 pause 0x34bfbb0 0.00K
0.00K sda
21964 0xfffffc0052c3a220 0xfffffc0052c3a000 0xfffffc0052c3a3d8 0xfffffc017f7e3a40 0xfffffc0
060210e00 0xfffffc005ef098c0 W 0xfffffc00008e3278 sv_msg_rcv 0x32f7a8d 0.00K
0.00K sop
21965 0xfffffc015ace3720 0xfffffc015ace3500 0xfffffc015ace38d8 0xfffffc017f7e2fc0 0xfffffc0
15d5db200 0xfffffc00524562c0 W 0xfffffc00008e32e0 sv_msg_rcv 0xef1f2a1 0.00K
0.00K sop
21974 0xfffffc017f802220 0xfffffc017f802000 0xfffffc017f8023d8 0xfffffc017f7e3bc0 0xfffffc0
06eee7a00 0xfffffc017fe40dc0 W 0xfffffc017fe40dc0 pause 0x31f7f92 0.00K
0.00K irs
21992 0xfffffc017fe42ca0 0xfffffc017fe42a80 0xfffffc017fe42e58 0xfffffc005f70b8c0 0xfffffc0
078229600 0xfffffc0052456000 W 0 usleep 0x31f576f 0.00K
0.00K tail
13975 0xfffffc015b581720 0xfffffc015b581500 0xfffffc015b5818d8 0xfffffc015aa78480 0xfffffc0
078671300 0xfffffc015a43d8c0 W 0xfffffc015b581720 wait 0xf2ed714 0.00K
0.00K sh
22170 0xfffffc017fe54220 0xfffffc017fe54000 0xfffffc017fe543d8 0xfffffc00520a7ec0 0xfffffc0
03d977600 0xfffffc0052456dc0 W 0xfffffc0052456dc0 pause 0x31f54d7 0.00K
0.00K qp
22171 0xfffffc017fe55720 0xfffffc017fe55500 0xfffffc017fe558d8 0xfffffc00520a7740 0xfffffc0
0685eaa00 0xfffffc005ef08840 W 0xfffffc005ef08840 pause 0x31f54f4 0.00K
0.00K qp
14050 0xfffffc0052c40ca0 0xfffffc0052c40a80 0xfffffc0052c40e58 0xfffffc005f70af00 0xfffffc0
078276e00 0xfffffc0052550dc0 W 0xfffffc0052c40ca0 wait 0xee2a5ae 0.00K
0.00K ksh
22578 0xfffffc017f801720 0xfffffc017f801500 0xfffffc017f8018d8 0xfffffc00520a6a80 0xfffffc0
17f80ec00 0xfffffc017fe40b00 W 0xfffffc017fe40b00 pause 0x31f5472 0.00K
0.00K qp
22584 0xfffffc017f800ca0 0xfffffc017f800a80 0xfffffc017f800e58 0xfffffc00520a75c0 0xfffffc0
17f80ed00 0xfffffc005ef082c0 R 0 - 0 1 0.00K
0.00K qp
22614 0xfffffc017fe54ca0 0xfffffc017fe54a80 0xfffffc017fe54e58 0xfffffc00520a72c0 0xfffffc0
15d54b400 0xfffffc0052456580 W 0xfffffc0052456580 pause 0x87053b5 0.00K
0.00K ami
22628 0xfffffc00528a9720 0xfffffc00528a9500 0xfffffc00528a98d8 0xfffffc00520a66c0 0xfffffc0
17f80fa00 0xfffffc017fe40000 W 0xfffffc017fe40000 pause 0x32f7a84 0.00K
0.00K sap
22636 0xfffffc015ace2220 0xfffffc015ace2000 0xfffffc015ace23d8 0xfffffc005f70acc0 0xfffffc0
15d54b700 0xfffffc017f7f7340 W 0xfffffc017f7f7340 pause 0x31f5472 0.00K
0.00K cra
22642 0xfffffc0082aa3720 0xfffffc0082aa3500 0xfffffc0082aa38d8 0xfffffc005f70bd40 0xfffffc0
03f17e800 0xfffffc017f7f6000 W 0xfffffc017f7f6000 pause 0xee4caa2 0.00K
0.00K vanish
22650 0xfffffc00528a8220 0xfffffc00528a8000 0xfffffc00528a83d8 0xfffffc005196acc0 0xfffffc0
08e6ba000 0xfffffc017fe41080 W 0xfffffc017fe41080 pause 0x31f5471 0.00K
0.00K apa
22651 0xfffffc0082aa2ca0 0xfffffc0082aa2a80 0xfffffc0082aa2e58 0xfffffc005f70b200 0xfffffc0
03f17f400 0xfffffc017f7f6dc0 W 0xfffffc017f7f6dc0 pause 0x31f5472 0.00K
0.00K nm_ctrl
10509 0xfffffc015a89f720 0xfffffc015a89f500 0xfffffc015a89f8d8 0xfffffc015aa78840 0xfffffc0
15ab2bf00 0xfffffc015a43d340 W 0 usleep 0x3220ef3 0.00K
0.00K dm
10533 0xfffffc015b8ce220 0xfffffc015b8ce000 0xfffffc015b8ce3d8 0xfffffc00520a6600 0xfffffc0
078671f00 0xfffffc00524bcb00 W 0xfffffc015bec4294 (null) 0x323484a 0.00K
0.00K cron
10545 0xfffffc00528a8ca0 0xfffffc00528a8a80 0xfffffc00528a8e58 0xfffffc00520a6cc0 0xfffffc0
08e6baf00 0xfffffc00524bc840 W 0xfffffc00528a8ca0 wait 0x4b9dc47 0.00K
0.00K ndbsard
10596 0xfffffc015a430220 0xfffffc015a430000 0xfffffc015a4303d8 0xfffffc00520a6fc0 0xfffffc0
08e6ba700 0xfffffc00524bd600 W 0xfffffc006dae9a68 tty 0xf447d64 0.00K
0.00K getty
22997 0xfffffc015b4caca0 0xfffffc015b4caa80 0xfffffc015b4cae58 0xfffffc00a099bbc0 0xfffffc0
06c5ffd00 0xfffffc0052551b80 W 0 usleep 0x3203c86 0.00K
0.00K sadc
23520 0xfffffc0052458ca0 0xfffffc0052458a80 0xfffffc0052458e58 0xfffffc005196a6c0 0xfffffc0
078277800 0xfffffc017fe418c0 R 0 - 0 0.00K
0.00K rlogin
23522 0xfffffc0052458220 0xfffffc0052458000 0xfffffc00524583d8 0xfffffc005196a600 0xfffffc0
078670e00 0xfffffc00524bd080 W 0xfffffc015bd495a0 socket 0xeb2b83a 0.00K
0.00K rlogin
20475 0xfffffc017f7e9720 0xfffffc017f7e9500 0xfffffc017f7e98d8 0xfffffc005f70bc80 0xfffffc0
03dced600 0xfffffc006faea000 W 0xfffffc006faea000 pause 0x31f8131 0.00K
0.00K update
cda> quit
******************** From dia *************************
******************************** ENTRY 44 ********************************
** Error during ETC processing of GEN seg
- Canonical buffer dump follows
Entry# (record in file) 0.
Canonical buff size 25662.
Canonical event size 0.
Canonical Event-Buffer:
15--<-12 11--<-08 07--<-04 03--<-00 :Byte Order
******************************** ENTRY 45 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 5.
Timestamp of occurrence 29-JAN-1997 17:17:19
Host name DFS00-01
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000006
CPU logging event (mperr) x00000001
Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 100. CPU Machine Check Errors
CPU Minor class 4. 620 System Correctable Error
--TLaser 620 Corr Error--
Software Flags x00000001 TLSB Error Log Snapshot Packet Present
Active CPUs x0000003F
Hardware Rev x00000000
System Serial Number NI550R9310
Module Serial Number AY63007173
System Revision x00000000
MCHK Reason Mask x00000086
MCHK Frame Rev x00000001
EI STAT xFFFFFFF0C4FFFFFF
DATA SOURCE IS MEMORY OR SYSTEM
CORRECTABLE ECC ERROR
D-ref fill
EV5 Chip Rev 4
EI ADDRESS xFFFFFF00055A0D6F
FILL SYNDROME x0000000000009D00
Data Bit = 119
ISR x0000000100000000
Correctable ECC errors (IPL31)
AST requests 3 - 0 x0000000000000000
WHAMI x01 TLSB NODE ID 0.
CPU1
MISCR x55 B-Cache Size 4 Mbyte Bcache
Two Processors
TLSB RUN Signal
CPU0 Running console
TLDEV x51008014 -- Device Type: Dual EV5 Proc, 300Mhz,
4meg Bcache
TLBER x00240000 CORRECTABLE READ DATA ERROR
DATA SYNDROME 1
TLESR0 x00400303
TLESR1 x00A09D00 ECC Syndrome 0 x00000000
ECC Syndrome 1 x0000009D
CORRECTABLE READ ECC ERROR
Error Syndrome 0 x00 No Error
Error Syndrome 1 x9D Data Bit = 119
TLESR2 x00406060
TLESR3 x00409090
Palcode Revision x0000000400000400
Palcode Rev: 4.0-1
*TLaser CPU Registers*
TLSB Node Number 0.
TLDEV x51008014 -- Device Type: Dual EV5 Proc, 300Mhz,
4meg Bcache
TLBER x00240000 CORRECTABLE READ DATA ERROR
DATA SYNDROME 1
TLCNR x00000200
TLVID x00000010
TLESR0 x00400303
TLESR1 x00A09D00 ECC Syndrome 0 x00000000
ECC Syndrome 1 x0000009D
CORRECTABLE READ ECC ERROR
TLESR2 x00406060
TLESR3 x00409090
TLEPAERR x00040000 First ADG Design: Rev F
MODCONFIG x00098AD4 Lockout Enable
Command Piping To EV5 Disabled
Bcache Size: 4 MB
Bcache Idle Cycles Before 11.
Max Command Queue Entries 2.
Max Bus Queue Entries 4.
TLEPMERR x00000000
TLEPDERR x00000000
TLEP Interrupt Mask 0 x000000FE IPL 14 Interrupt Enable
IPL 15 Interrupt Enable
IPL 16 Interrupt Enable
IPL 17 Interrupt Enable
Interprocessor Interrupt Enable
Interval Timer Interrupt Enable
CPU Halt Enable
TLEP Interrupt Summary 0 x00000000
TLEP Interrupt Mask 1 x00000000
TLEP Interrupt Summary 1 x00000000
*TLaser CPU Registers*
TLSB Node Number 1.
TLDEV x51008014 -- Device Type: Dual EV5 Proc, 300Mhz,
4meg Bcache
TLBER x00800000
TLCNR x00000210
TLVID x00000032
TLESR0 x00000303
TLESR1 x00000303
TLESR2 x00000303
TLESR3 x00000303
TLEPAERR x00040000 First ADG Design: Rev F
MODCONFIG x00098AD4 Lockout Enable
Command Piping To EV5 Disabled
Bcache Size: 4 MB
Bcache Idle Cycles Before 11.
Max Command Queue Entries 2.
Max Bus Queue Entries 4.
TLEPMERR x00000000
TLEPDERR x00000000
TLEP Interrupt Mask 0 x000000FE IPL 14 Interrupt Enable
IPL 15 Interrupt Enable
IPL 16 Interrupt Enable
IPL 17 Interrupt Enable
Interprocessor Interrupt Enable
Interval Timer Interrupt Enable
CPU Halt Enable
TLEP Interrupt Summary 0 x00000000
TLEP Interrupt Mask 1 x00000000
TLEP Interrupt Summary 1 x00000000
*TLaser CPU Registers*
TLSB Node Number 2.
TLDEV x51008014 -- Device Type: Dual EV5 Proc, 300Mhz,
4meg Bcache
TLBER x00800000
TLCNR x00000220
TLVID x00000054
TLESR0 x00000303
TLESR1 x00000303
TLESR2 x00000303
TLESR3 x00000303
TLEPAERR x00040000 First ADG Design: Rev F
MODCONFIG x00098AD4 Lockout Enable
Command Piping To EV5 Disabled
Bcache Size: 4 MB
Bcache Idle Cycles Before 11.
Max Command Queue Entries 2.
Max Bus Queue Entries 4.
TLEPMERR x00000000
TLEPDERR x00000000
TLEP Interrupt Mask 0 x000000FE IPL 14 Interrupt Enable
IPL 15 Interrupt Enable
IPL 16 Interrupt Enable
IPL 17 Interrupt Enable
Interprocessor Interrupt Enable
Interval Timer Interrupt Enable
CPU Halt Enable
TLEP Interrupt Summary 0 x00000000
TLEP Interrupt Mask 1 x00000000
TLEP Interrupt Summary 1 x00000000
* TLaser Memory Regs *
TLSB Node Number 5.
TLDEV x00005000 -- Device Type: Memory Module
TLBER x01100000 DATA TRANSMITTER DURING ERROR
TLCNR x000FC250
TLVID x00000080
FADR x070500000011D700
FADR 1 x07050000 Failing Command: Write Bank Unlock
Failing Bank = Bank 0
TLESR0 x00000303
TLESR1 x00000C0C
TLESR2 x00006060
TLESR3 x00009090
TMIR x80000001 Interleave x00000001
TMCR x0000023D 2GB Module (E2036-AA)
16 MB
70ns DRAM
Strings Installed = 8
DRAM timing: Bus Spd = 13.0-15.0;
Refresh Cnt = 1008
TMER x00000004 Failing String = x00000004
TMDRA x00000000 Refresh Rate 1X
TDDR0 x00000000
TDDR1 x00000000
TDDR2 x00000000
TDDR3 x00000000
* TLaser Memory Regs *
TLSB Node Number 6.
TLDEV x00005000 -- Device Type: Memory Module
TLBER x00800000
TLCNR x000FC260
TLVID x00000091
FADR 0 x0012000000300000
FADR 1 x00120000
TLESR0 x00000303
TLESR1 x00000303
TLESR2 x00000303
TLESR3 x00000303
TMIR x80000001 Interleave x00000001
TMCR x0000023D 2GB Module (E2036-AA)
16 MB
70ns DRAM
Strings Installed = 8
DRAM timing: Bus Spd = 13.0-15.0;
Refresh Cnt = 1008
TMER x00000000 Failing String = x00000000
TMDRA x00000000 Refresh Rate 1X
TDDR0 x00000000
TDDR1 x00000000
TDDR2 x00000000
TDDR3 x00000000
* TLaser Memory Regs *
TLSB Node Number 7.
TLDEV x00005000 -- Device Type: Memory Module
TLBER x00800000
TLCNR x000FC270
TLVID x000000A2
FADR 0 x0022000000300000
FADR 1 x00220000
TLESR0 x00000303
TLESR1 x00000303
TLESR2 x00000303
TLESR3 x00000303
TMIR x80000001 Interleave x00000001
TMCR x0000023D 2GB Module (E2036-AA)
16 MB
70ns DRAM
Strings Installed = 8
DRAM timing: Bus Spd = 13.0-15.0;
Refresh Cnt = 1008
TMER x00000000 Failing String = x00000000
TMDRA x00000000 Refresh Rate 1X
TDDR0 x00000000
TDDR1 x00000000
TDDR2 x00000000
TDDR3 x00000000
* TLaser I/O Registers *
TLSB Node Number 8.
TLDEV x00002000 -- Device Type: I/O Module
TLBER x00000000
FADR 0 x0000000000000000
FADR 1 x00000000
TLESR0 x00000000
TLESR1 x00000000
TLESR2 x00000000
TLESR3 x00000000
CPU Interrupt Mask x00000001 Cpu Interrupt Mask = x00000001
ICCMSR x00000000 Arbitration Control Minimum Latency Mode
Supress Control Suppress after 16
Transations
ICCNSE x80000000 Interrupt Enable on NSES Set
ICCMTR x00000000
IDPNSE-0 x00000006 Hose Power OK
Hose Cable OK
IDPNSE-1 x00000006 Hose Power OK
Hose Cable OK
IDPNSE-2 x00000000
IDPNSE-3 x00000000
IDPVR x00000800
ICCWTR x00000000
TLMBPR x0000000000000000
IDPDR0 x20000000
IDPDR1 x20000000
IDPDR2 x00000000
IDPDR3 x00000000
***************** From message buffer in forced crash *****************
(kdbx) p *pmsgbuf
struct {
msg_magic = 405601
msg_bufx = 127
msg_bufr = 3704
msg_bufc = "rors detected on cpu 0. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 0. Reporting suspended.
al memory = 6144.00 megabytes.
available memory = 6039.08 megabytes.
using 7861 buffers containing 61.41 megabytes of memory
.
.
start up message stuff then
.
.
WARNING: too many System corrected errors detected on cpu 3. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 4. Reporting suspended.
WARNING: too many Processor corrected errors detected on cpu 4. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 2. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 5. Reporting suspended.
WARNING: too many System corrected er"
}
T.R | Title | User | Personal Name | Date | Lines |
---|
8686.1 | | NETRIX::"[email protected]" | Shashi Mangalat | Mon Feb 03 1997 20:22 | 13 |
| >Notice that most of the kernel threads are in a U state.
This, by itself, is not indicative of any problems. Most kernel
threads wait uninterruptibly. Any thread that has a "(null)" wait
message does not tell us anything where it is waiting. So you need to
do a stack trace on these to figure out if they are okay. Other wait
messages in the output are normal. You may want to look at the running
threads to see what they are upto. The only suspicious looking one is
the thread waiting on a "wlock".
--shashi
[Posted by WWW Notes gateway]
|