[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference turris::digital_unix

Title:DIGITAL UNIX(FORMERLY KNOWN AS DEC OSF/1)
Notice:Welcome to the Digital UNIX Conference
Moderator:SMURF::DENHAM
Created:Thu Mar 16 1995
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:10068
Total number of notes:35879

8686.0. "4.0a hang on 8400 - bad memory?" by CSC32::TRENTA () Mon Feb 03 1997 16:36

Hi,

I have a customer that experienced a hang on a AlphaServer 8400 Model 5/300
running V4.0a Digital UNIX.  They sent me the forced crash, messages, 
binary.errlog, etc. and in looking at it the problem appears to be a
bad memory module.  However, I am not sure and would appreciate anyones
feedback on this.

Below is some of the output.  Notice that most of the kernel threads are
in a U state.  In binary.errlog only one entry that was a '620 System
Correctable Error' with a 'Write Bank Unlock  Failing Bank=0' around the
time of the hang.  In the message buffer there were "too many system corrected errors on cpu0,
2, 3, 4, and 5"  with nothing making it to messages file or syslog.dated
files.

Any ideas as to what is wrong?  If it is memory, which memory module?

Thanks in advance.

Debbie Trenta
Lucent/AT&T UNIX Support

*******************************************************************  
cda>  kps -cmopstuwMSTW
  PID               PROC               TASK              UTASK                MAP          
     PMAP             THREAD  STATE         WAIT_EVENT  WAIT_MESG  WAIT_TIME CPU SWAP_ALLOC
 SWAP_INUSE COMMAND
    0 0xfffffc017fdceca0 0xfffffc017fdcea80 0xfffffc017fdcee58 0xfffffc017fdbc000 0xfffffc0
00066bce0 0xfffffc017fdd22c0 I R                     0          -          0   0      0.00K
      0.00K kernel idle
                                                                                           
          0xfffffc017fdd2580     WU                  0 malloc_thread  0x31f5905    
                                                                                           
          0xfffffc017fdd2840   R  U                  0          -          0    
                                                                                           
          0xfffffc017fdd2b00     WU 0xfffffc00004be748     (null)  0x3315125    
                                                                                           
          0xfffffc017fdd2dc0     WU 0xfffffc0000484cb0     (null)  0x31fc746    
                                                                                           
          0xfffffc017fdd3080     WU 0xfffffc00005ad668     Zzzzzz  0xf4c062c    
                                                                                           
          0xfffffc017fdd3340     WU 0xfffffc00004c871c     (null)  0xf4c062c    
                                                                                           
          0xfffffc017fdd3600     WU 0xfffffc017fe11680     (null)  0xf4c062c    
                                                                                           
          0xfffffc017fdd38c0     WU 0xfffffc017fe11b00     (null)  0xf4bf527    
                                                                                           
          0xfffffc017fdd3b80     WU 0xfffffc0169b22000     (null)  0xf4be227    
                                                                                           
          0xfffffc015a936000     WU 0xfffffc00005a00b0     (null)  0xf4bc27e    
                                                                                           
          0xfffffc015a9362c0     WU 0xfffffc00005a0348     (null)  0x31f54f6    
                                                                                           
          0xfffffc015a936580   R                     0          -          0   0
                                                                                           
          0xfffffc015a936840     WU 0xfffffc0169b22480     (null)  0xf4bc27e    
                                                                                           
          0xfffffc015a936b00     WU 0xfffffc0169b22900     (null)  0xf4bae17    
                                                                                           
          0xfffffc015a936dc0     WU 0xfffffc0169b22d80     (null)  0xf4b99af    
                                                                                           
          0xfffffc015a937080     WU 0xfffffc0169b23200     (null)  0xf4b8814    
                                                                                           
          0xfffffc015a937340     WU 0xfffffc0169b23680     (null)  0xf4b73ab    
                                                                                           
          0xfffffc015a937600   R                     0          -          0   5
                                                                                           
          0xfffffc015a9378c0     W  0xfffffc00005abdc8     netisr  0xf4b6168    
                                                                                           
          0xfffffc015a937b80     W  0xfffffc00005abdc8     netisr  0x3264d4b    
                                                                                           
          0xfffffc015ab6a000     W  0xfffffc00005abdc8     netisr  0x69a75c7    
                                                                                           
          0xfffffc015ab6a2c0   R                     0          -          0    
                                                                                           
          0xfffffc015ab6a580     W  0xfffffc00005abdc8     netisr  0xae5f178    
                                                                                           
          0xfffffc015ab6a840   R                     0          -          0   3
                                                                                           
          0xfffffc015ab6ab00     WU 0xfffffc000049c278     (null)  0xf4b6168    
                                                                                           
          0xfffffc015ab6adc0     WU 0xfffffc00005ad990 kio_thread  0xf4b60e1    
                                                                                           
          0xfffffc015ab6b080     WU 0xfffffc015ab0ae40     (null)  0xf4b60e1    
                                                                                           
          0xfffffc015ab6b340     WU 0xfffffc015ab0b600     (null)  0xf4b60e1    
                                                                                           
          0xfffffc015ab6b600 I R                     0          -          0   1
                                                                                           
          0xfffffc015ab6b8c0 I R                     0          -          0   2
                                                                                           
          0xfffffc015ab6bb80 I R                     0          -          0   3
                                                                                           
          0xfffffc015aa72000 I R                     0          -          0   4
                                                                                           
          0xfffffc015aa722c0 I R                     0          -          0   5
                                                                                           
          0xfffffc015aa72580   R  U                  0          -          0    
                                                                                           
          0xfffffc015aa72840     WU 0xfffffc000064aad0     (null)  0xf4b6098    
                                                                                           
          0xfffffc015aa72dc0     WU 0xfffffc000064bd70     Tswpin  0xf4b5ff9    
                                                                                           
          0xfffffc015aa73080     WU 0xfffffc000064bd80    Tswpout  0xf4b5ff9    
                                                                                           
          0xfffffc015aa73340     WU 0xfffffc00005ace78    pageout  0x31f54f9    
                                                                                           
          0xfffffc015aa73600     WU 0xfffffc000064bda0     reaper  0xef1dd9b    
                                                                                           
          0xfffffc015aa738c0     WU 0xfffffc000064bdb0     tswpin  0xf4b5ff8    
                                                                                           
          0xfffffc015aa73b80     WU 0xfffffc00005a2508    tswpout  0xf4b5ff8    
                                                                                           
          0xfffffc015a43c000     WU 0xfffffc000064bd40     actque  0xf4b5ff8    
                                                                                           
          0xfffffc015a43c2c0     WU 0xfffffc00002c4420   acctwtch  0x31f5dae    
                                                                                           
          0xfffffc015a43cb00     WU 0xfffffc00006b9568     (null)  0xf4abcb9    
                                                                                           
          0xfffffc015a43cdc0     WU 0xfffffc00006b9568     (null)  0xf4abcb9    
    1 0xfffffc017fdcf720 0xfffffc017fdcf500 0xfffffc017fdcf8d8 0xfffffc015aa78240 0xfffffc0
15ab2bd00 0xfffffc015aa72b00     W  0xfffffc015aa72b00      pause  0x320636c          0.00K
      0.00K init
   40 0xfffffc015a89e220 0xfffffc015a89e000 0xfffffc015a89e3d8 0xfffffc015aa78540 0xfffffc0
0530a4100 0xfffffc015a43d080     W  0xfffffc015a43d298      event  0x31fa61a          0.00K
      0.00K vold
  164 0xfffffc015a89eca0 0xfffffc015a89ea80 0xfffffc015a89ee58 0xfffffc015aa78300 0xfffffc0
15a432100 0xfffffc017fdd2000     W  0xfffffc015a4db060 sv_msg_rcv  0xf49928c          0.00K
      0.00K kloadsrv
  178 0xfffffc015a431720 0xfffffc015a431500 0xfffffc015a4318d8 0xfffffc015aa786c0 0xfffffc0
15d54b200 0xfffffc015a43c840     W  0xfffffc015a43c840      pause  0x31f8842          0.00K
      0.00K update
  309 0xfffffc0052175720 0xfffffc0052175500 0xfffffc00521758d8 0xfffffc015aa78900 0xfffffc0
15d937200 0xfffffc015a43d600   R                     0          -          0   2      0.00K
      0.00K syslogd
  311 0xfffffc0052957720 0xfffffc0052957500 0xfffffc00529578d8 0xfffffc015aa78a80 0xfffffc0
15d937400 0xfffffc0052550000     WU 0xfffffc0000ab9ad0      event  0x31f5472          0.00K
      0.00K binlogd
  321 0xfffffc0052956220 0xfffffc0052956000 0xfffffc00529563d8 0xfffffc015aa78cc0 0xfffffc0
15d937c00 0xfffffc00525502c0     W  0xfffffc00525504d8      event  0x32347f2          0.00K
      0.00K auditd
  358 0xfffffc0052174220 0xfffffc0052174000 0xfffffc00521743d8 0xfffffc015aa78e40 0xfffffc0
17f80bf00 0xfffffc0052550b00     W  0xfffffc00005a0968     Zzzzzz  0xf498bdc          0.00K
      0.00K epcadl
  370 0xfffffc015b580ca0 0xfffffc015b580a80 0xfffffc015b580e58 0xfffffc015aa78fc0 0xfffffc0
17f80c100 0xfffffc0052550580     W  0xfffffc0052550798      event  0x3274c3b          0.00K
      0.00K inetd
  373 0xfffffc0052174ca0 0xfffffc0052174a80 0xfffffc0052174e58 0xfffffc015aa79980 0xfffffc0
17f808a00 0xfffffc015a43c580     W  0xfffffc0052174ca0       wait  0xf499972          0.00K
      0.00K volwatch
  377 0xfffffc015b580220 0xfffffc015b580000 0xfffffc015b5803d8 0xfffffc015aa79b00 0xfffffc0
17f808300 0xfffffc0052550840     W  0xfffffc015a4e1494     (null)  0xf495bfa          0.00K
      0.00K volwatch
  378 0xfffffc0052154220 0xfffffc0052154000 0xfffffc00521543d8 0xfffffc015aa79c80 0xfffffc0
17f808600 0xfffffc0052551080     W  0xfffffc0052551298      event  0xf495d47          0.00K
      0.00K volnotify
16809 0xfffffc0082aa2220 0xfffffc0082aa2000 0xfffffc0082aa23d8 0xfffffc00520a7b00 0xfffffc0
052475400 0xfffffc017f804580     W  0xfffffc017f804580      pause  0x31f5f6c          0.00K
      0.00K update
  439 0xfffffc0052154ca0 0xfffffc0052154a80 0xfffffc0052154e58 0xfffffc015aa79d40 0xfffffc0
052878400 0xfffffc0052551340     W  0xfffffc0052551558      event  0xb04980b          0.00K
      0.00K portmap
21067 0xfffffc0052c3aca0 0xfffffc0052c3aa80 0xfffffc0052c3ae58 0xfffffc017f7e3440 0xfffffc0
170e0b300 0xfffffc005ef08b00     W  0xfffffc005ef08d18      event  0x40011ea          0.00K
      0.00K rlogind
21072 0xfffffc017fe3f720 0xfffffc017fe3f500 0xfffffc017fe3f8d8 0xfffffc017f7e2540 0xfffffc0
170e0a100 0xfffffc005ef09080     W  0xfffffc017fe3f720       wait  0xef48056          0.00K
      0.00K sh
21091 0xfffffc017fe3e220 0xfffffc017fe3e000 0xfffffc017fe3e3d8 0xfffffc00520a7e00 0xfffffc0
078670300 0xfffffc005ef08dc0     W  0xfffffc017fe3e220       wait  0xef28c59          0.00K
      0.00K sh
21231 0xfffffc015b4ca220 0xfffffc015b4ca000 0xfffffc015b4ca3d8 0xfffffc005f70a840 0xfffffc0
15e55de00 0xfffffc017f7f6840     W  0xfffffc017f7f6840      pause  0x31fc865          0.00K
      0.00K update
21276 0xfffffc0052956ca0 0xfffffc0052956a80 0xfffffc0052956e58 0xfffffc005f70b5c0 0xfffffc0
052474700 0xfffffc017f7f62c0     W  0xfffffc015d9b3c68        tty  0x40011e9          0.00K
      0.00K ksh
21372 0xfffffc015a430ca0 0xfffffc015a430a80 0xfffffc015a430e58 0xfffffc015aa78600 0xfffffc0
06c5fe300 0xfffffc00525518c0     W  0xfffffc00525518c0      pause  0x31f653e          0.00K
      0.00K scm
21509 0xfffffc015b4cb720 0xfffffc015b4cb500 0xfffffc015b4cb8d8 0xfffffc005f70a3c0 0xfffffc0
15b8ef700 0xfffffc017f7f6b00     W  0xfffffc00008e30d8 sv_msg_rcv  0x32f7a6b          0.00K
      0.00K csop
21513 0xfffffc0052c41720 0xfffffc0052c41500 0xfffffc0052c418d8 0xfffffc005f70a240 0xfffffc0
052474100 0xfffffc017f7f7b80     W  0xfffffc017f7f7b80      pause  0x31f5474          0.00K
      0.00K lan_mgr
21514 0xfffffc0052c40220 0xfffffc0052c40000 0xfffffc0052c403d8 0xfffffc005f70bb00 0xfffffc0
06eee6d00 0xfffffc017f7f6580     W  0xfffffc015c5fa540 sv_msg_rcv  0x31f98d1          0.00K
      0.00K dbmaint
21518 0xfffffc017f800220 0xfffffc017f800000 0xfffffc017f8003d8 0xfffffc005f70aa80 0xfffffc0
06eee7b00 0xfffffc017f7f78c0     W  0xfffffc00008e2f38 sv_msg_rcv  0x31f5473          0.00K
      0.00K rip
21520 0xfffffc0052155720 0xfffffc0052155500 0xfffffc00521558d8 0xfffffc005f70a600 0xfffffc0
06eee7600 0xfffffc017f7f7600   R                     0          -          0   4      0.00K
      0.00K lan_mgr_srv
21526 0xfffffc017f802ca0 0xfffffc017f802a80 0xfffffc017f802e58 0xfffffc005f70afc0 0xfffffc0
0685ea100 0xfffffc017fe41b80     W  0xfffffc017fe41b80      pause  0x31f664c          0.00K
      0.00K hcm
13339 0xfffffc0052448220 0xfffffc0052448000 0xfffffc00524483d8 0xfffffc00520a7140 0xfffffc0
08e6bb700 0xfffffc0052551600     W  0xfffffc0052551600      pause  0x31fb1b2          0.00K
      0.00K update
21534 0xfffffc0052448ca0 0xfffffc0052448a80 0xfffffc0052448e58 0xfffffc005f70b440 0xfffffc0
078276000 0xfffffc017fe41340     W  0xfffffc017fe41340      pause  0x31f580d          0.00K
      0.00K sched
21537 0xfffffc017f7fcca0 0xfffffc017f7fca80 0xfffffc017f7fce58 0xfffffc005f70a780 0xfffffc0
078276900 0xfffffc017fe41600     W  0xfffffc017fe41600      pause  0x31f55fe          0.00K
      0.00K ss7omap
21539 0xfffffc0052449720 0xfffffc0052449500 0xfffffc00524498d8 0xfffffc005f70bbc0 0xfffffc0
078277400 0xfffffc017fe402c0     WU 0xfffffc00976c620c      wlock  0x31f557d          0.00K
      0.00K ss7ioc
21541 0xfffffc017f7fc220 0xfffffc017f7fc000 0xfffffc017f7fc3d8 0xfffffc005f70a0c0 0xfffffc0
0685ea700 0xfffffc017fe40580     W  0xfffffc017fe40580      pause  0x31f5611          0.00K
      0.00K ss7trap
17447 0xfffffc017f7e8220 0xfffffc017f7e8000 0xfffffc017f7e83d8 0xfffffc005f70b800 0xfffffc0
07ddc9f00 0xfffffc015a43db80     W  0xfffffc015a43db80      pause  0x31f73c1          0.00K
      0.00K update
17518 0xfffffc017f7e8ca0 0xfffffc017f7e8a80 0xfffffc017f7e8e58 0xfffffc005f70b740 0xfffffc0
03d977400 0xfffffc006faeab00     W  0xfffffc006faeab00      pause  0x31f9013          0.00K
      0.00K update
21743 0xfffffc017f7e4220 0xfffffc017f7e4000 0xfffffc017f7e43d8 0xfffffc005f70a180 0xfffffc0
05f5aa200 0xfffffc005ef09600     W  0xfffffc005ef09600      pause  0xef1fbdb          0.00K
      0.00K ca
21912 0xfffffc005ef01720 0xfffffc005ef01500 0xfffffc005ef018d8 0xfffffc005f70b080 0xfffffc0
17fe45f00 0xfffffc0052457600     W  0xfffffc0052457600      pause  0x31f55b7          0.00K
      0.00K dbmaint
21941 0xfffffc017f7e4ca0 0xfffffc017f7e4a80 0xfffffc017f7e4e58 0xfffffc005f70b680 0xfffffc0
05f5ab900 0xfffffc005ef09b80     W  0xfffffc005ef09b80      pause  0xef1fb34          0.00K
      0.00K ca
21948 0xfffffc017fe42220 0xfffffc017fe42000 0xfffffc017fe423d8 0xfffffc017f7e2e40 0xfffffc0
15ed87400 0xfffffc0052457080     W  0xfffffc0052457080      pause  0xef1faf6          0.00K
      0.00K emd
21950 0xfffffc005ef00ca0 0xfffffc005ef00a80 0xfffffc005ef00e58 0xfffffc005f70ac00 0xfffffc0
15cdd8700 0xfffffc005ef09340     W  0xfffffc005ef09340      pause  0x83a360d          0.00K
      0.00K bdsop
21954 0xfffffc017f7e5720 0xfffffc017f7e5500 0xfffffc017f7e58d8 0xfffffc005f70b500 0xfffffc0
052164a00 0xfffffc005ef08000     W  0xfffffc015a421894     (null)  0xef1e790          0.00K
      0.00K emi
21955 0xfffffc005ef00220 0xfffffc005ef00000 0xfffffc005ef003d8 0xfffffc005f70a6c0 0xfffffc0
042fdf500 0xfffffc0052457340     W  0xfffffc0052457340      pause  0x32348aa          0.00K
      0.00K cyclic
21956 0xfffffc017f7f9720 0xfffffc017f7f9500 0xfffffc017f7f98d8 0xfffffc017f7e29c0 0xfffffc0
15d5da300 0xfffffc005ef08580     W  0xfffffc005ef08580      pause  0x31f5cfb          0.00K
      0.00K oscm
21957 0xfffffc017f7f8220 0xfffffc017f7f8000 0xfffffc017f7f83d8 0xfffffc017f7e2b40 0xfffffc0
052165600 0xfffffc00524578c0     W  0xfffffc00524578c0      pause  0x83aa501          0.00K
      0.00K sdval
21958 0xfffffc017f7f8ca0 0xfffffc017f7f8a80 0xfffffc017f7f8e58 0xfffffc005f70ab40 0xfffffc0
052164d00 0xfffffc0052457b80     W  0xfffffc0052457b80      pause  0x83a3575          0.00K
      0.00K mcd_maint
21960 0xfffffc017fe43720 0xfffffc017fe43500 0xfffffc017fe438d8 0xfffffc017f7e3200 0xfffffc0
15ed87500 0xfffffc0052456b00     W  0xfffffc0052456b00      pause  0x31f555d          0.00K
      0.00K snmpd
21961 0xfffffc0052c3b720 0xfffffc0052c3b500 0xfffffc0052c3b8d8 0xfffffc005f70ba40 0xfffffc0
15ed86d00 0xfffffc0052456840     W  0xfffffc0052456840      pause  0x34bfbb0          0.00K
      0.00K sda
21964 0xfffffc0052c3a220 0xfffffc0052c3a000 0xfffffc0052c3a3d8 0xfffffc017f7e3a40 0xfffffc0
060210e00 0xfffffc005ef098c0     W  0xfffffc00008e3278 sv_msg_rcv  0x32f7a8d          0.00K
      0.00K sop
21965 0xfffffc015ace3720 0xfffffc015ace3500 0xfffffc015ace38d8 0xfffffc017f7e2fc0 0xfffffc0
15d5db200 0xfffffc00524562c0     W  0xfffffc00008e32e0 sv_msg_rcv  0xef1f2a1          0.00K
      0.00K sop
21974 0xfffffc017f802220 0xfffffc017f802000 0xfffffc017f8023d8 0xfffffc017f7e3bc0 0xfffffc0
06eee7a00 0xfffffc017fe40dc0     W  0xfffffc017fe40dc0      pause  0x31f7f92          0.00K
      0.00K irs
21992 0xfffffc017fe42ca0 0xfffffc017fe42a80 0xfffffc017fe42e58 0xfffffc005f70b8c0 0xfffffc0
078229600 0xfffffc0052456000     W                   0     usleep  0x31f576f          0.00K
      0.00K tail
13975 0xfffffc015b581720 0xfffffc015b581500 0xfffffc015b5818d8 0xfffffc015aa78480 0xfffffc0
078671300 0xfffffc015a43d8c0     W  0xfffffc015b581720       wait  0xf2ed714          0.00K
      0.00K sh
22170 0xfffffc017fe54220 0xfffffc017fe54000 0xfffffc017fe543d8 0xfffffc00520a7ec0 0xfffffc0
03d977600 0xfffffc0052456dc0     W  0xfffffc0052456dc0      pause  0x31f54d7          0.00K
      0.00K qp
22171 0xfffffc017fe55720 0xfffffc017fe55500 0xfffffc017fe558d8 0xfffffc00520a7740 0xfffffc0
0685eaa00 0xfffffc005ef08840     W  0xfffffc005ef08840      pause  0x31f54f4          0.00K
      0.00K qp
14050 0xfffffc0052c40ca0 0xfffffc0052c40a80 0xfffffc0052c40e58 0xfffffc005f70af00 0xfffffc0
078276e00 0xfffffc0052550dc0     W  0xfffffc0052c40ca0       wait  0xee2a5ae          0.00K
      0.00K ksh
22578 0xfffffc017f801720 0xfffffc017f801500 0xfffffc017f8018d8 0xfffffc00520a6a80 0xfffffc0
17f80ec00 0xfffffc017fe40b00     W  0xfffffc017fe40b00      pause  0x31f5472          0.00K
      0.00K qp
22584 0xfffffc017f800ca0 0xfffffc017f800a80 0xfffffc017f800e58 0xfffffc00520a75c0 0xfffffc0
17f80ed00 0xfffffc005ef082c0   R                     0          -          0   1      0.00K
      0.00K qp
22614 0xfffffc017fe54ca0 0xfffffc017fe54a80 0xfffffc017fe54e58 0xfffffc00520a72c0 0xfffffc0
15d54b400 0xfffffc0052456580     W  0xfffffc0052456580      pause  0x87053b5          0.00K
      0.00K ami
22628 0xfffffc00528a9720 0xfffffc00528a9500 0xfffffc00528a98d8 0xfffffc00520a66c0 0xfffffc0
17f80fa00 0xfffffc017fe40000     W  0xfffffc017fe40000      pause  0x32f7a84          0.00K
      0.00K sap
22636 0xfffffc015ace2220 0xfffffc015ace2000 0xfffffc015ace23d8 0xfffffc005f70acc0 0xfffffc0
15d54b700 0xfffffc017f7f7340     W  0xfffffc017f7f7340      pause  0x31f5472          0.00K
      0.00K cra
22642 0xfffffc0082aa3720 0xfffffc0082aa3500 0xfffffc0082aa38d8 0xfffffc005f70bd40 0xfffffc0
03f17e800 0xfffffc017f7f6000     W  0xfffffc017f7f6000      pause  0xee4caa2          0.00K
      0.00K vanish
22650 0xfffffc00528a8220 0xfffffc00528a8000 0xfffffc00528a83d8 0xfffffc005196acc0 0xfffffc0
08e6ba000 0xfffffc017fe41080     W  0xfffffc017fe41080      pause  0x31f5471          0.00K
      0.00K apa
22651 0xfffffc0082aa2ca0 0xfffffc0082aa2a80 0xfffffc0082aa2e58 0xfffffc005f70b200 0xfffffc0
03f17f400 0xfffffc017f7f6dc0     W  0xfffffc017f7f6dc0      pause  0x31f5472          0.00K
      0.00K nm_ctrl
10509 0xfffffc015a89f720 0xfffffc015a89f500 0xfffffc015a89f8d8 0xfffffc015aa78840 0xfffffc0
15ab2bf00 0xfffffc015a43d340     W                   0     usleep  0x3220ef3          0.00K
      0.00K dm
10533 0xfffffc015b8ce220 0xfffffc015b8ce000 0xfffffc015b8ce3d8 0xfffffc00520a6600 0xfffffc0
078671f00 0xfffffc00524bcb00     W  0xfffffc015bec4294     (null)  0x323484a          0.00K
      0.00K cron
10545 0xfffffc00528a8ca0 0xfffffc00528a8a80 0xfffffc00528a8e58 0xfffffc00520a6cc0 0xfffffc0
08e6baf00 0xfffffc00524bc840     W  0xfffffc00528a8ca0       wait  0x4b9dc47          0.00K
      0.00K ndbsard
10596 0xfffffc015a430220 0xfffffc015a430000 0xfffffc015a4303d8 0xfffffc00520a6fc0 0xfffffc0
08e6ba700 0xfffffc00524bd600     W  0xfffffc006dae9a68        tty  0xf447d64          0.00K
      0.00K getty
22997 0xfffffc015b4caca0 0xfffffc015b4caa80 0xfffffc015b4cae58 0xfffffc00a099bbc0 0xfffffc0
06c5ffd00 0xfffffc0052551b80     W                   0     usleep  0x3203c86          0.00K
      0.00K sadc
23520 0xfffffc0052458ca0 0xfffffc0052458a80 0xfffffc0052458e58 0xfffffc005196a6c0 0xfffffc0
078277800 0xfffffc017fe418c0   R                     0          -          0          0.00K
      0.00K rlogin
23522 0xfffffc0052458220 0xfffffc0052458000 0xfffffc00524583d8 0xfffffc005196a600 0xfffffc0
078670e00 0xfffffc00524bd080     W  0xfffffc015bd495a0     socket  0xeb2b83a          0.00K
      0.00K rlogin
20475 0xfffffc017f7e9720 0xfffffc017f7e9500 0xfffffc017f7e98d8 0xfffffc005f70bc80 0xfffffc0
03dced600 0xfffffc006faea000     W  0xfffffc006faea000      pause  0x31f8131          0.00K
      0.00K update
cda> quit


******************** From dia *************************

******************************** ENTRY   44 ******************************** 


                                     ** Error during ETC processing of GEN seg 
                                     -  Canonical buffer dump follows 

Entry# (record in file)           0. 
Canonical buff size           25662. 
Canonical event size              0. 
Canonical Event-Buffer: 

          15--<-12  11--<-08  07--<-04  03--<-00   :Byte Order 



******************************** ENTRY   45 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             5. 
Timestamp of occurrence              29-JAN-1997 17:17:19   
Host name                            DFS00-01 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000006 
CPU logging event (mperr) x00000001 

Event validity                    1. O/S claims event is valid 
Event severity                    5. Low Priority 
Entry type                      100. CPU Machine Check Errors 

CPU Minor class                   4. 620 System Correctable Error 

--TLaser 620 Corr Error--              
Software Flags            x00000001  TLSB Error Log Snapshot Packet Present 
Active CPUs               x0000003F 
Hardware Rev              x00000000 
System Serial Number                 NI550R9310 
Module Serial Number                 AY63007173 
System Revision           x00000000 
MCHK Reason Mask          x00000086 
MCHK Frame Rev            x00000001 
EI STAT                   xFFFFFFF0C4FFFFFF 
                                     DATA SOURCE IS MEMORY OR SYSTEM 
                                     CORRECTABLE ECC ERROR 
                                     D-ref fill 
                                     EV5 Chip Rev 4 
EI ADDRESS                xFFFFFF00055A0D6F 
FILL SYNDROME             x0000000000009D00 
                                     Data Bit = 119 
ISR                       x0000000100000000 
                                     Correctable ECC errors (IPL31) 
                                     AST requests 3 - 0  x0000000000000000 
WHAMI                           x01  TLSB NODE ID  0. 
                                     CPU1 
MISCR                           x55  B-Cache Size  4 Mbyte Bcache 
                                     Two Processors 
                                     TLSB RUN Signal 
                                     CPU0 Running console 
TLDEV                     x51008014    -- Device Type:  Dual EV5 Proc, 300Mhz, 
                                                        4meg Bcache 
TLBER                     x00240000  CORRECTABLE READ DATA ERROR 
                                     DATA SYNDROME 1 
TLESR0                    x00400303 
TLESR1                    x00A09D00  ECC Syndrome 0  x00000000 
                                     ECC Syndrome 1  x0000009D 
                                     CORRECTABLE READ ECC ERROR 

  Error Syndrome 0              x00  No Error 
  Error Syndrome 1              x9D  Data Bit = 119 

TLESR2                    x00406060 
TLESR3                    x00409090 
Palcode Revision          x0000000400000400 
                                     Palcode Rev: 4.0-1 


*TLaser CPU Registers*                 
TLSB Node Number                  0. 
TLDEV                     x51008014    -- Device Type:  Dual EV5 Proc, 300Mhz, 
                                                        4meg Bcache 

TLBER                     x00240000  CORRECTABLE READ DATA ERROR 
                                     DATA SYNDROME 1 
TLCNR                     x00000200 
TLVID                     x00000010 
TLESR0                    x00400303 
TLESR1                    x00A09D00  ECC Syndrome 0  x00000000 
                                     ECC Syndrome 1  x0000009D 
                                     CORRECTABLE READ ECC ERROR 
TLESR2                    x00406060 
TLESR3                    x00409090 
TLEPAERR                  x00040000  First ADG Design:  Rev F 
MODCONFIG                 x00098AD4  Lockout Enable 
                                     Command Piping To EV5 Disabled 
                                     Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 11. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TLEP Interrupt Mask 0     x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TLEP Interrupt Summary 0  x00000000 
TLEP Interrupt Mask 1     x00000000 
TLEP Interrupt Summary 1  x00000000 


*TLaser CPU Registers*                 
TLSB Node Number                  1. 
TLDEV                     x51008014    -- Device Type:  Dual EV5 Proc, 300Mhz, 
                                                        4meg Bcache 

TLBER                     x00800000 
TLCNR                     x00000210 
TLVID                     x00000032 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TLEPAERR                  x00040000  First ADG Design:  Rev F 
MODCONFIG                 x00098AD4  Lockout Enable 
                                     Command Piping To EV5 Disabled 
                                     Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 11. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TLEP Interrupt Mask 0     x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TLEP Interrupt Summary 0  x00000000 
TLEP Interrupt Mask 1     x00000000 
TLEP Interrupt Summary 1  x00000000 


*TLaser CPU Registers*                 
TLSB Node Number                  2. 
TLDEV                     x51008014    -- Device Type:  Dual EV5 Proc, 300Mhz, 
                                                        4meg Bcache 

TLBER                     x00800000 
TLCNR                     x00000220 
TLVID                     x00000054 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TLEPAERR                  x00040000  First ADG Design:  Rev F 
MODCONFIG                 x00098AD4  Lockout Enable 
                                     Command Piping To EV5 Disabled 
                                     Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 11. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TLEP Interrupt Mask 0     x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TLEP Interrupt Summary 0  x00000000 
TLEP Interrupt Mask 1     x00000000 
TLEP Interrupt Summary 1  x00000000 


* TLaser Memory Regs *                 
TLSB Node Number                  5. 
TLDEV                     x00005000    -- Device Type:  Memory Module 

TLBER                     x01100000  DATA TRANSMITTER DURING ERROR 
TLCNR                     x000FC250 
TLVID                     x00000080 
FADR                      x070500000011D700 
FADR 1                    x07050000  Failing Command:    Write Bank Unlock 
                                     Failing Bank =   Bank 0 
TLESR0                    x00000303 
TLESR1                    x00000C0C 
TLESR2                    x00006060 
TLESR3                    x00009090 
TMIR                      x80000001  Interleave  x00000001 
TMCR                      x0000023D  2GB Module (E2036-AA) 
                                     16 MB 
                                     70ns DRAM 
                                     Strings Installed =   8 
                                     DRAM timing:   Bus Spd = 13.0-15.0; 
                                                    Refresh Cnt = 1008 
TMER                      x00000004  Failing String =   x00000004 
TMDRA                     x00000000  Refresh Rate   1X 
TDDR0                     x00000000 
TDDR1                     x00000000 
TDDR2                     x00000000 
TDDR3                     x00000000 


* TLaser Memory Regs *                 
TLSB Node Number                  6. 
TLDEV                     x00005000    -- Device Type:  Memory Module 

TLBER                     x00800000 
TLCNR                     x000FC260 
TLVID                     x00000091 
FADR 0                    x0012000000300000 
FADR 1                    x00120000 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TMIR                      x80000001  Interleave  x00000001 
TMCR                      x0000023D  2GB Module (E2036-AA) 
                                     16 MB 
                                     70ns DRAM 
                                     Strings Installed =   8 
                                     DRAM timing:   Bus Spd = 13.0-15.0; 
                                                    Refresh Cnt = 1008 
TMER                      x00000000  Failing String =   x00000000 
TMDRA                     x00000000  Refresh Rate   1X 
TDDR0                     x00000000 
TDDR1                     x00000000 
TDDR2                     x00000000 
TDDR3                     x00000000 


* TLaser Memory Regs *                 
TLSB Node Number                  7. 
TLDEV                     x00005000    -- Device Type:  Memory Module 

TLBER                     x00800000 
TLCNR                     x000FC270 
TLVID                     x000000A2 
FADR 0                    x0022000000300000 
FADR 1                    x00220000 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TMIR                      x80000001  Interleave  x00000001 
TMCR                      x0000023D  2GB Module (E2036-AA) 
                                     16 MB 
                                     70ns DRAM 
                                     Strings Installed =   8 
                                     DRAM timing:   Bus Spd = 13.0-15.0; 
                                                    Refresh Cnt = 1008 
TMER                      x00000000  Failing String =   x00000000 
TMDRA                     x00000000  Refresh Rate   1X 
TDDR0                     x00000000 
TDDR1                     x00000000 
TDDR2                     x00000000 
TDDR3                     x00000000 


* TLaser I/O Registers *               
TLSB Node Number                  8. 
TLDEV                     x00002000    -- Device Type:  I/O Module 

TLBER                     x00000000 
FADR 0                    x0000000000000000 
FADR 1                    x00000000 
TLESR0                    x00000000 
TLESR1                    x00000000 
TLESR2                    x00000000 
TLESR3                    x00000000 
CPU Interrupt Mask        x00000001  Cpu Interrupt Mask =   x00000001 
ICCMSR                    x00000000  Arbitration Control  Minimum Latency Mode 
                                     Supress Control  Suppress after 16 
                                                      Transations 
ICCNSE                    x80000000  Interrupt Enable on NSES Set 
ICCMTR                    x00000000 
IDPNSE-0                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-1                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-2                  x00000000 
IDPNSE-3                  x00000000 
IDPVR                     x00000800 
ICCWTR                    x00000000 
TLMBPR                    x0000000000000000 
IDPDR0                    x20000000 
IDPDR1                    x20000000 
IDPDR2                    x00000000 
IDPDR3                    x00000000 


***************** From message buffer in forced crash *****************

(kdbx) p *pmsgbuf
struct {
    msg_magic = 405601
    msg_bufx = 127
    msg_bufr = 3704
    msg_bufc = "rors detected on cpu 0. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 0. Reporting suspended.
al memory = 6144.00 megabytes.
available memory = 6039.08 megabytes.
using 7861 buffers containing 61.41 megabytes of memory
.
.
start up message stuff then
.
.
WARNING: too many System corrected errors detected on cpu 3. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 4. Reporting suspended.
WARNING: too many Processor corrected errors detected on cpu 4. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 2. Reporting suspended.
WARNING: too many System corrected errors detected on cpu 5. Reporting suspended.
WARNING: too many System corrected er"
}
T.RTitleUserPersonal
Name
DateLines
8686.1NETRIX::&quot;[email protected]&quot;Shashi MangalatMon Feb 03 1997 20:2213
>Notice that most of the kernel threads are in a U state.

This, by itself, is not indicative of any problems.  Most kernel
threads wait uninterruptibly.  Any thread that has a "(null)" wait
message does not tell us anything where it is waiting.  So you need to
do a stack trace on these to figure out if they are okay.  Other wait
messages in the output are normal.  You may want to look at the running
threads to see what they are upto.  The only suspicious looking one is
the thread waiting on a "wlock".

--shashi

[Posted by WWW Notes gateway]