[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference wonder::turbolaser

Title:TurboLaser Notesfile - AlphaServer 8200 and 8400 systems
Notice:Welcome to WONDER::TURBOLASER in it's new homeshortly
Moderator:LANDO::DROBNER
Created:Tue Dec 20 1994
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:1218
Total number of notes:4645

1109.0. "U:U:U: A8400 CONSOLE crash (need INFOS)" by COLES1::LONZECK () Fri Feb 14 1997 15:21

    Hello,
    
    i have a big trouble with one A8400 / 5/440.
    The following Console crashes appear twice a day.
    After the crash the console crash loop.
    ONLY a Power off for 2 Minutes clears the crash and the
    Selftest runs without any Problem.
    All internal Diagnostics including Memory Test and SIMMCALLOUT
    runs without any Problem. No Problem reportet under DECevent V2.3
    INFO xyz and show eeprom halt reports no Problem !!
    
    Any IDEA ???????
    
    Info: Console V4.1-6
    	  DIGITAL UNIX V4.0B
    
    normal Printout; after Pwr up.
    

F   E   D   C   B   A   9   8   7   6   5   4   3   2   1   0   NODE #
                            A   M   .   .   .   .   .   P   P   TYP
                            o   +   .   .   .   .   .  ++  ++   ST1
                            .   .   .   .   .   .   .  EE  EB   BPD
                            o   +   .   .   .   .   .  ++  ++   ST2
                            .   .   .   .   .   .   .  EE  EB   BPD
                            +   +   .   .   .   .   .  ++  ++   ST3
                            .   .   .   .   .   .   .  EE  EB   BPD

                +   +   +   .   +   +   +   .   +   .   +   +   C0 PCI +
                +   .   +   .   +   +   +   .   +   +   .   .   C1 PCI +
                .   +   +   .   +   +   +   .   +   .   .   .   C2 PCI +
                .   .   +   .   +   +   +   .   +   +   +   .   C3 PCI +

                            .  A0   .   .   .   .   .   .   .   ILV
                            . 2GB   .   .   .   .   .   .   .   2GB
AlphaServer 8400 Console V4.1-6, 15-NOV-1996 10:47:57, SROM V3.1
Configuring I/O adapters...
kzpsa0, slot 3, bus 0, hose0
kzpsa1, slot 5, bus 0, hose0
kzpsa2, slot 6, bus 0, hose0
kzpsa3, slot 7, bus 0, hose0
kzpsa4, slot 9, bus 0, hose0
kzpsa5, slot 10, bus 0, hose0
tulip0, slot 11, bus 0, hose0
kzpsa6, slot 2, bus 0, hose1
kzpsa7, slot 3, bus 0, hose1
kzpsa8, slot 5, bus 0, hose1
kzpsa9, slot 6, bus 0, hose1
kzpsa10, slot 7, bus 0, hose1
kzpsa11, slot 9, bus 0, hose1
tulip1, slot 11, bus 0, hose1
kzpsa12, slot 3, bus 0, hose2
kzpsa13, slot 5, bus 0, hose2
kzpsa14, slot 6, bus 0, hose2
kzpsa15, slot 7, bus 0, hose2
kzpsa16, slot 9, bus 0, hose2
kzpsa17, slot 10, bus 0, hose2
kzpaa0, slot 1, bus 0, hose3
kzpsa18, slot 2, bus 0, hose3
kzpsa19, slot 3, bus 0, hose3
kzpsa20, slot 5, bus 0, hose3
kzpsa21, slot 6, bus 0, hose3
kzpsa22, slot 7, bus 0, hose3
kzpsa23, slot 9, bus 0, hose3
P00>>>
P00>>>
P00>>>
P00>>>sho config

        Name                  Type   Rev  Mnemonic  
  TLSB
  0++   KN7CE-AB              8014  0000  kn7ce-ab0   
  1++   KN7CE-AB              8014  0000  kn7ce-ab1   
  7+    MS7CC                 5000  0000  ms7cc0      
  8+    KFTHA                 2000  0D03  kftha0      

  C0 PCI connected to kftha0              pci0    
  0+    DEC PCI MC          181011  000E  mc0         
  1+    DEC PCI MC          181011  000E  mc1         
  3+    KZPSA                81011  0000  kzpsa0      
  5+    KZPSA                81011  0000  kzpsa1      
  6+    KZPSA                81011  0000  kzpsa2      
  7+    KZPSA                81011  0000  kzpsa3      
  9+    KZPSA                81011  0000  kzpsa4      
  A+    KZPSA                81011  0000  kzpsa5      
  B+    DECchip 21140-AA     91011  0012  tulip0      

  C1 PCI connected to kftha0              pci1    
  2+    KZPSA                81011  0000  kzpsa6      
  3+    KZPSA                81011  0000  kzpsa7      
  5+    KZPSA                81011  0000  kzpsa8      
  6+    KZPSA                81011  0000  kzpsa9      
  7+    KZPSA                81011  0000  kzpsa10     
  9+    KZPSA                81011  0000  kzpsa11     
  B+    DECchip 21140-AA     91011  0012  tulip1      

  C2 PCI connected to kftha0              pci2    
  3+    KZPSA                81011  0000  kzpsa12     
  5+    KZPSA                81011  0000  kzpsa13     
  6+    KZPSA                81011  0000  kzpsa14     
  7+    KZPSA                81011  0000  kzpsa15     
  9+    KZPSA                81011  0000  kzpsa16     
  A+    KZPSA                81011  0000  kzpsa17     

  C3 PCI connected to kftha0              pci3    
  1+    KZPAA                11000  0002  kzpaa0      
  2+    KZPSA                81011  0000  kzpsa18     
  3+    KZPSA                81011  0000  kzpsa19     
  5+    KZPSA                81011  0000  kzpsa20     
  6+    KZPSA                81011  0000  kzpsa21     
  7+    KZPSA                81011  0000  kzpsa22     
  9+    KZPSA                81011  0000  kzpsa23     
P00>>>show mem
Set   Node   Size        Base Address         Intlv   Position
---   ----   ----      -------- --------      -----   --------
 A      7    2048 Mb   00000000 00000000      2-Way      0
P00>>>
P00>>>
P00>>>
P00>>>! selftest ok
P00>>>
    
    
    Crashinformation:
    =================
root@ernie:/root# 
root@ernie:/root# 
root@ernie:/root# 
root@ernie:/root# 
root@ernie:/root# Load POWERUP failed - mem e05a0, off 282a0, buf 17fba0, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 17fba0, status 1
Load POWERUP failed - mem e05a0, off 282a0, buf 181dc0, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 181dc0, status 1
Load POWERUP failed - mem e05a0, off 282a0, buf 183fe0, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 183fe0, status 1
Load POWERUP failed - mem e05a0, off 282a0, buf 186200, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 186200, status 1
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
CPU 3 unable to complete console mode transition 1
CPU 3: begin = 524581437, end = 13625018200, delta = 13100436763
unexpected exception/interrupt through vector 430
Illegal Operand Trap
process powerup, pcb = 00111680

 pc: 00000000 00014200  ps: 00000000 00001F04
 r2: 00000000 00068348  r5: 431C041C 478710A8
 r3: 00000000 00105668  r6: 00000000 00000000
 r4: 96810028 C3E0000A  r7: 4B21D69C C3E00000

Overlay name                       memadr   topadr    size   ref
turbo                               20000    7ce00  380416  0
xdelta                              7ce20    8b220   58368  1
powerup                             8b240    99440   57856  1
diagsupport                         99460    9f060   23552  1

exception context saved starting at 00113100

GPRs:
  0: 00000000 0000001F  16: 00000000 00054A78
  1: 00000000 00000008  17: 00000000 00004A5C
  2: 00000000 0006AC48  18: FFFFFFFF FFFFFFFF
  3: 00000000 00054A78  19: 00000000 00000000
  4: 96810028 C3E0000A  20: 00000000 00020178
  5: 431C041C 478710A8  21: 00000000 001132E8
  6: 00000000 00000000  22: 00000000 00054A78
  7: 4B21D69C C3E00000  23: 00000000 00005000
  8: 00000000 00000000  24: 00000000 00000007
  9: 00000000 00000000  25: 00000000 00000001
 10: 00000000 00000000  26: 00000000 00036A24
 11: 00000000 00000000  27: 00000000 0006AC48
 12: 00000000 00000000  28: 00000000 00115D68
 13: 00000000 00000000  29: 00000000 00113240
 14: 00000000 00000000  30: 00000000 00113240
 15: 00000000 00000000

dump of active call frames:

PC  =  000141FC
PD  =  0006AC48
FP  =  00113240
SP  =  00113240
bad PD; KIND =  2

Brk 0 at 00067724
@@� �04 21@`@*(B@MD�
(�BP@�`@@�
��$Z@     B@@FPB�!!a04A �@4P00AEA1!A0!HI)HA�    !       !!A�!
     @V@"�� @   xb 
                   D@    0$ @ ""�� @@H!         
                                                        R
                                                          0     �QH"@ p0 0h00 &
                                                                               J0
 �
  �
   �A dD0)$$" �00&IHI @@WD@DLA0@0 `b`df@��$D   R : `fH"  " 0    20 J0@!       1 O�
J�      P@d
           � ```$ 
                  ��0IH@��p&

                            R"�A        @QADLWI@Pf  0  00 a 
                                                            �(�2@@@A@!AA       40        %$,$$ $R
                 `0 �   @@ 00 0 0    $$� �
                                          eQ, &"��222 0"��@&&$$
                                                                        �
                                                                          �A�2
                                                                               0 "0 2���9P,�                    �

1AAY(,,,$�@PAJPH@                �i��A���@2
                 ( "!(  @(A@ $B@        *@BI    �$ ���  $
                                                                �$$ iJ bI     �$ ��$ 
u.s.w ......

R2  =  00000000
R3  =  00000000
R4  =  00000000
R29 =  00000000


Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 
<timeout>;P 
<timeout>;P 

F   E   D   C   B   A   9   8   7   6   5   4   3   2   1   0   NODE #
                            A   M   M   .   .   .   .   P   P   TYP
                            o   +   +   .   .   .   .  ++  ++   ST1
                            .   .   .   .   .   .   .  EE  EB   BPD
<timeout>;P 

***CPU 00: Unexpected Machine Check through vector 0670
Processor machine check

EV5 IPRs:
  exc_addr:  00000000 000c5c20  exc_sum:     00000000 00000000
  exc_mask:  00000000 00000000  isr:         00000000 40000000
  icsr:      00000041 44020300  icpe_stat:   00000000 00002000
  dcpe_stat: 00000000 00000000  va:          ffffffff 89c00040
  mm_stat:   00000000 00016051  sc_addr:     ffffff00 0001363f
  sc_stat:   00000000 00000000  bc_tag_addr: ffffff80 000f7fff
  ei_addr:   ffffff00 0020084f  ei_stat:     fffffff0 01ffffff
  fill_syn:  00000000 00000000

TLEP CSRs:
  tlber:    00200000  tlepaerr:   00600000  tlepmerr:  00000000
  tlepderr: 00000000  tlintrmask0: 0000007f  tlintrsum0: 00000000
  tlep_vmg: 00000000  tlintrmask1: 0000007e  tlintrsum1: 00000040
FRIGN asserted - cannot access TLFADR
  tlesr0: 8000c8c8  tlesr1: 80000101  tlesr2: 8000c8c8  tlesr3: 8000c8c8
  tlepwerr0: 00000000  tlepwerr1: 00000000  tlepwerr2: 00000000  tlepwerr3: 00000000

Console Crash... Type ;P to view stack contents

Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 

Process cpu_mem, pcb = 00119E40
 pc: 00000000 000C5C20  ps: 00000000 00000000
 r2: 00000000 00068348  r5: 00000000 0011C9A0
 r3: 00000000 001059C8  r6: 00000000 0011C9A8
 r4: 00000000 00000058  r7: 00000000 0011C9E0

exception context saved starting at 0011BBC0

GPRs:
  0: 00000000 00000001  16: 00000000 00000009
  1: FFFFFFFF FFFFFFFF  17: 00000000 00000000
  2: 00000000 000CD188  18: FFFFFFFF 89C00000
  3: 00000000 00000002  19: 00000000 00000000
  4: 00000000 00000002  20: 00000000 00000001
  5: 00000000 0011C9A0  21: 00000000 0011C9E4
  6: 00000000 0011C9A8  22: FFFFFFFF FFFFFFFF
  7: 00000000 0011C9E0  23: 00000000 00000000
  8: 00000000 0011BE08  24: 00000000 002EEF73
  9: 00000000 00115F80  25: 00000000 00000000
 10: 00000000 00119FD4  26: 00000000 00000000
 11: 00000000 0011C480  27: 00000000 000664C0
 12: 00000000 00000001  28: 00000000 0006A0E0
 13: 00000000 0009D8C0  29: 00000000 0011BD00
 14: 00000000 00000001  30: 00000000 0011BD00
 15: 00000000 0006AC80

dump of active call frames:

PC  =  000C5C1C
PD  =  000CD188 (CLEAR_TLBERS)
FP  =  0011BD00
SP  =  0011BD00

R2 R3 R4 R5 R6 R7 R29 saved starting at 0011BD08

R2  =  000CD1F8
R3  =  00000002
R4  =  0011C864
R5  =  00000002
R6  =  0011BE28
R7  =  00000001
R29 =  0011BD50

PC  =  000C45A4
PD  =  000CD1F8 (PHASE_1)
FP  =  0011BD50
SP  =  0011BD50

R2 R3 R4 R5 R6 R7 R8 R9 R10 R11 R12 R13 R14 R15 R29 saved starting at 0011BD78

R2  =  000CD2C0
R3  =  0011C740
R4  =  00000000
R5  =  00119E40
R6  =  00000001
R7  =  0011C480
R8  =  00000000
R9  =  88000000
R10 =  000233F0
R11 =  00000004
R12 =  0000000F
R13 =  00000002
R14 =  00000001
R15 =  00020340
R29 =  0011BE00

PC  =  000C4078
PD  =  000CD2C0 (CPU_MEM)
FP  =  0011BE00
SP  =  0011BE00

R2 R3 R4 R5 R6 R7 R8 R9 R10 R11 R12 R13 R14 R15 R29 saved starting at 0011BF70

R2  =  0005C910
R3  =  00119E40
R4  =  0011A010
R5  =  00000000
R6  =  00000000
R7  =  00000000
R8  =  00000000
R9  =  00000000
R10 =  00000000
R11 =  00000000
R12 =  00000000
R13 =  00000000
R14 =  00000000
R15 =  00000000
R29 =  0011BFF0

PC  =  0003C694
PD  =  0005C910 (KRN$_PROCESS)
FP  =  0011BFF0
SP  =  0011BFF0

R2 R3 R4 R29 saved starting at 0011BFF8

R2  =  00000000
R3  =  00000000
R4  =  00000000
R29 =  00000000


Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 





F   E   D   C   B   A   9   8   7   6   5   4   3   2   1   0   NODE #
                            A   M   .   .   .   .   .   P   P   TYP
                            o   +   .   .   .   .   .  ++  ++   ST1
                            .   .   .   .   .   .   .  EE  EB   BPD
                            o   +   .   .   .   .   .  ++  ++   ST2
                            .   .   .   .   .   .   .  EE  EB   BPD
                            +   +   .   .   .   .   .  ++  ++   ST3
                            .   .   .   .   .   .   .  EE  EB   BPD

***CPU 01: Unexpected Machine Check through vector 0670
Processor machine check

EV5 IPRs:
  exc_addr:  00000000 00042e00  exc_sum:     00000000 00000000
  exc_mask:  00000000 00000000  isr:         00000000 00000000
  icsr:      00000041 44020300  icpe_stat:   00000000 00002000
  mm_stat:   00000000 00016ed1  sc_addr:     ffffff00 00005d2f
  sc_stat:   00000000 00000000  bc_tag_addr: ffffff80 000f7fff
  ei_addr:   ffffff00 0020084f  ei_stat:     fffffff0 01ffffff
  fill_syn:  00000000 00000000

TLEP CSRs:
  tlber:    00800000  tlepaerr:   00600000  tlepmerr:  00000000
  tlepderr: 00000000  tlintrmask0: 0000007f  tlintrsum0: 00000000
  tlep_vmg: 00000000  tlintrmask1: 0000007e  tlintrsum1: 00000000
  tlfadr0 = 00000000  tlfadr1 = 00000000
  tlesr0: 00400303  tlesr1: 00400c0c  tlesr2: 00406060  tlesr3: 00409090
  tlepwerr0: 00080598  tlepwerr1: 00043000  tlepwerr2: 00000000  tlepwerr3: 00000000

Console Crash... Type ;P to view stack contents

Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 

  dcpe_statProcess idle, pcb = 0006BFD0
 pc: 00000000 00042E00  ps: 20000000 00000000
 r2: 00000000 00068348  r5: 00000000 000230C0
 r3: 00000000 001059C8  r6: 00000000 00014081
 r4: 00000000 00000058  r7: 00000000 0000000A

exception context saved starting at 0006CF40

GPRs:
:  0: 0000000A 1487C312  16: 00000000 0006D0A8
  1: 00000000 00000001  17: 00000000 002D27A6
  2: 00000000 0005DA10  18: 00000000 00000000
  3: 00000000 00000001  19: 00000000 00000000
  4: 00000000 00023098  20: 00000000 00000001
  5: 00000000 000230C0  21: 00000000 00000002
  6: 00000000 00014081  22: 00000000 0006D0A8
  7: 00000000 0000000A  23: 00000000 000204F0
  8: 00000000 000671A0  24: 00000000 00000491
  9: 00000000 000671A8  25: 00000000 00000001
 10: 00000000 00054980  26: 00000000 00042E00
 11: 00000000 0005A590  27: 00000000 00068A00
 12: 00000000 0005AA60  28: 00000000 00000000
 13: 00000000 0000F000  29: 00000000 0006D0A0
 14: 00000000 00000000  30: 00000000 0006D0A0
 15: 00000000 00000000

dump of active call frames:

PC  =  00042DFC
PD  =  0005DA10Initializing...

F   E   D   C   B   A   9   8   7   6   5   4   3   2   1   0   NODE #
                            A   M   .   .   .   .   .   P   P   TYP
                            o   +   .   .   .   .   .  ++  ++   ST1
                            .   .   .   .   .   .   .  EE  EB   BPD
                            o   +   .   .   .   .   .  ++  ++   ST2
                            .   .   .   .   .   .   .  EE  EB   BPD
                            +   +   .   .   .   .   .  ++  ++   ST3
                            .   .   .   .   .   .   .  EE  EB   BPD


    
T.RTitleUserPersonal
Name
DateLines
1109.1power?AFW4::MAZURMon Feb 17 1997 08:26355
Not sure, but the TLEP in slot 1 looks like it incurred a power cycle.
If that is happening then it is some hardware problem.  Do you have enough
CEAGs in that 24 KZPSA system?   

Things to try.  

   o Run with just the TLEP in slot 0 and see if you stay up all day.

   o Swap slot 0 and slot 1 TLEPs and see if that results in a different
     problem.

   o Pull some hoses and run for a day with only half the KZPSAs.  I am not
     sure what to do with results either way of this though.


P00>>>
P00>>>
P00>>>
P00>>>! selftest ok
P00>>>
    
    
    Crashinformation:
    =================
root@ernie:/root# 
root@ernie:/root# 
root@ernie:/root# 
root@ernie:/root# 
root@ernie:/root# Load POWERUP failed - mem e05a0, off 282a0, buf 17fba0, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 17fba0, status 1
Load POWERUP failed - mem e05a0, off 282a0, buf 181dc0, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 181dc0, status 1
Load POWERUP failed - mem e05a0, off 282a0, buf 183fe0, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 183fe0, status 1
Load POWERUP failed - mem e05a0, off 282a0, buf 186200, status 1
Load diagsupport failed - mem e05a0, off cb660, buf 186200, status 1
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
Unable to malloc memory for Decompression for POWERUP
Unable to malloc memory for Decompression for diagsupport
CPU 3 unable to complete console mode transition 1
CPU 3: begin = 524581437, end = 13625018200, delta = 13100436763
unexpected exception/interrupt through vector 430
Illegal Operand Trap
process powerup, pcb = 00111680

 pc: 00000000 00014200  ps: 00000000 00001F04
 r2: 00000000 00068348  r5: 431C041C 478710A8
 r3: 00000000 00105668  r6: 00000000 00000000
 r4: 96810028 C3E0000A  r7: 4B21D69C C3E00000

Overlay name                       memadr   topadr    size   ref
turbo                               20000    7ce00  380416  0
xdelta                              7ce20    8b220   58368  1
powerup                             8b240    99440   57856  1
diagsupport                         99460    9f060   23552  1

exception context saved starting at 00113100

GPRs:
  0: 00000000 0000001F  16: 00000000 00054A78
  1: 00000000 00000008  17: 00000000 00004A5C
  2: 00000000 0006AC48  18: FFFFFFFF FFFFFFFF
  3: 00000000 00054A78  19: 00000000 00000000
  4: 96810028 C3E0000A  20: 00000000 00020178
  5: 431C041C 478710A8  21: 00000000 001132E8
  6: 00000000 00000000  22: 00000000 00054A78
  7: 4B21D69C C3E00000  23: 00000000 00005000
  8: 00000000 00000000  24: 00000000 00000007
  9: 00000000 00000000  25: 00000000 00000001
 10: 00000000 00000000  26: 00000000 00036A24
 11: 00000000 00000000  27: 00000000 0006AC48
 12: 00000000 00000000  28: 00000000 00115D68
 13: 00000000 00000000  29: 00000000 00113240
 14: 00000000 00000000  30: 00000000 00113240
 15: 00000000 00000000

dump of active call frames:

PC  =  000141FC
PD  =  0006AC48
FP  =  00113240
SP  =  00113240
bad PD; KIND =  2

Brk 0 at 00067724
@@� �04 21@`@*(B@MD�
(�BP@�`@@�
��$Z@     B@@FPB�!!a04A �@4P00AEA1!A0!HI)HA�    !       !!A�!
     @V@"�� @   xb 
                   D@    0$ @ ""�� @@H!         
                                                        R
                                                          0     �QH"@ p0 0h00 &
                                                                               J0
 �
  �
   �A dD0)$$" �00&IHI @@WD@DLA0@0 `b`df@��$D   R : `fH"  " 0    20 J0@!       1 O�
J�      P@d
           � ```$ 
                  ��0IH@��p&

                            R"�A        @QADLWI@Pf  0  00 a 
                                                            �(�2@@@A@!AA       40        %$,$$ $R
                 `0 �   @@ 00 0 0    $$� �
                                          eQ, &"��222 0"��@&&$$
                                                                        �
                                                                          �A�2
                                                                               0 "0 2���9P,�                    �

1AAY(,,,$�@PAJPH@                �i��A���@2
                 ( "!(  @(A@ $B@        *@BI    �$ ���  $
                                                                �$$ iJ bI     �$ ��$ 
u.s.w ......

R2  =  00000000
R3  =  00000000
R4  =  00000000
R29 =  00000000


Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 
<timeout>;P 
<timeout>;P 

F   E   D   C   B   A   9   8   7   6   5   4   3   2   1   0   NODE #
                            A   M   M   .   .   .   .   P   P   TYP
                            o   +   +   .   .   .   .  ++  ++   ST1
                            .   .   .   .   .   .   .  EE  EB   BPD
<timeout>;P 

***CPU 00: Unexpected Machine Check through vector 0670
Processor machine check

EV5 IPRs:
  exc_addr:  00000000 000c5c20  exc_sum:     00000000 00000000
  exc_mask:  00000000 00000000  isr:         00000000 40000000
  icsr:      00000041 44020300  icpe_stat:   00000000 00002000
  dcpe_stat: 00000000 00000000  va:          ffffffff 89c00040
  mm_stat:   00000000 00016051  sc_addr:     ffffff00 0001363f
  sc_stat:   00000000 00000000  bc_tag_addr: ffffff80 000f7fff
  ei_addr:   ffffff00 0020084f  ei_stat:     fffffff0 01ffffff
  fill_syn:  00000000 00000000

TLEP CSRs:
  tlber:    00200000  tlepaerr:   00600000  tlepmerr:  00000000
  tlepderr: 00000000  tlintrmask0: 0000007f  tlintrsum0: 00000000
  tlep_vmg: 00000000  tlintrmask1: 0000007e  tlintrsum1: 00000040
FRIGN asserted - cannot access TLFADR
  tlesr0: 8000c8c8  tlesr1: 80000101  tlesr2: 8000c8c8  tlesr3: 8000c8c8
  tlepwerr0: 00000000  tlepwerr1: 00000000  tlepwerr2: 00000000  tlepwerr3: 00000000

Console Crash... Type ;P to view stack contents

Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 

Process cpu_mem, pcb = 00119E40
 pc: 00000000 000C5C20  ps: 00000000 00000000
 r2: 00000000 00068348  r5: 00000000 0011C9A0
 r3: 00000000 001059C8  r6: 00000000 0011C9A8
 r4: 00000000 00000058  r7: 00000000 0011C9E0

exception context saved starting at 0011BBC0

GPRs:
  0: 00000000 00000001  16: 00000000 00000009
  1: FFFFFFFF FFFFFFFF  17: 00000000 00000000
  2: 00000000 000CD188  18: FFFFFFFF 89C00000
  3: 00000000 00000002  19: 00000000 00000000
  4: 00000000 00000002  20: 00000000 00000001
  5: 00000000 0011C9A0  21: 00000000 0011C9E4
  6: 00000000 0011C9A8  22: FFFFFFFF FFFFFFFF
  7: 00000000 0011C9E0  23: 00000000 00000000
  8: 00000000 0011BE08  24: 00000000 002EEF73
  9: 00000000 00115F80  25: 00000000 00000000
 10: 00000000 00119FD4  26: 00000000 00000000
 11: 00000000 0011C480  27: 00000000 000664C0
 12: 00000000 00000001  28: 00000000 0006A0E0
 13: 00000000 0009D8C0  29: 00000000 0011BD00
 14: 00000000 00000001  30: 00000000 0011BD00
 15: 00000000 0006AC80

dump of active call frames:

PC  =  000C5C1C
PD  =  000CD188 (CLEAR_TLBERS)
FP  =  0011BD00
SP  =  0011BD00

R2 R3 R4 R5 R6 R7 R29 saved starting at 0011BD08

R2  =  000CD1F8
R3  =  00000002
R4  =  0011C864
R5  =  00000002
R6  =  0011BE28
R7  =  00000001
R29 =  0011BD50

PC  =  000C45A4
PD  =  000CD1F8 (PHASE_1)
FP  =  0011BD50
SP  =  0011BD50

R2 R3 R4 R5 R6 R7 R8 R9 R10 R11 R12 R13 R14 R15 R29 saved starting at 0011BD78

R2  =  000CD2C0
R3  =  0011C740
R4  =  00000000
R5  =  00119E40
R6  =  00000001
R7  =  0011C480
R8  =  00000000
R9  =  88000000
R10 =  000233F0
R11 =  00000004
R12 =  0000000F
R13 =  00000002
R14 =  00000001
R15 =  00020340
R29 =  0011BE00

PC  =  000C4078
PD  =  000CD2C0 (CPU_MEM)
FP  =  0011BE00
SP  =  0011BE00

R2 R3 R4 R5 R6 R7 R8 R9 R10 R11 R12 R13 R14 R15 R29 saved starting at 0011BF70

R2  =  0005C910
R3  =  00119E40
R4  =  0011A010
R5  =  00000000
R6  =  00000000
R7  =  00000000
R8  =  00000000
R9  =  00000000
R10 =  00000000
R11 =  00000000
R12 =  00000000
R13 =  00000000
R14 =  00000000
R15 =  00000000
R29 =  0011BFF0

PC  =  0003C694
PD  =  0005C910 (KRN$_PROCESS)
FP  =  0011BFF0
SP  =  0011BFF0

R2 R3 R4 R29 saved starting at 0011BFF8

R2  =  00000000
R3  =  00000000
R4  =  00000000
R29 =  00000000


Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 





F   E   D   C   B   A   9   8   7   6   5   4   3   2   1   0   NODE #
                            A   M   .   .   .   .   .   P   P   TYP
                            o   +   .   .   .   .   .  ++  ++   ST1
                            .   .   .   .   .   .   .  EE  EB   BPD
                            o   +   .   .   .   .   .  ++  ++   ST2
                            .   .   .   .   .   .   .  EE  EB   BPD
                            +   +   .   .   .   .   .  ++  ++   ST3
                            .   .   .   .   .   .   .  EE  EB   BPD

***CPU 01: Unexpected Machine Check through vector 0670
Processor machine check

EV5 IPRs:
  exc_addr:  00000000 00042e00  exc_sum:     00000000 00000000
  exc_mask:  00000000 00000000  isr:         00000000 00000000
  icsr:      00000041 44020300  icpe_stat:   00000000 00002000
  mm_stat:   00000000 00016ed1  sc_addr:     ffffff00 00005d2f
  sc_stat:   00000000 00000000  bc_tag_addr: ffffff80 000f7fff
  ei_addr:   ffffff00 0020084f  ei_stat:     fffffff0 01ffffff
  fill_syn:  00000000 00000000

TLEP CSRs:
  tlber:    00800000  tlepaerr:   00600000  tlepmerr:  00000000
  tlepderr: 00000000  tlintrmask0: 0000007f  tlintrsum0: 00000000
  tlep_vmg: 00000000  tlintrmask1: 0000007e  tlintrsum1: 00000000
  tlfadr0 = 00000000  tlfadr1 = 00000000
  tlesr0: 00400303  tlesr1: 00400c0c  tlesr2: 00406060  tlesr3: 00409090
  tlepwerr0: 00080598  tlepwerr1: 00043000  tlepwerr2: 00000000  tlepwerr3: 00000000

Console Crash... Type ;P to view stack contents

Brk 0 at 00067724

00067724 ! BPT          <timeout>;P 

  dcpe_statProcess idle, pcb = 0006BFD0
 pc: 00000000 00042E00  ps: 20000000 00000000
 r2: 00000000 00068348  r5: 00000000 000230C0
 r3: 00000000 001059C8  r6: 00000000 00014081
 r4: 00000000 00000058  r7: 00000000 0000000A

exception context saved starting at 0006CF40

GPRs:
:  0: 0000000A 1487C312  16: 00000000 0006D0A8
  1: 00000000 00000001  17: 00000000 002D27A6
  2: 00000000 0005DA10  18: 00000000 00000000
  3: 00000000 00000001  19: 00000000 00000000
  4: 00000000 00023098  20: 00000000 00000001
  5: 00000000 000230C0  21: 00000000 00000002
  6: 00000000 00014081  22: 00000000 0006D0A8
  7: 00000000 0000000A  23: 00000000 000204F0
  8: 00000000 000671A0  24: 00000000 00000491
  9: 00000000 000671A8  25: 00000000 00000001
 10: 00000000 00054980  26: 00000000 00042E00
 11: 00000000 0005A590  27: 00000000 00068A00
 12: 00000000 0005AA60  28: 00000000 00000000
 13: 00000000 0000F000  29: 00000000 0006D0A0
 14: 00000000 00000000  30: 00000000 0006D0A0
 15: 00000000 00000000

dump of active call frames:

PC  =  00042DFC
PD  =  0005DA10Initializing...

F   E   D   C   B   A   9   8   7   6   5   4   3   2   1   0   NODE #
                            A   M   .   .   .   .   .   P   P   TYP
                            o   +   .   .   .   .   .  ++  ++   ST1
                            .   .   .   .   .   .   .  EE  EB   BPD
                            o   +   .   .   .   .   .  ++  ++   ST2
                            .   .   .   .   .   .   .  EE  EB   BPD
                            +   +   .   .   .   .   .  ++  ++   ST3
                            .   .   .   .   .   .   .  EE  EB   BPD


    

1109.2Not Node 0 or 1 !!COLES1::LONZECKThu Feb 20 1997 02:438
    	I have 3 * 48V DC-Power regulators installed.
        The System contains 4 DWLPA-xx and each DWLPA-xx contains 
    	6 KZPSA-BB. 
    	
    	The Problem is not generated of the TLEP Node 0 and 1 !!!
    
    	
    	
1109.3AFW3::MAZURThu Feb 20 1997 07:4422
>    	
>    	The Problem is not generated of the TLEP Node 0 and 1 !!!
>

I would agree that the TLEP in slot 1 (CPU 2 & 3) is the best candidate at this
time to be the cause of the problem.

If you are trying to verify this problem more with the hardware you have
on hand, then you could try swapping TLEP slot 0 and TLEP slot 1;  or
running without TLEP slot 1. 

If your one TLEP system runs fine, you can think that the other TLEP
is broken, or its slot is bad (bent pins).  You could then try running
the 2nd TLEP in a different slot.

If you have another TLEP available to you, remove the TLEP in slot 1, and
put the new one in TLEP slot 2 (in case there is a bent pin in slot 1).

Good luck,
Dennis    	
    	

1109.4>>>Problem generated by MS7CC-FA <<<COLES1::LONZECKWed Feb 26 1997 05:20824
    Hello,
    
    the Problem is generated by Node ID#6 (MS7CC-FA.)
    
    at one point the primary Cpu loads the microcode into the lower memory
    and starts the internal diagnostics.
    the information, stored in the memory, are bad >>> System crash and
    console loop<<<.
    
    I changed the TLEP Module 6 with 7 (only Slotchange).
    After 1h to 3 Days runtime the system crashes with some information.
    unexp. exep. inter. Vector 660.... and i analyse the crashinformation
    with dia v2.2
    TLEP 0; 1 and 8 count's SEQUENCE Errors and i get the Information
    that TLEP Node 7 has a Problem with Bank 1.
    The TLEP -Slot 7 is ok. after some 'online' tests, because after pwr
    reset all internal diagnostics runs without any problem, Slotchanges.....
    i found that the problem is generated by the MS7cc-FA.
    System run's now with 2 GB and without any Problem.
    
    Errorlogprintout follows:
    

******************************** ENTRY   26 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             6. 
Timestamp of occurrence              19-FEB-1997 09:52:25   
Host name                            ernie 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000004 
CPU logging event (mperr) x00000002 
                      
Event validity                    1. O/S claims event is valid 
Event severity                    1. Severe Priority 
Entry type                      100. CPU Machine Check Errors 

CPU Minor class                   2. 660 Entry 

---TurboLaser 660---                   
Software Flags            x00000001  TLSB Error Log Snapshot Packet Present 
Active CPUs               x0000000F 
Hardware Rev              x00000000 
System Serial Number                 ay65115768 
Module Serial Number                 AY64903517 
System Revision           x00000000 
MCHK Reason Mask          x0000FFF0 
MCHK Frame Rev            x00000001 
PAL SHADOW REG 0          x0000000000000000 
PAL SHADOW REG 1          x0000000000000000 
PAL SHADOW REG 2          x0000000000000000 
PAL SHADOW REG 3          x0000000000000000 
PAL SHADOW REG 4          x0000000000000000 
PAL SHADOW REG 5          x0000000000000000 
PAL SHADOW REG 6          x0000000000000000 
PAL SHADOW REG 7          x0000000000000000 
PALTEMP0                  xFFFFFC0032CC5E00 
PALTEMP1                  x0000040000000000 
PALTEMP2                  xFFFFFC000047FE80 
PALTEMP3                  x0000000000005F20 
PALTEMP4                  x0000000000000001 
PALTEMP5                  x0000000000000000 
PALTEMP6                  x000000000000019D 
PALTEMP7                  xFFFFFC000047F8C0 
PALTEMP8                  x1F1E161514020100 
PALTEMP9                  xFFFFFC000047FBF0 
PALTEMP10                 xFFFFFC00004A40F8 
PALTEMP11                 xFFFFFC000047FA50 
PALTEMP12                 xFFFFFC000047FDF0 
PALTEMP13                 x0000005555400000 
PALTEMP14                 x0000000000000000 
PALTEMP15                 x00000002040585D9 
PALTEMP16                 x8000009806700201 
PALTEMP17                 x00000002088F8345 
PALTEMP18                 x0000000000000000 
PALTEMP19                 xFFFFFFFE8E5779A8 
PALTEMP20                 x0000000001024000 
PALTEMP21                 xFFFFFC000047FE20 
PALTEMP22                 xFFFFFC00005DF5B0 
PALTEMP23                 x00000000E736BA38 
EXC_ADDR                  xFFFFFC00004A40F8 
                                     Native-mode instruction 
                                     Exception PC  x3FFFFF000012903E 
EXC_SUM                   x0000000000000000 
EXC_MSK                   x0000000000000000 
PAL_BASE                  x0000000000018000 
                                     Base address for palcode  x0000000000000006 
ISR                       x0000000000000000 
                                     AST requests 3 - 0  x0000000000000000 
ICSR                      x0000004160020100 
                                     Timeout Bit Not Set 
                                     PAL Shadow Registers Enabled 
                                     Correctable Err Intrpts Enabled 
                                     MBOX packet selected 
                                     ICACHE BIST Successful 
IC PERR STAT              x0000000000002000 
                                     TIMEOUT RESET ERROR 
DC PERR STAT              x0000000000000000 
Virtual Address           xFFFFFFFE00945508 
MM STAT                   x0000000000014990 
                                     Ref resulted in DTB miss 
                                     Ra Field  x0000000000000006 

                                     Opcode Field   x0000000000000029 
SC ADDR                   xFFFFFF000001D24F 
SC STAT                   x0000000000000000 
BC TAG ADDRESS            xFFFFFF80354D6FFF 
                                     External cache hit 
                                     Parity for ds and v bits 
                                     Cache block dirty 
                                     Cache block shared 
                                     Cache block valid 
                                     Ext cache tag addr parity bit 
                                     Tag address is   x0000000000006B7F 
EI ADDRESS                xFFFFFF000020084F 
FILL SYNDROME             x0000000000000000 
EI STAT                   xFFFFFFF001FFFFFF 
                                     EV56 Chip Rev 1 
LD LOCK                   xFFFFFF000442F90F 
WHAMI                           x02  TLSB NODE ID  1. 
                                     CPU0 
MISCR                           x15  B-Cache Size  4 Mbyte Bcache 
                                     Two Processors 
                                     TLSB RUN Signal 
TLDEV                     x73008014    -- Device Type:  Dual EV56 Proc, 440Mhz, 
                                                        4meg Bcache 
TLBER                     x20800000  SEQUENCE ERROR 
TLCNR                     x00000210 
TLVID                     x00000032 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TLEPAERR                  x00600100  TLSB_FAULT ASSERTED IN SYSTEM 
                                     Second ADG Design:  Rev A 
MODCONFIG                 x00E08A84  Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 10. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TL INTR MASK 0            x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TL INTR MASK 1            x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TL INTR SUM 0             x00000000 
TL INTR SUM 1             x00000000 
TLEP VMG                  x00000000 
TLEPWERR0                 x000FFD80 
TLEPWERR1                 x00043810 
TLEPWERR2                 x00002D80 
TLEPWERR3                 x00047811 
                                       
  CPU0 Last Win Sp Access x000000C3810FFD80 
                                     Pending Bit=0, Address NOT VALID 
  CPU1 Last Win Sp Access x000000C781102D80 
                                     Pending Bit=0, Address NOT VALID 
                                       
Palcode Revision          x0000000600000401 
                                     Palcode Rev: 4.1-1 


*TLaser CPU Registers*                 
TLSB Node Number                  0. 
TLDEV                     x73008014    -- Device Type:  Dual EV56 Proc, 440Mhz, 
                                                        4meg Bcache 

TLBER                     x20800000  SEQUENCE ERROR 
TLCNR                     x00000200 
TLVID                     x00000010 
TLESR0                    x00400303 
TLESR1                    x00400C0C 
TLESR2                    x00406060 
TLESR3                    x00409090 
TLEPAERR                  x00600100  TLSB_FAULT ASSERTED IN SYSTEM 
                                     Second ADG Design:  Rev A 
MODCONFIG                 x00E08A84  Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 10. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TLEP Interrupt Mask 0     x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TLEP Interrupt Summary 0  x00000001  UART 0 Interrupt Outstanding 
TLEP Interrupt Mask 1     x00000000 
TLEP Interrupt Summary 1  x00000000 


*TLaser CPU Registers*                 
TLSB Node Number                  1. 
TLDEV                     x73008014    -- Device Type:  Dual EV56 Proc, 440Mhz, 
                                                        4meg Bcache 

TLBER                     x20800000  SEQUENCE ERROR 
TLCNR                     x00000210 
TLVID                     x00000032 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TLEPAERR                  x00600100  TLSB_FAULT ASSERTED IN SYSTEM 
                                     Second ADG Design:  Rev A 
MODCONFIG                 x00E08A84  Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 10. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TLEP Interrupt Mask 0     x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TLEP Interrupt Summary 0  x00000000 
TLEP Interrupt Mask 1     x00000000 
TLEP Interrupt Summary 1  x00000000 


* TLaser Memory Regs *                 
TLSB Node Number                  6. 
TLDEV                     x00005000    -- Device Type:  Memory Module 

TLBER                     x00800000 
TLCNR                     x000FC260 
TLVID                     x00000080 
FADR 0                    x0002000000300180 
FADR 1                    x00020000 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TMIR                      x80000002  Interleave  x00000002 
TMCR                      x0000022D  2GB Module (E2036-AA) 
                                     16 MB 
                                     70ns DRAM 
                                     Strings Installed =   8 
                                     DRAM timing:   Bus Spd = 11.3-12.9; 
                                                    Refresh Cnt = 1088 
TMER                      x00000006  Failing String =   x00000006 
TMDRA                     x00000000  Refresh Rate   1X 
TDDR0                     x00000000 
TDDR1                     x00000000 
TDDR2                     x00000000 
TDDR3                     x00000000 


* TLaser Memory Regs *                 
TLSB Node Number                  7. 
TLDEV                     x00005000    -- Device Type:  Memory Module 

TLBER                     x00100000 
TLCNR                     x000FC270 
TLVID                     x00000091 
FADR                      x071500000011D840 
FADR 1                    x07150000  Failing Command:    Write Bank Unlock 
                                     Failing Bank =   Bank 1 
TLESR0                    x00000303 
TLESR1                    x00000C0C 
TLESR2                    x00006060 
TLESR3                    x00009090 
TMIR                      x80000002  Interleave  x00000002 
TMCR                      x0000022D  2GB Module (E2036-AA) 
                                     16 MB 
                                     70ns DRAM 
                                     Strings Installed =   8 
                                     DRAM timing:   Bus Spd = 11.3-12.9; 
                                                    Refresh Cnt = 1088 
TMER                      x00000000  Failing String =   x00000000 
TMDRA                     x00000000  Refresh Rate   1X 
TDDR0                     x00000000 
TDDR1                     x00000000 
TDDR2                     x00000000 
TDDR3                     x00000000 


* TLaser I/O Registers *               
TLSB Node Number                  8. 
TLDEV                     x00002000    -- Device Type:  I/O Module 

TLBER                     x20000000  SEQUENCE ERROR 
FADR 0                    x0000000000000000 
FADR 1                    x00000000 
TLESR0                    x00000000 
TLESR1                    x00000000 
TLESR2                    x00000000 
TLESR3                    x00000000 
CPU Interrupt Mask        x00000001  Cpu Interrupt Mask =   x00000001 
ICCMSR                    x00000000  Arbitration Control  Minimum Latency Mode 
                                     Supress Control  Suppress after 16 
                                                      Transations 
ICCNSE                    x80000000  Interrupt Enable on NSES Set 
ICCMTR                    x00000000 
IDPNSE-0                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-1                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-2                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-3                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPVR                     x00000800 
ICCWTR                    x00000000 
TLMBPR                    x0000000000000000 
IDPDR0                    x20000000 
IDPDR1                    x20000000 
IDPDR2                    x00000000 
IDPDR3                    x00000000 



******************************** ENTRY   27 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             5. 
Timestamp of occurrence              19-FEB-1997 09:44:37   
Host name                            ernie 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000004 
CPU logging event (mperr) x00000001 

Event validity                    1. O/S claims event is valid 
Event severity                    5. Low Priority 
Entry type                      203. Undefined Entry Type 

                                     ** Error during CTR processing of EVT seg 
                                     - Canonical buffer dump follows 

Entry# (record in file)           0. 
Canonical buff size            1022. 
Canonical event size            252. 
Canonical Event-Buffer: 

          15--<-12  11--<-08  07--<-04  03--<-00   :Byte Order 
 0000:    0000001B  00000000  00000000  00000003   *................* 
 0010:    00000202  4E454720  33317646  534F0001   *..OSFv13 GEN....* 
 0020:    00000000  00000000  00000000  00000000   *................* 
 0030:    00050000  00000000  00000000  00000000   *................* 
 0040:    30303733  34343930  39313230  37393931   *1997021909443700* 
 0050:    00000000  00000000  00000020  20202020   *     ...........* 
 0060:    00000000  00000000  0065696E  72650000   *..ernie.........* 
 0070:    00000000  00000000  00000000  00000000   *................* 
 0080:    33317646  534F0001  00000000  00000000   *..........OSFv13* 
 0090:    000000FF  0000000C  00000000  55504320   * CPU............* 
 00A0:    00000000  00000000  00000001  00000004   *................* 
 00B0:    00000000  00000000  00000000  00000000   *................* 
 00C0:    00000000  00000000  00000000  00000000   *................* 
 00D0:    00000000  00000000  00000000  00000000   *................* 
 00E0:    00000000  00000000  00000000  00000000   *................* 
 00F0:              00000000  00000000  00000700   *    ............* 



******************************** ENTRY   28 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             4. 
Timestamp of occurrence              19-FEB-1997 09:42:01   
Host name                            ernie 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000004 
CPU logging event (mperr) x00000003 

Event validity                    1. O/S claims event is valid 
Event severity                    5. Low Priority 
Entry type                      203. Undefined Entry Type 

                                     ** Error during CTR processing of EVT seg 
                                     - Canonical buffer dump follows 

Entry# (record in file)           0. 
Canonical buff size            1022. 
Canonical event size            252. 
Canonical Event-Buffer: 

          15--<-12  11--<-08  07--<-04  03--<-00   :Byte Order 
 0000:    0000001C  00000000  00000000  00000003   *................* 
 0010:    00000202  4E454720  33317646  534F0001   *..OSFv13 GEN....* 
 0020:    00000000  00000000  00000000  00000000   *................* 
 0030:    00040000  00000000  00000000  00000000   *................* 
 0040:    30303130  32343930  39313230  37393931   *1997021909420100* 
 0050:    00000000  00000000  00000020  20202020   *     ...........* 
 0060:    00000000  00000000  0065696E  72650000   *..ernie.........* 
 0070:    00000000  00000000  00000000  00000000   *................* 
 0080:    33317646  534F0001  00000000  00000000   *..........OSFv13* 
 0090:    000000FF  0000000C  00000000  55504320   * CPU............* 
 00A0:    00000000  00000000  00000003  00000004   *................* 
 00B0:    00000000  00000000  00000000  00000000   *................* 
 00C0:    00000000  00000000  00000000  00000000   *................* 
 00D0:    00000000  00000000  00000000  00000000   *................* 
 00E0:    00000000  00000000  00000000  00000000   *................* 
 00F0:              00000000  00000000  00000700   *    ............* 



******************************** ENTRY   29 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             3. 
Timestamp of occurrence              19-FEB-1997 09:35:11   
Host name                            ernie 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000004 
CPU logging event (mperr) x00000002 

Event validity                    1. O/S claims event is valid 
Event severity                    5. Low Priority 
Entry type                      203. Undefined Entry Type 

                                     ** Error during CTR processing of EVT seg 
                                     - Canonical buffer dump follows 

Entry# (record in file)           0. 
Canonical buff size             966. 
Canonical event size            252. 
Canonical Event-Buffer: 

          15--<-12  11--<-08  07--<-04  03--<-00   :Byte Order 
 0000:    0000001D  00000000  00000000  00000003   *................* 
 0010:    00000202  4E454720  33317646  534F0001   *..OSFv13 GEN....* 
 0020:    00000000  00000000  00000000  00000000   *................* 
 0030:    00030000  00000000  00000000  00000000   *................* 
 0040:    30303131  35333930  39313230  37393931   *1997021909351100* 
 0050:    00000000  00000000  00000020  20202020   *     ...........* 
 0060:    00000000  00000000  0065696E  72650000   *..ernie.........* 
 0070:    00000000  00000000  00000000  00000000   *................* 
 0080:    33317646  534F0001  00000000  00000000   *..........OSFv13* 
 0090:    000000FF  0000000C  00000000  55504320   * CPU............* 
 00A0:    00000000  00000000  00000002  00000004   *................* 
 00B0:    00000000  00000000  00000000  00000000   *................* 
 00C0:    00000000  00000000  00000000  00000000   *................* 
 00D0:    00000000  00000000  00000000  00000000   *................* 
 00E0:    00000000  00000000  00000000  00000000   *................* 
 00F0:              00000000  00000000  00000700   *    ............* 



******************************** ENTRY   30 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             2. 
Timestamp of occurrence              19-FEB-1997 09:35:11   
Host name                            ernie 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000004 
CPU logging event (mperr) x00000002 

Event validity                    1. O/S claims event is valid 
Event severity                    5. Low Priority 
Entry type                      203. Undefined Entry Type 

                                     ** Error during CTR processing of EVT seg 
                                     - Canonical buffer dump follows 

Entry# (record in file)           0. 
Canonical buff size             966. 
Canonical event size            252. 
Canonical Event-Buffer: 

          15--<-12  11--<-08  07--<-04  03--<-00   :Byte Order 
 0000:    0000001E  00000000  00000000  00000003   *................* 
 0010:    00000202  4E454720  33317646  534F0001   *..OSFv13 GEN....* 
 0020:    00000000  00000000  00000000  00000000   *................* 
 0030:    00020000  00000000  00000000  00000000   *................* 
 0040:    30303131  35333930  39313230  37393931   *1997021909351100* 
 0050:    00000000  00000000  00000020  20202020   *     ...........* 
 0060:    00000000  00000000  0065696E  72650000   *..ernie.........* 
 0070:    00000000  00000000  00000000  00000000   *................* 
 0080:    33317646  534F0001  00000000  00000000   *..........OSFv13* 
 0090:    000000FF  0000000C  00000000  55504320   * CPU............* 
 00A0:    00000000  00000000  00000002  00000004   *................* 
 00B0:    00000000  00000000  00000000  00000000   *................* 
 00C0:    00000000  00000000  00000000  00000000   *................* 
 00D0:    00000000  00000000  00000000  00000000   *................* 
 00E0:    00000000  00000000  00000000  00000000   *................* 
 00F0:              00000000  00000000  00000700   *    ............* 



******************************** ENTRY   33 ******************************** 


Logging OS                        2. Digital UNIX 
System Architecture               2. Alpha 
Event sequence number             5. 
Timestamp of occurrence              19-FEB-1997 09:12:55   
Host name                            ernie 

System type register      x0000000C  AlphaServer 8x00 
Number of CPUs (mpnum)    x00000004 
CPU logging event (mperr) x00000000 

Event validity                    1. O/S claims event is valid 
Event severity                    1. Severe Priority 
Entry type                      100. CPU Machine Check Errors 

CPU Minor class                   2. 660 Entry 

---TurboLaser 660---                   
Software Flags            x00000001  TLSB Error Log Snapshot Packet Present 
Active CPUs               x0000000F 
Hardware Rev              x00000000 
System Serial Number                 ay65115768 
Module Serial Number                 AY65011831 
System Revision           x00000000 
MCHK Reason Mask          x0000FFFA 
MCHK Frame Rev            x00000001 
PAL SHADOW REG 0          x0000000000000000 
PAL SHADOW REG 1          x0000000000000000 
PAL SHADOW REG 2          x0000000000000000 
PAL SHADOW REG 3          x0000000000000000 
PAL SHADOW REG 4          x0000000000000000 
PAL SHADOW REG 5          x0000000000000000 
PAL SHADOW REG 6          x0000000000000000 
PAL SHADOW REG 7          x0000000000000000 
PALTEMP0                  xFFFFFC00ECB25E80 
PALTEMP1                  x0000040000000000 
PALTEMP2                  xFFFFFC000047FE80 
PALTEMP3                  x0000000000005200 
PALTEMP4                  x0000000000000001 
PALTEMP5                  x0000000000000000 
PALTEMP6                  x00000000000001A8 
PALTEMP7                  xFFFFFC000047F8C0 
PALTEMP8                  x1F1E161514020100 
PALTEMP9                  xFFFFFC000047FBF0 
PALTEMP10                 xFFFFFC00004A4120 
PALTEMP11                 xFFFFFC000047FA50 
PALTEMP12                 xFFFFFC000047FDF0 
PALTEMP13                 x0000005555400000 
PALTEMP14                 x0000000000000000 
PALTEMP15                 x00000002040585D9 
PALTEMP16                 x0000009806700001 
PALTEMP17                 x0000000000000000 
PALTEMP18                 x0000000000000000 
PALTEMP19                 xFFFFFFFE8E3F39A8 
PALTEMP20                 x0000000001024000 
PALTEMP21                 xFFFFFC000047FE20 
PALTEMP22                 xFFFFFC00005DF5B0 
PALTEMP23                 x00000000FF75BA38 
EXC_ADDR                  xFFFFFC00004A4120 
                                     Native-mode instruction 
                                     Exception PC  x3FFFFF0000129048 
EXC_SUM                   x0000000000000000 
EXC_MSK                   x0000000000000000 
PAL_BASE                  x0000000000018000 
                                     Base address for palcode  x0000000000000006 
ISR                       x0000000000000000 
                                     AST requests 3 - 0  x0000000000000000 
ICSR                      x0000006160020000 
                                     Timeout Bit Not Set 
                                     PAL Shadow Registers Enabled 
                                     Correctable Err Intrpts Enabled 
                                     Debug Port Sees Bits <11:5> of Siloed PC 
                                     ICACHE BIST Successful 
IC PERR STAT              x0000000000002000 
                                     TIMEOUT RESET ERROR 
DC PERR STAT              x0000000000000000 
Virtual Address           xFFFFFFFE009D6008 
MM STAT                   x0000000000016391 
                                     Ref which caused err was a write 
                                     Ref resulted in DTB miss 
                                     Ra Field  x000000000000000E 

                                     Opcode Field   x000000000000002C 
SC ADDR                   xFFFFFF000001D24F 
SC STAT                   x0000000000000000 
BC TAG ADDRESS            xFFFFFF8035CF6FFF 
                                     External cache hit 
                                     Parity for ds and v bits 
                                     Cache block dirty 
                                     Cache block shared 
                                     Cache block valid 
                                     Ext cache tag addr parity bit 
                                     Tag address is   x0000000000007B7F 
EI ADDRESS                xFFFFFF000011D85F 
FILL SYNDROME             x0000000000009000 
EI STAT                   xFFFFFFF001FFFFFF 
                                     EV56 Chip Rev 1 
LD LOCK                   xFFFFFF0004CE658F 
WHAMI                           x00  TLSB NODE ID  0. 
                                     CPU0 
MISCR                           x55  B-Cache Size  4 Mbyte Bcache 
                                     Two Processors 
                                     TLSB RUN Signal 
                                     CPU0 Running console 
TLDEV                     x73008014    -- Device Type:  Dual EV56 Proc, 440Mhz, 
                                                        4meg Bcache 
TLBER                     x00800000 
TLCNR                     x00000200 
TLVID                     x00000010 
TLESR0                    x00400303 
TLESR1                    x00400C0C 
TLESR2                    x00406060 
TLESR3                    x00409090 
TLEPAERR                  x00600100  TLSB_FAULT ASSERTED IN SYSTEM 
                                     Second ADG Design:  Rev A 
MODCONFIG                 x00E08A84  Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 10. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TL INTR MASK 0            x000001FF  UART 0 Interrupt Enable 
                                     IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
                                     Control/P Halt Enable 
TL INTR MASK 1            x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TL INTR SUM 0             x00000000 
TL INTR SUM 1             x00000000 
TLEP VMG                  x00000000 
TLEPWERR0                 x00002D80 
TLEPWERR1                 x00047811 
TLEPWERR2                 x00002D80 
TLEPWERR3                 x00047811 
                                       
  CPU0 Last Win Sp Access x000000C781102D80 
                                     Pending Bit=0, Address NOT VALID 
  CPU1 Last Win Sp Access x000000C781102D80 
                                     Pending Bit=0, Address NOT VALID 
                                       
Palcode Revision          x0000000600000401 
                                     Palcode Rev: 4.1-1 


*TLaser CPU Registers*                 
TLSB Node Number                  0. 
TLDEV                     x73008014    -- Device Type:  Dual EV56 Proc, 440Mhz, 
                                                        4meg Bcache 

TLBER                     x00800000 
TLCNR                     x00000200 
TLVID                     x00000010 
TLESR0                    x00400303 
TLESR1                    x00400C0C 
TLESR2                    x00406060 
TLESR3                    x00409090 
TLEPAERR                  x00600100  TLSB_FAULT ASSERTED IN SYSTEM 
                                     Second ADG Design:  Rev A 
MODCONFIG                 x00E08A84  Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 10. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TLEP Interrupt Mask 0     x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TLEP Interrupt Summary 0  x00000000 
TLEP Interrupt Mask 1     x00000000 
TLEP Interrupt Summary 1  x00000000 


*TLaser CPU Registers*                 
TLSB Node Number                  1. 
TLDEV                     x73008014    -- Device Type:  Dual EV56 Proc, 440Mhz, 
                                                        4meg Bcache 

TLBER                     x20800000  SEQUENCE ERROR 
TLCNR                     x00000210 
TLVID                     x00000032 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TLEPAERR                  x00600000  Second ADG Design:  Rev A 
MODCONFIG                 x00E08A84  Bcache Size:   4 MB 
                                     Bcache Idle Cycles Before 10. 
                                     Max Command Queue Entries 2. 
                                     Max Bus Queue Entries   4. 
TLEPMERR                  x00000000 
TLEPDERR                  x00000000 
TLEP Interrupt Mask 0     x000000FE  IPL 14 Interrupt Enable 
                                     IPL 15 Interrupt Enable 
                                     IPL 16 Interrupt Enable 
                                     IPL 17 Interrupt Enable 
                                     Interprocessor Interrupt Enable 
                                     Interval Timer Interrupt Enable 
                                     CPU Halt Enable 
TLEP Interrupt Summary 0  x00000000 
TLEP Interrupt Mask 1     x00000000 
TLEP Interrupt Summary 1  x00000000 


* TLaser Memory Regs *                 
TLSB Node Number                  6. 
TLDEV                     x00005000    -- Device Type:  Memory Module 

TLBER                     x00800000 
TLCNR                     x000FC260 
TLVID                     x00000080 
FADR 0                    x0002000000300180 
FADR 1                    x00020000 
TLESR0                    x00000303 
TLESR1                    x00000303 
TLESR2                    x00000303 
TLESR3                    x00000303 
TMIR                      x80000002  Interleave  x00000002 
TMCR                      x0000022D  2GB Module (E2036-AA) 
                                     16 MB 
                                     70ns DRAM 
                                     Strings Installed =   8 
                                     DRAM timing:   Bus Spd = 11.3-12.9; 
                                                    Refresh Cnt = 1088 
TMER                      x00000006  Failing String =   x00000006 
TMDRA                     x00000000  Refresh Rate   1X 
TDDR0                     x00000000 
TDDR1                     x00000000 
TDDR2                     x00000000 
TDDR3                     x00000000 


* TLaser Memory Regs *                 
TLSB Node Number                  7. 
TLDEV                     x00005000    -- Device Type:  Memory Module 

TLBER                     x00100000 
TLCNR                     x000FC270 
TLVID                     x00000091 
FADR                      x071500000011D840 
FADR 1                    x07150000  Failing Command:    Write Bank Unlock 
                                     Failing Bank =   Bank 1 
TLESR0                    x00000303 
TLESR1                    x00000C0C 
TLESR2                    x00006060 
TLESR3                    x00009090 
TMIR                      x80000002  Interleave  x00000002 
TMCR                      x0000022D  2GB Module (E2036-AA) 
                                     16 MB 
                                     70ns DRAM 
                                     Strings Installed =   8 
                                     DRAM timing:   Bus Spd = 11.3-12.9; 
                                                    Refresh Cnt = 1088 
TMER                      x00000000  Failing String =   x00000000 
TMDRA                     x00000000  Refresh Rate   1X 
TDDR0                     x00000000 
TDDR1                     x00000000 
TDDR2                     x00000000 
TDDR3                     x00000000 


* TLaser I/O Registers *               
TLSB Node Number                  8. 
TLDEV                     x00002000    -- Device Type:  I/O Module 

TLBER                     x00000000 
FADR 0                    x0000000000000000 
FADR 1                    x00000000 
TLESR0                    x00000000 
TLESR1                    x00000000 
TLESR2                    x00000000 
TLESR3                    x00000000 
CPU Interrupt Mask        x00000001  Cpu Interrupt Mask =   x00000001 
ICCMSR                    x00000000  Arbitration Control  Minimum Latency Mode 
                                     Supress Control  Suppress after 16 
                                                      Transations 
ICCNSE                    x80000000  Interrupt Enable on NSES Set 
ICCMTR                    x00000000 
IDPNSE-0                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-1                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-2                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPNSE-3                  x00000006  Hose Power OK 
                                     Hose Cable OK 
IDPVR                     x00000800 
ICCWTR                    x00000000 
TLMBPR                    x0000000000000000 
IDPDR0                    x20000000 
IDPDR1                    x20000000 
IDPDR2                    x00000000 
IDPDR3                    x00000000 


    regards 
    
    Uwe.
1109.5AFW3::MAZURWed Feb 26 1997 10:018
Your system is running fine now without any hardware replaced?  Do you
then think it was a module seating problem?

note: TLEP refers to a CPU.  Maybe that was just a name used in engineering and
      I confused people with my earlier reply.  In your last reply I think
      you wanted to say "TLSB slot" whereever "TLEP slot" appears.


1109.6>>> No HW replaced, but removed !! <<<COLES1::LONZECKWed Feb 26 1997 13:0715
    no, there is a misstake.
    
    I removed the bad 2 GB Memory Module from the System,
    and the system works fine.
    
    I get a new MS7CC-FA in approximate one week.
    
    If i install only this (bad) Memory Module in any free TLSB Slot
    the System crash sometimes with the same symptoms.
    When i have the new memory and i got new information the information
    are stored under this entry.
    (please also read//answer entry 1117 in this conference, if you have
    the right information for me)
    
    regards, uwe
1109.7DANGER::HARTWELLWed Feb 26 1997 16:065
    Did you replace the bad memory with a terminator card? (E2034)
    
    
    				/Dave
    
1109.8"System up & run "COLES1::LONZECKMon Mar 03 1997 14:227
    Hello Dave,
    
    i installed a NEW MS7CC-FA in NodeId 7.
    The System is now running with 2 MS7CC-FA, located as NodeID 7 and 6,
    Memory Module without any Problem.
    
    /Uwe