| Title: | SABLE SYSTEM PUBLIC DISCUSSION |
| Moderator: | COSMIC::PETERSON |
| Created: | Mon Jan 11 1993 |
| Last Modified: | Fri Jun 06 1997 |
| Last Successful Update: | Fri Jun 06 1997 |
| Number of topics: | 2614 |
| Total number of notes: | 10244 |
Is there anybody who could help me interpreting this errlog file.
The computer is an Alpha 2100a 4/275 with 2 cpus and Digital Unix V3.2G.
Uerf reports lots of cpu exceptions.The computer only has "CPU EXCEPTION".
I can't get any reference manual or information to analyze cpu exception .
Is the machine checks & Sable PFMS (Product Fault Management Spec)
on 2100 good for 2100a ? Does DECEVENT 2.3 know the 2100a ?
Thanks very much in advanced.
********************************* ENTRY 5. *********************************
----- EVENT INFORMATION -----
EVENT CLASS ERROR EVENT
OS EVENT TYPE 100. CPU EXCEPTION
SEQUENCE NUMBER 19.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Fri Mar 28 04:28:11 1997
OCCURRED ON SYSTEM EGISAP
SYSTEM ID x00060018
SYSTYPE x00000000
PROCESSOR COUNT 2.
PROCESSOR WHO LOGGED x00000000
----- UNIT INFORMATION -----
UNIT CLASS CPU
----- LEP MACHINE CHECK STACK FRAME -----
PROCESSOR OFFSET x00000110
SYSTEM OFFSET x000001A0
PALTEMP1 x000000010000008A
PALTEMP2 x0000000000000058
PALTEMP3 x001E02F800000004
PALTEMP4 x3334373330303030
PALTEMP5 x0000000000028728
PALTEMP6 xFFFFFC0015A6B000
PALTEMP7 x0000000000000240
PALTEMP8 x0000000000004200
PALTEMP9 x0000000000000400
PALTEMP10 x0000000000000000
PALTEMP11 xFFFFFC00004E4750
PALTEMP12 x0000000000000000
PALTEMP13 xFFFFFC00004E4AF0
PALTEMP14 xFFFFFC00004E4B20
PALTEMP15 xFFFFFC00004E4B80
PALTEMP16 xFFFFFC00004E48F0
PALTEMP17 xFFFFFC00004E4600
PALTEMP18 x0000000000019330
PALTEMP19 x000000011FFFFB90
PALTEMP20 xFFFFFFFFA148F2E0
PALTEMP21 xFFFFFC000068F2B0
PALTEMP22 x0000000000000000
PALTEMP23 x40424272727E7E7E
PALTEMP24 x85CD1FDFB7F5AFF5
PALTEMP25 x0000000000000000
PALTEMP26 x0000000000010000
PALTEMP27 x0000000000000E00
PALTEMP28 x0000000000000000
PALTEMP29 x000000001797C000
PALTEMP30 xFFFFFFFC00000000
PALTEMP31 x0000000000000001
EXC_ADDR x00000000017F5A58
EXCEPTING OR EXECUTING INSTRUCTION DID NOT COMPLETE PC
IS x5FD696
EXC_SUM x00000000004E39E8
INVALID OPERATION
FLOATING POINT OVERFLOW
FLOATING POINT UNDERFLOW
FLOATING INEXACT ERROR
FBOX CONVERT TO INTEGER OVERFLOW OR
_INTEGER ARITHMETIC OVERFLOW
EXC_MSK x0000000000000000
ICCSR x0000000000000000
PC0 INT ENABLED AFTER 2**16 EVENTS
_BY 2
PC1 COUNTER INPUT DCACHE MISSES
FP INSTRUCTIONS CAUSE FEN EXCEPTIONS
ADDRESS SPACE NUMBER = x0
PAL_BASE x0000000000000004
BASE ADDRESS FOR PALCODE = x0
HIER x0000000000014000
PC1 INTERRUPT DISABLED
PC0 INTERRUPT DISABLED
SOFTWARE INTERRUPT ENABLED ON LEVEL 1
SOFTWARE INTERRUPT ENABLED ON LEVEL 3
HIRR x0000000000001CF0
CRD - CORRECTABLE READ ERROR INT REQ
_SET
CPU HARDWARE INTERRUPT REQUESTED ON PIN
_3
CPU HARDWARE INTERRUPT REQUESTED ON PIN
_4
CPU HARDWARE INTERRUPT REQUESTED ON PIN
_5
CPU HARDWARE INTERRUPT REQUESTED ON PIN
_0
CPU HARDWARE INTERRUPT REQUESTED ON PIN
_1
CPU HARDWARE INTERRUPT REQUESTED ON PIN
_2
MM_CSR x0000000000000000
INTEGER REGISTER USED IS R 0.
DC_STAT x0000000000001E31
DC_HIT LAST LOAD OR STORE MISSED
_DCACHE
OPCODE RA FIELD - INTEGER REGISTER IS R 3.
INT INTEGER LOAD OR STORE OPERATION
LW DATA LENGTH OF LOAD OR STORE WAS
_LONGWORD
VAX_FP VAX FP DISPLAY LOAD OR STORE
_CAUSED ERROR
LOCK LDLL LDQL STLC STQC INSTRUCTION
_CAUSED ERROR
DC_ADDR x0000000000000003
ABOX_CTL x00000000FFFFFFFF FUNCTIONS ENABLED - WRITE BUFFER
_PREVENTED FROM SENDING DATA TO BIU
FUNCTIONS ENABLED - MCHECK ENABLED FOR
_UNCORRECTABLE ERRORS
FUNCTIONS ENABLED - CRD CORRECTED READ
_DATA INTERRUPT ENABLED
FUNCTIONS ENABLED - SINGLE ENTRY ICACHE
_STREAM BUFFER ENABLED
FUNCTIONS ENABLED - DCACHE ENABLED
FUNCTIONS ENABLED - FORCES D-STREAM
_REF'S TO HIT IN DCACHE
BIU_STAT x000000000000142E
BIU_SERR EXT. CY. TERM W/SOFT ERROR
BC_TPERR EXT. CACHE TAG PROBE HAD BAD
_PAR. IN TAG ADD RAM
BC_TCPERR EXT. CACHE TAG PROBE HAD
_BAD PAR. IN TAG CRTL RAM
BIU_CMD CYCLE CLASS IS FETCH
FILL_DPERR PRI. CACHE FILL FROM EXT.
_CACHE HAD PARITY ERROR
BIU_ADDR x0000000000000240
PHYSICAL ADDRESS OF CACHE BLOCK WITH ERROR IS x12
BIU_CTL x0000000000019330
EXTERNAL CACHE DISABLED
EXTERNAL CACHE PARITY ENABLED
EXTERNAL CACHE FORCE HIT FOR
_READ_BLOCK AND WRITE_BLOCK
_TRANSACTIONS
EXTERNAL CACHE READ/WRITE SPEED IN CPU CYCLES IS
_10
EXTERNAL CACHE WRITE ENABLE TIMING BIT FIELD IS x6
FILL_SYNDROME x000000005000E567 SINGLE BIT ERROR IS DATA BIT 26
SINGLE BIT ERROR IS DATA BIT 01
FILL_ADDR x0000000000000000
PHYSICAL ADDRESS OF QUADWORD WITH ERROR x0
VA x0000000000006100 D-STREAM FAULT OR DTB MISS - VIRTUAL ADDRESS IS x61
00
BC_TAG x0000000000006170
V BIT - CACHE BLOCK VALID
TAG ADDRESS IS x30B
----- DIGITAL 2100 A500 CPU SPECIFIC FRAME -----
BCC_CSR0 x00000000BC148244
ENB ENB TAG AND DUP TAGE PAR CHK
ENB COR ERR INTERRUPT
ENB B-CACHE COND I/O UPDATES
ENB EDC CHK H
ENB BLOCK WRITE AROUND H
BCCE_CSR1 xC00001C5C00001C5
BCCEA_CSR2 x0000000000000000
BCUE_CSR3 x0000000000000000
EDC SYNDROME 0 x0
EDC SYNDROME 2 x0
EDC SYNDROME 1 x0
EDC SYNDROME 3 x0
BCUEA_CSR4 x0000000000000900 B-CACHE MAP OFFSET x900
TAG VALUE x0
B-CACHE MAP OFFSET H x900
TAG VALUE H x0
DTER_CSR5 x000000000D400CB5 MISSED ERROR OCCURRED
DUP TAG STORE OFFSET x2D
DUP TAG x35003
MISSED ERROR OCCURRED H
DUP TAG STORE OFFSET H x2D
DUP TAG H x35003
CBCTL_CSR6 x000000001C17EEDC
C/A WRONG PARITY x2
ENABLE PARITY CHECK
FORCE SHARED
COMMANDER ID x6
ARB CONTROL MASK x6
ENB C-BUS ERROR INTERRUPT
C/A WRONGE PARITY H x2
ENABLE PARITY CHECK H
FORCE SHARED H
COMMANDER ID x6
ARB CONTROL MASK x6
ENB C-BUS ERROR INTERRUPT
C/A WRONGE PARITY H x2
ENABLE PARITY CHECK H
FORCE SHARED H
COMMANDER ID H x6
ARB CONTROL MASK H x6
ENB C-BUS ERROR INTERRUPT H
CBE_CSR7 x0000000000000828
MISSED C/A ERROR
MISSED ERROR ON WRITE DATA
DATA PARITY ERROR LW2 ERROR
MISS COUNT x0
MISSED C/A ERROR H
MISSED ERROR ON WRITE DATA - RESP H
DATA PARITY ERROR LW3 ERROR
MISS COUNT H x0
CBEAL_CSR8 x00000000EE000000
ADDRESS x3B800000
ADDRESS H x3C800E00
CBEAH_CSR9 xE0000043E0000043
PMBX_CSR10 x0F201D832F201D83
IPIR_CSR11 x0000000000000000
SIC_CSR12 x0000000000000000
ADLK_CSR13 x0000000000000000
MADRL_CSR14 x001BE800001BE800
CRREV4 x051E0FF900094651
----- DIGITAL 2100 A500P T2 SPECIFIC FRAME -----
IOCSR x0000000000000000
CERR1 xE3800010E3800010
CERR2 x00201D8320201D83
CERR3 x0000000000080000
PERR1 x0000000781C40300
PERR2 x0000000000000010
HAE0_1 x0000000000000000
HAE0_2 x000000000010603F
HBASE x00000000400807FF
WBASE1 x000000003FF00000
WMASK1 x0000000000000000
TBASE1 x00000000000C00FF
WBASE2 x000000000FF00000
WMASK2 x0000000000480000
TBASE2 x0000072300120806
TDR0 x0000072100120805
TDR1 x0001E42C0012080B
TDR2 x00014F1A0012080C
TDR3 x0000982C0012080F
TDR4 x00006F0600120810
TDR5 x0000071D00120803
TDR6 x0000071F00120804
TDR7 x0000005800000008
----- DIGITAL 2100 A500 MEMORY SPECIFIC FRAME -----
MODULE NUMBER x0000000000000000
MERR x051E0FE0E2000008
MCMD1 x20200BFB20201D83
MCMD2 x8001506880015068
MCONF x0833006300180DE8
MEDC1 x000004110000000D
MEDC2 x2000000020000000
MEDCC x0000080000000800
MSCTL x000001D8000001D8
MREF x0000000000000000
FILTER x0000005800000008
----- DIGITAL 2100 A500 MEMORY SPECIFIC FRAME -----
MODULE NUMBER x0000000000000000
MERR x0000000000000000
MCMD1 x0000000000000000
MCMD2 x0000000000000000
MCONF x0000000000000000
MEDC1 x0000000000000000
MEDC2 x0000000000000000
MEDCC x0000000000000000
MSCTL x0000000000000000
MREF x0000000000000000
FILTER x0000005800000008
----- DIGITAL 2100 A500 MEMORY SPECIFIC FRAME -----
MODULE NUMBER x0000000000000000
MERR x0000000000000000
MCMD1 x0000000000000000
MCMD2 x0000000000000000
MCONF x0000000000000000
MEDC1 x0000000000000000
MEDC2 x0000000000000000
MEDCC x0000000000000000
MSCTL x0000000000000000
MREF x0000000000000000
FILTER x0000005800000008
----- DIGITAL 2100 A500 MEMORY SPECIFIC FRAME -----
MODULE NUMBER x0000000000000000
MERR x0000000000000000
MCMD1 x0000000000000000
MCMD2 x0000000000000000
MCONF x0000000000000000
MEDC1 x0000000000000000
MEDC2 x0000000000000000
MEDCC x0000000000000000
MSCTL x0000000000000000
MREF x0000000000000000
FILTER x0000000000000000
RECORD ENTRY DUMP:
RECORD HEADER
0000: 001304B8 00060018 00060101 330A734B *............Ks.3*
0010: 53494745 00005041 00000000 00000000 *EGISAP..........*
0020: 00000002 00000000 10010064 00000000 *........d.......*
0030: 00000000 00000000 *........ *
RECORD BODY
0038: 00000002 00000228 00000220 80000000 *....(... .......*
0048: 00000110 000001A0 0000008A 00000001 *................*
0058: 0000008A 00000001 00000058 00000000 *........X.......*
0068: 00000004 001E02F8 30303030 33343733 *........00003743*
0078: 00028728 00000000 15A6B000 FFFFFC00 *(...............*
0088: 00000240 00000000 00004200 00000000 *@........B......*
0098: 00000400 00000000 00000000 00000000 *................*
00A8: 004E4750 FFFFFC00 00000000 00000000 *PGN.............*
00B8: 004E4AF0 FFFFFC00 004E4B20 FFFFFC00 *.JN..... KN.....*
00C8: 004E4B80 FFFFFC00 004E48F0 FFFFFC00 *.KN......HN.....*
00D8: 004E4600 FFFFFC00 00019330 00000000 *.FN.....0.......*
00E8: 1FFFFB90 00000001 A148F2E0 FFFFFFFF *..........H.....*
00F8: 0068F2B0 FFFFFC00 00000000 00000000 *..h.............*
0108: 727E7E7E 40424272 B7F5AFF5 85CD1FDF *~~~rrBB@........*
0118: 00000000 00000000 00010000 00000000 *................*
0128: 00000E00 00000000 00000000 00000000 *................*
0138: 1797C000 00000000 00000000 FFFFFFFC *................*
0148: 00000001 00000000 017F5A58 00000000 *........XZ......*
0158: 004E39E8 FFFFFC00 00000000 00000000 *.9N.............*
0168: 00000000 00000000 00000004 001E02F8 *................*
0178: 00014000 00000000 00001CF0 00000000 *.@..............*
0188: 00000000 00000000 00001E31 00000000 *........1.......*
0198: 00000003 00000000 FFFFFFFF 00000007 *................*
01A8: 0000142E 00000000 00000240 00000000 *........@.......*
01B8: 00019330 00000000 5000E567 0000000E *0.......g..P....*
01C8: 00000000 00000000 00006100 00000000 *.........a......*
01D8: 00006170 00000000 BC148244 78290500 *pa......D.....)x*
01E8: C00001C5 C00001C5 00000000 00000000 *................*
01F8: 00000000 00000000 00000900 00000900 *................*
0208: 0D400CB5 0D400CB5 1C17EEDC 1C17EEDC *..@...@.........*
0218: 00000828 00000828 EE000000 F2003800 *(...(........8..*
0228: E0000043 E0000043 2F201D83 0F201D83 *C...C..... /.. .*
0238: 00000000 00000000 00000000 00000000 *................*
0248: 00000000 00000000 001BE800 001BE800 *................*
0258: 00094651 051E0FF9 0002001A 0002001A *QF..............*
0268: 00000011 000000B8 27020B80 FE030E0A *...........'....*
0278: 00000000 00000000 E3800010 E3800010 *................*
0288: 20201D83 00201D83 00080000 00000000 *.. .. .........*
0298: 81C40300 00000007 00000010 00000000 *................*
02A8: 00000000 00000000 0010603F 00000000 *........?`......*
02B8: 400807FF 00000000 3FF00000 00000000 *...@.......?....*
02C8: 00000000 00000000 000C00FF 00000000 *................*
02D8: 0FF00000 00000000 00480000 00000000 *..........H.....*
02E8: 00120806 00000723 00120805 00000721 *....#.......!...*
02F8: 0012080B 0001E42C 0012080C 00014F1A *....,........O..*
0308: 0012080F 0000982C 00120810 00006F06 *....,........o..*
0318: 00120803 0000071D 00120804 0000071F *................*
0328: 00000008 00000058 00000000 00000000 *....X...........*
0338: 00000000 00040001 E2000008 051E0FE0 *................*
0348: 20201D83 20200BFB 80015068 80015068 *.. .. hP..hP..*
0358: 00180DE8 08330063 0000000D 00000411 *....c.3.........*
0368: 20000000 20000000 00000800 00000800 *... ... ........*
0378: 000001D8 000001D8 00000000 00000000 *................*
0388: 00000008 00000058 00000001 00000000 *....X...........*
0398: 00000000 00000000 00000000 00000000 *................*
03A8: 00000000 00000000 00000000 00000000 *................*
03B8: 00000000 00000000 00000000 00000000 *................*
03C8: 00000000 00000000 00000000 00000000 *................*
03D8: 00000000 00000000 00000000 00000000 *................*
03E8: 00000008 00000058 00000002 00000000 *....X...........*
03F8: 00000000 00000000 00000000 00000000 *................*
0408: 00000000 00000000 00000000 00000000 *................*
0418: 00000000 00000000 00000000 00000000 *................*
0428: 00000000 00000000 00000000 00000000 *................*
0438: 00000000 00000000 00000000 00000000 *................*
0448: 00000008 00000058 00000003 00000000 *....X...........*
0458: 00000000 00000000 00000000 00000000 *................*
0468: 00000000 00000000 00000000 00000000 *................*
0478: 00000000 00000000 00000000 00000000 *................*
0488: 00000000 00000000 00000000 00000000 *................*
0498: 00000000 00000000 00000000 00000000 *................*
04A8: 00000000 00000000 00000000 5E3C7E25 *............%~<^*
********************************* ENTRY 6. *********************************
| T.R | Title | User | Personal Name | Date | Lines |
|---|---|---|---|---|---|
| 2570.1 | cpu exceptions memory module0 errors | CSC32::HUTMACHER | Wed Apr 02 1997 11:42 | 34 | |
Hi there is no version of decevent that understands 2100a machines.
last thing i heard is there is no plans for its support?? how could
this be on the only currently shipping version of the 2100 product
line? jim hutmacher mvhs colorado csc 800-354-9000 ext 25561
its looks like your machine is getting memory errors on memory module0.
the uerf -r 100 -Z printout shows memory error on memory module 0
0328: 00000008 00000058 00000000 00000000 memory module0frame
0338: 00000000 00040001 E2000008 051E0FE0 <--- the 40001 is error
0348: 20201D83 20200BFB 80015068 80015068 memory module 0
0358: 00180DE8 08330063 0000000D 00000411 *....c.3.........*
0368: 20000000 20000000 00000800 00000800 *... ...........*
0378: 000001D8 000001D8 00000000 00000000 *................*
your uerf entry is miss aligned by one line but does not show the
error bits for the memory module 0 anyways??
----- DIGITAL 2100 A500 MEMORY SPECIFIC FRAME -----
MODULE NUMBER x0000000000000000 < this is really MERR but no
MERR x051E0FE0E2000008 40001 error bit showing?
MCMD1 x20200BFB20201D83 |
MCMD2 x8001506880015068 | all these registers need to
MCONF x0833006300180DE8 | shifted down by one
MEDC1 x000004110000000D |
MEDC2 x2000000020000000 |
MEDCC x0000080000000800 |
MSCTL x000001D8000001D8 V
MREF x0000000000000000
FILTER x0000005800000008
| |||||
| 2570.2 | One more question | BPSOF::SIPOS | Thu Apr 03 1997 03:45 | 7 | |
Thanks for looking at my note so quickly . I will try to replace memory module 0. Is there any guide or information how to interpret these log info ? Is there 1001 note good for 2100A ? Thanks for your advice . | |||||