[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference mvblab::alphaserver_4100

Title:AlphaServer 4100
Moderator:MOVMON::DAVISS
Created:Tue Apr 16 1996
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:648
Total number of notes:3158

470.0. "Unix Pal code 1.21 ?" by ONOIS1::CAVAILHE () Thu Feb 06 1997 11:41

    
    
    Hello all,
    
    On A4100 (B3004-AA,256MB) Unix V4.0A
    
    At the beginning of my call, SRM V 2-0,Unix Pal code V1.21.
    
    The problem of customer is: when he runs an mathematical application in
    one case,the system makes a Machine check type 670 with a machine check
    reason 8E = pal bugcheck internal error.
    
    With Decevent V2.2,I see SRM V2.0-3 & Unix Pal code V1.21-3.
    
    I ask to customer to upgrade SRM & Unix Pal code with Cd-rom V3.8.
    At console prompt SRM is V3.0 & Unix Pal code V1.21.
    When the customer runs the programm,we have again the same type of 
    Machine check error type 670 with pal bugcheck.
    
    With Decevent,I see now SRM V3.0 & Unix Pal code V1.21-3.
    
    The customer has told me when the upgrade phase,he has seen SRM V3.0-6
    & he has seen nothing error on Unix Pal code.
    
    My question is:
    
    1- Is Unix Pal code V1.21-3 true or is not a bad translate of  Decevent
       V2.2 ?.
    2- If this Unix Pal code good,how to upgrade at V1-21-14 ?.
    
    Please,could you give me a quickly response.
    
    By advance,thank-you very much.
    
    Best regards,
    
    Olivier Cavailhe,CSC_France.
    
T.RTitleUserPersonal
Name
DateLines
470.1more detail?POBOXB::STEINMANThu Feb 06 1997 13:486
    
    I'm not sure I understand the situation fully.
    
    Does the machine still fail with a 670 MCheck?
    
    mo
470.2LANDO::CUMMINSThu Feb 06 1997 14:1057
    PALcode is *always* updated at the same time console is updated when
    LFU is used to do the update. The LFU and console are indelibly linked.
    If you are running V3.0-10 console, you are definitely running V1.12-14
    UNIX PALcode.
    
    Your note mentioned V3.0-6 console. This must be a typo? The released
    version is V3.0-10. I've included more info on passing PAL revision to
    UNIX and DECevent below..
    
    You should update your DECevent to V2.3 with the appropriate KNL files
    for the 4100/4000. Many improvements have been made to this version.
    V2.3 DECevent should be able to decode the bits correctly and provide
    better diagnosis capability.
    
    Let me know what you find..
    
    BC
    
    More on PAL revisions..
    
    At the SRM console prompt, you can type the following to look at where
    console stores console and PAL revision data.. Force a boot halt so the
    HWRPB data structure will be built by console.
    
      P00>>>b -h <device> -flags <flags> -filename <filename>
      .
      .
      P00>>>e -n -q 10 2350
      pmem:             2350 0000000400000408 
      pmem:             2358 0004000500010113 
      pmem:             2360 0004001200020115 
      P00>>>continue                   # continue boot sequence
    
    The three quadwords correspond to console revision, VMS PAL revision,
    and UNIX PAL revision. These fields are Alpha SRM, HWRPB-defined
    fields passed to UNIX and VMS.
    
    As a for instance, on a V4.8-4 console (soon to be released), location
    2360 translates to:
    
       0004 : version of PAL supports up to four CPUs
       0012 : hex 12 = dec 18 = PAL dash variation
       0002 : UNIX PALcode rev quadword identifier is 2 (1 is VMS)
       0115 : Major version 1; minor version hex 15 = dec 21.
    
    The UNIX PAL revision for V4.8-4 console is V1.21-18. This would be the
    same revision you'd see if you typed SHOW PAL at the SRM console prompt
    on a V4.8 SRM console..
    
    P00>>>show pal
    pal              OpenVMS PALcode V1.19-5, Digital UNIX PALcode V1.21-18
    P00>>>
    
    Digital UNIX gets its PAL from the console, so if you see the correct
    version of PAL via the console SHOW PAL command, then the UNIX machine
    is definitely running the correct version of UNIX PALcode.
    
470.3response .2ONOIS1::CAVAILHEFri Feb 07 1997 11:5213
    
    Hello
    
    	Thank you for you good response .2.
    
    	I ask to FSE to upgrade Decevent V2.3 + KNL and if the problem is
    	again,then you  will be inform by Michel Semedo.
    
    	Nota:7th week I am in training in Netherland.
                          
    	Best regards
    
    	Olivier Cavailhe.
470.4feedback with the last SRM consoleONOIS1::CAVAILHEFri Mar 21 1997 09:3718
    
    Hello
    
    The 21th march,the customer has just upgraded with the last SRM console
    V4.8-6 & pal code Unix V1.21-18.
    
    When the user gives a special parameter in his mathematical application
    the system crashes in Machine check type 670 with Machine check reason
    8E=pal bugcheck.
    
    Nota,Decevent V2.3 with the last Knl gives always palcode revision
    V1.21-3.
    
    I transmit this call to central support center.
    
    Best regards,
    
    Olivier Cavailhe,CSC_France.
470.5HARMNY::CUMMINSFri Mar 21 1997 10:178
    I've forwarded your note off to the UNIX and DECevent folks. Will let
    you know what I find out. As for the 670 MCHK halt, are you aware that
    you can gather info about what caused the error halt by reading through
    and using the methodolgy described in nots 413.*. If dropped back to
    the console prompt, you can use the INFO (4,5,8) commands directly. If
    dropped into XDELTA, you can use the couple/few XDELTA commands to
    examine and record the contents of memory (PALcode's MCHK logout and
    IMPURE areas).
470.6Feedback from UNIX/VMS PAL owner re: MCHK code 8EHARMNY::CUMMINSFri Mar 21 1997 11:2313
Bill -

Well, as noted, the mchk code of 8E is used when PAL hits a 'bugcheck'
condition.  There are a number of such conditions, so you'd want to 
find out which one.  Examples are unaligned references made from PALcode,
OPCDEC traps while in PAL, Arithmetic traps while in PAL, etc.  I belive
the PC of the machine check should be a PALcode address, which would 
point to the specific condition encountered.  Hey, here's a case where
the PALtemps in the logout frame might be useful, too.

I've never seen one of these before...

/dave
470.7DECevent decoding PALcode incorrectlyPOBOXA::SHEPARDFri Mar 28 1997 07:476
PALcode field is not being decoded correctly.  A IPMT case is being entered
for to correct this problem.

Thanks

Gary
470.8DECevent fix release datePOBOXA::SHEPARDWed May 07 1997 15:254
    Just an update.  The fix for this will be in the DECevent release
    scheduled for 15-May-1997
    
    Gary