[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference azur::mcc

Title:DECmcc user notes file. Does not replace IPMT.
Notice:Use IPMT for problems. Newsletter location in note 6187
Moderator:TAEC::BEROUD
Created:Mon Aug 21 1989
Last Modified:Wed Jun 04 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:6497
Total number of notes:27359

4158.0. "DECMCC Crashing - KRNLSTAKNV, Kernel stack not valid" by PAKORA::BFJONES () Fri Nov 27 1992 10:21

    
    
    	Hi,
    		This is ongoing from an entry I put in this conference
    	number 4150. From 4150, I tried to do a new installation of
    	an BMS on my system , taken from a kit on note 3.169. To the
    	question , Upgrade Kit, I answered No. On my VAX 4000 Model 60
    	I have a private DNS namespace. Every time I do the installation
    	the system crashes:
    
    
    	Dump:
    
    
    	System crash information
    ------------------------
    Time of system crash: 27-NOV-1992 09:13:34.25
    
    
    Version of system: VAX/VMS VERSION V5.5
    
    System Version Major ID/Minor ID: 1/0
    
    
    System type: VAXstation 4000-60
    
    Crash CPU ID/Primary CPU ID:  00/00
    
    
    Bitmask of CPUs active/available:  00000001/00000001
    
    
    CPU bugcheck codes:
            CPU 00 -- KRNLSTAKNV, Kernel stack not valid
    CPU 00 Processor crash information
    ----------------------------------
    
    
    CPU 00 reason for Bugcheck: KRNLSTAKNV, Kernel stack not valid
    
    
    Process currently executing on this CPU: System Manager�
    
    
    Current image file:
    SQFNET$DKA100:[SYS0.SYSCOMMON.][SYSEXE]MCC_MAIN.EXE;8
    
    
    Current IPL: 31  (decimal)
    
    CPU database address:  81018000
    
    CPU 00 Processor crash information
    ----------------------------------
    
    General registers:
    
            R0  = 80002000   R1  = 00000000   R2  = 80E3A0E4   R3  =
    80E3A0B0
            R4  = 80A9E1E0   R5  = 806D88F0   R6  = 7FFCB440   R7  =
    00000032
            R8  = 80A94A30   R9  = 00000032   R10 = 00000050   R11 =
    7FFE71CC
            AP  = 7FFE7360   FP  = 7FFE72EC   SP  = 810191F8   PC  =
    80439544
            PSL = 041F0000
    
    CPU 00 Processor crash information
    ----------------------------------
    Processor registers:
    
    
            P0BR   = 81C6A200     SBR    = 02F48600     ASTLVL = 00000004
    
    
    
    
    
    
            P0LR   = 000023BD     SLR    = 0001D080     SISR   = 00000000
            P1BR   = 814B8400     PCBB   = 01E97C20     ICCS   = 00000040
            P1LR   = 001FF649     SCBB   = 02F42200     SID    = 12000003
    
            TODR    = 00000000    PCSTS   = 00000000
            SCCR    = 00000000    PAR_CTL   = 00000000
            MEMERR   = BAA87585
    
            ISP    = 810191F8
            KSP    = 7FFE71C8
            ESP    = 7FFE9800
            SSP    = 7FFECA48
            USP    = 7FECA1FC
    
    
     CPU 00 Processor crash information
    ----------------------------------
    
                    No spinlocks currently owned by CPU 00
    
    
    
    	Has anyone any ideas on fixing this one. If anyone has and needs
    	more information from my crash dump let me know. My original
    	setup of MCC had the field test version 1.2.7 on my system
    	which up to last Monday had been O.K.
    
    	Once this installation is complete I plan to restore my old
    	MIR's to MCC_COMMON.
    
    
    
    	/Brian
    
    
    	Brian F. Jones
    	Networks Support South Queensferry
    	THERAJ::BFJONES
    	Bri Jones @SQF
    	DTN 789 8399
     
T.RTitleUserPersonal
Name
DateLines
4158.1same pbTAV02::KALAISun Nov 29 1992 08:288
I had the same problem two days before you with similar configuration and
the reason was the SET AUDIT ,disable it and making more space on system disk
resolve my problem ,the crash stop.
   


Regards
Yael Kalai
4158.2Looks like that could be it !!PAKORA::BFJONESMon Nov 30 1992 11:2221
    
    Yael,
    	Thanks for the reply .... that seems to have done the trick, the
    	question is why doesn't MCC_AUDIT check these things out in the
    	first place. ! ... a few days wasted ... 
    
    
    
    	Thanks
    
    
    
    	/Brian
    
    
    
    	Brian F. Jones
    	South Queensferry Telecoms
    	DTN 789 8399
    	THERAJ::BFJONES
    	Bri Jones @SQF
4158.3cannot live without auditing...BACHUS::DEWILDEPatrickTue Dec 08 1992 06:3611
    
    	Yes indeed, it's resetting auditing that did the trick, but this is
    not a real solution, we need auditing. My nonpaged pool is OK, is there
    anything else I can check 
    
    	to avoid system crashes ANYTIME someone starts MCC with auditing 
    enabled?
    
    	Thanks for any hint,
    
    	Patrick.
4158.4DECMCC CRASHPAKORA::BFJONESTue Dec 08 1992 06:428
    Patrick,
    		Erik Mintz will be passing on my crash dump to DECmcc
    Engineering to have a look at. Indeed I must agree that we need
    auditing enabled but until someone comes up to a solution on this one
    we're stuck ! ... all my system parameters look ok.
    
    
    /Brian
4158.5Chasing into DNSTOOK::MINTZErik MintzTue Dec 15 1992 17:205
We believe we have traced this problem into DECdns code, and
the discussion is continuing in NOTED::DNS note 795.

-- Erik

4158.6May have a fix in DECmcc V1.3 updateTOOK::T_HUPPERThe rest, as they say, is history.Mon Feb 15 1993 09:4311
    Just to clean this up a bit, we have located a kernel-mode routine to
    extend the kernel stack.  If this routine is fully supported,
    well-behaved, and exists for all versions of VMS that DECmcc ships on,
    we will fix this problem in an update of DECmcc V1.3.  Given that we
    don't know much about this routine yet, this fix involves too much risk
    for squeezing into the imminent release of DECmcc V1.3.
    
    This problem is being tracked as QAR 355 in the MCC013_INT QAR
    database.
    
       Ted
4158.7TOOK::SWISTJim Swist LKG2-2/T2 DTN 226-7102Mon Feb 15 1993 10:222
    Why in the world is the kernel stack size not a sysgen parameter?
    
4158.8Another one crashed, any solution yet ?ZPOVC::SINSPSThu Mar 04 1993 16:3011
    Hi, is there any solution yet (I mean a patch or something) ? My
    customer just called to complain a similar crash. He has setup several
    operator accounts in order to manage his DECMCC station. Whenever any
    of these accounts invoked any MCC command, his system would crash with
    KRNLSTAKNV, Kernel stack not valid. However, when he used system
    account, there was no problem. His DECMCC-EMS is version 2.2 and being
    in a banking environment, AUDITING is required.
    
    Thanks,
    
    - LEH
4158.9STAR::DANIELEFri Mar 12 1993 12:428
      <<< Note 4158.7 by TOOK::SWIST "Jim Swist LKG2-2/T2 DTN 226-7102" >>>

>    Why in the world is the kernel stack size not a sysgen parameter?
 
	Hi Jim,
	
	   Go to Alpha!   

4158.10any solution?TROOA::GREENALLTue Aug 17 1993 12:1714
    
    Ok folks:
    
    	Again, is there a solution for the above mentioned problem? I am
    seeing this daily on an MCC station we are using managing the internal
    network in Canada.  
    
    	Turning of auditing is not an option, with the the Inspect
    Compliance *&* going around.   Mcc Version 1.3, VMS 5.5-2.
    
    Rich
    
    
    
4158.11mcc1.3 update available ?SWTHOM::NOBREJocelyne Nobre - CST FranceTue Oct 26 1993 10:12238
	Hi ,

	Is there now a solution for this kind of crash ?

	My customer uses VMS 5.5-2 , DECmcc 1.3 , DECnet/OSI 5.6
	The MCC station is DNS SERVER .
	MCC_AUDIT gives correct feed back.
	There is no disk space problem (.1 reply) but effectivly the
	auditserver is running.

	What about the DECmcc v1.3 update ?
	Thanks for news,
	Regards,
	Jocelyne.

	Crash dump and analysis follow :
	_______________________________


Selective Dump:    Dump Flags    = 020001C0
Current Process Name             = VUE$TOURNELLE_4
Current IPL                      = 31
Dump file version                = 0530
# memory pages in dump           = 78273  Physical memory = 80 MB
SID                              = 13000202
Crash time                       = 25-OCT-1993 11:01:55.78
Processor was operating on the Interrupt Stack
Current Access mode              = KERNEL
Number of processes in dump      = 7
Node Name                        = RM40     Clustered
Crash CPU/Primary CPU            = 0 / 0
CPU  0 Crash Code 20C,Crash Type = KRNLSTAKNV
Bitmask of CPUs available        = 1
Bitmask of CPUs active           = 1
CPU Bitmask completing bugcheck  = 1
Current image:                   = RM40$DKA300:[SYS0.SYSCOMMON.][SYSEXE]MCC_MAIN
.EXE;6
CPU Type                         = VAXstation 4000-90
VMS Version                      = V5.5-2

                         Symbol     Value       Contents

                 EXE$GL_MCHKERRS -> 800044F0  = 00000000
                  EXE$GL_MEMERRS -> 800044F4  = 00000000
                    EXE$GL_STATE -> 80008490  = 000007FE
                 EXE$GQ_BOOTTIME -> 80004460  = 13-OCT-1993 07:55:43.93
                    EXE$GL_FLAGS -> 800042F0  = 02412875
                  IO$GL_UBA_INT0 -> 800044F8  = 00000000
              PMS$GL_NPAGDYNEXPF -> 80004700  = 00000000
                 PMS$GL_NPAGDYNF -> 8000491C  = 00000000
            PMS$GL_NPAGDYNFPAGES -> 80004920  = 00000000
              PMS$GL_NPAGDYNREQF -> 8000492C  = 00000000
                  PMS$GL_PAGDYNF -> 80004704  = 00000000
             PMS$GL_PAGDYNFPAGES -> 80004924  = 00000000
               PMS$GL_PAGDYNREQF -> 80004934  = 00000000
                SCH$GL_FREECNT -> 80004018  = 00006511
                   SCH$GL_PFRATL -> 800080C4  = 00000000

  SBR 04E8F400  SLR 00049280

Processor Information for Crash CPU  0 (Hex)

   R0 8148BBF0   R1 813B15F0   R2 80E8235E   R3 80E8234E   R4 00000000
   R5 80E8234E   R6 8148C965   R7 81457A37   R8 00000001   R9 8148C955
  R10 80E82639  R11 00000002   AP 7FFE7260   FP 7FFE722C   SP 828CB1F8
   PC 80888744  PSL 041F0000  KSP 7FFE7200  ESP 7FFE9800  SSP 7FFECA48
  USP 01004C10  ISP 828CB1F8 P0BR 84829600 P1BR 840ECC00 P0LR 000082CE
 P1LR 001FF38E PCBB 0350C820 SCBB 04E84600 SISR 00000000

ASTLVL  = 00000004
ICCS    = 00000041

CPU Dependent Registers:

# Regs = 1E (Hex)
A92FE0A9
ECC80001
00000000
848EC8E0
00004004
01000200
00000001
0000008E
800001D0


 PCB    State CPU Process Name     Username     EPID    Pri PHD        Wkset

808E3F98   HIB    SWAPPER                       20200081 16 808E3E00      0
813729F0   LEF    DFG$RM40         DFG          20200103  6 8608BA00    863
813B15E0   CUR  0 VUE$TOURNELLE_4  TOURNELLEC   20200A85  6 847FAE00  23262
RM40$DKA300:[SYS0.SYSCOMMON.][SYSEXE]MCC_MAIN.EXE;6
811F2330   HIB    CONFIGURE        SYSTEM       20200086 10 828CD000    196
RM40$DKA300:[SYS0.SYSCOMMON.][SYSEXE]CONFIGURE.EXE;3
811FD1D0   HIB    IPCACP           SYSTEM       20200087 10 82BA2A00    101
8142CEF0   LEF    DECW$MWM         TOURNELLEC   20200688  6 87FB9800   1475
811FDD20   HIB    ERRFMT           SYSTEM       20200089  8 82D86600    124
811DCFF0   HIB    CACHE_SERVER     SYSTEM       2020008A 16 83515600    145
811DD290   HIB    CLUSTER_SERVER   SYSTEM       2020008B  8 82F6A200    285
811DD510   COM    OPCOM            SYSTEM       2020008C  6 8305C000    239
811DDBE0   HIB    AUDIT_SERVER     AUDIT$SERVER 2020008D 10 8314DE00     99
811FD9C0   HIB    JOB_CONTROL      SYSTEM       2020008E 10 8323FC00    159
811FE4C0   HIB    QUEUE_MANAGER    SYSTEM       2020008F  8 83331A00    394
8120B630   LEF    DNS$ADVER        SYSTEM       20200090  4 829BEE00   2048
8121A800   HIB    LES$ACP_V30      SYSTEM       20200091  8 84341800    467
8124B500   HIB    REMACP           SYSTEM       20200093  8 84617200     88
RM40$DKA300:[SYS0.SYSCOMMON.][SYSEXE]REMACP.EXE;2
81256BE0   HIB    NET$ACP          DNA$SessCtrl 20200094  4 83607400    649
8125F080   LEF    DTSS$SERVER      SYSTEM       20200096 11 837EB000    468
81265C50   HIB    NET$MOP          SYSTEM       20200097  4 838DCE00    873
812688C0   HIB    SMISERVER        SYSTEM       20200098  9 839CEC00     90
8126B870   HIB    TP_SERVER        SYSTEM       20200099 10 83AC0A00    155
8126C320   HIB    OSAK$SERVER_V3   SYSTEM       2020009A 12 83BB2800     90
81272C60   LEF    OSAK$NETMAN      SYSTEM       2020009B 12 83CA4600     93
81274640   HIB    DXD$DSA_SERVER   SYSTEM       2020009C  6 83D96400    149
811EEB90   COM    DECW$SERVER_0    SYSTEM       2020011D  6 858FCA00   9585
812850D0   HIB    LATACP           SYSTEM       202000A0 14 8406BE00    242
812BA220   LEF    ATK DAL Server   SYSTEM       20200121  7 84DA6200    388
813E5590   LEF    UCX$FTPC_5       TOURNELLEC   202005A2  7 87B00200    427
812F3CA0   HIB    BATCH_927        TOURNELLEC   20200DA3  5 86636E00    363
81365BB0   LEF    DECW$SESSION     TOURNELLEC   20200125  6 85718E00  11996
81372120   HIB    STOCK_DEFAULT    SYSTEM       20200126  4 86453200    102
81275C20   LEF    DQS$NOTIFIER     SYSTEM       202000A8  8 8415DC00    105
8128C220   LEF    NSCHED           SYSTEM       202000A9  8 8424FA00    149
81373420   HIB    SYMBIONT_6       SYSTEM       2020012A  4 86545000    340
8128D500   LEF    SCHED_REMOTE     SYSTEM       202000AB  8 84433600    174
81375B10   HIB    SYMBIONT_7       SYSTEM       2020012C  4 82AB0C00    102
RM40$DKA300:[SYS0.SYSCOMMON.][SYSEXE]DQS$SMB.EXE
8136C1B0   HIB    BATCH_579        TOURNELLEC   20200132  5 8617D800    377
81296140   HIB    AppleTalk ACP    SYSTEM       202000B4  8 84F89E00    338
81246DF0   HIB    SNS$WATCHDOG     DCM          20200137  3 85CC4200    378
812F19F0   LEF    UCX$FTPC_10      TOURNELLEC   202005B9  7 8727F400    427
812B18D0   HIB    ATKGW$ACP        ATKGW$USER   202000BB  8 8507BC00    212
813DD950   LEF    DECW$TE_113C     SYSTEM       2020113C  7 8681AA00    337
81244250   LEF    SYSTEM           SYSTEM       202012BD  5 86BE2200    204
81380B80   LEF    DNS$Server       DNS$Server   20200740  8 84709000   5453
813C4AA0   LEF    _FTA29:          TOURNELLEC   202003C1  5 86728C00    515
81393600   LEF    VUE$TOURNELLE_3  TOURNELLEC   20200142  5 86361400   1166
81395A30   LEF    DECW$TE_0144     TOURNELLEC   20200144  5 83F7A000   2415
8139A710   LEF    TOURNELLEC       TOURNELLEC   20200145  5 8690C800    310
81385210   LEF    VUE$TOURNELLE_5  TOURNELLEC   20200146  6 869FE600   3504
812B5EC0   HIB    MSAF$SERVER0     SYSTEM       202000C7  7 848ECC00    121
812955D0   HIB    SYMBIONT_1       SYSTEM       202000CB  5 87646C00    107
812B69D0   HIB    SYMBIONT_2       SYSTEM       202000CC  5 84525400    110
812B7CC0   HIB    SYMBIONT_3       SYSTEM       202000CE  5 84AD0800    110
8138E300   LEF    UCX$FTPC_12      PERROT       202007D0  7 8626F600    427
81392E80   COM    NET$EVD          SYSTEM       202006D2  4 82E78400    517
813B12A0   LEF    VUE$TOURNELL_10  TOURNELLEC   202014D4  4 86CD4000    469
813B5820   LEF    VUE$TOURNELLE_8  TOURNELLEC   202017D5  4 86FA9A00   4535
81262960   LEF    UCX$FTPD         UCX$FTP      20200559 10 8782A800    112
812B9850   HIB    NETBIOS          SYSTEM       202000DA  9 84CB4400     99
813ACE30   LEF    UCX$FTPC_1       TOURNELLEC   2020055B  7 8791C600    427
8139F5A0   HIB    BATCH_597        TOURNELLEC   2020015C  5 8718D600   2101
813BDB40   HIB    _FTA20:          TOURNELLEC   2020055D  7 87A0E400   5901
812B7B10   HIB    DECPS_DC         VPA          202000DE 15 849DEA00   1224
813EB9D0   LEF    DECW$TE_0B5F     PERROT       20200B5F 10 83423800   4078
813BD850   LEF    PERROT           PERROT       20200DE0  6 85F99C00    618
812B8340   HIB    PCFS_SERVER      SYSTEM       202000E1 11 83E88200   2152
81341B40   HIB    LAD$KERNEL       SYSTEM       202000E2  9 836F9200    203
81341E30   HIB    INET_ACP         INTERnet     202000E3 10 84BC2600    230
81355670   HIB    SMTP_RM40_01     SYSTEM       202000E4  5 84E98000   4104
8135FD80   LEF    VT_RESPONDER     SYSTEM       202000E5  4 8516DA00    483
8135FF30   LEF    VT_LAT_GTWY      SYSTEM       202000E6  4 8525F800    391
81361880   LEF    LAT_VT_GTWY      SYSTEM       202000E7  4 85BD2400    298
81362100   LEF    VT_TELNET_GTWY   SYSTEM       202000E8  4 85EA7E00    408
81363030   LEF    TELNET_VT_GTWY   SYSTEM       202000E9  4 85351600    784
81363A80   HIB    ELMS$NIMUX       SYSTEM       202000EA  8 85443400    109
813677E0   LEF    ULN_WATCHDOG     ULNET        202000EC  7 85627000    530
812F20E0   LEF    LERAY            LERAY        20200BED  8 82C94800   2905
81367AD0   LEF    MCC_TS_AM_SRV    SYSTEM       202000EE  4 85535200    512


Kernel Stack pointer 7FFE7200 outside stack limits 7FFE7200 : 7FFE7800

Contents of INTERRUPT Stack:

828CB1D8 8148C955 NONPAGED_POOL+30E955
828CB1DC 80E82639 PAGED_POOL+D4E39
828CB1E0 00000002
828CB1E4 7FFE7260 CTL$GL_KSTKBAS+60
828CB1E8 7FFE722C CTL$GL_KSTKBAS+2C
828CB1EC 828CB1F0
828CB1F0 80888744 <= PC EXCEPTION+144 EXE$MCHECK
828CB1F4 041F0000

828CB1F8 808887FD <= SP <= PC EXCEPTION+1FD EXE$EXCEPTION+2
828CB1FC 00000001



Instruction at PC 808887FD

EXE$EXCEPTION+2          808887FD PUSHR   #03

Instructions around PC 808887FD

EXE$EMULAT_REFLECT+E     808887D5 BGEQ    EXE$EMULAT_REFLECT+12
Invalid opcode 50 @808887D7
EXE$EMULAT_REFLECT+11    808887D8 CMPF    #12,#1A
EXE$EMULAT_REFLECT+14    808887DB MFPR    #12,R1
EXE$EMULAT_REFLECT+17    808887DE CMPL    #02,R1
EXE$EMULAT_REFLECT+1A    808887E1 BLSS    EXE$EMULAT_REFLECT+20
EXE$EMULAT_REFLECT+1C    808887E3 BBC     #1A,R0,EXE$EMULAT_REFLECT+24
EXE$EMULAT_REFLECT+20    808887E7 BUGW   #01CC
EXE$EMULAT_REFLECT+24    808887EB PROBER  #00,#04,@#CTL$AL_STACK
EXE$EMULAT_REFLECT+2C    808887F3 BEQL    EXE$EMULAT_REFLECT+20
EXE$EMULAT_REFLECT+2E    808887F5 JMP     EXE$REFLECT+14C
EXE$EXCEPTION            808887FB PUSHL   #01
EXE$EXCEPTION+2       PC>808887FD PUSHR   #03EXE$EXCEPTION+4          808887FF M
NEGL   #03,-(SP)
EXE$EXCEPTION+7          80888802 PUSHL   FP
EXE$EXCEPTION+9          80888804 PUSHL   #04
EXE$EXCEPTION+B          80888806 ADDL3   #06,18(SP),R0
EXE$EXCEPTION+10         8088880B TSTL    (SP)[R0]
EXE$EXCEPTION+13         8088880E BGEQ    EXE$EXCEPTION+35
EXE$EXCEPTION+15         80888810 MOVAL   @#CTL$AL_CMCNTX,R1


Failing PC is 808887FD EXCEPTION+1FD  EXE$EXCEPTION+2



VMS Version     : 5.5-2
Crash Type      : KRNLSTAKNV
Current Process : VUE$TOURNELLE_4
Current Image   : MCC_MAIN
CPU Type        : 4000-90
SID             : 13000202
Signal Array cnt: 0
Exception par #1: FFFFFFFF
Exception par #2: FFFFFFFF
Exception par #3: FFFFFFFF
Exception PC    : 808887FD
Exception PSL   : 00000001
Failing Inst    : PUSHR
Code Module     : EXE$EXCEPTION
Offset          : 2

4158.12Same error + NETDLLERR, Any Actions?BERN01::GMUERMon Dec 20 1993 12:0319
During the last two months we have installed 3 MCC stations on DECnet 
Phase V nodes (VMS 5.5-2, DECnet/OSI 5.6B, DNS V2.0, DECmcc V1.3).

All systems show regulary system crashes:

   - Two stations with KRNLSTKNV, Kernel Stack not valid

   - One station with NETDLLERR, DECnet Datalink Layer detected a fatal error

Disabling the audit process did not help.

We will escalate the problem by the appropriate channels. I am interested if 
some actions have been done before. Has somebody found a solution ?

Thanks for any help or comment !

Edgar Gmuer
Network Integration Services, Berne, Switzerland