[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference vaxaxp::vmsnotes

Title:VAX and Alpha VMS
Notice:This is a new VMSnotes, please read note 2.1
Moderator:VAXAXP::BERNARDO
Created:Wed Jan 22 1997
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:703
Total number of notes:3722

329.0. "qman dump" by MDR01::NIEVES (MCS Madrid) Fri Mar 14 1997 06:32

    Hi
    
    this customer has a cluster with two 4300, OFIMA1 and OFIMA2.
    VMS 6.2.
    
    He tells me that the queue_manager went down, and although they have
    the start/queue/manager/on the two nodes, the queue_manager didn't
    start, they had to start it manually. This happened two times, while
    SLS was doing an incremental backup.
    ********************************************************************
    
    $ show queue/manager/full
    Master file:  SYS$SYSROOT:[SYSEXE]QMAN$MASTER.DAT;
    
    Queue manager SYS$QUEUE_MANAGER, running, on OFIMA2::
      /ON=(*)
      Database location:  SYS$COMMON:[SYSEXE]
    
    ********************************************************************
    
    This is the operator.log where it mentions the error:
    
    
      System MIRLO1 is a AlphaServer 8400 Model 5/350 running VMS V6.2-1H3
    in a VAXc
    luster.
      (HW Ver=05130000000000000000000C, SID=80000000, XSID=00000000,
    DSN=V1.2)
    
    ^G%%%%%%%%%%%  OPCOM  11-MAR-1997 22:47:21.70  %%%%%%%%%%%
     Message from user SLS on OFIMA1
     BATCH_206, Commencing OFI_M_SBK backup of $1$DIA0:[000000...]*.*;*
      
    ^G%%%%%%%%%%%  OPCOM  11-MAR-1997 23:14:26.35  %%%%%%%%%%%    (from
    node OFIMA
     Message from user JROD on OFIMA2
        Defragmentation process normal completion
             Process ID: 23A02657
             Device: _$1$DIA2:
             Time: 11-MAR-1997 23:14:15.32
    
     ^G%%%%%%%%%%%  OPCOM  11-MAR-1997 23:26:23.09  %%%%%%%%%%%    (from
    node OFIMA
     Message from user QUEUE_MANAGE on OFIMA2
     %QMAN-E-WRITEERR, error writing
    $1$DIA0:[SYS2.SYSCOMMON.][SYSEXE]SYS$QUEUE_MAN
    
     ^G%%%%%%%%%%%  OPCOM  11-MAR-1997 23:26:23.10  %%%%%%%%%%%    (from
    node OFIMA
     Message from user QUEUE_MANAGE on OFIMA2
     -RMS-F-RMV, ACP remove function failed
       
    
     ^G%%%%%%%%%%%  OPCOM  11-MAR-1997 23:26:23.10  %%%%%%%%%%%    (from
    node OFIMA
     Message from user QUEUE_MANAGE on OFIMA2
     -SYSTEM-W-ACCONFLICT, file access conflict
                                              
    *******************************************************************
    
    $analy/full/process QMAN$QUEUE_MANAGER.DMP
    
    
     R0 = 000010F8  R1 = 00001064   R2 = 0001828A   R3 = 00000800
     R4 = 00000000  R5 = 00000000   R6 = 00000000   R7 = 000022F8
     R8 = 000110F8  R9 = 00027EF8   R10 = 00027EF0  R11 = 00027DB0
     SP = 7FF4AF9C  AP = 7FF4AFA0   FP  = 7FF4B030
    
     FREE_P0_VA  0016EE00           FREE_P1_VA  7FF48E00
     Active ASTs  00                Enabled ASTs 07
     Current Privileges  FFFFFFFF  FFFFFFFF
     Event Flags  00000000  E0000000
     Buffered I/O count/limit 200/200
     Direct I/O count/limit   200/200
     File count/limit         194/200
     Process count/limit      0/0
     Timer queue count/limit  299/300
     AST count/limit          299/300
     Enqueue count/limit      5964/6000
     Buffered I/O total 75          Direct I/O total 3367
                                                        
    Link Date  22-APR-1995 00:51:16.83
    
     Kernel stack 00000008 pages at 7FFE6A00 moved to 0016EE00
     Exec stack 00000011 pages at 7FFE7800 moved to 0016FE00
     Vector page 00000001 page at 7FFEFE00 moved to 00172000
     PIO (RMS) area 00000007 pages at 7FFDFE00 moved to 00172200
     Image activator context 00000001 page at 7FFE2600 moved to 00173000
     User writeable context 0000000A pages at 7FFE0C00 moved to 00173200
                                        
    
    
    
    
    Creating a subprocess
    Condition signalled to take dump:
    %QMAN-F-OPENERR, error opening
    $1$DIA0:[SYS0.SYSCOMMON.][SYSEXE]SYS$QUEUE_MANAGE
    R.QMAN$JOURNAL;1
    %QMAN-F-OPENERR, error opening
    $1$DIA0:[SYS0.SYSCOMMON.][SYSEXE]SYS$QUEUE_MANAGE
    R.QMAN$JOURNAL;1
    -RMS-E-FLK, file currently locked by another user
    -RMS-E-FLK, file currently locked by another user
    -SYSTEM-W-ACCONFLICT, file access conflict
    -SYSTEM-W-ACCONFLICT, file access conflict
    -RMS-S-NOJOB, job !AS not found in !AS queue
    %DEBUG-I-CANTCREATEMAIN, could not create the debugger subprocess
    %DEBUG-I-CANTCREATEMAIN, could not create the debugger subprocess
    -LIB-F-NOCLI, no CLI present to perform function
    -LIB-F-NOCLI, no CLI present to perform function
    %DEBUG-I-SHRPRC, debugger will share user process
    %DEBUG-I-SHRPRC, debugger will share user process
    
             OpenVMS VAX DEBUG Version V6.2-000
    
    %DEBUG-I-NOLOCALS, image does not contain local symbols
               
    
    
    Any help will be appreciated,
    
    Regards,
    
    Nieves
T.RTitleUserPersonal
Name
DateLines
329.1wild guess...HAN::HALLEVolker Halle MCS @HAO DTN 863-5216Fri Mar 14 1997 06:567
    Nieves,
    
    they seem to be backing up the system disk (DIA0) at that time ! Maybe
    BACKUP is just accessing the QMAN journal file and the queue manager
    tried to create a new version  and delete the old one (RMS-F-RMV) ?
    
    Volker.
329.2more infoMDR01::NIEVESMCS MadridFri Mar 14 1997 08:358
    Hi,
    
    that's the same I was thinking about, but I need to give the customer
    the correct reason, that's why I'm asking for more info.
    
    Thanks,
    
    Nieves
329.3no newsMDR01::NIEVESMCS MadridTue Mar 18 1997 05:403
    anything about this matter???
    
    Nieves