[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference abbott::mailworks-vms

Title:MailWorks for OpenVMS
Notice:kit info notes 3-6; policies note 2; reporting bugs note 7
Moderator:KOALA::LAVASH
Created:Wed Jul 28 1993
Last Modified:Mon Jun 02 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:1583
Total number of notes:6814

1548.0. "MUAS$SERVER ACCESS VIOLATION " by EVTAI1::GODARD () Tue Feb 25 1997 08:56

    Hi,
    
    Since yesterday, we have a BIG problem with a customer configuration :
    (Important customer : Conseil de l'Europe)
    
    About 15 nodes with MailWorks V1.3A-3 and Remote Message Router.
    One node with Message Router V3.3-313
                                       
    Problem : on some nodes, we can't start the MUAS$SERVER .
    MUAS$SERVER_OUTPUT.LOG shows :
    SYSTEM-F-ACCVIO, access violation, reason mask=04, virtual
    address=7F8E9D80, P
    
      Improperly handled condition, image exit forced.
        Signal arguments:   Number = 00000005
                            Name   = 0000000C
                                     00000004
                                     7F8E9D80
                                     005CA0F4
                                     0000001B
    
        Register dump:
        R0  = 0000000000000001  R1  = 000000007F8EA6B0  R2  =
    0000000000588E48
        R3  = 000000007F8FBC80  R4  = 000000007F8FBC80  R5  =
    00000000006DC000
        R6  = 0000000000002C27  R7  = 0000000000000037  R8  =
    00000000000000AA
        R9  = 0000000000000041  R10 = 0000000000000041  R11 =
    000000000000008D
        R12 = 000000007F8EB138  R13 = 000000000000006F  R14 =
    00000000000000F1
        R15 = 0000000000000078  R16 = 000000007F8FBC80  R17 =
    000000007F8EA738
        R18 = 000000000058910A  R19 = 0000000000000000  R20 =
    0000000000002C27
        R21 = 0000000000000037  R22 = FFFFFFFF80000031  R23 =
    FFFFFFFF80000000
        R24 = 0000000000000031  R25 = 0000000000000006  R26 =
    00000000005CA07C
        R27 = 0000000000588EF0  R28 = 0000000000000000  R29 =
    FFFFFFFF8003947C
        SP  = 000000007F964000  PC  = 00000000005CA0F4  PS  =
    000000000000001B
                                                                              
    
    If we change the configuration to don't use MR, so MUAS$SERVER is REAL
    
    All nodes are rebooted this morning, all messages from Message Router
    are been delivered, but after some times, some MUAS$SERVER disappears
    and now we can't started them 
    
    
    Have you any ideas??
    Thanks
    Monique
     
    
    
T.RTitleUserPersonal
Name
DateLines
1548.1recreate serevrEVTAI1::GODARDTue Feb 25 1997 10:506
    Delete the server muas$server_000 then create the server
    muas$server_000  on each node
    
    ==> ALL IS WORKING
    
 	Monique
1548.2Same ProblemWOODY::FABBRIWed Feb 26 1997 04:3914
Hi,

This morning we have again the problem on two nodes where we didn't delete and
re-create the muas$server_000.
After re-creating the  muas$server_000 everything works well. So I have
recommended to the customer to do the same on all the systems but I'm not able
to explain why .
I have seen this with many customers.

So does somebody have an idea ?

Thank you

Michel
1548.3KOALA::LAVASHThu Feb 27 1997 07:5319
    There is not enough information to completely understand the
    problem.  To analyze the access violation in the MUAS Server
    the information from ANALYZE needs to be provided.  Also, were
    any errors logged in any of the MUAS log files?  Did anything
    change recently on the system?  How long was this cofiguration
    working before the problem occurred?
    
    If this is an issue for your custoemr, I would recommend the
    following actions to be taken the next time the problem occurs:
    - restart MUAS Servers
    - run DMW$COLLECT_INFO.COM
    - create a Severity 3 level IPMT and include the saveset as
      part of the supporting data.  Also include information such
      as: How often does the problem occurs?
          When did this problem start?
          What changed on the system between the time the product
            did not see this problem and when the problem started?
    
    -diana
1548.4DMW$COLLECT_ERROREVTAI1::GODARDFri Feb 28 1997 09:4555
    
    Even the DMW$COLLECT_INFO give an ACCESS VIOLATION !!
    
    I'm connected at customer :
    
    @DMW$COLLECT_INFO creates 2 files : 
    	DMW$COLLECT_INFO.DAT and
    	DMW$COLLECT_INFO.ERROR
    but not the save-set.
    
    This is the  DMW$COLLECT_INFO.ERROR
    
    %A1MAIL-I-CREATED, []MUAS$CWF_JNL_SNAPSHOT.DAT created
    %SYSTEM-F-ACCVIO, access violation, reason mask=00, virtual
    address=06FFFFFC, PC
    =8041A004, PS=0000001B
    
      Improperly handled condition, image exit forced.
        Signal arguments:   Number = 00000005
                            Name   = 0000000C
                                     00000000
                                     06FFFFFC
                                     8041A004
                                     0000001B
    
        Register dump:
        R0  = 000000007FBC8000  R1  = 0000000007000000  R2  =
    000000007FEBC250
        R3  = 0000000007000000  R4  = 0000000000000003  R5  =
    0000000000074078
        R6  = 0000000000000000  R7  = 0000000000000001  R8  =
    000000007FF9C1F8
        R9  = 000000007F963660  R10 = 000000007FF9D228  R11 =
    000000007FFBE3E0
        R12 = 0000000000000000  R13 = 000000007F9F56C8  R14 =
    0000000000000000
        R15 = 0000000007000000  R16 = 0000000007000000  R17 =
    000000007FBC8008
        R18 = 0000000000CE8673  R19 = 000000007FBC8818  R20 =
    0000000077770000
        R21 = 0000000000070000  R22 = FFFFFFFF84180000  R23 =
    000000007FBC80E4
        R24 = 0000000000000001  R25 = 0000000000000001  R26 =
    FFFFFFFF805B8054
        R27 = 000000007FB8A8C0  R28 = 0000000000000030  R29 =
    000000007F963000
        SP  = 000000007F962FE0  PC  = FFFFFFFF8041A004  PS  =
    200000000000001B
    
    I create the IPMT and join the 2 files.
    
    
    Thanks
    Monique
    
1548.5KOALA::LAVASHMon Mar 03 1997 07:1122
    Comment out the lines that are causing the problem.
    
    In this case, it looks like the following group of commands
    are causing the problem:
    
        MANAGE WORK_FILE
        SNAPSHOT
        START []MUAS$CWF_JNL_SNAPSHOT.DAT /NOLOCK
        LIST
        CANCEL
        SPAWN DELETE/NOCONFIRM/NOLOG []MUAS$CWF_JNL_SNAPSHOT.DAT;
    
    Try just commenting out the lines:
    	SNAPSHOT
    and
    	SPAWN DELETE/NOCONFIRM/NOLOG []MUAS$CWF_JNL_SNAPSHOT.DAT;
    
    If there is still a access violoation, comment out the
    other four lines.  Make a note of this when submitting the
    IPMT case.
    
    -diana
1548.6same problem againEVTAI1::GODARDFri Mar 21 1997 11:0811
    The same problem reappears... the customer must recreated the server on
    5 nodes.
    the ECSO is logged n0 7381
    
    When the problem appears on one node, the situation seems to
    deteriorate on several nodes..
    The customer have noticed the problem appears on ALPHA
    
    thanks for help
    Monique