[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference iosg::all-in-1_v30

Title:*OLD* ALL-IN-1 (tm) Support Conference
Notice:Closed - See Note 4331.l to move to IOSG::ALL-IN-1
Moderator:IOSG::PYE
Created:Thu Jan 30 1992
Last Modified:Tue Jan 23 1996
Last Successful Update:Fri Jun 06 1997
Number of topics:4343
Total number of notes:18308

1730.0. "RSF Housekeeping Procedure Questions" by EVOIS7::DEC_HELLAS () Fri Nov 06 1992 08:14

    Hi,
    
    below is a log file from running the RSF Housekeeping procedure. It
    should check the files listed in "ALL-IN-1 Mgmt Guide", pp 10-15,10-16.
    
    However, because a couple of files are open at that time the procedure
    fails, sending also a mail to Manager. From all these files the most
    interesting for disk space saving are the OA$SHARx:OA$DAF_x.DAT.
    However, it seems that my RSF has not touched these files at all. Why?
    Also, what can be done to prevent these failures?
    
    Regards,
    Nicholas
    
    The log follows:
    
    ===========================================================================
    $! SMJACKET.COM         V3.0 - 004
    $!
    $! Details to the end of the file
    $!---------------------------------------------------------------------------
    $       save_ver = f$verify(0)
    
    Ensure the form libraries are open
    Preset status values
    See if we need to shut down - and store the result
    Get the name of the submitter from the schedule file
    Check that the submitter in the schedule file still
    exists in the profile
    Submitter is a valid account entry
    Verify that the submitter has a VMS account entry in the profile
    Verify that the submitter's VMS account is still valid
    See if the submitter holds VMS identifier OA$MANAGER
    Passed security checks - submitter has appropriate identifier(s)
    Autorescheduling - set next run date for repeated run
    Now reschedule the job - calling queue_job.scp
    Set the job parameters
    Procedure completed - exiting SMJACKET.SCP
    %OA-I-LASTLINE, Auto-rescheduling procedure for 19-Nov-1992 00:05am Please wa
    
    OA-I-LASTLINE, Auto-rescheduling procedure for 19-Nov-1992 00:05am . Please wa
    %OA-I-LASTLINE, Scheduling operation completed successfully
    %SMJACKET-I Commencing batch execution of utility type "RSF" at 5-NOV-1992 00:
    %SMJACKET-I Invoking common START_BATCH procedure
    %START_BATCH-I Performing Start batch job procedures for "RSF"
    %START_BATCH-I SYSPRV privilege enabled
    %START_BATCH-I  Attempt to take out a lock on the sender and fetcher
    files
    %START_BATCH-I SENDLOCK.DAT and FETCHLOCK.DAT locked
    %START_BATCH-I running ALLIN1 under account "MANAGER" (/NOINIT/REENTER)
    
    Stopping server ATH651::"73="
    %OA-I-SHUT_NOTIFIED, All users have been notified of the shutdown
    %START_BATCH-I performing %START_BATCH exit and cleanup processing
    %START_BATCH-I SENDLOCK.DAT and FETCHLOCK.DAT unlocked
    %START_BATCH-I processing completed ok for %START_BATCH
    %SMJACKET-I Invoking Utility command file "OA$LIB:SMSREORG.COM"
    
    Shared file reorganization starting
    Converting "OA$DATA_LLV:FORMAT.DAT" using FDL file "OA$LIB:FORMAT.FDL"
    "OA$DATA_LLV:FORMAT.DAT" converted - was 36 blocks, now 29 blocks
    Converting "OA$DATA_SHARE:ARCHIVE_DOCS_PENDING_RECOVERY.DAT" using FDL
    file "OA
    "OA$DATA_SHARE:ARCHIVE_DOCS_PENDING_RECOVERY.DAT" converted - was 3 blocks, now
    Converting "OA$DATA_SHARE:ARCHIVE_SETS_DATA.DAT" using FDL file
    "OA$LIB:ARCHIVE
    "OA$DATA_SHARE:ARCHIVE_SETS_DATA.DAT" converted - was 20 blocks, now 20 blocks
    Converting "OA$DATA_SHARE:ATTENDEE.DAT" using FDL file
    "OA$LIB:ATTENDEE.FDL"
    "OA$DATA_SHARE:ATTENDEE.DAT" converted - was 726 blocks, now 627 blocks
    Converting "OA$DATA_SHARE:ATTENDEE_OVERFLOW.DAT" using FDL file
    "OA$LIB:ATTENDE
    "OA$DATA_SHARE:ATTENDEE_OVERFLOW.DAT" converted - was 12 blocks, now 2 blocks
    Converting "OA$DATA_SHARE:MEETING.DAT" using FDL file
    "OA$LIB:MEETING.FDL"
    "OA$DATA_SHARE:MEETING.DAT" converted - was 114 blocks, now 50 blocks
    
    Converting "OA$DATA_SHARE:OA$SHARED_DAF_MASTER.DAT" using FDL file
    "OA$LIB:OA$S
    "OA$DATA_SHARE:OA$SHARE_DDAF_MASTER.DAT" converted - was 8 blocks, now 8 blocks
    Converting "OA$DATA_SHARE:OA$SHARED_DIRECTORY_MASTER.DAT" using FDL file "OA$LI
    "OA$DATA_SHARE:OA$SHARED_DIRECTORY_MASTER.DAT" converted - was 11 blocks, now 1
    Converting "OA$DATA_SHARE:OA$SM_UTILITY_MASTER.DAT" using FDL file "OA$LIB:OA$S
    "OA$DATA_SHARE:OA$SM_UTILITY_MASTER.DAT" converted - was 14 blocks, now 14 bloc
    Converting "OA$DATA_SHARE:OA$SM_UTIL_SCHEDULE.DAT" using FDL file "OA$LIB:OA$SM
    "OA$DATA_SHARE:OA$SM_UTIL_SCHEDULE.DAT" converted - was 180 blocks, now 77 bloc
    Converting "OA$DATA_SHARE:PENDING.DAT" using FDL file "OA$LIB:PENDING.FDL"
    %CONV-F-OPENIN, error opening
    A1$SYSTEM:[ALLIN1.DATA_SHARE]PENDING.DAT;7 as input
    -RMS-E-FLK, file currently locked by another user
    Convert failed for "OA$DATA_SHARE:PENDING.DAT"
    Converting "OA$DATA_SHARE:PARTITION.DAT" using FDL file
    "OA$LIB:PARTITION.FDL"
    %CONV-F-OPENIN, error opening
    A1$SYSTEM:[ALLIN1.DATA_SHARE]PARTITION.DAT;1 as i
    -RMS-E-FLK, file currently locked by another user
    Convert failed for "OA$DATA_SHARE:PARTITION.DAT"
    %SMJACKET-I Invoking common END_BATCH procedure
    %END_BATCH-I Performing End batch job procedures for "RSF"
    
    OA-I-LASTLINE,
    %OA-I-LASTLINE,
    %END_BATCH-I Deleting old log files
    "OA$LOG:REORGANIZE_SYS_FILES_SM.LOG*;*" bef
    %END_BATCH-I Purging old log files
    "OA$LOG:REORGANIZE_SYS_FILES_SM.LOG*;*" to "
    %END_BATCH-I Total of 1 logfiles, purge to 01, means delete 0
    %END_BATCH-I Deleting old log files
    "OA$LOG:REORGANIZE_SYS_FILES_SA.LOG*;*" bef
    %END_BATCH-I Purging old log files
    "OA$LOG:REORGANIZE_SYS_FILES_SA.LOG*;*" to "
    %END_BATCH-I Total of 0 logfiles, purge to 01, means delete -1
    %END_BATCH-I performing %END_BATCH exit and cleanup processing
    %END_BATCH-I processing completed ok for %END_BATCH
    
    %END_BATCH-I processing completed ok for %END_BATCH
    %SMJACKET-E- Internal error in housekeeping procedure
    %SMJACKET-I performing %SMJACKET exit and cleanup processing
    %SMJACKET-I close lockfiles
    
    %OA-I-LASTLINE, Submitting server ATH651::"73=" startup to batch...
    %OA-I-LASTLINE, Working...
    %OA-I-LASTLINE,
    %SMJACKET-I %SMJACKET facility exiting due to error
    %SMJACKET-E processing completed with an error for %SMJACKET
    
    %EMD-I-USESEND, Send this message when you are ready
    %EMD-I-MESSENT, The message has been sent
    %EMD-I-PUTINWB, 1 message placed in your Wastebasket
      ALLIN1       job terminated at  5-NOV-1992 00:12:49.46
    
    
    
T.RTitleUserPersonal
Name
DateLines
1730.1SMREORG.COM exits after 2nd errorCESARE::EIJSAll in 1 PieceFri Nov 06 1992 09:5022
    
    Nicholas,
    
    > However, it seems that my RSF has not touched these files at all.
    > Why?
    
    Well, there are a number of errors and SMREORG.COM checks for the
    errors. I always thought that 'On error then Continue' allowed the code
    to continue after the first hit, but wouldn't after a second. After the
    second error, SMREORG.COM just stops, which happens to be before the
    reorganization of SDAF, so it is quite understandable.
    
    > Also, what can be done to prevent these failures?
    
    Appareantly someone was logged in. I don't think it was the POSTMASTER
    causing the problems as the Sender and Fetcher are locked. Someone
    who is allowed to do $ALLIN1/OVERWRITE=SHUTDOWN. The MANAGER?
    
    Ciao,
    
    	Simon
     
1730.2Cannot find out who else was using the files...EVOIS8::DEC_HELLASFri Nov 06 1992 12:4611
    Simon,
    
    I don't think there was anyone with the option /over=shutdown logged
    into the system. The only one who uses and controls the VMS account
    "ALLIN1" and the ALL-IN-1 account "Manager" is me.
    And that specific noght there were no other H-K procedures scheduled!
    What else could it be?
    
    rgrds
    
    Nicholas
1730.3not the RSF developer ...IOSG::TYLDESLEYFri Nov 06 1992 14:3817
    I would agree with Simon's analysis that some process is holding onto
    pending and partition files, and then smreorg.com bombs out. The 
    immediate thought goes to the Sender and Fetcher, but as far as I can 
    see these have been correctly closed down, and your job has moved on
    into start_batch.scp. Here we get the actual shutdown. Now it's just
    possible, that a) the shutdown did not kill all processes (some spawned
    ones still hanging around?) or b) the sm_fc_server_stop.scp did not do
    the business, leaving some locks open on the two files mentioned (have
    you other FC servers on the system?). 
    
    To proceed, you might want to try a show dev/files on those data files,
    or possibly set trace and verify in some of the chain of procedures
    involved e.g. smreorg.com, smjacket.com, smjacket.scp, start_batch.com,
    start_batch.scp and sm_fc_server_stop.scp (all OA$LIB).
    
    Cheers,
    DaveT                          
1730.4identIOSG::TYLDESLEYFri Nov 06 1992 14:455
    ... oh and nearly forgot, check that your manager account has the 
    identifier OAFC$SYSMAN in order to manage servers (long shot!).
    
    Cheers,
    DaveT
1730.5HouseKeeping Procs. still does not work!EVOIS7::DEC_HELLASMon Nov 09 1992 10:2539
    Dave,
    
    the identifier OAFC$SYSMAN does exist in the ALLIN1 VMS user profile.
    However, last Saturday night to Sunday morning at 02:30 am, I had
    scheduled the "test and repair mail areas" H-K procedure. Again the
    procedure exited because the file: OA$DATA_SHARE:PARTITION.DAT was
    locked again! 
    
    Here a few lines from the log
    file:OA$SM_FCVR_MAIL_AREA.LOG1992110802300000;1
    
    
    OA-I-LASTLINE, 02:36am
    %OA-I-LASTLINE, 02:36am
    <CR><LF><CR><LF><CR><LF><CR><LF><CR><LF>        MANAGER finished using
    ALL-IN-1
    ALL-IN-1 File Cabinet Verification and Repair Program
    =====================================================
    
    Version 3.0-13
    
    Could not open partition file OA$DATA_SHARE:PARTITION.DAT
            ?File is locked
    Most likely cause is that the File Cabinet Server has not stopped.
    This error can also occur if accounts are still logged in to ALL-IN-1.
    Terminating program
    %SMJACKET-I Invoking common END_BATCH procedure
    %END_BATCH-I Performing End batch job procedures for "TRM"
    
    
    1. In the log file I can see that the FC Server has been stopped.
    2. When ALL-IN-1 is shutting down it kicks out all users.
    
    Then, what is wrong?
    
    rgrds
    
    Nicholas
    
1730.6look for the processsesIOSG::TYLDESLEYMon Nov 09 1992 11:0314
    Hello Nicholas,
    
    The log file does imply, as I thought, that the FCS is not being 
    closed down. Please check with $sho device/files oa$data:  to see 
    which processes have the partition file open. Perhaps you might want
    to kill them off and try the housekeeping again. 
                                      
    Also, check SM MFC MS to see the status of FC servers on your system.
    Lastly, please set trace to A1TRACE.LOG in
    oa$lib:sm_fc_server_stop.scp, in order to see what is going wrong with
    the FC server shutdown.
    
    Cheers,
    DaveT
1730.7ANGLIN::HARRISAuser viciousTue Jun 01 1993 22:2634
    i'm seeing a similar problem on 1 of my systems also.
    
    ALl-IN-1 3.0-1, VMS 5.5
    
    When RSF ran (1st time since upgrade), the following error appeared in
    the log file...
    
    Converting "OA$DATA_SHARE:PENDING.DAT" using FDL file
    "OA$LIB:PENDING.FDL"
    %CONV-F-OPENIN, error opening
    DATA$DISK:[ALLIN1.DATA_SHARE]PENDING.DAT;18 as in
    -RMS-E-FLK, file currently locked by another user
    Convert failed for "OA$DATA_SHARE:PENDING.DAT"
    .
    .
    Shared area reorganization starting
    Converting "DATA$DISK:[ALLIN1.SHARED_E]OA$DAF_E.DAT" using FDL file
    "OA$LIB:SDA
    %CONV-F-OPENIN, error opening
    DATA$DISK:[ALLIN1.SHARED_E]OA$DAF_E.DAT;18 as inp
    -RMS-E-FLK, file currently locked by another user
    Convert failed for "DATA$DISK:[ALLIN1.SHARED_E]OA$DAF_E.DAT"
    .
    .
    
    Looks like all the ohter .DAT files converted.  Since everything else
    looks ok, I'm assuming ( knw i shouldn't do that) that there must have
    been a sub-process going somewhere that didn't get shut down.  what can
    be done to ensure that subprocesses get logged out when ALL-IN-1 shuts
    down?
    
    	ann
    
    
1730.8Rounding up the usual suspect!AIMTEC::WICKS_AJune 7-13 Real Football in the U.SWed Jun 02 1993 02:539
    Ann,
    
    are you sure it wasn't the FCS that had those files locked.
    
    P.S thanks for the "secret" package (:==:)
    
    Regards,
    
    Andrew.D.Wicks
1730.9postits R usANGLIN::HARRISAuser viciousWed Jun 02 1993 16:286
    well... in the top part of the log file, the server was stopped.
    in the bottom of the file, the server wasstarted.
    i looked in the log file for the server startup and it looked ok (no
    errors).
    
    	ann
1730.10different system, different problemANGLIN::HARRISAhooked on DAVESun Oct 24 1993 22:5034
    ok, anohter RSF question:
    
    this is what the log showed from the run last night:
    
    Converting "DATA$DISK:[OA$SHARA]OA$DAF_A.DAT" using FDL file
    "OA$LIB:SDAF.FDL"
    %SYSTEM-W-NOSUCHFILE, no such file
     \DATA$DISK:[OA$SHARA]OA$DAF_A.DAT;-1\
    %SYSTEM-W-NOSUCHFILE, no such file
     \DATA$DISK:[OA$SHARA]OA$DAF_A.DAT;-1\
    %RMS-E-FNF, file not found
    Convert failed for "DATA$DISK:[OA$SHARA]OA$DAF_A.DAT"
    Converting "DATA$DISK:[A1DATA.DATA_SHARE]OA$DAF_E.DAT" using FDL file
    "OA$LIB:S
    "DATA$DISK:[A1DATA.DATA_SHARE]OA$DAF_E.DAT" converted - was 272541
    blocks, now 
    Converting "DATA$DISK:[OA$SHARD]OA$DAF_D.DAT" using FDL file
    "OA$LIB:SDAF.FDL"
    %SYSTEM-W-NOSUCHFILE, no such file
     \DATA$DISK:[OA$SHARD]OA$DAF_D.DAT;-1\
    %SYSTEM-W-NOSUCHFILE, no such file
     \DATA$DISK:[OA$SHARD]OA$DAF_D.DAT;-1\
    %RMS-E-FNF, file not found
    Convert failed for "DATA$DISK:[OA$SHARD]OA$DAF_D.DAT"
    
    
    i have customized SMREORG on this system to do the daf's in a certain
    order (cause of size) and this did not even try to convert DAF_C. DAF_C
    is on a totally different disk.  SMREORG has also been customized to do
    PURGE/KEEP=1, we just don't have the space for 2 version of all these
    files.
    
    	ann