[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference iosg::all-in-1_v30

Title:*OLD* ALL-IN-1 (tm) Support Conference
Notice:Closed - See Note 4331.l to move to IOSG::ALL-IN-1
Moderator:IOSG::PYE
Created:Thu Jan 30 1992
Last Modified:Tue Jan 23 1996
Last Successful Update:Fri Jun 06 1997
Number of topics:4343
Total number of notes:18308

2891.0. "EW/MJU Infinite loop" by GANTRY::HULL (Digital Services Delivery - Motown) Mon Jun 21 1993 16:09

We are running A1V3.0-1 under VMS V5.5-2 and have the Mail Janitor V3.0 kit
installed.

I ran EW (MJU) beginning Friday night, and it racked up over 7 hours of CPU
time and only made it into the F's in the usernames. It got hung up on 1
user acct and remained there for over 2 days.  This is the 3rd month in a
row that something like this has hit with EW/MJU.

I looked in the user's acct with NEWDIR.  This user had never logged into
A1 since our V3.0 upgrade, because his Nicknames file was converted when I
entered the acct.  He only has 496 docs in the file cabinet, and the
cabinet in not corrupted in any way.  I ran TRU against his acct and it
came up clean. He did have 5 Inbox messages waiting, but that shouldn't be
any fatal condition.

There is nothing in the log file to indicate that anything is wrong.  It
just kept working on that acct forever.

I really can't see trying to run the MJU with Tracing enabled, as the
resulting huge logfile would eat our diskspace up in no time.  We need help
to resolve this!

Last entry in logfile:

Drawer: [FEDAK]MAIN is being processed at 08:51pm
Drawer owned by VMS account: FEDAK Last log-in at: 02-Mar-1993 09:30pm
User last logged in more than 4 weeks before last EW run
Drawer: [FEDAK]MAIN emptying wastebasket
Drawer: [FEDAK]MAIN Owner assumed to be inactive.
[EOB]


TRU Summary:

ALL-IN-1 File Cabinet Verification and Repair Program
=====================================================
Version 3.1-5

Beginning Phase 1 at 08:52 AM -- scanning users' personal File Cabinets
=======================================================================

A summary of the repairs made to personal File Cabinets is given
at the end of this phase

Scanning Drawer [FEDAK]MAIN at 08:52 AM
------------------------------------------

Number of DOCDB records read:    496  - 08:52 AM

All requested users processed

No repairs made to personal File Cabinets
-----------------------------------------


Regards,

	Al
T.RTitleUserPersonal
Name
DateLines
2891.1a real mysteryGANTRY::HULLDigital Services Delivery - MotownMon Jun 21 1993 17:0136
I also reset the Dept field for this user and ran EW on just that lone
acct, and it finished in a couple of minutes with no errors. ???  So why
was it hanging up the system?

Log:

Calculated 4 weeks from last clean EW: 18-May-1993
Only the following departments will be processed:

  EWTEST

Drawer: [FEDAK]MAIN is being processed at 10:50am
Drawer owned by VMS account: FEDAK Last log-in at: 02-Mar-1993 09:30pm
User last logged in more than 4 weeks before last EW run
Drawer: [FEDAK]MAIN emptying wastebasket

Drawer: [FEDAK]MAIN Owner assumed to be inactive.
Drawer: [FEDAK]MAIN reorganized
Last clean run date now set to:  21-Jun-1993

        MANAGER finished using ALL-IN-1 at 21-Jun-1993 10:52am

****************************
Log of reorganized calendars
****************************

Converting "FEDAK_DEV:[FEDAK.OA]FEDAK.A1CAL" using FDL file
"OA$LIB:CALENDAR.FDL
"
"FEDAK_DEV:[FEDAK.OA]FEDAK.A1CAL" converted - was 2 blocks, now 2 blocks
Converting "FEDAK_DEV:[FEDAK.OA]ACTITEM.DAT" using FDL file
"OA$LIB:ACTITEM.FDL"
"FEDAK_DEV:[FEDAK.OA]ACTITEM.DAT" converted - was 3 blocks, now 3 blocks
%SM_EW-I-DONE processing complete for %SM_EW
%SMJACKET-I Invoking common END_BATCH procedure
%END_BATCH-I Performing End batch job procedures for "EW"
2891.2Disk space?IOSG::MAURICENight rolls in, my dark companionMon Jun 21 1993 18:2821
    Hi,
    
    Once again it looks like the problem is with the reorganize. Here are
    some questions:
    
    1. Is free disk space low on the Manager's account?
    
    2. Is free disk space low on the FEDAK account?
    
    3. Does the Manager account have *default* privilege EXQUOTA? 
    
    If the answer to (3) is no, then:
    
    4. Does the Manager account have low quota.
    
    5. Does the Manager or FEDAK account account have low quota on the disk
       used by the FEDAK account?
    
    Cheers
    
    Stuart
2891.3TONS of roomGANTRY::HULLDigital Services Delivery - MotownMon Jun 21 1993 19:3960
Re:    <<< Note 2891.2 by IOSG::MAURICE "Night rolls in, my dark companion" >>>
                                -< Disk space? >-
Stuart - 

>    Once again it looks like the problem is with the reorganize. Here are
>    some questions:
    
>    1. Is free disk space low on the Manager's account?

No quotas enabled on the ALL-IN-1 disk. 1.9Million blocks free on device
    
>    2. Is free disk space low on the FEDAK account?

No.  19K/75K
    
>    3. Does the Manager account have *default* privilege EXQUOTA? 
    
Yes.

Username: ALLIN1                           Owner:  Allin1 manager
Account:                                   UIC:    [2,4] ([ALLIN1])
CLI:      DCL                              Tables: DCLTABLES
Default:  USER0:[ALLIN1]
LGICMD:
Flags:  DisPwdDic
Primary days:   Mon Tue Wed Thu Fri
Secondary days:                     Sat Sun
No access restrictions
Expiration:            (none)    Pwdminimum: 15   Login Fails:     0
Pwdlifetime:           (none)    Pwdchange:  17-APR-1993 16:31
Last Login: 20-JUN-1993 18:32 (interactive), 21-JUN-1993 13:31
(non-interactive)
Maxjobs:         0  Fillm:       100  Bytlm:        51200
Maxacctjobs:     0  Shrfillm:      0  Pbytlm:           0
Maxdetach:       0  BIOlm:        50  JTquota:       3072
Prclm:          10  DIOlm:        50  WSdef:          512
Prio:            4  ASTlm:       100  WSquo:         2048
Queprio:         0  TQElm:        50  WSextent:      9999
CPU:        (none)  Enqlm:       350  Pgflquo:      40960
Authorized Privileges:
  CMKRNL SYSNAM GRPNAM DETACH PRMMBX ALTPRI SETPRV TMPMBX WORLD
  OPER EXQUOTA NETMBX VOLPRO PHY_IO PRMGBL SYSGBL SYSPRV SYSLCK
  READALL
Default Privileges:
  CMKRNL SYSNAM GRPNAM DETACH PRMMBX ALTPRI SETPRV TMPMBX WORLD
  OPER EXQUOTA NETMBX VOLPRO PHY_IO PRMGBL SYSGBL SYSPRV SYSLCK
  READALL
Identifier                         Value           Attributes
  OA$ADMIN                         %X8001100E
  OA$PRVAPP                        %X8001100F      RESOURCE
  OA$MANAPP                        %X8001009F      RESOURCE
  OA$MANAGER                       %X800100A0
  OA$USER_QM                       %X800100A2
  OAFC$SYSMAN                      %X800100A3
UAF> ex


I just don't see disk quota as being part of this particular problem.

	Al
2891.4Need more info.IOSG::CHINNICKgone walkaboutTue Jun 22 1993 10:4020
    
    We really need to know what it's doing at the time that it hangs...
    
    You could try just looking at the open files at the time using either
    SDA SHOW PROCESS/CHAN or $ SHOW DEVICE/FILES. This might give a hint as
    to what it's doing when it hangs.
    
    The other thing to look at is a $ SHOW PROCESS/CONTINUOUS and see what
    it is doing when it hangs - lots of direct I/O or just CPU time? What
    processor mode is it spending its time in?
    
    Finally, it might just be worth running a $ ANALYZE/RMS/CHECK on the
    data files for that account.
    
    If I had to guess at the moment - I'd say you have a quota problem. Try
    pushing up:
    	 	ENQLM:		1200
    		FILLM:		200
    
    Paul.
2891.5It looks like DOCDB reorganisation looping !BUSHIE::SETHIIt&#039;s not wise to have wisdom teethMon Jul 26 1993 06:3735
    Hi Paul and Stuart,
    
    I have a customer with ALL-IN-1 IOS 3.0-1 installed and I have checked
    all of the above.  On the one account the Empty Wastebasket HK goes
    into a loop when run from the ALL-IN-1 managers account, however from
    the administrators account everythings runs okay so the customer tells
    me.
    
    Having logged in I saw lot's of DOCDB.TEMP files and it looks like it's
    the reorganisation of the DOCDB that's causing a problem.  The file is
    120/660 blocks, I did a convert/fdl and the file size decreased 66/120.
    ANALYZE/RMS/CHECK showed no error on both the DAF and DOCDB (by the way
    I also did a convert/fdl on the daf.dat).
    
    Having looked at the process via the SDA, I saw that the process was
    executing the script SMREORG_JAN.SCP and the image FDLSHR.EXE was
    active, there were several others but these are the most important ones
    I guess.  I didn't see the i/o rate as being abnormal nor the cpu
    usage. Show device/files/out did not show any files being accessed on
    the user disk.
    
    The customer is going to up the limit of the ENQULM and FILLM as
    suggested by Paul.  The customer has said that the Admin. account had
    the same quotas as the Managers account. The default limits are Fillm
    100 and Enqlm 350.
    
    Should you wish to examin the logfile of my login session it can be
    found on RIPPER::USER$TSC:[SETHI]Q71724.LOG.  I will know in the
    tomorrow if the EW finnished, we are running it for this account only.
    
    Regards,
    
    Sunil
    
    PS - Stuart nice to see you again
2891.6Increased FILLM and ENQLM and it worked !!! BUSHIE::SETHIIt&#039;s not wise to have wisdom teethWed Jul 28 1993 01:3312
    Hi All,
    
    We increased the FILLM and ENQLM as suggested by Paul and the HK ran to
    completion.  This leaves the question,  why is it that the Adim.
    account had quotas that are exactly the same as the ALL-IN-1 managers
    accounts (before they were increased) and the job ran to completion ?
    I would have expected the same result in both the cases.  Is there a
    reason behind all of this ?
    
    Regards,
    
    Sunil
2891.7One sample does not a statistic makeIOSG::CHINNICKgone walkaboutThu Jul 29 1993 11:5113
    
    Well,
    
    I think it's a little early to conclude that the quota changes have
    fixed it. Let's reserve judgement for the present.
    
    If quotas are related, then the consumption of resources will depend
    upon the number of files accessed and processed and number of users and
    and and...
    
    So, it may work some runs and fail on others.
    
    Paul.