[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference iosg::all-in-1_v30

Title:*OLD* ALL-IN-1 (tm) Support Conference
Notice:Closed - See Note 4331.l to move to IOSG::ALL-IN-1
Moderator:IOSG::PYE
Created:Thu Jan 30 1992
Last Modified:Tue Jan 23 1996
Last Successful Update:Fri Jun 06 1997
Number of topics:4343
Total number of notes:18308

4174.0. "TRU/TRM keep running and running and running..." by ANGLIN::HARRISA (Non practising good person) Tue May 17 1994 22:06

    ALl-IN-1 3.0 OVMS 5.5-2
    
    for 2 trys now, I have a customer site that the TRU and TRM jobs run
    for days!  TRU will start around 1am Sunday morning. It has taken about
    2 hours (if that long!) in the past. TRM is set to start them abut
    3:30AM on sunday morning. 
    
    when I came in today (tuesday) BOTH jobs were still executing! this is
    a 2 node cluster and both nodes have been up for 22+ days.
    
    I've looked at the a1sub.log's and smtmp1,2,3.log's and they don't show
    anything relevent (to me anyway). emtmp4,5.log's had not been written
    to yet (so i know TRM didn't get that far).
    
    TRM ran successfully on May 1 as did TRU.  
    
    Even though TRM was "executing" users were still able to log inot
    ALL-IN-1.
    
    has anyone ever seen this before? any ideas?
    
    	thanks - ann
    
T.RTitleUserPersonal
Name
DateLines
4174.1MON SYS?IOSG::PYEGraham - ALL-IN-1 Sorcerer's ApprenticeWed May 18 1994 19:496
    Was anything else doing I/O running, like backups or defrag?
    
    Has someone given the ALLIN1 (sic) account crummy SYSUAF params? Or
    done the same to the queue FCVR's running on?
    
    Graham
4174.2anything here? thanks!ANGLIN::HARRISANon practising good personWed May 18 1994 20:4716
    the SYSUAF paramenters for ALLIN1 account:
    
    Maxjobs:         0  Fillm:       100  Bytlm:        36000
    Maxacctjobs:     0  Shrfillm:      0  Pbytlm:           0
    Maxdetach:       0  BIOlm:        50  JTquota:       2048
    Prclm:          10  DIOlm:        50  WSdef:          600
    Prio:            4  ASTlm:       100  WSquo:         1500
    Queprio:         0  TQElm:        50  WSextent:      3000
    CPU:        (none)  Enqlm:       350  Pgflquo:      20000
    
    the que that TRU/TRM runs on:
    
    Batch queue COSMV7$MAINT, idle, on COSMV7::
      /BASE_PRIORITY=4 /JOB_LIMIT=50 /OWNER=[SYSTEM] /PROTECTION=(S,O,G,W)
      /WSDEFAULT=1024 /WSEXTENT=65535 /WSQUOTA=1024
    
4174.3ANGLIN::HARRISANon practising good personWed May 18 1994 21:2611
    a bit more history - both jobs ran from Sunday early morning til monday
    afternoon the weekend of April 17. 
    
    Both jobs ran fine weekend of may 1. Both jobs ran early sunday til
    late Tuesday afternoon on May 15. 
    
    All 3 times the jobs ran on the same que.  No one (at least no one has
    admitted to) modifying the UAF for ALLIN1 or the queue parameters.
    
    	ann
    
4174.4go figure!?ANGLIN::HARRISANon practising good personMon May 23 1994 20:224
    i had both TRM and TRU run this past weekend. i set them up as ALL-IN-1
    housekeeping jbs running just once. of cource they ran fine. 
    
    	ann