[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference csc32::consolemanager

Title:POLYCENTER Console Manager
Notice:Kits, Scans, Docs on CSC32:: as PCM$KITS:,PCM$DOCS:, PCM$SCANS:
Moderator:CSC32::BUTTERWORTH
Created:Thu Aug 06 1992
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:1541
Total number of notes:6564

572.0. "RWMBX, PCM Hangs & Reboots Must Be Resolved" by BSS::LIND (Jim Lind; 592-4099 CX03-1/N14 CNMC-West) Sat Jan 21 1995 21:49

    SUBJECT:	RWMBX, PCM Hangs, Reboots - an ongoing issue
    		NEED to get PCM STABLE and RELIABLE

    The CXO3 production center uses PCM to control all production systems
    in use in the Western territory at CXO3, some 190+ nodes in all.

    We continue to suffer RWMBX problems ...

    	THIS PROBLEM IS URGENT, AND NEEDS RESOLUTION ASAP, as I am sure
    	is the case with many of our customers.

    	We have had to reboot the PCM system on the average of one to two
    	times per week, less if you don't use it, and this is an
    	unacceptable level of performance by any measure.

    ENVIRONMENT:

    	VMS V6.1 w/PCM V1.005 (w/MUP)
    	VAX 6620 w/128 meg, DEMNA, KDM70, KDB50
    	FOUR RA71 (two system, two user/page, shadowed) 
    	FOUR RA72 (stripe for logging)

    Since the CX3PCM system is for Digital PRODUCTION use, it is essential
    that this system remain up and available at all times.  PCM has FAR to
    many points of failure to make it production ready in the VMS/VAX
    platform, and we need immediate assistance in correcting this problem.

    The CX3PCM system is available for monitoring by Engineering and 
    CSC support personnel in an effort to help identify and fix the 
    problems that many PCM users are experiencing.

    If Engineering is having difficulties demonstrating the problems, it
    is suggested that they work with CXO3 personnel to help resolve issues
    and get this thing stable for the next release by using, changing 
    and monitoring the CXO3 platform.

    The following CNS folks are available to help in this regard:

    	Jim Lind		Ron van Zuylen
    	BSS::LIND		MISERY::RON
    	DTN 592-4099		DTN 521-6667

    	Eric Leader		Judy Aguilar
    	OUTSRC::LEADER		OUTSRC::AGUILAR
    	DTN 566-5205		DTN 566-5208

    Jim Lind
T.RTitleUserPersonal
Name
DateLines
572.1OPG::PHILIPAnd through the square window...Sun Jan 22 1995 11:5026
Jim,

  I dont want to sound unhelpful, but, if you really need an urgent
  fix for this problem then you will have to escalate it in the IPMT
  system, we dont have the time that is required to fix this at the
  moment, however, an IPMT will force us to make time (it will also
  highlight to our management that this "design deficiency" is quite
  important, and requires time to work on it).

  We as you say are having difficulties reproducing this problem,
  however, we have seen enough systems where this has occured to
  understand the problem and we have formulated a strategy to fix it,
  its just a matter of scheduling it in with our other work. Now, as
  I have said, if you IPMT it, then it will move up our work queue
  to the top (or extremely close to it).

  This goes for anyone else who is experiencing this problem, IPMT it
  and we will work on it, until that is done unfortunately we cant do
  anything.

  This all boils down to the fact that this NOTES file IS NOT an official
  problem reporting mechanism.

Cheers,
Phil

572.2Answers to RWMBX, ACCVIOJDARC::GUIBOILEAUTue Jan 24 1995 08:2615
    I had these problems and I resolved them.
    
    PCM is used to manage 12 nodes VMS but I need to open today 24 windows
    with the action console$eventlist.
    
    I had always processes in RWMBX or RWAST, ACCVIO ...
    The problems has been resolved by:
    
    	- defmbxbufquo	: 1054*4
    	- defmbxmxmsg	: 256*4
    
    	- bytlm		: 800000	! Proc console$startup modified
    	- diolm		: 256		! Proc console$startup modified
    
    If it can help somebody.
572.3CSC32::BUTTERWORTHGun Control is a steady hand.Tue Jan 24 1995 20:1313
    Re. -1.
    
    Your input is appreciated but it's still possible to cause an
    "RWMBX hang". Your tuning makes it much less likely and is quite
    appropriate for a lot of environments. The problem is a design flaw - 
    I've spent many hours in the code looking for a simple fix and I can
    tell you that  it doesn't exist - and it will have to be corrected by
    some rather major code changes. The code that needs changing is very
    basic to the product and it will require some exhaustive testing
    to make sure we don't break anything else.
    
    Regards,
       Dan
572.4RWMBX, PCM Hangs & Reboots Must Be ResolvedKETJE::STEUKERSPWed Jan 25 1995 07:4513
		Re. -2.


	In CONSOLE$STARTUP.COM You specify to modify CONSOLE$STARTUP.COM
	the BYTLM and DIOLM value. With the CONSOLEMUP015 version however
	i cannot find these parameters in the procedure.

	Could it be You mean to add/modify /IO_BUFFERED and /BUFFER_LIMIT
	to the RUN command instead of using the PQL/other values?


	Thanks on forehand,
	Erik.
572.5CSC32::BUTTERWORTHGun Control is a steady hand.Wed Jan 25 1995 18:5821
    > Could it be You mean to add/modify /IO_BUFFERED and /BUFFER_LIMIT
    
    /BUFFER_LIMIT is indeed equivalent to BYTLM in the PQL's and UAF.
    and /IO_DIRECT is equivalent to DIOLM.
    
    /IO_DIRECT is not specified in the command file anywhere so we are
    picking up the PQL sysgen value here. Note also that the buffer limit
    value is passed as an argument to the CONSOLE$IMAGE_STARTUP DCL
    subroutine in the CONSOLE$STARTUP procedure. The MUP specifies
    1024000 for the ENS daemon and 655530 for the parent controller daemon.
    Note that the values that you see in CONSOLE$STARTUP *do not* effect
    the child controller daemons. The quotas for these processes are
    either defined by a logical ( described in another recent note) or
    a default value contained within the image.
    
    Regs,
      Dan
    
    
    
    
572.6When fixed?JGODCL::SYS_MJANSSENMarc Janssen (889-9594)Thu Jan 26 1995 12:448
    Have you got an idea when this "design flaw" will be fixed.
    
    We have scheduled a major upgrade weekend in a couple of weeks, where
    these RWMBX problems are absolutely not acceptable.
    
    Is this issue "high" on the priority list?
    
    Marc
572.7OPG::PHILIPAnd through the square window...Thu Jan 26 1995 16:5318
Marc,

>>    Have you got an idea when this "design flaw" will be fixed.
  
 When we get the time.
  
>>    We have scheduled a major upgrade weekend in a couple of weeks, where
>>    these RWMBX problems are absolutely not acceptable.

 Yes, we realize these problems are not acceptable and they will be fixed.
    
>>    Is this issue "high" on the priority list?

 Yup, it number 1 but, as I said its just finding the time to schedule it in.
    
Cheers,
Phil

572.8Is a patch release scheduled?JGODCL::SYS_MJANSSENMarc Janssen (889-9594)Thu Feb 09 1995 15:1412
    Hi,
    
    Is there any news regarding the RWMBX-poblems? We will be using
    PCM extensively soon during the scheduled upgrade weekend.
    
    Could you give us an idea when this issue will be resolved via
    patch/MUP?                                    
    
    Thanks
    
    Marc
    
572.9OPG::PHILIPAnd through the square window...Thu Feb 09 1995 15:539
Marc,

  Nobody has IPMT'd the problem yet, until it is officially elevated, we cant 
  work on it until we have fixed all those officially elevated problems.

  So, dont hold your breath.

Cheers,
Phil
572.10IPMT is on its way.JGODCL::SYS_MJANSSENMarc Janssen (889-9594)Fri Feb 10 1995 13:306
    Ok,
    
    I will stick with the rules. I have logged a new call at our CS support
    center, to IPMT the issue officially.
    
    Marc
572.11Update on Problem?36418::SOJDAFri Mar 10 1995 15:2711
    Just to revisit this issue....
    
    I am now working with a large customer who is nearly ready to go live
    with PCM and who has experienced this problem regularly during testing.
    
    Since it has been about a month since the IPMT was issued, is there any
    additional information on when a fix might be available?
    
    Thanks in advance,
    
    Larry
572.12OPG::PHILIPAnd through the square window...Fri Mar 10 1995 18:1426
Larry,

  I am busy trying to get the software working correctly again after 
  putting the fix in (we have a few minor teething problems), we will be
  issuing IFT test kits in about two weeks time if we can get all the present
  problems fixed.

  We will not be releasing a MUP or ECO for this fix, it is too large a 
  change in the code for that, so, we will be doing a small field test with 
  those people who have experienced this problem, then we will be releasing 
  V1.6 of PCM.

  A warning to everybody who is reading this...

  The underlying Interprocess Communication mechanism has completely changed, 
  any user written action routines using the API MUST be recompiled and link in 
  order for them to work with this new version. This applies ONLY to OpenVMS 
  (VAX amd Alpha).

  Finally, we will be taking this opportunity (V1.6) to release a brand new
  C3, this new C3 is hierarchical in nature. Also, we WILL NOT be releasing
  V1.6 on ULTRIX, any customers wishing to get the new C3 MUST upgrade to 
  either Digital UNIX or OpenVMS.

Cheers,
Phil
572.13VolunteeringVELI::KORKKOVeli K�rkk�, FNO/OMS Delivery, DTN 879-5512Sat Mar 11 1995 17:203
        We would be very interested to FT your coming V1.6 kit. 
        
        veli
572.14Me tooSTKEIS::MALMMy God! It's full of stars...Tue Mar 14 1995 12:205
    
    
    We are interested to participate in the FT.
    
    /Staffan
572.15us too47182::PIOVANODon't try so hard ...Thu Mar 16 1995 13:046
    
    We are also interested!
    
    
    /Gianmario
    /Paolo
572.16EFT ?????45211::CLIFFE_SWed Mar 22 1995 13:173
     If this is going into EFT, then my customer would like to Participate
    
    				Steve C
572.17When ?BACHUS::WILLEMSJohan Willems @BRO DTN 856-8739Tue Mar 28 1995 12:013
Do you have any idea when the EFT version will be available?

Johan
572.18OPG::PHILIPAnd through the square window...Tue Mar 28 1995 13:426
Johan,

  When it works!!! Hopefully next week sometime, but no promises.

Cheers,
Phil
572.19tick-tock42178::HARRISIt's got to be turn-onable & turn-offableTue Apr 04 1995 15:016
    
    ...so you'll let us know then...
    
:-)
    
Nige
572.20OPG::PHILIPAnd through the square window...Tue Apr 04 1995 17:075
  Of course.

Cheers,
Phil
572.21CERN would like to FT as wellCERN::HOBBSDial "M" for dyslexiaTue Apr 18 1995 11:346
The computer center operations manager at CERN has requested to field test
the next version.

How can I apply for a formal or informal FT?

-cw
572.22YOSSAM::PHILIPAnd through the square window...Tue Apr 18 1995 11:597
cw,

  We are only going to be doing this informally, so just keep an eye open
  here and grab the kit when it becomes available.

Cheers,
Phil
572.23Any more news ???SEDOAS::CLIFFE_SWed Apr 26 1995 11:274
    Any progress on the 1.6 EFT or SSB kit ?????
    
    
    		Steve C
572.24YOSSAM::PHILIPAnd through the square window...Wed Apr 26 1995 11:486
Steve,

  Any day now, we are ironing out the final few obvious problems.

Cheers,
Phil
572.25Reverse LAT seems ok for us non-power usersMOVIES::HARRISIt's got to be turn-onable & turn-offableThu Apr 27 1995 01:357
    
    Well we hung it out for a while but our use probably doesn't
    need the might of a console manager so I'll take my leave now as
    reverse LAT's probably all we really need. All the best for the new
    version, I may come back to you if we can't do all we want.
    
Nige