[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference bulova::decw_jan-89_to_nov-90

Title:DECWINDOWS 26-JAN-89 to 29-NOV-90
Notice:See 1639.0 for VMS V5.3 kit; 2043.0 for 5.4 IFT kit
Moderator:STAR::VATNE
Created:Mon Oct 30 1989
Last Modified:Mon Dec 31 1990
Last Successful Update:Fri Jun 06 1997
Number of topics:3726
Total number of notes:19516

33.0. "DECW$SERVER_x loops after X$Store_color" by STKHLM::GOLSSON (Gert Olsson, Malmoe (UGO)) Thu Jan 26 1989 18:08

    !! This is basicly a repost of note 1777 wich dissapeared in the
    !! disk crash, though with some added info.
    
    I have a customer who has ported a large UIS based application to
    DECWindows.
    
    Everything is ok except for a "blink" function, wich is
    implemented by changing one of the allocated color map entries at a
    rate of 0.25 hz. Color maps are handled by X$Default_colormap,
    X$Alloc_color and a number of X$Store_color.
    
    What happens is that the server goes into a compute loop at priority 6
    (PC loops between %X00124EF and %X0012566D). It makes no paging, no I/O
    and uses no eventflags as far as SHOW/PROC/CONT DECW$SERVER_0 can tell.
    
    The Xlib call that seems to trigger the loop is a X$Store_color, but we 
    are a not sure that it causes it.
    
    A simple test with the "blink" routine doesn't have the same problem
    as the whole application. 
    We have tried to reduce the application to a level where it will run
    but without success. Lowering the blink rate didn't give any improvment, 
    but turning it of completly helps.
    Running the application with the client on another node didn't help
    either.
    
    We tapped the protocol up to the loop with help of XLIDDY, but as far
    as I can see there is nothing wierd in the protocol. We tapped both the
    original and the minimized application. The results of XLIDDY, and the 
    part of the application that handles Xlib could be found at 
    STKTSC""::
    	GRAPH.PAS;197           859/861   INPUTDRIVERS.PAS;66     135/135   
	MAIN.XL;1               183/183   MAIN_MIN.XL;1           211/213   
        (Then MAIN_MIN.XL is bigger beacuse it runs for a longer time)
    
    Config:
    	VSII/GPX, 8 planes, 9Mb, VMS/DECWindows 5.1 SDC.
    
    So, the questionsmarks.
    What are the server doing in the area of the loop?
    Can anyone knowledgable take a look at the XLIDDY files and see if
    something stiks out?
    Any suggestion on what to do next?
    (SPR and/or QAR of course, but there is problems to reproduce the failure
    without the complete application software, and that's a lot of code).
    
    
    Regards,
    Gert.

T.RTitleUserPersonal
Name
DateLines
33.1I'll look at the PCDECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Fri Jan 27 1989 15:407
I'll try to figure out what is at those PC values.  BTW, is the application
doing an XSynch or something to force the buffer to be flushed before each
�-second wait?  If not, try it.  There is still a bug if the server loops, but
it might do something.

Burns

33.2We will try X$SynchSTKHLM::GOLSSONGert Olsson, Malmoe (UGO)Sun Jan 29 1989 09:059
    RE: 1.
    Thank you for trying to help me out.
    In the standard application we don't synch, but in the minimized
    version we tried it without any success. I will try to put it in the
    standard appliction and post the result. 
    By the way, the wait is not �-second, it's 4 seconds.
    
    Gert.

33.3Where to find the filesSTKHLM::GOLSSONGert Olsson, Malmoe (UGO)Sun Jan 29 1989 10:3415
    My excuses to anybody who have tried to reach the files mentioned in
    the base note. Someone of our system managers made a too good job.
    The mentioned files is available as follows:
    
        Directory STKTSC::SWS4:[GOLSSON]   
        
        MAIN.XL;1               183/183    
        MAIN_MIN.XL;1           211/213    
	GRAPH.PAS;197           859/861    
        INPUTDRIVERS.PAS;66                
                                135/135    
    
    
    Gert.

33.4X$Synch/X$Flush without improvment!!STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Sun Feb 05 1989 06:2110
    RE: 1
    We have tried to put in a lot of X$Synch and/or X$Flush but the problem
    remains the same as described in the basenote.
    
    We are working on tracing all XLib calls and generate a smaller
    program out of that. We will post it here ifit fails the same way as
    the "big" application is doing.
    
    Gert.

33.5A way to reproduce the loop!!!STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Mon Feb 06 1989 10:4814
    We have managed to reduce the code that generates the problem.
    
    If anyone want to try it or wants to give som suggestions for 
    workarounds, are welcome and can get it from the following:

	Directory STKTSC::SWS4:[GOLSSON]
	DECW_KILLER_MINI.PAS;2   25
	DECW_KILLER_MINI.XL;1     7
    
    The code isn't beatiful, as most of it is generated by tracing Xlib calls,
    so please don't shout at me for that.
    
    Gert.

33.6QAR 2353CSSE32::MERMELLWindow PainMon Feb 06 1989 15:4812
It would have saved me several hours of work if this bug
had been QAR'ed.  I came across another program that
appears to cause the server to loop in the same way.
I checked the QAR files for reported server bugs, didn't see any, 
so I then spent several hours generating crash dumps and investigating
the problem.

Once a problem is understood to be a bug in a DECwindows component,
it needs to be QAR'ed.

grrrr.

33.7Thanks and moreSTKHLM::GOLSSONGert Olsson, Malmoe (UGO)Tue Feb 07 1989 16:0918
    RE: .6
    Excuse me, but I hasn't been sure that this is a server bug (and
    still I'm not 100% sure about it, probably due to my limited knowledge
    of the VMS X-Server).
    
    We have also spent time trying to reduce the code that exhibits the
    "loop problem" to a more manageble amount to ease the SPR/QAR
    process.
    
    I have use this note as a way to help me with this, and I hope that
    that isn't a missuse of this conference. If so, I apologize.
    
    But of course you are right about the QAR system, I should have
    checked it for similar bugs very carefully. The main obstacle on
    the way is that I dont know where it is! Could anyone tell me?
    
    Gert.

33.8I should have known!!!STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Tue Feb 07 1989 16:3511
    Cont from .7:
    
    Excuse me again, I should of course know that the QAR system lives on
    TRIFID.
    
    It's not easy to use from this side of the Atlantic, but anyway, I'll
    try to check out QAR 2353 and see if it fits, else I think we have a
    well defined problem to put in.
    
    Gert.

33.9QAR:ed as #2357STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Wed Feb 08 1989 04:188
    The problem is QAR:ed as #2357 in DECWINDOWS_IFT at TRIFID.
    
    I had some problem entering the QAR, partly due to not being used
    to the QAR system, partly due to the long response times across
    the Atlantic.
    
    Gert.

33.10hint for using QAR from afarCSSE32::MERMELLWindow PainWed Feb 08 1989 11:3010
One trick to help with using TRIFID from far away is to
create the QAR text file on your local system.  Make the
file world readable, then set host to TRIFID.  Ask for
the editor to create the QAR, then just type at the editor prompt
* INCLUDE MYNODE::MYDISK:[MYDIR]MYFILE.TXT.
This step may take a few minutes, but you don't have to waste
your time while this is happening.

Thanks for entering the QAR.

33.11Any feedback on QAR #2357????STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Thu Feb 16 1989 12:3710
    This may be impropriate, but I ask anyway.
    
    Can anyone from DECwindows group give us or me any feedback on the
    QAR I entered for this problem? (QAR #2357, DECWINDOWS-IFT)
    
    The QAR is open and has gotten it's component and abstract fields
    filled in.
    
    Gert.

33.12Looking...DECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Thu Feb 16 1989 17:266
I'm looking at it at this very moment.  I have gotten the "minimized" version,
but I can't reproduce the problem.  It seems to run fine.  I'm getting the
liddy log now to see if I can tell anything.

Burns

33.13No joyDECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Fri Feb 17 1989 09:579
I'm not getting much of anywhere with this.  Everything seems to work fine
for me.

In the liddy log, did you ^Y the client after the hang started?  That is,
is it reasonable to assume that the last thing in the liddy log is the
last instruction sent to the server?

Burns

33.14Still no fun hereSTKHLM::GOLSSONGert Olsson, Malmoe (UGO)Sat Feb 18 1989 11:4814
    Yes, we interrupted the client with ^Y, stopped it and then stopped
    the XLIDDY process.
    
    To bad that you can't get it to fail. I tried today again with same
    result. As you probably have read before we have the problem on both
    a VS2000, 4 plane color, 6Mb and a VSII GPX 8 planes, 9 Mb. The
    problems shows up with both EFT-2 and 5.1 SDC (Easynet kit) software.
    
    Are you trying it out in that environment or something newer?
    If you have newer software we are of course willing to try it too see
    if it helps!
    
    Gert.

33.15No different configurationDECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Sat Feb 18 1989 22:5117
    I'm running regular old SDC V5.1 on a 5(?)-meg VSII/GPX with 8-planes.
    
    I also tried it on a 24-plane Firefox, but that didn't work for beans.
    You should probably search for a PseudoColor visual at the beginning
    rather than assuming that the default visual is PseudoColor.
    
    But I honestly don't know what is going on.  Next thing I will do is
    to try to figure out what the addresses you mention earlier are.  Just
    to double check, could you enter here the addresses in your server log?
    It should say something like "Device Independent Loaded at <hex address>"
    and then "GPX Color Support Loaded at <hex address>".  That's not quite
    the right words, but you can find it pretty easily, and that is what I
    would like, please.
    
    Burns
    

33.16Logs, but not from failuresSTKHLM::GOLSSONGert Olsson, Malmoe (UGO)Mon Feb 20 1989 06:3526
    After a failing run both DECW$SERVER_0_ERROR.LOG and OUTPUT.LOG is
    empty.
    
    I have included a DECW$SERVER_0_ERROR.LOG from a normal run.
    
    Gert.
    
Hello, this is the X server
Dixmain address=22dfa
Now attach all known txport images
in SetFontPath
out SetFontPath
GPX color/monochrome support loaded
gpx$InitOutput address=1138d4
Connection Prefix: len == 42
Now I call scheduler/dispatcher
Connection aa578 is closed by Txport
Connection aa578 is closed by Txport
Connection ac490 is closed by Txport
Connection ac4c8 is closed by Txport
Connection ac4c8 is closed by Txport
Connection ac4c8 is closed by Txport
Connection ac4c8 is closed by Txport
Connection
    

33.17<An observation>STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Mon Feb 20 1989 07:539
    A little observation:
    
    I've tried the "mini_killer" several times today with failures.
    One thing though, if i compile, link and run it with debugger,
    it works most of the time. So to be sure of failure run it without
    any debugging.
    
    Gert.

33.18Workaround and thanks!!!STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Tue Feb 21 1989 15:2215
    There is a workaround to solve this problem. (See QAR #2357)
    There will also be a fix to the server in the feature.
    
    Basically the problem is that the server doesn't mask out the
    appropriate bits in the X$Color.X$Colr_Flags structure when defining
    a color. Especially if the sign bit in this byte is set the server
    goes into a compute loop.
    
    The workaround is to clear this byte before setting any of the flags.
    
    Thanks to everybody who helped me to get this sorted out.
    
    Regards,
    Gert.

33.19DECWIN::FISHERBurns Fisher 381-1466, ZKO3-4/W23Wed Feb 22 1989 17:428
Sorry I did not post the fix here too, but thanks to Gert for relaying the
info from the QAR system.  I presume that this fixed your problem, Gert?

Thanks to everyone for the help...it was easy to find once I could reproduce
it, but reproducing it was a bear.  Andy came up with the program that did it.

Burns

33.20STKHLM::GOLSSONGert Olsson, Malmoe (UGO)Thu Feb 23 1989 09:306
    RE: .19
    
    Sure it fixed my problem, thanks for the help!
    
    Gert.