[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference orarep::nomahs::rdb_60

Title:Oracle Rdb - Still a strategic database for DEC on Alpha AXP!
Notice:RDB_60 is archived, please use RDB_70..
Moderator:NOVA::SMITHISON
Created:Fri Mar 18 1994
Last Modified:Fri May 30 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:5118
Total number of notes:28246

5080.0. "writing ROOT file TSNBLK VBN" by m5.us.oracle.com::JAKUHN ([email protected]) Wed Feb 26 1997 16:12

    Rdb 7.0
    
    I have a customer that has been running V7.0 since it first came out,
    but last night he started seeing this in RMU/SHOW STATs:
    
    - 
    Process.ID Since......   Stall.reason.............................
    Lock.ID. 
    20C57D6A:2               writing ROOT file (TSNBLK VBN 249) 
    20C56998:4               writing ROOT file (TSNBLK VBN 249) 
    20C39D78:1               writing ROOT file (TSNBLK VBN 249) 
    20C5217E:1               writing ROOT file (TSNBLK VBN 249) 
    20C539F0:1               writing ROOT file (TSNBLK VBN 249) 
    20C3F949:1               writing ROOT file (TSNBLK VBN 249) 
    20C4E597:1               writing ROOT file (TSNBLK VBN 249) 
    20C3F949:2               writing ROOT file (TSNBLK VBN 249) 
    ...
    ...
    
    The system seems to be running fine. I had him do a SHOW
    LOCKS/MODE=BLOCKING but nothing comes up.
    
    He has several databases open on his system. when the problem first
    started, it was on one database only, now it's on everyone. 
    The disks are shadowed at the controller level.
    
    thanks
    jay
    
T.RTitleUserPersonal
Name
DateLines
5080.1NOVA::SMITHIDon't understate or underestimate Rdb!Wed Feb 26 1997 16:273
Which screen is this?  Looks like this is history (since the Since is blank).

Ian
5080.2More questions...bouvs.us.oracle.com::OAKEYI'll take Clueless for $500, AlexWed Feb 26 1997 17:1317
>>  <<< Note 5080.1 by NOVA::SMITHI "Don't understate or underestimate Rdb!" >>>

>>Which screen is this?  Looks like this is history (since the Since is blank).

The 'Since' being blank happens on the Active User Stall Messages screen.  
(The Stall Messages screen shows only active stalls).

This might happen if the disk the root file was on was extremely busy at 
the time all these users went to update TSNBLK information.  The fact that 
the since is blank is really okay since that means the stall is no longer 
stalled.  You're simply seeing the text from the last stall that was active 
at the time the stats displayed.

How often does this happen?  When everyone attaches?  Some other time?  All 
the time?  Are all the root files on the same shadow set?  Is the shadowed 
controller new (and whose is it)?

5080.3*ACTIVE* USER STALL != STALL m5.us.oracle.com::JAKUHN[email protected]Wed Feb 26 1997 18:0314
    /This might happen if the disk the root file was on was extremely busy
    /at the time all these users went to update TSNBLK information.  The fact
    /thatthe since is blank is really okay since that means the stall is no
    /longer stalled.  You're simply seeing the text from the last stall that was
    /active at the time the stats displayed.
     
    Ahh! forgot about that screen. He was having some locking problems 
    (caused his code) so I got confused. Now if he saw thoses messages
    (with timestamp) on the "STALL MESSAGES" screen, then, we would have
    a problem. ?
    
    thanks
    jay
    
5080.46 of one, half dozen of another :)bouvs.us.oracle.com::OAKEYI&#039;ll take Clueless for $500, AlexWed Feb 26 1997 18:2620
>>     <<< Note 5080.3 by m5.us.oracle.com::JAKUHN "[email protected]" >>>
>>                       -< *ACTIVE* USER STALL != STALL  >-

>>    (caused his code) so I got confused. Now if he saw thoses messages
>>    (with timestamp) on the "STALL MESSAGES" screen, then, we would have
>>    a problem. ?
    
First, the Active User Stall Messages shows all users, stalled or not; the 
Stall Messages shows only active stalls.  So basically, the Stall Messages 
is a subset of the Active User Stall Messages screen.

The fact that you saw them at all does indicate a potential problem since 
the user did have to 'stall' or wait.  How long the stall is would give 
some indication of the level of problem.  Also remember, for the text to
even show up (even if not stalled) means that at one point during the 
refresh, the user was actively stalled and there was a time filled in (and 
just subsequently cleared on a later refresh).  So, some function of how 
significant the problem is will be based on the refresh rate.  The top of 
the screen should show that information.

5080.5m5.us.oracle.com::JAKUHN[email protected]Wed Feb 26 1997 19:443
    yes, and for some reason the messages cropped up in all his databases
    all at once, even though only one database was having problems with
    hangs. Maybe a controller problem or something.
5080.6more infom5.us.oracle.com::JAKUHN[email protected]Thu Feb 27 1997 15:2117
    Their entire system is 4 months old. 
    They use HSD50s and HSD30s in a dual redundant configuration. 
    ie every HSD cabinet has 2 controllers in case one fails. 
    The databases are spread out across both types of contollers and I see 
    the "writing ROOT file (TSNBLK VBN xxx)" on all database regardless of 
    controller or shadow set.  They use HSD based shadow/mirroring. 
     
    Additionally, even databases that were not open at the time now show 
    this message.  He also checked unopened and opened databases from my 
    workstation(the only other member in the cluster with the production
    Alpha) 
    and his workstation also shows the "writing ROOT file (TSNBLK VBN xxx)" 
    message. The message is cluster wide regardless if a
    database was opened the other night when they had the problem. 
    ????
    thanks
    jay
5080.7HOTRDB::PMEADPaul, [email protected], 719-577-8032Thu Feb 27 1997 16:378
    Gee Jay, try attaching to a db on your system.  Start/commit a txn, and
    see what the "Active User Stall Messages" screen says.
    
    In V7.0 all root "objects" are written to disk asynchronously. 
    Sometimes we don't do asynch because we want to make sure the updates
    get to disk before we proceed.  Thus we enter a stall msg before
    writing the object.  Since writing the TSNBLK turns out to be the last
    thing done in a txn then that is what you see in stats.