[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference orarep::nomahs::rdb_60

Title:Oracle Rdb - Still a strategic database for DEC on Alpha AXP!
Notice:RDB_60 is archived, please use RDB_70..
Moderator:NOVA::SMITHISON
Created:Fri Mar 18 1994
Last Modified:Fri May 30 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:5118
Total number of notes:28246

4968.0. "RMU-F-FILACCERR,SYSTEM-F-IVCHAN From RMU/RESTORE" by M5::BLEHLBAC (Toto, It Ain't Kansas) Mon Jan 27 1997 14:51

rdb 6.0-05
vax/vms 6.1

  I have a customer that is getting the following when doing an rmu/restore:

        %RMU-F-FILACCERR, error reading disk file
        -SYSTEM-F-IVCHAN, invalid I/O channel


  This is a partial log:

SIVS01$ RMU/RESTORE/NORECOVER/LOG SACSS_MAIN_DB_IMAGE.RBF;2
%RMU-I-RESTXT_04, Thread 1 uses devices DKA400:            
%RMU-I-AIJRSTBEG, restoring after-image journal "state" information
:                                                                  
:                                                                  
:                                                                  
%RMU-I-RESTXT_08,     restored 24804 data pages                    
%RMU-I-LOGRESSST, restored storage area   
DISK$USER3:[SACSS_ACMS.V10.DB]XSA_PRR_0101.RDA;1
%RMU-I-RESTXT_05,  rebuilt 15 space management pages
%RMU-I-RESTXT_06,     restored 0 inventory pages    
%RMU-I-RESTXT_07,     rebuilt 264 logical area bitmap pages
%RMU-I-RESTXT_08,     restored 24738 data pages  
%RMU-I-LOGRESSST, restored storage area   
DISK$USER3:[SACSS_ACMS.V10.DB]SA_FIN_0101.RDA;1
%RMU-I-RESTXT_05,     rebuilt 27 space management pages 
%RMU-I-RESTXT_06,     restored 0 inventory pages        
%RMU-I-RESTXT_07,     rebuilt 306 logical area bitmap pages
%RMU-I-RESTXT_08,     restored 44694 data pages            
%RMU-F-FILACCERR, error reading disk file                  
-SYSTEM-F-IVCHAN, invalid I/O channel                      

------------------------------------------------------------------

  No bugcheck dumps.  ACMS is being used.

  There are about 30 storage areas, fillm was boosted to 600 but it still
failed.

  A workaround for the customer is to do an rmu/restore/noafter.

  Any ideas??       Thanks,  Barry
T.RTitleUserPersonal
Name
DateLines
4968.1WAGsvrav1.au.oracle.com::MBRADLEYI was dropped on my head as a baby. What's your excuse?Mon Jan 27 1997 20:165
What is the value for CHANNELCOUNT?

G'day,

Mark.
4968.2Rocky Mountain HIGH .... ;^)M5::BLEHLBACRDB: 34% better than real lifeMon Jan 27 1997 21:048
<<< Note 4968.1 by svrav1.au.oracle.com::MBRADLEY "I was dropped on my head as a baby. What's your excuse?" >>>
                                    -< WAG >-

>>What is the value for CHANNELCOUNT?

  CHANNELCOUNT is set to 1200.

      Thanks,  Barry
4968.3ORAREP::HERON::VIGIERPhilippe\Rdb EngineeringTue Jan 28 1997 07:1912
    From VMS doco :
    IVCHAN,  invalid I/O channel
    Facility:   SYSTEM, System ServicesExplanation:  The channel number
    		specified in an input or output request
                is not a valid channel number; the I/O operation cannot be
                performed.
    User Action:  Check for a programming error. Verify that the request to
                  assign the I/O channel completed successfully and returned a
                  valid channel number.
    
    What about some tricky memory corruption ???
    \Philippe.
4968.4will take a lookVGER::BLEHLBACToto, It Ain&#039;t KansasTue Jan 28 1997 08:047
re .3

  I saw that.  The problem is happening on 2 unrelated systems so I haven't
pursued a "system specific" problem.  However, I will review process quotas.
It appears that the aij is involved somehow.  And, perhaps, acms.

                                      Thanks,  Barry
4968.5M5::LWILCOXChocolate in January!!Tue Jan 28 1997 08:279
     <<< Note 4968.3 by ORAREP::HERON::VIGIER "Philippe\Rdb Engineering" >>>

>>    User Action:  Check for a programming error. 

Philippe, I thought for sure this mean that there was a programming
error in the Rdb product!  Silly me!

Liz :-).

4968.6HOTRDB::PMEADPaul, [email protected], 719-577-8032Tue Jan 28 1997 09:494
    Before expending too much effort chasing down quotas and such why not
    have them upgrade to the most recent kit?  Since this is likely to be
    caused by memory corruption (like my good buddy Dr. Philippe suggests)
    then just about anything could fix it.
4968.7its not CHANNELCNT.svrav1.au.oracle.com::MCHANTue Jan 28 1997 17:4119
    
    I dont believe its shortage of channelcnt. 
    Customer have checked it using sh proc/chan in SDA and 
    even reboot the machine before try RMU/RESTORE again. 
    I would expect "system-f-noiochan no i/o channel available"
    error message if channelcnt exceed its limit.
    
    Did the dump of AIJ file showing any sign of corruption? 
    Customer should verify their database before backup it up.
    
    
    
    Regards,
    Michael.
    
    
        
    
    
4968.8M5::LWILCOXChocolate in January!!Fri Jan 31 1997 17:5823
FWIW, I talked to this customer some more and said I would attempt to
see if there might be something she could do to try to verify that indeed,
the problem is memory corruption.  Is there some way to look at the process
buffers, or whatever other structures would be involved, during the restore
operation to attempt to verify this theory?

She stated that they tried to restore from numerous RMU/BACKUP files on the
one system and ended up going back to one from some time in December.  That
did not exhibit the error on restore, and she did not specify /NOAFTER.
On at least one that did exhibit the error on restore they found that using
/NOAFTER meant a successful restore (as Barry mentioned).

They could successfully RMU/DUMP/BACKUP even those that errored on the
restore.  She will try to take one of the backups that errored and restore
it on a 6.0 system with a more current ECO in order to see if that makes
a difference.

I realize there is little interest in this due to the age of the version
she is running.

Thanks,

Liz
4968.9I think they can use their time more constructively :)BOUVS::OAKEYI&#039;ll take Clueless for $500, AlexSun Feb 02 1997 18:1721
~~           <<< Note 4968.8 by M5::LWILCOX "Chocolate in January!!" >>>

~~FWIW, I talked to this customer some more and said I would attempt to
~~see if there might be something she could do to try to verify that indeed,
~~the problem is memory corruption.  Is there some way to look at the process
~~buffers, or whatever other structures would be involved, during the restore
~~operation to attempt to verify this theory?

Not easily.  We ship the product in such a state as to eliminate as much 
information as possible about what structures someone might be looking at.  
(ie, no debug information is available).  As most memory corruption errors 
take experienced engineers with access to sources quite a while to track 
down, I doubt the customer would be able to easily determine anything.

~~They could successfully RMU/DUMP/BACKUP even those that errored on the
~~restore.  She will try to take one of the backups that errored and restore

I believe the DUMP/BACKUP has been discussed before.  It verifies that you 
can read the backup file but doesn't really say anything about your ability 
to restore a useful database.

4968.10Check also BYTLMORAREP::HERON::GODFRINDOracle Rdb EngineeringWed Feb 05 1997 05:008
I had a customer with a similar problem some time ago. I also started chasing
channelcount et al. 

At the end someone (don't remember who) mentionned BYTLM as a possible cause.

Increasing this made the problem go away.

/albert
4968.11M5::LWILCOXChocolate in January!!Thu Feb 06 1997 09:463
Thanks, both of you, that all helps.

Liz