[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference vaxaxp::vmsnotes

Title:VAX and Alpha VMS
Notice:This is a new VMSnotes, please read note 2.1
Moderator:VAXAXP::BERNARDO
Created:Wed Jan 22 1997
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:703
Total number of notes:3722

129.0. "%ANALDISK-W-BAD_NAMEORDER, bad advice" by POMPY::LESLIE ([email protected] as of Feb 14) Mon Feb 03 1997 06:58

    This morning I did a directory of pcsi-related files in sys$library.
    
    POMPY::leslie � dir sys$library:pcsi$*
    
    Directory SYS$COMMON:[SYSLIB]
    
    PCSI$MOTIFSHR.EXE;2
                                  172/174           12-NOV-1996 09:15:04.00
    PCSI$MOTIFSHR.EXE;1
                                  173/174           10-DEC-1996 14:20:23.48
    PCSI$SHR.EXE;2                916/918           12-NOV-1996 09:14:45.00
    PCSI$SHR.EXE;1                914/915           10-DEC-1996 14:20:24.31
    PCSI$MOTIFSHR.EXE;1
                                  173/174           10-DEC-1996 14:20:23.48
    PCSI$SHR.EXE;1                914/915           10-DEC-1996 14:20:24.31
                   
    Total of 6 files, 3262/3270 blocks.
    
    Now the above is visibly up the spout, because PCSI$SHR.EXE;1 appears
    twice, so I did an $ana/disk/repair dsa0:
    
    which gave me the message
    
    %ANALDISK-W-BAD_NAMEORDER, filename ordering incorrect in VBN 82
            of directory SYS0.SYSCOMMON.SYSLIB (874,48,1)
            Filenames are PCSI$SHR.EXE
                      and MTHDEF.FOR
    
    
    Hmm. I thought...
    
    
    
    POMPY::leslie � Help/message  BAD_NAMEORDER

BAD_NAMEORDER,  filename ordering incorrect in VBN 'virtual-block- number' 
of directory 'file-id' 

Filenames are 'file-id' and 'file-id'
    
Facility:     ANALDISK, Analyze/Disk_Structure Utility
    
Explanation:  The Analyze/Disk_Structure utility detected two files that
are out of alphabetical order.
    
User Action:  Do a SET FILE/NODIRECTORY on the directory file in which the
files are located; then delete the directory file. Last, run
ANALYZE/DISK/REPAIR to rename all the files from the bad directory into the
SYSLOST.DIR;1 directory.
    
    
    This is plainly a bit dangerous as advice to the inexperienced system
    manager. If I had followed these instructions for sys$library i may
    have never moved again. Worse would have been sysexe.dir!
    
    I would suggest that this 'help' text be revisited and thus this note
    will be posted in DOCNOTES as well.
    
    In the meantime, this is VAX OpenVMS 7.1 and I'd love to know what
    caused it. (I fixed it with judicious rename/copying.)
    
    /andy
    
    
    

    
    
T.RTitleUserPersonal
Name
DateLines
129.1AUSS::GARSONDECcharity Program OfficeMon Feb 03 1997 17:1510
re .0
    
>User Action:  Do a SET FILE/NODIRECTORY on the directory file in which the
>files are located; then delete the directory file. Last, run
>ANALYZE/DISK/REPAIR to rename all the files from the bad directory into the
>SYSLOST.DIR;1 directory.
    
    Ouch. Good pick up.
    
    I think you should QAR this to ensure it receives adequate attention.
129.2not so fast...60600::GILLINGSa crucible of informative mistakesMon Feb 03 1997 17:4242
  Hold on a minute. Do you want the help text to list *all* the exceptions?

  The stated user action is correct for 99.9999% of cases. Yes, there are a
  few situations under which following that advice might cause some grief,
  but is it really worth filling up HELP/MESSAGE with complicated and
  potentially confusing information? My only criticism of the text is that
  it leaves out the final step (RENAME SYSLOST.DIR back to where it came
  from). If you really think it's that dangerous, the text should simply
  be replaced with "log a call with your local CSC". If you make the text
  sound dangerous, that's what people will do anyway, but it's a rather 
  expensive way to deal with the majority of simple instances.

>    In the meantime, this is VAX OpenVMS 7.1 and I'd love to know what
>    caused it. (I fixed it with judicious rename/copying.)

  There were some F11XQP bugs which could result in the use of MOVEFILE (by
  defraggers) munging directory order in clusters:

"Problems Addressed in the VAXF11X02_061 Kit for OpenVMS VAX V6.1:

  o  As of OpenVMS VAX V6.0, movement of directory files is allowed
     as long as no node in the cluster has data from these files
     cached.

     The cluster-wide check for this condition does not work
     properly.  This results in cache incoherency and, as a result,
     directory file corruption.  Common symptoms of this problem are:

       +  out-of-order directory listings
       +  duplicate directory entries for the same file
       +  "lost" files
       +  inconsistent directory listings on different nodes
       +  files appearing in directory listings which are
          not "accessible" from the RMS level (i.e., they
          cannot be "typed" out).
"

  Latest version of this patch is VAXF11X03_070, but I'd have thought this
  problem was fixed long before V7.1. Could they have been there for a long
  time?

						John Gillings, Sydney CSC
129.3POMPY::LESLIE[email protected] as of Feb 14Tue Feb 04 1997 02:4430
    John
    
    	I'll take your points in reverse order. 
    
    Since the system is standalone and I don't run a defrag, then I don't
    believe the problem comes from those sources. The upgradde from 6.2 to
    7.1 went without a hitch 10 days ago. The system disk had an analysis
    performed about a week ago.
    
    You asked if I want the help text to list all the exceptions, or
    alternatively to just tell the user to contact the CSC. Not at all.
    HOWEVER, what could preface the advice would be, imho, a section that
    says:
    
    "If this problem occurs on your system disk and appears to be in an
    OpenVMS-critical directory, please contact your local CSC before doing
    anything else following the advice below. Otherwise...<rest of text>".
    
    Anyone blindly following the advice given will probably *royally* screw
    up a system disk.
    
    Funnily enough, I never have seen this error before in 14+ years of
    VMS'ing. Makes you wonder if VAX OpenVMS 7.1 has had this reported as a
    bug^H^H^Hfeature?
    
    BTW: Removing SYSLOST.DIR wins nothing, I never bother.
    
    /a
    
                    
129.4AUSS::GARSONDECcharity Program OfficeTue Feb 04 1997 16:4619
re .3
    
>I never have seen this error before in 14+ years of VMS'ing.
    
    I believe that it is only fairly recently that ANAL/DISK has included a
    check of the ordering of directory entries.
    
>    BTW: Removing SYSLOST.DIR wins nothing, I never bother.
    
    I think John meant that if you follow the advice that you quoted from
    the help text then you end up with all the files in SYSLOST.DIR. One
    possible final step is just to rename SYSLOST.DIR to be the directory
    that you deleted however I wouldn't do that for two reasons viz. other
    unrelated orphaned files may have been picked up, and this will not
    ensure the correct security attributes that were on the original
    directory.
    
    The last time I had this problem it was SYS$MANAGER. I'm sure glad I
    didn't follow the suggested remedy! (not that I read it)
129.5MOVIES::WIDDOWSONRodFri Feb 07 1997 02:3314
>>I never have seen this error before in 14+ years of VMS'ing.
>    
>    I believe that it is only fairly recently that ANAL/DISK has included a
>    check of the ordering of directory entries.
    
    Absolutely, the `bug' has been there since V1 and previous.  Without 
    going into the sums, I would wagerthat the re-ordering came on a block
    boundary. 
    
    What has happened is that the XQP got brutally inetrrupted in the
    middle of a `shuffledir', as it sqeezes the files up into the previous
    block.   This sort of situation is bound to occur with a non-logged
    filesystem, we are taking action in 7.1 to reduce the possibility of
    you seeing this (narrowing the window), but this will never be fixed.
129.6POMPY::LESLIEAndy, DEC man walking...Fri Feb 07 1997 04:2813
    
    In the middle of this shuffle, one file actually got completely
    trashed from SYS$LIBRARY, whilst two files, as per the listing,
    showed up twice.
    
    I discovered the missing file (UCX$ACP_SHR or somesuch) when I rebooted
    and noticed that UCX failed to start up with a meaningful message.
    Installing 4.1 and ECO 4 fixed that.
    
    Interestingly, ANA/DISK didn't find this file and place it in syslost,
    so I wonder where it was sent by the XQP!
    
    /a