[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference iosg::all-in-1_v30

Title:*OLD* ALL-IN-1 (tm) Support Conference
Notice:Closed - See Note 4331.l to move to IOSG::ALL-IN-1
Moderator:IOSG::PYE
Created:Thu Jan 30 1992
Last Modified:Tue Jan 23 1996
Last Successful Update:Fri Jun 06 1997
Number of topics:4343
Total number of notes:18308

3638.0. "Distributed Sharing Option problems" by UTRTSC::SMEETS (Look at the ALL-IN-1 side of life) Tue Dec 07 1993 13:24

Hello,

Customer has "installed" DSO (Distributed Sharing Option) on an ALL-IN-1 V3.0-1
system and has the following problem.

Cluster ADOORN					Cluster APCL02
--------------					--------------
node AP0101					node AP0201
node AP0103					node AP0202

ALL-IN-1 user BOSMANX				ALL-IN-1 user BOSMAN2
drawers: A		                        Drawers: C
	 B	                                         D

PROXIES:   					PROXIES:

*::BOSMAN2      				*::BOSMANX
BOSMANX (D)                                     BOSMAN2 (D)

Following tests:

1. User BOSMANX on cluster ADOORN requests Index of Available Drawers (System = 
   APCL02, user=BOSMAN) result: drawers A, B, C and D. However if user BOSMANX
   tries to access drawer C or D; error message "Drawer not available".

2. User MANAGER on cluster ADOORN requests Index of Available Drawers (System = 
   APCL02, user=BOSMAN) result: drawers A, B, C and D. No access problems

3. User MANAGER on cluster ADOORN requests Index of Available Drawers (System = 
   APCL02, user=BOSMAN2) result: drawers C and D. No access problems

4. User MANAGER on cluster APCL02 requests Index of Available Drawers (System = 
   ADOORN, user=BOSMANX) result: drawer A and B. No access problems

5. User MANAGER on cluster APCL02 requests Index of Available Drawers (System = 
   ADOORN, user=BOSMAN) result: drawer A, B, C and D. No access problems

6. User BOSMAN2 on cluster APCL02 requests Index of Available Drawers (System = 
   ADOORN, user=BOSMAN) result: error message: "File cabinetserver no available
   on ADOORN"

   However $ open/read/share svr ADOORN::"73=" doesn't show any errors

Any ideas what could be wrong ?

Thanks in advance,

Martin

p.s. Cluster ADOORN + cluster APCL02 = Heterogeneous cluster (2 system disks
and 1 ALL-IN-1 disk)
T.RTitleUserPersonal
Name
DateLines
3638.1couple of quick questionsCHRLIE::HUSTONTue Dec 07 1993 17:1711
    
    Couple of questions:
    
    1) Do both clusters have DSO installed?
    2) Have FCS's been re-started since DSO installed?
    3) Are there any proxies other than those mentioned with the users in
    question?
    4) Is the FCS started on both nodes of both clusters?
    
    --Bob
    
3638.2some quick answersUTRTSC::SMEETSLook at the ALL-IN-1 side of lifeWed Dec 08 1993 08:0015
Hi Bob,

1) Do both clusters have DSO installed?
   Yes

2) Have FCS's been re-started since DSO installed?
   Yes

3) Are there any proxies other than those mentioned with the users in question?
   No

4) Is the FCS started on both nodes of both clusters?
   Yes

Is there some DSO trouble shooting guide available
3638.5support for DSO in heterogeneous clusterUTRTSC::SMEETSLook at the ALL-IN-1 side of lifeMon Dec 13 1993 08:446
Hi,

Is DSO at all supported in a heterogeneous cluster; two system disks and one 
ALL-IN-1 disk ?

Martin
3638.6Long shot, but worth a tryCHRLIE::HUSTONMon Dec 13 1993 20:1322
    
    I, like Kevin, can't come up with much in ways of an explanation for
    this. 
    
    one thought, is that the FCS basically uses the VMS security principles
    to detect whether you have access to a document, it has nothing to do
    with IOS itself. 
    
    Can you do the following: from the remote side, see if you have access, 
    via DECnet, to one of the documents you want to see. For instance, on
    the local side, get a documetn into the context block and do an
    <get oa$cur_docfilename, then from the remote side, from DCL, do:
    
    $ copy remote_node::file from above []
    
    See if this copies the file over, this is basically what the FCS is
    trying to see if you can do.
    
    The multiple system disks might be confusing things.
    
    --Bob
    
3638.7Thanks for thinking with meUTRTSC::SMEETSLook at the ALL-IN-1 side of lifeTue Dec 14 1993 07:3312
Hi Bob,

>>    $ copy remote_node::file from above []

    I've done this already, and wasn't a problem at all ...
    
>>    The multiple system disks might be confusing things.
    
    Coming weekend the customer will untie the heterogeneous cluster into two
    seperate clusters. I hope that will fix his problem.

Martin
3638.8???????????????????UTRTSC::SCHOLLAERTHolland goes USAThu Dec 16 1993 15:4416
    Hello all,
    
    I wonder how this configuration can work.
    
    ONE ALL-IN-1 system running one a FOUR node heterogeneous cluster
    of which two couples form TWO homogeneous sub-clusters.
    
    Manual says that a local file cab server is to be considered
    as part of a cluster. All four servers are local on all nodes.
    So what happens when you do an Index of drawers of a node that
    is not part of the sub-cluster ? I can imaging things mess up.
    
    Regards,
    
    Jan
    
3638.9H E L P, P L E A S EUTRTSC::SMEETSLook at the ALL-IN-1 side of lifeFri Dec 17 1993 14:0621
Hi to all DSO experts,

The customer is really pushing me on this issue, now they want to be certain 
that untying the heterogeneous cluster will solve the problem, even Simon Eijs
is involved in trying to solve this nasty problem.

Customer has also thought about what could possibly be wrong. Here's his theory:

1. User BOSMAN2 on system APCL02 requests IAD (sytem=ADOORN, user=BOSMAN)
2. IS ADOORN$SRV73 (=AP0101$SRV73 or AP0103$SRV73) a local FCS
3. Yes, it's a local server, so look up VMSUSR (BOSMAN2) in SYSAUF on ADOORN
4. BOSMAN2 doesn't exist on ADOORN, so end of the story. Message "FileCabinet
   server not available" is kind of misleading.
5. When there's also a BOSMAN2 SYSUAF entry on ADOORN, then there are no 
   problems.

So once again the question will untie the cluster solve the problem ?

Thank you,

Martin
3638.10Nobody can say for sure, just educated guessCHRLIE::HUSTONFri Dec 17 1993 14:4436
    
    There is no way to know for sure if untying the cluster will solve the
    problem, becuase we don't yet know what the problem is for sure. The
    multiple system disks is a high candidate, but without watching a
    debug server run so that we can see what is really happening, there
    is no way to be sure.
    
>1. User BOSMAN2 on system APCL02 requests IAD (sytem=ADOORN, user=BOSMAN)
>2. IS ADOORN$SRV73 (=AP0101$SRV73 or AP0103$SRV73) a local FCS
>3. Yes, it's a local server, so look up VMSUSR (BOSMAN2) in SYSAUF on ADOORN
>4. BOSMAN2 doesn't exist on ADOORN, so end of the story. Message "FileCabinet
>   server not available" is kind of misleading.
>5. When there's also a BOSMAN2 SYSUAF entry on ADOORN, then there are no 
>   problems.
    
    
    The biggest hole I see in this is step 3. If you are already talking
    to a server, and you then ask if that server is local, then if the
    answer is "yes", there will be no lookup to the UAF file. it will
    simply run the partition.dat looking for drawers owned by
    BOSMAN2. If it doesn't find any, it would return success, with no
    drawer information.
    
    This really looks like the FCSs are tyring to broker, to make sure, 
    try disabling the DSO license and restart the FCS, try the operation 
    again, if you get errors about the DSO not being installed then
    you are brokering and I would GUESS that untying the cluster would
    solve the problem, if you don't get dso errors, then I would GUESS that
    you are not brokering and the DSO is not the problem, but untying the
    cluster MIGHT still solve the problem.
    
    Basically, I don't know of this setup ever being tested, I know I never
    imagined it (so never tested it) while writing the DSO handling code.
    
    --Bob
    
3638.12IM(H)O....IOSG::PYEGraham - ALL-IN-1 Sorcerer&#039;s ApprenticeTue Dec 21 1993 12:0518
    The SPD doesn't explicitly say we support mixed environment (aka
    heterogenous) clusters or not.
    
    The Management Guide says (paraphrasing) that we only support them as
    long as each bit of the cluster has its own SYSUAF and all its own data
    files. Also, you need to be careful how all the queues used for the
    Sender/Fetcher and starting the FCS run on the right nodes. Then you
    treat the two halves of the cluster as completely different systems.
    
    My view (guess :-) ) is that this would work if the cluster alias
    wasn't defined, and the two ALL-IN-1 systems were completely separate.
    Instead the OA$CLUSTER_NODE (name???) logical should be defined on each
    set of nodes.
    
    In any case, probably the only way to find out is to set it up on the
    customer's system and try it!
    
    Graham
3638.13not quite clear what you meanUTRTSC::SMEETSLook at the ALL-IN-1 side of lifeWed Dec 22 1993 08:026
Hi Graham,

What are you exactly trying to say Do you say that the mixed environment
should work or should the cluster be split up ?

Martin
3638.14NOTE 402UTRTSC::SMEETSLook at the ALL-IN-1 side of lifeWed Dec 22 1993 12:325
Hi,

Jan Schollaert pointed me to note 402. How about that.,,,

Martin<