[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference iosg::all-in-1_v30

Title:	OLD ALL-IN-1 (tm) Support Conference
Notice:	Closed - See Note 4331.l to move to IOSG::ALL-IN-1
Moderator:	IOSG::PYE

Created:	Thu Jan 30 1992
Last Modified:	Tue Jan 23 1996
Last Successful Update:	Fri Jun 06 1997
Number of topics:	4343
Total number of notes:	18308

3638.0. "Distributed Sharing Option problems" by UTRTSC::SMEETS (Look at the ALL-IN-1 side of life) Tue Dec 07 1993 13:24

Hello,

Customer has "installed" DSO (Distributed Sharing Option) on an ALL-IN-1 V3.0-1
system and has the following problem.

Cluster ADOORN					Cluster APCL02
--------------					--------------
node AP0101					node AP0201
node AP0103					node AP0202

ALL-IN-1 user BOSMANX				ALL-IN-1 user BOSMAN2
drawers: A		                        Drawers: C
	 B	                                         D

PROXIES:   					PROXIES:

*::BOSMAN2      				*::BOSMANX
BOSMANX (D)                                     BOSMAN2 (D)

Following tests:

1. User BOSMANX on cluster ADOORN requests Index of Available Drawers (System = 
   APCL02, user=BOSMAN) result: drawers A, B, C and D. However if user BOSMANX
   tries to access drawer C or D; error message "Drawer not available".

2. User MANAGER on cluster ADOORN requests Index of Available Drawers (System = 
   APCL02, user=BOSMAN) result: drawers A, B, C and D. No access problems

3. User MANAGER on cluster ADOORN requests Index of Available Drawers (System = 
   APCL02, user=BOSMAN2) result: drawers C and D. No access problems

4. User MANAGER on cluster APCL02 requests Index of Available Drawers (System = 
   ADOORN, user=BOSMANX) result: drawer A and B. No access problems

5. User MANAGER on cluster APCL02 requests Index of Available Drawers (System = 
   ADOORN, user=BOSMAN) result: drawer A, B, C and D. No access problems

6. User BOSMAN2 on cluster APCL02 requests Index of Available Drawers (System = 
   ADOORN, user=BOSMAN) result: error message: "File cabinetserver no available
   on ADOORN"

   However $ open/read/share svr ADOORN::"73=" doesn't show any errors

Any ideas what could be wrong ?

Thanks in advance,

Martin

p.s. Cluster ADOORN + cluster APCL02 = Heterogeneous cluster (2 system disks
and 1 ALL-IN-1 disk)

T.R	Title	User	Personal Name	Date	Lines
3638.1	couple of quick questions	CHRLIE::HUSTON		`Tue Dec 07 1993 17:17`	11
	Couple of questions: 1) Do both clusters have DSO installed? 2) Have FCS's been re-started since DSO installed? 3) Are there any proxies other than those mentioned with the users in question? 4) Is the FCS started on both nodes of both clusters? --Bob
3638.2	some quick answers	UTRTSC::SMEETS	Look at the ALL-IN-1 side of life	`Wed Dec 08 1993 08:00`	15
	Hi Bob, 1) Do both clusters have DSO installed? Yes 2) Have FCS's been re-started since DSO installed? Yes 3) Are there any proxies other than those mentioned with the users in question? No 4) Is the FCS started on both nodes of both clusters? Yes Is there some DSO trouble shooting guide available
3638.5	support for DSO in heterogeneous cluster	UTRTSC::SMEETS	Look at the ALL-IN-1 side of life	`Mon Dec 13 1993 08:44`	6
	Hi, Is DSO at all supported in a heterogeneous cluster; two system disks and one ALL-IN-1 disk ? Martin
3638.6	Long shot, but worth a try	CHRLIE::HUSTON		`Mon Dec 13 1993 20:13`	22
	I, like Kevin, can't come up with much in ways of an explanation for this. one thought, is that the FCS basically uses the VMS security principles to detect whether you have access to a document, it has nothing to do with IOS itself. Can you do the following: from the remote side, see if you have access, via DECnet, to one of the documents you want to see. For instance, on the local side, get a documetn into the context block and do an <get oa$cur_docfilename, then from the remote side, from DCL, do: $ copy remote_node::file from above [] See if this copies the file over, this is basically what the FCS is trying to see if you can do. The multiple system disks might be confusing things. --Bob
3638.7	Thanks for thinking with me	UTRTSC::SMEETS	Look at the ALL-IN-1 side of life	`Tue Dec 14 1993 07:33`	12
	Hi Bob, >> $ copy remote_node::file from above [] I've done this already, and wasn't a problem at all ... >> The multiple system disks might be confusing things. Coming weekend the customer will untie the heterogeneous cluster into two seperate clusters. I hope that will fix his problem. Martin
3638.8	???????????????????	UTRTSC::SCHOLLAERT	Holland goes USA	`Thu Dec 16 1993 15:44`	16
	Hello all, I wonder how this configuration can work. ONE ALL-IN-1 system running one a FOUR node heterogeneous cluster of which two couples form TWO homogeneous sub-clusters. Manual says that a local file cab server is to be considered as part of a cluster. All four servers are local on all nodes. So what happens when you do an Index of drawers of a node that is not part of the sub-cluster ? I can imaging things mess up. Regards, Jan
3638.9	H E L P, P L E A S E	UTRTSC::SMEETS	Look at the ALL-IN-1 side of life	`Fri Dec 17 1993 14:06`	21
	Hi to all DSO experts, The customer is really pushing me on this issue, now they want to be certain that untying the heterogeneous cluster will solve the problem, even Simon Eijs is involved in trying to solve this nasty problem. Customer has also thought about what could possibly be wrong. Here's his theory: 1. User BOSMAN2 on system APCL02 requests IAD (sytem=ADOORN, user=BOSMAN) 2. IS ADOORN$SRV73 (=AP0101$SRV73 or AP0103$SRV73) a local FCS 3. Yes, it's a local server, so look up VMSUSR (BOSMAN2) in SYSAUF on ADOORN 4. BOSMAN2 doesn't exist on ADOORN, so end of the story. Message "FileCabinet server not available" is kind of misleading. 5. When there's also a BOSMAN2 SYSUAF entry on ADOORN, then there are no problems. So once again the question will untie the cluster solve the problem ? Thank you, Martin
3638.10	Nobody can say for sure, just educated guess	CHRLIE::HUSTON		`Fri Dec 17 1993 14:44`	36
	There is no way to know for sure if untying the cluster will solve the problem, becuase we don't yet know what the problem is for sure. The multiple system disks is a high candidate, but without watching a debug server run so that we can see what is really happening, there is no way to be sure. >1. User BOSMAN2 on system APCL02 requests IAD (sytem=ADOORN, user=BOSMAN) >2. IS ADOORN$SRV73 (=AP0101$SRV73 or AP0103$SRV73) a local FCS >3. Yes, it's a local server, so look up VMSUSR (BOSMAN2) in SYSAUF on ADOORN >4. BOSMAN2 doesn't exist on ADOORN, so end of the story. Message "FileCabinet > server not available" is kind of misleading. >5. When there's also a BOSMAN2 SYSUAF entry on ADOORN, then there are no > problems. The biggest hole I see in this is step 3. If you are already talking to a server, and you then ask if that server is local, then if the answer is "yes", there will be no lookup to the UAF file. it will simply run the partition.dat looking for drawers owned by BOSMAN2. If it doesn't find any, it would return success, with no drawer information. This really looks like the FCSs are tyring to broker, to make sure, try disabling the DSO license and restart the FCS, try the operation again, if you get errors about the DSO not being installed then you are brokering and I would GUESS that untying the cluster would solve the problem, if you don't get dso errors, then I would GUESS that you are not brokering and the DSO is not the problem, but untying the cluster MIGHT still solve the problem. Basically, I don't know of this setup ever being tested, I know I never imagined it (so never tested it) while writing the DSO handling code. --Bob
3638.12	IM(H)O....	IOSG::PYE	Graham - ALL-IN-1 Sorcerer's Apprentice	`Tue Dec 21 1993 12:05`	18
	The SPD doesn't explicitly say we support mixed environment (aka heterogenous) clusters or not. The Management Guide says (paraphrasing) that we only support them as long as each bit of the cluster has its own SYSUAF and all its own data files. Also, you need to be careful how all the queues used for the Sender/Fetcher and starting the FCS run on the right nodes. Then you treat the two halves of the cluster as completely different systems. My view (guess :-) ) is that this would work if the cluster alias wasn't defined, and the two ALL-IN-1 systems were completely separate. Instead the OA$CLUSTER_NODE (name???) logical should be defined on each set of nodes. In any case, probably the only way to find out is to set it up on the customer's system and try it! Graham
3638.13	not quite clear what you mean	UTRTSC::SMEETS	Look at the ALL-IN-1 side of life	`Wed Dec 22 1993 08:02`	6
	Hi Graham, What are you exactly trying to say Do you say that the mixed environment should work or should the cluster be split up ? Martin
3638.14	NOTE 402	UTRTSC::SMEETS	Look at the ALL-IN-1 side of life	`Wed Dec 22 1993 12:32`	5
	Hi, Jan Schollaert pointed me to note 402. How about that.,,, Martin<