[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference iosg::all-in-1

Title:	ALL-IN-1 (tm) Support Conference
Notice:	Please spell ALL-IN-1 correctly - all CAPITALS!
Moderator:	IOSG::PYECE

Created:	Fri Jul 01 1994
Last Modified:	Fri Jun 06 1997
Last Successful Update:	Fri Jun 06 1997
Number of topics:	2716
Total number of notes:	12169

2653.0. "NODE1::"73=" Stopped" by BARNA::DSMAIL () Wed Apr 30 1997 14:45

Hi,

A customer has a VMS cluster with NODE1 and NODE2,
he ran ALL-IN-1V3.0 from NODE1 without problems
Now, he runs ALL-IN-1 from NODE2, but he has File Cabinet problems.
    
SM/MFC/MS shows		             Server Name:  NODE1::"73="
                                          Status:  Stopped
        	            DECnet object number:  73
        		      TCP/IP port number:  7373
        
also:

                               Index of Servers
 (Selections: 0  )                                          (New messages: 0  )

   No.   Node   Num TCP/IP Type   Process Name       Startup Queue

 > 1     NODE1   73  7373  LOCAL  NODE1$SRV73        A1V30$BATCH


OAFC$SERVER_STARTUP.LOG;79
                     30-APR-1997 08:07:27.05

    No reference to NODE2!!!
    
search OAFC$SERVER_STARTUP.LOG NODE1

$       @sys$startup:oafc$startup SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1 NODE1$SRV73
            /PROCESS_NAME        = "NODE1$SRV73" -
$       WRITE/ERROR=write_mbx_error write_mbx "SYS$SYSDEVICE:[ALLIN1.
DATA_SHARE]NODE1$SERVER73.DAT;1"

                configuration_file: SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1
                process_name:       NODE1$SRV73


SH SYS 
VAX/VMS V5.5-2  on node NODE2  30-APR-1997 12:49:28.77   Uptime  11 04:21:23
  Pid    Process Name    State  Pri      I/O       CPU       Page flts Ph.Mem

20800219 SYMBIONT_0001   HIB      6  1486754   0 00:55:21.42     38565    293
2080221C NODE1$SRV73     HIB      6      332   0 00:00:05.91      1758   1988
20800257 OA$SYMBIONT_008 HIB      6       21   0 00:00:00.77       200    108
20800258 SYMBIONT_0009   HIB      5    14801   0 00:17:32.01      2142   1354
20802CA2 ALLIN1          CUR  1   4      713   0 00:00:07.33      3778    509
20802AF6 OA$FCV          SUSP     3        5   0 00:00:00.07        66    135
20801EF7 TM_SERVER       LEF      3       26   0 00:00:00.65       381    801




Starting server with the STA option
	NODE1::"73=" submitted to A1V30$BATCH for startup
Nothing happens
	Batch queue A1V30$BATCH, idle, on NODE2::
  /BASE_PRIORITY=4 /JOB_LIMIT=1 /OWNER=[SISTEMAS,SYSTEM] /PROTECTION=(S:E,O:D,
  G:R,W:W)


OAFC$SERVER_STARTUP.LOG;80
                     30-APR-1997 12:08:40.18
Contains as last lines:


$ start_server:
$       @sys$startup:oafc$startup SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1 NODE1$SRV73
$!
$!      Copyright information and description at end.
$!
$! Default any parameters not passed to us.
$!
$       IF P1 .EQS. "" THEN P1 = "OA$DATA_SHARE:OAFC$SERVER_CONFIG.DAT"
$       IF P2 .EQS. "" THEN P2 = "A1FS"
$!
$! Check that the configuration file exists before trying to use it for the
$! server.
$!
$       if f$search( P1 ) .eqs. "" then goto No_config_file
$!
$! We'll be running under a different UIC, we'll need CMKRNL for that, but the
$! subprocess will also need a number of other privileges, and hence we need to
$! make sure that we've already got them, since we can't pass them to the
$! subprocess otherwise. Set a symbol to the list so it's easy to change here,
$! in the error message, and in the RUN command.
$!
$       Required_privs = -
          "CMKRNL,SYSLCK,SYSNAM,SYSPRV,PRMMBX,TMPMBX,EXQUOTA,NETMBX,DETACH"
$       IF .not. F$PRIVILEGE( Required_privs ) THEN GOTO priv_server
$!
$! Check a process is not already running with the requested process name
$!
$       if f$type( Ctx ) .eqs. "PROCESS_CONTEXT" then -
            Temp = f$context( "PROCESS",Ctx,"CANCEL")
$       Ctx = ""
$!
$! Select processes that have the name the caller chose (or we defaulted to)
$! for the subprocess, are detached, are owned by OAFC$SERVER and are on this
$! node (the latter is the default).
$!
$       Temp = f$context( "PROCESS",Ctx,"PRCNAM",P2,"EQL" )
$       Temp = f$context( "PROCESS",Ctx,"JOBTYPE","DETACHED","EQL" )
$       Temp = f$context( "PROCESS",Ctx,"UIC","[OAFC$SERVER]","EQL" )
$!
$! Pick up the process ID, which clears the context if the process doesn't
$! exist, otherwise we know a process of the specified name already exists.
$!
$       Temp = f$pid( Ctx )
$       if f$type( Ctx ) .eqs. "PROCESS_CONTEXT"
$       then
$           Temp = f$context( "PROCESS",Ctx,"CANCEL")
$           goto proc_name_error
$proc_name_error:
$       WRITE SYS$OUTPUT -
          "A server process already exists with the name NODE1$SRV73"
A server process already exists with the name NODE1$SRV73
$       EXIT 0
%NONAME-W-NOMSG, Message number 00000000
$
$       write sys$output "Server is being started:"
Server is being started:
$       write sys$output "              configuration_file: ",config_file
                configuration_file: SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1
$       write sys$output "              process_name:       ",process_name
                process_name:       NODE1$SRV73

$ exit:
$       write sys$output ""

$       write sys$output ""

$       write sys$output "******************************************", -
                                                "******************************"
************************************************************************
$       write sys$output ""

$       write sys$output ""

$
$       exit
  ALLIN1       job terminated at 30-APR-1997 12:08:42.40

  Accounting information:
  Buffered I/O count:              45         Peak working set size:     433
  Direct I/O count:                57         Peak page file size:      3012
  Page faults:                    493         Mounted volumes:             0
  Charged CPU time:           0 00:00:01.20   Elapsed time:     0 00:00:03.42





Anybody could help us?

Regards,

		Josep

T.R	Title	User	Personal Name	Date	Lines
2653.1	Stop/Start and then upgrade !	IOSG::STANDAGE	My hovercraft is full of eels	`Wed Apr 30 1997 16:00`	30
	JoseP, It would appear as though the FCS is running, but the Management UI is showing it to be Stopped. When you go to SM MFC MFS does it take a long time before saying "Stopped" (~30 seconds?). If this is the case then your server is in trouble and is not responding to connection requests. The quickest solution is to STOP/ID the NODE1::"73=" process and restart it from the SM MFC MFS subsystem. I don't think having the extra node running ALL-IN-1 effected this. In the longterm, there are a few things you should consider. Firstly, ALL-IN-1 V3.0 is very old so you should consider upgrading to V3.1 or (better still) V3.2. There are also a number of ICF patches for V3.1 which address FCS problems which you should install if you cannot upgrade to V3.2. If you really have to stay on ALL-IN-1 V3.0, then at least install the TLC - TeamLinks Connection Package (to V2.1 and then V2.5), as this will also update your servers to newer, better versions. Finally, be aware of VMS patches which exist to resolve problems that appear to be with the servers. There are already many notes in this conference about them... -Kevin.
2653.2	cluster alias?	VELI::KORKKO	Veli K�rkk� @FNO, 879-5512	`Wed Apr 30 1997 17:30`	6
	Do you have DECnet/OSI there by any chance? Do you have cluster alias enabled on NODE2 by any chance? There used to be case that having alias outgoing enabled on the FCS object (or session control application) caused this kind of problem. _veli
2653.3	I'm losing it, time to retire gracefully	IOSG::STANDAGE	My hovercraft is full of eels	`Wed Apr 30 1997 17:57`	10
	Ah..yes! Veli is indeed correct. I didn't realise that it's from NODE2 that you can't see the server running on NODE1 - my apologies, I was reading your note too quickly. The alias enabled could potentially be the problem... Kevin.
2653.4	Only DECnet	BARNA::DSMAIL		`Mon May 05 1997 16:16`	7
	The customer has only DECnet Phase IV. Any verification could be done in that? Best Regards, Josep
2653.5		IOSG::STANDAGE	My hovercraft is full of eels	`Mon May 05 1997 16:27`	12
	Josep, What does : NCP> SHOW EXEC CHAR show ? Kevin.
2653.6		AIMTEC::ZANIEWSKI_D	Add Jean to the list of deserters!	`Mon May 05 1997 16:43`	5
	$SHOW SYSTEM is all that you can do. The server indicating the wrong status on the management menu is a known problem for version 3.0 and 3.1, and fixed in 3.2 according to my IPMT. Dave Zaniewski
2653.7	I always did like history lessons...	IOSG::STANDAGE	My hovercraft is full of eels	`Mon May 05 1997 17:10`	11
	Dave, Which problem is that ? I know there's a problem when you have lots of servers running (Max DASL sessions), but in this case they only have one or two servers running. Just curious, Kevin.
2653.8		AIMTEC::ZANIEWSKI_D	Add Jean to the list of deserters!	`Mon May 05 1997 18:46`	7
	Hi Kevin, HPXQB2DDE C961101-5797 And Martin Cook references FC_SERVER_ICOS #189.0 Dave Zaniewski
2653.9	Not sure this is the same problem	IOSG::STANDAGE	My hovercraft is full of eels	`Tue May 06 1997 08:14`	13
	Yes, this is the client exceeding the maximum number of DASL connections in one session (or to that effect), which is usually reproducable only when you have multiple FCS and AIDAs and you elect to view the status on each one (I think 12 is the critical number). So, this might not be the same problem, but I would certainly suggest they upgrade first and take it from there... -Kevin.
2653.10	A relink could help?	BARNA::DSMAIL		`Tue May 06 1997 15:08`	5
	There is not cluster alias. Before upgrade, could a relink be done to check if helps? Best Regards, Josep