T.R | Title | User | Personal Name | Date | Lines |
---|
294.1 | Get latest patches? Autogen? get a dump | VMSSPT::JENKINS | Kevin M Jenkins VMS Support Engineering | Wed Apr 09 1997 06:35 | 21 |
|
Are you running with latest patch kits. There were some things fixed
around this area... I'm not sure if they were VAX of ALPHA, but you
should at least have VAXSHAD09, not for shadowing but for the
cluster code. Then check for kits for SYS$SCS, like VAXSCSxxx.
Just for starters... When the port reset there should have been some
registers printed out. Get them decoded to find out what error
interrupt was set. Another possibility could be not enough Nonpaged
Pool... perhaps when they try to cluster the systme runs out of pool
this could cause a hang...
If all else fails you'll need to crash both systems at the same time..
Hope you have seperate crash dumps... You should halt them both at the
same time then force a crash. This would be needed for someone to
try and figure out what is going on. It's important to halt them both
as crashing one while the other is still running will change the
state of things.
Kevin
|
294.2 | patches installed | TOSSUB::BRUSA | | Wed Apr 09 1997 06:52 | 7 |
| I have forgot that I have installed the following patches:
VAXSHAD09_U2055 & VAXDRIV04_070
without success.
Livio Brusa
|
294.3 | DSSI Node IDs? | BRUNEL::KIRBY | | Thu Apr 10 1997 07:10 | 14 |
| I hope you have set the 4000-100 systems to different DSSI Node IDs. Typically
I would set one of them to 7, the other to 6, and the HSD to 0. No duplicates
allowed on the bus. Don't forget to check any internal RFxx drives also.
On the later 4100s I think it is a console command (type "help" and look for
"show" and "set" DSSI something) ... the earlier units I suspect there was a
jumper.
However your drawing implies a 4000-100A, so as long as the system firmware
is reasonably up-to-date it should be done from the console.
Steve.
|
294.4 | Are the machines 4108s ? | KERNEL::MEGARITY | I remember when Rock was young | Thu Apr 10 1997 12:45 | 163 |
| Author : MARCI R POTTER
User type : DBA
Location : USTIMA
Vaxmail address : CSC32::POTTER
Copyright (c) Digital Equipment Corporation 1997. All rights reserved.
+---------------------------+TM
| | | | | | | |
| d | i | g | i | t | a | l | TIME DEPENDENT BLITZ
| | | | | | | |
+---------------------------+
BLITZ TITLE: Firmware Update for Vax 4000-108
PRIORITY LEVEL:
DATE:March 26,1997
TD #: 2269
AUTHOR:Heather Kane
DTN:223-4712
EMAIL:[email protected]
DEPARTMENT:RSE
=================================================================
PRODUCT NAMES:Vax 4000 108
PRODUCT FAMILY:
Storage ___
Systems/OS _X_
Networks ___
PC/Peripherals ___
Software Apps. ___
BLITZ TYPE:
Maintenance Tip TIMA::INFO_X_
Service Action Requested ___
IF SERVICE ACTION IS REQUESTED: (Check all that apply.)
Labor Support Required _X_
Material Support Required ___
Estimated time to complete activity (in hours):
Will this require a change in the field's inventory: Yes __X_ No ___
Will an FCO be associated with this advisory? Yes ___ No _X_
DESCRIPTION OF SERVICE ACTIVITY REQUESTED (if applicable):
Firmware needs to be updated from V1.0 to V2.0 on VAX 4000-108
*******************************************************************
PROBLEM STATEMENT:
When trying to install VAX 4000-108's in a DSSI cluster with other
Vax4000-108's or any other clusterable sytem the cluster may not
configure properly and may cause hangs when a second system is booted.
SYMPTOM:
No matter what DSSI ID is set at console, when OVMS boots the ID is
always 7 so there is a conflict.
SOLUTION:
This is caused by OVMS not using the ID configured by the console with
the SET_DSSI ID command. This is caused by a legacy code issue,
whereby OVMS will only use the DSSI ID value if Console firmware
Version is above 2.X.
The firmware needs to be updated to V2.0. The version is available
by copying from :
may21::WRK:[MOPLOAD]kacat_v20_1.sys
UPDATING PROCESS:
The new firmware file needs to be copied to a mop$load area and then
perform the update. Most systems have the firmware enable jumper
(W3) on the CPU modules installed, which allows Firmware updates. If
there is any problem with updating refer to the Vax4000-108 On-line
Service Guide as a reference.
***** On Server System *****
$ MCR NCP
NCP>SET CIRCUIT ISA-0 STATE OFF
NCP>SET CIRCUIT ISA-0 SERVICE ENABLED
NCP>SET CIRCUIT ISA-0 STATE ON
NCP>EXIT
$
$ COPY kacat_v20_1.sys MOM$LOAD:*.*
$
***** On Client System *****
>>>b/100 eza0
(BOOT/R5:100 EZA0)
2..
Bootfile: kacat_v20_1
-EZA0
1..0..
FEPROM update program
---CAUTION---
--- Executing this program will change your current FEPROM ---
Do you want to continue [Y/N] ? : y
Blasting in V2.0-1. The program will take at most several minutes.
DO NOT ATTEMPT TO INTERRUPT PROGRAM EXECUTION
Doing so may result in loss of operable state !!!
+----------------------------------------+
10...9...8...7...6...5...4...3...2...1...0
FEPROM Programming successful
?06 HLT INST
PC = 00008E24
>>>
cycle power
VERIFICATION:
Set DSSI ID on multi-node cluster as required and verify there are
no conflicts.
LARS INFORMATION: (Supplied by MCS)
Attention Service Personnel: Begin the comment field of your LARS
with the word "BLITZ" when you perform an activity associated with a
BLITZ Type "Service Action Requested".
*** DIGITAL INTERNAL USE ONLY ***
\\ GRP=TIME_DEPENDENT CAT=HARDWARE DB=CSSE_TIME_CRITICAL
\\ TYPE=KNOWN_PROBLEM TYPE=BLITZ STATUS=CURRENT PROD=DEC4000-XXX PROD=4XXX
|
294.5 | | TOSSUB::BRUSA | | Fri Apr 11 1997 07:13 | 14 |
| ***** Answer .3
The dssi id are different ,on the first cpu dssi 0 have id 7 and dssi 1
have id 5, on the second cpu the dssi id are 6 and 4 ,the HSD30 have dssi id 1.
No internal or external RFxx drive are connected .
The firmware version of the ka52a is 2.4 with vmb version 2.15.
***** Answer .4
The two system are 4000 model 100a.
Livio Brusa
|
294.6 | Later VMS required. | BRUNEL::KIRBY | | Fri Apr 11 1997 12:20 | 21 |
| I still suspect a DSSI ID problem. From memory, when the 4100 firmware was
updated to include the setable DSSI IDs there was a blitz of some sort
(that I can't find) that said that VMS was not yet coded to use the new method.
Note .4 is the same issue in reverse .... VMS is now looking for the V2 console.
Perhaps it is VMS V5.5-2 that doesn't support the new method???
So I've just dug out of the cupboard and read the "BA42B Based system DSSI
Upgrade Manual", EK-500AA-UP. This says that VMS 5.5-2H4 is needed, so I
would suggest you give that a try.
Also there is a STARS article that says the same, minimum of V5.5-2H4 or V6.1
to support the KFDDA-A (the dual-DSSI module).
Steve.
|
294.7 | upgraded to vms 5.5.2H4 | TOSSUB::BRUSA | | Wed May 14 1997 10:41 | 9 |
| I'm sorry for the late response but only in the last weekend the
customer have upgraded to Vms 5.5-2H4 but the problem is always
present.
Please someone have some suggestion.
Thank you
Livio Brusa
|
294.8 | Same root ? | STOWKS::SLUIS | Hans van Sluis - StorageWorks Engineering Support Europe- DTN 889 9526 | Fri May 16 1997 10:04 | 10 |
| Livio,
This may sound stupid, but we're not booting from the same system
root, are we.
Did you setup your bootflags on node A as 0,1 and on node B as 1,1
(or as 0,0 and 1,0). This will cause node A to boot from SYS0 and node B
from SYS1.
Hans
|
294.9 | different root | TOSUP1::BRUSA | | Mon May 19 1997 02:17 | 7 |
|
Hy Hans
The two systems boot from two different roots , the R5 registers are set
at 00000000 and 10000000.
Livio Brusa
|
294.10 | help | SSDEVO::ASTOR | Subsystems Engineering Support | Thu May 22 1997 15:29 | 37 |
| Hi Livio,
I'll just cover the stuff from the controller/DSSI side and leave the
VMS stuff to Kevin and others.
First please see blitz I wrote in note 114. I know that you have
drawn that you are using DSSI bus 0, but it might help anyway.
To make sure you dont have a DSSI conflict, bring up one system,
do a $ show cluster/cont, the add rport. Shut this system down and
bring up the other one and do the same thing. Then, you'll know for
sure that console/openvms/hsd30 isnt tricking you. Notice I didnt
single anyone out here :).
While the system is up and running, run vtdpy on the HSD30. You
should get a few NOR's (no response) per second, because the controller
is polling for DSSI ID's that dont exist on the bus. NAK's should be 0.
If NAKS are not 0, then there is a possible bus integrity problem.
I would recommend putting the HSD30 on the end of the bus with an
external terminator. Also, in that configuration, you could try DSSI
bus 1 (see note 114). The best part is you get to see the term pwr
light on the terminator to make sure you have termination power.
I know this sounds strange, but I have in my possesion a terminator
that is bad and causes very similar symptoms. I keep it around for
training purposes :). Could also be a bad cable or no termination.
This DSSI is robust stuff, things can be marginal and if the legnth is
short, it will work anyway.
If you could provide an errorlog it would be helpful, as well as a
SHOW THIS command from the controller.
Good luck,
Kurt
|
294.11 | Problem solved | TOSUP1::BRUSA | | Tue Jun 03 1997 02:49 | 18 |
| Hi
I'm sorry for the delay in the updating ,but I'have been out of office for
some weeks .
For some reason that I don't known the customer after the update
from version 5.5-2 to version 5.5-2h4, don't have check if the configuration
work correctly.
I'm very furious with the customer because when I have ask to him if the
problem has been corrected the answer has been "the problem still exist".
I was gone in the last week to the customer site and I have boot the two
system in cluster without any problem.
I apologize for the mistake and thank everybody .
Regards to all
Livio Brusa
|