| Title: | HSZ40 Product Conference |
| Moderator: | SSDEVO::EDMONDS |
| Created: | Mon Apr 11 1994 |
| Last Modified: | Fri Jun 06 1997 |
| Last Successful Update: | Fri Jun 06 1997 |
| Number of topics: | 902 |
| Total number of notes: | 3319 |
I've been through a lot of notes in several conferences, and can't see
any similar problem to this and can't find out where to go next.
We just got a new HSZ40 delivered. At the moment, I'm using a single
cable to connect it to one KZPSA on one Alphaserver 1000. When
everything comes up, the HSZ40 is seen by the Alpha but doesn't respond
properly. Once the system boots, the disks on the HSZ40 can be seen
but can't be accessed. Any attempt to access the disks hangs the
system.
I've checked at the HSZ40 console, self test runs without errors, the
configuration looks o.k., I can test / exercise the disks without
errors, etc. There is no SCSI ID conflict that I can see (HSZ40 is at
ID 6, KZPSA is at ID 2 or ID 7). There is a "Y" adapter with a
terminator at the HSZ40 end, the cable runs to one KZPSA adaptor on one
CPU, and should have an internal terminator. I've tried this system on
two different 1000s and one one 2100 and get the same results. I can
plug a StorageWorks box into the same systems and they work, so the
controller at the CPU end is o.k.
I suppose it's possible that there is something mis-configured in the
HSZ40, but I can't imagine what: I've checked everything I can find in
the manuals, I've tried re-configuring disk drives manually and using
CFMENU, I've tried different disks, and so on, but nothing changes. My
only other thought is that the unit was bad on delivery, or that the
cable is somehow bad in a way that allows it to partially function.
Below is a log of the console from one system showing what happens.
You can see some error messages come out when the system polls the
controllers, so the problem apparently is at a low level.
Any suggestions on where to go next?
SYSTEM SHUTDOWN COMPLETE
halted CPU 0
halt code = 5
HALT instruction executed
PC = ffffffff8006df28
>>>init
ff.fe.fd.fc.fb.fa.f9.f8.f7.f6.f5.ef.df.ee.f4.
probing hose 0, PCI
probing PCI-to-EISA bridge, bus 1
probing PCI-to-PCI bridge, bus 2
bus 2, slot 0 -- pka -- QLogic ISP1020
bus 2, slot 1 -- ewa -- DECchip 21140-AA
bus 0, slot 11 -- pkb -- DEC KZPSA
bus 0, slot 12 -- pkc -- DEC KZPSA
bus 0, slot 13 -- fwa -- DEC PCI FDDI
ed.ec.eb.....ea.e9.e8.e7.e6.e5.e4.e3.e2.e1.e0.
V4.7-179, built on Dec 17 1996 at 14:26:45
>>>sho devi
waiting for pkc0.7.0.12.0 to poll... <-
amcsr_lo = 8 <- THESE MESSAGES OCCUR ONLY WHEN
abbrr_lo = 200a40f <- THE SYSTEM IS CONNECTED TO
dafqir_lo = 80052b1 <- THE HSZ40
dacqir_lo = 8005459 <-
asr_lo = 10 <- MORE MESSAGES COME OUT DURING
afar_lo = 0 <- THE BOOT PROCESS, BELOW
afpr_lo = 30a <-
waiting for pkc0.7.0.12.0 to poll... <-
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
dka0.0.0.2000.0 DKA0 RZ26N 0616
dka400.4.0.2000.0 DKA400 RRD45 0436
dva0.0.0.1000.0 DVA0
ewa0.0.0.2001.0 EWA0 00-00-F8-03-E6-74
fwa0.0.0.13.0 FWA0 00-00-F8-4A-A0-04
pka0.7.0.2000.0 PKA0 SCSI Bus ID 7 2.10
pkb0.7.0.11.0 PKB0 SCSI Bus ID 7 P01 A10
pkc0.7.0.12.0 PKC0 SCSI Bus ID 7 P01 A10
>>>sho conf
Digital Equipment Corporation
AlphaServer 1000A 4/***
Firmware
SRM Console: V4.7-179
ARC Console: 4.49
PALcode: VMS PALcode V5.56-6, OSF PALcode X1.45-12
Serial Rom: V2.8
Processor
DECchip (tm) 21064A-2 233MHz
Memory
64 Meg of System Memory
Bank 0 = 64 Mbytes(16 MB Per Simm) Starting at 0x00000000
Bank 1 = No Memory Detected
Bank 2 = No Memory Detected
Bank 3 = No Memory Detected
Slot Option Hose 0, Bus 0, PCI
7 Intel 82375EB Bridge to Bus 1, EISA
8 DECchip 21050-AA Bridge to Bus 2, PCI
11 DEC KZPSA pkb0.7.0.11.0 SCSI Bus ID 7
12 DEC KZPSA pkc0.7.0.12.0 SCSI Bus ID 7
13 DEC PCI FDDI fwa0.0.0.13.0 00-00-F8-4A-A0-04
Slot Option Hose 0, Bus 1, EISA
Slot Option Hose 0, Bus 2, PCI
0 QLogic ISP1020 pka0.7.0.2000.0 SCSI Bus ID 7
dka0.0.0.2000.0 RZ26N
dka400.4.0.2000.0 RRD45
1 DECchip 21140-AA ewa0.0.0.2001.0 00-00-F8-03-E6-74
>>>b
ff.fe.fd.fc.fb.fa.f9.f8.f7.f6.f5.ef.df.ee.f4.
probing hose 0, PCI
probing PCI-to-EISA bridge, bus 1
probing PCI-to-PCI bridge, bus 2
bus 2, slot 0 -- pka -- QLogic ISP1020
bus 2, slot 1 -- ewa -- DECchip 21140-AA
bus 0, slot 11 -- pkb -- DEC KZPSA
bus 0, slot 12 -- pkc -- DEC KZPSA
bus 0, slot 13 -- fwa -- DEC PCI FDDI
ed.ec.eb.....ea.e9.e8.e7.e6.e5.e4.e3.e2.e1.e0.
V4.7-179, built on Dec 17 1996 at 14:26:45
CPU 0 booting
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
error on pkc0.6.0.12.0, cmd = 12, sts = 48, camh->status = 19
amcsr_lo = 8
abbrr_lo = 200a40f
dafqir_lo = 802b0f9
dacqir_lo = 80290a5
asr_lo = 10
afar_lo = 0
afpr_lo = 30a
SIMport Adapter error: asr = 10, afpr = 30a
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
CAM command EXECUTE_SCSI_IO timed out
(boot dka0.0.0.2000.0 -flags 0,0)
FRU table creation disabled
block 0 of dka0.0.0.2000.0 is a valid boot block
reading 904 blocks from dka0.0.0.2000.0
bootstrap code read in
base = 1c2000, image_start = 0, image_bytes = 71000
initializing HWRPB at 2000
initializing page table at 3ff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code
error on pkc0.6.0.12.0, cmd = 12, sts = 48, camh->status = 19
OpenVMS (TM) Alpha Operating System, Version V7.1
[remainder of system boot looks normal.]
$ sho dev
Device Device Error Volume Free Trans Mnt
Name Status Count Label Blocks Count Cnt
DAD0: Online 0
PRFMK1$DKB601: Offline 1
PRFMK1$DKC0: Mounted 0 AXPVMSSYS 1008792 308 1
PRFMK1$DKC400: Online wrtlck 0
PRFMK1$DVA0: Online 0
Device Device Error
Name Status Count
FTA0: Offline 0
LTA0: Offline mounted 0
OPA0: Online 0
RTA0: Offline 0
RTB0: Offline 0
TTA0: Online 0
Device Device Error
Name Status Count
LRA0: Online 0
Device Device Error
Name Status Count
EWA0: Online 0
EWA2: Online 0
EWA3: Online 0
FWA0: Online 0
FWA2: Online 0
FWA4: Online 0
FWA5: Online 0
GQA0: Online 0
IKA0: Offline 0
IMA0: Offline 0
INA0: Offline 0
LAST0: Online 0
MPA0: Online 0
OPA2: Online 0
OPA3: Online 0
PKA0: Online 0
PKB0: Online 1
PKC0: Online 0
WSA0: Offline 0
WSA1: Online 0
$ sho dev/fu dkb601
Disk PRFMK1$DKB601:, device type unknown, is offline, file-oriented device,
shareable, available to cluster, error logging is enabled.
Error count 1 Operations completed 0
Owner process "" Owner UIC [SYSTEM]
Owner process ID 00000000 Dev Prot S:RWPL,O:RWPL,G:R,W
Reference count 0 Default buffer size 512
| T.R | Title | User | Personal Name | Date | Lines |
|---|---|---|---|---|---|
| 760.1 | SSDEVO::T_GONZALES | Fri Feb 07 1997 17:47 | 9 | ||
When you said that you tried the system on different cpu's does that
mean that you tried different kzpsa's or the same kzpsa, the error
entry seems to indicate a an unrecoverable adapter error, Have you
viewed the kszpa paramters with the arc console and the kzpsa
utility floppy. Try doing a show pk* from the console and see what you
get. Could you provide a drawing showing the exact scsi config
including hsz's. What type of cable are you using and what kind of
terminators?
| |||||
| 760.2 | SSDEVO::T_GONZALES | Fri Feb 07 1997 17:48 | 2 | ||
ONe additional note, when you try to access the hsz when the sytem is
up, what are you getting in the error log entry?
| |||||
| 760.3 | Error Log | EVMS::PIRULO::LEDERMAN | B. Z. Lederman | Mon Feb 10 1997 07:08 | 197 |
The adapter has been tried on four different KZPSAs on three different
host systems. The KZPSAs were NOT moved between systems: each system
has it's own adaptors. Two of the systems are brand new 1000As, each
with two KZPSAs installed an configured by Digital before delivery.
The third is a 2100: I don't know who installed the KZPSA but it's been
in use for more than a year directly connected to a BA356 and DWZZA.
If I shut down the system, remove the cable to the BA356 and connect
the cable to the HSZ40 and restart, I get the symptoms: I can see the
HSZ40 and disk units but can't access any of them.
I tried moving the BA356 to the 1000A (which is where the HSZ40 is
normally connected) and it works there, so the 1000A and controller are
apparently o.k.
For the moment, the normal configuration is for one cable to go to one
1000A. Eventually there will be a cluster but not until I get this
setup working with one system. So there is really nothing to draw.
There is a trilink connector at the HSZ40 with a terminator in one
socket and a cable in the other: the cable goes directly to one KZPSA
socket in one system.
The following are the only error log entries I have (other than normal
system startup and shutdown). They do appear to occur each time the
system starts up and attempts to access the HSZ.
%ERF-I-UNKENTRY, unknown entry type, 37
******************************* ENTRY 63. *******************************
ERROR SEQUENCE 867. LOGGED ON: CPU_TYPE 00000006
DATE/TIME 5-FEB-1997 13:18:15.60 SYS_TYPE 0000001B
SYSTEM UPTIME: 0 DAYS 00:00:45
SCS NODE: PRFMK1 OpenVMS AXP V7.1
HW_MODEL: FFFFFFFF Hardware Model = 4294967295.
DEVICE ATTENTION AlphaServer 1000A 4/***
GENERIC DK SUB-SYSTEM, UNIT _PRFMK1$DKB601:
HW REVISION 00000000
HW REVISION = ....
ERROR TYPE 03
COMMAND TRANSMISSION FAILURE
SCSI ID 06
SCSI ID = 6.
SCSI LUN 00
SCSI LUN = 0.
SCSI SUBLUN 01
SCSI SUBLUN = 1.
PORT STATUS 00000054
%SYSTEM-F-CTRLERR, FATAL CONTROLLER
ERROR
SCSI CMD 00000012
0080
INQUIRY
SCSI STATUS 02
CHECK CONDITION
UCB$L_ERTCNT 224F4706
6. RETRIES REMAINING
UCB$L_ERTMAX 000000AA
170. RETRIES ALLOWABLE
ORB$L_OWNER 00010004
OWNER UIC [001,004]
UCB$L_CHAR 1C454008
DIRECTORY STRUCTURED
FILE ORIENTED
SHARABLE
AVAILABLE
ERROR LOGGING
CAPABLE OF INPUT
CAPABLE OF OUTPUT
RANDOM ACCESS
UCB$L_STS 08000110
ONLINE
BUSY
UCB$L_OPCNT 00000000
0. QIO'S THIS UNIT
UCB$L_ERRCNT 00000001
1. ERRORS THIS UNIT
IRP$L_BCNT 00000000
TRANSFER SIZE 0. BYTE(S)
IRP$L_BOFF 00000000
TRANSFER PAGE ALIGNED
IRP$L_PID 00000000
REQUESTOR "PID"
IRP$Q_IOSB 00000000
00000000 IOSB, 0. BYTE(S) TRANSFERRED
******************************* ENTRY 64. *******************************
ERROR SEQUENCE 868. LOGGED ON: CPU_TYPE 00000006
DATE/TIME 5-FEB-1997 13:18:15.98 SYS_TYPE 0000001B
SYSTEM UPTIME: 0 DAYS 00:00:46
SCS NODE: PRFMK1 OpenVMS AXP V7.1
HW_MODEL: FFFFFFFF Hardware Model = 4294967295.
"UNKNOWN DEVICE" ENTRY AlphaServer 1000A 4/***
ERROR LOG RECORD
ERF$L_SID FFFFFFFF
SYSTEM ID REGISTER
ERL$W_ENTRY 0062
ERROR ENTRY TYPE
EXE$GQ_SYSTIME 8276F391
009AF6EF 64 BIT TIME WHEN ERROR LOGGED
ERL$GL_SEQUENCE 0364
UNIQUE ERROR SEQUENCE = 868.
UCB$L_STS 00000010
DEVICE STATUS
UCB$B_DEVCLASS 80
DEVICE CLASS = 128.
UCB$B_DEVTYPE 2D
DEVICE TYPE = 45.
UCB$W_UNIT 0000
PHYSICAL UNIT NUMBER = 0.
UCB$L_ERRCNT 00000001
UNIT ERROR COUNT = 1.
UCB$L_OPCNT 0000001F
UNIT OPERATION COUNT = 31.
ORB$L_OWNER 00010004
OWNER UIC = [001,004]
UCB$L_DEVCHAR 0C440000
DEVICE CHARACTERISTICS
UCB$B_SLAVE 00
DEVICE SLAVE CONTROLLER = 0.
DDB$T_NAME 4652500A
24314B4D
00424B50
00000000
/PRFMK1$PKB/
LONGWORD 1. 0000003D
LONGWORD 2. 07000B02
LONGWORD 3. 47FF0000
LONGWORD 4. 00008032
LONGWORD 5. 00000000
LONGWORD 6. 00000000
LONGWORD 7. 00000000
LONGWORD 8. 00000000
LONGWORD 9. 11000000
LONGWORD 10. 00000000
LONGWORD 11. 00000007
LONGWORD 12. 00000000
LONGWORD 13. 00000000
LONGWORD 14. 00000000
LONGWORD 15. 44000000
LONGWORD 16. 20204345
LONGWORD 17. 20313050
LONGWORD 18. 30314120
LONGWORD 19. 00202020
LONGWORD 20. 1C000000
LONGWORD 21. 00000000
LONGWORD 22. 00000000
LONGWORD 23. 00000000
LONGWORD 24. 00000000
LONGWORD 25. 00000000
LONGWORD 26. 00000000
LONGWORD 27. 00000000
LONGWORD 28. 00000000
LONGWORD 29. 00000000
LONGWORD 30. 00000000
LONGWORD 31. 00000000
LONGWORD 32. 00000000
LONGWORD 33. 00000000
LONGWORD 34. 00000000
LONGWORD 35. 00000000
LONGWORD 36. 00000000
LONGWORD 37. 00000000
LONGWORD 38. 00000000
LONGWORD 39. 00000000
LONGWORD 40. 00000000
LONGWORD 41. 00000000
LONGWORD 42. 00000000
LONGWORD 43. 00000000
LONGWORD 44. 00000000
LONGWORD 45. 00000000
LONGWORD 46. 00000000
LONGWORD 47. 00000000
LONGWORD 48. 00000000
LONGWORD 49. 00000010
LONGWORD 50. 00000000
LONGWORD 51. 0000030A
LONGWORD 52. 00000016
LONGWORD 53. 0000001A
LONGWORD 54. 00000001
LONGWORD 55. 00000019
LONGWORD 56. 00000006
LONGWORD 57. 00000001
LONGWORD 58. 0000001D
LONGWORD 59. 00000000
LONGWORD 60. 20005BF0
LONGWORD 61. 00000000
LONGWORD 62. 00000000
| |||||
| 760.4 | More information (console variables) | EVMS::PIRULO::LEDERMAN | B. Z. Lederman | Mon Feb 10 1997 10:23 | 336 |
Here is more information (probably more than you asked for).
>>>sho p*
pal VMS PALcode V5.56-6, OSF PALcode X1.45-12
pci_parity off
pka0_host_id 7
pka0_soft_term on
pkb0_fast 1
pkb0_host_id 1
pkb0_termpwr 1
pkc0_fast 1
pkc0_host_id 1
pkc0_termpwr 1
>>>sho dev
waiting for pkc0.1.0.12.0 to poll...
amcsr_lo = 8
abbrr_lo = 200a40f
dafqir_lo = 80052c1
dacqir_lo = 80054b1
asr_lo = 10
afar_lo = 0
afpr_lo = 30a
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
dka0.0.0.2000.0 DKA0 RZ26N 0616
dka400.4.0.2000.0 DKA400 RRD45 0436
dva0.0.0.1000.0 DVA0
ewa0.0.0.2001.0 EWA0 00-00-F8-03-E6-74
fwa0.0.0.13.0 FWA0 00-00-F8-4A-A0-04
pka0.7.0.2000.0 PKA0 SCSI Bus ID 7 2.10
pkb0.1.0.11.0 PKB0 SCSI Bus ID 1 P01 A10
pkc0.1.0.12.0 PKC0 SCSI Bus ID 1 P01 A10
>>>b
ff.fe.fd.fc.fb.fa.f9.f8.f7.f6.f5.ef.df.ee.f4.
probing hose 0, PCI
probing PCI-to-EISA bridge, bus 1
probing PCI-to-PCI bridge, bus 2
bus 2, slot 0 -- pka -- QLogic ISP1020
bus 2, slot 1 -- ewa -- DECchip 21140-AA
bus 0, slot 11 -- pkb -- DEC KZPSA
bus 0, slot 12 -- pkc -- DEC KZPSA
bus 0, slot 13 -- fwa -- DEC PCI FDDI
ed.ec.eb.....ea.e9.e8.e7.e6.e5.e4.e3.e2.e1.e0.
V4.7-179, built on Dec 17 1996 at 14:26:45
CPU 0 booting
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
error on pkc0.6.0.12.0, cmd = 12, sts = a8, camh->status = 19
amcsr_lo = 8
abbrr_lo = 200a40f
dafqir_lo = 802c835
dacqir_lo = 8029031
asr_lo = 10
afar_lo = 0
afpr_lo = 30a
SIMport Adapter error: asr = 10, afpr = 30a
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
CAM command EXECUTE_SCSI_IO timed out
(boot dka0.0.0.2000.0 -flags 0,0)
FRU table creation disabled
block 0 of dka0.0.0.2000.0 is a valid boot block
reading 904 blocks from dka0.0.0.2000.0
bootstrap code read in
base = 1c2000, image_start = 0, image_bytes = 71000
initializing HWRPB at 2000
initializing page table at 3ff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code
error on pkc0.6.0.12.0, cmd = 12, sts = 48, camh->status = 19
OpenVMS (TM) Alpha Operating System, Version V7.1
[Remainder of boot normal]
$ sho dev d/fu
Disk DAD0:, device type unknown, is online, file-oriented device, shareable,
error logging is enabled, device is a template only.
Error count 0 Operations completed 0
Owner process "" Owner UIC [SYSTEM]
Owner process ID 00000000 Dev Prot S:RWPL,O:RWPL,G:R,W
Reference count 0 Default buffer size 512
Disk PRFMK1$DKB601:, device type unknown, is offline, file-oriented device,
shareable, available to cluster, error logging is enabled.
Error count 1 Operations completed 0
Owner process "" Owner UIC [SYSTEM]
Owner process ID 00000000 Dev Prot S:RWPL,O:RWPL,G:R,W
Reference count 0 Default buffer size 512
Disk PRFMK1$DKC0:, device type DEC RZ26N, is online, mounted, file-oriented
device, shareable, available to cluster, error logging is enabled.
Error count 0 Operations completed 4789
Owner process "" Owner UIC [SYSTEM]
Owner process ID 00000000 Dev Prot S:RWPL,O:RWPL,G:R,W
Reference count 143 Default buffer size 512
Total blocks 2050860 Sectors per track 83
Total cylinders 3089 Tracks per cylinder 8
Volume label "AXPVMSSYS" Relative volume number 0
Cluster size 3 Transaction count 308
Free blocks 986739 Maximum files allowed 256357
Extend quantity 5 Mount count 1
Mount status System Cache name "_PRFMK1$DKC0:XQPCACHE"
Extent cache size 64 Maximum blocks in extent cache 98673
File ID cache size 64 Blocks currently in extent cache 10158
Quota cache size 0 Maximum buffers in FCP cache 354
Volume owner UIC [1,1] Vol Prot S:RWCD,O:RWCD,G:RWCD,W:RWCD
Volume Status: subject to mount verification, protected subsystems enabled,
file high-water marking, write-through caching enabled.
Disk PRFMK1$DKC400:, device type DEC RRD45, is online, file-oriented device,
shareable, available to cluster, error logging is enabled.
Error count 0 Operations completed 0
Owner process "" Owner UIC [SYSTEM]
Owner process ID 00000000 Dev Prot S:RWPL,O:RWPL,G:R,W
Reference count 0 Default buffer size 512
Disk PRFMK1$DVA0:, device type RX26, is online, file-oriented device, shareable,
error logging is enabled.
Error count 0 Operations completed 0
Owner process "" Owner UIC [SYSTEM]
Owner process ID 00000000 Dev Prot S:RWPL,O:RWPL,G:R,W
Reference count 0 Default buffer size 512
Total blocks 2880 Sectors per track 18
Total cylinders 80 Tracks per cylinder 2
$ anal/err/since=10-feb-1997:12:14
Error Log Report Generator Version V6.1
******************************* ENTRY 86. *******************************
ERROR SEQUENCE 1591. LOGGED ON: CPU_TYPE 00000006
DATE/TIME 10-FEB-1997 12:18:19.14 SYS_TYPE 0000001B
SYSTEM UPTIME: 0 DAYS 00:00:04
SCS NODE: PRFMK1 OpenVMS AXP V7.1
HW_MODEL: FFFFFFFF Hardware Model = 4294967295.
SYSTEM START-UP AlphaServer 1000A 4/***
TIME OF DAY CLOCK 24DD085A
******************************* ENTRY 87. *******************************
ERROR SEQUENCE 1592. LOGGED ON: CPU_TYPE 00000006
DATE/TIME 10-FEB-1997 12:18:19.74 SYS_TYPE 0000001B
SYSTEM UPTIME: 0 DAYS 00:00:04
SCS NODE: PRFMK1 OpenVMS AXP V7.1
HW_MODEL: FFFFFFFF Hardware Model = 4294967295.
MOUNT VOLUME AlphaServer 1000A 4/***
UNIT _PRFMK1$DKC0:, VOLUME LABEL "AXPVMSSYS"
210. QIO OPERATIONS THIS UNIT, 0. ERRORS THIS UNIT
******************************* ENTRY 88. *******************************
ERROR SEQUENCE 1593. LOGGED ON: CPU_TYPE 00000006
DATE/TIME 10-FEB-1997 12:19:00.59 SYS_TYPE 0000001B
SYSTEM UPTIME: 0 DAYS 00:00:46
SCS NODE: PRFMK1 OpenVMS AXP V7.1
HW_MODEL: FFFFFFFF Hardware Model = 4294967295.
DEVICE ATTENTION AlphaServer 1000A 4/***
GENERIC DK SUB-SYSTEM, UNIT _PRFMK1$DKB601:
HW REVISION 00000000
HW REVISION = ....
ERROR TYPE 03
COMMAND TRANSMISSION FAILURE
SCSI ID 06
SCSI ID = 6.
SCSI LUN 00
SCSI LUN = 0.
SCSI SUBLUN 01
SCSI SUBLUN = 1.
PORT STATUS 00000054
%SYSTEM-F-CTRLERR, FATAL CONTROLLER
ERROR
SCSI CMD 00000012
0080
INQUIRY
SCSI STATUS 02
CHECK CONDITION
UCB$L_ERTCNT 24DD085A
90. RETRIES REMAINING
UCB$L_ERTMAX 000000AA
170. RETRIES ALLOWABLE
ORB$L_OWNER 00010004
OWNER UIC [001,004]
UCB$L_CHAR 1C454008
DIRECTORY STRUCTURED
FILE ORIENTED
SHARABLE
AVAILABLE
ERROR LOGGING
CAPABLE OF INPUT
CAPABLE OF OUTPUT
RANDOM ACCESS
UCB$L_STS 08000110
ONLINE
BUSY
UCB$L_OPCNT 00000000
0. QIO'S THIS UNIT
UCB$L_ERRCNT 00000001
1. ERRORS THIS UNIT
IRP$L_BCNT 00000000
TRANSFER SIZE 0. BYTE(S)
IRP$L_BOFF 00000000
TRANSFER PAGE ALIGNED
IRP$L_PID 00000000
REQUESTOR "PID"
IRP$Q_IOSB 00000000
00000000 IOSB, 0. BYTE(S) TRANSFERRED
******************************* ENTRY 89. *******************************
ERROR SEQUENCE 1594. LOGGED ON: CPU_TYPE 00000006
DATE/TIME 10-FEB-1997 12:19:00.92 SYS_TYPE 0000001B
SYSTEM UPTIME: 0 DAYS 00:00:47
SCS NODE: PRFMK1 OpenVMS AXP V7.1
HW_MODEL: FFFFFFFF Hardware Model = 4294967295.
"UNKNOWN DEVICE" ENTRY AlphaServer 1000A 4/***
ERROR LOG RECORD
ERF$L_SID FFFFFFFF
SYSTEM ID REGISTER
ERL$W_ENTRY 0062
ERROR ENTRY TYPE
EXE$GQ_SYSTIME 0F8C5A76
009AFAD5 64 BIT TIME WHEN ERROR LOGGED
ERL$GL_SEQUENCE 063A
UNIQUE ERROR SEQUENCE = 1594.
UCB$L_STS 00000010
DEVICE STATUS
UCB$B_DEVCLASS 80
DEVICE CLASS = 128.
UCB$B_DEVTYPE 2D
DEVICE TYPE = 45.
UCB$W_UNIT 0000
PHYSICAL UNIT NUMBER = 0.
UCB$L_ERRCNT 00000001
UNIT ERROR COUNT = 1.
UCB$L_OPCNT 0000001F
UNIT OPERATION COUNT = 31.
ORB$L_OWNER 00010004
OWNER UIC = [001,004]
UCB$L_DEVCHAR 0C440000
DEVICE CHARACTERISTICS
UCB$B_SLAVE 00
DEVICE SLAVE CONTROLLER = 0.
DDB$T_NAME 4652500A
24314B4D
00424B50
00000000
/PRFMK1$PKB/
LONGWORD 1. 0000003D
LONGWORD 2. 01000B02
LONGWORD 3. 47FF0000
LONGWORD 4. 00008032
LONGWORD 5. 00000000
LONGWORD 6. 00000000
LONGWORD 7. 00000000
LONGWORD 8. 00000000
LONGWORD 9. 11000000
LONGWORD 10. 00000000
LONGWORD 11. 00000001
LONGWORD 12. 00000000
LONGWORD 13. 00000000
LONGWORD 14. 00000000
LONGWORD 15. 44000000
LONGWORD 16. 20204345
LONGWORD 17. 20313050
LONGWORD 18. 30314120
LONGWORD 19. 00202020
LONGWORD 20. 1C000000
LONGWORD 21. 00000000
LONGWORD 22. 00000000
LONGWORD 23. 00000000
LONGWORD 24. 00000000
LONGWORD 25. 00000000
LONGWORD 26. 00000000
LONGWORD 27. 00000000
LONGWORD 28. 00000000
LONGWORD 29. 00000000
LONGWORD 30. 00000000
LONGWORD 31. 00000000
LONGWORD 32. 00000000
LONGWORD 33. 00000000
LONGWORD 34. 00000000
LONGWORD 35. 00000000
LONGWORD 36. 00000000
LONGWORD 37. 00000000
LONGWORD 38. 00000000
LONGWORD 39. 00000000
LONGWORD 40. 00000000
LONGWORD 41. 00000000
LONGWORD 42. 00000000
LONGWORD 43. 00000000
LONGWORD 44. 00000000
LONGWORD 45. 00000000
LONGWORD 46. 00000000
LONGWORD 47. 00000000
LONGWORD 48. 00000000
LONGWORD 49. 00000010
LONGWORD 50. 00000000
LONGWORD 51. 0000030A
LONGWORD 52. 00000016
LONGWORD 53. 0000001A
LONGWORD 54. 00000001
LONGWORD 55. 00000019
LONGWORD 56. 00000006
LONGWORD 57. 00000001
LONGWORD 58. 0000001D
LONGWORD 59. 00000000
LONGWORD 60. 20005BF0
LONGWORD 61. 00000000
LONGWORD 62. 00000000
| |||||
| 760.5 | Seen similar: Cable! | UTRTSC::VISSER | Tue Feb 11 1997 14:49 | 22 | |
Yesterday evening we were trying (3th try...) to build a SCSI-cluster
(VMS) with two AS2100's and a dual-redundant HSZ40.
(cpu's at the end, HSZ's in the middle of the SCSI-bus).
Booting anyone node: OK. Trying to boot the other: died in errors.
(Aborted Commands were logged on the HSZ console port; SCSI resets
detected (ASC/ASCQ=29/00) seen in the errorlog).
We re-routed the SCSI-cable: cpu -> cpu -> HSZ's.
Now one of the CPU's failed it's power-up tests with
"Waiting for pkb0.... (KZPSA conencted to HSZ) to poll..."
Also some similar register info was dumped out (as in .0)
SOLUTION: One of the SCSI-Y-cables was bad!
So my advice: Exclude any SCSI-cabling, including any trilinks and
terminators.
DO NOT FORGET: Using Y-cables with external terminators REQUIRE KZPSA's
five[5] terminator SIP to be removed.
Jan
| |||||
| 760.6 | Cable was too long or defective. | EVMS::PIRULO::LEDERMAN | B. Z. Lederman | Wed Feb 12 1997 10:23 | 13 |
Thanks to some off-line help from M. Difabio, my system is now working.
It turned out to be the cable.
The HSZ40 was supplied with a BN21F-10 cable. This cable was either
defective, or is too long even for differential SCSI. I replaced it
with a BN21F-05 and everything appears to be working normally.
Thanks for all of the suggestions.
Bart.
| |||||