[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference ssdevo::hsz40_product

Title:HSZ40 Product Conference
Moderator:SSDEVO::EDMONDS
Created:Mon Apr 11 1994
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:902
Total number of notes:3319

760.0. "Can see HSZ40 but can't access any drives" by EVMS::PIRULO::LEDERMAN (B. Z. Lederman) Wed Feb 05 1997 12:17

    I've been through a lot of notes in several conferences, and can't see
    any similar problem to this and can't find out where to go next.
    
    We just got a new HSZ40 delivered.  At the moment, I'm using a single
    cable to connect it to one KZPSA on one Alphaserver 1000.  When
    everything comes up, the HSZ40 is seen by the Alpha but doesn't respond
    properly.  Once the system boots, the disks on the HSZ40 can be seen
    but can't be accessed.  Any attempt to access the disks hangs the
    system.
    
    I've checked at the HSZ40 console, self test runs without errors, the
    configuration looks o.k., I can test / exercise the disks without
    errors, etc.  There is no SCSI ID conflict that I can see (HSZ40 is at
    ID 6, KZPSA is at ID 2 or ID 7).  There is a "Y" adapter with a
    terminator at the HSZ40 end, the cable runs to one KZPSA adaptor on one
    CPU, and should have an internal terminator.  I've tried this system on
    two different 1000s and one one 2100 and get the same results.  I can
    plug a StorageWorks box into the same systems and they work, so the
    controller at the CPU end is o.k.
    
    I suppose it's possible that there is something mis-configured in the
    HSZ40, but I can't imagine what: I've checked everything I can find in
    the manuals, I've tried re-configuring disk drives manually and using
    CFMENU, I've tried different disks, and so on, but nothing changes.  My
    only other thought is that the unit was bad on delivery, or that the
    cable is somehow bad in a way that allows it to partially function.
    
    Below is a log of the console from one system showing what happens. 
    You can see some error messages come out when the system polls the
    controllers, so the problem apparently is at a low level.
    
    Any suggestions on where to go next?
    
    
	SYSTEM SHUTDOWN COMPLETE

halted CPU 0

halt code = 5
HALT instruction executed
PC = ffffffff8006df28
>>>init
ff.fe.fd.fc.fb.fa.f9.f8.f7.f6.f5.ef.df.ee.f4.
probing hose 0, PCI
probing PCI-to-EISA bridge, bus 1
probing PCI-to-PCI bridge, bus 2
bus 2, slot  0 -- pka -- QLogic ISP1020
bus 2, slot  1 -- ewa -- DECchip 21140-AA
bus 0, slot 11 -- pkb -- DEC KZPSA
bus 0, slot 12 -- pkc -- DEC KZPSA
bus 0, slot 13 -- fwa -- DEC PCI FDDI
ed.ec.eb.....ea.e9.e8.e7.e6.e5.e4.e3.e2.e1.e0.
V4.7-179, built on Dec 17 1996 at 14:26:45
>>>sho devi
waiting for pkc0.7.0.12.0 to poll...	<-
amcsr_lo = 8				<-  THESE MESSAGES OCCUR ONLY WHEN
abbrr_lo = 200a40f			<-  THE SYSTEM IS CONNECTED TO
dafqir_lo = 80052b1			<-  THE HSZ40
dacqir_lo = 8005459			<-
asr_lo = 10				<-  MORE MESSAGES COME OUT DURING
afar_lo = 0				<-  THE BOOT PROCESS, BELOW
afpr_lo = 30a				<-
waiting for pkc0.7.0.12.0 to poll...	<-
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
dka0.0.0.2000.0            DKA0                          RZ26N  0616
dka400.4.0.2000.0          DKA400                        RRD45  0436
dva0.0.0.1000.0            DVA0                               
ewa0.0.0.2001.0            EWA0              00-00-F8-03-E6-74
fwa0.0.0.13.0              FWA0              00-00-F8-4A-A0-04
pka0.7.0.2000.0            PKA0                  SCSI Bus ID 7  2.10
pkb0.7.0.11.0              PKB0                  SCSI Bus ID 7   P01  A10    
pkc0.7.0.12.0              PKC0                  SCSI Bus ID 7   P01  A10    
>>>sho conf
                        Digital Equipment Corporation
                           AlphaServer 1000A 4/***

Firmware
SRM Console:	V4.7-179
ARC Console:	4.49
PALcode:	VMS PALcode V5.56-6, OSF PALcode X1.45-12
Serial Rom:	V2.8

Processor
DECchip (tm) 21064A-2	233MHz

Memory
     64 Meg of System Memory
     Bank 0 = 64 Mbytes(16 MB Per Simm) Starting at 0x00000000
     Bank 1 = No Memory Detected 
     Bank 2 = No Memory Detected 
     Bank 3 = No Memory Detected 


 Slot	Option			Hose 0, Bus 0, PCI
   7	Intel 82375EB       	                    	Bridge to Bus 1, EISA
   8	DECchip 21050-AA    	                    	Bridge to Bus 2, PCI
  11	DEC KZPSA           	pkb0.7.0.11.0       	SCSI Bus ID 7
  12	DEC KZPSA           	pkc0.7.0.12.0       	SCSI Bus ID 7
  13	DEC PCI FDDI        	fwa0.0.0.13.0       	00-00-F8-4A-A0-04

 Slot	Option			Hose 0, Bus 1, EISA

 Slot	Option			Hose 0, Bus 2, PCI
   0	QLogic ISP1020      	pka0.7.0.2000.0     	SCSI Bus ID 7
				dka0.0.0.2000.0     	RZ26N
				dka400.4.0.2000.0   	RRD45
   1	DECchip 21140-AA    	ewa0.0.0.2001.0     	00-00-F8-03-E6-74
>>>b
ff.fe.fd.fc.fb.fa.f9.f8.f7.f6.f5.ef.df.ee.f4.
probing hose 0, PCI
probing PCI-to-EISA bridge, bus 1
probing PCI-to-PCI bridge, bus 2
bus 2, slot  0 -- pka -- QLogic ISP1020
bus 2, slot  1 -- ewa -- DECchip 21140-AA
bus 0, slot 11 -- pkb -- DEC KZPSA
bus 0, slot 12 -- pkc -- DEC KZPSA
bus 0, slot 13 -- fwa -- DEC PCI FDDI
ed.ec.eb.....ea.e9.e8.e7.e6.e5.e4.e3.e2.e1.e0.
V4.7-179, built on Dec 17 1996 at 14:26:45

CPU 0 booting

waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
error on pkc0.6.0.12.0, cmd = 12, sts = 48, camh->status = 19
amcsr_lo = 8
abbrr_lo = 200a40f
dafqir_lo = 802b0f9
dacqir_lo = 80290a5
asr_lo = 10
afar_lo = 0
afpr_lo = 30a
SIMport Adapter error: asr = 10, afpr = 30a
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
waiting for pkc0.7.0.12.0 to poll...
CAM command EXECUTE_SCSI_IO timed out
(boot dka0.0.0.2000.0 -flags 0,0)
FRU table creation disabled
block 0 of dka0.0.0.2000.0 is a valid boot block
reading 904 blocks from dka0.0.0.2000.0
bootstrap code read in
base = 1c2000, image_start = 0, image_bytes = 71000
initializing HWRPB at 2000
initializing page table at 3ff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code
error on pkc0.6.0.12.0, cmd = 12, sts = 48, camh->status = 19


    OpenVMS (TM) Alpha Operating System, Version V7.1    

[remainder of system boot looks normal.]


$ sho dev

Device                  Device           Error    Volume         Free  Trans Mnt
 Name                   Status           Count     Label        Blocks Count Cnt
DAD0:                   Online               0
PRFMK1$DKB601:          Offline              1
PRFMK1$DKC0:            Mounted              0  AXPVMSSYS      1008792   308   1
PRFMK1$DKC400:          Online wrtlck        0
PRFMK1$DVA0:            Online               0

Device                  Device           Error
 Name                   Status           Count
FTA0:                   Offline              0
LTA0:                   Offline mounted      0
OPA0:                   Online               0
RTA0:                   Offline              0
RTB0:                   Offline              0
TTA0:                   Online               0

Device                  Device           Error
 Name                   Status           Count
LRA0:                   Online               0

Device                  Device           Error
 Name                   Status           Count
EWA0:                   Online               0
EWA2:                   Online               0
EWA3:                   Online               0
FWA0:                   Online               0
FWA2:                   Online               0
FWA4:                   Online               0
FWA5:                   Online               0
GQA0:                   Online               0
IKA0:                   Offline              0
IMA0:                   Offline              0
INA0:                   Offline              0
LAST0:                  Online               0
MPA0:                   Online               0
OPA2:                   Online               0
OPA3:                   Online               0
PKA0:                   Online               0
PKB0:                   Online               1
PKC0:                   Online               0
WSA0:                   Offline              0
WSA1:                   Online               0
$ sho dev/fu dkb601

Disk PRFMK1$DKB601:, device type unknown, is offline, file-oriented device,
    shareable, available to cluster, error logging is enabled.

    Error count                    1    Operations completed                  0
    Owner process                 ""    Owner UIC                      [SYSTEM]
    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W
    Reference count                0    Default buffer size                 512

T.RTitleUserPersonal
Name
DateLines
760.1SSDEVO::T_GONZALESFri Feb 07 1997 17:479
    When you said that you tried the system on different cpu's does that
    mean that you tried different kzpsa's or the same kzpsa,  the error
    entry seems to indicate a an unrecoverable adapter error,  Have you
    viewed the kszpa paramters with the arc console and the kzpsa
    utility floppy.  Try doing a show pk* from the console and see what you
    get.  Could you provide a drawing showing the exact scsi config
    including hsz's.  What type of cable are you using and what kind of
    terminators?
    
760.2SSDEVO::T_GONZALESFri Feb 07 1997 17:482
    ONe additional note, when you try to access the hsz when the sytem is
    up, what are you getting in the error log entry?
760.3Error LogEVMS::PIRULO::LEDERMANB. Z. LedermanMon Feb 10 1997 07:08197
    The adapter has been tried on four different KZPSAs on three different
    host systems.  The KZPSAs were NOT moved between systems: each system
    has it's own adaptors.  Two of the systems are brand new 1000As, each
    with two KZPSAs installed an configured by Digital before delivery. 
    The third is a 2100: I don't know who installed the KZPSA but it's been
    in use for more than a year directly connected to a BA356 and DWZZA. 
    If I shut down the system, remove the cable to the BA356 and connect
    the cable to the HSZ40 and restart, I get the symptoms: I can see the
    HSZ40 and disk units but can't access any of them.
    
    I tried moving the BA356 to the 1000A (which is where the HSZ40 is
    normally connected) and it works there, so the 1000A and controller are
    apparently o.k.
    
    For the moment, the normal configuration is for one cable to go to one
    1000A.  Eventually there will be a cluster but not until I get this
    setup working with one system. So there is really nothing to draw. 
    There is a trilink connector at the HSZ40 with a terminator in one
    socket and a cable in the other: the cable goes directly to one KZPSA
    socket in one system.
    
    The following are the only error log entries I have (other than normal
    system startup and shutdown).  They do appear to occur each time the
    system starts up and attempts to access the HSZ.
    

%ERF-I-UNKENTRY, unknown entry type, 37

 ******************************* ENTRY      63. *******************************
 ERROR SEQUENCE 867.                             LOGGED ON:  CPU_TYPE 00000006
 DATE/TIME  5-FEB-1997 13:18:15.60                            SYS_TYPE 0000001B
 SYSTEM UPTIME: 0 DAYS 00:00:45
 SCS NODE: PRFMK1                                           OpenVMS AXP V7.1

 HW_MODEL: FFFFFFFF Hardware Model = 4294967295.

 DEVICE ATTENTION AlphaServer 1000A 4/***

 GENERIC DK SUB-SYSTEM, UNIT _PRFMK1$DKB601:

       HW REVISION     00000000
                                       HW REVISION = ....
       ERROR TYPE            03
                                       COMMAND TRANSMISSION FAILURE
       SCSI ID               06
                                       SCSI ID = 6.
       SCSI LUN              00
                                       SCSI LUN = 0.
       SCSI SUBLUN           01
                                       SCSI SUBLUN = 1.
       PORT STATUS     00000054
                                       %SYSTEM-F-CTRLERR, FATAL CONTROLLER
                                        ERROR
       SCSI CMD        00000012
                           0080
                                       INQUIRY
       SCSI STATUS           02
                                       CHECK CONDITION
       UCB$L_ERTCNT    224F4706
                                       6. RETRIES REMAINING
       UCB$L_ERTMAX    000000AA
                                       170. RETRIES ALLOWABLE
       ORB$L_OWNER     00010004
                                       OWNER UIC [001,004]
       UCB$L_CHAR      1C454008
                                       DIRECTORY STRUCTURED
                                       FILE ORIENTED
                                       SHARABLE
                                       AVAILABLE
                                       ERROR LOGGING
                                       CAPABLE OF INPUT
                                       CAPABLE OF OUTPUT
                                       RANDOM ACCESS
       UCB$L_STS       08000110
                                       ONLINE
                                       BUSY
       UCB$L_OPCNT     00000000
                                       0. QIO'S THIS UNIT
       UCB$L_ERRCNT    00000001
                                       1. ERRORS THIS UNIT
       IRP$L_BCNT      00000000
                                       TRANSFER SIZE 0. BYTE(S)

       IRP$L_BOFF      00000000
                                       TRANSFER PAGE ALIGNED
       IRP$L_PID       00000000
                                       REQUESTOR "PID"
       IRP$Q_IOSB      00000000
                       00000000        IOSB, 0. BYTE(S) TRANSFERRED

 ******************************* ENTRY      64. *******************************
 ERROR SEQUENCE 868.                             LOGGED ON:  CPU_TYPE 00000006
 DATE/TIME  5-FEB-1997 13:18:15.98                            SYS_TYPE 0000001B
 SYSTEM UPTIME: 0 DAYS 00:00:46
 SCS NODE: PRFMK1                                           OpenVMS AXP V7.1

 HW_MODEL: FFFFFFFF Hardware Model = 4294967295.

 "UNKNOWN DEVICE" ENTRY AlphaServer 1000A 4/***

 ERROR LOG RECORD

       ERF$L_SID       FFFFFFFF
                                       SYSTEM ID REGISTER
       ERL$W_ENTRY         0062
                                       ERROR ENTRY TYPE
       EXE$GQ_SYSTIME  8276F391
                       009AF6EF        64 BIT TIME WHEN ERROR LOGGED
       ERL$GL_SEQUENCE     0364
                                       UNIQUE ERROR SEQUENCE = 868.
       UCB$L_STS       00000010
                                       DEVICE STATUS
       UCB$B_DEVCLASS        80
                                       DEVICE CLASS = 128.
       UCB$B_DEVTYPE         2D
                                       DEVICE TYPE = 45.
       UCB$W_UNIT          0000
                                       PHYSICAL UNIT NUMBER = 0.
       UCB$L_ERRCNT    00000001
                                       UNIT ERROR COUNT = 1.
       UCB$L_OPCNT     0000001F
                                       UNIT OPERATION COUNT = 31.
       ORB$L_OWNER     00010004
                                       OWNER UIC = [001,004]
       UCB$L_DEVCHAR   0C440000
                                       DEVICE CHARACTERISTICS
       UCB$B_SLAVE           00
                                       DEVICE SLAVE CONTROLLER = 0.
       DDB$T_NAME      4652500A
                       24314B4D
                       00424B50
                       00000000
                                       /PRFMK1$PKB/
       LONGWORD 1.     0000003D
       LONGWORD 2.     07000B02
       LONGWORD 3.     47FF0000
       LONGWORD 4.     00008032
       LONGWORD 5.     00000000
       LONGWORD 6.     00000000
       LONGWORD 7.     00000000
       LONGWORD 8.     00000000
       LONGWORD 9.     11000000
       LONGWORD 10.    00000000
       LONGWORD 11.    00000007
       LONGWORD 12.    00000000
       LONGWORD 13.    00000000
       LONGWORD 14.    00000000
       LONGWORD 15.    44000000
       LONGWORD 16.    20204345
       LONGWORD 17.    20313050
       LONGWORD 18.    30314120
       LONGWORD 19.    00202020
       LONGWORD 20.    1C000000
       LONGWORD 21.    00000000
       LONGWORD 22.    00000000
       LONGWORD 23.    00000000
       LONGWORD 24.    00000000
       LONGWORD 25.    00000000
       LONGWORD 26.    00000000
       LONGWORD 27.    00000000
       LONGWORD 28.    00000000
       LONGWORD 29.    00000000
       LONGWORD 30.    00000000
       LONGWORD 31.    00000000
       LONGWORD 32.    00000000
       LONGWORD 33.    00000000
       LONGWORD 34.    00000000
       LONGWORD 35.    00000000
       LONGWORD 36.    00000000
       LONGWORD 37.    00000000
       LONGWORD 38.    00000000
       LONGWORD 39.    00000000
       LONGWORD 40.    00000000
       LONGWORD 41.    00000000
       LONGWORD 42.    00000000
       LONGWORD 43.    00000000
       LONGWORD 44.    00000000
       LONGWORD 45.    00000000
       LONGWORD 46.    00000000
       LONGWORD 47.    00000000
       LONGWORD 48.    00000000
       LONGWORD 49.    00000010
       LONGWORD 50.    00000000
       LONGWORD 51.    0000030A
       LONGWORD 52.    00000016
       LONGWORD 53.    0000001A
       LONGWORD 54.    00000001
       LONGWORD 55.    00000019
       LONGWORD 56.    00000006
       LONGWORD 57.    00000001
       LONGWORD 58.    0000001D
       LONGWORD 59.    00000000
       LONGWORD 60.    20005BF0
       LONGWORD 61.    00000000
       LONGWORD 62.    00000000
    
    
760.4More information (console variables)EVMS::PIRULO::LEDERMANB. Z. LedermanMon Feb 10 1997 10:23336
    Here is more information (probably more than you asked for).
    
>>>sho p*
pal                 	VMS PALcode V5.56-6, OSF PALcode X1.45-12
pci_parity          	off             
pka0_host_id        	7
pka0_soft_term      	on              
pkb0_fast           	1
pkb0_host_id        	1
pkb0_termpwr        	1
pkc0_fast           	1
pkc0_host_id        	1
pkc0_termpwr        	1
>>>sho dev
waiting for pkc0.1.0.12.0 to poll...
amcsr_lo = 8
abbrr_lo = 200a40f
dafqir_lo = 80052c1
dacqir_lo = 80054b1
asr_lo = 10
afar_lo = 0
afpr_lo = 30a
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
dka0.0.0.2000.0            DKA0                          RZ26N  0616
dka400.4.0.2000.0          DKA400                        RRD45  0436
dva0.0.0.1000.0            DVA0                               
ewa0.0.0.2001.0            EWA0              00-00-F8-03-E6-74
fwa0.0.0.13.0              FWA0              00-00-F8-4A-A0-04
pka0.7.0.2000.0            PKA0                  SCSI Bus ID 7  2.10
pkb0.1.0.11.0              PKB0                  SCSI Bus ID 1   P01  A10    
pkc0.1.0.12.0              PKC0                  SCSI Bus ID 1   P01  A10    
>>>b
ff.fe.fd.fc.fb.fa.f9.f8.f7.f6.f5.ef.df.ee.f4.
probing hose 0, PCI
probing PCI-to-EISA bridge, bus 1
probing PCI-to-PCI bridge, bus 2
bus 2, slot  0 -- pka -- QLogic ISP1020
bus 2, slot  1 -- ewa -- DECchip 21140-AA
bus 0, slot 11 -- pkb -- DEC KZPSA
bus 0, slot 12 -- pkc -- DEC KZPSA
bus 0, slot 13 -- fwa -- DEC PCI FDDI
ed.ec.eb.....ea.e9.e8.e7.e6.e5.e4.e3.e2.e1.e0.
V4.7-179, built on Dec 17 1996 at 14:26:45

CPU 0 booting

waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
error on pkc0.6.0.12.0, cmd = 12, sts = a8, camh->status = 19
amcsr_lo = 8
abbrr_lo = 200a40f
dafqir_lo = 802c835
dacqir_lo = 8029031
asr_lo = 10
afar_lo = 0
afpr_lo = 30a
SIMport Adapter error: asr = 10, afpr = 30a
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
waiting for pkc0.1.0.12.0 to poll...
CAM command EXECUTE_SCSI_IO timed out
(boot dka0.0.0.2000.0 -flags 0,0)
FRU table creation disabled
block 0 of dka0.0.0.2000.0 is a valid boot block
reading 904 blocks from dka0.0.0.2000.0
bootstrap code read in
base = 1c2000, image_start = 0, image_bytes = 71000
initializing HWRPB at 2000
initializing page table at 3ff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code
error on pkc0.6.0.12.0, cmd = 12, sts = 48, camh->status = 19

    OpenVMS (TM) Alpha Operating System, Version V7.1    

[Remainder of boot normal]

$ sho dev d/fu

Disk DAD0:, device type unknown, is online, file-oriented device, shareable,
    error logging is enabled, device is a template only.

    Error count                    0    Operations completed                  0
    Owner process                 ""    Owner UIC                      [SYSTEM]
    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W
    Reference count                0    Default buffer size                 512

Disk PRFMK1$DKB601:, device type unknown, is offline, file-oriented device,
    shareable, available to cluster, error logging is enabled.

    Error count                    1    Operations completed                  0
    Owner process                 ""    Owner UIC                      [SYSTEM]
    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W
    Reference count                0    Default buffer size                 512

Disk PRFMK1$DKC0:, device type DEC RZ26N, is online, mounted, file-oriented
    device, shareable, available to cluster, error logging is enabled.

    Error count                    0    Operations completed               4789
    Owner process                 ""    Owner UIC                      [SYSTEM]
    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W
    Reference count              143    Default buffer size                 512
    Total blocks             2050860    Sectors per track                    83
    Total cylinders             3089    Tracks per cylinder                   8

    Volume label         "AXPVMSSYS"    Relative volume number                0
    Cluster size                   3    Transaction count                   308
    Free blocks               986739    Maximum files allowed            256357
    Extend quantity                5    Mount count                           1
    Mount status              System    Cache name      "_PRFMK1$DKC0:XQPCACHE"
    Extent cache size             64    Maximum blocks in extent cache    98673
    File ID cache size            64    Blocks currently in extent cache  10158
    Quota cache size               0    Maximum buffers in FCP cache        354
    Volume owner UIC           [1,1]    Vol Prot    S:RWCD,O:RWCD,G:RWCD,W:RWCD

  Volume Status:  subject to mount verification, protected subsystems enabled,
      file high-water marking, write-through caching enabled.

Disk PRFMK1$DKC400:, device type DEC RRD45, is online, file-oriented device,
    shareable, available to cluster, error logging is enabled.

    Error count                    0    Operations completed                  0
    Owner process                 ""    Owner UIC                      [SYSTEM]
    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W
    Reference count                0    Default buffer size                 512

Disk PRFMK1$DVA0:, device type RX26, is online, file-oriented device, shareable,
    error logging is enabled.

    Error count                    0    Operations completed                  0
    Owner process                 ""    Owner UIC                      [SYSTEM]
    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W
    Reference count                0    Default buffer size                 512
    Total blocks                2880    Sectors per track                    18
    Total cylinders               80    Tracks per cylinder                   2

$ anal/err/since=10-feb-1997:12:14
Error Log Report Generator					Version V6.1    
  
 ******************************* ENTRY      86. *******************************
 ERROR SEQUENCE 1591.                            LOGGED ON:  CPU_TYPE 00000006
 DATE/TIME 10-FEB-1997 12:18:19.14                            SYS_TYPE 0000001B
 SYSTEM UPTIME: 0 DAYS 00:00:04
 SCS NODE: PRFMK1                                           OpenVMS AXP V7.1

 HW_MODEL: FFFFFFFF Hardware Model = 4294967295.

 SYSTEM START-UP AlphaServer 1000A 4/***

 TIME OF DAY CLOCK     24DD085A
 ******************************* ENTRY      87. *******************************
 ERROR SEQUENCE 1592.                            LOGGED ON:  CPU_TYPE 00000006
 DATE/TIME 10-FEB-1997 12:18:19.74                            SYS_TYPE 0000001B
 SYSTEM UPTIME: 0 DAYS 00:00:04
 SCS NODE: PRFMK1                                           OpenVMS AXP V7.1

 HW_MODEL: FFFFFFFF Hardware Model = 4294967295.

 MOUNT VOLUME AlphaServer 1000A 4/***

       UNIT _PRFMK1$DKC0:, VOLUME LABEL "AXPVMSSYS"

       210. QIO OPERATIONS THIS UNIT, 0. ERRORS THIS UNIT
 ******************************* ENTRY      88. *******************************
 ERROR SEQUENCE 1593.                            LOGGED ON:  CPU_TYPE 00000006
 DATE/TIME 10-FEB-1997 12:19:00.59                            SYS_TYPE 0000001B
 SYSTEM UPTIME: 0 DAYS 00:00:46
 SCS NODE: PRFMK1                                           OpenVMS AXP V7.1

 HW_MODEL: FFFFFFFF Hardware Model = 4294967295.

 DEVICE ATTENTION AlphaServer 1000A 4/***

 GENERIC DK SUB-SYSTEM, UNIT _PRFMK1$DKB601:
 


       HW REVISION     00000000
                                       HW REVISION = ....
       ERROR TYPE            03
                                       COMMAND TRANSMISSION FAILURE
       SCSI ID               06
                                       SCSI ID = 6.
       SCSI LUN              00
                                       SCSI LUN = 0.
       SCSI SUBLUN           01
                                       SCSI SUBLUN = 1.
       PORT STATUS     00000054
                                       %SYSTEM-F-CTRLERR, FATAL CONTROLLER
                                        ERROR
       SCSI CMD        00000012
                           0080
                                       INQUIRY
       SCSI STATUS           02
                                       CHECK CONDITION
       UCB$L_ERTCNT    24DD085A
                                       90. RETRIES REMAINING
       UCB$L_ERTMAX    000000AA
                                       170. RETRIES ALLOWABLE
       ORB$L_OWNER     00010004
                                       OWNER UIC [001,004]
       UCB$L_CHAR      1C454008
                                       DIRECTORY STRUCTURED
                                       FILE ORIENTED
                                       SHARABLE
                                       AVAILABLE
                                       ERROR LOGGING
                                       CAPABLE OF INPUT
                                       CAPABLE OF OUTPUT
                                       RANDOM ACCESS
       UCB$L_STS       08000110
                                       ONLINE
                                       BUSY
       UCB$L_OPCNT     00000000
                                       0. QIO'S THIS UNIT
       UCB$L_ERRCNT    00000001
                                       1. ERRORS THIS UNIT
       IRP$L_BCNT      00000000
                                       TRANSFER SIZE 0. BYTE(S)
       IRP$L_BOFF      00000000
                                       TRANSFER PAGE ALIGNED
       IRP$L_PID       00000000
                                       REQUESTOR "PID"
       IRP$Q_IOSB      00000000
                       00000000        IOSB, 0. BYTE(S) TRANSFERRED
 ******************************* ENTRY      89. *******************************
 ERROR SEQUENCE 1594.                            LOGGED ON:  CPU_TYPE 00000006
 DATE/TIME 10-FEB-1997 12:19:00.92                            SYS_TYPE 0000001B
 SYSTEM UPTIME: 0 DAYS 00:00:47
 SCS NODE: PRFMK1                                           OpenVMS AXP V7.1

 HW_MODEL: FFFFFFFF Hardware Model = 4294967295.

 "UNKNOWN DEVICE" ENTRY AlphaServer 1000A 4/***

 ERROR LOG RECORD

       ERF$L_SID       FFFFFFFF
                                       SYSTEM ID REGISTER
       ERL$W_ENTRY         0062
                                       ERROR ENTRY TYPE
       EXE$GQ_SYSTIME  0F8C5A76
                       009AFAD5        64 BIT TIME WHEN ERROR LOGGED
       ERL$GL_SEQUENCE     063A
                                       UNIQUE ERROR SEQUENCE = 1594.
       UCB$L_STS       00000010
                                       DEVICE STATUS
       UCB$B_DEVCLASS        80
                                       DEVICE CLASS = 128.
       UCB$B_DEVTYPE         2D
                                       DEVICE TYPE = 45.
       UCB$W_UNIT          0000
                                       PHYSICAL UNIT NUMBER = 0.
       UCB$L_ERRCNT    00000001
                                       UNIT ERROR COUNT = 1.
       UCB$L_OPCNT     0000001F
                                       UNIT OPERATION COUNT = 31.
       ORB$L_OWNER     00010004
                                       OWNER UIC = [001,004]
       UCB$L_DEVCHAR   0C440000
                                       DEVICE CHARACTERISTICS
       UCB$B_SLAVE           00
                                       DEVICE SLAVE CONTROLLER = 0.
       DDB$T_NAME      4652500A
                       24314B4D
                       00424B50
                       00000000
                                       /PRFMK1$PKB/
       LONGWORD 1.     0000003D
       LONGWORD 2.     01000B02
       LONGWORD 3.     47FF0000
       LONGWORD 4.     00008032
       LONGWORD 5.     00000000
       LONGWORD 6.     00000000
       LONGWORD 7.     00000000
       LONGWORD 8.     00000000
       LONGWORD 9.     11000000
       LONGWORD 10.    00000000
       LONGWORD 11.    00000001
       LONGWORD 12.    00000000
       LONGWORD 13.    00000000
       LONGWORD 14.    00000000
       LONGWORD 15.    44000000
       LONGWORD 16.    20204345
       LONGWORD 17.    20313050
       LONGWORD 18.    30314120
       LONGWORD 19.    00202020
       LONGWORD 20.    1C000000
       LONGWORD 21.    00000000
       LONGWORD 22.    00000000
       LONGWORD 23.    00000000
       LONGWORD 24.    00000000
       LONGWORD 25.    00000000
       LONGWORD 26.    00000000
       LONGWORD 27.    00000000
       LONGWORD 28.    00000000
       LONGWORD 29.    00000000
       LONGWORD 30.    00000000
       LONGWORD 31.    00000000
       LONGWORD 32.    00000000
       LONGWORD 33.    00000000
       LONGWORD 34.    00000000
       LONGWORD 35.    00000000
       LONGWORD 36.    00000000
       LONGWORD 37.    00000000
       LONGWORD 38.    00000000
       LONGWORD 39.    00000000
       LONGWORD 40.    00000000
       LONGWORD 41.    00000000
       LONGWORD 42.    00000000
       LONGWORD 43.    00000000
       LONGWORD 44.    00000000
       LONGWORD 45.    00000000
       LONGWORD 46.    00000000
       LONGWORD 47.    00000000
       LONGWORD 48.    00000000
       LONGWORD 49.    00000010
       LONGWORD 50.    00000000
       LONGWORD 51.    0000030A
       LONGWORD 52.    00000016
       LONGWORD 53.    0000001A
       LONGWORD 54.    00000001
       LONGWORD 55.    00000019
       LONGWORD 56.    00000006
       LONGWORD 57.    00000001
       LONGWORD 58.    0000001D
       LONGWORD 59.    00000000
       LONGWORD 60.    20005BF0
       LONGWORD 61.    00000000
       LONGWORD 62.    00000000

760.5Seen similar: Cable!UTRTSC::VISSERTue Feb 11 1997 14:4922
    
    Yesterday evening we were trying (3th try...) to build a SCSI-cluster
    (VMS) with two AS2100's and a dual-redundant HSZ40.
    (cpu's at the end, HSZ's in the middle of the SCSI-bus).
    Booting anyone node: OK. Trying to boot the other: died in errors.
    (Aborted Commands were logged on the HSZ console port; SCSI resets
    detected (ASC/ASCQ=29/00) seen in the errorlog).
    
    We re-routed the SCSI-cable: cpu -> cpu -> HSZ's.
    Now one of the CPU's failed it's power-up tests with 
    	"Waiting for pkb0.... (KZPSA conencted to HSZ) to poll..."
    Also some similar register info was dumped out (as in .0)
    
    SOLUTION: One of the SCSI-Y-cables was bad!
    
    So my advice: Exclude any SCSI-cabling, including any trilinks and
    terminators.
    DO NOT FORGET: Using Y-cables with external terminators REQUIRE KZPSA's
    five[5] terminator SIP to be removed.
    
    				Jan
    
760.6Cable was too long or defective.EVMS::PIRULO::LEDERMANB. Z. LedermanWed Feb 12 1997 10:2313
    
    Thanks to some off-line help from M. Difabio, my system is now working.
    
    It turned out to be the cable.
    
    The HSZ40 was supplied with a BN21F-10 cable.  This cable was either
    defective, or is too long even for differential SCSI.  I replaced it
    with a BN21F-05 and everything appears to be working normally.
    
    Thanks for all of the suggestions.
    
    Bart.