| re: 5456.0 by LYOV00::MASSET
RE: .0 (I am working with Bernard Masset, the .0 author).
Hello, to share what I've found:
The .0 INCONSTATE crash at UCX$BGDRIVER+0002AE54, is triggered
by a DECC application that does a "PING" (ICMP ECHO Request)
every 30 seconds towards another VMS system.
FREQUENCE: The crash is not systematic, and appears to only
occur when the 'pinging' system is heavily loaded.
Another customer reports a similar crash (same offset),
SYSTEMATIC, from an Non-Privileged account (as say the
customer), with the undocumented UCX tool UCX$TRACEROUTE.
SOLUTION: It appears that the (pre-ECO 5) UCX V4.1 PAT 8 kit
CORRECTS this "traceroute INCONSTATE" problem:
I have tested that a VAX VMS 6.1 UCX 4.1 ECO 3 => INCONSTATE
checked that a VAX VMS 6.1 UCX 4.1 PAT 8 => OK
I have checked that an AXP VMS 6.2 UCX 4.1 PAT 8 => OK
Also: CFS.46521 says it's fixed by UCX 4.1 PAT 5 => OK
ACTION PLAN for ".0" (not systematic Ping INCONSTATE):
Have the customer install PAT8 on one of his system, then
check whether the 'ping' application works better.
Best regards,
Pierre-Etienne
1) CUSTOMER's REPORT: (with UCX$TRACEROUTE)
========================================
Versions: UCX V4.1 ECO 4 on AXP VMS V6.2
Problem: INCONSTATE at UCX$BGDRIVER+0002AE54
Process leading to the symptom:
$ traceroute :== $sys$examples:ucx$traceroute
$ traceroute -l <host_name> ! type here a remote host name
==> *** INCONSTATE UCX$BGDRIVER+0002AE54 ***
See the Crash Footprint in the Annex below.
2) DIGITAL's TEST: on a VAX VMS 6.1 UCX 4.1 ECO 3
==============================================
Digital Answer (tested on a VAX VMS V6.1):
2.1 Steps used to duplicate the problem:
On a VAX UCX V4.1 ECO 3 V6.1 system, the following log has
been catched:
RIVOLI::ped$ traceroute -l prssos
traceroute to PRSSOS (16.189.128.61), 30 hops max, 40 byte packets
(loose-source-routed to 16.189.144.121)
1 PRSSOS (16.189.128.61) 10 ms 10 ms 10 ms
==> *** INCONSTATE CRASH ***
! $ CLUE /DISPLAY
! CLUE> SHOW ALL 2
! #2 INCONSTATE crash on RIVOLI at 14-MAY-1997 18:40:44.40
!
! CANASTA Parameter Value
!
! VMS VERSION 6.1
! BUGCHECK TYPE INCONSTATE
! PROCESS NAME DIETZ
! IMAGE NAME TRACEROUTE
! CPU TYPE 3100
! SID 0A000006
! SIGNAL ARRAY COUNT 00000000
! EXCEPTION REG 1 FFFFFFFF
! EXCEPTION REG 2 FFFFFFFF
! EXCEPTION REG 3 FFFFFFFF
! EXCEPTION PC 81144503
! EXCEPTION PSL 00C80000
! FAILING INSTRUCTION MOVZBL
! FAILING MODULE UCX$INTERNET_SERVICES
! OFFSET A703
2.2 Then PAT8 has been installed, the system rebooted:
RIVOLI::ped$ traceroute -l prssos
traceroute to PRSSOS (16.189.128.61), 30 hops max, 40 byte packets
(loose-source-routed to 16.189.144.121)
1 PRSSOS (16.189.128.61) 30 ms 10 ms 10 ms
RIVOLI::ped$
==> *** OK, not crashing ***
ANNEX
=====
+---------------------------------------------------------------+
! UCX V4.1 ECO 4 on AXP VMS V6.2, INCONSTATE UCX$BGDRIVER+2AE54
! triggered by a " $TRACEROUTE -l <host_name> " command
! *** CLUE CRASH footprint ***
+---------------------------------------------------------------+
SDA> CLUE CRASH
Crashdump Summary Information:
------------------------------
Crash Time: 14-MAY-1997 16:26:25.17
Bugcheck Type: INCONSTATE, Inconsistent I/O data base
Node: IRENE (Clustered)
CPU Type: AlphaServer 2000 4/233
VMS Version: V6.2
Current Process: _FTA994:
Current Image: $4$DKB501:[SYS36.SYSCOMMON.]
[SYSHLP.EXAMPLES.UCX]TRACEROUTE.EXE;1
Failing PC: FFFFFFFF 8A202E54
Failing PS: 18000000 00000803
Module: UCX$BGDRIVER
Offset: 0002AE54
Boot Time: 5-MAY-1997 12:52:56.00
System Uptime: 9 03:33:29.17
Crash/Primary CPU: 01/00
Saved Processes: 5
Pagesize: 8 KByte (8192 bytes)
Physical Memory: 256 MByte (32768 PFNs)
Dumpfile Pagelets: 121069 blocks
Dump Flags: olddump,writecomp,errlogcomp,dump_style
EXE$GL_FLAGS: poolpging,init,bugdump
Stack Pointers:
KSP = 00000000 7FF91C58 ESP = 00000000 7FF96000 SSP = 00000000 7FF9C100
USP = 00000000 7EE35700
General Registers:
R0 = 00000000 00000000 R1 = 00000000 00000000 R2 = FFFFFFFF 8A2267D0
R3 = FFFFFFFF 81153B60 R4 = FFFFFFFF 81153FD4 R5 = 00000000 00000000
R6 = FFFFFFFF 81153FA0 R7 = 00000000 00000000 R8 = FFFFFFFF 8A220210
R9 = 00000000 00000000 R10 = FFFFFFFF 81216858 R11 = 00000000 00000001
R12 = 00000000 00008001 R13 = FFFFFFFF 8A221368 R14 = FFFFFFFF 825B95D8
R15 = 00000000 0A88ADA3 R16 = 00000000 0000019C R17 = 00000000 00000000
R18 = 00000000 00000000 R19 = FFFFFFFF 81153FEE R20 = 00000000 00000000
R21 = 00000000 00000000 R22 = FFFFFFFF 8A228B20 R23 = 00000000 00000001
R24 = FFFFFFFF 8A211290 AI = 00000000 00000000 RA = FFFFFFFF 8A21B254
PV = FFFFFFFF 8A228368 R28 = FFFFFFFF 8A2119FC FP = 00000000 7FF91C60
PC = FFFFFFFF 8A202E58 PS = 18000000 00000803
System Registers:
Page Table Base Register (PTBR) 00000000 0000578C
Processor Base Register (PRBR) FFFFFFFF 80D6FC80
Privileged Context Block Base (PCBB) 00000000 0CA8C080
System Control Block Base (SCBB) 00000000 000001AA
Software Interrupt Summary Register (SISR) 00000000 00000000
Address Space Number (ASN) 00000000 00000006
AST Summary / AST Enable (ASTSR_ASTEN) 00000000 0000000F
Floating-Point Enable (FEN) 00000000 00000001
Interrupt Priority Level (IPL) 00000000 00000008
Machine Check Error Summary (MCES) 00000000 00000008
Virtual Page Table Base Register (VPTB) 00000002 00000000
Failing Instruction:
UCX$BGDRIVER+2AE54: BUGCHK
Instruction Stream (last 20 instructions):
UCX$BGDRIVER+2AE04: LDQ R2,#X0020(SP)
UCX$BGDRIVER+2AE08: LDQ FP,#X0028(SP)
UCX$BGDRIVER+2AE0C: ADDQ SP,#X30,SP
UCX$BGDRIVER+2AE10: RET R31,(R28)
UCX$BGDRIVER+2AE14: BIS R31,R31,R31
UCX$BGDRIVER+2AE18: BIS R31,R31,R31
UCX$BGDRIVER+2AE1C: BIS R31,R31,R31
UCX$BGDRIVER+2AE20: SUBQ SP,#X20,SP
UCX$BGDRIVER+2AE24: STQ R27,(SP)
UCX$BGDRIVER+2AE28: STQ R26,#X0010(SP)
UCX$BGDRIVER+2AE2C: STQ FP,#X0018(SP)
UCX$BGDRIVER+2AE30: BIS R31,SP,FP
UCX$BGDRIVER+2AE34: SUBQ SP,#X10,SP
UCX$BGDRIVER+2AE38: STQ R16,#X0008(SP)
UCX$BGDRIVER+2AE3C: STQ R17,(SP)
UCX$BGDRIVER+2AE40: LDQ R17,#X0028(R27)
UCX$BGDRIVER+2AE44: BIS R17,#X04,R17
UCX$BGDRIVER+2AE48: BIS R31,R17,R16
UCX$BGDRIVER+2AE4C: LDQ R17,(SP)
UCX$BGDRIVER+2AE50: ADDQ SP,#X08,SP
UCX$BGDRIVER+2AE54: BUGCHK
UCX$BGDRIVER+2AE58: HALT
UCX$BGDRIVER+2AE5C: BIS R31,R31,R31
UCX$BGDRIVER+2AE60: SUBQ SP,#X20,SP
UCX$BGDRIVER+2AE64: STQ R27,(SP)
SDA> ex
SDA> exit
|