[Search for users]
[Overall Top Noters]
[List of all Conferences]
[Download this site]
Title: | DIGITAL UNIX (FORMERLY KNOWN AS DEC OSF/1) |
Notice: | Welcome to the Digital UNIX Conference |
Moderator: | SMURF::DENHAM |
|
Created: | Thu Mar 16 1995 |
Last Modified: | Fri Jun 06 1997 |
Last Successful Update: | Fri Jun 06 1997 |
Number of topics: | 10068 |
Total number of notes: | 35879 |
9359.0. "AS8200, V3.2G, crash - ADVFS EXCEPTION" by BRADEC::PODOLINSKY (Peter Podolinsky - MCS Slovakia) Wed Apr 02 1997 08:14
Hello,
I am looking for the advise/solution regarding a crash we had at our customer.
According to the crash-data file it looks like a HW related crash caused by timeouts
on the rz24 and rz27 disks.
rz24 is a system disk, rz27 contains a secondary swap partition, both they are on the internal
SCSI bus (narrow,fast SCSI - KFTIA)
If the timeout was just on one drive, I would suggest to swap it, but in this case both
drives have timed-out.
After the crash thy system booted fine, I have not seen any timeout on any disk since the crash.
Before the crash there was nobody touching the server cabinet, so I do not think it was a loose cable/terminator.
Binary log contains a burst of SCSI CAM errors on both rz24 and rz27 drives just before the crash.
Could you have a look what was going on ?
Thanks,
Peter
********************************************************************************************
Crash-data file and an extract from binary log follows.
********************************************************************************************
#
# Crash Data Collection (Version 1.4)
#
_crash_data_collection_time: Mon Mar 31 13:06:25 MET DST 1997
_current_directory: /
_crash_kernel: /home6/vmunix.0
_crash_core: /home6/vmcore.0
_crash_arch: alpha
_crash_os: Digital UNIX
_host_version: Digital UNIX V3.2G (Rev. 62); Sun Dec 29 19:18:06 MET 1996
_crash_version: Digital UNIX V3.2G (Rev. 62); Sun Dec 29 19:18:06 MET 1996
_crashtime: struct {
tv_sec = 859754615
tv_usec = 696864
}
_boottime: struct {
tv_sec = 859640305
tv_usec = 631472
}
_config: struct {
sysname = "OSF1"
nodename = "posta3"
release = "V3.2"
version = "62"
machine = "alpha"
}
_cpu: 39
_system_string: 0xffffffffff802eb8 = ""
_ncpus: 12
_avail_cpus: 2
_partial_dump: 1
_physmem(MBytes): 767
_panic_string: 0xffffffffa482b708 = ""
_paniccpu: 11
_panic_thread: 0xfffffc002fe00800
_preserved_message_buffer_begin:
struct {
msg_magic = 0x63061
msg_bufx = 0x402
msg_bufr = 0xe93
msg_bufc = "orted by the host
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
cdisk_complete
Retries Exhausted
Hard Error Detected
DEC RZ28M
Active CCB at time of error
CCB request aborted by the host
vm_swap I/O error during pageout
vm_swap I/O error during pageout
vm_swap I/O error during pageout
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
ss_abort_done
SCSI abort tag has been performed
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
cdisk_complete
Retries Exhausted
Hard Error Detected
DEC RZ28M
Active CCB at time of error
CCB request aborted by the host
advfs I/O error: setId 0x2ffdcdab.000d1730.fffffffe.0000 tag 0xfffffff7.0000u page 413
vd 1 blk 6752 blkCnt 16
write error = 5
ADVFS EXCEPTION
Module = 41, Line = 614
panic (cpu 11):
syncing disks... DUMP.prom: dev SCSI 0 5 0 0 0 0 0, block 200000
DUMP.prom: dev SCSI 0 5 0 0 0 0 0, block 200000
ected
cam_logger: CAM_ERROR packet
DEC RZ28
cam_logger: bus 3 target 0 lun 0
ss_abort_done
Active CCB at time of error
SCSI abort tag has been performed
CCB request aborted by the host
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
cdisk_complete
Retries Exhausted
Hard Error Detected
DEC RZ28M
Active CCB at time of error
CCB request aborted by the host
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
cdisk_complete
Retries Exhausted
Hard Error Detected
DEC RZ28M
Active CCB at time of error
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 CCB request aborted by the host
lun 0
ss_perform_timeout
vm_swap I/O error during pageout
vm_swap I/O error during pagein
timeout on disconnected request
vm_swap I/O error during pageout
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 lun 0
ss_perform_timeout
timeout on disconnected request
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 lun 0
ss_abort_done
SCSI abort tag has been performed
cam_logger: CAM_ERROR packet
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 lun 0
cdisk_complete
cam_logger: bus 3 target 3 lun 0
ss_abort_done
Retries Exhausted
SCSI abort tag has been performed
Hard Error Detected
cam_logger: CAM_ERROR packet
DEC RZ28
cam_logger: bus 3 target 0 lun 0
ss_abort_done
Active CCB at time of error
SCSI abort tag has been performed
CCB request aborted by the host
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 lun 0
cdisk_complete
Retries Exhausted
Hard Error Detected
DEC RZ28
Active CCB at time of error
CCB request aborted by the host
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
cdisk_complete
Retries Exhausted
Hard Error Detected
DEC RZ28M
Active CCB at time of error
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 CCB request aborted by the host
lun 0
ss_perform_timeout
vm_swap I/O error during pageout
timeout on disconnected request
vm_swap I/O error during pageout
vm_swap I/O error during pageout
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 lun 0
ss_abort_done
SCSI abort tag has been performed
cam_logger: CAM_ERROR packet
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 3 lun 0
cdisk_complete
cam_logger: bus 3 target 0 lun 0
ss_abort_done
Retries Exhausted
SCSI abort tag has been performed
Hard Error Detected
cam_logger: CAM_ERROR packet
DEC RZ28
cam_logger: bus 3 target 0 lun 0
ss_abort_done
Active CCB at time of error
SCSI abort tag has been performed
CCB request aborted by the host
cam_logger: CAM_ERROR packet
cam_logger: bus 3 target 0 lun 0
cdisk_complete
Retries Exhausted
Hard Error Detected
DEC RZ28M
Active CCB at time of error
CCB request ab"
}
_preserved_message_buffer_end:
_kernel_process_status_begin:
PID COMM
00000 kernel idle
00001 init
00008 kloadsrv
00024 vold
00047 update
00157 syslogd
00159 binlogd
00278 portmap
00280 mountd
00282 nfsd
00284 nfsiod
00287 rpc.statd
00289 rpc.lockd
08514 getty
00334 progrd
00338 progtd
00348 knblink
00350 sendmail
00356 dllink
00408 os_mibs
00412 snmpd
00445 inetd
00450 cron
00509 psdc
00515 pwlic.reg
00521 lpd
00524 lpd
00541 xdm
00552 nbelink
00563 pwalrtr
00566 lmx.ctrl
00606 lmx.srv
00617 lmx.dmn
00726 _mprosrv
00730 _progres
05663 _mprosrv
05667 _mprshut
05669 _mprshut
05671 _mprshut
05675 _mprshut
05676 _mprosrv
05678 _mprshut
05683 _mprshut
05686 _mprshut
05689 _mprshut
05693 _mprshut
05695 _mprshut
05697 _mprosrv
05700 _mprshut
05704 _mprshut
05706 _mprshut
05708 _mprshut
05712 _mprshut
05714 _progres
09970 _progres
06722 _mprosrv
06728 _mprosrv
06729 _mprosrv
06742 ksh
06743 telnetd
06751 _progres
10936 _progres
11480 _progres
_kernel_process_status_end:
_current_pid: 5669
_current_tid: 0xfffffc001c25eb80
_proc_thread_list_begin:
thread 0xfffffc001c25eb80 stopped at [boot:1760 ,0xfffffc00004ef9ec] Source not available
_proc_thread_list_end:
_dump_begin:
> 0 boot(0x0, 0x4, 0xfffffc0000204170, 0xfffffc002f4fb0b8, 0x0)
["../../../../src/kernel/arch/alpha/machdep.c":1760, 0xfffffc00004ef9ec]
1 panic(s = 0xfffffc0000639ac8 = "event_timeout: panic request") ["../../../../src/kernel/bsd/subr_prf.c":673,
0xfffffc000044c8d8]
pcpu = 0xfffffc000aae35c0
i = 3418288
bootopt = 6355720
mycpu = 3540
spl = 4
prevcc = 18446744072191307952
nextcc = 64
timer = -3300145495936
limit = -4398042108344
2 event_timeout(func = 0xfffffc00004eeac0, arg = 0xffffffffa5817048, timeout = 3513128)
["../../../../src/kernel/arch/alpha/cpu.c":863, 0xfffffc00004e9f08]
prevcc = 9792
nextcc = 21
timer = 9223372036854775807
limit = 18446744073709551542
3 simple_lock_miss(0xfffffc0000359b28, 0xfffffc00006c8580, 0x359b25, 0x485869, 0xfffffc000cd1b240)
["../../../../src/kernel/arch/alpha/lockprim.s":1096, 0xfffffc00004eea90]
4 qenable(q = 0xfffffc00090294e8) ["../../../../src/kernel/streams/str_runq.c":167, 0xfffffc0000359b24]
sq = 0xfffffc000aae35c0
gotit = 1
_ssavpri = 1
_s = 1
netisr = 0xfffffc00002d778c
5 lats_timeout(0xfffffc0000342e1c, 0xfffffc0000204170, 0xfffffc0000204668, 0xffffffffa5814000,
0xfffffc000034ee70) ["../../../../src/kernel/lat/streams/lattimer.c":397, 0xfffffc0000342e88]
6 str_timeout(0xfffffc000cd1b240, 0x0, 0xfffffc002ca89c00, 0xfffffc00006cd1f0, 0x61a8)
["../../../../src/kernel/streams/str_env.c":387, 0xfffffc000034ee6c]
7 softclock_scan(usermode = 1) ["../../../../src/kernel/bsd/kern_clock.c":1081, 0xfffffc0000433504]
p1 = 0xfffffc002b653300
arg = 0xfffffc000062a380 = ""
func = 0x200
a = 7012040
myprocessor = 0xffffffffa5817d3c
8 lwc_schedule(0x0, 0xfffffc002b653300, 0x2fe26a00, 0xfffffc000045f3e4, 0xfffffc000045f480)
["../../../../src/kernel/bsd/lwc.c":209, 0xfffffc000024cc28]
9 thread_preempt(thread = 0xfffffc001c25eb80, processor = 0xfffffc0000204170)
["../../../../src/kernel/kern/sched_prim.c":3487, 0xfffffc0000481680]
s = 2
pri = 5175456
pset = 0xfffffc00006b6368
10 boot(0x0, 0x0, 0x1, 0xa, 0x2) ["../../../../src/kernel/arch/alpha/machdep.c":1704, 0xfffffc00004ef8c4]
11 panic(s = 0xfffffc0000639c28 = "cpu_ip_intr: panic request") ["../../../../src/kernel/bsd/subr_prf.c":757,
0xfffffc000044ca94]
pcpu = 0xd
i = 1073751896
bootopt = 1073758432
mycpu = 65560
spl = 5
prevcc = 18446739676135287680
nextcc = 18446739676135285264
timer = -1518256128
limit = -4398044081912
12 cpu_ip_intr() ["../../../../src/kernel/arch/alpha/cpu.c":513, 0xfffffc00004e9468]
mycpu = 10
ipi_mask = 1
ipi_maskp = 0xfffffc00002045c4
percpu = 0x8
13 _XentInt(0x4, 0xfffffc000047e758, 0xfffffc000067fa90, 0x4, 0x47e74c)
["../../../../src/kernel/arch/alpha/locore.s":961, 0xfffffc00004ec7cc]
14 thread_wakeup_prim(event = 18446739675670150488, one_thread = 5, result = 7124152)
["../../../../src/kernel/kern/sched_prim.c":1455, 0xfffffc000047e754]
thread = 0x8
qe = 0xfffffc00002045c4
wq = 0xa
s = 4
state = 4427396
need_swapin_wakeup = 472247168
15 netisr_input(num = 2, m = 0xfffffc0023a67400, header = 0xffffffffa5817870 = "\377\377\377\377\377\377",
hdrlen = 14) ["../../../../src/kernel/net/netisr.c":717, 0xfffffc00004858f4]
netisr = 0xfffffc00006c7e50
s = 4
err = 0
ifq = 0xfffffc00006bbef8
ifp = (nil)
netisr = 0x20ab7a
16 ether_input(0xfffffc0000709f60, 0xffffffffa5817870, 0xfffffc0023a67400, 0x1, 0x4)
["../../../../src/kernel/net/if_ethersubr.c":1187, 0xfffffc00004843e8]
17 tu_receive_int(0x2000ffffffffffff, 0x8065c5288af, 0xfffffc000044caf0, 0xfffffc00002375d1, 0xfffffc0000583c74)
["../../../../src/kernel/io/dec/netif/if_tu.c":3403, 0xfffffc0000584940]
18 tuintr(0xfffffc001c25e210, 0xfffffc000070a460, 0xfffffc0000709f60, 0x3, 0xfffffc00004ec790)
["../../../../src/kernel/io/dec/netif/if_tu.c":2948, 0xfffffc0000583cc8]
19 intr_dispatch_post(0xfffffc002b568c40, 0x180000, 0x0, 0x0, 0x0)
["../../../../src/kernel/arch/alpha/hal/shared_intr.c":238, 0xfffffc0000516a28]
20 _XentInt(0x8, 0x12005a1e0, 0x14001c850, 0x1, 0x18) ["../../../../src/kernel/arch/alpha/locore.s":934,
0xfffffc00004ec78c]
_dump_end:
warning: Files compiled -g3: parameter values probably wrong
_kernel_thread_list_begin:
thread 0xfffffc002fdce000 stopped at [thread_run:2302 ,0xfffffc000047f8b4] Source not available
thread 0xfffffc002fdce400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fe00800 stopped at [stop_secondary_cpu:404 ,0xfffffc00004e915c] Source not available
thread 0xfffffc002fe00c00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fe01000 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fe01400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fe01800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fe01c00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2be000 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2be400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2be800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2bec00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2bf000 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2bf400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2bf800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b2bfc00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b63a000 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b63a400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b63a800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b63ac00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b63b000 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b63b400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b63b800 stopped at [thread_run:2287 +0x2c,0xfffffc000047f848] Source not available
thread 0xfffffc002b63bc00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b03c000 stopped at [thread_block:1934 ,0xfffffc000047f0c8] Source not available
thread 0xfffffc002b03c400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b03c800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b03d000 stopped at [thread_block:1934 ,0xfffffc000047f0c8] Source not available
thread 0xfffffc002b03d400 stopped at [thread_block:1934 ,0xfffffc000047f0c8] Source not available
thread 0xfffffc002b03d800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002b03dc00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fcb2000 stopped at [thread_block:1934 ,0xfffffc000047f0c8] Source not available
thread 0xfffffc002fcb2400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fcb2800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fcb2c00 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fcb3000 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fcb3400 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
thread 0xfffffc002fcb3800 stopped at [thread_block:1919 +0x28,0xfffffc000047f058] Source not available
_kernel_thread_list_end:
_savedefp: (nil)
_kernel_memory_fault_data_begin:
struct {
fault_va = 0x0
fault_pc = 0x0
fault_ra = 0x0
fault_sp = 0x0
access = 0x0
status = 0x0
cpunum = 0x0
count = 0x0
pcb = (nil)
thread = (nil)
task = (nil)
proc = (nil)
}
_kernel_memory_fault_data_end:
Invalid character in input
_uptime: 31.75 hours
paniccpu: 0xb
machine_slot[paniccpu]: struct {
is_cpu = 0x1
cpu_type = 0xf
cpu_subtype = 0x0
running = 0x1
cpu_ticks = {
[0] 0x23c1757
[1] 0x0
[2] 0x39cd70
[3] 0x3e91992
[4] 0x9b253b
}
clock_freq = 0x400
error_restart = 0x0
cpu_panicstr = 0xffffffffa482b708 = ""
cpu_panic_thread = 0xfffffc002fe00800
}
tset machine_slot[paniccpu].cpu_panic_thread:
Begin Trace for machine_slot[paniccpu].cpu_panic_thread:
warning: Files compiled -g3: parameter values probably wrong
> 0 stop_secondary_cpu() ["../../../../src/kernel/arch/alpha/cpu.c":403, 0xfffffc00004e9158]
1 panic(s = 0xfffffc0000639ac8 = "event_timeout: panic request") ["../../../../src/kernel/bsd/subr_prf.c":669,
0xfffffc000044c8cc]
2 event_timeout(func = 0xfffffc000044cb20, arg = 0xfffffc00006ccbd0, timeout = 0x0)
["../../../../src/kernel/arch/alpha/cpu.c":863, 0xfffffc00004e9f08]
3 xcpu_puts(s = 0xffffffffa482b4e8, prfbufp = 0xfffffc00006ccbd0) ["../../../../src/kernel/bsd/subr_prf.c":810,
0xfffffc000044cb84]
4 printf(va_alist = 0xfffffc0000629aa0) ["../../../../src/kernel/bsd/subr_prf.c":355, 0xfffffc000044bed4]
5 panic(s = 0xffffffffa482b708 = "") ["../../../../src/kernel/bsd/subr_prf.c":719, 0xfffffc000044ca3c]
6 advfs_sad(0x29, 0x266, 0xfffffc000067a208, 0x0, 0x0) ["../../../../src/kernel/msfs/bs/bs_misc.c":322,
0xfffffc00003e49ac]
7 bs_osf_complete(bp = 0xfffffc0023a67600) ["../../../../src/kernel/msfs/osf/msfs_io.c":613, 0xfffffc000040d458]
8 msfs_async_iodone_lwc() ["../../../../src/kernel/msfs/osf/msfs_io.c":770, 0xfffffc000040d894]
9 lwc_schedule(0xfffffc000067b6a0, 0xfffffc000055df10, 0xfffffc000047f058, 0xfffffc002b63b800,
0xfffffc0000706f80) ["../../../../src/kernel/bsd/lwc.c":238, 0xfffffc000024ccdc]
10 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1773, 0xfffffc000047edd0]
11 xpt_callback_thread() ["../../../../src/kernel/io/cam/xpt.c":2262, 0xfffffc000055e0ac]
End Trace for machine_slot[paniccpu].cpu_panic_thread:
"cpu_data" is not an array
warning: cannot get register (number = 64)
warning: cannot get register (number = 64)
warning: cannot get register (number = 64)
warning: PC value 0x0 not valid, trying RA
warning: cannot get register (number = 26)
warning: RA value 0x0 not valid, trying text start
>
warning: cannot get register (number = 64)
[alpha_bootstrap:607, 0xfffffc0000230000] lda sp, -208(sp)
_stack_trace[0]_begin:
warning: cannot get register (number = 64)
warning: cannot get register (number = 26)
warning: cannot get register (number = 30)
0 alpha_bootstrap(ffpfn = l3 address 0xffffffffffffffd0 not mapped, pte 0x0
[bad address (0xffffffffffffffd0)], ptbr = l3 address 0xffffffffffffffd8 not mapped, pte 0x0
[bad address (0xffffffffffffffd8)], argc = l3 address 0xffffffffffffffe0 not mapped, pte 0x0
[bad address (0xffffffffffffffe0)], argv = l3 address 0xffffffffffffffe8 not mapped, pte 0x0
[bad address (0xffffffffffffffe8)], sysconfigtab = l3 address 0xfffffffffffffff0 not mapped, pte 0x0
[bad address (0xfffffffffffffff0)]) ["../../../../src/kernel/arch/alpha/alpha_init.c":607, 0xfffffc0000230000]
_stack_trace[0]_end:
"cpu_data" is not an array
warning: cannot get register (number = 64)
warning: cannot get register (number = 64)
warning: cannot get register (number = 64)
warning: PC value 0x0 not valid, trying RA
warning: cannot get register (number = 26)
warning: RA value 0x0 not valid, trying text start
>
warning: cannot get register (number = 64)
[alpha_bootstrap:607, 0xfffffc0000230000] lda sp, -208(sp)
_stack_trace[1]_begin:
warning: cannot get register (number = 64)
warning: cannot get register (number = 26)
warning: cannot get register (number = 30)
0 alpha_bootstrap(ffpfn = l3 address 0xffffffffffffffd0 not mapped, pte 0x0
[bad address (0xffffffffffffffd0)], ptbr = l3 address 0xffffffffffffffd8 not mapped, pte 0x0
[bad address (0xffffffffffffffd8)], argc = l3 address 0xffffffffffffffe0 not mapped, pte 0x0
[bad address (0xffffffffffffffe0)], argv = l3 address 0xffffffffffffffe8 not mapped, pte 0x0
[bad address (0xffffffffffffffe8)], sysconfigtab = l3 address 0xfffffffffffffff0 not mapped, pte 0x0
[bad address (0xfffffffffffffff0)]) ["../../../../src/kernel/arch/alpha/alpha_init.c":607, 0xfffffc0000230000]
_stack_trace[1]_end:
_kdbx_sum_start:
Hostname : posta3
cpu: avail: 12
Boot-time: Sat Mar 29 13:58:25 1997
Time: Sun Mar 30 22:43:35 1997
Kernel : OSF1 release V3.2 version 62 (alpha)
_kdbx_sum_end:
_kdbx_swap_start:
Swap device name Size In Use Free
-------------------------------- ---------- ---------- ----------
/dev/rz24b 500000k 243312k 256688k
62500p 30414p 32086p
/dev/rz27b 500000k 241256k 258744k
62500p 30157p 32343p
-------------------------------- ---------- ---------- ----------
Total swap partitions: 2 1000000k 484568k 515432k
125000p 60571p 64429p
_kdbx_swap_end:
_kdbx_proc_start:
Addr PID PPID PGRP UID NICE SIGCATCH P_SIG Event Flags
=========== ===== ===== ===== ===== ==== ======== ======== =========== ============
k0x2fdc7210 0 0 0 0 0 00000000 00000000 NULL in sys
k0x2f4be210 1 0 1 0 0 307a7eff 00000000 NULL in pagv exec
k0x0ac94210 8 1 7 0 0 00004006 00000000 NULL in pagv exec
k0x0ac75210 24 1 24 0 0 20400000 00000000 NULL in pagv
k0x0ac74210 47 1 47 0 0 00002000 00002000 NULL in pagv
k0x0ab32210 157 1 157 0 0 00086001 00002000 NULL in pagv
k0x0ab33210 159 1 159 0 0 00004001 00000000 NULL in pagv
k0x09f12210 278 1 278 0 0 00080628 00000000 NULL in pagv
k0x09f13210 280 1 280 0 0 66006001 00000000 NULL in pagv
k0x0ab58210 282 1 282 0 0 00000000 00000000 NULL in pagv
k0x0ab59210 284 1 284 0 0 00000000 00000000 NULL in pagv
k0x0c670210 287 1 0 0 0 00002000 00000000 NULL in pagv ctty
k0x0c671210 289 1 289 0 0 00002000 00000000 NULL in pagv
k0x2f4bf210 8514 1 8514 0 0 00000000 00000000 NULL in pagv ctty exec
k0x09bf9210 334 1 333 0 0 00004000 00000000 NULL in pagv
k0x0abb8210 338 1 337 0 0 00004000 00000000 NULL in pagv
k0x0abb9210 348 1 95 0 0 00000000 00000000 NULL in pagv ctty
k0x0ac95210 350 1 0 0 0 00086000 00000000 NULL in pagv
k0x09c00210 356 1 95 0 0 00000000 00000000 NULL in pagv ctty
k0x077fa210 408 1 408 0 0 00004002 00000000 NULL in pagv
k0x0981d210 412 1 412 0 0 20004002 00000000 NULL in pagv
k0x0981c210 445 1 445 0 0 00086001 00000000 NULL in pagv
k0x09bf8210 450 1 450 0 0 00002000 00000000 v0xa4997fd4 in pagv
k0x0983a210 509 1 0 0 0 20084402 00000000 NULL in pagv
k0x08435210 515 1 515 0 0 40006007 00000000 NULL in pagv
k0x08434210 521 1 521 0 0 00084007 00000000 NULL in pagv
k0x09823210 524 521 524 0 0 00084007 00000000 NULL in pagv
k0x09822210 541 1 0 0 0 00084003 00000000 NULL in pagv
k0x0842c210 552 1 95 0 0 00000000 00000000 NULL in pagv ctty
k0x0983b210 563 1 563 0 0 20000004 00000000 NULL in pagv
k0x2b43f210 566 1 566 0 0 20004ef8 00000000 NULL in chldstop pagv
k0x077fb210 606 566 566 0 0 00000000 00000000 NULL in chldstop pagv exec
k0x0647e210 617 1 566 0 0 20004000 00000000 NULL in chldstop pagv
k0x1c25f210 726 1 726 31 0 00807ed8 00000000 NULL in pagv ctty
k0x14e0e210 730 1 654 31 0 66d0fef8 00000000 NULL in pagv ctty exec
k0x1d5eb210 5663 1 5663 31 0 00807ed8 00000000 NULL in pagv
k0x09dd3210 5667 1 5667 31 0 66d0fef8 00000000 NULL in pagv
k0x1c25e210 5669 1 5669 31 0 66d0fef8 00000000 NULL in pagv
k0x2fc5a210 5671 1 5671 31 0 66d0fef8 00000000 NULL in pagv
k0x09dd2210 5675 1 5675 31 0 66d0fef8 00000000 NULL in pagv
k0x1a226210 5676 1 5676 31 0 00807ed8 00000000 NULL in pagv
k0x2fc5b210 5678 1 5678 31 0 66d0fef8 00000000 NULL in pagv
k0x15223210 5683 1 5683 31 0 66d0fef8 00000000 NULL in pagv
k0x15222210 5686 1 5686 31 0 66d0fef8 00000000 NULL in pagv
k0x1fde5210 5689 1 5689 31 0 66d0fef8 00000000 NULL in pagv
k0x26744210 5693 1 5693 31 0 66d0fef8 00000000 NULL in pagv
k0x1fde4210 5695 1 5695 31 0 66d0fef8 00002000 NULL in pagv
k0x039bc210 5697 1 5697 31 0 00807ed8 00000000 NULL in pagv
k0x03dc9210 5700 1 5700 31 0 66d0fef8 00000000 NULL in pagv
k0x281d2210 5704 1 5704 31 0 66d0fef8 00000000 NULL in pagv
k0x281d3210 5706 1 5706 31 0 66d0fef8 00000000 NULL in pagv
k0x1b9ed210 5708 1 5708 31 0 66d0fef8 00000000 NULL in pagv
k0x1b9ec210 5712 1 5712 31 0 66d0fef8 00000000 NULL in pagv
k0x1c229210 5714 1 450 31 0 66d0fef8 00000000 NULL in pagv exec
k0x0f06e210 9970 1 6742 32 0 66d0fef8 00000000 NULL in pagv ctty exec
k0x26745210 6722 1 6722 31 0 00807ed8 00000000 NULL in pagv
k0x270ec210 6728 1 6728 31 0 00807ed8 00000000 NULL in pagv
k0x09c01210 6729 1 6729 31 0 00807ed8 00000000 NULL in pagv
k0x1d5ea210 6742 6743 6742 32 0 60007aff 00000000 NULL in pagv ctty exec
k0x0647f210 6743 445 6743 0 0 00084025 00000000 NULL in pagv exec
k0x13e11210 6751 6742 6742 32 0 66d0feff 00000000 NULL in pagv ctty exec
k0x14e0f210 10936 1 6742 32 0 66d0fef8 00000000 NULL in pagv ctty exec
k0x0901f210 11480 1 6742 32 0 66d0fef8 00000000 NULL in pagv ctty exec
_kdbx_proc_end:
Audit subsystem disabled
No audit data to be saved
#
_crash_data_collection_finished:
*********************************************************************************************
Extract fro binary.errlog: (filtered dia -R)
*********************************************************************************************
DECevent V2.1
******************************** ENTRY 1 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 1.
Timestamp of occurrence 31-MAR-1997 13:05:33
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000001
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 300. Start-Up ASCII Message Type
SWI Minor class 9. ASCII Message
SWI Minor sub class 3. Startup
ASCII Message
Alpha boot: available memory from 0x1440000 to 0x2ffbe000
Digital UNIX V3.2G (Rev. 62); Sun Dec 29 19:18:06 MET 1996
physical memory = 768.00 megabytes.
available memory = 747.49 megabytes.
using 2940 buffers containing 22.96 megabytes of memory
Firmware revision: 0.0
PALcode: VMS version 0.00
AlphaServer 8400 Model 5/350
Master cpu at slot 10.
Created FRU table configuration errorlog packet
tiop0 at tlsb0 node 8
tiop0: cpu interrupt mask being set as 400.
pci0 at tiop0 slot 0
isp0 at pci0 slot 0
isp0: QLOGIC ISP1020 - Differential Mode
isp0: Firmware revision 2.10 (loaded by console)
scsi0 at isp0 slot 0
tz5 at scsi0 bus 0 target 5 lun 0 (DEC TZ877 (C) DEC 971B)
isp1 at pci0 slot 1
isp1: QLOGIC ISP1020 - Differential Mode
isp1: Firmware revision 2.10 (loaded by console)
scsi1 at isp1 slot 0
tu0: DECchip 21040-AA: Revision: 2.3
tu0 at pci0 slot 2
tu0: DEC TULIP Ethernet Interface, hardware address: 08-00-2B-E6-20-F1
tu0: console mode: selecting 10BaseT (UTP) port: half duplex: no link
isp2 at pci0 slot 4
isp2: QLOGIC ISP1020 - Differential Mode
isp2: Firmware revision 2.10 (loaded by console)
scsi2 at isp2 slot 0
isp3 at pci0 slot 5
isp3: QLOGIC ISP1020
isp3: Firmware revision 2.10 (loaded by console)
scsi3 at isp3 slot 0
rz24 at scsi3 bus 3 target 0 lun 0 (DEC RZ28M (C) DEC 0466)
rz27 at scsi3 bus 3 target 3 lun 0 (DEC RZ28 (C) DEC 442D)
rz28 at scsi3 bus 3 target 4 lun 0 (DEC RRD43 (C) DEC 1084)
tu1: DECchip 21040-AA: Revision: 2.3
tu1 at pci0 slot 6
tu1: DEC TULIP Ethernet Interface, hardware address: 08-00-2B-E6-1F-53
tu1: console mode: selecting 10BaseT (UTP) port: half duplex: no link
pci1 at tiop0 slot 1
pza0 at pci1 slot 1
pza0 firmware version: DEC N01 A10
scsi4 at pza0 slot 0
rz33 at scsi4 bus 4 target 1 lun 0 (DEC HSZ40 V25Z)
rz34 at scsi4 bus 4 target 2 lun 0 (DEC HSZ40 V25Z)
rz35 at scsi4 bus 4 target 3 lun 0 (DEC HSZ40 V25Z)
rz36 at scsi4 bus 4 target 4 lun 0 (DEC HSZ40 V25Z)
pza1 at pci1 slot 3
pza1 firmware version: DEC L01 A10
scsi5 at pza1 slot 0
rz41 at scsi5 bus 5 target 1 lun 0 (DEC HSZ40 V25Z)
rz42 at scsi5 bus 5 target 2 lun 0 (DEC HSZ40 V25Z)
rz43 at scsi5 bus 5 target 3 lun 0 (DEC HSZ40 V25Z)
rz44 at scsi5 bus 5 target 4 lun 0 (DEC HSZ40 V25Z)
tu2: DECchip 21140-AA: Revision: 2.0
tu2: auto negotiation capable device: National DP83840
tu2 at pci1 slot 7
tu2: DEC Fast Ethernet Interface, hardware address: 00-00-F8-02-34-39
tu2: auto negotiation off: selecting 100BaseTX (UTP) port: full duplex
tu3: DECchip 21140-AA: Revision: 2.0
tu3: auto negotiation capable device: National DP83840
tu3 at pci1 slot 8
tu3: DEC Fast Ethernet Interface, hardware address: 00-00-F8-02-34-26
tu3: auto negotiation off: selecting 100BaseTX (UTP) port: full duplex: no
link
psiop0 at pci1 slot 9
Loading SIOP: script f0001900, reg 42a2100, data 40715900
scsi6 at psiop0 slot 0
tu4: DECchip 21040-AA: Revision: 2.3
tu4 at pci1 slot 11
tu4: DEC TULIP Ethernet Interface, hardware address: 08-00-2B-E5-82-28
tu4: console mode: selecting 10Base5 (AUI) port: no carrier
TLMEM at node 7
TLMEM at node 6
Dual TLEP at node 5
lvm0: configured.
lvm1: configured.
dli: configured
SuperLAT. Copyright 1993 Meridian Technology Corp. All rights reserved.
datalink: links=64, macs=6
knbinit: sessions=256, names=64
knbtcp: configured
knbtcpd: configured
knbadm configured
nbeadmin_configure
netbeuid_configure
netbeui_configure
******************************** ENTRY 2 ********************************
******************************** ENTRY 3 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 46.
Timestamp of occurrence 30-MAR-1997 22:38:52
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000002
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 1. Severe Priority
Entry type 199. CAM SCSI Event Type
------- Unit Info -------
Bus Number 3.
Unit Number x00D8 Target = 3.
LUN = 0.
------- CAM Data -------
Class x22 DEC SIM - SCSI Interface Module
Subsystem x22 DEC SIM - SCSI Interface Module
Number of Packets 2.
------ Packet Type ------ 258. Module Name String
Routine Name ss_abort_done
------ Packet Type ------ 256. Generic String
SCSI abort tag has been performed
******************************** ENTRY 4 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 43.
Timestamp of occurrence 30-MAR-1997 22:38:30
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000002
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 1. Severe Priority
Entry type 199. CAM SCSI Event Type
------- Unit Info -------
Bus Number 3.
Unit Number x00C0 Target = 0.
LUN = 0.
------- CAM Data -------
Class x22 DEC SIM - SCSI Interface Module
Subsystem x22 DEC SIM - SCSI Interface Module
Number of Packets 2.
------ Packet Type ------ 258. Module Name String
Routine Name ss_abort_done
------ Packet Type ------ 256. Generic String
SCSI abort tag has been performed
******************************** ENTRY 5 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 40.
Timestamp of occurrence 30-MAR-1997 22:38:25
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000002
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 1. Severe Priority
Entry type 199. CAM SCSI Event Type
------- Unit Info -------
Bus Number 3.
Unit Number x00C0 Target = 0.
LUN = 0.
------- CAM Data -------
Class x22 DEC SIM - SCSI Interface Module
Subsystem x22 DEC SIM - SCSI Interface Module
Number of Packets 2.
------ Packet Type ------ 258. Module Name String
Routine Name ss_abort_done
------ Packet Type ------ 256. Generic String
SCSI abort tag has been performed
******************************** ENTRY 6 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 39.
Timestamp of occurrence 30-MAR-1997 22:38:25
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000002
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 199. CAM SCSI Event Type
------- Unit Info -------
Bus Number 3.
Unit Number x00C0 Target = 0.
LUN = 0.
------- CAM Data -------
Class x00 Disk
Subsystem x00 Disk
Number of Packets 3.
------ Packet Type ------ 258. Module Name String
Routine Name isp_termio_abort_bdr
------ Packet Type ------ 256. Generic String
Specified IO transaction aborted
------ Packet Type ------ 1038. SIM Working Set(SIM_WS)
Packet Revision 2.
*flink xFFFFFC002D780100
*blink xFFFFFFFF804C3168
Controller # for HBA 3.
Target ID 0.
LUN 0.
Cam Status x0B Command Timeout
TAG x000000E6
Sequence Number 64376.
Time Stamp x00000000
*nexus xFFFFFFFF804C3168
*it_nexus xFFFFFFFF804C4D68
*sim_sc xFFFFFFFF804C3000
*ccb xFFFFFC0021925328
Phase Bits x00000000
Misc Flags x000E0440 This request is tagged
Command has completed
SIM expects bus free phase
Abort tag initiated on this request
Timeout
Cam Flags x00000482 SIM Queue Actions are Enabled
Data Direction (10: DATA OUT)
Disable the SIM Queue Frozen State
Error Recovery x00000080 SIM_WS in process of being timed out
Recovery Status x00000000
(*as_callback)() x0000000000000000
*as_ccb x0000000000000000
(*tmo_fn)() xFFFFFC000054FC10
*tmo_arg xFFFFFC0021925100
Rest of SIM_WS ** Not Printed **
******************************** ENTRY 7 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 38.
Timestamp of occurrence 30-MAR-1997 22:38:25
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000002
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 1. Severe Priority
Entry type 199. CAM SCSI Event Type
------- Unit Info -------
Bus Number 3.
Unit Number x00C0 Target = 0.
LUN = 0.
------- CAM Data -------
Class x22 DEC SIM - SCSI Interface Module
Subsystem x22 DEC SIM - SCSI Interface Module
Number of Packets 3.
------ Packet Type ------ 258. Module Name String
Routine Name ss_perform_timeout
------ Packet Type ------ 256. Generic String
timeout on disconnected request
------ Packet Type ------ 1038. SIM Working Set(SIM_WS)
Packet Revision 2.
*flink xFFFFFC002D780100
*blink xFFFFFFFF804C3168
Controller # for HBA 3.
Target ID 0.
LUN 0.
Cam Status x00 CCB Request In Progress
TAG x000000E6
Sequence Number 64376.
Time Stamp x00000000
*nexus xFFFFFFFF804C3168
*it_nexus xFFFFFFFF804C4D68
*sim_sc xFFFFFFFF804C3000
*ccb xFFFFFC0021925328
Phase Bits x00000000
Misc Flags x00080040 This request is tagged
Timeout
Cam Flags x00000482 SIM Queue Actions are Enabled
Data Direction (10: DATA OUT)
Disable the SIM Queue Frozen State
Error Recovery x00000080 SIM_WS in process of being timed out
Recovery Status x00000000
(*as_callback)() x0000000000000000
*as_ccb x0000000000000000
(*tmo_fn)() xFFFFFC000054FC10
*tmo_arg xFFFFFC0021925100
Rest of SIM_WS ** Not Printed **
******************************** ENTRY 8 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 37.
Timestamp of occurrence 30-MAR-1997 22:38:11
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000002
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 1. Severe Priority
Entry type 199. CAM SCSI Event Type
------- Unit Info -------
Bus Number 3.
Unit Number x00C0 Target = 0.
LUN = 0.
------- CAM Data -------
Class x22 DEC SIM - SCSI Interface Module
Subsystem x22 DEC SIM - SCSI Interface Module
Number of Packets 2.
------ Packet Type ------ 258. Module Name String
Routine Name ss_abort_done
------ Packet Type ------ 256. Generic String
SCSI abort tag has been performed
******************************** ENTRY 9 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 36.
Timestamp of occurrence 30-MAR-1997 22:38:11
Host name posta3
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000002
CPU logging event (mperr) x0000000A
Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 199. CAM SCSI Event Type
------- Unit Info -------
Bus Number 3.
Unit Number x00C0 Target = 0.
LUN = 0.
------- CAM Data -------
Class x00 Disk
Subsystem x00 Disk
Number of Packets 3.
------ Packet Type ------ 258. Module Name String
Routine Name isp_termio_abort_bdr
------ Packet Type ------ 256. Generic String
Specified IO transaction aborted
------ Packet Type ------ 1038. SIM Working Set(SIM_WS)
Packet Revision 2.
*flink xFFFFFC0021925100
*blink xFFFFFFFF804C3168
Controller # for HBA 3.
Target ID 0.
LUN 0.
Cam Status x0B Command Timeout
TAG x0000007B
Sequence Number 64375.
Time Stamp x00000000
*nexus xFFFFFFFF804C3168
*it_nexus xFFFFFFFF804C4D68
*sim_sc xFFFFFFFF804C3000
*ccb xFFFFFC002FE63328
Phase Bits x00000000
Misc Flags x000E0440 This request is tagged
Command has completed
SIM expects bus free phase
Abort tag initiated on this request
Timeout
Cam Flags x00000482 SIM Queue Actions are Enabled
Data Direction (10: DATA OUT)
Disable the SIM Queue Frozen State
Error Recovery x00000080 SIM_WS in process of being timed out
Recovery Status x00000000
(*as_callback)() x0000000000000000
*as_ccb x0000000000000000
(*tmo_fn)() xFFFFFC000054FC10
*tmo_arg xFFFFFC002FE63100
Rest of SIM_WS ** Not Printed **
******************************** ENTRY 10 ********************************
T.R | Title | User | Personal Name | Date | Lines |
---|
9359.1 | please try CANASTA ... | HAN::HALLE | Volker Halle MCS @HAO DTN 863-5216 | Wed Apr 02 1997 11:07 | 6 |
| Peter,
did you try the CANASTA Mail Server ? If not, please read note #8919
and send your crash-data file to CANASTA.
Volker.
|
9359.2 | Hard error on AdvFS metadata | NETRIX::"[email protected]" | Tim Mark | Wed Apr 02 1997 13:58 | 3 |
| The panic was caused because a write to an AdvFS metadata file failed. This
was probably due to the hard errors seen on the disks.
[Posted by WWW Notes gateway]
|
9359.3 | | BRADEC::PODOLINSKY | Peter Podolinsky - MCS Slovakia | Wed Apr 02 1997 15:05 | 7 |
| Thanks,
you and CANASTA confirmed, that it was a HW related crash.
What remains opened is the cause of I/O errors. I have checked the system again. There are
no errors logged since the boot.
If the problem reoccurs we will start swapping HW. What would you suggest to start with ?
Regards,
Peter
|