[Search for users]
[Overall Top Noters]
[List of all Conferences]
[Download this site]
Title: | Alpha Support Conference |
Notice: | This is a new Alphanotes, please read note 2.2 |
Moderator: | VAXAXP::BERNARDO |
|
Created: | Thu Jan 02 1997 |
Last Modified: | Fri Jun 06 1997 |
Last Successful Update: | Fri Jun 06 1997 |
Number of topics: | 128 |
Total number of notes: | 617 |
63.0. "AlphaServer 1000 machine check" by 50239::HOLZGREVE () Tue Mar 18 1997 03:12
AlphaServer 1000 4/233 machine check 20c - nonexistent memory error.
The mchk_osf.v35 program tells me:
The Error code entered above has the following meaning
Non-existent Memory Error
Indicates that a read or write occurred to an invalid address
which does not map to any memory bank, CSR or I/O quadrant.
Most likley Broken = CPU CARD
The CPU requested transaction caused the error.
The CPU, system board and memory are replaced, but I still have the same
problem.
Help needed !!!
Uwe Holzgreve
MCS Cologne, Germany
-------------------------------------------------------------------------------
#
# Crash Data Collection (Version 1.4)
#
_crash_data_collection_time: Mon Mar 17 15:15:41 MET 1997
_current_directory: /
_crash_kernel: /var/adm/crash/vmunix.24
_crash_core: /var/adm/crash/vmcore.24
_crash_arch: alpha
_crash_os: Digital UNIX
_host_version: Digital UNIX V4.0B (Rev. 564); Fri Feb 14 16:19:27 MET 1997
_crash_version: Digital UNIX V4.0B (Rev. 564); Fri Feb 14 16:19:27 MET 1997
_crashtime: struct {
tv_sec = 858607849
tv_usec = 857821
}
_boottime: struct {
tv_sec = 858345679
tv_usec = 766025
}
_config: struct {
sysname = "OSF1"
nodename = "iraalph"
release = "V4.0"
version = "564"
machine = "alpha"
}
_cpu: 43
_system_string: 0xffffffffff800a58 = "AlphaServer 1000 4/233"
_ncpus: 1
_avail_cpus: 1
_partial_dump: 1
_physmem(MBytes): 191
_panic_string: 0xfffffc000067c090 = "Machine check - Hardware error"
_paniccpu: 0
_panic_thread: 0xfffffc00063662c0
_preserved_message_buffer_begin:
struct {
msg_magic = 0x63061
msg_bufx = 0x78
msg_bufr = 0x582
msg_bufc = " 0 0 0 0 0, block 253956
device string for dump = RAID 0 13 0 0 0 0 0.
DUMP.prom: dev RAID 0 13 0 0 0 0 0, block 253956
hysical memory = 192.00 megabytes.
available memory = 174.73 megabytes.
using 730 buffers containing 5.70 megabytes of memory
AlphaServer 1000 4/233
Firmware revision: 4.7
PALcode: OSF version 1.45
pci0 at nexus
psiop0 at pci0 slot 6
Loading SIOP: script 801500, reg 82004000, data 406c74b0
scsi0 at psiop0 slot 0
rz4 at scsi0 target 4 lun 0 (LID=0) (DEC RRD43 (C) DEC 1084)
eisa0 at pci0
ace0 at eisa0
ace1 at eisa0
lp0 at eisa0
fdi0 at eisa0
fd0 at fdi0 unit 0
qvision0 at eisa0
qvision0: CMPQ Qvision 1024/E SVGA
tu0: DECchip 21040-AA: Revision: 2.4
tu0 at pci0 slot 12
tu0: DEC TULIP Ethernet Interface, hardware address: 00-00-F8-21-EC-2A
tu0: console mode: selecting 10Base2 (BNC) port: no carrier
Initializing xcr0. Please wait....
xcr0 at pci0 slot 13
re0 at xcr0 unit 0 (unit status = ONLINE, raid level = 1)
re1 at xcr0 unit 1 (unit status = ONLINE, raid level = 5)
gpc0 at eisa0
lvm0: configured.
lvm1: configured.
kernel console: qvision0
dli: configured
ATM Subsystem configured with 1 restart threads
ATM UNI 3.x signalling: configured
ATM IP interface: configured
ADVFS: using 1738 buffers containing 13.57 megabytes of memory
Environmental Monitoring Subsystem Configured.
SuperLAT. Copyright 1994 Meridian Technology Corp. All rights reserved.
lp0: printer offline
AlphaServer 1000 4/233 machine check type 0x660.
retry = 0xffffffff
mchk_code = 0x20c
paltemp[1] = 0x7
paltemp[2] = 0x4
paltemp[3] = 0x0
paltemp[4] = 0x6000
paltemp[5] = 0x0
paltemp[6] = 0x2340
paltemp[7] = 0x4200
paltemp[8] = 0x400
paltemp[9] = 0x7
paltemp[10] = 0x4fd930
paltemp[11] = 0x0
paltemp[12] = 0x4fdcd0
paltemp[13] = 0x4fdd00
paltemp[14] = 0x4fdd60
paltemp[15] = 0x4fdad0
paltemp[16] = 0x4fd7a0
paltemp[17] = 0x19310
paltemp[18] = 0x1ffff610
paltemp[19] = 0x8c577a38
paltemp[20] = 0x698260
paltemp[21] = 0x0
paltemp[22] = 0x626e6e6e
paltemp[23] = 0x0
paltemp[24] = 0x0
paltemp[25] = 0x50000
paltemp[26] = 0x1071c80
paltemp[27] = 0x0
paltemp[28] = 0xae4c000
paltemp[29] = 0x0
paltemp[30] = 0x1
paltemp[31] = 0x8d2da38
exc_addr = 0x4fcd3a
exc_sum = 0x0
msk = 0x0
iccsr = 0x4
pal_base = 0x14000
hier = 0x0
hirr = 0x0
mm_csr = 0x3640
dc_stat = 0x3
dc_addr = 0xffffffff
abox_ctl = 0x942e
biu_stat = 0x241
biu_addr = 0x6390
biu_ctl = 0x10002227
fill_syndrome = 0x0
fill_adr = 0x6100
va = 0x6170
bc_tag = 0x42492448
coma_gcr = 0x7fb20034
coma_edsr = 0xbf32108
coma_ter = 0x6fb17fe0
coma_elar = 0x6fb1031c
coma_ehar = 0x6fb10800
coma_ldlr = 0x6fb126fb
coma_ldhr = 0x6fb10057
coma_base0 = 0x6fb10200
coma_base1 = 0x6fb10000
coma_base2 = 0x22310000
coma_base3 = 0xbf30000
coma_cnfg0 = 0x22310049
coma_cnfg1 = 0x22310067
coma_cnfg2 = 0x22310000
coma_cnfg3 = 0x7fb20000
epic_dcsr = 0x801e0019
epic_pear = 0x807a40
epic_sear = 0x2fe2570
epic_tbr1 = 0x8a0000
epic_tbr2 = 0x0
epic_pbr1 = 0x8c0000
epic_pbr2 = 0x40080000
epic_pmr1 = 0x700000
epic_pmr2 = 0x3ff00000
epic_harx1 = 0x80000000
epic_harx2 = 0x0
epic_pmlt = 0xff
epic_tag0 = 0x802000
epic_tag1 = 0x800000
epic_tag2 = 0x806000
epic_tag3 = 0x803000
epic_tag4 = 0x801000
epic_tag5 = 0x807000
epic_tag6 = 0x802000
epic_tag7 = 0x800000
epic_data0 = 0x6c0
epic_data1 = 0x6be
epic_data2 = 0x6c4
epic_data3 = 0x6c0
epic_data4 = 0x6be
epic_data5 = 0x6c4
epic_data6 = 0x6c0
epic_data7 = 0x6be
pceb_vid = 0x8086
pceb_did = 0x482
pceb_revision = 0x4
pceb_command = 0x7
pceb_status = 0x200
pceb_latency = 0xf8
pceb_control = 0x60
pceb_arbcon = 0x9d
pceb_arbpri = 0x4
esc_id = 0xf
esc_revision = 0x3
esc_int0 = 0xa1
esc_int1 = 0xef
esc_elcr0 = 0x0
esc_elcr1 = 0x0
esc_last_eisa = 0xff
esc_nmi_stat = 0x30
pci_ir = 0xff
pci_imr = 0x10
svr_mgr = 0xd4
panic (cpu 0): Machine check - Hardware error
syncing disks... device string for dump = RAID 0 13 0 0 0 0 0.
DUMP.prom: dev RAID 0 13"
}
_preserved_message_buffer_end:
_kernel_process_status_begin:
PID COMM
00000 kernel idle
00001 init
00003 kloadsrv
24586 smbd
00029 update
24650 smbd
00102 syslogd
00104 binlogd
21613 smbd
03259 smbd
08442 smbd
23833 sh
00287 portmap
00289 mountd
00291 nfsd
00293 nfsiod
00295 rpc.pcnfsd
00298 rpc.statd
00300 rpc.lockd
00306 automount
23877 mwm
23885 dxterm
00364 sendmail
23919 ksh
23930 dxsysinfo
00380 xntpd
23938 smbd
23940 dxsession
00409 pmgrd
02467 xdm
30119 smbd
00425 snmpd
00427 os_mibs
00445 svrSystem_mib
00450 envmond
10704 smbd
00469 advfsd
12766 smbd
00480 inetd
00513 cron
00551 lpd
27184 smbd
00568 smbd
00570 nmbd
00584 xdm
00588 Xdec
00602 swxcrmon
00603 getty
10849 smbd
12904 smbd
12923 smbd
23200 smbd
24280 smbd
24314 smbd
13067 smbd
29470 smbd
13093 lpd
13094 comsat
23348 smbd
13109 hplaserof
23369 dxconsole
26457 smbd
11112 smbd
23411 dxsysinfo
_kernel_process_status_end:
_current_pid: 10704
_current_tid: 0xfffffc00063662c0
_proc_thread_list_begin:
thread 0xfffffc00063662c0 stopped at [boot:2466 ,0xfffffc0000501da8] Source not available
_proc_thread_list_end:
_dump_begin:
> 0 boot(0x400000000, 0x0, 0xfffffc000027c7d4, 0xfffffc0000627830, 0xfffffc0000627830) ["../../../../src/kernel/arch/alpha/machdep.c":2466, 0xfffffc0000501da8]
1 panic(s = 0xfffffc0000629a20 = "thread_block: interrupt level call") ["../../../../src/kernel/bsd/subr_prf.c":707, 0xfffffc000027ae9c]
pcpu = 0xfffffc00006ca8a0
i = 6461984
mycpu = 0
spl = 5
2 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1925, 0xfffffc00002a8410]
thread = 0xfffffc00063662c0
new_thread = 0xfffffc00063662c0
mycpu = 0
myprocessor = 0xfffffc00001c2100
s = 5
pset = 0x1000
3 thread_preempt(thread = 0x26, processor = 0xfffffc00001c2100) ["../../../../src/kernel/kern/sched_prim.c":3820, 0xfffffc00002aafa4]
s = 2
pset = 0x1
4 boot(0x0, 0xfffffc00063662c0, 0x2c0000002c, 0x35, 0x1) ["../../../../src/kernel/arch/alpha/machdep.c":2410, 0xfffffc0000501c90]
5 panic(s = 0xfffffc000067c090 = "Machine check - Hardware error") ["../../../../src/kernel/bsd/subr_prf.c":791, 0xfffffc000027b03c]
pcpu = 0xfffffc00006ca8a0
i = 6980192
mycpu = 0
spl = 7
6 machcheck(0xfffffc0000006398, 0x630, 0x800100000001, 0x100000014, 0xffffffff8c577430) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3030, 0xfffffc000052e420]
7 mach_error(0x800100000001, 0x100000014, 0xffffffff8c577430, 0xffffffff8c577500, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":808, 0xfffffc0000512c88]
8 _XentInt(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
9 bcopy(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/fastcopy.s":324, 0xfffffc00004fcd34]
10 harderr_intr(0x1, 0xfffffc0000692bc8, 0xfffffc0000690548, 0xfffffc00091c5720, 0xfffffc0000000000) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3367, 0xfffffc000052f520]
11 softerr_intr(0xfffffc0000000000, 0xfffffc00003bb3b8, 0xfffffc0000512c94, 0xfffffc0000006398, 0x630) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":4338, 0xfffffc00005302c4]
12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]
13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]
15 pmap_create(size = 18446744065111220072) ["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
map = 0xfffffc00099bde00
stats = (nil)
scratch = union {
quadword = 18446739675685418240
PTE_BITFIELD = struct {
_v = 0
_for = 0
_fow = 0
_foe = 0
_asm = 0
_gh = 0
0
_prot = 117
_exec = 1
_wire = 0
_seg = 1
_ssm = 0
_gh_shared = 1
_soft = 2
_lw_wire = 1
_pfn = 4294966272
}
}
16 vm_map_fork(0xfffffc00002ad1e8, 0xfffffc0001557500, 0xfffffc00091c5500, 0x1, 0x1) ["../../../../src/kernel/vm/vm_map.c":1723, 0xfffffc00004dd924]
17 task_create(parent_task = 0xfffffc0001557500, new_task = 0xfffffc00091c5500) ["../../../../src/kernel/kern/task.c":458, 0xfffffc00002ad1e4]
pset = 0x1
tasks = -4398041577748
threads = 0
kr = 6892488
18 procdup(0xfffffc00091c5500, 0xfffffc00091c5720, 0xfffffc000025ef94, 0xfffffc00091c5500, 0x0) ["../../../../src/kernel/vm/vm_unix.c":582, 0xfffffc00004e9ee4]
19 newproc(0xffffffff8c5778e0, 0x0, 0xfffffc00001c92b0, 0xfffffc0000692bc8, 0xfffffc00091c5500) ["../../../../src/kernel/bsd/kern_fork.c":717, 0xfffffc000025efb0]
20 fork1(0xfffffc0001557720, 0xfffffc00063662c0, 0xfffffc0000690930, 0xfffffc0001557500, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":565, 0xfffffc000025ec28]
21 fork(0x0, 0xfffffc0001557720, 0xfffffc0000507a84, 0x0, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":486, 0xfffffc000025e9d4]
22 syscall(0x1, 0xffffffff8c577ce0, 0x41a9a00000000, 0xfffffc0001a42c00, 0x2) ["../../../../src/kernel/arch/alpha/syscall_trap.c":540, 0xfffffc0000507a80]
23 _Xsyscall(0x8, 0x3ff80180ac0, 0x3ffc00931b0, 0xd, 0x0) ["../../../../src/kernel/arch/alpha/locore.s":1209, 0xfffffc00004fdb44]
_dump_end:
warning: Files compiled -g3: parameter values probably wrong
_kernel_thread_list_begin:
thread 0xfffffc000bf402c0 stopped at [thread_run:2469 ,0xfffffc00002a8f08] Source not available
thread 0xfffffc000bf40580 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf40840 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf40b00 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf40dc0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf41080 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf41340 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf41600 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf418c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bf41b80 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea2000 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea22c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea2580 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea2840 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea2b00 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea2dc0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea3080 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea3340 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea3600 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea38c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000bea3b80 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8a000 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8a580 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8a840 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8ab00 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8adc0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8b080 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8b340 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8b600 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc000be8b8c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc00035fd600 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc00035fd8c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc00035fdb80 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002adc000 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002adc2c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002adc580 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002adc840 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002adcb00 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002adcdc0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002add080 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002add340 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002add600 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002add8c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002addb80 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad4000 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad42c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad4840 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad4b00 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad4dc0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad5080 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad5340 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad5600 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
thread 0xfffffc0002ad58c0 stopped at [thread_block:2097 ,0xfffffc00002a8740] Source not available
_kernel_thread_list_end:
_savedefp: (nil)
_kernel_memory_fault_data_begin:
struct {
fault_va = 0x0
fault_pc = 0x0
fault_ra = 0x0
fault_sp = 0x0
access = 0x0
status = 0x0
cpunum = 0x0
count = 0x0
pcb = (nil)
thread = (nil)
task = (nil)
proc = (nil)
}
_kernel_memory_fault_data_end:
_uptime: 72.82 hours
paniccpu: 0x0
machine_slot[paniccpu]: struct {
is_cpu = 0x1
cpu_type = 0xf
cpu_subtype = 0x11
running = 0x1
cpu_ticks = {
[0] 0x134927c
[1] 0x0
[2] 0x763b62
[3] 0xe55bda5
[4] 0x67ec
}
clock_freq = 0x400
error_restart = 0x0
cpu_panicstr = 0xfffffc000067c090 = "Machine check - Hardware error"
cpu_panic_thread = 0xfffffc00063662c0
}
tset machine_slot[paniccpu].cpu_panic_thread:
Begin Trace for machine_slot[paniccpu].cpu_panic_thread:
> 0 boot(0x400000000, 0x0, 0xfffffc000027c7d4, 0xfffffc0000627830, 0xfffffc0000627830) ["../../../../src/kernel/arch/alpha/machdep.c":2466, 0xfffffc0000501da8]
1 panic(s = 0xfffffc0000629a20 = "thread_block: interrupt level call") ["../../../../src/kernel/bsd/subr_prf.c":707, 0xfffffc000027ae9c]
2 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1925, 0xfffffc00002a8410]
3 thread_preempt(thread = 0x26, processor = 0xfffffc00001c2100) ["../../../../src/kernel/kern/sched_prim.c":3820, 0xfffffc00002aafa4]
4 boot(0x0, 0xfffffc00063662c0, 0x2c0000002c, 0x35, 0x1) ["../../../../src/kernel/arch/alpha/machdep.c":2410, 0xfffffc0000501c90]
5 panic(s = 0xfffffc000067c090 = "Machine check - Hardware error") ["../../../../src/kernel/bsd/subr_prf.c":791, 0xfffffc000027b03c]
6 machcheck(0xfffffc0000006398, 0x630, 0x800100000001, 0x100000014, 0xffffffff8c577430) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3030, 0xfffffc000052e420]
7 mach_error(0x800100000001, 0x100000014, 0xffffffff8c577430, 0xffffffff8c577500, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":808, 0xfffffc0000512c88]
8 _XentInt(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
9 bcopy(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/fastcopy.s":324, 0xfffffc00004fcd34]
10 harderr_intr(0x1, 0xfffffc0000692bc8, 0xfffffc0000690548, 0xfffffc00091c5720, 0xfffffc0000000000) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3367, 0xfffffc000052f520]
11 softerr_intr(0xfffffc0000000000, 0xfffffc00003bb3b8, 0xfffffc0000512c94, 0xfffffc0000006398, 0x630) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":4338, 0xfffffc00005302c4]
12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]
13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]
15 pmap_create(size = 0xfffffffdff7fdf68) ["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
16 vm_map_fork(0xfffffc00002ad1e8, 0xfffffc0001557500, 0xfffffc00091c5500, 0x1, 0x1) ["../../../../src/kernel/vm/vm_map.c":1723, 0xfffffc00004dd924]
17 task_create(parent_task = 0xfffffc0001557500, new_task = 0xfffffc00091c5500) ["../../../../src/kernel/kern/task.c":458, 0xfffffc00002ad1e4]
18 procdup(0xfffffc00091c5500, 0xfffffc00091c5720, 0xfffffc000025ef94, 0xfffffc00091c5500, 0x0) ["../../../../src/kernel/vm/vm_unix.c":582, 0xfffffc00004e9ee4]
19 newproc(0xffffffff8c5778e0, 0x0, 0xfffffc00001c92b0, 0xfffffc0000692bc8, 0xfffffc00091c5500) ["../../../../src/kernel/bsd/kern_fork.c":717, 0xfffffc000025efb0]
20 fork1(0xfffffc0001557720, 0xfffffc00063662c0, 0xfffffc0000690930, 0xfffffc0001557500, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":565, 0xfffffc000025ec28]
21 fork(0x0, 0xfffffc0001557720, 0xfffffc0000507a84, 0x0, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":486, 0xfffffc000025e9d4]
22 syscall(0x1, 0xffffffff8c577ce0, 0x41a9a00000000, 0xfffffc0001a42c00, 0x2) ["../../../../src/kernel/arch/alpha/syscall_trap.c":540, 0xfffffc0000507a80]
23 _Xsyscall(0x8, 0x3ff80180ac0, 0x3ffc00931b0, 0xd, 0x0) ["../../../../src/kernel/arch/alpha/locore.s":1209, 0xfffffc00004fdb44]
End Trace for machine_slot[paniccpu].cpu_panic_thread:
"cpu_data" is not an array
_stack_trace[0]_begin:
> 0 boot(0x400000000, 0x0, 0xfffffc000027c7d4, 0xfffffc0000627830, 0xfffffc0000627830) ["../../../../src/kernel/arch/alpha/machdep.c":2466, 0xfffffc0000501da8]
1 panic(s = 0xfffffc0000629a20 = "thread_block: interrupt level call") ["../../../../src/kernel/bsd/subr_prf.c":707, 0xfffffc000027ae9c]
2 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1925, 0xfffffc00002a8410]
3 thread_preempt(thread = 0x26, processor = 0xfffffc00001c2100) ["../../../../src/kernel/kern/sched_prim.c":3820, 0xfffffc00002aafa4]
4 boot(0x0, 0xfffffc00063662c0, 0x2c0000002c, 0x35, 0x1) ["../../../../src/kernel/arch/alpha/machdep.c":2410, 0xfffffc0000501c90]
5 panic(s = 0xfffffc000067c090 = "Machine check - Hardware error") ["../../../../src/kernel/bsd/subr_prf.c":791, 0xfffffc000027b03c]
6 machcheck(0xfffffc0000006398, 0x630, 0x800100000001, 0x100000014, 0xffffffff8c577430) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3030, 0xfffffc000052e420]
7 mach_error(0x800100000001, 0x100000014, 0xffffffff8c577430, 0xffffffff8c577500, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":808, 0xfffffc0000512c88]
8 _XentInt(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
9 bcopy(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/fastcopy.s":324, 0xfffffc00004fcd34]
10 harderr_intr(0x1, 0xfffffc0000692bc8, 0xfffffc0000690548, 0xfffffc00091c5720, 0xfffffc0000000000) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3367, 0xfffffc000052f520]
11 softerr_intr(0xfffffc0000000000, 0xfffffc00003bb3b8, 0xfffffc0000512c94, 0xfffffc0000006398, 0x630) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":4338, 0xfffffc00005302c4]
12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]
13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]
15 pmap_create(size = 18446744065111220072) ["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
16 vm_map_fork(0xfffffc00002ad1e8, 0xfffffc0001557500, 0xfffffc00091c5500, 0x1, 0x1) ["../../../../src/kernel/vm/vm_map.c":1723, 0xfffffc00004dd924]
17 task_create(parent_task = 0xfffffc0001557500, new_task = 0xfffffc00091c5500) ["../../../../src/kernel/kern/task.c":458, 0xfffffc00002ad1e4]
18 procdup(0xfffffc00091c5500, 0xfffffc00091c5720, 0xfffffc000025ef94, 0xfffffc00091c5500, 0x0) ["../../../../src/kernel/vm/vm_unix.c":582, 0xfffffc00004e9ee4]
19 newproc(0xffffffff8c5778e0, 0x0, 0xfffffc00001c92b0, 0xfffffc0000692bc8, 0xfffffc00091c5500) ["../../../../src/kernel/bsd/kern_fork.c":717, 0xfffffc000025efb0]
20 fork1(0xfffffc0001557720, 0xfffffc00063662c0, 0xfffffc0000690930, 0xfffffc0001557500, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":565, 0xfffffc000025ec28]
21 fork(0x0, 0xfffffc0001557720, 0xfffffc0000507a84, 0x0, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":486, 0xfffffc000025e9d4]
22 syscall(0x1, 0xffffffff8c577ce0, 0x41a9a00000000, 0xfffffc0001a42c00, 0x2) ["../../../../src/kernel/arch/alpha/syscall_trap.c":540, 0xfffffc0000507a80]
23 _Xsyscall(0x8, 0x3ff80180ac0, 0x3ffc00931b0, 0xd, 0x0) ["../../../../src/kernel/arch/alpha/locore.s":1209, 0xfffffc00004fdb44]
_stack_trace[0]_end:
_kdbx_sum_start:
Hostname : iraalph
cpu: AlphaServer 1000 4/233 avail: 1
Boot-time: Fri Mar 14 14:21:19 1997
Time: Mon Mar 17 15:10:49 1997
Kernel : OSF1 release V4.0 version 564 (alpha)
_kdbx_sum_end:
_kdbx_swap_start:
Swap device name Size In Use Free
-------------------------------- ---------- ---------- ----------
/dev/re0b 500096k 10848k 489248k
62512p 1356p 61156p
-------------------------------- ---------- ---------- ----------
Total swap partitions: 1 500096k 10848k 489248k
62512p 1356p 61156p
_kdbx_swap_end:
_kdbx_proc_start:
Addr PID PPID PGRP UID NICE SIGCATCH P_SIG Event Flags
=========== ===== ===== ===== ===== ==== ======== ======== =========== ============
k0x0bf3cca0 0 0 0 0 0 00000000 00000000 NULL in sys
k0x0beaaca0 1 0 1 0 0 307a62ff 00000000 NULL in contign pagv
k0x03602220 3 1 3 0 0 00004000 00000000 NULL in pagv
k0x07c50ca0 24586 568 0 0 0 00001601 00000000 NULL in pagv
k0x03602ca0 29 1 29 0 0 00002000 00000000 NULL in pagv
k0x01556ca0 24650 568 0 0 0 00001601 00000000 NULL in pagv
k0x035fa220 102 1 102 0 0 60086001 00000000 NULL in pagv
k0x035fb720 104 1 104 0 0 00004001 00000000 NULL in pagv
k0x08f73720 21613 568 0 0 0 00001601 00000000 NULL in pagv
k0x091c4220 3259 568 0 0 0 00001601 00000000 NULL in pagv
k0x0463a220 8442 568 0 0 0 00001601 00000000 NULL in pagv
k0x01354220 23833 23940 23940 0 0 00000000 00000000 NULL in pagv
k0x034a8ca0 287 1 287 0 0 00080628 00000000 NULL in pagv
k0x034a8220 289 1 289 0 0 66006001 00000000 NULL in pagv
k0x034a9720 291 1 291 0 0 00000000 00000000 NULL in pagv
k0x0349c220 293 1 293 0 0 00000000 00000000 NULL in pagv
k0x0349cca0 295 1 0 0 0 00000001 00000000 NULL in pagv
k0x0349d720 298 1 0 0 0 00002000 00000000 NULL in pagv ctty
k0x0290c220 300 1 300 0 0 00002000 00000000 NULL in pagv
k0x0290cca0 306 1 0 0 0 00084001 00000000 NULL in pagv
k0x01354ca0 23877 23940 23940 0 0 00004007 00000000 NULL in pagv
k0x0760e220 23885 23940 23940 0 0 00080000 00000000 NULL in pagv
k0x02a8c220 364 1 0 0 0 00086000 00000000 NULL in pagv
k0x01556220 23919 23885 23919 0 0 60083aff 00000000 NULL in pagv ctty
k0x0a9a7720 23930 23411 23940 0 0 00004007 00000000 NULL in pagv
k0x0290d720 380 1 380 0 -12 60486007 00002000 NULL in pagv
k0x07e55720 23938 568 0 0 0 00001601 00000000 NULL in pagv
k0x0760f720 23940 2467 23940 0 0 00080000 00000000 NULL in pagv
k0x035faca0 409 1 409 0 -1 00004002 00000000 NULL in pagv
k0x019ec220 2467 584 2467 0 0 20004002 00000000 NULL in pagv
k0x07c50220 30119 568 0 0 0 00001601 00000000 NULL in pagv
k0x028ed720 425 1 425 0 0 20004002 00000000 NULL in pagv
k0x02a8cca0 427 1 427 0 0 00004002 00000000 NULL in pagv
k0x028ecca0 445 1 445 0 0 00004002 00000000 NULL in pagv
k0x02a8d720 450 1 450 0 0 00004003 00000000 NULL in pagv
k0x01557720 10704 568 0 0 0 00001601 00000000 NULL in pagv
k0x019ecca0 469 1 77 0 -1 60027eff 00000000 NULL in pagv ctty
k0x07c51720 12766 568 0 0 0 00001601 00000000 NULL in pagv
k0x028ec220 480 1 480 0 0 40082001 00000000 NULL in pagv
k0x030e2220 513 1 513 0 0 00002000 00000000 NULL in pagv
k0x030e2ca0 551 1 551 0 0 00084007 00000000 NULL in pagv
k0x087d2ca0 27184 568 0 0 0 00001601 00000000 NULL in pagv
k0x02eeb720 568 1 0 0 0 00080601 00000000 NULL in pagv
k0x02eeaca0 570 1 0 0 0 00005601 00000000 NULL in pagv
k0x02eea220 584 1 0 0 0 20084003 00000000 NULL in pagv
k0x047d2220 588 584 588 0 0 00004003 00000000 NULL in pagv
k0x03603720 602 1 602 0 0 20004002 00000000 NULL in pagv
k0x0beab720 603 1 603 0 0 00000000 00000000 NULL in pagv ctty
k0x08f72220 10849 568 0 0 0 00001601 00000000 NULL in pagv
k0x08b64ca0 12904 568 0 0 0 00001601 00000000 NULL in pagv
k0x01355720 12923 568 0 0 0 00001601 00000000 NULL in pagv
k0x08b64220 23200 568 0 0 0 00001601 00000000 NULL in pagv
k0x07e54ca0 24280 568 0 0 0 00001601 00000000 NULL in pagv
k0x047d3720 24314 568 0 0 0 00001601 00000000 NULL in pagv
k0x030e3720 13067 568 0 0 0 00001601 00000000 NULL in pagv
k0x091c4ca0 13093 551 13093 0 0 00084007 00000000 NULL in pagv
k0x07e54220 13094 480 13094 0 0 60082000 00000000 NULL in pagv
k0x08b65720 23348 568 0 0 0 00001601 00000000 NULL in pagv
k0x08f72ca0 13109 13093 13093 1 0 00000000 00000000 NULL in pagv
k0x0a9a6220 23369 23833 23940 0 0 00000000 00000000 NULL in pagv
k0x047d2ca0 26457 568 0 0 0 00001601 00000000 NULL in pagv
k0x0760eca0 11112 568 0 0 0 00001601 00000000 NULL in pagv
k0x0a9a6ca0 23411 1 23940 0 0 00004007 00000000 NULL in pagv
_kdbx_proc_end:
Audit subsystem disabled
No audit data to be saved
#
_crash_data_collection_finished:
T.R | Title | User | Personal Name | Date | Lines |
---|
63.1 | Cross-Posted; Tried CANASTA? | XDELTA::HOFFMAN | Steve, OpenVMS Engineering | Tue Mar 18 1997 12:39 | 7 |
|
.0 is already posted in WRKSYS::MIKASA 857.*, and (probably) over in
TURRIS::DIGITAL_UNIX.
You might want to try running the crashdump through CANASTA (See the
conference at SPECXN::CANASTA) and see if it recognizes this crash.
|
63.2 | CANASTA points to mchk program | COL01::HOLZGREVE | | Wed Mar 19 1997 03:47 | 3 |
| CANASTA points me to the mchk program I have used.
Uwe
|
63.3 | IPMT | XDELTA::HOFFMAN | Steve, OpenVMS Engineering | Wed Mar 19 1997 08:50 | 5 |
| : CANASTA points me to the mchk program I have used.
And if it did that, it also means you should then elevate via formal
channels, as it means CANASTA hasn't seen this crash before...
|
63.4 | think about a possible SW problem ... | HAN::HALLE | Volker Halle MCS @HAO DTN 863-5216 | Thu Mar 20 1997 02:00 | 19 |
| Uwe,
are you able to find out the ADDRESS, which caused the non-existant
memory error to happen ? The program might be trying to access an
non-existant address, so it might well be a software error ! This also
comes to mind AFTER all the hardware has been replaced !!!
Just as an example, some of the 'ping-of-death' footprints also showed
up as machine check crashes and those were CLEARLY a SOFTWARE problem !
The ROUTINE which incurred this fault is bcopy (called from pmap_create)
If you look in the CAN_OSF1_CASES.SEQ file, you'll find a lot of
machine check crashes happening in bcopy. See note SPECXN::CANASTA #239
and use the following search command:
$ search file "mach_error _XentInt bcopy"
Volker.
|
63.5 | HOW TO ? | COL01::HOLZGREVE | | Thu Mar 20 1997 09:49 | 7 |
| Volker,
how to find out the ADDRESS, which caused the non-existant memory error
to happen ?!
Uwe
|
63.6 | just guessing ... | HAN::HALLE | Volker Halle MCS @HAO DTN 863-5216 | Thu Mar 20 1997 12:55 | 53 |
| Uwe,
you know, that I'm not a UNIX expert, but I'm just guessing. Maybe you
should get a better response in the UNIX notes conference.
On OpenVMS, you can find the instruction, that causes the NMX error.
It should be possible on UNIX as well.
I would expect one of the parameters to _XentINT to be the 'invalid
addr' ?! Maybe another one the actual PC of the problem. Can you try
to check with dbx ?
Volker.
...
12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29,
0xfffffc00004fda40)
["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]
13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68,
0xfffffc000873bf68)
["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68,
0xfffffc000873bf68)
["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]
15 pmap_create(size = 18446744065111220072)
^^^^^^^^^^^^^^^^^^^^<<< does THIS look sane ????
["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
map = 0xfffffc00099bde00
stats = (nil)
scratch = union {
quadword = 18446739675685418240
PTE_BITFIELD = struct {
_v = 0
_for = 0
_fow = 0
_foe = 0
_asm = 0
_gh = 0
0
_prot = 117
_exec = 1
_wire = 0
_seg = 1
_ssm = 0
_gh_shared = 1
_soft = 2
_lw_wire = 1
_pfn = 4294966272 <<< ????????????
}
}
|
63.7 | try exc_addr ? | HAN::HALLE | Volker Halle MCS @HAO DTN 863-5216 | Thu Mar 20 1997 13:11 | 12 |
|
From an MCHECK error DECevent posting, I think EXC_ADDR is the address
of the failing instruction (this is just an example):
Exception Address Reg xFFFFFC0000303500
Native-mode Instruction
Exception PC x3FFFFF00000C0D40
Try examining the instruction pointed to by exc_addr (you'll find this
in the mcheck frame in the messages buffer).
Volker.
|