[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference vaxaxp::alphanotes

Title:Alpha Support Conference
Notice:This is a new Alphanotes, please read note 2.2
Moderator:VAXAXP::BERNARDO
Created:Thu Jan 02 1997
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:128
Total number of notes:617

63.0. "AlphaServer 1000 machine check" by 50239::HOLZGREVE () Tue Mar 18 1997 03:12

AlphaServer 1000 4/233 machine check 20c - nonexistent memory error.
The mchk_osf.v35 program tells me:

The Error code entered above has the following meaning

Non-existent Memory Error
Indicates that a read or write occurred to an invalid address
which does not map to any memory bank, CSR or I/O quadrant.
Most likley Broken = CPU CARD

The CPU requested transaction caused the error.


The CPU, system board and memory are replaced, but I still have the same
problem.

Help needed !!!

Uwe Holzgreve
MCS Cologne, Germany

-------------------------------------------------------------------------------

#
# Crash Data Collection (Version 1.4)
#
_crash_data_collection_time: Mon Mar 17 15:15:41 MET 1997
_current_directory: /
_crash_kernel: /var/adm/crash/vmunix.24
_crash_core: /var/adm/crash/vmcore.24
_crash_arch: alpha
_crash_os: Digital UNIX
_host_version: Digital UNIX V4.0B  (Rev. 564); Fri Feb 14 16:19:27 MET 1997 
_crash_version: Digital UNIX V4.0B  (Rev. 564); Fri Feb 14 16:19:27 MET 1997 

_crashtime:  struct {
    tv_sec = 858607849
    tv_usec = 857821
} 
_boottime:  struct {
    tv_sec = 858345679
    tv_usec = 766025
} 
_config:  struct {
    sysname = "OSF1"
    nodename = "iraalph"
    release = "V4.0"
    version = "564"
    machine = "alpha"
} 
_cpu:  43 
_system_string:  0xffffffffff800a58 = "AlphaServer 1000 4/233" 
_ncpus:  1 
_avail_cpus:  1 
_partial_dump:  1 
_physmem(MBytes):  191 
_panic_string:  0xfffffc000067c090 = "Machine check - Hardware error" 
_paniccpu:  0 
_panic_thread:  0xfffffc00063662c0 
_preserved_message_buffer_begin: 
struct {
    msg_magic = 0x63061
    msg_bufx = 0x78
    msg_bufr = 0x582
    msg_bufc = " 0 0 0 0 0, block 253956
device string for dump = RAID 0 13 0 0 0 0 0.
DUMP.prom: dev RAID 0 13 0 0 0 0 0, block 253956
hysical memory = 192.00 megabytes.
available memory = 174.73 megabytes.
using 730 buffers containing 5.70 megabytes of memory
AlphaServer 1000 4/233
Firmware revision: 4.7
PALcode: OSF version 1.45
pci0 at nexus
psiop0 at pci0 slot 6
Loading SIOP: script 801500, reg 82004000, data 406c74b0
scsi0 at psiop0 slot 0
rz4 at scsi0 target 4 lun 0 (LID=0) (DEC     RRD43   (C) DEC  1084)
eisa0 at pci0
ace0 at eisa0
ace1 at eisa0
lp0 at eisa0
fdi0 at eisa0
fd0 at fdi0 unit 0
qvision0 at eisa0
qvision0: CMPQ Qvision 1024/E SVGA
tu0: DECchip 21040-AA: Revision: 2.4
tu0 at pci0 slot 12
tu0: DEC TULIP Ethernet Interface, hardware address: 00-00-F8-21-EC-2A
tu0: console mode: selecting 10Base2 (BNC) port: no carrier
Initializing xcr0.  Please wait....
xcr0 at pci0 slot 13
re0 at xcr0 unit 0 (unit status = ONLINE, raid level = 1)
re1 at xcr0 unit 1 (unit status = ONLINE, raid level = 5)
gpc0 at eisa0
lvm0: configured.
lvm1: configured.
kernel console: qvision0
dli: configured
ATM Subsystem configured with 1 restart threads
ATM UNI 3.x signalling: configured
ATM IP interface: configured
ADVFS: using 1738 buffers containing 13.57 megabytes of memory
Environmental Monitoring Subsystem Configured.
SuperLAT. Copyright 1994 Meridian Technology Corp. All rights reserved.
lp0: printer offline
AlphaServer 1000 4/233 machine check type 0x660.
  retry		= 0xffffffff
  mchk_code	= 0x20c
  paltemp[1]	= 0x7
  paltemp[2]	= 0x4
  paltemp[3]	= 0x0
  paltemp[4]	= 0x6000
  paltemp[5]	= 0x0
  paltemp[6]	= 0x2340
  paltemp[7]	= 0x4200
  paltemp[8]	= 0x400
  paltemp[9]	= 0x7
  paltemp[10]	= 0x4fd930
  paltemp[11]	= 0x0
  paltemp[12]	= 0x4fdcd0
  paltemp[13]	= 0x4fdd00
  paltemp[14]	= 0x4fdd60
  paltemp[15]	= 0x4fdad0
  paltemp[16]	= 0x4fd7a0
  paltemp[17]	= 0x19310
  paltemp[18]	= 0x1ffff610
  paltemp[19]	= 0x8c577a38
  paltemp[20]	= 0x698260
  paltemp[21]	= 0x0
  paltemp[22]	= 0x626e6e6e
  paltemp[23]	= 0x0
  paltemp[24]	= 0x0
  paltemp[25]	= 0x50000
  paltemp[26]	= 0x1071c80
  paltemp[27]	= 0x0
  paltemp[28]	= 0xae4c000
  paltemp[29]	= 0x0
  paltemp[30]	= 0x1
  paltemp[31]	= 0x8d2da38
  exc_addr	= 0x4fcd3a
  exc_sum	= 0x0
  msk		= 0x0
  iccsr		= 0x4
  pal_base	= 0x14000
  hier		= 0x0
  hirr		= 0x0
  mm_csr	= 0x3640
  dc_stat	= 0x3
  dc_addr	= 0xffffffff
  abox_ctl	= 0x942e
  biu_stat	= 0x241
  biu_addr	= 0x6390
  biu_ctl	= 0x10002227
  fill_syndrome	= 0x0
  fill_adr	= 0x6100
  va		= 0x6170
  bc_tag	= 0x42492448

  coma_gcr	= 0x7fb20034
  coma_edsr	= 0xbf32108
  coma_ter	= 0x6fb17fe0
  coma_elar	= 0x6fb1031c
  coma_ehar	= 0x6fb10800
  coma_ldlr	= 0x6fb126fb
  coma_ldhr	= 0x6fb10057
  coma_base0	= 0x6fb10200
  coma_base1	= 0x6fb10000
  coma_base2	= 0x22310000
  coma_base3	= 0xbf30000
  coma_cnfg0	= 0x22310049
  coma_cnfg1	= 0x22310067
  coma_cnfg2	= 0x22310000
  coma_cnfg3	= 0x7fb20000

  epic_dcsr	= 0x801e0019
  epic_pear	= 0x807a40
  epic_sear	= 0x2fe2570
  epic_tbr1	= 0x8a0000
  epic_tbr2	= 0x0
  epic_pbr1	= 0x8c0000
  epic_pbr2	= 0x40080000
  epic_pmr1	= 0x700000
  epic_pmr2	= 0x3ff00000
  epic_harx1	= 0x80000000
  epic_harx2	= 0x0
  epic_pmlt	= 0xff
  epic_tag0	= 0x802000
  epic_tag1	= 0x800000
  epic_tag2	= 0x806000
  epic_tag3	= 0x803000
  epic_tag4	= 0x801000
  epic_tag5	= 0x807000
  epic_tag6	= 0x802000
  epic_tag7	= 0x800000
  epic_data0	= 0x6c0
  epic_data1	= 0x6be
  epic_data2	= 0x6c4
  epic_data3	= 0x6c0
  epic_data4	= 0x6be
  epic_data5	= 0x6c4
  epic_data6	= 0x6c0
  epic_data7	= 0x6be

  pceb_vid	= 0x8086
  pceb_did	= 0x482
  pceb_revision	= 0x4
  pceb_command	= 0x7
  pceb_status	= 0x200
  pceb_latency	= 0xf8
  pceb_control	= 0x60
  pceb_arbcon	= 0x9d
  pceb_arbpri	= 0x4

  esc_id	= 0xf
  esc_revision	= 0x3
  esc_int0	= 0xa1
  esc_int1	= 0xef
  esc_elcr0	= 0x0
  esc_elcr1	= 0x0
  esc_last_eisa	= 0xff
  esc_nmi_stat	= 0x30

  pci_ir	= 0xff
  pci_imr	= 0x10
  svr_mgr	= 0xd4
panic (cpu 0): Machine check - Hardware error
syncing disks... device string for dump = RAID 0 13 0 0 0 0 0.
DUMP.prom: dev RAID 0 13"
} 
_preserved_message_buffer_end: 
_kernel_process_status_begin: 
  PID	COMM
00000	kernel idle
00001	init
00003	kloadsrv
24586	smbd
00029	update
24650	smbd
00102	syslogd
00104	binlogd
21613	smbd
03259	smbd
08442	smbd
23833	sh
00287	portmap
00289	mountd
00291	nfsd
00293	nfsiod
00295	rpc.pcnfsd
00298	rpc.statd
00300	rpc.lockd
00306	automount
23877	mwm
23885	dxterm
00364	sendmail
23919	ksh
23930	dxsysinfo
00380	xntpd
23938	smbd
23940	dxsession
00409	pmgrd
02467	xdm
30119	smbd
00425	snmpd
00427	os_mibs
00445	svrSystem_mib
00450	envmond
10704	smbd
00469	advfsd
12766	smbd
00480	inetd
00513	cron
00551	lpd
27184	smbd
00568	smbd
00570	nmbd
00584	xdm
00588	Xdec
00602	swxcrmon
00603	getty
10849	smbd
12904	smbd
12923	smbd
23200	smbd
24280	smbd
24314	smbd
13067	smbd
29470	smbd
13093	lpd
13094	comsat
23348	smbd
13109	hplaserof
23369	dxconsole
26457	smbd
11112	smbd
23411	dxsysinfo
_kernel_process_status_end: 
_current_pid:  10704 
_current_tid:  0xfffffc00063662c0 
_proc_thread_list_begin: 
thread 0xfffffc00063662c0 stopped at  [boot:2466 ,0xfffffc0000501da8]	 Source not available
_proc_thread_list_end: 
_dump_begin: 
>  0 boot(0x400000000, 0x0, 0xfffffc000027c7d4, 0xfffffc0000627830, 0xfffffc0000627830) ["../../../../src/kernel/arch/alpha/machdep.c":2466, 0xfffffc0000501da8]

   1 panic(s = 0xfffffc0000629a20 = "thread_block: interrupt level call") ["../../../../src/kernel/bsd/subr_prf.c":707, 0xfffffc000027ae9c]
pcpu = 0xfffffc00006ca8a0
i = 6461984
mycpu = 0
spl = 5

   2 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1925, 0xfffffc00002a8410]
thread = 0xfffffc00063662c0
new_thread = 0xfffffc00063662c0
mycpu = 0
myprocessor = 0xfffffc00001c2100
s = 5
pset = 0x1000

   3 thread_preempt(thread = 0x26, processor = 0xfffffc00001c2100) ["../../../../src/kernel/kern/sched_prim.c":3820, 0xfffffc00002aafa4]
s = 2
pset = 0x1

   4 boot(0x0, 0xfffffc00063662c0, 0x2c0000002c, 0x35, 0x1) ["../../../../src/kernel/arch/alpha/machdep.c":2410, 0xfffffc0000501c90]

   5 panic(s = 0xfffffc000067c090 = "Machine check - Hardware error") ["../../../../src/kernel/bsd/subr_prf.c":791, 0xfffffc000027b03c]
pcpu = 0xfffffc00006ca8a0
i = 6980192
mycpu = 0
spl = 7

   6 machcheck(0xfffffc0000006398, 0x630, 0x800100000001, 0x100000014, 0xffffffff8c577430) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3030, 0xfffffc000052e420]

   7 mach_error(0x800100000001, 0x100000014, 0xffffffff8c577430, 0xffffffff8c577500, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":808, 0xfffffc0000512c88]

   8 _XentInt(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]

   9 bcopy(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/fastcopy.s":324, 0xfffffc00004fcd34]

  10 harderr_intr(0x1, 0xfffffc0000692bc8, 0xfffffc0000690548, 0xfffffc00091c5720, 0xfffffc0000000000) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3367, 0xfffffc000052f520]

  11 softerr_intr(0xfffffc0000000000, 0xfffffc00003bb3b8, 0xfffffc0000512c94, 0xfffffc0000006398, 0x630) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":4338, 0xfffffc00005302c4]

  12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]

  13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]

  14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]

  15 pmap_create(size = 18446744065111220072) ["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
map = 0xfffffc00099bde00
stats = (nil)
scratch = union {
    quadword = 18446739675685418240
    PTE_BITFIELD = struct {
        _v = 0
        _for = 0
        _fow = 0
        _foe = 0
        _asm = 0
        _gh = 0
        0
        _prot = 117
        _exec = 1
        _wire = 0
        _seg = 1
        _ssm = 0
        _gh_shared = 1
        _soft = 2
        _lw_wire = 1
        _pfn = 4294966272
    }
}

  16 vm_map_fork(0xfffffc00002ad1e8, 0xfffffc0001557500, 0xfffffc00091c5500, 0x1, 0x1) ["../../../../src/kernel/vm/vm_map.c":1723, 0xfffffc00004dd924]

  17 task_create(parent_task = 0xfffffc0001557500, new_task = 0xfffffc00091c5500) ["../../../../src/kernel/kern/task.c":458, 0xfffffc00002ad1e4]
pset = 0x1
tasks = -4398041577748
threads = 0
kr = 6892488

  18 procdup(0xfffffc00091c5500, 0xfffffc00091c5720, 0xfffffc000025ef94, 0xfffffc00091c5500, 0x0) ["../../../../src/kernel/vm/vm_unix.c":582, 0xfffffc00004e9ee4]

  19 newproc(0xffffffff8c5778e0, 0x0, 0xfffffc00001c92b0, 0xfffffc0000692bc8, 0xfffffc00091c5500) ["../../../../src/kernel/bsd/kern_fork.c":717, 0xfffffc000025efb0]

  20 fork1(0xfffffc0001557720, 0xfffffc00063662c0, 0xfffffc0000690930, 0xfffffc0001557500, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":565, 0xfffffc000025ec28]

  21 fork(0x0, 0xfffffc0001557720, 0xfffffc0000507a84, 0x0, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":486, 0xfffffc000025e9d4]

  22 syscall(0x1, 0xffffffff8c577ce0, 0x41a9a00000000, 0xfffffc0001a42c00, 0x2) ["../../../../src/kernel/arch/alpha/syscall_trap.c":540, 0xfffffc0000507a80]

  23 _Xsyscall(0x8, 0x3ff80180ac0, 0x3ffc00931b0, 0xd, 0x0) ["../../../../src/kernel/arch/alpha/locore.s":1209, 0xfffffc00004fdb44]

_dump_end: 

warning: Files compiled -g3: parameter values probably wrong
_kernel_thread_list_begin: 
thread 0xfffffc000bf402c0 stopped at   [thread_run:2469 ,0xfffffc00002a8f08]	 Source not available
thread 0xfffffc000bf40580 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf40840 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf40b00 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf40dc0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf41080 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf41340 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf41600 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf418c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bf41b80 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea2000 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea22c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea2580 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea2840 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea2b00 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea2dc0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea3080 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea3340 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea3600 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea38c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000bea3b80 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8a000 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8a580 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8a840 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8ab00 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8adc0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8b080 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8b340 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8b600 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc000be8b8c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc00035fd600 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc00035fd8c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc00035fdb80 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002adc000 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002adc2c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002adc580 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002adc840 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002adcb00 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002adcdc0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002add080 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002add340 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002add600 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002add8c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002addb80 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad4000 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad42c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad4840 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad4b00 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad4dc0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad5080 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad5340 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad5600 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
thread 0xfffffc0002ad58c0 stopped at   [thread_block:2097 ,0xfffffc00002a8740]	 Source not available
_kernel_thread_list_end: 
_savedefp:  (nil) 
_kernel_memory_fault_data_begin:  
struct {
    fault_va = 0x0
    fault_pc = 0x0
    fault_ra = 0x0
    fault_sp = 0x0
    access = 0x0
    status = 0x0
    cpunum = 0x0
    count = 0x0
    pcb = (nil)
    thread = (nil)
    task = (nil)
    proc = (nil)
} 
_kernel_memory_fault_data_end:  
_uptime: 72.82 hours

paniccpu: 0x0 
machine_slot[paniccpu]: struct {
    is_cpu = 0x1
    cpu_type = 0xf
    cpu_subtype = 0x11
    running = 0x1
    cpu_ticks = {
        [0] 0x134927c
        [1] 0x0
        [2] 0x763b62
        [3] 0xe55bda5
        [4] 0x67ec
    }
    clock_freq = 0x400
    error_restart = 0x0
    cpu_panicstr = 0xfffffc000067c090 = "Machine check - Hardware error"
    cpu_panic_thread = 0xfffffc00063662c0
} 
tset machine_slot[paniccpu].cpu_panic_thread: 
Begin Trace for machine_slot[paniccpu].cpu_panic_thread: 
>  0 boot(0x400000000, 0x0, 0xfffffc000027c7d4, 0xfffffc0000627830, 0xfffffc0000627830) ["../../../../src/kernel/arch/alpha/machdep.c":2466, 0xfffffc0000501da8]
   1 panic(s = 0xfffffc0000629a20 = "thread_block: interrupt level call") ["../../../../src/kernel/bsd/subr_prf.c":707, 0xfffffc000027ae9c]
   2 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1925, 0xfffffc00002a8410]
   3 thread_preempt(thread = 0x26, processor = 0xfffffc00001c2100) ["../../../../src/kernel/kern/sched_prim.c":3820, 0xfffffc00002aafa4]
   4 boot(0x0, 0xfffffc00063662c0, 0x2c0000002c, 0x35, 0x1) ["../../../../src/kernel/arch/alpha/machdep.c":2410, 0xfffffc0000501c90]
   5 panic(s = 0xfffffc000067c090 = "Machine check - Hardware error") ["../../../../src/kernel/bsd/subr_prf.c":791, 0xfffffc000027b03c]
   6 machcheck(0xfffffc0000006398, 0x630, 0x800100000001, 0x100000014, 0xffffffff8c577430) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3030, 0xfffffc000052e420]
   7 mach_error(0x800100000001, 0x100000014, 0xffffffff8c577430, 0xffffffff8c577500, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":808, 0xfffffc0000512c88]
   8 _XentInt(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
   9 bcopy(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/fastcopy.s":324, 0xfffffc00004fcd34]
  10 harderr_intr(0x1, 0xfffffc0000692bc8, 0xfffffc0000690548, 0xfffffc00091c5720, 0xfffffc0000000000) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3367, 0xfffffc000052f520]
  11 softerr_intr(0xfffffc0000000000, 0xfffffc00003bb3b8, 0xfffffc0000512c94, 0xfffffc0000006398, 0x630) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":4338, 0xfffffc00005302c4]
  12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]
  13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
  14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]
  15 pmap_create(size = 0xfffffffdff7fdf68) ["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
  16 vm_map_fork(0xfffffc00002ad1e8, 0xfffffc0001557500, 0xfffffc00091c5500, 0x1, 0x1) ["../../../../src/kernel/vm/vm_map.c":1723, 0xfffffc00004dd924]
  17 task_create(parent_task = 0xfffffc0001557500, new_task = 0xfffffc00091c5500) ["../../../../src/kernel/kern/task.c":458, 0xfffffc00002ad1e4]
  18 procdup(0xfffffc00091c5500, 0xfffffc00091c5720, 0xfffffc000025ef94, 0xfffffc00091c5500, 0x0) ["../../../../src/kernel/vm/vm_unix.c":582, 0xfffffc00004e9ee4]
  19 newproc(0xffffffff8c5778e0, 0x0, 0xfffffc00001c92b0, 0xfffffc0000692bc8, 0xfffffc00091c5500) ["../../../../src/kernel/bsd/kern_fork.c":717, 0xfffffc000025efb0]
  20 fork1(0xfffffc0001557720, 0xfffffc00063662c0, 0xfffffc0000690930, 0xfffffc0001557500, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":565, 0xfffffc000025ec28]
  21 fork(0x0, 0xfffffc0001557720, 0xfffffc0000507a84, 0x0, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":486, 0xfffffc000025e9d4]
  22 syscall(0x1, 0xffffffff8c577ce0, 0x41a9a00000000, 0xfffffc0001a42c00, 0x2) ["../../../../src/kernel/arch/alpha/syscall_trap.c":540, 0xfffffc0000507a80]
  23 _Xsyscall(0x8, 0x3ff80180ac0, 0x3ffc00931b0, 0xd, 0x0) ["../../../../src/kernel/arch/alpha/locore.s":1209, 0xfffffc00004fdb44]
End Trace for machine_slot[paniccpu].cpu_panic_thread: 

"cpu_data" is not an array
_stack_trace[0]_begin: 
>  0 boot(0x400000000, 0x0, 0xfffffc000027c7d4, 0xfffffc0000627830, 0xfffffc0000627830) ["../../../../src/kernel/arch/alpha/machdep.c":2466, 0xfffffc0000501da8]
   1 panic(s = 0xfffffc0000629a20 = "thread_block: interrupt level call") ["../../../../src/kernel/bsd/subr_prf.c":707, 0xfffffc000027ae9c]
   2 thread_block() ["../../../../src/kernel/kern/sched_prim.c":1925, 0xfffffc00002a8410]
   3 thread_preempt(thread = 0x26, processor = 0xfffffc00001c2100) ["../../../../src/kernel/kern/sched_prim.c":3820, 0xfffffc00002aafa4]
   4 boot(0x0, 0xfffffc00063662c0, 0x2c0000002c, 0x35, 0x1) ["../../../../src/kernel/arch/alpha/machdep.c":2410, 0xfffffc0000501c90]
   5 panic(s = 0xfffffc000067c090 = "Machine check - Hardware error") ["../../../../src/kernel/bsd/subr_prf.c":791, 0xfffffc000027b03c]
   6 machcheck(0xfffffc0000006398, 0x630, 0x800100000001, 0x100000014, 0xffffffff8c577430) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3030, 0xfffffc000052e420]
   7 mach_error(0x800100000001, 0x100000014, 0xffffffff8c577430, 0xffffffff8c577500, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":808, 0xfffffc0000512c88]
   8 _XentInt(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
   9 bcopy(0x7, 0xfffffc00004fcd38, 0xfffffc0000698260, 0xfffffc0100006397, 0xfffffc0000722810) ["../../../../src/kernel/arch/alpha/fastcopy.s":324, 0xfffffc00004fcd34]
  10 harderr_intr(0x1, 0xfffffc0000692bc8, 0xfffffc0000690548, 0xfffffc00091c5720, 0xfffffc0000000000) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":3367, 0xfffffc000052f520]
  11 softerr_intr(0xfffffc0000000000, 0xfffffc00003bb3b8, 0xfffffc0000512c94, 0xfffffc0000006398, 0x630) ["../../../../src/kernel/arch/alpha/hal/kn22a.c":4338, 0xfffffc00005302c4]
  12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29, 0xfffffc00004fda40) ["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]
  13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]
  14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68, 0xfffffc000873bf68) ["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]
  15 pmap_create(size = 18446744065111220072) ["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
  16 vm_map_fork(0xfffffc00002ad1e8, 0xfffffc0001557500, 0xfffffc00091c5500, 0x1, 0x1) ["../../../../src/kernel/vm/vm_map.c":1723, 0xfffffc00004dd924]
  17 task_create(parent_task = 0xfffffc0001557500, new_task = 0xfffffc00091c5500) ["../../../../src/kernel/kern/task.c":458, 0xfffffc00002ad1e4]
  18 procdup(0xfffffc00091c5500, 0xfffffc00091c5720, 0xfffffc000025ef94, 0xfffffc00091c5500, 0x0) ["../../../../src/kernel/vm/vm_unix.c":582, 0xfffffc00004e9ee4]
  19 newproc(0xffffffff8c5778e0, 0x0, 0xfffffc00001c92b0, 0xfffffc0000692bc8, 0xfffffc00091c5500) ["../../../../src/kernel/bsd/kern_fork.c":717, 0xfffffc000025efb0]
  20 fork1(0xfffffc0001557720, 0xfffffc00063662c0, 0xfffffc0000690930, 0xfffffc0001557500, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":565, 0xfffffc000025ec28]
  21 fork(0x0, 0xfffffc0001557720, 0xfffffc0000507a84, 0x0, 0x0) ["../../../../src/kernel/bsd/kern_fork.c":486, 0xfffffc000025e9d4]
  22 syscall(0x1, 0xffffffff8c577ce0, 0x41a9a00000000, 0xfffffc0001a42c00, 0x2) ["../../../../src/kernel/arch/alpha/syscall_trap.c":540, 0xfffffc0000507a80]
  23 _Xsyscall(0x8, 0x3ff80180ac0, 0x3ffc00931b0, 0xd, 0x0) ["../../../../src/kernel/arch/alpha/locore.s":1209, 0xfffffc00004fdb44]
_stack_trace[0]_end: 

_kdbx_sum_start:
Hostname : iraalph
cpu: AlphaServer 1000 4/233	avail: 1
Boot-time:	Fri Mar 14 14:21:19 1997
Time:	Mon Mar 17 15:10:49 1997
Kernel : OSF1 release V4.0 version 564 (alpha)
_kdbx_sum_end:
_kdbx_swap_start:

       Swap device name              Size       In Use       Free
--------------------------------  ----------  ----------  ----------
/dev/re0b                            500096k      10848k     489248k
                                      62512p       1356p      61156p
--------------------------------  ----------  ----------  ----------
Total swap partitions:    1          500096k      10848k     489248k
                                      62512p       1356p      61156p
_kdbx_swap_end:
_kdbx_proc_start:
Addr        PID   PPID  PGRP  UID   NICE SIGCATCH P_SIG    Event       Flags
=========== ===== ===== ===== ===== ==== ======== ======== =========== ============
k0x0bf3cca0     0     0     0     0    0 00000000 00000000        NULL in sys
k0x0beaaca0     1     0     1     0    0 307a62ff 00000000        NULL in contign pagv
k0x03602220     3     1     3     0    0 00004000 00000000        NULL in pagv
k0x07c50ca0 24586   568     0     0    0 00001601 00000000        NULL in pagv
k0x03602ca0    29     1    29     0    0 00002000 00000000        NULL in pagv
k0x01556ca0 24650   568     0     0    0 00001601 00000000        NULL in pagv
k0x035fa220   102     1   102     0    0 60086001 00000000        NULL in pagv
k0x035fb720   104     1   104     0    0 00004001 00000000        NULL in pagv
k0x08f73720 21613   568     0     0    0 00001601 00000000        NULL in pagv
k0x091c4220  3259   568     0     0    0 00001601 00000000        NULL in pagv
k0x0463a220  8442   568     0     0    0 00001601 00000000        NULL in pagv
k0x01354220 23833 23940 23940     0    0 00000000 00000000        NULL in pagv
k0x034a8ca0   287     1   287     0    0 00080628 00000000        NULL in pagv
k0x034a8220   289     1   289     0    0 66006001 00000000        NULL in pagv
k0x034a9720   291     1   291     0    0 00000000 00000000        NULL in pagv
k0x0349c220   293     1   293     0    0 00000000 00000000        NULL in pagv
k0x0349cca0   295     1     0     0    0 00000001 00000000        NULL in pagv
k0x0349d720   298     1     0     0    0 00002000 00000000        NULL in pagv ctty
k0x0290c220   300     1   300     0    0 00002000 00000000        NULL in pagv
k0x0290cca0   306     1     0     0    0 00084001 00000000        NULL in pagv
k0x01354ca0 23877 23940 23940     0    0 00004007 00000000        NULL in pagv
k0x0760e220 23885 23940 23940     0    0 00080000 00000000        NULL in pagv
k0x02a8c220   364     1     0     0    0 00086000 00000000        NULL in pagv
k0x01556220 23919 23885 23919     0    0 60083aff 00000000        NULL in pagv ctty
k0x0a9a7720 23930 23411 23940     0    0 00004007 00000000        NULL in pagv
k0x0290d720   380     1   380     0  -12 60486007 00002000        NULL in pagv
k0x07e55720 23938   568     0     0    0 00001601 00000000        NULL in pagv
k0x0760f720 23940  2467 23940     0    0 00080000 00000000        NULL in pagv
k0x035faca0   409     1   409     0   -1 00004002 00000000        NULL in pagv
k0x019ec220  2467   584  2467     0    0 20004002 00000000        NULL in pagv
k0x07c50220 30119   568     0     0    0 00001601 00000000        NULL in pagv
k0x028ed720   425     1   425     0    0 20004002 00000000        NULL in pagv
k0x02a8cca0   427     1   427     0    0 00004002 00000000        NULL in pagv
k0x028ecca0   445     1   445     0    0 00004002 00000000        NULL in pagv
k0x02a8d720   450     1   450     0    0 00004003 00000000        NULL in pagv
k0x01557720 10704   568     0     0    0 00001601 00000000        NULL in pagv
k0x019ecca0   469     1    77     0   -1 60027eff 00000000        NULL in pagv ctty
k0x07c51720 12766   568     0     0    0 00001601 00000000        NULL in pagv
k0x028ec220   480     1   480     0    0 40082001 00000000        NULL in pagv
k0x030e2220   513     1   513     0    0 00002000 00000000        NULL in pagv
k0x030e2ca0   551     1   551     0    0 00084007 00000000        NULL in pagv
k0x087d2ca0 27184   568     0     0    0 00001601 00000000        NULL in pagv
k0x02eeb720   568     1     0     0    0 00080601 00000000        NULL in pagv
k0x02eeaca0   570     1     0     0    0 00005601 00000000        NULL in pagv
k0x02eea220   584     1     0     0    0 20084003 00000000        NULL in pagv
k0x047d2220   588   584   588     0    0 00004003 00000000        NULL in pagv
k0x03603720   602     1   602     0    0 20004002 00000000        NULL in pagv
k0x0beab720   603     1   603     0    0 00000000 00000000        NULL in pagv ctty
k0x08f72220 10849   568     0     0    0 00001601 00000000        NULL in pagv
k0x08b64ca0 12904   568     0     0    0 00001601 00000000        NULL in pagv
k0x01355720 12923   568     0     0    0 00001601 00000000        NULL in pagv
k0x08b64220 23200   568     0     0    0 00001601 00000000        NULL in pagv
k0x07e54ca0 24280   568     0     0    0 00001601 00000000        NULL in pagv
k0x047d3720 24314   568     0     0    0 00001601 00000000        NULL in pagv
k0x030e3720 13067   568     0     0    0 00001601 00000000        NULL in pagv
k0x091c4ca0 13093   551 13093     0    0 00084007 00000000        NULL in pagv
k0x07e54220 13094   480 13094     0    0 60082000 00000000        NULL in pagv
k0x08b65720 23348   568     0     0    0 00001601 00000000        NULL in pagv
k0x08f72ca0 13109 13093 13093     1    0 00000000 00000000        NULL in pagv
k0x0a9a6220 23369 23833 23940     0    0 00000000 00000000        NULL in pagv
k0x047d2ca0 26457   568     0     0    0 00001601 00000000        NULL in pagv
k0x0760eca0 11112   568     0     0    0 00001601 00000000        NULL in pagv
k0x0a9a6ca0 23411     1 23940     0    0 00004007 00000000        NULL in pagv
_kdbx_proc_end:

Audit subsystem disabled

No audit data to be saved
#
_crash_data_collection_finished:
T.RTitleUserPersonal
Name
DateLines
63.1Cross-Posted; Tried CANASTA?XDELTA::HOFFMANSteve, OpenVMS EngineeringTue Mar 18 1997 12:397
   .0 is already posted in WRKSYS::MIKASA 857.*, and (probably) over in
   TURRIS::DIGITAL_UNIX.

   You might want to try running the crashdump through CANASTA (See the
   conference at SPECXN::CANASTA) and see if it recognizes this crash.

63.2CANASTA points to mchk programCOL01::HOLZGREVEWed Mar 19 1997 03:473
    CANASTA points me to the mchk program I have used.
    
    Uwe
63.3IPMTXDELTA::HOFFMANSteve, OpenVMS EngineeringWed Mar 19 1997 08:505
:    CANASTA points me to the mchk program I have used.
 
   And if it did that, it also means you should then elevate via formal
   channels, as it means CANASTA hasn't seen this crash before...

63.4think about a possible SW problem ...HAN::HALLEVolker Halle MCS @HAO DTN 863-5216Thu Mar 20 1997 02:0019
    Uwe,
    
    are you able to find out the ADDRESS, which caused the non-existant
    memory error to happen ? The program might be trying to access an
    non-existant address, so it might well be a software error ! This also
    comes to mind AFTER all the hardware has been replaced !!!
    
    Just as an example, some of the 'ping-of-death' footprints also showed
    up as machine check crashes and those were CLEARLY a SOFTWARE problem !
    
    The ROUTINE which incurred this fault is  bcopy (called from pmap_create)
    
    If you look in the CAN_OSF1_CASES.SEQ file, you'll find a lot of
    machine check crashes happening in bcopy. See note SPECXN::CANASTA #239
    and use the following search command:
    
    	$ search file "mach_error _XentInt bcopy"
    
    Volker.
63.5HOW TO ?COL01::HOLZGREVEThu Mar 20 1997 09:497
    Volker,
    
    how to find out the ADDRESS, which caused the non-existant memory error 
    to happen ?!
    
    Uwe
    
63.6just guessing ...HAN::HALLEVolker Halle MCS @HAO DTN 863-5216Thu Mar 20 1997 12:5553
    Uwe,
    
    you know, that I'm not a UNIX expert, but I'm just guessing. Maybe you
    should get a better response in the UNIX notes conference.
    
    On OpenVMS, you can find the instruction, that causes the NMX error.
    It should be possible on UNIX as well.
    
    I would expect one of the parameters to _XentINT to be the 'invalid
    addr' ?! Maybe another one the actual PC of the problem. Can you try
    to check with dbx ?
    
    Volker.
    
    ...
  12 mach_error(0xfffffc0000512c94, 0xfffffc0000006398, 0x630, 0x29,
    0xfffffc00004fda40)
    ["../../../../src/kernel/arch/alpha/hal/cpusw.c":802, 0xfffffc0000512c90]

  13 _XentInt(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68,
    0xfffffc000873bf68)
    ["../../../../src/kernel/arch/alpha/locore.s":1112, 0xfffffc00004fda3c]

  14 bcopy(0x0, 0xfffffc00004fccd8, 0xfffffc0000698260, 0xfffffffdff7fdf68,
    0xfffffc000873bf68)
    ["../../../../src/kernel/arch/alpha/fastcopy.s":278, 0xfffffc00004fccd4]

  15 pmap_create(size = 18446744065111220072)
    		        ^^^^^^^^^^^^^^^^^^^^<<< does THIS look sane ????
    ["../../../../src/kernel/arch/alpha/pmap.c":1904, 0xfffffc000051de30]
map = 0xfffffc00099bde00
stats = (nil)
scratch = union {
    quadword = 18446739675685418240
    PTE_BITFIELD = struct {
        _v = 0
        _for = 0
        _fow = 0
        _foe = 0
        _asm = 0
        _gh = 0
        0
        _prot = 117
        _exec = 1
        _wire = 0
        _seg = 1
        _ssm = 0
        _gh_shared = 1
        _soft = 2
        _lw_wire = 1
        _pfn = 4294966272	<<< ????????????
    }
}
63.7try exc_addr ?HAN::HALLEVolker Halle MCS @HAO DTN 863-5216Thu Mar 20 1997 13:1112
    
    From an MCHECK error DECevent posting, I think EXC_ADDR is the address
    of the failing instruction (this is just an example):
    
    Exception Address Reg     xFFFFFC0000303500
                                         Native-mode Instruction
                                         Exception PC  x3FFFFF00000C0D40
    
    Try examining the instruction pointed to by exc_addr (you'll find this
    in the mcheck frame in the messages buffer).
    
    Volker.