[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference turris::digital_unix

Title:DIGITAL UNIX(FORMERLY KNOWN AS DEC OSF/1)
Notice:Welcome to the Digital UNIX Conference
Moderator:SMURF::DENHAM
Created:Thu Mar 16 1995
Last Modified:Fri Jun 06 1997
Last Successful Update:Fri Jun 06 1997
Number of topics:10068
Total number of notes:35879

8690.0. "out of memory trying to allocate exception system resources" by RDGENG::READINGS_R (Richard Readings) Tue Feb 04 1997 04:27

I have an executable (C and Fortran code) which fails to execute, with the
following message:

exception system: exiting due to internal error: out of memory trying to
allocate exception system resources

I had to use unlimit datasize to avoid a heap allocation failure in order to
get this far. I have tried increasing maxdsiz (initially 1073741824) but it
made no difference.

Running from dbx reveals that the problem is apparently in the initialisation
code (it doesn't reach main):

ddors.reo.dec.com> dbx meshv
dbx version 3.11.10
Type 'help' for help.

(dbx) stop in main
[2] stop in main
(dbx) run
exception system: exiting due to internal error: out of memory trying to
allocate exception system resources

Program terminated normally


I'm using DUNIX V4.0A.

Any suggestions?
T.RTitleUserPersonal
Name
DateLines
8690.1QUARRY::nethCraig NethTue Feb 04 1997 08:555
Can you point us at the object?    Looking at the code, it looks like it
might be a bad/corrupt object - the exception handling system is trying
to malloc some memory for some internal data structures it uses.    If
the exception handling sections in the object were all screwed up one 
could imagine it doing something like this.   
8690.2How did you build it?WTFN::SCALESDespair is appropriate and inevitable.Tue Feb 04 1997 10:204
Giving us your full compile and link lines might be illuminating, too.


				Webb
8690.3apc013.reo.dec.com:/pub/meshvRDGENG::READINGS_RRichard ReadingsWed Feb 05 1997 03:0814
Re .1

>Can you point us at the object?    

It's available by anonymous ftp from apc013.reo.dec.com:/pub/meshv

Re .2

I'll request the compile and link lines from the customer. I only have the 
executable here.

Thanks for the help

Richard
8690.4Makefile with -tasoRDGENG::READINGS_RRichard ReadingsMon Feb 10 1997 03:4369
Re .2

The application is being linked with -taso. Could that be the problem?

---------------------------------------------------------------
   This is the Makefile of the fortran sources
---------------------------------------------------------------

F77 = f77 -O -convert big_endian $(DEBUG) -c
OBJ_F = `cat ../../files/FortranAxp`
all:
        make `cat ../../files/FortranAxp`
.f.o:
        @echo ""
        @echo "Compile $< with $(F77)"
        @echo ""
        @$(F77) $< 






-----------------------------------------------------
   This is the Makefile of the c sources
-----------------------------------------------------

OBJ_F=`cat ../../files/FortranMeshvAxp`
OBJ_C=bibX11c.o cb.o cb_visu.o cio.o control_mesh.o dialogue.o fsb.o image.o
info.o iso.o meshv.o pref.o region.o rot.o utils.o

ROOT        = meshv
BINDIR      = ../../../bin_axp/

SRCDIR      = ../../src/$(ROOT)/
INCDIR      = ../../src/include/

LIBX        = -lMrm -lXm -lXt -lX11 -lm -lots -lfor -lUfor

CFLAG       = $(DEBUG) -D_NO_PROTO -DNO_REGEX -DSTRINGS_ALIGNED -DSUN
$(DEBUGMODE)
CC          = cc $(CFLAG) -I$(INCDIR) -O
F77         = f77  $(DEBUG)
LD          = $(PURE) cc $(DEBUG)

UID         = $(BINDIR)fm_$(ROOT).uid

OBJ         = $(OBJ_F) $(OBJ_C)

BIN         = $(BINDIR)$(ROOT)
BINS        = $(BIN) $(UID)

all:$(BINS)
        @echo ">>> All binaries ok"

$(OBJ_C):$(SRCDIR)$(*)
        @echo ">>> cc [$(SRCDIR)$(@F:.o=.c) -> $@ ] $(DEBUG)
        @$(CC) -c $(SRCDIR)$(@F:.o=.c)

$(BIN):$(OBJ_C)
        @echo ">>> ld [ $(@F) -> $@ ]"
        @$(LD) -taso  -o $(@) $(OBJ) $(LIBX)

$(UID):$(SRCDIR)fm_$(ROOT).uis
        @echo ">>> uil [fm_$(ROOT).uis -> $(UID) ]"
        @/usr/bin/X11/uil $(SRCDIR)/fm_$(ROOT).uis -o $(UID) 



8690.5V4.0B, tooSAPEC3::WALLMEROTHMon Feb 10 1997 04:1470
    Sorry, bad news.   The "not wired" panic still occurs in V4.0B.
    This time in happened during the database build phase, so we
    don't reach the point, where we can tell whether database
    informations got corrupted without leading to a panic. The
    database is Adabas again. No news about Informix.
    
    The worst thing in .0 is that you not always (should I say seldom)
    get a panic, but the application continues with corrupted data !
    Shouldn't we warn our customers ?
    
    A few lines out of crash-data follow:
    
    pmap_lw_unwire_new: pmap 0xfffffc000ea68600 pte 0xa5e200083309
    panic (cpu 2): not wired
    
    Begin Trace for machine_slot[paniccpu].cpu_panic_thread:
    thread 0xfffffc003de61080 stopped at   [stop_secondary_cpu:513 ,0xffff
    fc00004f8988]    Source not available
    
    warning: Files compiled -g3: parameter values probably wrong
    >  0 stop_secondary_cpu(do_lwc = 0x0) ["../../../../src/kernel/arch/al
    pha/cpu.c":507, 0xfffffc00004f8984]
       1 panic(s = 0xfffffc00006b4c88 = "event_timeout: panic request") ["
    ../../../../src/kernel/bsd/subr_prf.c":703, 0xfffffc000027b190]
       2 event_timeout(func = 0xfffffc000027b3c0, arg = 0xfffffc0000738040
    , timeout = 0x5) ["../../../../src/kernel/arch/alpha/cpu.c":1005, 0xff
    fffc00004f9780]
       3 xcpu_puts(s = 0xfffffc00006ef790, prfbufp = 0xfffffc0000738040) [
    "../../../../src/kernel/bsd/subr_prf.c":844, 0xfffffc000027b424]
       4 printf(va_alist = 0xfffffc000066a9d8) ["../../../../src/kernel/bs
    d/subr_prf.c":379, 0xfffffc000027a820]
       5 panic(s = 0xfffffc00006bbb90 = "not wired") ["../../../../src/ker
    nel/bsd/subr_prf.c":753, 0xfffffc000027b2e8]
       6 pmap_lw_unwire_new(map = 0xfffffc000ea68600, start = 0x1cc2000, b
    itmap = 0xffffffffa905f590) ["../../../../src/kernel/arch/alpha/pmap.c
    ":6398, 0xfffffc000052619c]
       7 lw_unwire_new(0xffffffffa905f848, 0xffffffffa905f838, 0x1cc4000,
    0x1000, 0xfffffc0000494480) ["../../../../src/kernel/bsd/kern_mman.c":
    3441, 0xfffffc00002665b8]
       8 vm_map_pageable(0x1cc4000, 0x1000, 0xfffffc0000494480, 0x1, 0xfff
    ffc00004944ac) ["../../../../src/kernel/vm/vm_map.c":1269, 0xfffffc000
    04dd58c]
       9 physio(0xfffffc0000654260, 0x0, 0xfffffc0000537310, 0xfffffc00005
    42620, 0x10080d004) ["../../../../src/kernel/ufs/ufs_physio.c":327, 0x
    fffffc00004944a8]
      10 cdisk_read(uio = 0xffffffffa905f848) ["../../../../src/kernel/io/
    cam/cam_disk.c":7916, 0xfffffc0000537fd0]
      11 spec_read(0x11ffffbc0, 0x0, 0x0, 0xfffffc003cd97f00, 0xfffffc0000
    80d004) ["../../../../src/kernel/vfs/spec_vnops.c":1870, 0xfffffc00004
    a3c98]
      12 msfsspec_read(vp = 0xfffffc003b65a200, uio = 0xffffffffa905f848,
    ioflag = 0x5, cred = 0xfffffc003cd97f00) ["../../../../src/kernel/msfs
    /osf/msfs_vnops.c":4422, 0xfffffc00003eb02c]
      13 vn_read(0xfffffc0000289ee4, 0xffffffffa905f8e0, 0xffffffffa905f84
    8, 0x0, 0xfffffc003d5ed980) ["../../../../src/kernel/vfs/vfs_vnops.c":
    877, 0xfffffc00004c4d8c]
      14 rwuio(0xfffffc00138ccca0, 0xfffffc003de61080, 0xfffffc00006d7f30,
     0xfffffc00138cca80, 0xfffffc003d5ed980) ["../../../../src/kernel/bsd/
    sys_generic.c":1217, 0xfffffc0000289f38]
      15 read(0xfffffc0000001000, 0xffffffffa905f838, 0x573000, 0x10000000
    0001, 0x0) ["../../../../src/kernel/bsd/sys_generic.c":1167, 0xfffffc0
    000289e48]
      16 syscall(0x1000, 0xfffffc00004fee44, 0x9e2b432f10bf1, 0x1205d099c,
     0x3) ["../../../../src/kernel/arch/alpha/syscall_trap.c":540, 0xfffff
    c0000507d80]
      17 _Xsyscall(0x8, 0x3ff800d1818, 0x14008ada0, 0x3, 0x1cc2000) ["../.
    ./../../src/kernel/arch/alpha/locore.s":1209, 0xfffffc00004fde44]
    End Trace for machine_slot[paniccpu].cpu_panic_thread:
    
    
8690.6QUARRY::nethCraig NethMon Feb 10 1997 10:306
Ummm, .5 looks like a reply to the wrong note...

As far as .4 goes, mail was sent offline.   The program has a huge
data section and is running out of space for the heap.   Some suggestions
were made on how to rearrange the memory map to make things fit.

8690.7linked with -tasoRDGENG::READINGS_RRichard ReadingsTue Feb 11 1997 02:5511
Re .6

>As far as .4 goes, mail was sent offline.   The program has a huge
>data section and is running out of space for the heap.   Some suggestions
>were made on how to rearrange the memory map to make things fit.

Thanks to Craig for the help. I should point out that the program was linked 
with -taso, which seems to be the cause of the problem, in case anyone else 
stumbles across this.

Richard