[Search for users] [Overall Top Noters] [List of all Conferences] [Download this site]

Conference turris::languages

Title:	Languages
Notice:	Speaking In Tongues
Moderator:	TLE::TOKLAS::FELDMAN

Created:	Sat Jan 25 1986
Last Modified:	Wed May 21 1997
Last Successful Update:	Fri Jun 06 1997
Number of topics:	394
Total number of notes:	2683

342.0. "gabrbage collection test applications" by CIRCUS::DOSSER () Thu Sep 03 1992 19:32

  We are searching for test applications to use in an advanced
  development effort to produce an efficient and practical garbage
  collector for C and C++ applications.

  The fundamental goal of this project is to improve programmer
  productivity by greatly reducing storage-management bugs, with
  minimal impact on CPU and memory usage. The development of a
  garbage collector for use in C and C++ environments is an important
  element of this project.

  Over the next year we plan to implement, test, and characterize the
  performance of this garbage collector. By integrating the collector
  with large "real-world" applications it will be possible to properly
  characterize the effectiveness and cost of garbage collection in
  production use.

  We are searching for test applications which are large memory
  consumers. Interactive applications and long running servers are both
  good candidates. Applications with known (but perhaps poorly
  characterized) memory allocation problems would be of particular
  interest.

  At this stage, project work is limited to MIPS ULTRIX and Alpha OSF/1
  based applications. In the near future work will be extended to VMS,
  OpenVMS, and NT, but at this time only Ultrix and OSF/1 applications
  are solicited.

  Development organizations willing to participate in this effort would
  be requested to provide source kits for the contributed application
  and a minimal amount of consultation on building and running the
  application.

  Project members in Palo Alto and Seattle will integrate the collector
  with the application and measure changes in product performance and
  memory requirements.

  Benefits of this effort for developers may include characterization
  of the memory usage of the contributed application, as well as the
  identification of memory leak and dangling pointer bugs.

  Further, all results and code modifications will be returned to the
  developers, allowing quick incorporation of the garbage collector
  into the application, if this proves desirable.

  If you have an application which fits these specifications and would
  care to partcipate in this effort please contact Al Dosser at this
  address.

  Digital Equipment Corporation
  Systems Research Center
  130  Lytton Ave.
  Palo Alto, CA  94301

  DTN 543-2100    dosser::decsrc
  415-853-2100    [email protected]

  This project is a cooperative effort between DEC SRC in Palo Alto and
  DECwest in Seattle, and is funded by CRA. It is led by John Ellis of
  SRC and Joel Bartlett of WRL in consultation with Dave Detlefs and
  John Detreville (SRC), and Ben Zorn (University of Colorado).
  Engineering support is provided by David Dunkle and Al Dosser of
  DECwest.

T.R	Title	User	Personal Name	Date	Lines
342.1	Yes please!	TAEC::HARPER	John Harper, DTN 828 5351	`Tue Oct 27 1992 08:34`	36
	This isn't really an answer, because I don't personally have a test application. (I can think of one though... DECmcc meets all your requirements!) But I wanted to reply to say what a good idea this project is. To me memory allocation (etc) is "the next frontier" for CASE. I reckon that something like 25% of the total effort in a large system software project goes into dealing with memory allocation, or more accurately memory DEallocation. Tracking down memory leaks is incredibly time-consuming, but that's only the tip of the iceberg. Much more significant is all the thought that has to go in at the design stage to avoid memory leaks, which would be completely unnecessary with garbage collection. Although it's 20 years since I used a langauge that had it (POP-2... how many people remember that one?) I can still remember how incredibly liberating it is. There are all sorts of things that are very natural to do, that you don't because of the memory allocation implications. Having functions return results as structures is one example. I was wondering, in a philosophical moment recently, what would be an acceptable price to pay for garbage collection? My figure is 30%, in terms of execution time, space, etc. After all, we are easily paying a lot more than that for using high-level langauges, but nobody in their right mind would write in assembler without some very good reason. Similarly, general-purpose hardware-hiding GUI systems such as Motif and Windows are vast CPU hogs, but that doesn't stop their popularity, because there are so many applications that just wouldn't get written otherwise. Surely though there are language implications? For garbage collection to work you need very strong typing. Things like undifferentiated unions and variable-size (C-style) arrays are out! Casting becomes an even more dubious practice, and the language would need to provide mechanisms for the (fairly rare) cases where it is really necessary. In summary, the sooner the better... John
342.2	I get more cynical in the evening	MU::PORTER	meetings - the alternative to work	`Mon Nov 02 1992 23:47`	22
	I've heard of POP-2, do I win a prize? (I've never programmed in it, however). >I reckon that something like 25% of the total effort in a large >system software project goes into dealing with memory allocation, or >more accurately memory DEallocation. I don't believe this is true. There are numerous products around which have clearly put zero effort into dealing with memory deallocation -- this, alas, appears to be the norm. > Similarly, general-purpose hardware-hiding GUI systems such as Motif > and Windows are vast CPU hogs, but that doesn't stop their popularity, I saw Motif running on a Sandpiper this morning -- it really is pretty decent. How long do you think it'll be before they manage to slow it down to normal again? (Sigh, when I were a lad if we had a whole APR it were luxury, tell that to the youth of today, etc etc.)
342.3	More about GC	HERON::FRON		`Wed Dec 23 1992 04:22`	15
	I'm sorry I don't have test applications, but would like to share some Smalltalk experiments regarding GC. Smalltalk of course has it and it's great. The time spent is estimated to less than 10%. Also, new features include monitoring of the GC strategy (periodic with user-define period, when-needed, etc...) and having GC as a true object (one in its class) in order to be able to specialize strategy. This can be useful for tuning critical applications. Moreover, good monitoring and observability of GC allows to optimize only where is needed, though may be more expensive on some tests, trading predicatability versus performance. Last but not least, GC strategy is closely related to persistence schemes, and it could be a nice trick to accomodate both.
342.4		MLTVAX::PUDER	Karl Puder	`Thu Jan 14 1993 13:00`	3
	You don't need "strong typing" to have decent GC. Look at LISP and APL. :Karl, the weird languages guy.
342.5	Well, yes	TAEC::HARPER	John Harper, DTN 828 5351	`Fri Jan 15 1993 18:14`	20
	I tried looking at APL, but it made my eyes go funny. As for LISP, it has the strongest typing you can imagine: there is only one type. (Actually that isn't quite true, because atoms have multiple types). In general, you don't need any compile-time typing if everything carries type information at runtime (which is the case for APL, and to the extent that there are types, for Lisp). Also for Pop-2. (Ever come across that?) Languages like C++ are already a compromise in the direction of efficiency, in that they try to do type checking a compile time rather than runtime. I guess moving all type checking to runtime removes further constraints from the programmer, at a further cost in cycles to execute. Of course, this is the delight of using "untyped" languages; you can produce things much more quickly, if you don't mind the extra execution time. (I still remember how easy it was to "knock things up" in Pop-2). John
342.6	Some cases mandate compile-time reporting	EVMS::KILGALLEN		`Sat Jan 16 1993 10:21`	9
	> <<< Note 342.5 by TAEC::HARPER "John Harper, DTN 828 5351" >>> > > In general, you don't need any compile-time typing if everything > carries type information at runtime (which is the case for APL, > and to the extent that there are types, for Lisp). Also for Pop-2. That really depends on whether you have any preference between getting your errors at compile-time and getting them at run-time. For software being delivered to customers, compile-time is often preferable.
342.7		11SRUS::PUDER	Karl Puder	`Mon Jan 18 1993 14:31`	19
	The important thing is that the programmer should not normally deal with pointers and addresses and such. If the language lets the programmer think about data and objects and functions instead of addresses, then the language implementor can make the compiler and RTL keep track of what memory is used and free. As soon as the programmer can ask for the address of something, there's no easy guarantee that memory won't be lost. What I'm trying to say is that one is taking the hard road if one takes a language like C or BLISS and hopes to patch on a memory-manager to fix these problems. The language must make it easy for the programmer to get the job done without needing to work at the address level in order for the compiler to successfully hide that level and provide automatic storage allocation and reclamation. I mentioned APL and LISP as well-known examples of this, but I agree they are by no means the only possible solutions. :Karl.
342.8	More...	TAEC::HARPER	John Harper, DTN 828 5351	`Mon Jan 18 1993 16:27`	24
	re .-1: Of course what you say is true: as soon as the programmer can ask for the address opf something (and do anything with it) you can't guarantee that GC can work. (Actually in itself this doesn't permit memory loss; it just allows the programmer to trip overhis/her shoelaces when GC deletes things he/she had remembered the address of). C++, used properly, works with references rather than addresses. You don't ask for the address of something but rather a reference to it. As long as you don't violate the language's type safety by using explicit casts and one or two other things, GC can be made to work. But it would need a few minor tweaks to be able to do all the things you might reasonably want to be able to do and still be GC-ok. (For example, there are cases where you need to cast to a more refined type, but I don't believe you can ever need to do this in a way that couldn't be checked to be OK at run-time, unless you are a real hacker). As for whether dynamically typed languages like LISP and Pop-2 are better than compile-time checked languages like C++ and Ada, I guess that's a religious question and therefore best avoided. John
342.9		MU::PORTER	savage pencil	`Mon Jan 18 1993 20:43`	5
	re .-1 > unless you are a real hacker. I'm usually an integer hacker, but I can be coerced.
342.10	I got a compiler error...	TAEC::HARPER	John Harper, DTN 828 5351	`Wed Jan 20 1993 16:41`	7
	undeclared identifier: integer unexpected token: hacker Maybe you meant an int hacker? John
342.11		SUBWAY::BRIGGS	Have datascope, will travel.	`Mon Feb 01 1993 11:51`	21
	Both C and C++ sometimes lie about return values. For example, suppose I write a function to return a pointer to a char: char * myfun(void) and I execute it: > myfun(void) % core Here, the function didn't return a char *, but a core dump. This is merely an example: it is also easy have a memory access violation returned. This is clearly a deficiency in the language. (only kidding ;-)