T.R | Title | User | Personal Name | Date | Lines |
---|
8552.1 | | VAXCPU::michaud | Jeff Michaud - ObjectBroker | Thu Jan 23 1997 11:53 | 17 |
8552.2 | You could QAR this. | NETRIX::"[email protected]" | Jarkko Hietaniemi | Mon Jan 27 1997 04:47 | 10 |
| I heard that there is, in fact, such a feature in the free BSDs.
Something along the lines core.`base-name-of-the-executable`.$$
(in shell notation here but handled by the kernel). If you feel
strongly enough for this feature I guess you could QAR it?
--
Jarkko Hietaniemi Digital/Finland MCS/OSS DTN 879-4738
[email protected] +358-(0)9-434 4738
[Posted by WWW Notes gateway]
|
8552.3 | warning (fwiw) | VAXCPU::michaud | Jeff Michaud - ObjectBroker | Mon Jan 27 1997 10:33 | 8 |
| > I heard that there is, in fact, such a feature in the free BSDs.
> Something along the lines core.`base-name-of-the-executable`.$$
> (in shell notation here but handled by the kernel). If you feel
> strongly enough for this feature I guess you could QAR it?
Of course the obvious problem with turning on such a
feature system-wide is that all your disc space can
fill up with core files if you're not careful.
|
8552.4 | | SMURF::DENHAM | Digital UNIX Kernel | Mon Jan 27 1997 12:05 | 2 |
| Yep, we'd probably want to rig it up to work something like
the uac command works for unaligned access fixup/reporting.
|
8552.5 | | TLE::REAGAN | All of this chaos makes perfect sense | Mon Jan 27 1997 12:52 | 9 |
| RE: .3
> Of course the obvious problem with turning on such a
> feature system-wide is that all your disc space can
> fill up with core files if you're not careful.
You mean like version numbers? :-) :-)
-John
|
8552.6 | | SMURF::PBECK | Paul Beck | Mon Jan 27 1997 13:24 | 1 |
| set file core /version_limit=5
|
8552.7 | | SMURF::DENHAM | Digital UNIX Kernel | Mon Jan 27 1997 14:28 | 4 |
| % set file core /version_limit=5
set: Variable name must begin with a letter.
Rats!
|
8552.8 | Minor oversight... | SMURF::PBECK | Paul Beck | Mon Jan 27 1997 14:38 | 2 |
| I forgot to mention steps 1 and 2 (1: add version numbers to file
system, and 2: write DCL shell).
|
8552.9 | Real UNIX folks wouldn't get caught dead w/a DCL shell :-) | VAXCPU::michaud | Jeff Michaud - ObjectBroker | Mon Jan 27 1997 16:07 | 6 |
| > I forgot to mention steps 1 and 2 (1: add version numbers to file
> system, and 2: write DCL shell).
Well I could see adding version numbers, but screw DCL (btw,
there are DCL shell's by 3rd parties that at least used to
be available for ULTRIX)
|
8552.10 | QAR entered. | GALVIA::STONES | Tom Stones | Fri Jan 31 1997 11:00 | 22 |
| I've entered QAR 51279 :
There doesn't seem to be a simple way of causing core files to be
created with unique names.
I'm running parallel applications on a cluster. A parallel application
(PVM, MPI or HPF) typically consists of multiple copies of the same code
running on one or more nodes of the cluster. The multiple processes
often expect to be running in the same working directory. When they
crash, they all create a file called "core". By Sods Law, the core file
with the info you're looking for is never the one that remains on the disk
after they've all finished overwriting each other.
I'd like a mechanism, such as setting an environment variable, to indicate
that I'd like unique names for any core files produced. I'd be satisfied
with appending the pid to the name - better yet would be appending
nodename and pid - better yet would be appending program name,host and pid.
The "High Performance Computing" and "Scientific and Technical" markets
are making more and more use of parallel codes like this, so a need for
such a mechanism is becoming more urgent.
|
8552.11 | Would a last chance handler help? | HYDRA::NEWMAN | Chuck Newman, 508/467-5499 (DTN 297), MRO1-3/F26 | Fri Jan 31 1997 12:34 | 11 |
| As a workaround, how about implementing a last chance
handler that cooperates w/ other copies of the same
program such that only one process can die at a time,
and they rename old core files before they exit
You'd probably need to find the last-chance handler
first, and call that from your last-chance handler
(or whatever it takes to make the core dump be
written).
I don't know if this would work -- just a thought...
|
8552.12 | | VAXCPU::michaud | Jeff Michaud - ObjectBroker | Fri Jan 31 1997 14:23 | 9 |
| Re: .11
if by last chance, you mean a signal handler, yes, that could
work as well. I've used such signal handlers in the past
to syslog info from the signal context about the pc and address
for segv and bus violations. Such a handler could also rename
an existing "core" file before re-asserting the signal (and make
sure to restore the signal's behaviour to SIG_DFL before re-
asserting).
|