| Greetings
A curious set of circumstances, indeed. It would be of interest to
find out more. It *appears* that the system time goes through the roof
but it is not clear to me whether this is because of a lack of physical
memory or a lack of virtual memory.
If the customer is willing, please ask him/her to do the following:
1) ON BOTH MACHINES:
Become superuser
more /var/adm/messages
go down to bottom (just press G) then scroll backwards looking
for most recent boot record.
What does the boot section say about exact unix version, exact
hardware model number, physical memory, and firmware? E.g. -
Jun 6 17:07:44 dstant vmunix: DEC OSF/1 V3.2 (Rev. 214); Tue Jun 6 17:06:33
EDT 1995
Jun 6 17:07:44 dstant vmunix: physical memory = 128.00 megabytes.
Jun 6 17:07:44 dstant vmunix: available memory = 115.97 megabytes.
Jun 6 17:07:44 dstant vmunix: using 483 buffers containing 3.77 megabytes of memory
Jun 6 17:07:44 dstant vmunix: AlphaStation 200 4/233 system
Jun 6 17:07:44 dstant vmunix: Apecs pass II Sio rev II
Jun 6 17:07:44 dstant vmunix: Firmware revision: 4.2
Jun 6 17:07:44 dstant vmunix: PALcode: OSF version 1.35
2) Give the images some sort of mnemonic names, like "myexe.f38" and
"myexe.f40" and maybe "myexe.f40_Optimize0". Tell us what the names
mean. Add a little C shell wrapper that includes the results of
the vmstat, w, and limit commands and which uses the time command,
looking something like this:
cat > runem.csh
set verbose
vmstat 2 3
limit
uptime
time ./myexe.f38
time ./myexe.f40
time ./myexe.f38
time ./myexe.f40
unlimit
limit
uptime
time ./myexe.f38
time ./myexe.f40
time ./myexe.f38
time ./myexe.f40
uptime
The C-shell is picked here because of its informative variant of
the time command. The programs are run twice just to get a
primitive reading on run-to-run variation. They are run both before
and after an "unlimit" command to get some sensitivity to resource
limits. The uptime command summarizes CPU activity on the system.
Run this little script *BOTH* as an ordinary user and as superuser,
on *BOTH* machines, with "script" watching, and post the results.
I.e. do something like this:
ftp> put both exes to both machines
ftp> put the cshell script to both machines
script
csh runem.csh
su
csh runem.csh
rlogin other_sys
csh runem.csh
su
csh runem.csh
exit
repeat "exit" until it says "Script done, file is typescript"
Email typescript to digital
Feel free to forward this note to the customer, including my email
address, if you wish.
/John Henning
CSD Performance Group
Digital Equipment Corporation
[email protected]
Speaking for myself, not Digital
|
| I'm lost. I suppose I should be used to that by now, but no.
Here's an output of what you requested in .4
I hope that it makes some sense to you. I'm in over my head.
Neil.
> 1) ON BOTH MACHINES:
> Become superuser
> more /var/adm/messages
> go down to bottom (just press G) then scroll backwards looking
for most recent boot record.
What does the boot section say about exact unix version, exact
hardware model number, physical memory, and firmware? E.g. -
On the Alphaserver 8400:
Feb 12 09:57:32 columbus vmunix: Digital UNIX V3.2C (Rev. 148); Wed Nov 13 12:0
6:06 GMT 1996
Feb 12 09:57:32 columbus vmunix: physical memory = 2048.00 megabytes.
Feb 12 09:57:32 columbus vmunix: available memory = 2007.95 megabytes.
Feb 12 09:57:32 columbus vmunix: using 7855 buffers containing 61.36 megabytes o
f memory
Feb 12 09:57:32 columbus vmunix: Firmware revision: 4.1
Feb 12 09:57:32 columbus vmunix: PALcode: OSF version 1.21
Feb 12 09:57:32 columbus vmunix: AlphaServer 8400 Model 5/300
Feb 12 09:57:32 columbus vmunix: Master cpu at slot 0.
On the DEC 7000:
Jan 29 15:11:23 magellan vmunix: Digital UNIX V3.2C (Rev. 148); Thu Oct 24 16:1
4:14 BST 1996
Jan 29 15:11:23 magellan vmunix: physical memory = 512.00 megabytes.
Jan 29 15:11:23 magellan vmunix: available memory = 496.53 megabytes.
Jan 29 15:11:23 magellan vmunix: using 1958 buffers containing 15.29 megabytes o
f memory
Jan 29 15:11:23 magellan vmunix: Firmware revision: 4.3
Jan 29 15:11:23 magellan vmunix: PALcode: OSF version 1.35
Jan 29 15:11:23 magellan vmunix: DEC 7000 system
Jan 29 15:11:23 magellan vmunix: cpu at node 0
> 2) Give the images some sort of mnemonic names, like "myexe.f38" and
> "myexe.f40" and maybe "myexe.f40_Optimize0". Tell us what the names
> mean. Add a little C shell wrapper that includes the results of
> the vmstat, w, and limit commands and which uses the time command,
> looking something like this:
Images are:
shell.128.shared - size parameter set to 128, optimze is -O, shared libraries
shell.128.static - size parameter set to 128, optimze is -O, static libraries
(-non_shared)
shell.250.shared - size parameter set to 250, optimze is -O, shared libraries
shell.250.static - size parameter set to 250, optimze is -O, static libraries
(-non_shared)
shell.250.shared_o1 - size parameter set to 250, optimze is -O1, shared librarie
s
On the 8400:
Script started on Wed Mar 05 10:18:45 1997
% csh runem.csh
vmstat 2 3
Virtual Memory Statistics: (pagesize = 8192)
procs memory pages intr cpu
r w u act free wire fault cow zero react pin pout in sy cs us sy id
11150 23 63K 160K 32K 5664M 21M 251M 190M 26M 240K 160 2K 778 81 5 15
13155 22 71K 151K 33K 4525 465 3338 16 514 0 45 1K 589 77 8 16
14154 22 74K 149K 33K 8413 2 8409 21 4 0 8 664 353 82 3 15
limit
cputime unlimited
filesize unlimited
datasize 131072 kbytes
stacksize 2048 kbytes
coredumpsize unlimited
memoryuse 2056144 kbytes
descriptors 4096 files
addressspace 2097152 kbytes
uptime
10:18 up 21 days, 26 mins, 22 users, load average: 5.20, 5.23, 5.16
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
8.02u 0.46s 0:08 94% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
7.94u 0.41s 0:09 90% 0+14k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
8.48u 71.50s 1:26 92% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
Insufficent memory to open Fortran RTL message catalog, message #41.
0.00u 0.05s 0:00 12% 0+0k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
12.70u 0.69s 0:15 88% 0+26k 0+0io 0pf+0w
unlimit
limit
cputime unlimited
filesize unlimited
datasize 2097152 kbytes
stacksize 32768 kbytes
coredumpsize unlimited
memoryuse 2056144 kbytes
descriptors 4096 files
addressspace 2097152 kbytes
uptime
10:20 up 21 days, 28 mins, 22 users, load average: 6.17, 5.99, 5.99
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
8.15u 0.50s 0:08 97% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
7.96u 0.41s 0:12 68% 0+13k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
8.74u 73.43s 1:25 95% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
PROGRAM SHELL 4.1
8.09u 0.50s 0:12 70% 0+24k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
12.50u 0.50s 0:13 96% 0+26k 0+0io 0pf+0w
uptime
10:23 up 21 days, 31 mins, 23 users, load average: 5.58, 6.12, 6.14
% su
Password:
# csh runem.csh
vmstat 2 3
Virtual Memory Statistics: (pagesize = 8192)
procs memory pages intr cpu
r w u act free wire fault cow zero react pin pout in sy cs us sy id
14150 22 73K 147K 35K 5665M 21M 252M 190M 26M 240K 160 2K 778 81 5 15
11152 23 73K 147K 35K 484 35 387 23 26 0 29 1K 400 83 2 15
13151 22 73K 147K 35K 148 0 146 16 0 0 10 610 352 83 1 16
limit
cputime unlimited
filesize unlimited
datasize 131072 kbytes
stacksize 2048 kbytes
coredumpsize unlimited
memoryuse 2056144 kbytes
descriptors 4096 files
addressspace 2097152 kbytes
uptime
10:30 up 21 days, 38 mins, 22 users, load average: 5.11, 5.17, 5.09
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
8.22u 0.45s 0:09 94% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
7.79u 0.42s 0:09 89% 0+13k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
8.58u 70.72s 1:25 93% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
Insufficent memory to open Fortran RTL message catalog, message #41.
0.00u 0.03s 0:00 8% 0+0k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
12.55u 0.52s 0:15 86% 0+26k 0+0io 0pf+0w
unlimit
limit
cputime unlimited
filesize unlimited
datasize 2097152 kbytes
stacksize 32768 kbytes
coredumpsize unlimited
memoryuse 2056144 kbytes
descriptors 4096 files
addressspace 2097152 kbytes
uptime
10:32 up 21 days, 40 mins, 22 users, load average: 4.78, 5.62, 5.91
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
8.17u 0.43s 0:08 98% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
7.79u 0.39s 0:08 98% 0+13k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
8.77u 70.95s 1:25 93% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
PROGRAM SHELL 4.1
7.88u 0.49s 0:11 70% 0+24k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
12.61u 0.51s 0:13 99% 0+26k 0+0io 0pf+0w
uptime
10:35 up 21 days, 42 mins, 23 users, load average: 5.58, 5.76, 5.81
# exit
% exit
%
script done on Wed Mar 05 10:35:11 1997
On the 7000:
Script started on Wed Mar 05 11:07:19 1997
% csh runem.csh
vmstat 2 3
Virtual Memory Statistics: (pagesize = 8192)
procs memory pages intr cpu
r w u act free wire fault cow zero react pin pout in sy cs us sy id
2 85 17 10K 46K 6367 53M 181K 52M 1M 207K 211 7 161 237 0 1 98
2 85 17 11K 46K 6367 245 31 170 0 26 0 8 44 251 0 2 97
2 85 17 11K 46K 6367 150 0 150 0 0 0 7 304 248 0 4 96
limit
cputime unlimited
filesize unlimited
datasize 131072 kbytes
stacksize 2048 kbytes
coredumpsize unlimited
memoryuse 508448 kbytes
descriptors 4096 files
ddressspace 1048576 kbytes
uptime
11:07 up 34 days, 19:57, 4 users, load average: 0.19, 0.23, 0.24
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
19.78u 0.76s 0:20 97% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
18.89u 0.78s 0:20 96% 0+14k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
18.04u 57.81s 1:22 91% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
Insufficent memory to open Fortran RTL message catalog, message #41.
0.00u 0.02s 0:00 66% 0+0k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
27.58u 0.75s 0:29 96% 0+26k 0+0io 0pf+0w
unlimit
limit
cputime unlimited
filesize unlimited
datasize 1048576 kbytes
stacksize 32768 kbytes
coredumpsize unlimited
memoryuse 508448 kbytes
descriptors 4096 files
addressspace 1048576 kbytes
uptime
11:10 up 34 days, 19:59, 4 users, load average: 1.00, 1.31, 1.43
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
19.81u 0.72s 0:20 98% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
18.85u 0.74s 0:20 97% 0+14k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
18.09u 57.69s 1:23 91% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
PROGRAM SHELL 4.1
21.55u 0.79s 0:22 98% 0+24k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
27.62u 0.72s 0:28 98% 0+26k 0+0io 0pf+0w
uptime
11:13 up 34 days, 20:02, 4 users, load average: 1.10, 1.24, 1.38
% exit su
Password:
# csh runem.csh
vmstat 2 3
Virtual Memory Statistics: (pagesize = 8192)
procs memory pages intr cpu
r w u act free wire fault cow zero react pin pout in sy cs us sy id
2 86 17 11K 46K 5141 53M 182K 52M 1M 207K 211 7 161 237 0 1 98
2 86 17 11K 46K 5141 322 35 243 0 26 0 9 60 250 0 2 97
2 86 17 11K 46K 5141 76 0 76 0 0 0 4 297 241 0 4 96
limit
cputime unlimited
filesize unlimited
datasize 131072 kbytes
stacksize 2048 kbytes
coredumpsize unlimited
memoryuse 508448 kbytes
descriptors 4096 files
addressspace 1048576 kbytes
uptime
11:34 up 34 days, 20:24, 4 users, load average: 0.15, 0.07, 0.03
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
19.96u 0.81s 0:21 97% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
18.33u 0.78s 0:19 98% 0+14k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
18.36u 57.76s 1:23 91% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
Insufficent memory to open Fortran RTL message catalog, message #41.
0.00u 0.03s 0:00 75% 0+0k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
28.05u 0.76s 0:29 97% 0+26k 0+0io 0pf+0w
unlimit
limit
cputime unlimited
filesize unlimited
datasize 1048576 kbytes
stacksize 32768 kbytes
coredumpsize unlimited
memoryuse 508448 kbytes
descriptors 4096 files
addressspace 1048576 kbytes
uptime
11:37 up 34 days, 20:27, 4 users, load average: 1.27, 1.39, 1.47
time ./shell.128.shared < test.dat
PROGRAM SHELL 4.1
20.01u 0.73s 0:21 97% 0+15k 0+0io 0pf+0w
time ./shell.128.static < test.dat
PROGRAM SHELL 4.1
18.37u 0.76s 0:19 96% 0+14k 0+0io 0pf+0w
time ./shell.250.shared < test.dat
PROGRAM SHELL 4.1
18.33u 57.76s 1:23 91% 0+24k 0+0io 0pf+0w
time ./shell.250.static < test.dat
PROGRAM SHELL 4.1
18.78u 0.76s 0:19 97% 0+24k 0+0io 0pf+0w
time ./shell.250.shared_o1 < test.dat
PROGRAM SHELL 4.1
28.01u 0.79s 0:29 97% 0+26k 0+0io 0pf+0w
uptime
11:40 up 34 days, 20:30, 4 users, load average: 1.01, 1.17, 1.37
# exit
% exit
%
script done on Wed Mar 05 13:26:28 1997
|