T.R | Title | User | Personal Name | Date | Lines |
---|
9972.1 | Some info | 28964::PARKER | | Wed May 28 1997 18:09 | 31 |
|
Hi, Ken.
>> How often and under what cercumstances is the UBC flushed?
Every 30 seconds the update daemon sync's the disks.
>> For how long will this simple lock be taken out?
It depends on several factors but it could be as long as it takes
to flush the UBC. 800 ms is not unreasonable.
>>The problem that my customer is seeing is that a realtime process
>>will have an 800 millisecond delay when a process (timeshare priority)
>>does a "dd if=core of=new_core conv=sparse". The system itself
>>is a Turbolaser with 4 processors and 4GB of memory. The core files
>>themselves are small (10 to 15MB).
Well, if it's using timeshare scheduling (i.e. non-fixed priority)
then I would not consider the application "realtime". Have they tried
using round-robin or fixed priority policies? Might be a good idea.
I would be happy to discuss the issue with you. They could also lock
their app into memory or try other things to help reduce the latency.
Best Regards,
Lee Parker [email protected]
Digital UNIX Device Driver & Realtime Support
Realtime Expertise Center 1-800-354-9000
|
9972.2 | | HELIX::SONTAKKE | | Thu May 29 1997 12:33 | 11 |
| I thought he said the process doing "dd if=core of=new_core
conv=sparse" was using timeshare priority.
Which version of the operating system?
Few weeks ago, somebody mentioned that doing large file copy was
causing the system to become unresponsive during the time it took to
copy the file. I have not seen any update to that topic. I wonder if
the root cause is similar to Ken's problem.
- Vikas
|
9972.3 | More info | RHETT::PARKER | | Thu May 29 1997 13:18 | 19 |
|
Hi, Vikas.
I just talked to Ken about it - yes, the dd(1) is timeshare and
the realtime app is SCHED_FIFO. It's sending data over the net
using sockets (I think) so maybe that's one reason it's being
blocked? Ken is going to try to get more info using ps(1) and
any other tools he can find - do you all have any tools that
will show more ? Their ubc-maxpercent is something like 3% so
it should not take that long to sync the disks - maybe it's too
low? Maybe Ken can either mail you more info or post it here.
I recall the other note too - I'm not sure if/how it was resolved.
I can't seem to find it right now.
Thanks,
Lee
|
9972.4 | | CSC64::BLAYLOCK | If at first you doubt,doubt again. | Thu May 29 1997 14:09 | 31 |
|
The version of the OS is V4.0A with BL4 of USEG's patches
applied (the January kit).
The only realtime process is reading and writing over the
network (FORE ATM) via the BSD socket interface (UDP). The
process has never lost any packets, it is only the delay
that is noticed (the packet is echoed with a time stamp).
Changing the network interface to FDDI or ethernet does
not change the delay.
Changing the process priority from 33 to 63 has no effect.
Changing the priority of the netisr thread(s) has no effect.
There are actually no real users on the system (except the
developer) and so an 800 ms delay is not noticable except to
this one peice of the application.
Even with ps(1), getting a picture of what the RT priority
process is being blocked by is not going to be easy. The
only time that the delay is seen is when the dd(1) occurs
(the script involved is looking for and copying core files
so the 'conv=sparse' is necessary.)
We have been looking for tools to find out what the schedualar
has been doing, but the POLYCENTER tools are not up to V4.0 yet
and none of the others (Parasite) seem to show what the schedular
is actually doing.
Thanks to all for the response.
|
9972.5 | | HELIX::SONTAKKE | | Thu May 29 1997 17:18 | 1 |
| I just found the note; it is 9075 and it talks about mv performance.
|
9972.6 | | KITCHE::schott | Eric R. Schott USG Product Management | Thu May 29 1997 21:42 | 4 |
| Setting ubc-max to 3% is a really BAD idea....
I don't know the answer, but this one at 3% can't help...
|