T.R | Title | User | Personal Name | Date | Lines |
---|
976.1 | | DWOVAX::YOUNG | Note early. Note Often. | Tue Nov 15 1988 11:54 | 8 |
| > It can be done if I have the distribution and deviation of the
> MTBF and MTTR figures, but that is unfortunately not the case.
MTBF of most computer components is usually considered to be a Poisson
distribution. MTTR is trickier, it depends on what kind of service
contracts, guarantees, and reliability exists. I would advise trying
to come up with some function that approximates what you (or others)
think will be the distribution of MTTR.
|
976.2 | Insufficient information. | ERLTC::COOPER | Topher Cooper | Tue Nov 15 1988 13:03 | 39 |
| There is not enough information here to calculate what you want.
Indeed there is not enough information here to calculate what you
claim to have calculated. You cannot calculate the mean time
between failures for the system as a whole from the mean time
between failures for its components unless you make some additional
assumptions.
Specifically, you have probably assumed that the failures of
the components are random in time, and independent of each other
both within and between components.
If you make those assumptions than each failure for the system
as a whole is also independent of all the others and independent
of the time. That is, the probability of the system going down
during an interval of any given length (say one second) is the
same whether or not the system has been repaired recently and whether
it is day night or weekend. This is unlikely to be true but may
be a reasonable approximation.
In this case the distribution of inter-failure times follows the
exponential distribution:
p(t) = (1/T)*exp(-t/T)
where T is the mean time between failures (if I haven't misremembered
the formula). The Poisson distribution is related in that it tells
you how many failures will occur in a given sized interval.
It does not seem to me that these assumptions are very well justified
for repair time, and without some knowledge of the distribution
of the repair time beyond its mean, you cannot make your calculations.
Off the top of my head, I would say that the best model might be
a constant time interval plus a normal distribution. That requires
an estimate the constant interval plus the variance of the variable
part. You might get away with a simple normal approximation.
What are you willing to assume about repair times?
Topher
|
976.3 | Thinking..... | UTRTSC::LUBBERS | Jan Lubbers, Software Support. | Thu Nov 17 1988 02:38 | 8 |
| Thanks,
I have to think now I think ...
I'll come back to this problem later when I have defined the
assumptions better.
Regards.
|
976.4 | a few comments | PULSAR::WALLY | Wally Neilsen-Steinhardt | Fri Nov 25 1988 12:48 | 21 |
| First off, this is a well-studied problem and there is no reason
for you to thrash around in the dark. You might start with
_Maintainability and Maintenance Mangement_, J. D. Patton, Instrument
Society of America, Research Triangle Park, North Carolina, 1980.
And a lot of books and Proceedings with "Reliability" in the title
discuss this problem.
Second, MTTR is defined as "the average length of working clock
time required to complete a corrective service call..." and excludes
response time and travel time. Depending on the repair strategy
these may be large, so MTTR is not really what you want to combine
with MTBF to get availability.
Third, availability depends strongly on the redundancy built into
the hardware and the planned repair strategy. You have to define
these first.
Fourth, if this is Digital business, you should contact the CSSE
experts on availability, who have some computer models for making
these calculations, lots of experience in applying them, and some
business judgement which is often more relevant than the math.
|
976.5 | formulas? | UTRTSC::LUBBERS | Jan Lubbers, Software Support. | Mon Nov 28 1988 02:47 | 20 |
| Hi Wally,
Thanks for your input, yes I understand the problems you mentioned.
What I'm really after is a way to determine the probability of a
certain availability figure...
I do have a simulation model and a program that calculates everything,
but I want to add some additional functionality.
To do this I need to understand how a "correct" formula is made.
I also know there are a lot of assumpitions involved, my major
problem is how the different figures are distributed.
eg. I don't know if the "mean" in MTBF and MTTR is the mathematical
mean, the modus or the median of those figures. Also Im looking
for the (standard) deviation. In the programs I have they are assumed
to be lognormal for repair time and exponential for uptime.
Is this a correct assumption?
Regards, Jan
|
976.6 | New conference on Availability | UTRTSC::LUBBERS | Jan Lubbers, Software Support. | Mon Nov 28 1988 11:26 | 14 |
| I have created a new conference on this subject.
The conference is located at UTRROM::UPTIME
Anyone who is interested in the subject and knowledgeable persons
on mathematics and statistics are free to contribute...
I have the intention to improve the existing tool to incorporate
the new developments like volume shadowing and VAXsimPLUS that
influence availability.
Hit <select> or <KP7> to add it to your notebook.
Regards, Jan.
|