| Title: | DECmcc user notes file. Does not replace IPMT. |
| Notice: | Use IPMT for problems. Newsletter location in note 6187 |
| Moderator: | TAEC::BEROUD |
| Created: | Mon Aug 21 1989 |
| Last Modified: | Wed Jun 04 1997 |
| Last Successful Update: | Fri Jun 06 1997 |
| Number of topics: | 6497 |
| Total number of notes: | 27359 |
ALARMS LOG FILE - NEED A DOCTOR
===============================
I have this problem to report. I have three alarms log files;
BRIDGE_ALARMS.LOG for all the bridges,
NODE_ALARMS.LOG for all the Phase IV nodes,
TS-ALARMS.LOG for all terminal servers.
SAMPLE OF THE ALARM RULE PROCEDURE
----------------------------------
CREATE MCC 0 ALARMS RULE SIN999_NODE_UNREACHABLE -
EXPRESSION = (NODE4 .MCC.SRPC.SIN999 STATE <> ON ,-
AT EVERY 00:30:00) ,-
PROCEDURE = MCC_COMMON:MCC_ALARMS_LOG_ALARM.COM ,-
EXCEPTION HANDLER = MCC_COMMON:MCC_ALARMS_LOG_EXCEPTION.COM ,-
PARAMETER = "NODE_ALARMS.LOG" ,-
CATEGORY = "Node unreachable" ,-
DESCRIPTION = "SIN999 NODE UNREACHABLE" ,-
QUEUE = "ALARMS$BATCH" ,-
PERCEIVED SEVERITY= MAJOR ,-
IN DOMAIN = .MCC.SRPC_DOMAIN
The following procedures was carried out to activate the various alarms
rules for bridges, Node4 and terminal servers.
$MANAGE/ENTERPRISE
MCC>DO ENABLE_DECBRIDGE.COM
MCC>DO ENABLE_NODE4.COM
MCC>DO ENABLE_TS.COM
I also activate the DECmcc Iconic Map on a separate instance beforehand.
The version of DECmcc in used is V1.1
Assuming that two of the Node4 entity in the Iconic Map then turns red
due to exception critical conditions. Only one of the alarms are saved
onto the NODE_ALARMS.LOG file. I did various tests and confirm that
if both alarms rule were fired 30 seconds or more apart then both
alarms were saved onto the NODE_ALARMS.LOG file whereas if both alarms
rules were to fired immediately one after another then only one of
the alarms were saved onto the NODE_ALARMS.LOG file.
The same principle applies to both bridges as well as terminal servers.
I also did another test, ie. fired two alarms rule simultaneously but
saved onto different log file. It work fine as both alarms are captured
onto both log files respectively.
I can temporary solve this problems by using different log files for
each alarms rule created. The consequences of this is that there will
be too many alarms log files created.
Has anyone experienced similar problems before?
Are there any better solutions or other alternative plan of actions?
All feedbacks are most welcome.
THANKS IN ADVANCE
| T.R | Title | User | Personal Name | Date | Lines |
|---|---|---|---|---|---|
| 1203.1 | Yes, it's is a problem | TOOK::ORENSTEIN | Mon Jul 01 1991 14:41 | 20 | |
Yes, this is a known problem.
If rules fire at the same time, and use the same logfile,
the logfile is opened on a first-come-only-served basis.
There is no retry builtin if successive rules can not open
the file.
I don't think that this will be fixed in V1.2, and with
the Map being so snazzy, the ALARMS command procedures are
becoming a smaller focus.
As for things to do, you could edit the sample command
procedure to make it retry, or write the message to a unique
temporary file and then append it to the logfile.
PS. I'll make sure this reaches the next round of release
notes.
aud...
| |||||
| 1203.2 | Use a single job batch queue | NSSG::R_SPENCE | Nets don't fail me now... | Mon Jul 01 1991 15:46 | 15 |
I believe that the reccomended workaround is to use a batch queue for
the alarms proceedures that will write to a log file that has a limit
of aonly one job at a time. That way if two alarms fire at one time,
each proceedure will run and close the log file before another
proceedure will open it.
To the developers:
I understand why the focus is on the map. However, it might serve to
simply document the above reccomendation (I got it from engineering
myself) in both the proceedures, and perhaps in the manual.
As far as the broadcast and mail proceedures go, they seem to be happy
with multiple copies running at the same time.
s/rob
| |||||
| 1203.3 | Thanks for the memory ... | TOOK::ORENSTEIN | Tue Jul 02 1991 09:42 | 8 | |
Thanks for the suggestion. I will see that it is clearly documented
in both the procedure and the manual.
And yesm the broadcast and mail have no problems because the data is
written to unique temporary files before being mailed or sent.
aud...
| |||||
| 1203.4 | It is already in the Alarms manual | DFLAT::PLOUFFE | Jerry | Tue Jul 02 1991 14:53 | 12 |
re: -.1
> Thanks for the suggestion. I will see that it is clearly documented
> in both the procedure and the manual.
I'm not so sure how clear it is, ;) but...
This point is already documented in the manual. See page 4-2 (Log
Notification for Alarm Conditions) and 4-4 (Log Notification for Error
Conditions).
- Jerry
| |||||