| Title: | SNA GATEWAY NOTEFILE |
| Notice: | Note 1.* -> kits and doc, 288.* -> obtaining product support |
| Moderator: | EDSCLU::GARROD |
| Created: | Fri Feb 07 1986 |
| Last Modified: | Fri Jun 06 1997 |
| Last Successful Update: | Fri Jun 06 1997 |
| Number of topics: | 7116 |
| Total number of notes: | 28576 |
My customer is using the SNA Peer Server on Digital UNIX 3.2G with SNA Terminal and Print Services. Intermittently we seem to be losing our printer sessions to the mainframe. I found this happens by two means, one is we lose the *.pid file for the individual printer, second is the PID listed in the *.pid file is not running (cannot be found with a ps command). To remedy this I wrote a 'watchdog' program to verify that a *.pid file exists for each printer and that the PID is actually running. If either of these conditions are not met the printer is restarted. I set this up as a cron job to run every hour. My first question is are there any know reasons why we would be losing these print sessions? Is this a common or known problem with Terminal and Print Services? In addition occasionaly when the customer attempts to start a 3270 session they get a message that that TPS is not running or that the 3270 session cannot be started. The customer then runs a exstop -n TPS and exstart - n TPS. After which the 3270 session starts and runs fine. This seems similar to the problem with the 3287 print sessions. Is this a known problem? The next reply is the watchdog program I wrote for the printer sessions. To help the customer I have increased the frequency of this to run every 15 minutes. Any information or suggestions would be appreciated. Thanks, Rich Rose Sales Support - Albany, New York [email protected]
| T.R | Title | User | Personal Name | Date | Lines |
|---|---|---|---|---|---|
| 7004.1 | Watchdog Program | NQOS01::nqsrv225.nqo.dec.com::Rich Rose | Thu Feb 06 1997 10:55 | 108 | |
#
# Program Name: sna_ck_print
# Function: Verify that printers configured to start via the /sbin/rc3.d/
# S99snaps program have a valid session with the SNA Peer
Server.
#
# Author: R.Rose Digital Equipment Corporation
# Date: 11/1/96
#
#
#---------------------------------------------------------------------------
# Set environment parameters
#ksh
COMMLINK=/usr/commlink
export COMMLINK
. $COMMLINK/bin/utl/commpath
# Check the size of the log file and make sure it is not getting too big.
# If /var/sna/printer.log is 500 lines or more copy it to printerlog.old and
# start a new log file.
echo "Starting Printer check..."
wc /var/sna/printer.log |cut -c8-10>/var/sna/logsize
read num </var/sna/logsize
if [ "$num" -ge "500" ]
then cp /var/sna/printer.log /var/sna/printerlog.old
echo "New Printer Log started on \c" >/var/sna/printer.log
date >>/var/sna/printer.log
fi
# Now - make a list of all printers configured in S99snaps
# Put this list into file: /var/sna/snatmp
cat /sbin/rc3.d/S99snaps|grep start3287 |cut -c33-38 >/var/sna/snatmp
# Now read each entry in the file and verify that pid is running.
cat /var/sna/snatmp |while read line
do
#First check to see that *.pid file exists if not stop then start printer.
ls -l /usr/commlink/adm/3270/$line.pid > /dev/null 2>/dev/null
if [ "$?" -ne 0 ]
then
echo "Printer $line had no $line.pid file as of \c" >>/var/sna/printer.log
date >>/var/sna/printer.log
#Stop the printer cleanly:
/usr/commlink/bin/3270/stop3287 $line >/dev/null 2>/dev/null
#Now restart the printer:
/usr/commlink/bin/3270/start3287 $line -p /usr/commlink/adm/3270/$line.pc
>/dev/
null 2>/dev/null
wait
# Send message to log file that attempt was made to restart printer.
echo "Restarting Printer $line on \c" >>/var/sna/printer.log
date >>/var/sna/printer.log
# Check to see if pid is valid:
else
# echo $line
read x </usr/commlink/adm/3270/$line.pid
# echo $x
ps -p $x >/dev/null
# echo $?
if [ "$?" -ne 0 ]
then
#echo "Printer $line is offline."
#Print Session is not running send message to log file.
echo "Printer $line had no active pid as of \c" >>/var/sna/printer.log
date >>/var/sna/printer.log
# Remove the old *.pid file
rm /usr/commlink/adm/3270/$line.pid >/dev/null 2>/dev/null
#Stop the printer cleanly:
/usr/commlink/bin/3270/stop3287 $line >/dev/null 2>/dev/null
#Now restart the printer:
/usr/commlink/bin/3270/start3287 $line -p /usr/commlink/adm/3270/$line.pc
>/dev/
null 2>/dev/null
# Send message to log file that attempt was made to restart printer.
echo "Restarting Printer $line on \c" >>/var/sna/printer.log
date >>/var/sna/printer.log
else
echo "Printer $line is on line." >/dev/null
fi
fi
done
echo "Printer Check completed at \c" >>/var/sna/printer.log
date >>/var/sna/printer.log
| |||||
| 7004.2 | Well, I try another way! | NQOS01::nqsrv203.nqo.dec.com::Rich Rose | Thu Feb 13 1997 08:16 | 6 | |
Based on the response (actually lack of) I guess I should start the problem escalation process to get to someone to pay attention to this problem. Thanks anyway, Rich | |||||
| 7004.3 | Problems need problem reports | EDSCLU::GARROD | IBM Interconnect Engineering | Thu Feb 13 1997 10:03 | 7 |
Re .-1
You've got it. As has been said many times before this notesfile is not
a problem reporting or escalation mechanism. It is purely for
INFORMATION sharing.
Dave
| |||||
| 7004.4 | Some traces will have to go to support. | DELNI::BARBER_MINGO | Let me DANCE for you | Tue Feb 18 1997 17:37 | 26 |
Hi,
You should use support channels, but when you do, they might wish
to know:
1- Is your engine TPS.gwy failing?
2- What type of gateway is your TPS engine communicating with?
3- Do you have sna traces of the gateway? Is your IBM bouncing
the line from time to time?
4- Do you have Update 5 of the TPS software?
5- Do you have TPS traces of the failures?
I haven't heard of processes comming down for no reason.
(I once heard of a watchdog problem that killed TPS engines
by accident. It was an "old process" watchdog that the system
programmer had implemented, but not shared with the customer."
Besides that, it has usually been the IBM's that brought down
communications.
Anything else, you'd have to get to the CSC. Even if someone
here knew about it, they couldn't get you a "fix". The engineers
for this product aren't on this conference.
Regards,
Cindi
| |||||