Title: | ase |
Moderator: | SMURF::GROSSO |
Created: | Thu Jul 29 1993 |
Last Modified: | Fri Jun 06 1997 |
Last Successful Update: | Fri Jun 06 1997 |
Number of topics: | 2114 |
Total number of notes: | 7347 |
Hello, We have a DECsafe environment v1.3 with 2 network interfaces: FDDI Ethernet interface. Primary network for ASE is the FDDI, Ethernet has been configured as backup. 2 * 4100 with DU 3.2F. Our problem was that the other day, nobody could access to ASE service defined on one system. The daemon.log in that moment gave next messages: Apr 1 10:08:18 rtrprd2 DECsafe: local HSM Warning: Can't ping rtrprd1-f over th e network Apr 1 10:08:20 rtrprd2 DECsafe: local HSM Notice: /var/ase/sbin/ase_run_sh: cha nge host 10.10.47.234: gateway 10.10.33.220 Apr 1 10:08:20 rtrprd2 DECsafe: local HSM Notice: /var/ase/sbin/ase_run_sh: cha nge host 10.10.33.220: gateway 10.10.33.220 Apr 1 10:08:20 rtrprd2 DECsafe: local HSM ***ALERT: HSM_PATH_STATUS:10.10.33.22 0:UP:10.10.47.234:DOWN Apr 1 10:54:48 rtrprd2 DECsafe: rtrprd2 Agent Notice: stopping service rtrprd Apr 1 10:56:27 rtrprd2 DECsafe: rtrprd2 Agent Notice: /var/ase/sbin/ase_run_sh rtrprd: rtrprd: stop at Tue Apr 1 10:54:49 MET DST 1997 Apr 1 10:56:28 rtrprd2 DECsafe: rtrprd2 Agent Notice: /var/ase/sbin/ase_run_sh rtrprd: RTRPRD: Parando la aplicacion Apr 1 10:56:28 rtrprd2 DECsafe: rtrprd2 Agent Notice: /var/ase/sbin/ase_run_sh rtrprd: stty: tcgetattr: Not a typewriter Apr 1 10:56:28 rtrprd2 DECsafe: rtrprd2 Agent Notice: /var/ase/sbin/ase_run_sh rtrprd: ping: sendto: No buffer space available Apr 1 10:56:28 rtrprd2 DECsafe: rtrprd2 Agent Notice: /var/ase/sbin/ase_run_sh rtrprd: PING rtrp-f (10.10.47.244): 56 data bytes Apr 1 10:56:28 rtrprd2 DECsafe: rtrprd2 Agent Notice: /var/ase/sbin/ase_run_sh rtrprd: ping: wrote rtrp-f 64 chars, ret=-1 At that moment, you could do a ping at FDDI computer address, but you couldn't access to any FDDI ASE service. When you try to do a ping to an ASE FDDI service it said: "No buffer space available". Ethernet works OK. We didn't see any messages saying that "fta0: Link unavailable". We did: # ifconfig fta0 down # ifconfig fta0 up Doing that, the problem was solved, but this is a temporary solution, and i would know which is the real problem customer has. In order to solve the problem i don't know if this could be a system parameter or perhaps a configuration problem from dec safe. If anybody knows which parameter, please let me know. I'm going to analyze with "netstat -m" and "vmstat -M" the status when there is no problem. I'm going to suggest customer to do a procedure that do a ping to other computer thru Ethernet/FDDI and each other thru both interfaces and to ASE services address, in order to isolate the problem. Any other suggestion would be appreciatte. Thanks in advance, Carmen Arranz. MCS. Madrid. Spain. ************** Cross posted in ASE/DIGITAL UNIX **************
T.R | Title | User | Personal Name | Date | Lines |
---|---|---|---|---|---|
1988.1 | Any idea about mbuf error? | MDR01::CARRANZ | MCS Madrid | Tue Apr 08 1997 08:23 | 9 |
Please, has anybody out here any idea about this problem? Any help will be very appreciate. Thanks in advance, Carmen Arranz | |||||
1988.2 | usr406.zko.dec.com::Marshall | Rob Marshall | Fri Apr 11 1997 13:53 | 20 | |
Hello Carmen, I don't know which interface is your FDDI, but according to this one of the interfaces (10.10.47.234) was seen as DOWN: Apr 1 10:08:20 rtrprd2 DECsafe: local HSM ***ALERT: HSM_PATH_STATUS:10.10.33.22 0:UP:10.10.47.234:DOWN If the FDDI goes down, it is possible to see the error message "no buffer space available". This is because even an unconnected interface will queue up packets. Eventually the send queue for the interface will fill up, and you will get this error. There is a patch coming that will help in this case. If you need it sooner, you can open a CLD and I will get you a test patch. But, what you really need to figure out is why you lost your FDDI connection. Fix the problem with the FDDI, and you won't have the "no buffer space..." problem. Rob Marshall USEG | |||||
1988.3 | One or two problem? | MDR01::CARRANZ | MCS Madrid | Tue Apr 15 1997 14:14 | 56 |
Thanks for your reply, Sorry, 10.10.47.234 is the FDDI network address, as you have guessed. We have had some ASE messages about that FDDI is down, and after some seconds it's up. Today we have had a big customer problem, i suppossed related with this one: Customer couldn't connect to the ase alias services, it wasn't responding, but you could do a ping to that physical interface "10.10.47.234": # ping 10.10.47.234 was responding And customer could do ping to any FDDI network node without any problem. May be that possible? Few seconds before that, customer had the same message about that ASE couldn't do a ping thru FDDI, FDDI was down (same as .0). I have read that when you have primary network saturated, and if you out of mbufs, you should adjust the "ubcmaxpercent" and "ubcminpercent". My question now is that i'm not sure if here we are having one problem or two different problems. Could anybody help me in that question?. Now customer will safe into a file then "netstst -in", "netstat -m" and "vmstat -M" outputs daily and will send them to me. We have raised every ASE processes and we have increased too the timing parameters into "/etc/hsm.conf". We have increased next parameters: HSM_PNET_TBS=100 HSM_PNET_TBF=120 HSM_PNET_MR=4 rest parameters have default values. How could i delimited the FDDI ASE problem. Any help would be very appreciate. Thanks in advance, Carmen Arranz |