T.R | Title | User | Personal Name | Date | Lines |
---|
2047.1 | | BACHUS::DEVOS | Manu Devos NSIS Brussels 856-7539 | Tue May 06 1997 17:39 | 7 |
| Hi,
You got a timeout on the nfs_mountd completion. Are you sure that the
system you tried to relocate to is NFS configured, that /etc/hosts is
still identical to the first member?
Manu.
|
2047.2 | | WRKSYS::ALONGI | | Wed May 07 1997 09:16 | 7 |
| yup.. the hosts files are identical. And the system is definitely nfs
configured exactly the same way as the other one
Anything else I can check?
Thanks,
Doreen
|
2047.3 | HELP !! | WRKSYS::ALONGI | | Wed May 07 1997 14:45 | 101 |
| HELP!!
I made things much worse now. I tried to delete the node that I couldn't
relocate to and it won't delete. I have been sitting here for about 15 minutes.
Member to delete: zoe
Is this correct (y/n) [y]:
Deleting member 'zoe'...
................................................................................
...................................................
The log file says
May 7 13:20:33 bigbird ASE: zoe Director Warning: Director exiting...
May 7 13:20:33 bigbird ASE: bigbird Agent Notice: starting a new director
May 7 13:21:50 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:21:51 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseusers/lmnt/graphx-users: not currently mounted
May 7 13:21:51 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseusers/lmnt/graphx-users already unmounted
May 7 13:22:54 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:22:54 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asearchive/lmnt/archive: not currently mounted
May 7 13:22:54 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asearchive/lmnt/archive already unmounted
May 7 13:23:48 bigbird ASE: bigbird AseMgr Warning: timeout waiting on Reply to
ASE_DELETE_MEMBER
May 7 13:23:48 bigbird ASE: bigbird AseMgr Notice: director request timed out,
retrying...
May 7 13:23:56 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:23:57 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asebigbird/lmnt/bigbird: not currently mounted
May 7 13:23:57 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/asebigbird/lmnt/bigbird already unmounted
May 7 13:24:59 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:24:59 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseneon/lmnt/neon: not currently mounted
May 7 13:24:59 bigbird ASE: zoe Agent Notice: /var/ase/sbin/ase_mount_action: /
var/ase/mnt/aseneon/lmnt/neon already unmounted
May 7 13:25:51 bigbird ASE: zoe Director Warning: timeout waiting on Reply to A
SE_DELETE_MEMBER
May 7 13:25:51 bigbird ASE: zoe Director Notice: deleted member zoe
May 7 13:25:53 bigbird ASE: bigbird AseMgr Warning: msgsvc: discarding unclaime
d reply. seq: 11
May 7 13:26:00 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:27:01 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:28:02 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:28:53 bigbird ASE: bigbird AseMgr Warning: timeout waiting on Reply to
ASE_DELETE_MEMBER
May 7 13:28:54 bigbird ASE: bigbird AseMgr Notice: director request timed out,
retrying...
May 7 13:29:03 bigbird ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/n
fs_mountd action completion
May 7 13:29:04 bigbird ASE: zoe Agent Notice: restarting Agent!
May 7 13:29:04 bigbird ASE: zoe Director Warning: msgsvc: discarding unclaimed
reply. seq: 12
May 7 13:30:57 bigbird ASE: zoe Director Notice: msgSvc: unclaimed timeout
When I try to run asemgr and it hangs on the node I am already running the
delete on
# asemgr
....
And if I try it on the node that I tried to delete I get
# asemgr
Enter a comma-separated list of all hostnames you want as ASE servers.
Enter Members:
# tail -f daemon.log
May 7 13:29:03 zoe ASE: zoe Agent Error: timeout waiting on /var/ase/sbin/nfs_m
ountd action completion
May 7 13:29:04 zoe ASE: zoe Agent Notice: restarting Agent!
May 7 13:29:04 zoe ASE: zoe Director Warning: msgsvc: discarding unclaimed repl
y. seq: 12
May 7 13:29:04 zoe ASE: local AseMgr Notice:
May 7 13:29:08 zoe ASE: zoe Agent Notice: in install state
May 7 13:29:09 zoe ASE: local HSM Notice: Network interface fta0 16.122.176.221
UP
May 7 13:29:10 zoe ASE: local HSM ***ALERT: HSM_NI_STATUS:16.122.176.221:UP
May 7 13:29:10 zoe ASE: local Simulator Notice: snd: exiting...
May 7 13:30:57 zoe ASE: zoe Director Notice: msgSvc: unclaimed timeout
May 7 13:32:22 zoe ASE: local AseMgr Notice: Agent is in INSTALL STATE
HELP.. what should I do. Fortunately the services are still available.
Doreen
|
2047.4 | a little more info | WRKSYS::ALONGI | | Wed May 07 1997 15:11 | 26 |
| I finally got
May 7 13:55:59 zoe ASE: bigbird AseMgr Error: Unable to get DB.
May 7 14:00:12 zoe ASE: zoe AseMgr Error: Member change failed
May 7 14:00:12 zoe ASE: zoe AseMgr Error: Delete member failed. Check syslog's
daemon log for reason.
but then I tried to do a status of the services and it is not giving me back
anything but dots.
Enter your choice: s
Obtaining ASE Status
m) Display the status of the members
s) Display the status of a service
l) Display the location of logger(s)
v) Display the level of logging
x) Exit to the Main Menu ?) Help
Enter your choice [x]: s
........................................................
|
2047.5 | | BACHUS::DEVOS | Manu Devos NSIS Brussels 856-7539 | Wed May 07 1997 18:26 | 11 |
| Doreen,
You definitely have a problem with nfs_mountd. I am at home and have
not access to the script, but maybe you can check the contents of
/etc/exports file for a .INCLUDE line and the subsequent files. The
/var/ase/sbin/nfs_mountd is a script, so maybe you can place a "set -x"
at the beginning to find what is happening by looking in the
daemon.log?
Manu.
|
2047.6 | corrupt database | WRKSYS::ALONGI | | Thu May 08 1997 08:06 | 8 |
| panic over
The database files had different sizes and dates so I copied the good one
over to the bad one and rebooted and everything is working now. I can
relocate services too.
thanks
doreen
|