| Joel,
A bit of apples and oranges.
ASE is to make sure that the CPU and network resources are not single
points of failure.
Replication, in your instance, is to ensure that the DB Engine and site
is not a single point of failure.
For a single site it is probably sufficient to use disk mirroring and
ASE.
For multiple sites, replication can provide a good means of (almost)
synchronizing the data base. And if one site dies the other can take
over.
For your case it sounds like TruCluster with OPS is the best. That way
both boxes can access the data base, and if either side goes down, the
other side can take over.
A common scenario is to have two ASE's, one at each site. Then the
sites replicate between themselves. If you lose a box, the ASE pair
takes over, if you lose a site, the other site takes over.
A good place to start with this is at
http://www-unix.zk3.dec.com/www/dt.html
/Another Joel
|
| Joel,
But the customer is paranoid after the AdvFS corrupted the Oracle files
when a KZPSA went bad. Telco customer wrote off as bad debts the
billing files that got lost and were no longer reconstructable. You
should have been in the customer meeting after that happen and feel
the heat of the implications of the incident. Imagine that a hw
failure creeping up to AdvFS level!
So, he is no longer confident with disk mirroring over HSZs and KZPSAs.
What with cpio, tar, and vdump not working with his GBs of Oracle files
to backup.
What Oracle did is to ship archive logs across the other system. Only
one ASE service is online - the one that holds the production data.
One other storage holds a copy and receives the archive log from the
other system. This is not a service. It's just ordinary mounted AdvFS.
They haven't opted for full Replication it turns out on the ASE site.
They are in fact doing this for the other sites.
The conflict is in that the ASE service on the active system and the
ordinary AdvFS mounted fs from the idle server both uses a common
mount point. So, when a failover happens, the ASE fails to do its
job precisely because the idle server is using the mount point.
I need help on constructing a script to manage this transition
in different failover scenarios - dismount the idle server's mount
point, start service on that freed up mount point; when the other
server comes back, mount the non-ASE service AdvFS on it - basically
exchange roles when a failover happens.
/joeljosol
|