[Linux-cluster] cluster crash problem

Andrea Laack alaack at ustrap.com
Tue Feb 12 18:53:53 UTC 2008


We are running RHEL 3.0 with version 1.0.3 of RedHat cluster suite.  We are
utilizing a Promise Vtrak 15200 for shared storage and an Adaptec ASA-7211C
iSCSI initiator.
Having problems with any process that uses high I/O across the iSCSI link.

Last night dba attempted to create an Oracle instance on the shared storage
device.  The cluster crashed and failed over the the backup node.  Nothing
in the logs.  Log level set at 6.  Only indication I have that something
happened is from the graphs of the disk I/O (HotSanic).  This shows 69.42
Pentabytes (yes, it shows pentabytes).

We are using a watchdog timer.

This has happened before when copying *very* large amounts of data that
includes *very* large files.  Many small files does not cause the cluster to
crash.

Has anyone seen this type of problem?  Any help will be sincerely
appreciated.  Adaptec will only talk to me if I pay them $199/phone call.

Thanks
Andrea

Andrea Laack
Network Administrator
Universal Strap 
W209N17500 Industrial Drive
Jackson, WI  53037
262-677-3641 Ext 5220




More information about the Linux-cluster mailing list