LinuxQuestions.org
Visit Jeremy's Blog.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Red Hat
User Name
Password
Red Hat This forum is for the discussion of Red Hat Linux.

Notices


Reply
  Search this Thread
Old 09-28-2011, 03:38 AM   #1
ineedtosolvetheproblem
LQ Newbie
 
Registered: Sep 2011
Posts: 1

Rep: Reputation: Disabled
Need Help - Two node cluster, RHEL 6 High Availability Add on , with Oracle over NFS


we have set up two node cluster, with Oracle datafiles running in the NFS mounted /data

fail over is working with DB crash, power failure

However, a loss of connectivity on eth0 causes the following problems:

1. The /data mount is not detected to have failed or hung. The netfs.sh script which we are using on the cluster.conf doesnt detect this and try to unount it.

2. The cluster doesn't know eth0 is dead.

clustat reports everything as normal throughout, so nothing happens. Additionally, because /data is essentially hung, manual failover via clusvcadm is also failing.

Here is our cluster.conf file. can anyone contribute please

<?xml version="1.0"?>
<cluster config_version="35" name="cluster1">
<fence_daemon post_fail_delay="0"/>
<clusternodes>
<clusternode name="test1.private" nodeid="1">
<fence>
<method name="manual">
<device name="manual"/>
</method>
</fence>
</clusternode>
<clusternode name="test2.private" nodeid="2">
<fence>
<method name="manual">
<device name="manual"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_manual" name="manual"/>
</fencedevices>
<rm>
<failoverdomains/>
<service autostart="1" name="oracle" recovery="relocate">
<netfs ref="data_mount"/>
<script ref="oracle_resource"/>
<ip address="192.168.1.86" monitor_link="eth0"/>
</service>
<resources>
<script file="/usr/local/bin/test.sh" name="oracle_resource"/>
<netfs export="/data/dbcluster" force_unmount="1" fstype="nfs" host="test.main.example.com" mountpoint="/data" name="data_mount" opt ions="rw,bg,hard,nointr,tcp,nfsvers=3,timeo=600,rsize=32768,wsize=32768,actimeo=0"/>
</resources>
</rm>
</cluster>"
 
Old 09-28-2011, 11:21 PM   #2
kbp
Senior Member
 
Registered: Aug 2009
Posts: 3,790

Rep: Reputation: 653Reputation: 653Reputation: 653Reputation: 653Reputation: 653Reputation: 653
Sounds like you only have one nic .. is this correct? Most people building a HA cluster would have separate nics for presentation, heartbeat and storage. If you only have one nic I'm guessing that you're not using RAC ?
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Internet switch redundancy for high availability cluster charrois Linux - Networking 6 06-17-2008 03:06 PM
LXer: How to setup apache high availability cluster? LXer Syndicated Linux News 0 09-01-2007 03:10 PM
High availability Samba cluster DRBD + Heartbeat djalex Linux - Networking 3 09-05-2006 01:14 PM
High Availability Cluster IP... CRCool75 Linux - Networking 2 06-26-2006 09:44 PM
LXer: Penguin Computing's Scyld ClusterWare(TM) HPC Offers Enhanced High Availability and Most Advanced Linux Cluster Virtualization, Enabling Broader Cluster Use LXer Syndicated Linux News 0 04-06-2006 06:03 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Distributions > Red Hat

All times are GMT -5. The time now is 03:48 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration