On 18/02/11 20:52, Jonathan Gowar wrote:
On 18/02/11 13:30, Jonathan Gowar wrote:
I've been following the cluster from scratch guide, by Beekhof. I'm
using Debian 6, so I don't know how much that might confuse things; I
appreciate there are a few debian-specifics.
Before adding the drbd pacemaker resource crm status looked fine. After
configuring the resource I get the following error from crm_mon:-
WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
configured
Here is the crm configuration, and monitor:-
root@squeeze:~# crm configure show
node sleeze
node sneeze
node squeeze
primitive ClusterIP ocf:heartbeat:IPaddr2 \
params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
op monitor interval="30s"
primitive WebData ocf:linbit:drbd \
params drbd_resource="wwwdata" \
op monitor interval="60s"
primitive WebSite ocf:heartbeat:apache \
params configfile="/etc/apache2/apache2.conf" \
op monitor interval="1m"
ms WebDataClone WebData \
meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
notify="true"
colocation website-with-ip inf: WebSite ClusterIP
order apache-after-ip inf: ClusterIP WebSite
property $id="cib-bootstrap-options" \
dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
cluster-infrastructure="openais" \
expected-quorum-votes="3" \
stonith-enabled="false"
rsc_defaults $id="rsc-options" \
resource-stickiness="100"
root@squeeze:~# crm status
============
Last updated: Fri Feb 18 13:15:53 2011
Stack: openais
Current DC: sneeze - partition with quorum
Version: 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b
3 Nodes configured, 3 expected votes
3 Resources configured.
============
Online: [ squeeze sneeze sleeze ]
ClusterIP (ocf::heartbeat:IPaddr2): Started sneeze
WebSite (ocf::heartbeat:apache): Started sneeze
Master/Slave Set: WebDataClone
Masters: [ squeeze ]
Slaves: [ sneeze ]
Failed actions:
WebData_monitor_0 (node=sleeze, call=4, rc=6, status=complete): not
configured
WebData_monitor_0 (node=sneeze, call=9, rc=6, status=complete): not
configured
WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not
configured
Does anyone have any ideas as to how I might investigate where the
problem is.
Kind regards,
Jon
Hi,
Found out how to debug failing resources:-
http://www.clusterlabs.org/wiki/Debugging_Resource_Failures
I managed to clear 1 problem, fuser was not installed; that means psmisc
for Debian users.
root@squeeze:~# crm configure show
node sleeze
node sneeze
node squeeze
primitive ClusterIP ocf:heartbeat:IPaddr2 \
params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \
op monitor interval="30s"
primitive WebData ocf:linbit:drbd \
params drbd_resource="wwwdata" \
op monitor interval="60s"
primitive WebFS ocf:heartbeat:Filesystem \
params device="/dev/drbd/by-res/wwwdata" directory="/var/www/drbd"
fstype="ext4" \
meta is-managed="true"
primitive WebSite ocf:heartbeat:apache \
params configfile="/etc/apache2/apache2.conf" \
op monitor interval="1m"
ms WebDataClone WebData \
meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1"
notify="true" is-managed="false"
location cli-prefer-WebSite WebSite \
rule $id="cli-prefer-rule-WebSite" inf: #uname eq sleeze
colocation WebSite-with-WebFS inf: WebSite WebFS
colocation fs_on_drbd inf: WebFS WebDataClone:Master
colocation website-with-ip inf: WebSite ClusterIP
order WebFS-after-WebData inf: WebDataClone:promote WebFS:start
order WebSite-after-WebFS inf: WebFS WebSite
order apache-after-ip inf: ClusterIP WebSite
property $id="cib-bootstrap-options" \
dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \
cluster-infrastructure="openais" \
expected-quorum-votes="3" \
stonith-enabled="false" \
last-lrm-refresh="1298043091"
rsc_defaults $id="rsc-options" \
resource-stickiness="100"
Here are a couple of bad looking lines from the debug output:-
/usr/lib/ocf/resource.d/linbit/drbd: 1: [[: not found
/usr/lib/ocf/resource.d/linbit/drbd: 1: 0x080307: not found
/usr/lib/ocf/resource.d/linbit/drbd: 1: Bad substitution
n.b. See full debug report at http://pastebin.com/pjKxBu8K
OCF Return Code: 2
OCF Alias: OCF_ERR_ARGS
Description: "The resource's configuration is not valid on this machine.
Eg. Refers to a location/tool not found on the node."
Recovery Type: hard
Let me know if there's anything else I need to post.
Kind regards,
Jon
Hi,
This appeared to be a problem running 3 nodes. Stopping corosync on
one of the nodes levitated the problem.
Is it possible to have a 3 node cluster, 3 running apache, 2 running
DRBD? If so, can someone point me in the direction of how to.
Kind regards,
Jon
--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster