On Fri, Feb 18, 2011 at 10:51 PM, Jonathan Gowar <jon@xxxxxxxxxxxxxxxx> wrote: > On 18/02/11 20:52, Jonathan Gowar wrote: >> >> On 18/02/11 13:30, Jonathan Gowar wrote: >>> >>> I've been following the cluster from scratch guide, by Beekhof. I'm >>> using Debian 6, so I don't know how much that might confuse things; I >>> appreciate there are a few debian-specifics. >>> >>> Before adding the drbd pacemaker resource crm status looked fine. After >>> configuring the resource I get the following error from crm_mon:- >>> >>> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not >>> configured >>> >>> Here is the crm configuration, and monitor:- >>> >>> root@squeeze:~# crm configure show >>> node sleeze >>> node sneeze >>> node squeeze >>> primitive ClusterIP ocf:heartbeat:IPaddr2 \ >>> params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \ >>> op monitor interval="30s" >>> primitive WebData ocf:linbit:drbd \ >>> params drbd_resource="wwwdata" \ >>> op monitor interval="60s" >>> primitive WebSite ocf:heartbeat:apache \ >>> params configfile="/etc/apache2/apache2.conf" \ >>> op monitor interval="1m" >>> ms WebDataClone WebData \ >>> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" >>> notify="true" >>> colocation website-with-ip inf: WebSite ClusterIP >>> order apache-after-ip inf: ClusterIP WebSite >>> property $id="cib-bootstrap-options" \ >>> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \ >>> cluster-infrastructure="openais" \ >>> expected-quorum-votes="3" \ >>> stonith-enabled="false" >>> rsc_defaults $id="rsc-options" \ >>> resource-stickiness="100" >>> root@squeeze:~# crm status >>> ============ >>> Last updated: Fri Feb 18 13:15:53 2011 >>> Stack: openais >>> Current DC: sneeze - partition with quorum >>> Version: 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b >>> 3 Nodes configured, 3 expected votes >>> 3 Resources configured. >>> ============ >>> >>> Online: [ squeeze sneeze sleeze ] >>> >>> ClusterIP (ocf::heartbeat:IPaddr2): Started sneeze >>> WebSite (ocf::heartbeat:apache): Started sneeze >>> Master/Slave Set: WebDataClone >>> Masters: [ squeeze ] >>> Slaves: [ sneeze ] >>> >>> Failed actions: >>> WebData_monitor_0 (node=sleeze, call=4, rc=6, status=complete): not >>> configured >>> WebData_monitor_0 (node=sneeze, call=9, rc=6, status=complete): not >>> configured >>> WebData_monitor_0 (node=squeeze, call=11, rc=6, status=complete): not >>> configured >>> >>> Does anyone have any ideas as to how I might investigate where the >>> problem is. >>> >>> Kind regards, >>> Jon >> >> Hi, >> >> Found out how to debug failing resources:- >> >> http://www.clusterlabs.org/wiki/Debugging_Resource_Failures >> >> I managed to clear 1 problem, fuser was not installed; that means psmisc >> for Debian users. >> >> >> root@squeeze:~# crm configure show >> node sleeze >> node sneeze >> node squeeze >> primitive ClusterIP ocf:heartbeat:IPaddr2 \ >> params ip="xxx.xxx.xxx.xxx" cidr_netmask="32" \ >> op monitor interval="30s" >> primitive WebData ocf:linbit:drbd \ >> params drbd_resource="wwwdata" \ >> op monitor interval="60s" >> primitive WebFS ocf:heartbeat:Filesystem \ >> params device="/dev/drbd/by-res/wwwdata" directory="/var/www/drbd" >> fstype="ext4" \ >> meta is-managed="true" >> primitive WebSite ocf:heartbeat:apache \ >> params configfile="/etc/apache2/apache2.conf" \ >> op monitor interval="1m" >> ms WebDataClone WebData \ >> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" >> notify="true" is-managed="false" >> location cli-prefer-WebSite WebSite \ >> rule $id="cli-prefer-rule-WebSite" inf: #uname eq sleeze >> colocation WebSite-with-WebFS inf: WebSite WebFS >> colocation fs_on_drbd inf: WebFS WebDataClone:Master >> colocation website-with-ip inf: WebSite ClusterIP >> order WebFS-after-WebData inf: WebDataClone:promote WebFS:start >> order WebSite-after-WebFS inf: WebFS WebSite >> order apache-after-ip inf: ClusterIP WebSite >> property $id="cib-bootstrap-options" \ >> dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \ >> cluster-infrastructure="openais" \ >> expected-quorum-votes="3" \ >> stonith-enabled="false" \ >> last-lrm-refresh="1298043091" >> rsc_defaults $id="rsc-options" \ >> resource-stickiness="100" >> >> >> Here are a couple of bad looking lines from the debug output:- >> >> >> /usr/lib/ocf/resource.d/linbit/drbd: 1: [[: not found >> /usr/lib/ocf/resource.d/linbit/drbd: 1: 0x080307: not found >> /usr/lib/ocf/resource.d/linbit/drbd: 1: Bad substitution >> >> >> n.b. See full debug report at http://pastebin.com/pjKxBu8K >> >> OCF Return Code: 2 >> OCF Alias: OCF_ERR_ARGS >> Description: "The resource's configuration is not valid on this machine. >> Eg. Refers to a location/tool not found on the node." >> Recovery Type: hard >> >> Let me know if there's anything else I need to post. >> >> Kind regards, >> Jon > > Hi, > > This appeared to be a problem running 3 nodes. Stopping corosync on one of > the nodes levitated the problem. > > Is it possible to have a 3 node cluster, 3 running apache, 2 running DRBD? Should be possible > If so, can someone point me in the direction of how to. Depends on what errors are being thrown -- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster