After I started the pcsd service manually,
nfs-ganesha could be enabled successfully, but there was
no virtual IP present on the interfaces and looking at
the system log, I noticed corosync failed to start:
- on the host where I issued the gluster nfs-ganesha
enable command:
Sep 21 17:07:18 iron systemd: Starting NFS-Ganesha
file server...
Sep 21 17:07:19 iron systemd: Started NFS-Ganesha
file server.
Sep 21 17:07:19 iron rpc.statd[2409]: Received
SM_UNMON_ALL request from
iron.int.rdmedia.com
while not monitoring any hosts
Sep 21 17:07:20 iron systemd: Starting Corosync
Cluster Engine...
Sep 21 17:07:20 iron corosync[3426]: [MAIN ]
Corosync Cluster Engine ('2.3.4'): started and ready
to provide service.
Sep 21 17:07:20 iron corosync[3426]: [MAIN ]
Corosync built-in features: dbus systemd xmlconf snmp
pie relro bindnow
Sep 21 17:07:20 iron corosync[3427]: [TOTEM ]
Initializing transport (UDP/IP Unicast).
Sep 21 17:07:20 iron corosync[3427]: [TOTEM ]
Initializing transmit/receive security (NSS) crypto:
none hash: none
Sep 21 17:07:20 iron corosync[3427]: [TOTEM ] The
network interface [10.100.30.38] is now up.
Sep 21 17:07:20 iron corosync[3427]: [SERV ]
Service engine loaded: corosync configuration map
access [0]
Sep 21 17:07:20 iron corosync[3427]: [QB ]
server name: cmap
Sep 21 17:07:20 iron corosync[3427]: [SERV ]
Service engine loaded: corosync configuration service
[1]
Sep 21 17:07:20 iron corosync[3427]: [QB ]
server name: cfg
Sep 21 17:07:20 iron corosync[3427]: [SERV ]
Service engine loaded: corosync cluster closed process
group service v1.01 [2]
Sep 21 17:07:20 iron corosync[3427]: [QB ]
server name: cpg
Sep 21 17:07:20 iron corosync[3427]: [SERV ]
Service engine loaded: corosync profile loading
service [4]
Sep 21 17:07:20 iron corosync[3427]: [QUORUM] Using
quorum provider corosync_votequorum
Sep 21 17:07:20 iron corosync[3427]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:20 iron corosync[3427]: [SERV ]
Service engine loaded: corosync vote quorum service
v1.0 [5]
Sep 21 17:07:20 iron corosync[3427]: [QB ]
server name: votequorum
Sep 21 17:07:20 iron corosync[3427]: [SERV ]
Service engine loaded: corosync cluster quorum service
v0.1 [3]
Sep 21 17:07:20 iron corosync[3427]: [QB ]
server name: quorum
Sep 21 17:07:20 iron corosync[3427]: [TOTEM ]
adding new UDPU member {10.100.30.38}
Sep 21 17:07:20 iron corosync[3427]: [TOTEM ]
adding new UDPU member {10.100.30.37}
Sep 21 17:07:20 iron corosync[3427]: [TOTEM ] A new
membership (
10.100.30.38:104)
was formed. Members joined: 1
Sep 21 17:07:20 iron corosync[3427]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:20 iron corosync[3427]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:20 iron corosync[3427]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:20 iron corosync[3427]: [QUORUM]
Members[1]: 1
Sep 21 17:07:20 iron corosync[3427]: [MAIN ]
Completed service synchronization, ready to provide
service.
Sep 21 17:07:20 iron corosync[3427]: [TOTEM ] A new
membership (
10.100.30.37:108)
was formed. Members joined: 1
Sep 21 17:08:21 iron corosync: Starting Corosync
Cluster Engine (corosync): [FAILED]
Sep 21 17:08:21 iron systemd: corosync.service:
control process exited, code=exited status=1
Sep 21 17:08:21 iron systemd: Failed to start
Corosync Cluster Engine.
Sep 21 17:08:21 iron systemd: Unit corosync.service
entered failed state.
Sep 21 17:07:19 cobalt systemd: Starting Preprocess
NFS configuration...
Sep 21 17:07:19 cobalt systemd: Starting RPC Port
Mapper.
Sep 21 17:07:19 cobalt systemd: Reached target RPC
Port Mapper.
Sep 21 17:07:19 cobalt systemd: Starting Host and
Network Name Lookups.
Sep 21 17:07:19 cobalt systemd: Reached target Host
and Network Name Lookups.
Sep 21 17:07:19 cobalt systemd: Starting RPC bind
service...
Sep 21 17:07:19 cobalt systemd: Started Preprocess
NFS configuration.
Sep 21 17:07:19 cobalt systemd: Started RPC bind
service.
Sep 21 17:07:19 cobalt systemd: Starting NFS status
monitor for NFSv2/3 locking....
Sep 21 17:07:19 cobalt rpc.statd[2662]: Version
1.3.0 starting
Sep 21 17:07:19 cobalt rpc.statd[2662]: Flags:
TI-RPC
Sep 21 17:07:19 cobalt systemd: Started NFS status
monitor for NFSv2/3 locking..
Sep 21 17:07:19 cobalt systemd: Starting
NFS-Ganesha file server...
Sep 21 17:07:19 cobalt systemd: Started NFS-Ganesha
file server.
Sep 21 17:07:19 cobalt kernel: warning:
`ganesha.nfsd' uses 32-bit capabilities (legacy
support in use)
Sep 21 17:07:19 cobalt logger: setting up
rd-ganesha-ha
Sep 21 17:07:19 cobalt rpc.statd[2662]: Received
SM_UNMON_ALL request from
cobalt.int.rdmedia.com
while not monitoring any hosts
Sep 21 17:07:19 cobalt logger: setting up cluster
rd-ganesha-ha with the following cobalt iron
Sep 21 17:07:20 cobalt systemd: Stopped Pacemaker
High Availability Cluster Manager.
Sep 21 17:07:20 cobalt systemd: Stopped Corosync
Cluster Engine.
Sep 21 17:07:20 cobalt systemd: Reloading.
Sep 21 17:07:20 cobalt systemd:
[/usr/lib/systemd/system/dm-event.socket:10] Unknown
lvalue 'RemoveOnStop' in section 'Socket'
Sep 21 17:07:20 cobalt systemd:
[/usr/lib/systemd/system/lvm2-lvmetad.socket:9]
Unknown lvalue 'RemoveOnStop' in section 'Socket'
Sep 21 17:07:20 cobalt systemd: Reloading.
Sep 21 17:07:20 cobalt systemd:
[/usr/lib/systemd/system/dm-event.socket:10] Unknown
lvalue 'RemoveOnStop' in section 'Socket'
Sep 21 17:07:20 cobalt systemd:
[/usr/lib/systemd/system/lvm2-lvmetad.socket:9]
Unknown lvalue 'RemoveOnStop' in section 'Socket'
Sep 21 17:07:20 cobalt systemd: Starting Corosync
Cluster Engine...
Sep 21 17:07:20 cobalt corosync[2816]: [MAIN ]
Corosync Cluster Engine ('2.3.4'): started and ready
to provide service.
Sep 21 17:07:20 cobalt corosync[2816]: [MAIN ]
Corosync built-in features: dbus systemd xmlconf snmp
pie relro bindnow
Sep 21 17:07:20 cobalt corosync[2817]: [TOTEM ]
Initializing transport (UDP/IP Unicast).
Sep 21 17:07:20 cobalt corosync[2817]: [TOTEM ]
Initializing transmit/receive security (NSS) crypto:
none hash: none
Sep 21 17:07:21 cobalt corosync[2817]: [TOTEM ] The
network interface [10.100.30.37] is now up.
Sep 21 17:07:21 cobalt corosync[2817]: [SERV ]
Service engine loaded: corosync configuration map
access [0]
Sep 21 17:07:21 cobalt corosync[2817]: [QB ]
server name: cmap
Sep 21 17:07:21 cobalt corosync[2817]: [SERV ]
Service engine loaded: corosync configuration service
[1]
Sep 21 17:07:21 cobalt corosync[2817]: [QB ]
server name: cfg
Sep 21 17:07:21 cobalt corosync[2817]: [SERV ]
Service engine loaded: corosync cluster closed process
group service v1.01 [2]
Sep 21 17:07:21 cobalt corosync[2817]: [QB ]
server name: cpg
Sep 21 17:07:21 cobalt corosync[2817]: [SERV ]
Service engine loaded: corosync profile loading
service [4]
Sep 21 17:07:21 cobalt corosync[2817]: [QUORUM]
Using quorum provider corosync_votequorum
Sep 21 17:07:21 cobalt corosync[2817]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:21 cobalt corosync[2817]: [SERV ]
Service engine loaded: corosync vote quorum service
v1.0 [5]
Sep 21 17:07:21 cobalt corosync[2817]: [QB ]
server name: votequorum
Sep 21 17:07:21 cobalt corosync[2817]: [SERV ]
Service engine loaded: corosync cluster quorum service
v0.1 [3]
Sep 21 17:07:21 cobalt corosync[2817]: [QB ]
server name: quorum
Sep 21 17:07:21 cobalt corosync[2817]: [TOTEM ]
adding new UDPU member {10.100.30.37}
Sep 21 17:07:21 cobalt corosync[2817]: [TOTEM ]
adding new UDPU member {10.100.30.38}
Sep 21 17:07:21 cobalt corosync[2817]: [TOTEM ] A
new membership (
10.100.30.37:100)
was formed. Members joined: 1
Sep 21 17:07:21 cobalt corosync[2817]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:21 cobalt corosync[2817]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:21 cobalt corosync[2817]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:21 cobalt corosync[2817]: [QUORUM]
Members[1]: 1
Sep 21 17:07:21 cobalt corosync[2817]: [MAIN ]
Completed service synchronization, ready to provide
service.
Sep 21 17:07:21 cobalt corosync[2817]: [TOTEM ] A
new membership (
10.100.30.37:108)
was formed. Members joined: 1
Sep 21 17:07:21 cobalt corosync[2817]: [VOTEQ ]
Waiting for all cluster members. Current votes: 1
expected_votes: 2
Sep 21 17:07:21 cobalt corosync[2817]: [QUORUM]
Members[1]: 1
Sep 21 17:07:21 cobalt corosync[2817]: [MAIN ]
Completed service synchronization, ready to provide
service.
Sep 21 17:08:50 cobalt systemd: corosync.service
operation timed out. Terminating.
Sep 21 17:08:50 cobalt corosync: Starting Corosync
Cluster Engine (corosync):
Sep 21 17:08:50 cobalt systemd: Failed to start
Corosync Cluster Engine.
Sep 21 17:08:50 cobalt systemd: Unit
corosync.service entered failed state.
Sep 21 17:08:55 cobalt logger: warning: pcs
property set no-quorum-policy=ignore failed
Sep 21 17:08:55 cobalt logger: warning: pcs
property set stonith-enabled=false failed
Sep 21 17:08:55 cobalt logger: warning: pcs
resource create nfs_start ganesha_nfsd
ha_vol_mnt=/var/run/gluster/shared_storage --clone
failed
Sep 21 17:08:56 cobalt logger: warning: pcs
resource delete nfs_start-clone failed
Sep 21 17:08:56 cobalt logger: warning: pcs
resource create nfs-mon ganesha_mon --clone failed
Sep 21 17:08:56 cobalt logger: warning: pcs
resource create nfs-grace ganesha_grace --clone failed
Sep 21 17:08:57 cobalt logger: warning pcs resource
create cobalt-cluster_ip-1 ocf:heartbeat:IPaddr ip=
cidr_netmask=32 op monitor interval=15s failed
Sep 21 17:08:57 cobalt logger: warning: pcs
resource create cobalt-trigger_ip-1
ocf:heartbeat:Dummy failed
Sep 21 17:08:57 cobalt logger: warning: pcs
constraint colocation add cobalt-cluster_ip-1 with
cobalt-trigger_ip-1 failed
Sep 21 17:08:57 cobalt logger: warning: pcs
constraint order cobalt-trigger_ip-1 then
nfs-grace-clone failed
Sep 21 17:08:57 cobalt logger: warning: pcs
constraint order nfs-grace-clone then
cobalt-cluster_ip-1 failed
Sep 21 17:08:57 cobalt logger: warning pcs resource
create iron-cluster_ip-1 ocf:heartbeat:IPaddr ip=
cidr_netmask=32 op monitor interval=15s failed
Sep 21 17:08:57 cobalt logger: warning: pcs
resource create iron-trigger_ip-1 ocf:heartbeat:Dummy
failed
Sep 21 17:08:57 cobalt logger: warning: pcs
constraint colocation add iron-cluster_ip-1 with
iron-trigger_ip-1 failed
Sep 21 17:08:57 cobalt logger: warning: pcs
constraint order iron-trigger_ip-1 then
nfs-grace-clone failed
Sep 21 17:08:58 cobalt logger: warning: pcs
constraint order nfs-grace-clone then
iron-cluster_ip-1 failed
Sep 21 17:08:58 cobalt logger: warning: pcs
constraint location cobalt-cluster_ip-1 rule
score=-INFINITY ganesha-active ne 1 failed
Sep 21 17:08:58 cobalt logger: warning: pcs
constraint location cobalt-cluster_ip-1 prefers
iron=1000 failed
Sep 21 17:08:58 cobalt logger: warning: pcs
constraint location cobalt-cluster_ip-1 prefers
cobalt=2000 failed
Sep 21 17:08:58 cobalt logger: warning: pcs
constraint location iron-cluster_ip-1 rule
score=-INFINITY ganesha-active ne 1 failed
Sep 21 17:08:58 cobalt logger: warning: pcs
constraint location iron-cluster_ip-1 prefers
cobalt=1000 failed
Sep 21 17:08:58 cobalt logger: warning: pcs
constraint location iron-cluster_ip-1 prefers
iron=2000 failed
Sep 21 17:08:58 cobalt logger: warning pcs cluster
cib-push /tmp/tmp.nXTfyA1GMR failed
Sep 21 17:08:58 cobalt logger: warning: scp
ganesha-ha.conf to cobalt failed
BTW, I'm using CentOS 7. There are multiple network
interfaces on the servers, could that be a problem?