If I recall correctly, VMWare doesn't do multicast properly. I'm not sure though, I don't use it. Try unicast with no RRP. See if that works. On 23/11/13 23:16, Slava Bendersky wrote: > Hello Digimer, > All machines are rhel 6.4 based on vmware , there not physical switch > only from vmware. I set rrp to none and cluster is formed. > With this config I am getting constant error messages. > > [root@eusipgw01 ~]# cat /etc/redhat-release > Red Hat Enterprise Linux Server release 6.4 (Santiago) > > [root@eusipgw01 ~]# rpm -qa | grep corosync > corosync-1.4.1-15.el6.x86_64 > corosynclib-1.4.1-15.el6.x86_64 > > > [2013-11-23 22:46:20] WARNING[2057] res_corosync.c: CPG mcast failed (6) > [2013-11-23 22:46:20] WARNING[2057] res_corosync.c: CPG mcast failed (6) > > iptables > > -A INPUT -i eth1 -p udp -m state --state NEW -m udp --dport 5404:5407 -j > NFLOG --nflog-prefix "dmz_ext2fw: " --nflog-group 2 > -A INPUT -i eth1 -m pkttype --pkt-type multicast -j NFLOG > --nflog-prefix "dmz_ext2fw: " --nflog-group 2 > -A INPUT -i eth1 -m pkttype --pkt-type unicast -j NFLOG --nflog-prefix > "dmz_ext2fw: " --nflog-group 2 > -A INPUT -i eth1 -p igmp -j NFLOG --nflog-prefix "dmz_ext2fw: " > --nflog-group 2 > -A INPUT -j ACCEPT > > > ------------------------------------------------------------------------ > *From: *"Digimer" <lists@xxxxxxxxxx> > *To: *"Slava Bendersky" <volga629@xxxxxxxxxxxxx> > *Cc: *discuss@xxxxxxxxxxxx > *Sent: *Saturday, November 23, 2013 10:34:00 PM > *Subject: *Re: information request > > I don't think you ever said what OS you have. I've never had to set > anything in sysctl.conf on RHEL/CentOS 6. Did you try disabling RRP > entirely? If you have a managed switch, make sure persistent multicast > groups are enabled or try a different switch entirely. > > *Something* is interrupting your network traffic. What does > iptables-save show? Are these physical or virtual machines? > > The more information about your environment that you can share, the > better we can help. > > On 23/11/13 22:29, Slava Bendersky wrote: >> Hello Digimer, >> As an idea, might be some settings in sysctl.conf ? >> >> Slava. >> >> >> ------------------------------------------------------------------------ >> *From: *"Slava Bendersky" <volga629@xxxxxxxxxxxxx> >> *To: *"Digimer" <lists@xxxxxxxxxx> >> *Cc: *discuss@xxxxxxxxxxxx >> *Sent: *Saturday, November 23, 2013 10:27:22 PM >> *Subject: *Re: information request >> >> Hello Digimer, >> Yes I set to passive and selinux is disabled >> >> [root@eusipgw01 ~]# sestatus >> SELinux status: disabled >> [root@eusipgw01 ~]# cat /etc/corosync/corosync.conf >> totem { >> version: 2 >> token: 160 >> token_retransmits_before_loss_const: 3 >> join: 250 >> consensus: 300 >> vsftype: none >> max_messages: 20 >> threads: 0 >> nodeid: 2 >> rrp_mode: passive >> interface { >> ringnumber: 0 >> bindnetaddr: 10.10.10.0 >> mcastaddr: 226.94.1.1 >> mcastport: 5405 >> } >> } >> >> logging { >> fileline: off >> to_stderr: yes >> to_logfile: yes >> to_syslog: off >> logfile: /var/log/cluster/corosync.log >> debug: off >> timestamp: on >> logger_subsys { >> subsys: AMF >> debug: off >> } >> } >> >> >> Slava. >> >> ------------------------------------------------------------------------ >> *From: *"Digimer" <lists@xxxxxxxxxx> >> *To: *"Slava Bendersky" <volga629@xxxxxxxxxxxxx> >> *Cc: *"Steven Dake" <sdake@xxxxxxxxxx>, discuss@xxxxxxxxxxxx >> *Sent: *Saturday, November 23, 2013 7:04:43 PM >> *Subject: *Re: information request >> >> First up, I'm not Steven. Secondly, did you follow Steven's >> recommendation to not use active RRP? Does the cluster form with no RRP >> at all? Is selinux enabled? >> >> On 23/11/13 18:29, Slava Bendersky wrote: >>> Hello Steven, >>> In multicast it log filling with this message >>> >>> Nov 24 00:26:28 corosync [TOTEM ] A processor failed, forming new >>> configuration. >>> Nov 24 00:26:28 corosync [TOTEM ] A processor joined or left the >>> membership and a new membership was formed. >>> Nov 24 00:26:31 corosync [CPG ] chosen downlist: sender r(0) >>> ip(10.10.10.1) ; members(old:2 left:0) >>> Nov 24 00:26:31 corosync [MAIN ] Completed service synchronization, >>> ready to provide service. >>> >>> In uudp it not working at all. >>> >>> Slava. >>> >>> >>> ------------------------------------------------------------------------ >>> *From: *"Digimer" <lists@xxxxxxxxxx> >>> *To: *"Slava Bendersky" <volga629@xxxxxxxxxxxxx> >>> *Cc: *"Steven Dake" <sdake@xxxxxxxxxx>, discuss@xxxxxxxxxxxx >>> *Sent: *Saturday, November 23, 2013 6:05:56 PM >>> *Subject: *Re: information request >>> >>> So multicast works with the firewall disabled? >>> >>> On 23/11/13 17:28, Slava Bendersky wrote: >>>> Hello Steven, >>>> I disabled iptables and no difference, error message the same, but at >>>> least in multicast is wasn't generate the error. >>>> >>>> >>>> Slava. >>>> >>>> ------------------------------------------------------------------------ >>>> *From: *"Digimer" <lists@xxxxxxxxxx> >>>> *To: *"Slava Bendersky" <volga629@xxxxxxxxxxxxx>, "Steven Dake" >>>> <sdake@xxxxxxxxxx> >>>> *Cc: *discuss@xxxxxxxxxxxx >>>> *Sent: *Saturday, November 23, 2013 4:37:36 PM >>>> *Subject: *Re: information request >>>> >>>> Does either mcast or unicast work if you disable the firewall? If so, >>>> then at least you know for sure that iptables is the problem. >>>> >>>> The link here shows the iptables rules I use (for corosync in mcast and >>>> other apps): >>>> >>>> https://alteeve.ca/w/AN!Cluster_Tutorial_2#Configuring_iptables >>>> >>>> digimer >>>> >>>> On 23/11/13 16:12, Slava Bendersky wrote: >>>>> Hello Steven, >>>>> Than what I see when setup through UDPU >>>>> >>>>> Nov 23 22:08:13 corosync [MAIN ] Compatibility mode set to whitetank. >>>>> Using V1 and V2 of the synchronization engine. >>>>> Nov 23 22:08:13 corosync [TOTEM ] adding new UDPU member {10.10.10.1} >>>>> Nov 23 22:08:16 corosync [MAIN ] Totem is unable to form a cluster >>>>> because of an operating system or network fault. The most common cause >>>>> of this message is that the local firewall is configured improperly. >>>>> >>>>> >>>>> Might be missing some firewall rules ? I allowed unicast. >>>>> >>>>> Slava. >>>>> >>>>> > ------------------------------------------------------------------------ >>>>> *From: *"Steven Dake" <sdake@xxxxxxxxxx> >>>>> *To: *"Slava Bendersky" <volga629@xxxxxxxxxxxxx> >>>>> *Cc: *discuss@xxxxxxxxxxxx >>>>> *Sent: *Saturday, November 23, 2013 10:33:31 AM >>>>> *Subject: *Re: information request >>>>> >>>>> >>>>> On 11/23/2013 08:23 AM, Slava Bendersky wrote: >>>>> >>>>> Hello Steven, >>>>> >>>>> My setup >>>>> >>>>> 10.10.10.1 primary server -----EoIP tunnel vpn ipsec ----- dr > server >>>>> 10.10.10.2 >>>>> >>>>> On both servers is 2 interfaces eth0 which default gw out and eth1 >>>>> where corosync live. >>>>> >>>>> Iptables: >>>>> >>>>> -A INPUT -i eth1 -p udp -m state --state NEW -m udp --dport >> 5404:5407 >>>>> -A INPUT -i eth1 -m pkttype --pkt-type multicast >>>>> -A INPUT -i eth1 -p igmp >>>>> >>>>> >>>>> Corosync.conf >>>>> >>>>> totem { >>>>> version: 2 >>>>> token: 160 >>>>> token_retransmits_before_loss_const: 3 >>>>> join: 250 >>>>> consensus: 300 >>>>> vsftype: none >>>>> max_messages: 20 >>>>> threads: 0 >>>>> nodeid: 2 >>>>> rrp_mode: active >>>>> interface { >>>>> ringnumber: 0 >>>>> bindnetaddr: 10.10.10.0 >>>>> mcastaddr: 226.94.1.1 >>>>> mcastport: 5405 >>>>> } >>>>> } >>>>> >>>>> Join message >>>>> >>>>> [root@eusipgw01 ~]# corosync-objctl | grep member >>>>> runtime.totem.pg.mrp.srp.members.2.ip=r(0) ip(10.10.10.2) >>>>> runtime.totem.pg.mrp.srp.members.2.join_count=1 >>>>> runtime.totem.pg.mrp.srp.members.2.status=joined >>>>> runtime.totem.pg.mrp.srp.members.1.ip=r(0) ip(10.10.10.1) >>>>> runtime.totem.pg.mrp.srp.members.1.join_count=254 >>>>> runtime.totem.pg.mrp.srp.members.1.status=joined >>>>> >>>>> Is it possible that ping sends out of wrong interface ? >>>>> >>>>> Slava, >>>>> >>>>> I wouldn't expect so. >>>>> >>>>> Which version? >>>>> >>>>> Have you tried udpu instead? If not, it is preferable to multicast >>>>> unless you want absolute performance on cpg groups. In most cases the >>>>> performance difference is very small and not worth the trouble of >>>>> setting up multicast in your network. >>>>> >>>>> Fabio had indicated rrp active mode is broken. I don't know the >>>>> details, but try passive RRP - it is actually better then active >>> IMNSHO :) >>>>> >>>>> Regards >>>>> -steve >>>>> >>>>> Slava. >>>>> >>>>> >>>> ------------------------------------------------------------------------ >>>>> *From: *"Steven Dake" <sdake@xxxxxxxxxx> >>>>> *To: *"Slava Bendersky" <volga629@xxxxxxxxxxxxx>, >>> discuss@xxxxxxxxxxxx >>>>> *Sent: *Saturday, November 23, 2013 6:01:11 AM >>>>> *Subject: *Re: information request >>>>> >>>>> >>>>> On 11/23/2013 12:29 AM, Slava Bendersky wrote: >>>>> >>>>> Hello Everyone, >>>>> Corosync run on box with 2 Ethernet interfaces. >>>>> I am getting this message >>>>> CPG mcast failed (6) >>>>> >>>>> Any information thank you in advance. >>>>> >>>>> >>>>> >>>>> >>>> >>> >> > https://github.com/corosync/corosync/blob/master/include/corosync/corotypes.h#L84 >>>>> >>>>> This can occur because: >>>>> a) firewall is enabled - there should be something in the logs >>>>> telling you to properly configure the firewall >>>>> b) a config change is in progress - this is a normal response, and >>>>> you should try the request again >>>>> c) a bug in the synchronization code is resulting in a blocked >>>>> unsynced cluster >>>>> >>>>> c is very unlikely at this point. >>>>> >>>>> 2 ethernet interfaces = rrp mode, bonding, or something else? >>>>> >>>>> Digimer needs moar infos :) >>>>> >>>>> Regards >>>>> -steve >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> discuss mailing list >>>>> discuss@xxxxxxxxxxxx >>>>> http://lists.corosync.org/mailman/listinfo/discuss >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> _______________________________________________ >>>>> discuss mailing list >>>>> discuss@xxxxxxxxxxxx >>>>> http://lists.corosync.org/mailman/listinfo/discuss >>>>> >>>> >>>> >>>> -- >>>> Digimer >>>> Papers and Projects: https://alteeve.ca/w/ >>>> What if the cure for cancer is trapped in the mind of a person without >>>> access to education? >>>> >>> >>> >>> -- >>> Digimer >>> Papers and Projects: https://alteeve.ca/w/ >>> What if the cure for cancer is trapped in the mind of a person without >>> access to education? >>> >> >> >> -- >> Digimer >> Papers and Projects: https://alteeve.ca/w/ >> What if the cure for cancer is trapped in the mind of a person without >> access to education? >> >> >> _______________________________________________ >> discuss mailing list >> discuss@xxxxxxxxxxxx >> http://lists.corosync.org/mailman/listinfo/discuss >> > > > -- > Digimer > Papers and Projects: https://alteeve.ca/w/ > What if the cure for cancer is trapped in the mind of a person without > access to education? > -- Digimer Papers and Projects: https://alteeve.ca/w/ What if the cure for cancer is trapped in the mind of a person without access to education? _______________________________________________ discuss mailing list discuss@xxxxxxxxxxxx http://lists.corosync.org/mailman/listinfo/discuss