TCP failover doesn't work as expected

Donat Zenichev <donat@xxxxxxx> · Mon, 5 Mar 2018 12:29:37 +0200

Hi community.
I don't know if this mailing list is still alive, but will try to find 
answers here.

I'm trying to build up a tcp failover cluster.
I'm trying to save and restore active tcp sessions (that belong to 
master side) on the salve side, when master experience the failover. So 
that, I would have needed tcp sockets opened on slave side (that were 
indeed started on the master side).

The main goal is to make it working for kamailio (SER) daemon. I'm 
trying to reach real-time HA cluster for calls that are being on the 
line and save them when master experience the failure.

The "." is changed to "_dot_" cuz your firewall blocks all my messages.

What I've already done:
1. Created well-formed rule set for iptables:
-A INPUT -i eth0 -m state --state RELATED,ESTABLISHED -j ACCEPT
-A INPUT -i eth0 -d 10_dot_100_dot_100_dot_27/32 -j DROP
-A INPUT -m state --state INVALID -j LOG

-P FORWARD DROP
-A FORWARD -i eth0 -m state --state RELATED,ESTABLISHED -j ACCEPT
-A FORWARD -i eth0 -d 10_dot_100_dot_100_dot_27/32 -j DROP
-A FORWARD -m state --state INVALID -j LOG

where 10_dot_100_dot_100_dot_27 - VIP address
The same rule set is stored on the slave side.

2. Configured conntracd:
Sync {
    Mode FTFW {
        DisableExternalCache Off
        CommitTimeout 1800
        PurgeTimeout 5
    }

    UDP {
        IPv4_address 10_dot_100_dot_100_dot_28
        IPv4_Destination_Address 10_dot_100_dot_100_dot_29
        Port 3780
        Interface eth0
        SndSocketBuffer 1249280
        RcvSocketBuffer 1249280
        Checksum on
    }

    Options {
        ExpectationSync {
                sip
                ftp
        }
    }
}

General {
    Nice -20
    HashSize 32768
    HashLimit 131072
    LogFile on
    Syslog on
    LockFile /var/lock/conntrack.lock
    UNIX {
        Path /var/run/conntrackd.ctl
        Backlog 20
    }
    NetlinkBufferSize 2097152
    NetlinkBufferSizeMaxGrowth 8388608
    Filter From Userspace {
        Protocol Accept {
            TCP
            UDP
            ICMP
        }
        Address Ignore {
        }
    }
}

where 10_dot_100_dot_100_dot_28 - master and 10_dot_100_dot_100_dot_29 - 
salve.
The same config file is stored on the slave side, but addresses in UDP 
section are swapped.
I tried to use Address Ignore block, where I made an effort to add ip 
addresses belong to the node, but with this one it didn't work at all - 
there was no exchange of conntrackd traffic between cluster nodes. So I 
leaved it empty.

3. Configured keepalived:

vrrp_instance E1 {
    interface eth0
    state BACKUP
    virtual_router_id 61
    advert_int 1
    authentication {
        auth_type PASS
        auth_pass herepassword
    }
    virtual_ipaddress {
        10_dot_100_dot_100_dot_27/27 dev eth0
    }
    nopreempt
    garp_master_delay 1

    notify_master "/etc/conntrackd/primary-backup.sh primary"
    notify_backup "/etc/conntrackd/primary-backup.sh backup"
    notify_fault "/etc/conntrackd/primary-backup.sh fault"
}

where primary-backup.sh is a script, that is provided with conntrackd 
libraries.
You will ask me, why I don't use dedicated link for conntrackd? I used 
it for a while, but as matter of fact it didn't change anything, so I 
simplified the assignment for myself and made it deprecated.

How the process of failover looks like for the current moment:

1. I use telnet/ssh/ftp to connect to VIP address located (for current 
moment) on master side;
1.1. Master side experience a fail - I bring down the eth0 link;
2. Backup node see the problem and execute: 
/etc/conntrackd/primary-backup.sh primary
    so the following sequence of conntrackd command are executed:
    /usr/sbin/conntrackd -C /etc/conntrackd/conntrackd.conf -c
    /usr/sbin/conntrackd -C /etc/conntrackd/conntrackd.conf -f
    /usr/sbin/conntrackd -C /etc/conntrackd/conntrackd.conf -R
    /usr/sbin/conntrackd -C /etc/conntrackd/conntrackd.conf -B

3. I can see the needed telnet/ssh/ftp session on the backup node by 
command: conntrackd -i
    It has state  - ESTABLISHED state (I'm confident that this is 
session I need, cuz I remember the client's port was used for connection 
on master node).

4. But when I try to send packets (commands) from my client, server 
resets the TCP session with [R] flag. Tcpdump output on the backups node 
shows only 2 rows:

11:12:26.621001 IP telnet_dot_client_dot_test.56238 > 
10_dot_100_dot_100_dot_27.telnet: Flags [P.], seq 1380562259:1380562261, 
ack 1731331297, win 237, options [nop,nop,TS val 43239685 ecr 81905017], 
length 2
11:12:26.621083 IP 10_dot_100_dot_100_dot_27.telnet > 
telnet_dot_client_dot_test.56238: Flags [R], seq 1731331297, win 0, length 0

As you can see, firewall accepts the traffic (in INPUT and FORWARD 
chains),this means that session exists on the backup internal 
cache/kernel table (otherwise iptables would drop the packet), but it 
then resets it, why?
I tried to test it with ssh, telnet and ftp. No success at all.
I also tried to remove flushing command, so that sequence was changed to :
/usr/sbin/conntrackd -C /etc/conntrackd/conntrackd.conf -c
/usr/sbin/conntrackd -C /etc/conntrackd/conntrackd.conf -R

and it also didn't work.

So if someone has the needed experience, please don't be apathetic, help 
a bit.
At least I need a hint where to look for a problem.

--
BR, Donat Zenichev
Wnet VoIP Team
Tel:  +380(44) 5-900-800
--
To unsubscribe from this list: send the line "unsubscribe netfilter" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html