Re: [PATCH net-next] bonding: allow bond in mode balance-alb to work properly in bridge -try4.1

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Tue, May 26, 2009 at 06:59:53PM CEST, dada1@xxxxxxxxxxxxx wrote:
>Jiri Pirko a écrit :
>> [PATCH net-next] bonding: allow bond in mode balance-alb to work properly in bridge -try4.1
>> 
>> Hi all.
>> 
>> The problem is described in following bugzilla:
>> https://bugzilla.redhat.com/show_bug.cgi?id=487763
>> 
>> Basically here's what's going on. In every mode, bonding interface uses the same
>> mac address for all enslaved devices (except fail_over_mac). Only balance-alb
>> will simultaneously use multiple MAC addresses across different slaves. When you
>> put this kind of bond device into a bridge it will only add one of mac adresses
>> into a hash list of mac addresses, say X. This mac address is marked as local.
>> But this bonding interface also has mac address Y. Now then packet arrives with
>> destination address Y, this address is not marked as local and the packed looks
>> like it needs to be forwarded. This packet is then lost which is wrong.
>> 
>> Notice that interfaces can be added and removed from bond while it is in bridge.
>> 
>> ***
>> When the multiple addresses for bridge port approach failed to solve this issue
>> due to STP I started to think other way to solve this. I returned to previous
>> solution but tweaked one.
>> 
>> This patch solves the situation in the bonding without touching bridge code.
>> For every incoming frame to bonding the destination address is compared to
>> current address of the slave device from which tha packet came. If these two
>> match destination address is replaced by mac address of the master. This address
>> is known by bridge so it is delivered properly.
>> 
>> I experimentally tried that this works as good as searching through the slave
>> list (v4 of this patch).
>> 
>> I was forced to create a new header because I need to use
>> compare_ether_addr_64bits() (defined in linux/etherdevice.h) in
>> linux/netdevice.h. I've hit some cross include issues. I think that it's good
>> to have skb_bond_should_drop() in a separate file anyway.
>> 
>> Jirka
>> 
>> 
>> Signed-off-by: Jiri Pirko <jpirko@xxxxxxxxxx>
>> 
>> diff --git a/include/linux/bonding.h b/include/linux/bonding.h
>> new file mode 100644
>> index 0000000..3081ddb
>> --- /dev/null
>> +++ b/include/linux/bonding.h
>> @@ -0,0 +1,78 @@
>> +/*
>> + * include/linux/bonding.h
>> + *
>> + * Copyright (C) 2009 Jiri Pirko <jpirko@xxxxxxxxxx>
>> + *
>> + * This program is free software; you can redistribute it and/or modify
>> + * it under the terms of the GNU General Public License version 2
>> + * as published by the Free Software Foundation.
>> + *
>> + * Bonding device helpers.
>> + */
>> +
>> +#ifndef _LINUX_BONDING_H
>> +#define _LINUX_BONDING_H
>> +
>> +#ifdef __KERNEL__
>> +
>> +#include <linux/skbuff.h>
>> +#include <linux/netdevice.h>
>> +#include <linux/if.h>
>> +#include <linux/etherdevice.h>
>> +#include <linux/if_ether.h>
>> +
>> +static inline void skb_bond_set_mac_by_master(struct sk_buff *skb,
>> +					      struct net_device *dev,
>> +					      struct net_device *master)
>> +{
>> +	unsigned char *dest = eth_hdr(skb)->h_dest;
>> +
>> +	if (compare_ether_addr_64bits(dest, master->dev_addr) &&
>> +	    !compare_ether_addr_64bits(dest, dev->dev_addr))
>> +		memcpy(dest, master->dev_addr, ETH_ALEN);
>
>But couldnt we test skb->pkt_type == PACKET_HOST instead,
>Or eth_type_trans() not yet called at this point ?
>
>I would suggest :
>
>if (skb->pkt_type == PACKET_HOST)
>	memcpy(dest, master->dev_addr, ETH_ALEN);

Yes Eric, you are right, good point.

eth_type_trans() is called in any driver before and
compare_ether_addr_64bits(dest, dev->dev_addr) is done there. So it's safe
to use this here (and save some ticks). I'm going to make a new patch and
test it.

		Jirka
>
>> +}
>> +
>> +/* On bonding slaves other than the currently active slave, suppress
>> + * duplicates except for 802.3ad ETH_P_SLOW, alb non-mcast/bcast, and
>> + * ARP on active-backup slaves with arp_validate enabled.
>> + */
>> +static inline int skb_bond_should_drop(struct sk_buff *skb)
>> +{
>> +	struct net_device *dev = skb->dev;
>> +	struct net_device *master = dev->master;
>> +
>> +	if (master) {
>> +		if (master->priv_flags & IFF_MASTER_ARPMON)
>> +			dev->last_rx = jiffies;
>> +
>> +		if ((master->priv_flags & IFF_MASTER_ALB) && master->br_port) {
>> +			/* Do address unmangle. The local destination address
>> +			 * will be always the one master has. Provides the right
>> +			 * functionality in a bridge.
>> +			 */
>> +			skb_bond_set_mac_by_master(skb, dev, master);
>> +		}
>> +
>> +		if (dev->priv_flags & IFF_SLAVE_INACTIVE) {
>> +			if ((dev->priv_flags & IFF_SLAVE_NEEDARP) &&
>> +			    skb->protocol == __cpu_to_be16(ETH_P_ARP))
>> +				return 0;
>> +
>> +			if (master->priv_flags & IFF_MASTER_ALB) {
>> +				if (skb->pkt_type != PACKET_BROADCAST &&
>> +				    skb->pkt_type != PACKET_MULTICAST)
>> +					return 0;
>> +			}
>> +			if (master->priv_flags & IFF_MASTER_8023AD &&
>> +			    skb->protocol == __cpu_to_be16(ETH_P_SLOW))
>> +				return 0;
>> +
>> +			return 1;
>> +		}
>> +	}
>> +	return 0;
>> +}
>> +
>> +#endif /* __KERNEL__ */
>> +
>> +#endif	/* _LINUX_BONDING_H */
>> diff --git a/include/linux/netdevice.h b/include/linux/netdevice.h
>> index ae3c209..06e24ae 100644
>> --- a/include/linux/netdevice.h
>> +++ b/include/linux/netdevice.h
>> @@ -1897,39 +1897,6 @@ static inline void netif_set_gso_max_size(struct net_device *dev,
>>  	dev->gso_max_size = size;
>>  }
>>  
>> -/* On bonding slaves other than the currently active slave, suppress
>> - * duplicates except for 802.3ad ETH_P_SLOW, alb non-mcast/bcast, and
>> - * ARP on active-backup slaves with arp_validate enabled.
>> - */
>> -static inline int skb_bond_should_drop(struct sk_buff *skb)
>> -{
>> -	struct net_device *dev = skb->dev;
>> -	struct net_device *master = dev->master;
>> -
>> -	if (master) {
>> -		if (master->priv_flags & IFF_MASTER_ARPMON)
>> -			dev->last_rx = jiffies;
>> -
>> -		if (dev->priv_flags & IFF_SLAVE_INACTIVE) {
>> -			if ((dev->priv_flags & IFF_SLAVE_NEEDARP) &&
>> -			    skb->protocol == __cpu_to_be16(ETH_P_ARP))
>> -				return 0;
>> -
>> -			if (master->priv_flags & IFF_MASTER_ALB) {
>> -				if (skb->pkt_type != PACKET_BROADCAST &&
>> -				    skb->pkt_type != PACKET_MULTICAST)
>> -					return 0;
>> -			}
>> -			if (master->priv_flags & IFF_MASTER_8023AD &&
>> -			    skb->protocol == __cpu_to_be16(ETH_P_SLOW))
>> -				return 0;
>> -
>> -			return 1;
>> -		}
>> -	}
>> -	return 0;
>> -}
>> -
>>  extern struct pernet_operations __net_initdata loopback_net_ops;
>>  
>>  static inline int dev_ethtool_get_settings(struct net_device *dev,
>> diff --git a/net/8021q/vlan_core.c b/net/8021q/vlan_core.c
>> index 7f7de1a..c6eae40 100644
>> --- a/net/8021q/vlan_core.c
>> +++ b/net/8021q/vlan_core.c
>> @@ -2,6 +2,7 @@
>>  #include <linux/netdevice.h>
>>  #include <linux/if_vlan.h>
>>  #include <linux/netpoll.h>
>> +#include <linux/bonding.h>
>>  #include "vlan.h"
>>  
>>  /* VLAN rx hw acceleration helper.  This acts like netif_{rx,receive_skb}(). */
>> diff --git a/net/core/dev.c b/net/core/dev.c
>> index 241613f..221b43f 100644
>> --- a/net/core/dev.c
>> +++ b/net/core/dev.c
>> @@ -127,6 +127,7 @@
>>  #include <linux/jhash.h>
>>  #include <linux/random.h>
>>  #include <trace/napi.h>
>> +#include <linux/bonding.h>
>>  
>>  #include "net-sysfs.h"
>>  
>> --
>> To unsubscribe from this list: send the line "unsubscribe netdev" in
>> the body of a message to majordomo@xxxxxxxxxxxxxxx
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>> 
>> 
>
>
_______________________________________________
Bridge mailing list
Bridge@xxxxxxxxxxxxxxxxxxxxxxxxxx
https://lists.linux-foundation.org/mailman/listinfo/bridge


[Index of Archives]     [Netdev]     [AoE Tools]     [Linux Wireless]     [Kernel Newbies]     [Security]     [Linux for Hams]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux RAID]     [Linux Admin]     [Samba]     [Video 4 Linux]

  Powered by Linux