On 9/7/11 5:34 AM, "Michael S. Tsirkin" <mst@xxxxxxxxxx> wrote: > On Tue, Sep 06, 2011 at 03:35:40PM -0700, Roopa Prabhu wrote: >> This patch is an attempt at providing address filtering support for macvtap >> devices in PASSTHRU mode. Its still a work in progress. >> Briefly tested for basic functionality. Wanted to get some feedback on the >> direction before proceeding. >> > > Good work, thanks. > Thanks. >> I have hopefully CC'ed all concerned people. > > kvm crowd might also be interested. > Try using ./scripts/get_maintainer.pl as well. > Thanks for the tip. Expanded CC list a bit more. >> PASSTHRU mode today sets the lowerdev in promiscous mode. In PASSTHRU mode >> there is a 1-1 mapping between macvtap device and physical nic or VF. And all >> filtering is done in lowerdev hw. The lowerdev does not need to be in >> promiscous mode as long as the guest filters are passed down to the lowerdev. >> This patch tries to remove the need for putting the lowerdev in promiscous >> mode. >> I have also referred to the thread below where TUNSETTXFILTER was mentioned >> in >> this context: >> http://patchwork.ozlabs.org/patch/69297/ >> >> This patch basically passes the addresses got by TUNSETTXFILTER to macvlan >> lowerdev. >> >> I have looked at previous work and discussions on this for qemu-kvm >> by Michael Tsirkin, Alex Williamson and Dragos Tatulea >> http://patchwork.ozlabs.org/patch/78595/ >> http://patchwork.ozlabs.org/patch/47160/ >> https://patchwork.kernel.org/patch/474481/ >> >> Redhat bugzilla by Michael Tsirkin: >> https://bugzilla.redhat.com/show_bug.cgi?id=655013 >> >> I used Michael's qemu-kvm patch for testing the changes with KVM >> >> I would like to cover both MAC and vlan filtering in this work. >> >> Open Questions/Issues: >> - There is a need for vlan filtering to complete the patch. It will require >> a new tap ioctl cmd for vlans. >> Some ideas on this are: >> >> a) TUNSETVLANFILTER: This will entail we send the whole vlan bitmap filter >> (similar to tun_filter for addresses). Passing the vlan id's to lower >> device will mean going thru the whole list of vlans every time. >> >> OR >> >> b) TUNSETVLAN with vlan id and flag to set/unset >> >> Does option 'b' sound ok ? >> >> - In this implementation we make the macvlan address list same as the address >> list that came in the filter with TUNSETTXFILTER. This will not cover cases >> where the macvlan device needs to have other addresses that are not >> necessarily in the filter. Is this a problem ? > > What cases do you have in mind? > This patch targets only macvlan PASSTHRU mode and for PASSTHRU mode I don't see a problem with uc/mc address list being the same in all the stacked netdevs in the path. I called that out above to make sure I was not missing any case in PASSTHRU mode where this might be invalid. Otherwise I don't see a problem in the simple PASSTHRU use case this patch supports. >> - The patch currently only supports passing of IFF_PROMISC and IFF_MULTICAST >> filter flags to lowerdev >> >> This patch series implements the following >> 01/3 - macvlan: Add support for unicast filtering in macvlan >> 02/3 - macvlan: Add function to set addr filter on lower device in passthru >> mode >> 03/3 - macvtap: Add support for TUNSETTXFILTER >> >> Please comment. Thanks. >> >> Signed-off-by: Roopa Prabhu <roprabhu@xxxxxxxxx> >> Signed-off-by: Christian Benvenuti <benve@xxxxxxxxx> >> Signed-off-by: David Wang <dwang2@xxxxxxxxx> > > The security isn't lower than with promisc, so I don't see > a problem with this as such. > > There are more features we'll want down the road though, > so let's see whether the interface will be able to > satisfy them in a backwards compatible way before we > set it in stone. Here's what I came up with: > > How will the filtering table be partitioned within guests? Since this patch supports macvlan PASSTHRU mode only, in which the lower device has 1-1 mapping to the guest nic, it does not require any partitioning of filtering table within guests. Unless I missed understanding something. If the lower device were being shared by multiple guest network interfaces (non PASSTHRU mode), only then we will need to maintain separate filter tables for each guest network interface in macvlan and forward the pkt to respective guest interface after a filter lookup. This could affect performance too I think. I chose to support PASSTHRU Mode only at first because its simpler and all code additions are in control path only. > > A way to limit what the guest can do would also be useful. > How can this be done? selinux? I vaguely remember a thread on the same context.. had a suggestion to maintain pre-approved address lists and allow guest filter registration of only those addresses for security. This seemed reasonable. Plus the ability to support additional address registration from guest could be made configurable (One of your ideas again from prior work). I am not an selinux expert, but I am thinking we can use it to only allow or disallow access or operations to the macvtap device. (?). I will check more on this. > > Any thoughts on spoofing filtering? I can only think of checking addresses against an allowed address list. Don't know of any other ways. Any hints ? In any case I am assuming all the protection/security measures should be taken at the layer calling the TUNSETTXFILTER ie..In macvtap virtualization use case its libvirt or qemu-kvm. No ? > > Would it be possible to make the filtering programmable > using netlink, e.g. ethtool, ip, or some such? Should be possible via ethtool or ip calling ioctl TUNSETTXFILTER. Are you thinking of macvlan having a netlink interface to set filter and not ioctl ?. Sure. But I was thinking the point of implementing TUNSETTXFILTER was to maintain compatibility with the generic tap interface that does the same thing. And having both the netlink op and ioctl interface might not be clean ?. Sorry if I misunderstood your question. > That would make this useful for bridged setups besides > macvtap/virtualization. > Thanks for the comments. -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html