On Thu, Feb 28, 2019 at 5:05 PM Jakub Kicinski <kubakici@xxxxx> wrote: > > On Thu, 28 Feb 2019 16:20:28 -0800, Siwei Liu wrote: > > On Thu, Feb 28, 2019 at 11:56 AM Jakub Kicinski wrote: > > > On Thu, 28 Feb 2019 14:36:56 -0500, Michael S. Tsirkin wrote: > > > > > It is a bit of a the chicken or the egg situation ;) But users can > > > > > just blacklist, too. Anyway, I think this is far better than module > > > > > parameters > > > > > > > > Sorry I'm a bit confused. What is better than what? > > > > > > I mean that blacklist net_failover or module param to disable > > > net_failover and handle in user space are better than trying to solve > > > the renaming at kernel level (either by adding module params that make > > > the kernel rename devices or letting user space change names of running > > > devices if they are slaves). > > > > Before I was aksed to revive this old mail thread, I knew the > > discussion could end up with something like this. Yes, theoretically > > there's a point - basically you don't believe kernel should take risk > > in fixing the issue, so you push back the hope to something in > > hypothesis that actually wasn't done and hard to get done in reality. > > It's not too different than saying "hey, what you're asking for is > > simply wrong, don't do it! Go back to modify userspace to create a > > bond or team instead!" FWIW I want to emphasize that the debate for > > what should be the right place to implement this failover facility: > > userspace versus kernel, had been around for almost a decade, and no > > real work ever happened in userspace to "standardize" this in the > > Linux world. > > Let me offer you my very subjective opinion of why "no real work ever > happened in user space". The actors who have primary interest to get > the auto-bonding working are HW vendors trying to either convince > customers to use SR-IOV, or being pressured by customers to make SR-IOV > easier to consume. HW vendors hire driver developers, not user space > developers. So the solution we arrive at is in the kernel for a non > technical reason (Conway's law, sort of). > > $ cd NetworkManager/ > $ git log --pretty=format:"%ae" | \ > grep '\(mellanox\|intel\|broadcom\|netronome\)' | sort | uniq -c > 81 andrew.zaborowski@xxxxxxxxx > 2 David.Woodhouse@xxxxxxxxx > 2 ismo.puustinen@xxxxxxxxx > 1 michael.i.doherty@xxxxxxxxx > > Andrew works on WiFi. > I'm sorry, but we don't use NetworkManager in our cloud images at all. We sufferd from lots of problems when booting from remote iSCSI disk with NetworkManager enabled, and it looks like those issues are still there while that's not (my subjective impression) a network config tool mainly targeting desktop and WiFi users ever cares about. At least a sign of lack of sufficient testing was made there. >From cloud service provider perspective, we always prefer single central solution than speak to various distro vendors with their own network daemons/config tools thus different solutions. It's hard to coordicate all efforts in one place. From my personal perspetive, the in-kernel auto-slave solution is nothing technically inferior than any userspace implementation, and every major OS/cloud providers choose to implement this in-kernel model for the same reason. I don't want to argue more if there's value or not for net_failover to be in Linux kernel, given that it's already there I think it's better to move on. We have done extensive work in reporting (actually, fix them internally before posting) issues to the dracut, udev, initramfs-tools, and cloud-init community. Although as claimed the 3-netdev should be transparent to userspace in general, the reality is opposite: the effort is nothing differenet than bring up a new type of virutal bond than any existing userspace tool would otherwise expect for a regular physical netdev. If there's ever concern about breaking userspace, I bet no one ever tries to start using it. If they did they know what I am saying. The dup MAC address setting and plugging order are totally new to userspace that none of userspace tools fail to know how to plumb failover interface in a proper way, if without fixing them one or another. -Siwei > I have asked the NetworkManager folks to implement this feature last > year when net_failover got dangerously close to getting merged, and > they said they were never approached with this request before, much less > offered code that solve it. Unfortunately before they got around to it > net_failover was merged already, and they didn't proceed. > > So to my knowledge nobody ever tried to solve this in user space. > I don't think net_failover is particularly terrible, or that renaming > of primary in the kernel is the end of the world, but I'd appreciate if > you could point me to efforts to solve it upstream in user space > components, or acknowledge that nobody actually tried that. _______________________________________________ Virtualization mailing list Virtualization@xxxxxxxxxxxxxxxxxxxxxxxxxx https://lists.linuxfoundation.org/mailman/listinfo/virtualization