On Tue, Feb 18, 2020 at 12:11:47PM -0500, Dennis Dalessandro wrote: > On 2/18/2020 9:04 AM, Leon Romanovsky wrote: > > On Fri, Feb 14, 2020 at 01:13:53PM -0500, Dennis Dalessandro wrote: > > > Was there any discussion on the upgrade scenario for existing deployments as > > > far as device-rename changing node descriptions? > > > > > > If someone is running an older version of rdma-core they are going to have a > > > certain set of node descriptions for each node. This could be in logs, or > > > configuration databases, who knows what. Now if they upgrade to a new > > > version of rdma-core their node descriptions all automatically change out > > > from under them by default. > > > > > > Of course the admin could disable the rename prior to upgrade and as Leon > > > pointed out previously the upgrade won't remove the disablement file. The > > > problem is they would have to know to do that ahead of time. > > > > Dennis, > > > > It was discussed and the conclusion was that most if not all users are > > using one of two upgrade and strategy. > > Do you have a pointer to a thread I can read, I apparently missed it? First, we started to talk about it even before patches were sent. See this summary from LPC 2017: * the sysadmin will be able to disable this for "backward support" https://lore.kernel.org/linux-rdma/20170917125603.GA5788@mtr-leonro.local/ Second, during the submission too, just need to continue to google it :) > > > First option is to rely on distro and every distro behaves differently > > in such cases, some of them won't change anything till their last user > > dies :) and others more dynamic with more up-to-date packages already > > adopted our default. > > This is the issue I see. The problem is when the distro doesn't know any > better and pulls in a new rdma-core and breaks things unintentionally. Up to > date is good, but up to date that brings with it what is essentially an ABI > breakage is not. ABI breakage is a strong word, luckily enough it is not defined at all. We never considered dmesg prints, device names, device ordering as an ABI. You can't rely on debug features too, they can disappear too. So the bottom line, the expectation that distro should fix all broken software before enabling device renaming and their bugs are not excuse to declare ABI breakage. > > > Second option is to use numerous OFED stacks, which are expected to > > provide full upgrade to all components which will work smoothly. > > Yeah I'm sure OFED will handle things for themselves. At the end, OFED stacks behave like "mini-distros", so if they manage to handle it, distro should do the same. Thanks