On Mon, 1 Jun 2020 at 22:25, Alan Maguire <alan.maguire@xxxxxxxxxx> wrote: > > On Mon, 1 Jun 2020, Daniel Borkmann wrote: > > > On 6/1/20 7:48 PM, Alan Maguire wrote: > > > On Wed, 13 May 2020, Lorenz Bauer wrote: > > > > > >>>> Option 1: always downgrade UNNECESSARY to NONE > > >>>> - Easiest to back port > > >>>> - The helper is safe by default > > >>>> - Performance impact unclear > > >>>> - No escape hatch for Cilium > > >>>> > > >>>> Option 2: add a flag to force CHECKSUM_NONE > > >>>> - New UAPI, can this be backported? > > >>>> - The helper isn't safe by default, needs documentation > > >>>> - Escape hatch for Cilium > > >>>> > > >>>> Option 3: downgrade to CHECKSUM_NONE, add flag to skip this > > >>>> - New UAPI, can this be backported? > > >>>> - The helper is safe by default > > >>>> - Escape hatch for Cilium (though you'd need to detect availability of > > >>>> the > > >>>> flag somehow) > > >>> > > >>> This seems most reasonable to me; I can try and cook a proposal for > > >>> tomorrow as > > >>> potential fix. Even if we add a flag, this is still backportable to stable > > >>> (as > > >>> long as the overall patch doesn't get too complex and the backport itself > > >>> stays > > >>> compatible uapi-wise to latest kernels. We've done that before.). I happen > > >>> to > > >>> have two ixgbe NICs on some of my test machines which seem to be setting > > >>> the > > >>> CHECKSUM_UNNECESSARY, so I'll run some experiments from over here as well. > > >> > > >> Great! I'm happy to test, of course. > > > > > > I had a go at implementing option 3 as a few colleagues ran into this > > > problem. They confirmed the fix below resolved the issue. Daniel is > > > this roughly what you had in mind? I can submit a patch for the bpf > > > tree if that's acceptable with the new flag. Do we need a few > > > tests though? > > > > Coded this [0] up last week which Lorenz gave a spin as well. Originally > > wanted to > > get it out Friday night, but due to internal release stuff it got too late Fri > > night > > and didn't want to rush it at 3am anymore, so the series as fixes is going out > > tomorrow > > morning [today was public holiday in CH over here]. > > > > Looks great! Although I've only seen this issue arise > for cases where csum_level == 0, should we also > add "skb->csum_level = 0;" when we reset the > ip_summed value? FWIW I had the same reaction. Maybe it's worth adding after all, Daniel? -- Lorenz Bauer | Systems Engineer 6th Floor, County Hall/The Riverside Building, SE1 7PB, UK www.cloudflare.com