On Mon, 2023-04-17 at 15:32 +0530, Karthik M wrote: > If the average value in mesh metrics calculation > has been rounded to 0 (success), this resets it to > the smallest nonzero value (similarly to the initialization) > to avoid a case where a single failure would result in > an average value that goes beyond the value > of 95 (Link Failure Threshold). > > Signed-off-by: Tamizh Chelvam Raja <quic_tamizhr@xxxxxxxxxxx> > Signed-off-by: Karthik M <quic_karm@xxxxxxxxxxx> > --- > Changes since v1: > - Altered the comment to mention "mesh" and thershold value. > - Checkpatch done This is v2 now, how did an explicit "v1" happen? :) Anyway ... > +++ b/net/mac80211/mesh_hwmp.c > @@ -298,10 +298,23 @@ void ieee80211s_update_metric(struct ieee80211_local *local, > { > struct ieee80211_tx_info *txinfo = st->info; > int failed; > + u32 fail_avg; > struct rate_info rinfo; > > failed = !(txinfo->flags & IEEE80211_TX_STAT_ACK); > > + fail_avg = ewma_mesh_fail_avg_read(&sta->mesh->fail_avg); > + if (!fail_avg) { > + /* If the average value in mesh metrics calculation > + * has been rounded to 0 (success), this resets it to > + * the smallest nonzero value (similarly to the initialization) > + * to avoid a case where a single failure would result in > + * an average value that goes beyond the value > + * of 95 (Link Failure Threshold) > + */ > + ewma_mesh_fail_avg_add(&sta->mesh->fail_avg, 1); > + } > + > /* moving average, scaled to 100. > * feed failure as 100 and success as 0 > */ This still seems really non-intuitive to me. It seems to me this is down to the special "0 means init" behaviour, that you don't want, because your values actually fluctate between 0 and 100, and you can actually legitimately reach 0 with a lot of successes. But to me it's really non-intuitive, if not counter-intuitive, to say "oh yeah my values are 0 to 100 inclusive, but I can't ever deal with reaching 0 because then I jump to 100 immediately". That doesn't make much sense to me? I mean, I guess I can see where this patch makes some sense like from a code point of view, this is sort of the minimal code change you could make to make the existing code work, but ... I'd argue you're optimising to the wrong metric here, "minimal code changes to fix the bug" should normally not be your metric, it should be "code changes that make this clearer and avoid the problem", or something like that? Anyway I guess that's all a long-winded way of saying that I don't really agree with this change. To me, basically, I see two ways to solve this: 1) we have DECLARE_EWMA_ZERO_VALID() or something like that which *doesn't* treat 0 as an uninitialized value, and either has a separate "not initialized yet" bit (but that's iffy storage wise), or simply has another argument to _init() for the initial value or so. 2) you don't just don't use 0 and 100 but say 1 and 100, that results in basically the same behaviour, but avoids the special 0. johannes