Re: [PATCH 05/21] libmultipath: lookup_binding: add comment about the algorithm

Martin Wilck <mwilck@xxxxxxxx> · Thu, 07 Sep 2023 10:22:58 +0200

On Wed, 2023-09-06 at 17:43 -0500, Benjamin Marzinski wrote:
> On Fri, Sep 01, 2023 at 08:02:18PM +0200, mwilck@xxxxxxxx wrote:
> > From: Martin Wilck <mwilck@xxxxxxxx>
> > 
> > When I read this code, I always get confused. Adding comments to
> > explain the algorithm.
> > 
> > Signed-off-by: Martin Wilck <mwilck@xxxxxxxx>
> > ---
> >  libmultipath/alias.c | 35 +++++++++++++++++++++++++++++++++++
> >  1 file changed, 35 insertions(+)
> > 
> > diff --git a/libmultipath/alias.c b/libmultipath/alias.c
> > index f7834d1..e61eb91 100644
> > --- a/libmultipath/alias.c
> > +++ b/libmultipath/alias.c
> > @@ -172,6 +172,41 @@ lookup_binding(FILE *f, const char *map_wwid,
> > char **map_alias,
> >                 alias = strtok_r(buf, " \t", &saveptr);
> >                 if (!alias) /* blank line */
> >                         continue;
> > +
> > +               /*
> > +                * Find an unused index - explanation of the
> > algorithm
> > +                *
> > +                * ID: 1 = mpatha, 2 = mpathb, ...
> > +                *
> > +                * We assume the bindings are unsorted. The only
> > constraint
> > +                * is that no ID occurs more than once. IDs that
> > occur in the
> > +                * bindings are called "used".
> > +                *
> > +                * We call the list 1,2,3,..., exactly in this
> > order, the list
> > +                * of "expected" IDs. The variable "id" always
> > holds the next
> > +                * "expected" ID, IOW the last "expected" ID
> > encountered plus 1.
> > +                * Thus all IDs below "id" are known to be used.
> > However, at the
> > +                * end of the loop, the value of "id" isn't
> > necessarily unused.
> > +                *
> > +                * "smallest_bigger_id" is the smallest used ID
> > that was
> > +                * encountered while it was larger than the next
> > "expected" ID
> > +                * at that iteration. Let X be some used ID. If all
> > IDs below X
> > +                * are used and encountered in the right sequence
> > before X, "id"
> > +                * will be > X when the loop ends. Otherwise, X was
> > encountered
> > +                * "out of order", the condition (X > id) holds
> > when X is
> > +                * encountered, and "smallest_bigger_id" will be
> > set to X; i.e.
> > +                * it will be less or equal than X when the loop
> > ends.
> > +                *
> > +                * At the end of the loop, (id <
> > smallest_bigger_id) means that
> > +                * the value of "id" had been encountered neither
> > in order nor
> > +                * out of order, and is thus unused. (id >=
> > smallest_bigger_id)
> 
> I know the check is (id >= smallest_bigger_id), but as long as no ID
> occurs more than once, id can never actually be bigger than
> smallest_bigger_id since id only gets incremented when (curr_id ==
> id)
> and if smallest_bigger_id is not INT_MAX, then smallest_bigger_id
> already occured once in the file before id was incremented to equal
> it.
> This means it can't occur again, so id can never get incremented past
> it. Not this this really matters, so

Right. Like you, I thought this didn't matter for the algorithm as
such, and that explaining it would make the lengthy explanation even
lengthier.

Btw, if you have a simpler explanation of this algorithm, please
provide it ;-)

Martin

> 
> Reviewed-by: Benjamin Marzinski <bmarzins@xxxxxxxxxx>
> 
> > +                * means that "id"'s value is in use. In this case,
> > we play safe
> > +                * and use "biggest_id + 1" as the next value to
> > try.
> > +                *
> > +                * biggest_id is always > smallest_bigger_id,
> > except in the
> > +                * "perfectly ordered" case.
> > +                */
> > +
> >                 curr_id = scan_devname(alias, prefix);
> >                 if (curr_id == id) {
> >                         if (id < INT_MAX)
> > -- 
> > 2.41.0
> 

--
dm-devel mailing list
dm-devel@xxxxxxxxxx
https://listman.redhat.com/mailman/listinfo/dm-devel