Re: Change in rebalance graph

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



AFR had similar problem 2 years back and we started persisting afr children and graph always gets generated with what the xattrs will be in that order as an option. You can check afr_pending_xattrs_init() for it. On the glusterd side we give an identifer for each of the bricks.

The main reason we did this is because we didn't want to change this logic if new xlators get introduced in between client xlator and afr. What will you do if you add one more xlator between readdir-ahead and dht in future?

Now that I am seeing this mail and also Facebook guys also expressed that there should be a way for gluster to identify if it is talking to right bricks recently when they met us. May be we should let each xlator choose an identifier for its subvolumes and persist it on the bricks and use that instead of depending on mgmt to do the right things.

I understand this patch is already solving your problem. I see that it is merged as well but it is not future proof.


On Wed, May 17, 2017 at 10:03 AM, Raghavendra Gowdappa <rgowdapp@xxxxxxxxxx> wrote:
dht: Add readdir-ahead in rebalance graph if parallel-readdir is on

    Issue:
    The value of linkto xattr is generally the name of the dht's
    next subvol, this requires that the next subvol of dht is not
    changed for the life time of the volume. But with parallel
    readdir enabled, the readdir-ahead loaded below dht, is optional.
    The linkto xattr for first subvol, when:
    - parallel readdir is enabled : "<volname>-readdir-head-0"
    - plain distribute volume : "<volname>-client-0"
    - distribute replicate volume : "<volname>-afr-0"

    The value of linkto xattr is "<volname>-readdir-head-0" when
    parallel readdir is enabled, and is "<volname>-client-0" if
    its disabled. But the dht_lookup takes care of healing if it
    cannot identify which linkto subvol, the xattr points to.

    In dht_lookup_cbk, if linkto xattr is found to be "<volname>-client-0"
    and parallel readdir is enabled, then it cannot understand the
    value "<volname>-client-0" as it expects "<volname>-readdir-head-0".
    In that case, dht_lookup_everywhere is issued and then the linkto file
    is unlinked and recreated with the right linkto xattr. The issue is
    when parallel readdir is enabled, mount point accesses the file
    that is currently being migrated. Since rebalance process doesn't
    have parallel-readdir feature, it expects "<volname>-client-0"
    where as mount expects "<volname>-readdir-head-0". Thus at some point
    either the mount or rebalance will fail.

    Solution:
    Enable parallel-readdir for rebalance as well and then do not
    allow enabling/disabling parallel-readdir if rebalance is in
    progress.

    Change-Id: I241ab966bdd850e667f7768840540546f5289483
    BUG: 1436090
    Signed-off-by: Poornima G <pgurusid@xxxxxxxxxx>
    Reviewed-on: https://review.gluster.org/17056
    Smoke: Gluster Build System <jenkins@xxxxxxxxxxxxxxxxx>
    NetBSD-regression: NetBSD Build System <jenkins@xxxxxxxxxxxxxxxxx>
    CentOS-regression: Gluster Build System <jenkins@xxxxxxxxxxxxxxxxx>


Patch: https://review.gluster.org/17056
owners: Me and Poornima

PS: readdir-ahead is loaded in rebalance graph on top of each subvolume of DHT.

regards,
Raghavendra
_______________________________________________
Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-devel



--
Pranith
_______________________________________________
Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-devel

[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux