Re: An algorithm to fix uneven CRUSH distributions in Ceph

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

I will have a look in more detail but the first thing that has come to
my mind is Coordinate Descent
(https://en.wikipedia.org/wiki/Coordinate_descent), so it might make
sense it converges to some point. The fact that it converges always to
zero or near zero error seems more related maybe to this particular
problem and I wonder if it's possible to engineer some instance where
the algorithm gets stuck on some local minimum.

Cheers,
Pedro.


2017-05-12 13:26 GMT+02:00 Loic Dachary <loic@xxxxxxxxxxx>:
> Hi Pedro,
>
> After significant testing I believe the algorithm described at http://dachary.org/?p=4055 works. I'll publish an implementation next week with python-crush[1]. There is however something I don't understand other than intuitively. Why does it converge in all cases ? The core of the algorithm is:
>
> repeat while the distribution improves
>          run a simulation
>          lower the weight of the most overfilled device
>          increase the weight of the most underfilled device
>
> Which we can do because we can run a simulation with all the values. We always know the full set of values (PGs in the Ceph parlance but you can think of them as files) that are distributed to devices. The loss function is the sum of the absolute value of the difference between the expected distribution and the actual distribution for each device. The Kullback-Liebler divergence also works and has a fancier name but that does not seem to be better.
>
> With a better understanding of why it works we may be able to come up with a faster faster implementation and get closer to the perfect distribution in some cases.
>
> I'm curious to know what you think about that :-)
>
> Cheers
>
> [1] http://crush.readthedocs.io/
> --
> Loïc Dachary, Artisan Logiciel Libre
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux