Re: Upmap balancing - pools grouped together?

David Zafman <dzafman@xxxxxxxxxx> · Tue, 17 Mar 2020 15:10:51 -0700

Hi Andras,

In 14.2.8 (the latest) version of Nautilus, each pool is balanced 
individually.   Also, the latest version balances within 5 PGs by 
default.  That could be a spread of 10, 5 below the mean and 5 above.  I 
see a spread of 13 in your results.  So either you are running an 
earlier Nautilus version or the mgr hasn't finished balancing.  If you 
are running an earlier version, it would balance pools that use the same 
crush rule together.

I'm working on a hybrid approach that will balance pools by rule, but 
pools will be grouped with a configurable PG count limit.

Currently, we don't balance based on size.

I don't think we know how much space a given PG uses.  Unless Bluestore 
can or does efficiently track the space utilization, we can't reasonably 
balance based on size.

David

On 3/16/20 1:29 PM, Andras Pataki wrote:
I've been trying the upmap balancer on a new Nautilus cluster.  We 
three main pools, a triple replicated pool (id:1) and two 6+3 erasure 
coded pools (id: 4 and 5).  The balancer does a very nice job on the 
triple replicated pool, but does something strange on the EC pools.  
Here is a sample of the PG counts on OSDs (each line) per pool (columns):

OSD    1   2   3   4   5    ALL

   0  25   -   -  80  34    139
   1  26   -   -  74  42    142
   2  25   -   -  74  42    141
   3  26   -   -  75  41    142
   4  25   -   -  83  31    139
   5  26   -   -  80  35    141
   6  26   -   -  79  36    141
   7  26   -   -  72  44    142
   8  25   -   -  74  42    141
   9  26   -   -  78  38    142
  10  26   -   -  70  46    142
  11  25   -   -  78  36    139
  12  26   -   -  74  41    141
  13  26   -   -  78  37    141
  14  26   -   -  76  40    142
  15  25   -   -  82  33    140
  16  26   -   -  77  37    140
  17  26   -   -  73  43    142
  18  26   -   -  77  37    140
  19  26   -   -  79  35    140
  20  26   -   -  74  42    142
  21  26   -   -  78  36    140
  22  26   -   -  79  35    140
  23  26   -   -  81  34    141
  24  26   -   -  77  38    141
  25  26   -   -  79  35    140

Pool #1 has 25 or 26 PGs on each OSD extremely consistently.  But it 
looks like pool #4 and #5 have a lot of variance, but their *sum* 
seems to be well balanced (and hence the total PGs per OSD is also 
very consistent).  As if the balancer was balancing the two pools 
together as one.  Both pools are EC with 6+3 profile, and use the same 
crush rule.  Could that be the reason?  If not - any ideas?

Also - a separate balancer related question: is there a way to have 
the balancer balace the size on each OSD per pool as opposed to the 
number of PGs?  If not currently - is it something hard to implement?

Andras

_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx