OSD weighting

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Dear devs,

Playing around with ceph and gradually moving it from a toy thing into production I wanted ceph to actually make its run for the money (so to speak). I have assembled number of OSDs which are really built on different hardware: starting from old P4 with 512MB of RAM and ending up with high end Dell server, including mixture of 100 and 1000 mbit networks. I will not really speak about performance of MONs and MDSes as they do fairly well does not matter what I throw to them. But with OSDs it is different story. Even one full OSD will stall whole ceph - I've read that it is normal and good way of fighting it is to have periodic health check to see that no OSD is approaching full status. However I believe it would be better if ceph will reduce weighting for OSDs approaching full status so it will effectively prevent OSD getting full. Should be reasonably simple to implement and will not cause major grief if some OSD will go past near full status to full status quickly and unnoticed. I guess reweight-by-utilization is an attempt to address the issue based on CPU performance.

In the mean time I have reverted back to manual weighting of OSDs and I found that there no clear explanation on how weights actually applied. I've seen suggestion to keep weight equivalent to number of TBs on OSD. Doing so in single rack has achieved expected result: data has spread itself proportionally to OSDs sizes. But when I started to move OSDs from toy rack into production rack I also have changed weights for racks in pool. So I had 6 OSDs and I moved 2 of them. I have changed toyrack weight to 4.000 and productionrack to 2.000. Waited for data to settle just to find out that disk use is no longer proportional. Then I have changed rack weights to total amount of TBs in the rack, data reshuffled and settled but again did not achieved expected result. So I guess function of weights: racks, hosts and devices is not straight forward as I thought originally. This begs clear explanation of how weights are used in case of straw algo.

Regards,
Vladimir
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux