Re: Storage node refurbishing, a "freeze" OSD feature would be nice

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, 31 Aug 2015 08:57:23 +0200 Udo Lembke wrote:

> Hi Christian,
> for my setup "b" takes too long - too much data movement and stress to
> all nodes. I have simply (with replica 3) "set noout", reinstall one
> node (with new filesystem on the OSDs, but leave them in the crushmap)
> and start all OSDs (at friday night) - takes app. less than one day for
> rebuild (11*4TB 1*8TB). Do also stress the other nodes, but less than
> with weigting to zero.
> 
I will hopefully have a good idea of what times and impacts I'm looking at
after a).
But I think that doing a massive push-pull like that will be too long
for the maintenance windows we have, also the number of OSDs will change in
the old nodes.

Christian

> Udo
> 
> On 31.08.2015 06:07, Christian Balzer wrote:
> > 
> > Hello,
> > 
> > I'm about to add another storage node to small firefly cluster here and
> > refurbish 2 existing nodes (more RAM, different OSD disks).
> > 
> > Insert rant about not going to start using ceph-deploy as I would have
> > to set the cluster to no-in since "prepare" also activates things due
> > to the udev magic...
> > 
> > This cluster is quite at the limits of its IOPS capacity (the HW was
> > requested ages ago, but the mills here grind slowly and not particular
> > fine either), so the plan is to:
> > 
> > a) phase in the new node (lets call it C), one OSD at a time (in the
> > dead of night)
> > b) empty out old node A (weight 0), one OSD at a time. When
> > done, refurbish and bring it back in, like above.
> > c) repeat with 2nd old node B.
> > 
> > Looking at this it's obvious where the big optimization in this
> > procedure would be, having the ability to "freeze" the OSDs on node B.
> > That is making them ineligible for any new PGs while preserving their
> > current status. 
> > So that data moves from A to C (which is significantly faster than A
> > or B) and then back to A when it is refurbished, avoiding any heavy
> > lifting by B.
> > 
> > Does that sound like something other people might find useful as well
> > and is it feasible w/o upsetting the CRUSH applecart?
> > 
> > Christian
> > 
> 
> 


-- 
Christian Balzer        Network/Systems Engineer                
chibi@xxxxxxx   	Global OnLine Japan/Fusion Communications
http://www.gol.com/
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux