Re: rbd volume upgrades

Yehuda Sadeh <yehuda@xxxxxxxxxxx> · Fri, 9 Nov 2012 12:31:27 -0800



On Fri, Nov 9, 2012 at 12:30 PM, Yehuda Sadeh <yehuda@xxxxxxxxxxx> wrote:
> On Fri, Nov 9, 2012 at 12:26 PM, Josh Durgin <josh.durgin@xxxxxxxxxxx> wrote:
>> On 11/09/2012 12:09 PM, Alex Elder wrote:
>>>
>>> On 11/09/2012 02:03 PM, Josh Durgin wrote:
>>>>
>>>> On 11/09/2012 11:44 AM, Yehuda Sadeh wrote:
>>>>>
>>>>> On Fri, Nov 9, 2012 at 11:30 AM, Josh Durgin <josh.durgin@xxxxxxxxxxx>
>>>>> wrote:
>>>>>>
>>>>>> On 11/09/2012 11:08 AM, Yehuda Sadeh wrote:
>>>>>>>
>>>>>>>
>>>>>>> On Fri, Nov 9, 2012 at 11:04 AM, Josh Durgin <josh.durgin@xxxxxxxxxxx>
>>>>>>> wrote:
>>>>>>>>
>>>>>>>>
>>>>>>>> On 11/09/2012 11:01 AM, Gregory Farnum wrote:
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> I was asked today if there's a way to upgrade RBD volumes from v1 to
>>>>>>>>> v2. I didn't think so, but wanted
>>>>>>>>> 1) to make sure I'm right,
>>>>>>>>> 2) to ask how hard it would be,
>>>>>>>>> 3) to ask if we haven't done it because it didn't occur to us or
>>>>>>>>> because it's too hard.
>>>>>>>>> -Greg
>>>>>>>>>
>>>>>>>>
>>>>>>>> This was addressed in the original discussions about format 2.
>>>>>>>>
>>>>>>>> You need to export and then import the volume as format 2. Format 2
>>>>>>>> uses
>>>>>>>> different names for objects, so providing an 'upgrade' path would
>>>>>>>> still
>>>>>>>> require copying all the data around.
>>>>>>>>
>>>>>>> Couldn't we just set a flag in the header specifying the object naming
>>>>>>> version, which would then only require updating the header?
>>>>>>>
>>>>>>> Yehuda
>>>>>>
>>>>>>
>>>>>>
>>>>>> The header was separated from the id object to allow renames to work
>>>>>> while the image was in use or with cloning. The whole header format
>>>>>> changed and moved to a different object as a result. It would be
>>>>>> messy to implement this kind of upgrade, and doesn't provide much
>>>>>> benefit when there's an easy way to convert already. If someone really
>>>>>> wanted it, it could be implemented, but otherwise I don't think it's
>>>>>> worth adding. It would have to be added to the upcoming kernel
>>>>>> layering support too.
>>>>>>
>>>>>
>>>>> The assumption is that when you upgrade you don't go back, so the fact
>>>>> that the header was separated from the id object doesn't change much.
>>>>> An upgrade process would be the same as creating a new v2 image,
>>>>> having object names (prefix?) that set as the original object names,
>>>>> and with a version field that specifies that these are a v1 names.
>>>>>
>>>>> The problem that I see with converting v1 to v2 through copy is that
>>>>> (besides the cumbersome and potentially very long process) we will end
>>>>> up turning sparse data objects into fully written data objects, which
>>>>> will affect the data consumption.
>>>>
>>>>
>>>> That's a good point about export. It would be good to make export create
>>>> sparse files as well, but since it doesn't yet, the in-place upgrade
>>>> would be better for space usage.
>>>
>>>
>>> Plus!  It looks like you don't even need a flag.
>>>
>>> I think if you simply recorded the old-format object prefix in the
>>> new format header, all would be fine.  The format of the object
>>> id has not changed between v1 and v2, just the object prefix.
>>
>>
>> You still need a flag to tell whether there should be an 'rbd_data.' prefix
>> (format 2) or an 'rb.' prefix (format 1) before the object_prefix
>> stored in the header.
>>
>
> So maybe instead of having a format version it'll just be a string
> that specifies either 'rb.' or 'rbd_.'?

that is 'rbd_data.'
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html