Re: OSDs failing to start after host reboot

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, Jan 29, 2018 at 1:37 PM, Andre Goree <andre@xxxxxxxxxx> wrote:
> On 2018/01/29 12:28 pm, Alfredo Deza wrote:
>>
>> On Mon, Jan 29, 2018 at 10:55 AM, Andre Goree <andre@xxxxxxxxxx> wrote:
>>>
>>> On my OSD node that I built with ceph-ansible, the OSDs are failing to
>>> start
>>> after a reboot.
>>
>>
>> This is not uncommon for ceph-disk unfortunately, and one of the
>> reasons we have introduced ceph-volume. There are a few components
>> that can
>> cause this, you may find that rebooting your node will yield different
>> results, some times other OSDs will come up (or all of them even!)
>>
>> If you search the tracker, or even this mailing list, you will see
>> this is nothing new.
>>
>> ceph-ansible has the ability to deploy using ceph-volume, which
>> doesn't suffer from the same caveats, you might want to try it out (if
>> possible)
>>
>>
>
> Thank you, yes I did see that this apparently happens (happened?) often
> after many hours of internet searching.  Very unfortunate.
>
> The only issue I see with ceph-volume (at least with ceph-ansible) is that
> it MUST use LVM, which we'd like to avoid.  But if we cannot reboot our OSD
> hosts for fear of them not being able to come back online, that is perhaps
> something we'll have to reconsider.
>
> Does ceph-volume work without LVM when manually creating things?

Yes it does! It even accepts previously created OSDs (either manual or
via ceph-disk) and can manage them for you.

That means: it will disable the problematic ceph-disk/udev interaction
by overriding the systemd units, and will map the newly captured OSD
details
to ceph-volume systemd units.

You will need to perform a 'scan' of the running OSD (although there
is functionality to scan a partition that is not mounted as well), so
that the details needed to manage it
will get persisted.

Make sure that the JSON output looks correct, so that the systemd
units can have correct data.

More details at:

http://docs.ceph.com/docs/master/ceph-volume/simple/

>
>
>
> --
> Andre Goree
> -=-=-=-=-=-
> Email     - andre at drenet.net
> Website   - http://blog.drenet.net
> PGP key   - http://www.drenet.net/pubkey.html
> -=-=-=-=-=-
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux