Re: OSD crash with segfault Luminous 12.2.4

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



"NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this."

Have you ever wondered what this means and why it's there? :)

This is at least something you can try. it may provide useful
information, it may not.

This stack looks like it is either corrupted, or possibly not in ceph
but in one of the linked libraries or glibc itself. If it's the
former, it probably won't tell us anything. If it's the latter you
will need the relevant debuginfo installed to get meaningful output
and note that it will probably take a while. '<executable>' in this
case is ceph-osd of course.

Alternatively, if you can upload a coredump and an sosreport (so I can
validate exact versions of all packages installed) I can try and take
a look.

On Fri, Mar 23, 2018 at 9:20 PM, Dietmar Rieder
<dietmar.rieder@xxxxxxxxxxx> wrote:
> Hi,
>
>
> I encountered one more two days ago, and I opened a ticket:
>
> http://tracker.ceph.com/issues/23431
>
> In our case it is more like 1 every two weeks, for now...
> And it is affecting different OSDs on different hosts.
>
> Dietmar
>
> On 03/23/2018 11:50 AM, Oliver Freyermuth wrote:
>> Hi together,
>>
>> I notice exactly the same, also the same addresses, Luminous 12.2.4, CentOS 7.
>> Sadly, logs are equally unhelpful.
>>
>> It happens randomly on an OSD about once per 2-3 days (of the 196 total OSDs we have). It's also not a container environment.
>>
>> Cheers,
>>       Oliver
>>
>> Am 08.03.2018 um 15:00 schrieb Dietmar Rieder:
>>> Hi,
>>>
>>> I noticed in my client (using cephfs) logs that an osd was unexpectedly
>>> going down.
>>> While checking the osd logs for the affected OSD I found that the osd
>>> was seg faulting:
>>>
>>> [....]
>>> 2018-03-07 06:01:28.873049 7fd9af370700 -1 *** Caught signal
>>> (Segmentation fault) **
>>>  in thread 7fd9af370700 thread_name:safe_timer
>>>
>>>   ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b)
>>> luminous (stable)
>>>    1: (()+0xa3c611) [0x564585904611]
>>>     2: (()+0xf5e0) [0x7fd9b66305e0]
>>>      NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>>> needed to interpret this.
>>> [...]
>>>
>>> Should I open a ticket for this? What additional information is needed?
>>>
>>>
>>> I put the relevant log entries for download under [1], so maybe someone
>>> with more
>>> experience can find some useful information therein.
>>>
>>> Thanks
>>>   Dietmar
>>>
>>>
>>> [1] https://expirebox.com/download/6473c34c80e8142e22032469a59df555.html
>>>
>>>
>>>
>>> _______________________________________________
>>> ceph-users mailing list
>>> ceph-users@xxxxxxxxxxxxxx
>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>
>>
>>
>>
>>
>> _______________________________________________
>> ceph-users mailing list
>> ceph-users@xxxxxxxxxxxxxx
>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>
>
>
> --
> _________________________________________
> D i e t m a r  R i e d e r, Mag.Dr.
> Innsbruck Medical University
> Biocenter - Division for Bioinformatics
> Innrain 80, 6020 Innsbruck
> Phone: +43 512 9003 71402
> Fax: +43 512 9003 73100
> Email: dietmar.rieder@xxxxxxxxxxx
> Web:   http://www.icbi.at
>
>
>
> _______________________________________________
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>



-- 
Cheers,
Brad
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux