Re: OSD crash with segfault Luminous 12.2.4

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Mar 27, 2018 at 9:04 PM, Dietmar Rieder
<dietmar.rieder@xxxxxxxxxxx> wrote:
> Thanks Brad!

Hey Dietmar,

yw.

>
> I added some information to the ticket.
> Unfortunately I still could not grab a coredump, since there was no
> segfault lately.

OK. That may help to get us started. Getting late here for me so I'll
take a look at this tomorrow.

Thanks!

>
>  http://tracker.ceph.com/issues/23431
>
> Maybe Oliver has something to add as well.
>
>
> Dietmar
>
>
> On 03/27/2018 11:37 AM, Brad Hubbard wrote:
>> "NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>> needed to interpret this."
>>
>> Have you ever wondered what this means and why it's there? :)
>>
>> This is at least something you can try. it may provide useful
>> information, it may not.
>>
>> This stack looks like it is either corrupted, or possibly not in ceph
>> but in one of the linked libraries or glibc itself. If it's the
>> former, it probably won't tell us anything. If it's the latter you
>> will need the relevant debuginfo installed to get meaningful output
>> and note that it will probably take a while. '<executable>' in this
>> case is ceph-osd of course.
>>
>> Alternatively, if you can upload a coredump and an sosreport (so I can
>> validate exact versions of all packages installed) I can try and take
>> a look.
>>
>> On Fri, Mar 23, 2018 at 9:20 PM, Dietmar Rieder
>> <dietmar.rieder@xxxxxxxxxxx> wrote:
>>> Hi,
>>>
>>>
>>> I encountered one more two days ago, and I opened a ticket:
>>>
>>> http://tracker.ceph.com/issues/23431
>>>
>>> In our case it is more like 1 every two weeks, for now...
>>> And it is affecting different OSDs on different hosts.
>>>
>>> Dietmar
>>>
>>> On 03/23/2018 11:50 AM, Oliver Freyermuth wrote:
>>>> Hi together,
>>>>
>>>> I notice exactly the same, also the same addresses, Luminous 12.2.4, CentOS 7.
>>>> Sadly, logs are equally unhelpful.
>>>>
>>>> It happens randomly on an OSD about once per 2-3 days (of the 196 total OSDs we have). It's also not a container environment.
>>>>
>>>> Cheers,
>>>>       Oliver
>>>>
>>>> Am 08.03.2018 um 15:00 schrieb Dietmar Rieder:
>>>>> Hi,
>>>>>
>>>>> I noticed in my client (using cephfs) logs that an osd was unexpectedly
>>>>> going down.
>>>>> While checking the osd logs for the affected OSD I found that the osd
>>>>> was seg faulting:
>>>>>
>>>>> [....]
>>>>> 2018-03-07 06:01:28.873049 7fd9af370700 -1 *** Caught signal
>>>>> (Segmentation fault) **
>>>>>  in thread 7fd9af370700 thread_name:safe_timer
>>>>>
>>>>>   ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b)
>>>>> luminous (stable)
>>>>>    1: (()+0xa3c611) [0x564585904611]
>>>>>     2: (()+0xf5e0) [0x7fd9b66305e0]
>>>>>      NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>>>>> needed to interpret this.
>>>>> [...]
>>>>>
>>>>> Should I open a ticket for this? What additional information is needed?
>>>>>
>>>>>
>>>>> I put the relevant log entries for download under [1], so maybe someone
>>>>> with more
>>>>> experience can find some useful information therein.
>>>>>
>>>>> Thanks
>>>>>   Dietmar
>>>>>
>>>>>
>>>>> [1] https://expirebox.com/download/6473c34c80e8142e22032469a59df555.html
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> ceph-users mailing list
>>>>> ceph-users@xxxxxxxxxxxxxx
>>>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>>>
>>>>
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> ceph-users mailing list
>>>> ceph-users@xxxxxxxxxxxxxx
>>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>>
>>>
>>>
>>> --
>>> _________________________________________
>>> D i e t m a r  R i e d e r, Mag.Dr.
>>> Innsbruck Medical University
>>> Biocenter - Division for Bioinformatics
>>> Innrain 80, 6020 Innsbruck
>>> Phone: +43 512 9003 71402
>>> Fax: +43 512 9003 73100
>>> Email: dietmar.rieder@xxxxxxxxxxx
>>> Web:   http://www.icbi.at
>>>
>>>
>>>
>>> _______________________________________________
>>> ceph-users mailing list
>>> ceph-users@xxxxxxxxxxxxxx
>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>
>>
>>
>>
>
>
> --
> _________________________________________
> D i e t m a r  R i e d e r, Mag.Dr.
> Innsbruck Medical University
> Biocenter - Division for Bioinformatics
> Innrain 80, 6020 Innsbruck
> Phone: +43 512 9003 71402
> Fax: +43 512 9003 73100
> Email: dietmar.rieder@xxxxxxxxxxx
> Web:   http://www.icbi.at
>
>



-- 
Cheers,
Brad
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux