Re: OSD crash with segfault Luminous 12.2.4

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Thanks Brad!

I added some information to the ticket.
Unfortunately I still could not grab a coredump, since there was no
segfault lately.

 http://tracker.ceph.com/issues/23431

Maybe Oliver has something to add as well.


Dietmar


On 03/27/2018 11:37 AM, Brad Hubbard wrote:
> "NOTE: a copy of the executable, or `objdump -rdS <executable>` is
> needed to interpret this."
> 
> Have you ever wondered what this means and why it's there? :)
> 
> This is at least something you can try. it may provide useful
> information, it may not.
> 
> This stack looks like it is either corrupted, or possibly not in ceph
> but in one of the linked libraries or glibc itself. If it's the
> former, it probably won't tell us anything. If it's the latter you
> will need the relevant debuginfo installed to get meaningful output
> and note that it will probably take a while. '<executable>' in this
> case is ceph-osd of course.
> 
> Alternatively, if you can upload a coredump and an sosreport (so I can
> validate exact versions of all packages installed) I can try and take
> a look.
> 
> On Fri, Mar 23, 2018 at 9:20 PM, Dietmar Rieder
> <dietmar.rieder@xxxxxxxxxxx> wrote:
>> Hi,
>>
>>
>> I encountered one more two days ago, and I opened a ticket:
>>
>> http://tracker.ceph.com/issues/23431
>>
>> In our case it is more like 1 every two weeks, for now...
>> And it is affecting different OSDs on different hosts.
>>
>> Dietmar
>>
>> On 03/23/2018 11:50 AM, Oliver Freyermuth wrote:
>>> Hi together,
>>>
>>> I notice exactly the same, also the same addresses, Luminous 12.2.4, CentOS 7.
>>> Sadly, logs are equally unhelpful.
>>>
>>> It happens randomly on an OSD about once per 2-3 days (of the 196 total OSDs we have). It's also not a container environment.
>>>
>>> Cheers,
>>>       Oliver
>>>
>>> Am 08.03.2018 um 15:00 schrieb Dietmar Rieder:
>>>> Hi,
>>>>
>>>> I noticed in my client (using cephfs) logs that an osd was unexpectedly
>>>> going down.
>>>> While checking the osd logs for the affected OSD I found that the osd
>>>> was seg faulting:
>>>>
>>>> [....]
>>>> 2018-03-07 06:01:28.873049 7fd9af370700 -1 *** Caught signal
>>>> (Segmentation fault) **
>>>>  in thread 7fd9af370700 thread_name:safe_timer
>>>>
>>>>   ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b)
>>>> luminous (stable)
>>>>    1: (()+0xa3c611) [0x564585904611]
>>>>     2: (()+0xf5e0) [0x7fd9b66305e0]
>>>>      NOTE: a copy of the executable, or `objdump -rdS <executable>` is
>>>> needed to interpret this.
>>>> [...]
>>>>
>>>> Should I open a ticket for this? What additional information is needed?
>>>>
>>>>
>>>> I put the relevant log entries for download under [1], so maybe someone
>>>> with more
>>>> experience can find some useful information therein.
>>>>
>>>> Thanks
>>>>   Dietmar
>>>>
>>>>
>>>> [1] https://expirebox.com/download/6473c34c80e8142e22032469a59df555.html
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> ceph-users mailing list
>>>> ceph-users@xxxxxxxxxxxxxx
>>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>>
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> ceph-users mailing list
>>> ceph-users@xxxxxxxxxxxxxx
>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>
>>
>>
>> --
>> _________________________________________
>> D i e t m a r  R i e d e r, Mag.Dr.
>> Innsbruck Medical University
>> Biocenter - Division for Bioinformatics
>> Innrain 80, 6020 Innsbruck
>> Phone: +43 512 9003 71402
>> Fax: +43 512 9003 73100
>> Email: dietmar.rieder@xxxxxxxxxxx
>> Web:   http://www.icbi.at
>>
>>
>>
>> _______________________________________________
>> ceph-users mailing list
>> ceph-users@xxxxxxxxxxxxxx
>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>
> 
> 
> 


-- 
_________________________________________
D i e t m a r  R i e d e r, Mag.Dr.
Innsbruck Medical University
Biocenter - Division for Bioinformatics
Innrain 80, 6020 Innsbruck
Phone: +43 512 9003 71402
Fax: +43 512 9003 73100
Email: dietmar.rieder@xxxxxxxxxxx
Web:   http://www.icbi.at


Attachment: signature.asc
Description: OpenPGP digital signature

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux