Re: ceph node crashed with these errors "kernel: ceph: build_snap_context" (maybe now it is urgent?)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, Dec 2, 2019 at 10:27 AM Marc Roos <M.Roos@xxxxxxxxxxxxxxxxx> wrote:
>
>
> I have been asking before[1]. Since Nautilus upgrade I am having these,
> with a total node failure as a result(?). Was not expecting this in my
> 'low load' setup. Maybe now someone can help resolving this? I am also
> waiting quite some time to get access at
> https://tracker.ceph.com/issues.

Hi Marc,

ISTR there were some anti-spam measures put in place.  Is your account
waiting for manual approval?  If so, David should be able to help.

>
>
>
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9287
> ffff911a9a26bd00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9283
> ffff911d34e69d00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9276
> ffff911d34e69c00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c926c
> ffff912068b92c00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9268
> ffff912068b93000 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c926d
> ffff912068b92900 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c928a
> ffff912118e5be00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9272
> ffff9119950d9500 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9269
> ffff911940f3d000 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9270
> ffff911748427c00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c926b
> ffff91169b000600 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9281
> ffff91169b000500 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9288
> ffff9115844d2500 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c927d
> ffff9115844d2e00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9280
> ffff91186401b000 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9267
> ffff9121535ecc00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c927c
> ffff9121cecb1e00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9271
> ffff9121cecb0400 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9279
> ffff911d26646300 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c927f
> ffff911d26646900 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9275
> ffff9121cecb1700 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9259
> ffff91170c9f6600 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9257
> ffff9118ef2a8000 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c924e
> ffff911a1e091800 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9262
> ffff911a1e090c00 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9266
> ffff9115e3859500 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c924f
> ffff9118aefd1300 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c925f
> ffff91170c9f6100 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9252
> ffff9115e3859800 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9256
> ffff912045dc5300 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9254
> ffff91170c9f6900 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020c9261
> ffff91170c9f7100 fail -12
> Dec 1 03:14:36 c04 kernel: ceph: build_snap_context 100020d4ec4
> ffff9118aefd0000 fail -12

It is failing to allocate memory.  "low load" isn't very specific,
can you describe the setup and the workload in more detail?

How many snapshots do you have?

Do you keep track of memory consumption on the node?

Finally, you say "crash" in the subject.  Does the kernel actually
crash or perhaps it locks up?  If it actually crashes, do you have the
panic message?

Thanks,

                Ilya
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux