slow operation observed for _collection_list

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

Having slow ops and laggy pgs due to osd is not accessible (octopus 15.2.14 version and 15.2.10 also).
At the time when slow ops started, in the osd log I can see:

"7f2a8d68f700  1 heartbeat_map is_healthy 'OSD::osd_op_tp thread 0x7f2a70de5700' had timed out after 15"

And this blocks the io until the radosgateway didn't restart itself.
Is this a bug or something else?

In the ceph.log I can see also that specific osd is reported failed from another osds:

2021-10-29T05:49:34.386857+0700 mon.server-3s01 (mon.0) 3576376 : cluster [DBG] osd.7 reported failed by osd.31
2021-10-29T05:49:34.454037+0700 mon.server-3s01 (mon.0) 3576377 : cluster [DBG] osd.7 reported failed by osd.22
2021-10-29T05:49:34.666758+0700 mon.server-3s01 (mon.0) 3576379 : cluster [DBG] osd.7 reported failed by osd.6
2021-10-29T05:49:34.807714+0700 mon.server-3s01 (mon.0) 3576382 : cluster [DBG] osd.7 reported failed by osd.11

Here is the osd log: https://justpaste.it/4x4h2
Here is the ceph.log itself: https://justpaste.it/5bk8k
Here is some additional information regarding memory usage and backtrace...: https://justpaste.it/1tmjg

Thank you
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux