OSD log bound mismatch

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello Cephers, 

I am having some issues with two osds, which are either flapping or just crashing without recovering back. I've got a log file 100MB or so for these osds which has been generated in a couple of hours if anyone is interested. I am running firefly with the latest updates on Ubuntu 12.04 with the latest LTS kernel. 

Looking at the osd logs I see a bunch of these entries: 

2014-09-26 15:24:08.998918 7f73cb194700 0 log [ERR] : 5.108 log bound mismatch, info (53757'2809698,54690'2817536] actual [53757'2809532,54690'2817536] 

followed by slow requests like these: 

2014-09-26 15:24:16.798355 7f73e247c700 0 log [WRN] : slow request 31.463701 seconds old, received at 2014-09-26 15:23:45.334567: osd_op(client.37190249.0:6372257 rbd_data.3a0cd42ae8944a.000000000000280d [set-alloc-hint object_size 4194304 write_size 4194304,write 2203648~4096] 5.27e2bd53 ack+ondisk+write e54691) v4 currently waiting for subops from 8 
2014-09-26 15:24:16.798358 7f73e247c700 0 log [WRN] : slow request 31.004246 seconds old, received at 2014-09-26 15:23:45.794022: osd_op(client.38862536.0:2001456 rbd_data.250f7505e5edd7.0000000000000f4f [stat,set-alloc-hint object_size 4194304 write_size 4194304,write 3813376~4096] 5.5a3d6aa3 ack+ondisk+write e54691) v4 currently waiting for missing object 

The cluster seems to suffer and the guest vms are running a bit with a lag. 

Any idea how to fix these issues? 

Cheers 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ceph.com/pipermail/ceph-users-ceph.com/attachments/20140926/57303d43/attachment.htm>


[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux