ceph: get_reply unknown tid

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



H
i,

Last night i kept my Ceph cluster online with bonnie++ running on a
Rados Block Device, this caused some errors on the client (i already
sent Yehuda an e-mail about that), but right now i'm experiencing the
same messages with the Ceph filesystem.

To to a more random I/O test i'm downloading about 500GB of small images
to my Ceph filesystem (about 3 miljon files) to my Ceph filesystem.

The rsync started fine but kept stalling and stalling, while my dmesg
was filling up with the following messages:

[ 1875.923024] ceph: get_reply unknown tid 25902 from osd3
[ 1875.924294] ceph: get_reply unknown tid 25905 from osd3
[ 1875.924316] ceph: get_reply unknown tid 25907 from osd3
..
..
[ 2563.040170] ceph:  tid 40963 timed out on osd3, will reset osd
[ 2623.040071] ceph:  tid 40963 timed out on osd3, will reset osd
..
..
[ 2813.209679] ceph: get_reply unknown tid 41671 from osd3
[ 2813.209792] ceph: get_reply unknown tid 41723 from osd3
[ 2813.210610] ceph: get_reply unknown tid 41763 from osd3

root@ceph-client:~# dmesg |grep osd3|wc -l
1065
root@ceph-client:~#

These lines were produced in less then 10 minutes.

Sage recommended my to use a OSD Journal, so right now all my 5 OSD's
are running with a 5GB OSD journal on /dev/sda8

While my writes are stalling and these messages are appearing, reads are
doing fine, but any new writes stall to.

The load on osd3 is really high, the I/O wait is about 85%, so that
would suggest that the subsystem can't keep up with the writes.

The local disk is a 500GB S-ATA disk (7200RPM) which easily keeps up
with the speed my rsync is going when writing to his local filesystem
directly.

The other 4 OSD's have the same local disk and are running fine, their
I/O wait is about 3%

In the logfile of "osd3" where no errors to be found.

After stalling i killed my "rsync" processes and ran a sync. The sync
took about 15 minutes to flush the buffers while only 55.000 files were
written with a total size of 2.9GB

Now the sync is done the load of osd3 went back to normal again.

I assume even faster disk subsystems would speed this proces up, but
right now all my OSD's have a single 500GB S-ATA disk, which should be
sufficient for these simple tests.

Reproducing these messages is easy, just start writing a lot of small
files to a Ceph filesystem and the OSD messages will appear after about
5 minutes.

Restarting the OSD has affect for about 10 minutes, but eventually the
messages will come back again.

Right now my osd3 is giving these problems, but last night it was osd4
who gave me the same messages.

-- 
Met vriendelijke groet,

Wido den Hollander
Hoofd Systeembeheer / CSO
Telefoon Support Nederland: 0900 9633 (45 cpm)
Telefoon Support België: 0900 70312 (45 cpm)
Telefoon Direct: (+31) (0)20 50 60 104
Fax: +31 (0)20 50 60 111
E-mail: support@xxxxxxxxxxxx
Website: http://www.pcextreme.nl
Kennisbank: http://support.pcextreme.nl/
Netwerkstatus: http://nmc.pcextreme.nl





--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux