Need help with synchronizing ceph mons

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

I have some problems with my ceph monitor nodes in my Cluster.

I had 5 mons in the cluster. On all 5 nodes the leveldb store grew up to about 80 – 90 GB in size. To get rid of it I triggerd a compaction with the following command on one node.

  ceph tell mon.d compact

The Monitor compacted his data to about 5 GB. After this the mon tried to synchronize his data with the other mons. And start my problems.

After a short time (20 – 30 sec) of streaming data from another mon node The stream breaks and further the sender reads oviously all the other data in his store with maximum speed. At this point the cluster lost his leader and trying to elect a new one. The leader election only works if the reading of the data is done.

I tried to remove the mon from the cluster completly and rejoin it as a new one but while syncing I experience the same issue. So currently the cluster has only 4 Mons.

While further investigation and testing I lost another mon which wants to sync data after starting with the same behavior.

It seems the Node which streams data while syncing is in stress with reading and sending data. I tried to limit the network bandwith of the joining node to reduce the load. I also tried to set ionice -c3 on the process which does all the disk IO while reading. But nothing helped.

Because the cluster is productive I don't want to expermient further more without knowing what's going on.

Does anyone have any ideas what's going on and how I can try to fix this?

I am using ceph version 0.67.11 (bc8b67bef6309a32361be76cd11fb56b057ea9d2)
5 Monitor Nodes with SSD as leveldb store
24 OSD Hosts with 1416 OSDs

Thank you
Manuel

--
Manuel Lausch

Systemadministrator
Cloud Backend Services

1&1 Mail & Media Development  & Technology GmbH | Brauerstraße 48 | 76135 Karlsruhe | Germany
Phone: +49 721 91374-1847
E-Mail: manuel.lausch@xxxxxxxx | Web: www.1und1.de

Amtsgericht Montabaur, HRB 5452

Geschäftsführer: Frank Einhellinger, Hans-Henning Kettler, Jan Oetjen


Member of United Internet

Diese E-Mail kann vertrauliche und/oder gesetzlich geschützte Informationen enthalten. Wenn Sie nicht der bestimmungsgemäße Adressat sind oder diese E-Mail irrtümlich erhalten haben, unterrichten Sie bitte den Absender und vernichten Sie diese E-Mail. Anderen als dem bestimmungsgemäßen Adressaten ist untersagt, diese E-Mail zu speichern, weiterzuleiten oder ihren Inhalt auf welche Weise auch immer zu verwenden.

This e-mail may contain confidential and/or privileged information. If you are not the intended recipient of this e-mail, you are hereby notified that saving, distribution or use of the content of this e-mail in any way is prohibited. If you have received this e-mail in error, please notify the sender and delete the e-mail.

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com





[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux