Hi, I can share negative test results (on Jewel 10.2.6). All tests were performed while actively writing to CephFS from single client (about 1300 MB/sec). Cluster consists of 8 nodes, 8 OSD each (2 SSD for journals and metadata, 6 HDD RAID6 for data), MON/MDS are on dedicated nodes. 2 MDS at all, active/standby. - Crashing one node resulted in write hangs for 17 minutes. Repeating the test resulted in CephFS hangs forever. - Restarting active MDS resulted in successful failover to standby. Then, after standby became active and the restarted MDS became standby the new active was restarted. CephFS hanged for 12 minutes. P.S. Planning to repeat the tests again on 10.2.7 or higher
--
Dmitry Glushenok Jet Infosystems |
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com