Just wanted to drop a note to the group that I had my cluster go sideways yesterday, and the root of the problem was networking again. Using iperf I discovered that one of my nodes was only moving data at 1.7Mb / s. Moving that node to a different switch port with a different cable has resolved the problem. It took awhile to track down because none of the server-side error metrics for disk or network showed anything was amiss, and I didn't think to test network performance (as suggested in another thread) until well into the process.
Check networking first!
QH
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com