We have a single VM that is acting odd. We had 7 SSD OSDs (out of 40) go down over a period of about 12 hours. These are a cache tier and have size 4, min_size 2. I'm not able to make heads or tails of the error and hoped someone here could help.
2016-01-14 23:09:54.559121 osd.136 [ERR] 13.503 copy from f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 to f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 data digest 0x92bc163c != source 0x8fe2d0a9
The PG fully recovered then the error was
2016-01-15 00:39:25.321469 osd.12 [ERR] 13.503 copy from f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 to f8bedd03/rbd_data.48a6325f5e3f87.000000000000683d/head//13 data digest 0x92bc163c != source 0x8fe2d0a9
A deep scrub of the PG comes back clean and a hash of the files on all OSDs match. The file system on this vm keeps going read only.
The osd file system is EXT4 and this is 0.94.5.
Thanks,
Robert LeBlanc
Sent from a mobile device please excuse any typos.
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com