Re: Ceph PG Incomplete = Cluster unusable

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Jan 9, 2015 at 7:17 AM, Robert LeBlanc <robert@xxxxxxxxxxxxx> wrote:
> Protect against bit rot. Checked on read and on deep scrub.

There are still issues (at least in firefly) with FDCache and scrub
completion having corrupted on-disk data, so throughout checksumming
will not cover every possible corruption case (at least not before
adding possibility to invalidate FDCache on demand). As long as this
topic rose consistency question, it worthy to mention this too. Most
times this issue will not hit anyone as hardware failures are going in
a different way than a single file corruption, but it is possible to
imagine such case, especially when dealing with SSDs. I suspect that
not anyone familiar with mentioned problem, in a short it looks that
we *may* corrupt certain data blocks in a filestore and due to FDCache
they will not be revealed even by a deep-scrub and problem may persist
upon OSD restart. This issue is very concerning for me after I hit
misbehavior of recover procedure in the middle of Dec, as mine issue
can possibly be related to one described above.
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux