On Sat, 4 Apr 2015, huang jun wrote: > hi,ceph > > Last week, we add 48 OSDs to our exist cluster, the special thing is during > the backfill and recovery procedure, some osds going to 100% full(we > have set osd_full_ratio to 0.99). > In each full OSD, we find one PG that the PG's acting and up set > didn't include this OSD anymore, then move the PG data to stale disk, Note: copying data like this is generally not safe! Even if you preserve xattrs tehre is some metadata kept in leveldb that isn't copied. The new ceph-objectstore-tool has a new export/import pg function that should be used instead! > and then restart the full OSDs, the backfill and recovery finally fininshed. And > "ceph -s" shows all PG state is "active+clean". So we didn't copy the backuped > data back. This sounds fine. Removing PGs like this generally *does* work (although it will leave some junk behind in leveldb). > But now we have a problem, we queried a file, find that one object of the file > is lost, but the PG status is ok > > root@node1:~# ceph osd map data 10000024b56.00000001 > osdmap e64986 pool 'data' (0) object '10000024b56.00000001' -> pg > 0.6b96f1d3 (0.11d3) -> up ([85,22], p85) acting ([85,22], p85) > > root@node1:/var/lib/ceph/osd/ceph-85/current/0.11d3_head# find ./ > -name 10000024b56* > > root@node5:/var/lib/ceph/osd/ceph-22/current/0.11d3_head# find ./ > -name 10000024b56* > root@node5:/var/lib/ceph/osd/ceph-22/current/0.11d3_head# ls -R *|grep > 10000024b56 > > As you can see, there are no object 10000024b56.00000001 found both in > osd.85 and osd.82. Are you sure that object should exist (or existed before)? CephFS only creates the object if data was written to it; the empty parts of a sparse file may not have an object at all. > > we check all backuped PG data, but with no luck. > > Does the object to PG refection changes after add OSDs? i think only > the PG to OSD reflection changes. > > how to recovery lost objects? > > Any tips or hints are welcome. Nothing here would have obviously made those objects get lost. How many such objects are there? sage -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html