Re: how to recovery lost objects

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sat, 4 Apr 2015, huang jun wrote:
> hi,ceph
> 
> Last week, we  add 48 OSDs to our exist cluster, the special thing is during
> the  backfill and recovery procedure,  some osds going to 100% full(we
> have set osd_full_ratio to 0.99).
> In each full OSD, we find one PG  that the PG's acting and up set
> didn't include this OSD anymore, then move the PG data to stale disk,

Note: copying data like this is generally not safe!  Even if you preserve 
xattrs tehre is some metadata kept in leveldb that isn't copied.  The new 
ceph-objectstore-tool has a new export/import pg function that should be 
used instead!

> and then restart the full OSDs, the backfill and recovery finally fininshed. And
> "ceph -s" shows all PG state is "active+clean". So we didn't copy the backuped
> data back.

This sounds fine.  Removing PGs like this generally *does* work (although 
it will leave some junk behind in leveldb).

> But now  we have a problem, we queried a file, find that one object of the file
> is lost, but the PG status is ok
> 
> root@node1:~# ceph osd  map data 10000024b56.00000001
> osdmap e64986 pool 'data' (0) object '10000024b56.00000001' -> pg
> 0.6b96f1d3 (0.11d3) -> up ([85,22], p85) acting ([85,22], p85)
> 
> root@node1:/var/lib/ceph/osd/ceph-85/current/0.11d3_head# find ./
> -name 10000024b56*
> 
> root@node5:/var/lib/ceph/osd/ceph-22/current/0.11d3_head#  find ./
> -name 10000024b56*
> root@node5:/var/lib/ceph/osd/ceph-22/current/0.11d3_head# ls -R *|grep
> 10000024b56
> 
> As you can see, there are no object 10000024b56.00000001 found both in
> osd.85 and osd.82.

Are you sure that object should exist (or existed before)?  CephFS only 
creates the object if data was written to it; the empty parts of a 
sparse file may not have an object at all.
> 
> we check all backuped PG data, but with no luck.
> 
> Does the object to PG refection changes after add OSDs? i think only
> the PG to OSD reflection changes.
> 
> how to recovery lost objects?
> 
> Any tips or hints are welcome.

Nothing here would have obviously made those objects get lost.  How many 
such objects are there?

sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux