Why import temp objects when clear_temp_objects() will just remove it on
osd start-up?
If we need the temp objects for replay purposes, does it matter if a
split has occurred after the original export happened?
Or can we just import all temporary objects without regards to split
and assume that after replay the clear_temp_objects() will
clean them up?
David
On 7/6/15 1:28 PM, Sage Weil wrote:
On Fri, 19 Jun 2015, David Zafman wrote:
This ghobject_t which has a pool of -3 is part of the export. This caused
the assert:
Read -3/1c/temp_recovering_1.1c_33'50_39_head/head
This was added by "osd: use per-pool temp poolid for temp objects"
18eb2a5fea9b0af74a171c3717d1c91766b15f0c in your branch.
You should skip it on export or recreate it on import with special handling.
Ah, that makes sense. I think we should include these temp objects in the
export, though, and make cot understand that they are part of the pool.
We moved the "clear temp objects on startup" logic into teh OSD, which I
think will be useful for e.g. multiobject transactions (where we'll want
some objects that are internal/hidden to persist across peering intervals
and restarts).
Looking at your wip-temp-zafman, I think the first patch needs to be
dropped: include the temp objects, and I assume the meta one (which
has the pg log and other critical pg metadata).
Not sure where to change cot to handle the temp objects though?
Thanks!
sage
David
On 6/19/15 7:38 PM, David Zafman wrote:
Have not seen this as an assert before. Given the code below in do_import()
of master branch the assert is impossible (?).
if (!curmap.have_pg_pool(pgid.pgid.m_pool)) {
cerr << "Pool " << pgid.pgid.m_pool << " no longer exists" << std::endl;
// Special exit code for this error, used by test code
return 10; // Positive return means exit status
}
David
On 6/19/15 7:25 PM, Sage Weil wrote:
Hey David,
On this run
/a/sage-2015-06-18_15:51:18-rados-wip-temp---basic-multi/939648
ceph-objectstore-tool is failing to import a pg because the pool doesn't
exist. It looks like the thrasher is doing an export+import and racing
with a test that is tearing down a pool. The crash is
ceph version 9.0.1-955-ge274efa
(e274efa450e99a68c02bcb713c8837d7809f1ec3)
1: ceph-objectstore-tool() [0xa26335]
2: (()+0xfcb0) [0x7f10cef18cb0]
3: (gsignal()+0x35) [0x7f10cd5af425]
4: (abort()+0x17b) [0x7f10cd5b2b8b]
5: (__gnu_cxx::__verbose_terminate_handler()+0x11d) [0x7f10cdf0269d]
6: (()+0xb5846) [0x7f10cdf00846]
7: (()+0xb5873) [0x7f10cdf00873]
8: (()+0xb596e) [0x7f10cdf0096e]
9: (ceph::__ceph_assert_fail(char const*, char const*, int, char
const*)+0x259) [0xb0ce09]
10: (ObjectStoreTool::get_object(ObjectStore*, coll_t,
ceph::buffer::list&, OSDMap&, bool*)+0x143f) [0x64829f]
11: (ObjectStoreTool::do_import(ObjectStore*, OSDSuperblock&, bool,
std::string)+0x13dd) [0x64a62d]
12: (main()+0x3017) [0x632037]
13: (__libc_start_main()+0xed) [0x7f10cd59a76d]
14: ceph-objectstore-tool() [0x639119]
I don't think this is related to my branch.. but maybe? Have you seen
this? I rebased onto latest master yesterday.
sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html