Re: random thoughts on past_intervals

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Yeah, that's pretty much where I'm going.
-Sam

On Fri, Dec 9, 2016 at 3:24 PM, Sage Weil <sweil@xxxxxxxxxx> wrote:
> On Fri, 9 Dec 2016, Samuel Just wrote:
>> Currently, never.  However, I'm thinking that we might want to retain
>> the freedom to not send the structure if it's really big.  And
>> actually, we won't ever need to extend a received past_intervals
>> structure to the current epoch since if the interval changed, we'd
>> throw out the whole message.
>
> I'd say throw out the incremental build code, then, and assert
> past_intervals is present at notify time; we can re-add something to
> recalculate the whole past_intervals if it becomes necessary in the
> future.
>
> sage
>
>
>> -Sam
>>
>> On Fri, Dec 9, 2016 at 2:16 PM, Sage Weil <sweil@xxxxxxxxxx> wrote:
>> > On Fri, 9 Dec 2016, Samuel Just wrote:
>> >> In particular, we don't need PG::generate_past_intervals duplicating
>> >> the logic in build_past_intervals_parallel since constructed PG
>> >> objects only ever need to maintain a consistent past_intervals
>> >> structure, never build it from scratch.
>> >
>> > Sounds good to me.
>> >
>> > My main question is:
>> >
>> >> On Fri, Dec 9, 2016 at 11:59 AM, Samuel Just <sjust@xxxxxxxxxx> wrote:
>> >> > There's code for dealing with some odd past_intervals configurations
>> >> > including doesn't go back far enough and doesn't go forward far
>> >> > enough.  I *think* we can simplify this as follows:
>> >> > 1) Once the PG object is constructed and in memory, past_intervals
>> >> > extends from history.last_epoch_started to the PG's current map
>> >> > 2) On disk, either 1) is true or the set is empty (after an import)
>> >> >
>> >> > On boot, the OSD generates past_intervals in parallel for any PGs
>> >> > without them (and perhaps verifies 1) for the rest).  On receipt of a
>> >> > Notify creating a PG, the OSD generates the past_intervals structure
>> >> > before instantiating the PG using the same process as on boot --
>> >> > starting with the included past_intervals if present (may not extend
>> >> > to the current map, and so may need to be extended).
>> >
>> > When does this actually happen?  If PGs are always in state 1, can we
>> > instead ensure that PG notify will always include past_intervals and that
>> > a received notify will never require us to go off do the (slow) work of
>> > loading up old maps to generate old intervals?
>> >
>> > Then, we can focus our efforts on make the past_intervals representation
>> > compact (e.g., by discarding redundant intervals)...
>> >
>> > sage
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@xxxxxxxxxxxxxxx
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>>
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux