Re: Implications of pglog_hardlimit

Josh Baergen <jbaergen@xxxxxxxxxxxxxxxx> · Tue, 29 Nov 2022 16:32:00 -0700

It's also possible you're running into large pglog entries - any
chance you're running RGW and there's an s3:CopyObject workload
hitting an object that was uploaded with MPU?
https://tracker.ceph.com/issues/56707

If that's the case, you can inject a much smaller value for
osd_min_pg_log_entries and osd_max_pg_log_entries (ceph tell osd.*
config set osd_min_pg_log_entries 500 - repeat for max) to relieve
memory pressure.

Josh

On Tue, Nov 29, 2022 at 3:10 PM Frank Schilder <frans@xxxxxx> wrote:
>
> Hi, it sounds like you might be affected by the pg_log dup bug:
>
> # Check if any OSDs are affected by the pg dup problem
> sudo -i ceph tell "osd.*" perf dump | grep -e pglog -e "osd\\."
>
> If any osd_pglog_items>>1M check https://www.clyso.com/blog/osds-with-unlimited-ram-growth/
>
> Best regards,
> =================
> Frank Schilder
> AIT Risø Campus
> Bygning 109, rum S14
>
> ________________________________________
> From: Gregory Farnum <gfarnum@xxxxxxxxxx>
> Sent: 29 November 2022 22:25:54
> To: Joshua Timmer
> Cc: ceph-users@xxxxxxx
> Subject:  Re: Implications of pglog_hardlimit
>
> On Tue, Nov 29, 2022 at 1:18 PM Joshua Timmer <mrjoshuatimmer@xxxxxxxxx>
> wrote:
>
> > I've got a cluster in a precarious state because several nodes have run
> > out of memory due to extremely large pg logs on the osds. I came across
> > the pglog_hardlimit flag which sounds like the solution to the issue,
> > but I'm concerned that enabling it will immediately truncate the pg logs
> > and possibly drop some information needed to recover the pgs. There are
> > many in degraded and undersized states right now as nodes are down. Is
> > it safe to enable the flag in this state? The cluster is running
> > luminous 12.2.13 right now.
>
>
> The hard limit will truncate the log, but all the data goes into the
> backing bluestore/filestore instance at the same time. The pglogs are used
> for two things:
> 1) detecting replayed client operations and sending the same answer back on
> replays, so shorter logs means a shorter time window of detection but
> shouldn’t be an issue;
> 2) enabling log-based recovery of pgs where OSDs with overlapping logs can
> identify exactly which objects have been modified and only moving them.
>
> So if you set the hard limit, it’s possible you’ll induce more backfill as
> fewer logs overlap. But no data will be lost.
> -Greg
>
>
> > _______________________________________________
> > ceph-users mailing list -- ceph-users@xxxxxxx
> > To unsubscribe send an email to ceph-users-leave@xxxxxxx
> >
> >
> >
> _______________________________________________
> ceph-users mailing list -- ceph-users@xxxxxxx
> To unsubscribe send an email to ceph-users-leave@xxxxxxx
> _______________________________________________
> ceph-users mailing list -- ceph-users@xxxxxxx
> To unsubscribe send an email to ceph-users-leave@xxxxxxx
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx