Patrick; I agree with Ranjan, though not in the particulars. The issue is that "oversized" is ambiguous, though undersized is also ambiguous. I personally prefer unambiguous error messages which also suggest solutions, like: "1 MDSs reporting cache exceeds 'mds cache memory limit,' of: <value>." My 2 cents. Thank you, Dominic L. Hilsbos, MBA Director - Information Technology Perform Air International Inc. DHilsbos@xxxxxxxxxxxxxx www.PerformAir.com -----Original Message----- From: ceph-users [mailto:ceph-users-bounces@xxxxxxxxxxxxxx] On Behalf Of Patrick Donnelly Sent: Thursday, December 05, 2019 11:41 AM To: Ranjan Ghosh Cc: Ceph Users Subject: Re: HEALTH_WARN 1 MDSs report oversized cache On Thu, Dec 5, 2019 at 9:45 AM Ranjan Ghosh <ghosh@xxxxxx> wrote: > Ah, that seems to have fixed it. Hope it stays that way. I've raised it > to 4 GB. Thanks to you both! Just be aware the warning could come back. You just moved the goal posts. The 1GB default is probably too low for most deployments, I have a PR to increase this: https://github.com/ceph/ceph/pull/32042 > Although I have to say that the message is IMHO *very* misleading: "1 > MDSs report oversized cache" sounds to me like the cache is too large > (i.e. wasting RAM unnecessarily). Shouldn't the message rather be "1 > MDSs report *undersized* cache"? Weird. No. I means the MDS cache is larger than its target. This means the MDS cannot trim its cache to go back under the limit. This could be for many reasons but probably due to clients not releasing capabilities, perhaps due to a bug. -- Patrick Donnelly, Ph.D. He / Him / His Senior Software Engineer Red Hat Sunnyvale, CA GPG: 19F28A586F808C2402351B93C3301A3E258DD79D _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com