On 08/23/2017 01:33 AM, Sage Weil wrote:
One other trick that has been used here: if you look inside the PG
directories on the OSDs and find that they are mostly empty then it's
possible some of the memory and peering overhead is related to
empty and useless PG instances on the wrong OSDs. You can write a script
to find empty directories (or ones that only contain the single pgmeta
object with a mostly-empty name) and remove them (using
ceph-objectstore-tool). (For safety I'd recommend doing
ceph-objectstore-tool export first, just in case there is some useful
metadata there.)
That will only help if most of the pg dirs look empty, though. If so,
it's worth a shot!
The other thing we once did was use a kludge patch to trim the
past_intervals metadata, which was respnosible for most of the memory
usage. I can't tell from the profile in this thread if that is the case
or not. There is a patch floating around in git somewhere that can be
reused if it looks like that is the thing consuming the memory.
sage
we ll try the empty pg search. not sure how much is there, but i
randomly checked and found a few.
as for the "kludge" patch, where can I find it. I searched the git repo,
but could not identify it. did not know what to look for specifically.
also, what would we need to better know if the patch would be useful?
e.g. if we need another/more mem profiling.
we installed a test cluster of 4 nodes and replicated the issue there,
and we are testing various scenarios there. if any one cares to
replicate it i can elaborate on the steps.
if all failed, do you think moving pgs out of the current dir is safe?
we are trying to test it, but we ll never be sure 100%
thanks
ali
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html