Re: CPU load spikes when CentOS tries to reclaim 'cached' memory

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



We saw very similar issues with a CentOS server with 40 cores (32 virtualized) when moving from a physical server to a virtual server (I think it had 128GB RAM).   Never had the problem on a physical server.  We checked the same things as noted here, but never found a bug.   We really thought it had something to do with NUMA zone reclaim, but could never prove that.    In our case it was all kernel time in the guest, all CPUs at 100%.   Sometimes it would last for a few seconds or minutes.  Sometimes we would go days without a problem, and then it would completely tank.  

If you figure out what is going on, I would like to know  (especially if it is virtualized).

Deron



On Thu, Jun 5, 2014 at 12:23 PM, Merlin Moncure <mmoncure@xxxxxxxxx> wrote:
On Thu, Jun 5, 2014 at 10:58 AM, Jeff Janes <jeff.janes@xxxxxxxxx> wrote:
> This sounds like a kernel problem, probably either the zone reclaim issue,
> or the transparent huge pages issue.

I at first thought maybe same, but I don't think THP was introduced
until 2.6.38...OP is running 2.6.32-431.11.2.el6.x86_6.  Maybe it's
NUMA related, but would not be idiomatic of NUMA issues as I
understand them (poor memory utilization/high IO utilization).  Would
be a very cheap/easy thing to try though.

Is this server virtualized?

merlin


--
Sent via pgsql-performance mailing list (pgsql-performance@xxxxxxxxxxxxxx)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-performance


[Postgresql General]     [Postgresql PHP]     [PHP Users]     [PHP Home]     [PHP on Windows]     [Kernel Newbies]     [PHP Classes]     [PHP Books]     [PHP Databases]     [Yosemite]

  Powered by Linux