Re: core system is getting unresponsive because over 300 cpu load

Victor Yegorov <vyegorov@xxxxxxxxx> · Wed, 11 Oct 2017 00:01:48 +0300

2017-10-10 23:40 GMT+03:00 pinker <pinker@xxxxxxx>:
We've got problem with a very serious repetitive incident on our core

system. Namely, cpu load spikes to 300-400 and the whole db becomes

unresponsive. From db point of view nothing special is happening, memory

looks fine, disks io's are ok and the only problem is huge cpu load. Kernel

parameters that are increasing with load are always the same:

Can you provide output of `iostat -myx 10` at the “peak” moments, please?

Also, it'd be good to look in more detailed bgwriter/checkpointer stats.
You can find more details in this post: http://blog.postgresql-consulting.com/2017/03/deep-dive-into-postgres-stats_27.html
(You might want to reset 'shared' stats here.)

-- 
Victor Yegorov