Re: 3.8.2 : Node not healing

Krutika Dhananjay <kdhananj@xxxxxxxxxx> · Mon, 15 Aug 2016 12:47:11 +0530

Could you please attacj the brick logs and glustershd logs?
Also share the volume configuration please (`gluster volume info`).

-Krutika

On Mon, Aug 15, 2016 at 12:19 PM, Lindsay Mathieson <lindsay.mathieson@xxxxxxxxx> wrote:
Moved to a new subject as its now an issue on our cluster.

As an experiment I killed glusterfsd on one node. System kept

chugging allong fine with no hiccups. I ran a few disk intensive VM's

on that node and others, no real slow down. Monitoring it with "heal

statistics heal-count'

heal-count got up to approx 2500 shards and restarted glusterfsd by

restarting the gluster-service (glusterd).

heal-count stopped rising, but what is concerning is that it doesn't

seem to be going back down. 45min later at its stable at 2439 files

needing healed and glusterfsd is thrashing the CPU's on that node

(1000%!)

The glfsheal log has no entries at all.

Previously (3.7.x) when I've done this test, heals kicked in very rapidly.

At three hours later, still no progress in heal at all. VM's on other

nodes getting occasional read timeouts.

heal-count = 2550, and not changing.

--

Lindsay

_______________________________________________

Gluster-users mailing list

Gluster-users@xxxxxxxxxxx

http://www.gluster.org/mailman/listinfo/gluster-users

_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
http://www.gluster.org/mailman/listinfo/gluster-users