On 08/26/2013 11:34 PM, Bobby Jacob wrote: > Hi, I have the same issue. Sometimes, the files are not getting replicated between the bricks. > Glusterd service and glusterfsd are running. When the files replication is not happening, these are the outcomes of the commands. Probably a minor semantic issue, but files do not get replicated *between* the bricks. They're replicated *to* the bricks. Sometimes realizing that difference helps people diagnose their issue better. > Service glusterfsd does not restart. It just stops when I try to restart it. Also self-heal commands says no heal required. I have to reboot the server for the file replication to start off. You don't start glusterfsd. I assume you're trying "service glusterfsd start". That won't work. glusterd has to start glusterfsd or glusterfsd won't know what it's supposed to do. To start the brick service if it's (for some reason) down, first check the brick log and find out why it's down. Correct the problem or file the bug report. Then, for the volume that the brick is missing from, "gluster volume start $vol force" will start the missing brick. > > Root> ps -ef | grep glusterfsd (gives me the following services running) > > root 1928 1 0 Aug12 ? 00:03:00 /usr/sbin/glusterfsd -s localhost --volfile-id gfscloud.KWTTESTGSNODE002.mnt-cloud_gfs -p /var/lib/glusterd/vols/gfscloud/run/KWTTESTGSNODE002-mnt-cloud_gfs.pid -S /tmp/216f54af381565662c717863ab3b6312.socket --brick-name /mnt/cloud_gfs -l /var/log/glusterfs/bricks/mnt-cloud_gfs.log --xlator-option *-posix.glusterd-uuid=684855a9-5208-4480-a52b-f2bc8caeb992 --brick-port 24011 --xlator-option gfscloud-server.listen-port=24011 > > Root> gluster volume status. > > Status of volume: gfscloud > Gluster process Port Online Pid > ------------------------------------------------------------------------------ > Brick KWTTESTGSNODE002:/mnt/cloud_gfs 24011 Y 1928 > Brick ZAJILTESTGSNODE001:/mnt/cloud_gfs 24011 Y 1588 > NFS Server on localhost 38467 Y 1934 > Self-heal Daemon on localhost N/A Y 1940 > NFS Server on ZAJILTESTGSNODE001 38467 Y 1611 > Self-heal Daemon on ZAJILTESTGSNODE001 N/A Y 1620 > > > Thanks & Regards, > Bobby Jacob > > -----Original Message----- > From: gluster-users-bounces at gluster.org [mailto:gluster-users-bounces at gluster.org] On Behalf Of Vijay Bellur > Sent: Saturday, July 27, 2013 6:35 PM > To: Paulo Silva > Cc: gluster-users at gluster.org > Subject: Re: Healing in glusterfs 3.3.1 > > On 07/24/2013 07:27 PM, Paulo Silva wrote: >> Hi, >> >> I have a glusterfs 3.3.1 setup with 2 servers and a replicated volume >> used by 4 clients. >> >> Sometimes from some clients I can't access some of the files. After I >> force a full heal on the brick I see several files healed. Is this >> behavior normal? >> > With 3.3.1, healing happens in two modes: > > a) index heal - based on changes recorded in indices on bricks. The indices are consumed by self-heal daemons on each server for performing healing and is the default mode of operation. > > b) full heal - works through a full crawl of the volume. In this mode, self-heal daemon inspects changelogs of all files and directories it encounters during a crawl. This mode is triggered when volume heal full command is used. > > Normally, a) is faster and addresses most healing cases. Do you have more details on files that require a full heal to be performed for healing completion? > > Regards, > Vijay > > > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > http://supercolony.gluster.org/mailman/listinfo/gluster-users > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > http://supercolony.gluster.org/mailman/listinfo/gluster-users