Re: Replica 2 cluster not replicating

Marcus <marcus.pedersen@xxxxxx> · Wed, 3 May 2017 08:47:03 +0200



    Some additional info:

    I read somewhere that there has been a problem when the directory
    structures are too deep.

    On my file servers there is a depth of 9-10 directories, is this
    still a problem?

    
    The error I find in the error log is:

    performing entry selfheal on 70764b1d-a022-4489-b7e1-df6073765d66

    remote operation failed. Path: (null)
    (00000000-0000-0000-0000-000000000000) [No data available]

    
    If I look in .glusterfs at the gfid on both machines I see that some
    files are missing on the second server.

    But the content of .glusterfs matches what is stored in file
    structure, so there is not miss match there.

    
    I have written a script that makes an ls on all files in the
    structure on a client.

    This triggers a heal and the servers are busy working.

    I am not done yet as this takes a while.

    But this is not the way you are supposed to heal, right?

    
    Best regarards

    Marcus

    
    On 02/05/17 11:12, Marcus wrote:

    
    Hi all!
      

      I have setup a replicated gluster cluster on two identical
      machines with replica 2.
      

      I run CentOs 7 and gluster version 3.8.11.
      

      I started out with creating a distributed single node gluster
      brick.
      

      When I created the brick there was already about 11TB data in
      directory before I created the brick.
      

      We used the distributed brick for a couple of weeks before I had
      the other server.
      

      I configured the second server and and created replicated bricks
      where the second server was empty on data.
      

      gluster volume add-brick vol replica 2 transport tcp
      192.168.1.2:/vol
      

      gluster volume add-brick vol2 replica 2 transport tcp
      192.168.1.2:/vol2
      

      Everything went fine and files started to replicate and second
      server started to be populated with data.
      

      Now two weeks later they are still out of sync.....second server
      is still missing data.
      

      If I run: gluster volume heal vol info
      

      There is still thousands of files that need sync from first
      server.
      

      Most of the files are shown as a gfid number and not file names.
      

      I have tried to run: gluster volume heal vol
      

      and: gluster volume heal vol full
      

      It reports back that it starts successfully.
      

      I have rebooted both machines.
      

      Still same status.
      

      If I go to a client and use ls on specific files that are not in
      sync they become synchronized.
      

      As in:
      

      ls /mymount/mydir/file1
      

      ls /mymount/mydir/file2
      

      Both file1 and file2 gets synchronized.
      

      But I can not do that on thousands of files, right?
      

      How do I get these bricks synchronized and up and running as exact
      replicas?
      

      Many thanks in advance!
      

      Best regards
      

      Marcus
      

      _______________________________________________
      

      Gluster-users mailing list
      

      Gluster-users@xxxxxxxxxxx
      

      http://lists.gluster.org/mailman/listinfo/gluster-users
      

    -- 

      
 Marcus Pedersén

      System administrator

      
              Interbull Centre

              Department of Animal Breeding & Genetics — SLU

              Box 7023, SE-750 07

              Uppsala, Sweden

              
              Visiting address:

              Room 55614, Ulls väg 26, Ultuna

              Uppsala

              Sweden

              
              Tel: +46-(0)18-67 1962

            
_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
http://lists.gluster.org/mailman/listinfo/gluster-users