Re: sharding in glusterfs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 





On Wed, Sep 19, 2018 at 11:37 AM Ashayam Gupta <ashayam.gupta@xxxxxxxxxxxxxx> wrote:
Please find our workload details as requested by you :

* Only 1 write-mount point as of now
* Read-Mount : Since we auto-scale our machines this can be as big as 300-400 machines during peak times
* >" multiple concurrent reads means that Reads will not happen until the file is completely written to"  Yes , in our current scenario we can ensure that indeed this is the case.

But when you say it only supports single writer workload we would like to understand the following scenarios with respect to multiple writers and the current behaviour of glusterfs with sharding 
  • Multiple Writer writes to different files 
When I say multiple writers, I mean multiple mounts. Since you were saying earlier there is only one mount which does all writes, everything should work as expected.
  • Multiple Writer writes to same file 
    • they write to same file but different shards of same file
    • they write to same file (no gurantee if they write to different shards)
As long as the above happens from same mount, things should be fine. Otherwise there could be problems.
 
There might be some more cases which are known to you , would be helpful if you can describe us about those scenarios as well or may point us to the relevant documents.
Also it would be helpful if you can suggest the most stable version of glusterfs with sharding feature to use , since we would like to use this in production.

It has been stable for a while, so use any of the latest maintained releases like 3.12.x or 4.1.x

As I was mentioning already, sharding is mainly tested with VM/gluster-block workloads. So there could be some corner cases with single writer workload which we never ran into for the VM/block workloads we test. But you may run into them. Do let us know and we can take a look if you find something out of the ordinary. What I would suggest is to use one of the maintained releases and run the workloads you have for some time to test things out, once you feel confident, you can put it in production.

HTH

Thanks
Ashayam Gupta

On Tue, Sep 18, 2018 at 11:00 AM Pranith Kumar Karampuri <pkarampu@xxxxxxxxxx> wrote:


On Mon, Sep 17, 2018 at 4:14 AM Ashayam Gupta <ashayam.gupta@xxxxxxxxxxxxxx> wrote:
Hi All,

We are currently using glusterfs for storing large files with write-once and multiple concurrent reads, and were interested in understanding one of the features of glusterfs called sharding for our use case.

So far from the talk given by the developer [https://www.youtube.com/watch?v=aAlLy9k65Gw] and the git issue [https://github.com/gluster/glusterfs/issues/290] , we know that it was developed for large VM images as use case and the second link does talk about a more general purpose usage , but we are not clear if there are some issues if used for non-VM image large files [which is the use case for us].

Therefore it would be helpful if we can have some pointers or more information about the more general use-case scenario for sharding and any shortcomings if any , in case we use it for our scenario which is non-VM large files with write-once and multiple concurrent reads.Also it would be very helpful if you can suggest the best approach/settings for our use case scenario.

Sharding is developed for Big file usecases and at the moment only supports single writer workload. I also added the maintainers for sharding to the thread. May be giving a bit of detail about access pattern w.r.t. number of mounts that are used for writing/reading would be helpful. I am assuming write-once and multiple concurrent reads means that Reads will not happen until the file is completely written to. Could you explain  a bit more about the workload?
 

Thanks
Ashayam Gupta
_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-users


--
Pranith


--
Pranith
_______________________________________________
Gluster-users mailing list
Gluster-users@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-users

[Index of Archives]     [Gluster Development]     [Linux Filesytems Development]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux