Re: Introducing me, questions on general improvements in gluster re. latency and throughput

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Welcome first of all, glad to see interest in improving Gluster performance.
I believe there's a huge potential for improvements in different areas you can look into.
Certainly zero-copy (or just eliminating extra copies we have today in the code) would be most welcome.

We seem to have some lock contention in some areas of Gluster. Depending on the setup and workload, there might be different ones.
Certainly reducing some of them could lead to reduced latency and CPU consumption.

RDMA has been dropped as it was not properly tested and therefore maintained. I do believe there is value, but it is not a major bottleneck at the moment - 10g from a performance perspective provides a good enough alternative, and latency is still something we should work in the code.
The network stack for example, may be contributing to increased latency.

Removal of translators that are not needed in the stack might help performance as well. We have many translators, but few are needed in most cases.

Y.


On Thu, Jun 11, 2020 at 10:52 AM Federico Strati <strati.federico@xxxxxxxxx> wrote:
Dear All,

I just started working for a company named A3Cube, who produces HPC
supercomputers.

I was assigned the task to investigate which improvements to gluster are
viable

in order to lead to overall better performance in latency and throughput.

I'm quite new to SDS and so pardon me if some questions are naive.

 From what I've understood so far, possible bottlenecks are

in FUSE and transport.

Generally speaking, if you have time to just drop me some pointers,

1] FUSE + splice has never been considered (issue closed without real
discussions)

(probably because it conflicts with the general architecture and in
particular

with the write-behind translator)

Recently, it has been announced a new userspace fs kernel module, ZUFS,
whose aim

is to zero copy and improving vastly over FUSE: would you be interested

in investigating it ?

(ZUFS: https://github.com/NetApp/zufs-zuf ;
https://lwn.net/Articles/756625/)

2] Transport over RDMA (Infiniband) has been recently dropped:

may I ask you what considerations have been made ?

3] I would love to hear what you consider real bottlenecks in gluster

right now regarding latency and thruput.

Thanks in advance

Kind regards

Federico

_______________________________________________

Community Meeting Calendar:

Schedule -
Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC
Bridge: https://bluejeans.com/441850968




Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-devel

_______________________________________________

Community Meeting Calendar:

Schedule -
Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC
Bridge: https://bluejeans.com/441850968




Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
https://lists.gluster.org/mailman/listinfo/gluster-devel


[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux