As your issue is Network, consider changing the MTU if the infrastructure is allowing it.
The tuned profiles are very important, as they control ratios for dumping data in memory to disk (this case gluster over network). You want to avoid keeping a lot of data in client's memory(in this case the gluster server), just to unleash it over network.
These 2 can be implemented online and I do not expect any issues.
Filesystem of bricks is important because the faster they soak data, the faster gluster can take more.
Of course, you need to reproduce it in test.
Also consider checking if there is any kind of backup running on the bricks. I have seen too many 'miracles' :D
Best Regards,
Strahil Nikolov
On Jan 8, 2020 01:03, David Cunningham <dcunningham@xxxxxxxxxxxxx> wrote:
Hi Strahil,Thanks for that. The queue/scheduler file for the relevant disk reports "noop [deadline] cfq", so deadline is being used. It is using ext4, and I've verified that the MTU is 1500.We could change the filesystem from ext4 to xfs, but in this case we're not looking to tinker around the edges and get a small performance improvement - we need a very large improvement on the 114MBps of network traffic to make it usable.I think what we really need to do first is to reproduce the problem in testing, and then come back to possible solutions.On Tue, 7 Jan 2020 at 22:15, Strahil Nikolov <hunter86_bg@yahoo.com> wrote:To find the scheduler , find all pvs of the LV is providing your storage[root@ovirt1 ~]# df -Th /gluster_bricks/data_fast
Filesystem Type Size Used Avail Use% Mounted on
/dev/mapper/gluster_vg_nvme-gluster_lv_data_fast xfs 100G 39G 62G 39% /gluster_bricks/data_fast[root@ovirt1 ~]# pvs | grep gluster_vg_nvme
/dev/mapper/vdo_nvme gluster_vg_nvme lvm2 a-- <1024.00g 0[root@ovirt1 ~]# cat /etc/vdoconf.yml
####################################################################
# THIS FILE IS MACHINE GENERATED. DO NOT EDIT THIS FILE BY HAND.
####################################################################
config: !Configuration
vdos:vdo_nvme: !VDOServicedevice: /dev/disk/by-id/nvme-ADATA_SX8200PNP_2J1120011596[root@ovirt1 ~]# ll /dev/disk/by-id/nvme-ADATA_SX8200PNP_2J1120011596
lrwxrwxrwx. 1 root root 13 Dec 17 20:21 /dev/disk/by-id/nvme-ADATA_SX8200PNP_2J1120011596 -> ../../nvme0n1
[root@ovirt1 ~]# cat /sys/block/nvme0n1/queue/scheduler
[none] mq-deadline kyberNote: If device is under multipath , you need to check all paths (you can get them from 'multipath -ll' command).The only scheduler you should avoid is "cfq" which was default for RHEL 6 & SLES 11.XFS has better performance that ext-based systems.Another tuning is to use Red hat's tuned profiles for gluster. You can extract them from (or newer if you find) ftp://ftp.redhat.com/redhat/linux/enterprise/7Server/en/RHS/SRPMS/redhat-storage-server-3.4.2.0-1.el7rhgs.src.rpmAbout MTU - it's reducing the ammount of packages that the kernel has to process - but requires infrastructure to support that too. You can test by setting MTU on both sides to 9000 and then run 'tracepath remote-ip'. Also run a ping with large size without do not fragment flag -> 'ping -M do -s 8900 <
________ Community Meeting Calendar: APAC Schedule - Every 2nd and 4th Tuesday at 11:30 AM IST Bridge: https://bluejeans.com/441850968 NA/EMEA Schedule - Every 1st and 3rd Tuesday at 01:00 PM EDT Bridge: https://bluejeans.com/441850968 Gluster-users mailing list Gluster-users@xxxxxxxxxxx https://lists.gluster.org/mailman/listinfo/gluster-users