Re: Slow clients after git pull

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

Our problem is that the webpage is on a autoscaling group, so the created machine is not always updated and needs to have the latest data always. I've tried several ways to do it:
  • Local Storage synced: Sometimes the sync fails and data is not updated
  • NFS: If NFS server goes down, all clients die
  • Two NFS Server synced+Monit: when a NFS server is down umount freezes and is not able to change to the other NFS server
  • GlusterFS: Too slow for webpages
CephFS is near to NFS on speed and have auto recovery if one node goes down (clients connects to other MDS automatically).

About to use RBD, my problem is that I need a FS, because Nginx is not able to read directly from Ceph in other ways.
About S3 and similar, I've also tried AWS NFS method but is much slower (even more than GlusterFS).

My problem is that CephFS fits what I need.

Doing tests I've noticed that maybe the file is updated on ceph node while client has file sessions open, so until I remount the FS that sessions continue opened. When I open the files with vim I notice that is a bit slower while is updating the repository, but after the update it works as fast as before.

It fails even on Jewel so I think that maybe the only way to do it is to create a task to remount the FS when I deploy.

Greetings and thanks!!


2018-03-01 15:29 GMT+01:00 David Turner <drakonstein@xxxxxxxxx>:
Using CephFS for something like this is about the last thing I would do.  Does it need to be on a networked posix filesystem that can be mounted on multiple machines at the same time?  If so, then you're kinda stuck and we can start looking at your MDS hardware and see if there are any MDS settings that need to be configured differently for this to work.

If you don't NEED CephFS, then I would recommend utilizing an RBD for something like this.  Its limitation is only being able to be mapped to 1 server at a time, but that's decent enough for most failover scenarios for build setups.  If you need to failover, unmap it from the primary and map it to another server to resume workloads.

Hosting websites out of CephFS also seems counter-intuitive.  Have you looked at S3 websites?  RGW supports configuring websites out of a bucket that might be of interest.  Your RGW daemon configuration could easily become an HA website with an LB in front of them.

I'm biased here a bit, but I don't like to use networked filesystems unless nothing else can be worked out or the software using it is 3rd party and just doesn't support anything else.

On Thu, Mar 1, 2018 at 9:05 AM Daniel Carrasco <d.carrasco@xxxxxxxxx> wrote:
Hello,

I've tried to change a lot of things on configuration and use ceph-fuse but nothing makes it work better... When I deploy the git repository it becomes much slower until I remount the FS (just executing systemctl stop nginx && umount /mnt/ceph && mount -a && systemctl start nginx). It happen when the FS gets a lot of IO because when I execute Rsync I got the same problem.

I'm thinking about to downgrade to a lower version of ceph like for example jewel to see if works better. I know that will be deprecated soon, but I don't know what other tests I can do...

Greetings!!

2018-02-28 17:11 GMT+01:00 Daniel Carrasco <d.carrasco@xxxxxxxxx>:
Hello,

I've created a Ceph cluster with 3 nodes and a FS to serve a webpage. The webpage speed is good enough (near to NFS speed), and have HA if one FS die.
My problem comes when I deploy a git repository on that FS. The server makes a lot of IOPS to check the files that have to update and then all clients starts to have problems to use the FS (it becomes much slower).
In a normal usage the web takes about 400ms to load, and when the problem start it takes more than 3s. To fix the problem I just have to remount the FS on clients, but I can't remount the FS on every deploy...

While is deploying I see how the CPU on MDS is a bit higher, but when it ends the CPU usage goes down again, so look like is not a problem of CPU.

My config file is:
[global]
fsid = bf56854......e611c08
mon_initial_members = fs-01, fs-02, fs-03
mon_host = 10.50.0.94,10.50.1.216,10.50.2.52
auth_cluster_required = cephx
auth_service_required = cephx
auth_client_required = cephx

public network = 10.50.0.0/22
osd pool default size = 3

##
### OSD
##
[osd]
  osd_mon_heartbeat_interval = 5
  osd_mon_report_interval_max = 10
  osd_heartbeat_grace = 15
  osd_fast_fail_on_connection_refused = True
  osd_pool_default_pg_num = 128
  osd_pool_default_pgp_num = 128
  osd_pool_default_size = 2
  osd_pool_default_min_size = 2

##
### Monitores
##
[mon]
  mon_osd_min_down_reporters = 1

##
### MDS
##
[mds]
  mds_cache_memory_limit = 792723456
  mds_bal_mode = 1  

##
### Client
##
[client]
  client_cache_size = 32768
  client_mount_timeout = 30
  client_oc_max_objects = 2000
  client_oc_size = 629145600
  client_permissions = false
  rbd_cache = true
  rbd_cache_size = 671088640

My cluster and clients uses Debian 9 with latest ceph version (12.2.4). The clients uses kernel modules to mount the share, because are a bit faster than fuse modules. The deploy is done on one of the Ceph nodes, that have the FS mounted by kernel module too.
My cluster is not a high usage cluster, so have all daemons on one machine (3 machines with OSD, MON, MGR and MDS). All OSD has a copy of the data, only one MGR is active and two of the MDS are active with one on standby. The clients mount the FS using the three MDS IP addresses and just now don't have any request because is not published.

Someone knows what can be happening?, because all works fine (even on other cluster I did with an high load), but just deploy the git repository and all start to work very slow.

Thanks!!


--
_________________________________________

      Daniel Carrasco Marín
      Ingeniería para la Innovación i2TIC, S.L.
      Tlf:  +34 911 12 32 84 Ext: 223
      www.i2tic.com
_________________________________________



--
_________________________________________

      Daniel Carrasco Marín
      Ingeniería para la Innovación i2TIC, S.L.
      Tlf:  +34 911 12 32 84 Ext: 223
      www.i2tic.com
_________________________________________
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com



--
_________________________________________

      Daniel Carrasco Marín
      Ingeniería para la Innovación i2TIC, S.L.
      Tlf:  +34 911 12 32 84 Ext: 223
      www.i2tic.com
_________________________________________
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux