Re: Spurious disconnections / connectivity loss

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Daniel Maher wrote:
Gordan Bobic wrote:

That's hardly unexpected. If you are using client-side replicate, I'd expect to see the bandwidth requirements multiply with the number of replicas. For all clustered configurations (not limited to glfs) I use a separate LAN for cluster communication to ensure best possible throughput/latencies, and specifically in case of glfs, I do server side replicate so that the replicate traffic gets offloaded to that private cluster LAN, so the bandwidth requirements to the clients can be kept down to sane levels.

If you're willing to describe your setup further, i'd love to hear about it. I'm currently using client-side replication, and for the reasons of scalability you described, i'd like to move to a server-side replication setup.

In particular, i'm interested in how you handle the connectivity between the clients and the servers vis-à-vis load balancing (if any) and availability. For example, in your configuration, how does a given client « know » which server to speak to, and what happens that server becomes inaccessible ?

Since client connections persist, you could use something as naive as DNS round-robin load balancing - it'll do a good enough job in most cases, if you have lots of clients. In terms of fail-over, that's more tricky. NFS over UDP (with unfsd) works relatively gracefully with just failing over the IP to one of the surviving servers (use something like RedHat Cluster to handle the IP resource fail-over). Unfortunately, glfs protocol itself doesn't handle disconnects gracefully - you just end up with a "transport end point not connected" and you have to umount+remount to get the volume back, which is messy and most definitely not transparent. The obvious disadvantage of unfsd is performance (and it's pretty dire, no two ways about it), although as I mentioned in a thread here a while back, glfs protocol for client connection doesn't seem to yield noticable benefits over unfsd, due to it's own fuse overheads.

Performance translators help a lot, but unfortunately, last time I tested, they destabilized things too much and I had to remove them.

I realise that this may not be necessarily be appropriate discussion for the devel mailing list, but iirc, you're not on the user list, hence the reply here.

Yeah, I should probably sign up to the users list at some point. Most of my posts are about possible bug reports which wouldn't be particularly useful on the users list so I never bothered signing up to it.

Gordan




[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux