webserver setup with 6 nodes

freedman at FreeFormIT.com (Keith Freedman) · Tue, 10 Mar 2009 04:08:47 -0700

At 03:11 AM 3/10/2009, Christian Meisinger wrote:
>hi there.
>
>
>i've 6 load-balanced webservers running with apache 2.0.
>right now we use unison to upload file changes from the developers to server 1
>and then 'sync' those changes to 2, 3 andsoon.
>
>additionally if a file is created on one of the servers (like a 
>temporary download created with php)
>we 'sync' that file with scp. for safety there are unisons running 
>every 10mins on some servers
>to ensure all files are available on every server.
>
>right now it's not a problem with only 6 servers but i'm pretty sure 
>it will become a big problem
>with more servers...
>
>
>so i started some tests GlusterFS 2.0.0rc2.
>right now i think two setups may suit us:
>
>layout 1:
>   node1-replicate-node2 \
>   node3-replicate-node4 - --> distribute  ( like raid0 over 3 x raid1 )
>   node5-replicate-node6 /
>
>or
>
>layout 2:
>   node1-replicate-node2-replicate-node3 \
>   node4-replicate-node5-replicate-node6 - --> distribute  ( like 
> raid0 over 2 x raid5 :) )
>
>i think layout 1 should be ok, because until now i've never seen 2 
>servers crash at the same time.

I'd definitely recommend layout 1 over layout 2.  the more nodes in 
an AFR brick the more performance issues you may have.

>my problem is: what happens if we get additional servers?

if you add them in pairs, you're in good shape, otherwise, you need 
to do a 3-node afr brick until you have an even number of servers 
then do something different.

>i've tested layout 1 with 4 nodes, shutdown GLFS, added 
>node5-replicate-node6 to the distrubite translator and started
>again. files were ok but i got alot of GLFS errors in the log files.
>can i safely ignore them?

it depends on which errors they are..  you'd have to use your judgement.
whenever I know a node has failed in an AFR pair, I typicall turn on 
"favorite-child" in the other one, remount, then bring up the down 
server.  then run ls -lR  on the server that didn't crash.
once done, remount without favorite child.   but this way you don't 
get the i/o errors (although I think those are fixed in Rc3 or 4) you 
might be seeing.

>i know the manual says: use distribute for fresh installations else unify.
>still i'm curious if it would work with distribute.
>
>btw. for GlusterFS version 2.0 is AFR = replicate?

yes