Re: a union to two stripes to fourteen mirrors...

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Jerker Nyberg wrote:

Hi,

I'm trying out different configurations of GlusterFS. I have 7 nodes each with two 320 GB disks where 300 GB om each disk is for the distributed file system.

Each node is called N. Every file system is on the server side mirrored to the other disk on the next node, wrapped around so that the last node mirrors its disk to the first. invented. The real config is included in the end of this mail.

Pseudodefinitions:

fs(1) = a file system on the first disk
fs(2) = a file system on the second disk
n(I, fs(J)) = the fs J on node I
afr(N .. M) = mirror the volumes
stripe(N .. M) = stripe the volumes

Server:

Forw(N) = afr(fs(1), node(N+1, fs(2))
Back(N) = afr(fs(2), node(N-1, fs(1))

Client:

FStr(N .. M) = stripe(n(N, Forw(N)) .. n(N+i, Forw(N+1)) .. n(M, Forw(M)))
BStr(N .. M) = stripe(n(N, Back(N)) .. n(N+i, Back(N+1)) .. n(M, Back(M)))
mount /glusterfs = union(FStr(1 .. 7), BStr(1..7))



The goal was to get good performance but also redundancy. But this setup will not will it? The stripes will not work when a part of is gone and the union will not not magically find the other part of a file on the other stripe? And where to put the union namespace for good performance?

But my major question is this: I tried to stripe a single stripe (not using union on the client, just striping on the servers which in turn mirrored) When rsync'ing in data on it on a single server things worked fine, but when I put some load on it from the other nodes (dd'ing in and out some large files) the glusterfsd's on the first server died... Do you want me to check this up more and try to reproduce and narrow down the problem, or is this kind of setup in general not a good idea?

I don't think the developers would ever consider a situation where glusterfsd dies acceptable, so I'm sure they would want info on why it dies.

I can't seem to correlate your spec files with the layout you gave a above. My understanding of your spec files makes it look like this to me:

Server:

Forw(N) = afr(fs(1), node(N+1, fs(2))

Client:

FStr(N .. M) = stripe(n(1, Forw(1)) .. n(N, Forw(N)))


In either case you have a problem if you are looking for high availability, as the loss of any single node will reduce the cluster to an unusable state (unless striping does some stuff I don't know about). Since each AFR is defined on the main node itself, the client doesn't know it's actually putting files on node N and N+1, it's only talking to N. If N dies, the client has no way of knowing it can still read and write from the other AFR member, N+1. Moving the AFR config to the client would fix this.

Maybe that's why you had the Back(N) stuff, but I'm not sure I understand what you were trying to say with that... Were you trying to define the same afr share through two different servers with two different (but equivalent) configs?

--

-Kevan Benson
-A-1 Networks




[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux