Re: clustered file system of choice

"Simon Billis" <simon@xxxxxxxxxx> · Thu, 17 Jun 2010 15:41:02 +0100

Boris Epstein sent a missive on 2010-06-16:

> Hi all,
> 
> I am just trying to consider my options for storing a large mass of
> data (tens of terrabytes of files) and one idea is to build a
> clustered FS of some kind. Has anybody had any experience with that?
> Any recommendations?
> 
> Thanks in advance for any and all advice.

Take a look at hadoop http://hadoop.apache.org and specifically HDFS (hadoop
distributed file system) http://hadoop.apache.org/hdfs/ I've used it in
conjunction with nutch across 20 odd servers (circa 10TB). When I used it
the down side was a single metadata node, but this may have changed by now.
The data is stored redundantly across the nodes and doesn't seem to require
any special hardware (I ran it on dell 1425's).

HTH

Simon.

_______________________________________________
CentOS mailing list
CentOS@xxxxxxxxxx
http://lists.centos.org/mailman/listinfo/centos