Re: Hadoop on ceph

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



2012/3/17 Greg Farnum <gregory.farnum@xxxxxxxxxxxxx>:
> On Friday, March 16, 2012 at 2:24 PM, Andrey Stepachev wrote:
>> 2012/3/16 Noah Watkins <jayhawk@xxxxxxxxxxx (mailto:jayhawk@xxxxxxxxxxx)>:
>> >
>> > On Mar 16, 2012, at 8:37 AM, Sage Weil wrote:
>> >
>> > > Hi Andrey,
>> > >
>> > > On Fri, 16 Mar 2012, Andrey Stepachev wrote:
>> > >
>> > > possible). I take it TestDSFIO is a standard hadoop benchmark?
>> >
>> > Yes, this is. There are a number of benchmarks that ship with Hadoop. Although this is untested, one reason you might be seeing throughput issues is with the standard read/write interface that copies bytes across the JNI interface. On the short list of stuff for the next Java wrapper set is to use the ByteBuffer interface (NIO) to avoid this copying.
>>
>> I'm not sure, that problem on java side. All disks loaded at 100%, so
>> I think, that problem clearly on osd part. But i want to test your new
>> integration and see, if something changes. You maybe right, but I'm not.
>
>
> Those are some awfully slow disks.
Replication was 3.

> I don't know exactly what this test measures, but if you're write-constrained on the HDFS side then Ceph will definitely be slower due to little things like the journaling that it does. And that is a data safety issue where Ceph is paying much higher costs than HDFS does.

This test runs 36 mappers on 6 hosts. Each mapper writes its own file
in parallel.
All that hosts host ceph filesystem. 6 osd, 1 mds  + 2 standby, 3 mons. Mappers
was executed on that hosts.

> But it doesn't mean that Ceph is necessarily slower on good hardware. :)

I don't mean, that Ceph is somewhat slower at general.
Test shows, that hadoop on ceph is x1.5 times slower, then hdfs.
Hadoop has specific write workload and on that workload hadoop shines
right now.
Hadoop read also better. Hadoop can read locally from replica.

> -Greg
>
>



-- 
Andrey.
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux