Hi sage, thanks for your help. i have built up the system and run postmark and iozone to get some performance data. my system contains two machine, one is set as monitor, mds and osd. another is set as mds and osd. each osd is a SSD with ext3. i compared this system with another system that only use a SSD with ext3 on a local machine without distributing data to other machines. to me, the performance of ceph is much poorer than a local environment. likely from mb/s to kb/s decrease...... is this normal, i read the ceph paper, seems that the performance is not that bad. i am a little confused. Thanks, Sheng On Wed, Oct 26, 2011 at 1:24 PM, Sage Weil <sage@xxxxxxxxxxxx> wrote: > On Wed, 26 Oct 2011, sheng qiu wrote: >> hi Sage, >> >> it's a file. but the file is on a separated device from the osd. > > Is it on NFS or something? Or maybe SELinux/AppArmor is mucking things > up? > > Try putting the journal on a raw block device, or a file on a local file > system, and you shouldn't see this. > > sage > > >> >> On Wed, Oct 26, 2011 at 11:38 AM, Sage Weil <sage@xxxxxxxxxxxx> wrote: >> > On Wed, 26 Oct 2011, sheng qiu wrote: >> >> thanks for the reply. i met a new problem. now i can mount the ceph >> >> using cfuse. and write small single file. but it will crash when file >> >> size increased to some number. the osd.log is: >> >> 2011-10-26 08:59:20.156983 7febdca06700 journal FileJournal::write_bl >> >> : write_fd failed: error 13: Permission denied >> > >> > We're getting an error when trying to write to the journal device. Is >> > that a file or a block device? I'm not sure what would cause a write >> > EACCES on an already-open file... >> > >> > And strace -f log might on the ceph-osd might tell us more. >> > >> > sage >> > >> >> 2011-10-26 08:59:20.157000 7febdca06700 journal FileJournal::do_write: >> >> write_bl(pos=10481664) failed >> >> os/FileJournal.cc: In function 'void >> >> FileJournal::do_write(ceph::bufferlist&)', in thread '0x7febdca06700' >> >> os/FileJournal.cc: 817: FAILED assert(0) >> >> ceph version 0.34 (commit:2f039eeeb745622b866d80feda7afa055e15f6d6) >> >> 1: (FileJournal::do_write(ceph::buffer::list&)+0x367) [0x6b61f7] >> >> 2: (FileJournal::write_thread_entry()+0x3e6) [0x6b97c6] >> >> 3: (FileJournal::Writer::entry()+0xd) [0x6a50dd] >> >> 4: (()+0x69ca) [0x7febe38119ca] >> >> 5: (clone()+0x6d) [0x7febe22f870d] >> >> ceph version 0.34 (commit:2f039eeeb745622b866d80feda7afa055e15f6d6) >> >> 1: (FileJournal::do_write(ceph::buffer::list&)+0x367) [0x6b61f7] >> >> 2: (FileJournal::write_thread_entry()+0x3e6) [0x6b97c6] >> >> 3: (FileJournal::Writer::entry()+0xd) [0x6a50dd] >> >> 4: (()+0x69ca) [0x7febe38119ca] >> >> 5: (clone()+0x6d) [0x7febe22f870d] >> >> *** Caught signal (Aborted) ** >> >> in thread 0x7febdca06700 >> >> ceph version 0.34 (commit:2f039eeeb745622b866d80feda7afa055e15f6d6) >> >> 1: ./cosd() [0x5c69d4] >> >> 2: (()+0xf8f0) [0x7febe381a8f0] >> >> 3: (gsignal()+0x35) [0x7febe2245a75] >> >> 4: (abort()+0x180) [0x7febe22495c0] >> >> 5: (__gnu_cxx::__verbose_terminate_handler()+0x115) [0x7febe2afb8e5] >> >> 6: (()+0xcad16) [0x7febe2af9d16] >> >> 7: (()+0xcad43) [0x7febe2af9d43] >> >> 8: (()+0xcae3e) [0x7febe2af9e3e] >> >> 9: (ceph::__ceph_assert_fail(char const*, char const*, int, char >> >> const*)+0x39f) [0x5cbb1f] >> >> 10: (FileJournal::do_write(ceph::buffer::list&)+0x367) [0x6b61f7] >> >> 11: (FileJournal::write_thread_entry()+0x3e6) [0x6b97c6] >> >> 12: (FileJournal::Writer::entry()+0xd) [0x6a50dd] >> >> 13: (()+0x69ca) [0x7febe38119ca] >> >> 14: (clone()+0x6d) [0x7febe22f870d] >> >> >> >> does any one have idea about this? >> >> >> >> Thanks, >> >> Sheng >> >> >> >> On Mon, Oct 24, 2011 at 2:02 PM, sheng qiu <herbert1984106@xxxxxxxxx> wrote: >> >> > hi, >> >> > >> >> > is there any way to debug ceph to find out where it crash? >> >> > >> >> > Thanks, >> >> > Sheng >> >> > >> >> > On Mon, Oct 24, 2011 at 11:55 AM, Tommi Virtanen >> >> > <tommi.virtanen@xxxxxxxxxxxxx> wrote: >> >> >> On Sun, Oct 23, 2011 at 12:22, sheng qiu <herbert1984106@xxxxxxxxx> wrote: >> >> >>> does ceph only support ext3 and btrfs? i mean if i have another file >> >> >>> system, it support xattr, but it does not support journals, can i >> >> >>> close the journal config and run it with ceph? >> >> >> >> >> >> Ceph supports OSD storage on POSIX-compliant filesystems with extended >> >> >> attributes. There are some edge cases that are still problematic, see >> >> >> the email with subject "ceph on non-btrfs file systems" that Sage just >> >> >> sent for a better discussion of those. >> >> >> >> >> >> I'm not sure what you mean by a filesystem "not supporting journals", >> >> >> the ceph-osd journal is just a file it writes to. I'm not sure what >> >> >> you mean by "closing the journal config"; you can disable the journal, >> >> >> but that will likely just make things slower; once again, using a >> >> >> journal is possible on any filesystem, btrfs just makes it more >> >> >> effective. >> >> >> >> >> >>> i tried to do that by modifying the ceph.conf file (close the journal >> >> >>> on osd). the mkcephfs seems ok, but when i start the service, cosd >> >> >>> will crash. >> >> >>> here's the log: >> >> >>> *** Caught signal (Aborted) ** >> >> >>> in thread 0x7f1845ce6720 >> >> >>> ceph version 0.34 (commit:2f039eeeb745622b866d80feda7afa055e15f6d6) >> >> >>> 1: ./cosd() [0x5c6304] >> >> >> ... >> >> >>> 9: (CrushWrapper::decode(ceph::buffer::list::iterator&)+0xac) [0x5611fc] >> >> >>> 10: (OSDMap::decode(ceph::buffer::list&)+0x8aa) [0x5622aa] >> >> >>> 11: (OSD::get_map(unsigned int)+0x221) [0x52f441] >> >> >> >> >> >> That sounds more like the osdmap is corrupt, somehow. Anyone else have >> >> >> more specific suggestions? >> >> >> >> >> > >> >> > >> >> > >> >> > -- >> >> > Sheng Qiu >> >> > Texas A & M University >> >> > Room 302 Wisenbaker >> >> > email: herbert1984106@xxxxxxxxx >> >> > College Station, TX 77843-3259 >> >> > >> >> >> >> >> >> >> >> -- >> >> Sheng Qiu >> >> Texas A & M University >> >> Room 302 Wisenbaker >> >> email: herbert1984106@xxxxxxxxx >> >> College Station, TX 77843-3259 >> >> -- >> >> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in >> >> the body of a message to majordomo@xxxxxxxxxxxxxxx >> >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> >> >> >> >> >> >> >> -- >> Sheng Qiu >> Texas A & M University >> Room 302 Wisenbaker >> email: herbert1984106@xxxxxxxxx >> College Station, TX 77843-3259 >> -- >> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in >> the body of a message to majordomo@xxxxxxxxxxxxxxx >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> >> -- Sheng Qiu Texas A & M University Room 302 Wisenbaker email: herbert1984106@xxxxxxxxx College Station, TX 77843-3259 -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html