sorry for delay reply. Please have a try https://github.com/ceph/ceph/commit/ae4a8162eacb606a7f65259c6ac236e144bfef0a. 2015-12-21 0:10 GMT+08:00 Willem Jan Withagen <wjw@xxxxxxxxxxx>: > Hi, > > Most of the Ceph is getting there in the most crude and rough state. > So beneath is a status update on what is not working for me jet. > > Especially help with the aligment problem in os/FileJournal.cc would be > appricated... It would allow me to run ceph-osd and run more tests to > completion. > > What would happen if I comment out this test, and ignore the fact that > thing might be unaligned? > Is it a performance/paging issue? > Or is data going to be corrupted? > > --WjW > > PASS: src/test/run-cli-tests > ============================================================================ > Testsuite summary for ceph 10.0.0 > ============================================================================ > # TOTAL: 1 > # PASS: 1 > # SKIP: 0 > # XFAIL: 0 > # FAIL: 0 > # XPASS: 0 > # ERROR: 0 > ============================================================================ > > gmake test: > ============================================================================ > Testsuite summary for ceph 10.0.0 > ============================================================================ > # TOTAL: 119 > # PASS: 95 > # SKIP: 0 > # XFAIL: 0 > # FAIL: 24 > # XPASS: 0 > # ERROR: 0 > ============================================================================ > > The folowing notes can be made with this: > 1) the run-cli-tests run to completion because I excluded the RBD tests > 2) gmake test has the following tests FAIL: > FAIL: unittest_erasure_code_plugin > FAIL: ceph-detect-init/run-tox.sh > FAIL: test/erasure-code/test-erasure-code.sh > FAIL: test/erasure-code/test-erasure-eio.sh > FAIL: test/run-rbd-unit-tests.sh > FAIL: test/ceph_objectstore_tool.py > FAIL: test/test-ceph-helpers.sh > FAIL: test/cephtool-test-osd.sh > FAIL: test/cephtool-test-mon.sh > FAIL: test/cephtool-test-mds.sh > FAIL: test/cephtool-test-rados.sh > FAIL: test/mon/osd-crush.sh > FAIL: test/osd/osd-scrub-repair.sh > FAIL: test/osd/osd-scrub-snaps.sh > FAIL: test/osd/osd-config.sh > FAIL: test/osd/osd-bench.sh > FAIL: test/osd/osd-reactivate.sh > FAIL: test/osd/osd-copy-from.sh > FAIL: test/libradosstriper/rados-striper.sh > FAIL: test/test_objectstore_memstore.sh > FAIL: test/ceph-disk.sh > FAIL: test/pybind/test_ceph_argparse.py > FAIL: test/pybind/test_ceph_daemon.py > FAIL: ../qa/workunits/erasure-code/encode-decode-non-regression.sh > > Most of the fails are because ceph-osd crashed consistently on: > -1 journal bl.is_aligned(block_size) 0 > bl.is_n_align_sized(CEPH_MINIMUM_BLOCK_SIZE) 1 > -1 journal block_size 131072 CEPH_MINIMUM_BLOCK_SIZE 4096 > CEPH_PAGE_SIZE 4096 header.alignment 131072 > bl buffer::list(len=131072, buffer::ptr(0~131072 0x805319000 in raw > 0x805319000 len 131072 nref 1)) > os/FileJournal.cc: In function 'void FileJournal::align_bl(off64_t, > bufferlist &)' thread 805217400 time 2015-12-19 13:43:06.706797 > os/FileJournal.cc: 1045: FAILED assert(0 == "bl should be align") > > This is bugging me already for a few days, but I haven't found an easy > way to debug this, run it in gdb while being live or in post-mortum. > > Further: > A) unittest_erasure_code_plugin failes on the fact that there is a > different error code returned when dlopen-ing a non existent library. > load dlopen(.libs/libec_invalid.so): Cannot open > ".libs/libec_invalid.so"load dlsym(.libs/libec_missing_version.so, _ > _erasure_code_init): Undefined symbol > "__erasure_code_init"test/erasure-code/TestErasureCodePlugin.cc:88: Failure > Value of: instance.factory("missing_version", g_conf->erasure_code_dir, > profile, &erasure_code, &cerr) > Actual: -2 > Expected: -18 > load dlsym(.libs/libec_missing_entry_point.so, __erasure_code_init): > Undefined symbol "__erasure_code_init"erasure_co > de_init(fail_to_initialize,.libs): (3) No such processload > __erasure_code_init()did not register fail_to_registerload > : example erasure_code_init(example,.libs): (17) File existsload: > example [ FAILED ] ErasureCodePluginRegistryTest. > all (330 ms) > > B) ceph-detect-init/run-tox.sh failes on the fact that I need to work in > FreeBSD in the tests. > > C) ./gtest/include/gtest/internal/gtest-port.h:1358:: Condition > has_owner_ && pthread_equal(owner_, pthread_se > lf()) failed. The current thread is not holding the mutex @0x161ef20 > ./test/run-rbd-unit-tests.sh: line 9: 78053 Abort trap > (core dumped) unittest_librbd > > Which I think I found some commit comments about in either trac or git > about FreeBSD not being able to do things to its own thread. Got to look > into this. > > D) Fix some of the other python code to work as expected. > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@xxxxxxxxxxxxxxx > More majordomo info at http://vger.kernel.org/majordomo-info.html -- Regards, Xinze Chi -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html