---------- Forwarded message ---------- From: Xinze Chi (信泽) <xmdxcxz@xxxxxxxxx> Date: 2015-12-21 8:59 GMT+08:00 Subject: Re: FreeBSD Building and Testing To: Willem Jan Withagen <wjw@xxxxxxxxxxx> Please try this patch https://github.com/XinzeChi/ceph/commit/f4d5bd01a2e498e850e3a43fb1233ba40d8d1781 again. and tell me which patch could fix the bug. Thanks. 2015-12-21 8:45 GMT+08:00 Xinze Chi (信泽) <xmdxcxz@xxxxxxxxx>: > sorry for delay reply. Please have a try > https://github.com/ceph/ceph/commit/ae4a8162eacb606a7f65259c6ac236e144bfef0a. > > 2015-12-21 0:10 GMT+08:00 Willem Jan Withagen <wjw@xxxxxxxxxxx>: >> Hi, >> >> Most of the Ceph is getting there in the most crude and rough state. >> So beneath is a status update on what is not working for me jet. >> >> Especially help with the aligment problem in os/FileJournal.cc would be >> appricated... It would allow me to run ceph-osd and run more tests to >> completion. >> >> What would happen if I comment out this test, and ignore the fact that >> thing might be unaligned? >> Is it a performance/paging issue? >> Or is data going to be corrupted? >> >> --WjW >> >> PASS: src/test/run-cli-tests >> ============================================================================ >> Testsuite summary for ceph 10.0.0 >> ============================================================================ >> # TOTAL: 1 >> # PASS: 1 >> # SKIP: 0 >> # XFAIL: 0 >> # FAIL: 0 >> # XPASS: 0 >> # ERROR: 0 >> ============================================================================ >> >> gmake test: >> ============================================================================ >> Testsuite summary for ceph 10.0.0 >> ============================================================================ >> # TOTAL: 119 >> # PASS: 95 >> # SKIP: 0 >> # XFAIL: 0 >> # FAIL: 24 >> # XPASS: 0 >> # ERROR: 0 >> ============================================================================ >> >> The folowing notes can be made with this: >> 1) the run-cli-tests run to completion because I excluded the RBD tests >> 2) gmake test has the following tests FAIL: >> FAIL: unittest_erasure_code_plugin >> FAIL: ceph-detect-init/run-tox.sh >> FAIL: test/erasure-code/test-erasure-code.sh >> FAIL: test/erasure-code/test-erasure-eio.sh >> FAIL: test/run-rbd-unit-tests.sh >> FAIL: test/ceph_objectstore_tool.py >> FAIL: test/test-ceph-helpers.sh >> FAIL: test/cephtool-test-osd.sh >> FAIL: test/cephtool-test-mon.sh >> FAIL: test/cephtool-test-mds.sh >> FAIL: test/cephtool-test-rados.sh >> FAIL: test/mon/osd-crush.sh >> FAIL: test/osd/osd-scrub-repair.sh >> FAIL: test/osd/osd-scrub-snaps.sh >> FAIL: test/osd/osd-config.sh >> FAIL: test/osd/osd-bench.sh >> FAIL: test/osd/osd-reactivate.sh >> FAIL: test/osd/osd-copy-from.sh >> FAIL: test/libradosstriper/rados-striper.sh >> FAIL: test/test_objectstore_memstore.sh >> FAIL: test/ceph-disk.sh >> FAIL: test/pybind/test_ceph_argparse.py >> FAIL: test/pybind/test_ceph_daemon.py >> FAIL: ../qa/workunits/erasure-code/encode-decode-non-regression.sh >> >> Most of the fails are because ceph-osd crashed consistently on: >> -1 journal bl.is_aligned(block_size) 0 >> bl.is_n_align_sized(CEPH_MINIMUM_BLOCK_SIZE) 1 >> -1 journal block_size 131072 CEPH_MINIMUM_BLOCK_SIZE 4096 >> CEPH_PAGE_SIZE 4096 header.alignment 131072 >> bl buffer::list(len=131072, buffer::ptr(0~131072 0x805319000 in raw >> 0x805319000 len 131072 nref 1)) >> os/FileJournal.cc: In function 'void FileJournal::align_bl(off64_t, >> bufferlist &)' thread 805217400 time 2015-12-19 13:43:06.706797 >> os/FileJournal.cc: 1045: FAILED assert(0 == "bl should be align") >> >> This is bugging me already for a few days, but I haven't found an easy >> way to debug this, run it in gdb while being live or in post-mortum. >> >> Further: >> A) unittest_erasure_code_plugin failes on the fact that there is a >> different error code returned when dlopen-ing a non existent library. >> load dlopen(.libs/libec_invalid.so): Cannot open >> ".libs/libec_invalid.so"load dlsym(.libs/libec_missing_version.so, _ >> _erasure_code_init): Undefined symbol >> "__erasure_code_init"test/erasure-code/TestErasureCodePlugin.cc:88: Failure >> Value of: instance.factory("missing_version", g_conf->erasure_code_dir, >> profile, &erasure_code, &cerr) >> Actual: -2 >> Expected: -18 >> load dlsym(.libs/libec_missing_entry_point.so, __erasure_code_init): >> Undefined symbol "__erasure_code_init"erasure_co >> de_init(fail_to_initialize,.libs): (3) No such processload >> __erasure_code_init()did not register fail_to_registerload >> : example erasure_code_init(example,.libs): (17) File existsload: >> example [ FAILED ] ErasureCodePluginRegistryTest. >> all (330 ms) >> >> B) ceph-detect-init/run-tox.sh failes on the fact that I need to work in >> FreeBSD in the tests. >> >> C) ./gtest/include/gtest/internal/gtest-port.h:1358:: Condition >> has_owner_ && pthread_equal(owner_, pthread_se >> lf()) failed. The current thread is not holding the mutex @0x161ef20 >> ./test/run-rbd-unit-tests.sh: line 9: 78053 Abort trap >> (core dumped) unittest_librbd >> >> Which I think I found some commit comments about in either trac or git >> about FreeBSD not being able to do things to its own thread. Got to look >> into this. >> >> D) Fix some of the other python code to work as expected. >> -- >> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in >> the body of a message to majordomo@xxxxxxxxxxxxxxx >> More majordomo info at http://vger.kernel.org/majordomo-info.html > > > > -- > Regards, > Xinze Chi -- Regards, Xinze Chi -- Regards, Xinze Chi -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html