On 10/24/2014 08:21 AM, Xu (Simon) Chen wrote:
Hey folks, I am trying to enable OpenStack to use RBD as image backend: https://bugs.launchpad.net/nova/+bug/1226351 For some reason, nova-compute segfaults due to librados crash: ./log/SubsystemMap.h: In function 'bool ceph::log::SubsystemMap::should_gather(unsigned int, int)' thread 7f1b477fe700 time 2014-10-24 03:20:17.382769 ./log/SubsystemMap.h: 62: FAILED assert(sub < m_subsys.size()) ceph version 0.80.5 (38b73c67d375a2552d8ed67843c8a65c2c0feba6) 1: (()+0x42785) [0x7f1b4c4db785] 2: (ObjectCacher::flusher_entry()+0xfda) [0x7f1b4c53759a] 3: (ObjectCacher::FlusherThread::entry()+0xd) [0x7f1b4c54a16d] 4: (()+0x6b50) [0x7f1b6ea93b50] 5: (clone()+0x6d) [0x7f1b6df3e0ed] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. terminate called after throwing an instance of 'ceph::FailedAssertion' Aborted I feel that there is some concurrency issue, since this sometimes happen before and sometimes after this line: https://github.com/openstack/nova/blob/master/nova/virt/libvirt/rbd_utils.py#L208 Any idea what are the potential causes of the crash? Thanks. -Simon
This is http://tracker.ceph.com/issues/8912, fixed in the latest firefly and dumpling releases. _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com