Thanks. I'm glad to hear it is fixed in new version. Wiping the OSD worked. On Wed, Feb 13, 2013 at 2:08 PM, Mike Dawson <mike.dawson@xxxxxxxxxxxxxxxx> wrote: > Mandell, > > A few of us saw a similar failure on 0.56.1. > > http://tracker.ceph.com/issues/3770 > > Sam Just patched the issue for 0.56.2. My understanding is Sam's patch > prevents the issue in the future, but doesn't repair a previously damaged > OSD. > > If you have good replication (or a good backup), I have had luck removing > the affected OSD, formatting, and re-adding it. I believe Sam may have a > manual process to fix it if you can't wipe this OSD. > > Good Luck, > Mike > > > > On 2/13/2013 2:57 PM, Mandell Degerness wrote: >> >> I'm getting this error on one of my OSD's when I try to start it. >> >> I can gather more complete log data if no-one recognizes the error from >> this: >> >> Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.612847 >> 7f4f607e7780 0 filestore(/mnt/osd96) mount found snaps <> >> Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.615147 >> 7f4f607e7780 0 filestore(/mnt/osd96) mount: enabling WRITEAHEAD >> journal mode: btrfs not detected >> Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.658965 >> 7f4f607e7780 1 journal _open /mnt/osd96/journal fd 30: 8589934592 >> bytes, block size 4096 bytes, directio = 1, aio = 0 >> Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.720091 >> 7f4f607e7780 1 journal _open /mnt/osd96/journal fd 30: 8589934592 >> bytes, block size 4096 bytes, directio = 1, aio = 0 >> Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.721871 >> 7f4f607e7780 -1 osd/OSD.cc: In function 'OSDMapRef >> OSD::get_map(epoch_t)' thread 7f4f607e7780 time 2013-02-13 >> 19:30:04.721278 >> osd/OSD.cc: 4029: FAILED assert(_get_map_bl(epoch, bl)) >> >> ceph version 0.48.1argonaut >> (commit:a7ad701b9bd479f20429f19e6fea7373ca6bba7c) >> 1: (OSD::get_map(unsigned int)+0x560) [0x7f4f60a411e0] >> 2: (OSD::init()+0x5a3) [0x7f4f60a53ce3] >> 3: (main()+0x4462) [0x7f4f6096d182] >> 4: (__libc_start_main()+0xfd) [0x7f4f5e64b26d] >> 5: (()+0x16e829) [0x7f4f60968829] >> NOTE: a copy of the executable, or `objdump -rdS <executable>` is >> needed to interpret this. >> Feb 13 19:30:04 node-192-168-8-14 ceph-osd: --- begin dump of recent >> events --- >> -- >> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in >> the body of a message to majordomo@xxxxxxxxxxxxxxx >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html