Re: OSD failure on start

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Mandell,

A few of us saw a similar failure on 0.56.1.

http://tracker.ceph.com/issues/3770

Sam Just patched the issue for 0.56.2. My understanding is Sam's patch prevents the issue in the future, but doesn't repair a previously damaged OSD.

If you have good replication (or a good backup), I have had luck removing the affected OSD, formatting, and re-adding it. I believe Sam may have a manual process to fix it if you can't wipe this OSD.

Good Luck,
Mike


On 2/13/2013 2:57 PM, Mandell Degerness wrote:
I'm getting this error on one of my OSD's when I try to start it.

I can gather more complete log data if no-one recognizes the error from this:

Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.612847
7f4f607e7780  0 filestore(/mnt/osd96) mount found snaps <>
Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.615147
7f4f607e7780  0 filestore(/mnt/osd96) mount: enabling WRITEAHEAD
journal mode: btrfs not detected
Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.658965
7f4f607e7780  1 journal _open /mnt/osd96/journal fd 30: 8589934592
bytes, block size 4096 bytes, directio = 1, aio = 0
Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.720091
7f4f607e7780  1 journal _open /mnt/osd96/journal fd 30: 8589934592
bytes, block size 4096 bytes, directio = 1, aio = 0
Feb 13 19:30:04 node-192-168-8-14 ceph-osd: 2013-02-13 19:30:04.721871
7f4f607e7780 -1 osd/OSD.cc: In function 'OSDMapRef
OSD::get_map(epoch_t)' thread 7f4f607e7780 time 2013-02-13
19:30:04.721278
osd/OSD.cc: 4029: FAILED assert(_get_map_bl(epoch, bl))

  ceph version 0.48.1argonaut (commit:a7ad701b9bd479f20429f19e6fea7373ca6bba7c)
  1: (OSD::get_map(unsigned int)+0x560) [0x7f4f60a411e0]
  2: (OSD::init()+0x5a3) [0x7f4f60a53ce3]
  3: (main()+0x4462) [0x7f4f6096d182]
  4: (__libc_start_main()+0xfd) [0x7f4f5e64b26d]
  5: (()+0x16e829) [0x7f4f60968829]
  NOTE: a copy of the executable, or `objdump -rdS <executable>` is
needed to interpret this.
Feb 13 19:30:04 node-192-168-8-14 ceph-osd: --- begin dump of recent events ---
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux