Hi Martin- On Sat, 14 Jan 2012, Sage Weil wrote: > Hi Martin- > > On Sat, 14 Jan 2012, Martin Mailand wrote: > > > Hi > > one of four OSD died during the update to v0.40 with an Assertion > > os/FileStore.cc: 2438: FAILED assert(0 == "unexpected error") > > Even after a complete shutdown of the cluster an a new start with all OSD at > > the same version, this osd did not start. > > > > The OSD Log it attached. > > It's trying to replay a transaction that appears to be invalid because the > .2 clone is smaller than it thinks. Is this the first time the OSD > crashed, or did it crash once, and you cranked up logs and generated > this one? If you have the previous log, that would be helpful... it > should have a similar tranasction dump but a different stack trace. I pushed a wip-osd-dump-journal branch to git that will make ceph-osd -i <whatever> --dump-journal > /tmp/foo.txt dump the contents of your entire osd journal (sans data) to a text file. Do you mind sending that along as well? I'd like to see what is in the journal _after_ the event that is failing (if anything). Thanks! sage > > Also, are any of the 6 patches on top of 0.40 related to the filestore or > osd? > > Thanks! > sage > > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@xxxxxxxxxxxxxxx > More majordomo info at http://vger.kernel.org/majordomo-info.html > > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html