Re: OSD Journal Failure Behavior

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Loss of the journal will kill any osds using that journal.  With
btrfs, we theoretically could run without a journal, but it would be
impractically slow.  In the case of parallel journaling, loss of the
journal can result in the loss of updates which have been reported to
the client as committed, but replacing the journal device will be
enough to allow recovery from the replicas to occur.  In the case of
write ahead, loss of the journal can additionally result in
inconsistent data on the osd.  Thus, with write ahead, you should
recreate the osd and allow recovery to occur.

The --mkjournal option is used to recreate the journal on a fresh device.

This information does need to find its way into our documentation,
thanks for the heads up!

-Sam

On Fri, May 11, 2012 at 11:31 AM, Calvin Morrow <calvin.morrow@xxxxxxxxx> wrote:
> The Ceph Wiki (http://ceph.com/wiki/OSD_journal) does a pretty good
> job explaining the purpose of the journal and various modes available.
>  What isn't clear is what happens during the failure of a journal.
>
> With the use of btrfs enabling parallel journaling, it sounds like
> failure of a journal device would still enable OSD writes to occur
> albeit at a slower pace.  In writeahead mode, failure of the journal
> seems like it would take all OSDs using that device for journaling
> offline.
>
> Is this a correct assessment?  If the journal is a single point of
> failure to a system with potentially multiple OSDs ... it might be
> worth mentioning in one of the "building a cluster" or hardware pages
> about journal redundancy.
>
> Also, it would be nice to know what steps are required to replace a
> journal.  (I think this is just an update of the cluster.conf with the
> new journal device and a restart of the OSD process, correct?)
>
> Best regards,
> Calvin
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux