Re: OSD trashed by simple reboot (Debian Jessie, systemd?)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Righty - I'll see if I can replicate what you see if I setup an 0.80.9 cluster using the same workstation hardware (WD Raptors and Intel 520s) that showed up the issue previously at 0.83 (I wonder if I never tried a fresh install using the 0.80.* tree)...

May be a few days...

On 05/06/15 16:49, Christian Balzer wrote:

Hello,

On Fri, 05 Jun 2015 16:33:46 +1200 Mark Kirkwood wrote:

Well, whatever it is, I appear to not be the only one after all:
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=773361

Looking quickly at the relevant code:

FileJournal::stop_writer() in src/os/FileJpurnal.cc

I see that we didn't start seeing the (original) issue until changes in
0.83, which suggests that 0.80 tree might not be doing the same thing.
*However* I note that I'm not happy with the placement of the two thread
join operations in there - it *looks* to me like 0.80 could in fact be
vulnerable to the same journal corrupting problem, so if it occurs again
might be interesting to apply the gist of
https://github.com/ceph/ceph/pull/2764 and see if it helps (ahem - of
course would be best if this was on a test system)!

Alas this is neither a test cluster, nor do I have things set up to compile
from source here ATM.

Christian


_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux