Re: 2 replications,flapping can not stop for a very long time

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



hi, do you set both public_network and cluster_network, but just cut
off the cluster_network?
And do you have not only one osd on the same host?
If so, maybe you can not get stable, now the osd have peers in the
prev and next osd id,
they can exchange ping message.
you cut off the cluster_network, the outbox peer osds can not detect
the ping, they
reports the osd failure to MON, and MON gather enough reporters and
reports, then the osd will
be marked down.
But the osd can reports to MON bc the public_network is ok,  MON
thinks the osd wronly marked down, mark it to UP.
So flapping happens again and again.

2015-09-12 20:26 GMT+08:00 zhao.mingyue@xxxxxxx <zhao.mingyue@xxxxxxx>:
>
> Hi,
> I'm testing reliability of ceph recently, and I have met the flapping problem.
> I have 2 replications, and cut off the cluster network ,now  flapping can not stop,I have wait more than 30min, but status of osds are still not stable;
>     I want to know about  when monitor recv reports from osds ,how it can mark one osd down?
>     (reports && reporter && grace) need to satisfied some conditions, how to calculate the grace?
> and how long will the flapping  stop?Does the flapping must be stopped by configure,such as configure an osd lost?
> Can someone help me ?
> Thanks~
> -------------------------------------------------------------------------------------------------------------------------------------
> 本邮件及其附件含有杭州华三通信技术有限公司的保密信息,仅限于发送给上面地址中列出
> 的个人或群组。禁止任何其他人以任何形式使用(包括但不限于全部或部分地泄露、复制、
> 或散发)本邮件中的信息。如果您错收了本邮件,请您立即电话或邮件通知发件人并删除本
> 邮件!
> This e-mail and its attachments contain confidential information from H3C, which is
> intended only for the person or entity whose address is listed above. Any use of the
> information contained herein in any way (including, but not limited to, total or partial
> disclosure, reproduction, or dissemination) by persons other than the intended
> recipient(s) is prohibited. If you receive this e-mail in error, please notify the sender
> by phone or email immediately and delete it!



-- 
thanks
huangjun
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux