Re: many report failed after mon election

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 12 Sep 2013, Dominik Mostowiec wrote:
> Hi,
> Today i have some issues with ceph cluster.
> After new mon election many osd has been marked failed.
> Some time later osd boot and i think recover because meny slow request appear.
> Cluster come back after about 20minutes.

Was there some other event that triggered the mon election?  There's not 
much here to go on except that several elections were called and by 
different monitors, which suggests something was not quite right.

sage


> 
> cluster:
> ceph version 0.56.6
> 6 servers x 26 osd
> 
> 2013-09-12 07:11:40.920384 mon.1 10.177.64.5:6789/0 353 : [INF] mon.3
> calling new monitor election
> 2013-09-12 07:12:40.992532 mon.3 10.177.64.7:6789/0 364 : [INF] mon.4
> calling new monitor election
> 2013-09-12 07:12:41.024954 mon.4 10.177.64.8:6789/0 360 : [INF] mon.2
> calling new monitor election
> 2013-09-12 07:13:02.782203 mon.2 10.177.64.6:6789/0 336 : [INF] mon.1
> calling new monitor election
> 2013-09-12 07:13:02.783778 mon.3 10.177.64.7:6789/0 366 : [INF] mon.4
> calling new monitor election
> 2013-09-12 07:13:10.852842 mon.3 10.177.64.7:6789/0 367 : [INF] mon.4
> calling new monitor election
> 2013-09-12 16:17:09.484277 mon.4 10.177.64.8:6789/0 363 : [INF] mon.2
> calling new monitor election
> 2013-09-12 16:17:09.497337 mon.3 10.177.64.7:6789/0 368 : [INF] mon.4
> calling new monitor election
> 2013-09-12 16:17:09.523787 mon.0 10.177.64.4:6789/0 4369021 : [INF]
> mon.0 calling new monitor election
> 2013-09-12 16:17:14.525282 mon.0 10.177.64.4:6789/0 4369022 : [INF]
> mon.0@0 won leader election with quorum 0,1,2,3,4
> ...
> 2013-09-12 16:17:14.689555 mon.0 10.177.64.4:6789/0 4369027 : [DBG]
> osd.130 10.177.64.9:6801/1401 reported failed by osd.121
> 10.177.64.7:6909/29496
> 2013-09-12 16:17:14.689584 mon.0 10.177.64.4:6789/0 4369028 : [DBG]
> osd.131 10.177.64.9:6810/2435 reported failed by osd.121
> 10.177.64.7:6909/29496
> 2013-09-12 16:17:14.689600 mon.0 10.177.64.4:6789/0 4369029 : [DBG]
> osd.132 10.177.64.9:6846/2885 reported failed by osd.121
> 10.177.64.7:6909/29496
> 2013-09-12 16:17:14.689615 mon.0 10.177.64.4:6789/0 4369030 : [DBG]
> osd.134 10.177.64.9:6855/3223 reported failed by osd.121
> 10.177.64.7:6909/29496
> 2013-09-12 16:17:14.689630 mon.0 10.177.64.4:6789/0 4369031 : [DBG]
> osd.136 10.177.64.9:6865/3559 reported failed by osd.121
> 10.177.64.7:6909/29496
> 2013-09-12 16:17:14.689645 mon.0 10.177.64.4:6789/0 4369032 : [DBG]
> osd.141 10.177.64.9:6904/4259 reported failed by osd.121
> 10.177.64.7:6909/29496
> 
> -- 
> Pozdrawiam
> Dominik
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux