Re: async shutdown

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, 27 Dec 2016, Sage Weil wrote:
> I'm chasing a pretty reproducible crash of the rocksdb librados 
> env backend where it asserts on crypto shutdown with this PR applied:
> 
> 	https://github.com/ceph/ceph/pull/12624
> 
> This is generally caused by a memory leak of some crypto object (e.g., 
> CryptoKey).  If it run the test in a loop with ms type = simple it's fine, 
> but with async it crashes after about 10 minutes.  On one run valgrind 
> reported a leak from AsyncConnection, although it didn't make any 
> sense (recv_buf?).  But my suspicion is that the 
> AsyncMessenger::shutdown() call to stack->drain() isn't sufficient to make 
> sure all of the connections have actually been destroyed, which means the 
> cct and crypto shutdown asserts sometimes.
> 
> Does that theory make sense?  Any suggestions how to make sure that 
> the AsyncConnections are reliably reaped during librados shutdown?

...and as soon as I hit send my msg/simple run in a loop also 
crashed.  Nevermind!

sage


> 
> Thanks!
> sage
> 
> 
> BTW here's an example of the crash, although it's not particularly 
> informative in and of itself:
> 
> http://pulpito.ceph.com/sage-2016-12-27_18:31:32-rados-wip-sage-testing---basic-smithi/670919
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux