Hi, The default cache size is 32M, the tcmalloc documentation is outdated. As Somnath mentioned, the tcmalloc fix is to make the env effective as without this fix the library does not use exported value of ' TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES'. The degenerated case is hit less frequently with higher value of the cache size but we still do encounter the issue. We are not very sure of what is leading to this, the hypothesis so far is - Change in OSDs mem allocation profile causing the tcmalloc to bring different size segments to the thread cache - Change in load on the shard threads (the distribution is uneven) one less active threads due to I/O started on a different pool, this may cause tcmalloc to move memory to these threads If you want to test with increased cache value, you can export this value in the /etc/init/ceph-osd.conf upstart script Regards, Chaitanya -----Original Message----- From: ceph-devel-owner@xxxxxxxxxxxxxxx [mailto:ceph-devel-owner@xxxxxxxxxxxxxxx] On Behalf Of Mark Nelson Sent: 28 April 2015 02:03 To: Milosz Tanski; Alexandre DERUMIER; ceph-devel; Somnath Roy Subject: Re: Hitting tcmalloc bug even with patch applied On 04/27/2015 03:24 PM, Milosz Tanski wrote: > > > On 4/27/15 8:06 AM, Alexandre DERUMIER wrote: >> Hi, >> >> I'm hitting the tcmalloc even with patch apply. >> It's mainly occur when I try to bench fio with a lot jobs (20 - 40 >> jobs) >> >> Does It need to tuned something in osd environnement variable ? >> >> >> I double check it with >> >> #g++ -o gperftest gperftest.c -ltcmalloc # export >> TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES=67108864 >> # ./gperftest >> Tcmalloc OK! Internal and Env cache size are same:67108864 >> >> >> perf top >> ------- >> 10.04% libtcmalloc.so.4.1.2 [.] tcmalloc::ThreadCache::ReleaseToCentralCache >> 8.19% libtcmalloc.so.4.1.2 [.] tcmalloc::CentralFreeList::FetchFromSpans >> 3.89% libtcmalloc.so.4.1.2 [.] tcmalloc::CentralFreeList::ReleaseToSpans >> 2.04% libtcmalloc.so.4.1.2 [.] tcmalloc::CentralFreeList::ReleaseListToSpans >> 1.79% libtcmalloc.so.4.1.2 [.] operator new >> 1.25% ceph-osd [.] ConfFile::load_from_buffer >> 1.21% libtcmalloc.so.4.1.2 [.] operator delete >> 1.14% [kernel] [k] _raw_spin_lock >> 1.08% libstdc++.so.6.0.19 [.] std::basic_string<char, std::char_traits<char>, std::allocator<char> >::basic_string >> 1.04% [kernel] [k] __schedule >> 1.00% libpthread-2.17.so [.] pthread_mutex_trylock >> 0.90% [kernel] [k] native_write_msr_safe >> 0.89% [kernel] [k] __switch_to >> 0.79% [kernel] [k] _raw_spin_lock_irqsave >> 0.73% [kernel] [k] copy_user_enhanced_fast_string >> > > This is obviously going to be more painful but .... can you perform a capture for one OSD process using, pref record -p $OSD_PID. Ideally one with a callgraph and one without. > > That can be helpful to investigate further. Can see which parts of those tcmalloc functions are the biggest offer in terms of time. We can also see if there's a new/delete pastern in OSD code that is somehow trigger this degenerate case. If on a newish (3.11+) kernel that has libunwind compiled into perf, I've found that dwarf callgraphs are much more detailed. The frequency may need to be lowered to make it work well. -F 100 or something perhaps. > >> >> >> Regards, >> >> Alexandre >> -- >> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in >> the body of a message to majordomo@xxxxxxxxxxxxxxx >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> > -- > To unsubscribe from this list: send the line "unsubscribe ceph-devel" in > the body of a message to majordomo@xxxxxxxxxxxxxxx > More majordomo info at http://vger.kernel.org/majordomo-info.html > -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html ________________________________ PLEASE NOTE: The information contained in this electronic mail message is intended only for the use of the designated recipient(s) named above. If the reader of this message is not the intended recipient, you are hereby notified that you have received this message in error and that any review, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this communication in error, please notify the sender by telephone or e-mail (as shown above) immediately and destroy any and all copies of this message in your possession (whether hard copies or electronically stored copies). ��.n��������+%������w��{.n����z��u���ܨ}���Ơz�j:+v�����w����ޙ��&�)ߡ�a����z�ޗ���ݢj��w�f