Re: Giant + nfs over cephfs hang tasks

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Ilya, here is what I got shortly after starting the dd test:



[  288.307993]
[  288.308004] =========================================================
[  288.308008] [ INFO: possible irq lock inversion dependency detected ]
[  288.308014] 3.18.0-rc6-ceph-00024-g72ca172 #1 Tainted: G            E
[  288.308019] ---------------------------------------------------------
[  288.308023] kswapd1/87 just changed the state of lock:
[  288.308027]  (&xfs_dir_ilock_class){++++-+}, at: [<ffffffffa0682d44>] xfs_ilock+0x134/0x160 [xfs]
[  288.308072] but this lock took another, RECLAIM_FS-unsafe lock in the past:
[  288.308076]  (&mm->mmap_sem){++++++}
[  288.308076]
[  288.308076] and interrupts could create inverse lock ordering between them.
[  288.308076]
[  288.308084]
[  288.308084] other info that might help us debug this:
[  288.308089]  Possible interrupt unsafe locking scenario:
[  288.308089]
[  288.308094]        CPU0                    CPU1
[  288.308097]        ----                    ----
[  288.308100]   lock(&mm->mmap_sem);
[  288.308104]                                local_irq_disable();
[  288.308109]                                lock(&xfs_dir_ilock_class);
[  288.308114]                                lock(&mm->mmap_sem);
[  288.308120]   <Interrupt>
[  288.308122]     lock(&xfs_dir_ilock_class);
[  288.308127]
[  288.308127]  *** DEADLOCK ***
[  288.308127]
[  288.308133] 3 locks held by kswapd1/87:
[  288.308136]  #0:  (shrinker_rwsem){++++..}, at: [<ffffffff8117551f>] shrink_slab+0x3f/0x140
[  288.308151]  #1:  (&type->s_umount_key#27){++++.+}, at: [<ffffffff811d8c14>] grab_super_passive+0x44/0x90
[  288.308165]  #2:  (&pag->pag_ici_reclaim_lock){+.+...}, at: [<ffffffffa067acd4>] xfs_reclaim_inodes_ag+0xb4/0x400 [xfs]
[  288.308192]
[  288.308192] the shortest dependencies between 2nd lock and 1st lock:
[  288.308206]  -> (&mm->mmap_sem){++++++} ops: 27039227 {
[  288.308214]     HARDIRQ-ON-W at:
[  288.308218]                       [<ffffffff810a7209>] __lock_acquire+0x629/0x1c90
[  288.308229]                       [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.308236]                       [<ffffffff8173ae99>] down_write+0x49/0x80
[  288.308244]                       [<ffffffff811dcd03>] do_execve_common.isra.25+0x283/0x6e0
[  288.308253]                       [<ffffffff811dd178>] do_execve+0x18/0x20
[  288.308259]                       [<ffffffff8106ff4e>] ____call_usermodehelper+0x11e/0x170
[  288.308269]                       [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.308276]     HARDIRQ-ON-R at:
[  288.308280]                       [<ffffffff810a6f23>] __lock_acquire+0x343/0x1c90
[  288.308287]                       [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.308294]                       [<ffffffff8118d833>] might_fault+0x93/0xc0
[  288.308304]                       [<ffffffff813b7a80>] __clear_user+0x20/0x70
[  288.308314]                       [<ffffffff813b7afe>] clear_user+0x2e/0x40
[  288.308320]                       [<ffffffff8122a4cd>] padzero+0x2d/0x40
[  288.308329]                       [<ffffffff8122b0bf>] load_elf_binary+0x9cf/0x1880
[  288.308336]                       [<ffffffff811db9f0>] search_binary_handler+0xa0/0x1e0
[  288.308343]                       [<ffffffff811dcfa2>] do_execve_common.isra.25+0x522/0x6e0
[  288.308351]                       [<ffffffff811dd178>] do_execve+0x18/0x20
[  288.308358]                       [<ffffffff8106ff4e>] ____call_usermodehelper+0x11e/0x170
[  288.308366]                       [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.308373]     SOFTIRQ-ON-W at:
[  288.308376]                       [<ffffffff810a6f54>] __lock_acquire+0x374/0x1c90
[  288.308384]                       [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.308391]                       [<ffffffff8173ae99>] down_write+0x49/0x80
[  288.308398]                       [<ffffffff811dcd03>] do_execve_common.isra.25+0x283/0x6e0
[  288.308406]                       [<ffffffff811dd178>] do_execve+0x18/0x20
[  288.308412]                       [<ffffffff8106ff4e>] ____call_usermodehelper+0x11e/0x170
[  288.308420]                       [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.308427]     SOFTIRQ-ON-R at:
[  288.308431]                       [<ffffffff810a6f54>] __lock_acquire+0x374/0x1c90
[  288.308438]                       [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.308445]                       [<ffffffff8118d833>] might_fault+0x93/0xc0
[  288.308452]                       [<ffffffff813b7a80>] __clear_user+0x20/0x70
[  288.308458]                       [<ffffffff813b7afe>] clear_user+0x2e/0x40
[  288.308464]                       [<ffffffff8122a4cd>] padzero+0x2d/0x40
[  288.308470]                       [<ffffffff8122b0bf>] load_elf_binary+0x9cf/0x1880
[  288.308477]                       [<ffffffff811db9f0>] search_binary_handler+0xa0/0x1e0
[  288.308485]                       [<ffffffff811dcfa2>] do_execve_common.isra.25+0x522/0x6e0
[  288.308493]                       [<ffffffff811dd178>] do_execve+0x18/0x20
[  288.308499]                       [<ffffffff8106ff4e>] ____call_usermodehelper+0x11e/0x170
[  288.308507]                       [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.308514]     RECLAIM_FS-ON-W at:
[  288.308518]                          [<ffffffff810a96fa>] mark_held_locks+0x6a/0x90
[  288.308526]                          [<ffffffff810a9e17>] lockdep_trace_alloc+0x67/0xd0
[  288.308534]                          [<ffffffff8116b394>] __alloc_pages_nodemask+0x84/0xba0
[  288.308544]                          [<ffffffff811af917>] alloc_pages_current+0xf7/0x170
[  288.308553]                          [<ffffffff81190729>] __pmd_alloc+0x39/0x160
[  288.308559]                          [<ffffffff8119a69e>] move_page_tables+0x62e/0x6b0
[  288.308566]                          [<ffffffff811db1cb>] shift_arg_pages+0xcb/0x1a0
[  288.308574]                          [<ffffffff811db47b>] setup_arg_pages+0x1db/0x200
[  288.308581]                          [<ffffffff8122ab45>] load_elf_binary+0x455/0x1880
[  288.308588]                          [<ffffffff811db9f0>] search_binary_handler+0xa0/0x1e0
[  288.308596]                          [<ffffffff811dcfa2>] do_execve_common.isra.25+0x522/0x6e0
[  288.308603]                          [<ffffffff811dd178>] do_execve+0x18/0x20
[  288.308610]                          [<ffffffff8106ff4e>] ____call_usermodehelper+0x11e/0x170
[  288.308617]                          [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.308624]     RECLAIM_FS-ON-R at:
[  288.308628]                          [<ffffffff810a96fa>] mark_held_locks+0x6a/0x90
[  288.308635]                          [<ffffffff810a9e17>] lockdep_trace_alloc+0x67/0xd0
[  288.308643]                          [<ffffffff8116b394>] __alloc_pages_nodemask+0x84/0xba0
[  288.308651]                          [<ffffffff811af917>] alloc_pages_current+0xf7/0x170
[  288.308658]                          [<ffffffff81166bee>] __get_free_pages+0xe/0x40
[  288.308666]                          [<ffffffff81166c36>] get_zeroed_page+0x16/0x20
[  288.308673]                          [<ffffffff8118ff89>] __pud_alloc+0x29/0xb0
[  288.308679]                          [<ffffffff81191873>] handle_mm_fault+0xec3/0x10b0
[  288.308686]                          [<ffffffff81047f32>] __do_page_fault+0x1c2/0x580
[  288.308694]                          [<ffffffff8104844c>] do_page_fault+0xc/0x10
[  288.308700]                          [<ffffffff8173f4b2>] page_fault+0x22/0x30
[  288.308707]                          [<ffffffff813b7afe>] clear_user+0x2e/0x40
[  288.308713]                          [<ffffffff8122a4cd>] padzero+0x2d/0x40
[  288.308719]                          [<ffffffff8122b0bf>] load_elf_binary+0x9cf/0x1880
[  288.308726]                          [<ffffffff811db9f0>] search_binary_handler+0xa0/0x1e0
[  288.308734]                          [<ffffffff811dcfa2>] do_execve_common.isra.25+0x522/0x6e0
[  288.308741]                          [<ffffffff811dd178>] do_execve+0x18/0x20
[  288.308748]                          [<ffffffff8106ff4e>] ____call_usermodehelper+0x11e/0x170
[  288.308755]                          [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.308762]     INITIAL USE at:
[  288.308766]                      [<ffffffff810a6ff1>] __lock_acquire+0x411/0x1c90
[  288.308773]                      [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.308779]                      [<ffffffff8173ae99>] down_write+0x49/0x80
[  288.308785]                      [<ffffffff811dcd03>] do_execve_common.isra.25+0x283/0x6e0
[  288.308793]                      [<ffffffff811dd178>] do_execve+0x18/0x20
[  288.308800]                      [<ffffffff8106ff4e>] ____call_usermodehelper+0x11e/0x170
[  288.308807]                      [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.308814]   }
[  288.308816]   ... key      at: [<ffffffff81eaa388>] __key.49413+0x0/0x8
[  288.308824]   ... acquired at:
[  288.308827]    [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.308833]    [<ffffffff8118d833>] might_fault+0x93/0xc0
[  288.308839]    [<ffffffff811e92b4>] filldir+0xa4/0x120
[  288.308845]    [<ffffffffa0674308>] xfs_dir2_leaf_getdents+0x2d8/0x370 [xfs]
[  288.308866]    [<ffffffffa0674556>] xfs_readdir+0x1b6/0x1c0 [xfs]
[  288.308884]    [<ffffffffa067606b>] xfs_file_readdir+0x2b/0x30 [xfs]
[  288.308902]    [<ffffffff811e93ee>] iterate_dir+0xbe/0x150
[  288.308908]    [<ffffffff811e959c>] SyS_getdents+0x8c/0x110
[  288.308914]    [<ffffffff8173d716>] system_call_fastpath+0x16/0x1b
[  288.308920]
[  288.308922] -> (&xfs_dir_ilock_class){++++-+} ops: 233164 {
[  288.308930]    HARDIRQ-ON-W at:
[  288.308933]                     [<ffffffff810a7209>] __lock_acquire+0x629/0x1c90
[  288.308941]                     [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.308947]                     [<ffffffff810a33ef>] down_write_nested+0x4f/0x80
[  288.308955]                     [<ffffffffa0682d44>] xfs_ilock+0x134/0x160 [xfs]
[  288.308976]                     [<ffffffffa0685fcd>] xfs_inactive_ifree+0x8d/0x230 [xfs]
[  288.308997]                     [<ffffffffa0686229>] xfs_inactive+0xb9/0x170 [xfs]
[  288.309017]                     [<ffffffffa068c0d8>] xfs_fs_evict_inode+0xa8/0x110 [xfs]
[  288.309038]                     [<ffffffff811f1ed8>] evict+0xb8/0x190
[  288.309045]                     [<ffffffff811f2815>] iput+0x105/0x190
[  288.309050]                     [<ffffffff811ecd58>] d_delete+0x168/0x1a0
[  288.309057]                     [<ffffffff811e0ec5>] vfs_rmdir+0xf5/0x170
[  288.309063]                     [<ffffffff811e66eb>] do_rmdir+0x1cb/0x1e0
[  288.309069]                     [<ffffffff811e6ee6>] SyS_rmdir+0x16/0x20
[  288.309075]                     [<ffffffff8173d716>] system_call_fastpath+0x16/0x1b
[  288.309083]    HARDIRQ-ON-R at:
[  288.309086]                     [<ffffffff810a6f23>] __lock_acquire+0x343/0x1c90
[  288.309093]                     [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.309100]                     [<ffffffff810a3387>] down_read_nested+0x57/0x70
[  288.309107]                     [<ffffffffa0682d0a>] xfs_ilock+0xfa/0x160 [xfs]
[  288.309126]                     [<ffffffffa0682df5>] xfs_ilock_data_map_shared+0x35/0x40 [xfs]
[  288.309145]                     [<ffffffffa0683fe2>] xfs_lookup+0x62/0x160 [xfs]
[  288.309164]                     [<ffffffffa068095b>] xfs_vn_lookup+0x5b/0xa0 [xfs]
[  288.309183]                     [<ffffffff811df12d>] lookup_real+0x1d/0x60
[  288.309189]                     [<ffffffff811e2d38>] do_last+0x7a8/0xf80
[  288.309195]                     [<ffffffff811e4c44>] path_openat+0xc4/0x660
[  288.309201]                     [<ffffffff811e6a33>] do_filp_open+0x43/0xa0
[  288.309207]                     [<ffffffff811d4462>] do_sys_open+0x132/0x220
[  288.309213]                     [<ffffffff811d456e>] SyS_open+0x1e/0x20
[  288.309219]                     [<ffffffff8173d716>] system_call_fastpath+0x16/0x1b
[  288.309226]    SOFTIRQ-ON-W at:
[  288.309230]                     [<ffffffff810a6f54>] __lock_acquire+0x374/0x1c90
[  288.309237]                     [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.309243]                     [<ffffffff810a33ef>] down_write_nested+0x4f/0x80
[  288.309250]                     [<ffffffffa0682d44>] xfs_ilock+0x134/0x160 [xfs]
[  288.309268]                     [<ffffffffa0685fcd>] xfs_inactive_ifree+0x8d/0x230 [xfs]
[  288.309287]                     [<ffffffffa0686229>] xfs_inactive+0xb9/0x170 [xfs]
[  288.309305]                     [<ffffffffa068c0d8>] xfs_fs_evict_inode+0xa8/0x110 [xfs]
[  288.309326]                     [<ffffffff811f1ed8>] evict+0xb8/0x190
[  288.309332]                     [<ffffffff811f2815>] iput+0x105/0x190
[  288.309338]                     [<ffffffff811ecd58>] d_delete+0x168/0x1a0
[  288.309344]                     [<ffffffff811e0ec5>] vfs_rmdir+0xf5/0x170
[  288.309350]                     [<ffffffff811e66eb>] do_rmdir+0x1cb/0x1e0
[  288.309356]                     [<ffffffff811e6ee6>] SyS_rmdir+0x16/0x20
[  288.309362]                     [<ffffffff8173d716>] system_call_fastpath+0x16/0x1b
[  288.309369]    SOFTIRQ-ON-R at:
[  288.309373]                     [<ffffffff810a6f54>] __lock_acquire+0x374/0x1c90
[  288.309380]                     [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.309387]                     [<ffffffff810a3387>] down_read_nested+0x57/0x70
[  288.309393]                     [<ffffffffa0682d0a>] xfs_ilock+0xfa/0x160 [xfs]
[  288.309412]                     [<ffffffffa0682df5>] xfs_ilock_data_map_shared+0x35/0x40 [xfs]
[  288.309431]                     [<ffffffffa0683fe2>] xfs_lookup+0x62/0x160 [xfs]
[  288.309449]                     [<ffffffffa068095b>] xfs_vn_lookup+0x5b/0xa0 [xfs]
[  288.309467]                     [<ffffffff811df12d>] lookup_real+0x1d/0x60
[  288.309473]                     [<ffffffff811e2d38>] do_last+0x7a8/0xf80
[  288.309479]                     [<ffffffff811e4c44>] path_openat+0xc4/0x660
[  288.309485]                     [<ffffffff811e6a33>] do_filp_open+0x43/0xa0
[  288.309491]                     [<ffffffff811d4462>] do_sys_open+0x132/0x220
[  288.309497]                     [<ffffffff811d456e>] SyS_open+0x1e/0x20
[  288.309502]                     [<ffffffff8173d716>] system_call_fastpath+0x16/0x1b
[  288.309510]    IN-RECLAIM_FS-W at:
[  288.309513]                        [<ffffffff810a6fb0>] __lock_acquire+0x3d0/0x1c90
[  288.309521]                        [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.309528]                        [<ffffffff810a33ef>] down_write_nested+0x4f/0x80
[  288.309535]                        [<ffffffffa0682d44>] xfs_ilock+0x134/0x160 [xfs]
[  288.309552]                        [<ffffffffa067aa0c>] xfs_reclaim_inode+0x12c/0x340 [xfs]
[  288.309572]                        [<ffffffffa067aea7>] xfs_reclaim_inodes_ag+0x287/0x400 [xfs]
[  288.309592]                        [<ffffffffa067bda3>] xfs_reclaim_inodes_nr+0x33/0x40 [xfs]
[  288.309611]                        [<ffffffffa068b855>] xfs_fs_free_cached_objects+0x15/0x20 [xfs]
[  288.309632]                        [<ffffffff811d8dd8>] super_cache_scan+0x178/0x180
[  288.309640]                        [<ffffffff8117393e>] shrink_slab_node+0x15e/0x310
[  288.309647]                        [<ffffffff811755e0>] shrink_slab+0x100/0x140
[  288.309654]                        [<ffffffff81178306>] kswapd_shrink_zone+0x116/0x1a0
[  288.309661]                        [<ffffffff8117925b>] kswapd+0x4bb/0x9a0
[  288.309667]                        [<ffffffff8107a664>] kthread+0xe4/0x100
[  288.309674]                        [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.309682]    RECLAIM_FS-ON-R at:
[  288.309685]                        [<ffffffff810a96fa>] mark_held_locks+0x6a/0x90
[  288.309693]                        [<ffffffff810a9e17>] lockdep_trace_alloc+0x67/0xd0
[  288.309700]                        [<ffffffff8116b394>] __alloc_pages_nodemask+0x84/0xba0
[  288.309708]                        [<ffffffff811b1503>] alloc_pages_vma+0xf3/0x1c0
[  288.309715]                        [<ffffffff811916ce>] handle_mm_fault+0xd1e/0x10b0
[  288.309722]                        [<ffffffff81047f32>] __do_page_fault+0x1c2/0x580
[  288.309729]                        [<ffffffff8104844c>] do_page_fault+0xc/0x10
[  288.309735]                        [<ffffffff8173f4b2>] page_fault+0x22/0x30
[  288.309741]                        [<ffffffffa0674308>] xfs_dir2_leaf_getdents+0x2d8/0x370 [xfs]
[  288.309761]                        [<ffffffffa0674556>] xfs_readdir+0x1b6/0x1c0 [xfs]
[  288.309780]                        [<ffffffffa067606b>] xfs_file_readdir+0x2b/0x30 [xfs]
[  288.309799]                        [<ffffffff811e93ee>] iterate_dir+0xbe/0x150
[  288.309806]                        [<ffffffff811e959c>] SyS_getdents+0x8c/0x110
[  288.309813]                        [<ffffffff8173d716>] system_call_fastpath+0x16/0x1b
[  288.309821]    INITIAL USE at:
[  288.309824]                    [<ffffffff810a6ff1>] __lock_acquire+0x411/0x1c90
[  288.309831]                    [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.309837]                    [<ffffffff810a3387>] down_read_nested+0x57/0x70
[  288.309844]                    [<ffffffffa0682d0a>] xfs_ilock+0xfa/0x160 [xfs]
[  288.309863]                    [<ffffffffa0682df5>] xfs_ilock_data_map_shared+0x35/0x40 [xfs]
[  288.309882]                    [<ffffffffa0683fe2>] xfs_lookup+0x62/0x160 [xfs]
[  288.309900]                    [<ffffffffa068095b>] xfs_vn_lookup+0x5b/0xa0 [xfs]
[  288.309919]                    [<ffffffff811df12d>] lookup_real+0x1d/0x60
[  288.309924]                    [<ffffffff811e2d38>] do_last+0x7a8/0xf80
[  288.309930]                    [<ffffffff811e4c44>] path_openat+0xc4/0x660
[  288.309936]                    [<ffffffff811e6a33>] do_filp_open+0x43/0xa0
[  288.309941]                    [<ffffffff811d4462>] do_sys_open+0x132/0x220
[  288.309947]                    [<ffffffff811d456e>] SyS_open+0x1e/0x20
[  288.309953]                    [<ffffffff8173d716>] system_call_fastpath+0x16/0x1b
[  288.309960]  }
[  288.309962]  ... key      at: [<ffffffffa06decf4>] xfs_dir_ilock_class+0x0/0xfffffffffffd130c [xfs]
[  288.309983]  ... acquired at:
[  288.309985]    [<ffffffff810a607b>] check_usage_forwards+0x10b/0x150
[  288.309992]    [<ffffffff810a6a8b>] mark_lock+0x18b/0x2e0
[  288.309997]    [<ffffffff810a6fb0>] __lock_acquire+0x3d0/0x1c90
[  288.310003]    [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.310009]    [<ffffffff810a33ef>] down_write_nested+0x4f/0x80
[  288.310015]    [<ffffffffa0682d44>] xfs_ilock+0x134/0x160 [xfs]
[  288.310032]    [<ffffffffa067aa0c>] xfs_reclaim_inode+0x12c/0x340 [xfs]
[  288.310050]    [<ffffffffa067aea7>] xfs_reclaim_inodes_ag+0x287/0x400 [xfs]
[  288.310068]    [<ffffffffa067bda3>] xfs_reclaim_inodes_nr+0x33/0x40 [xfs]
[  288.310086]    [<ffffffffa068b855>] xfs_fs_free_cached_objects+0x15/0x20 [xfs]
[  288.310107]    [<ffffffff811d8dd8>] super_cache_scan+0x178/0x180
[  288.310113]    [<ffffffff8117393e>] shrink_slab_node+0x15e/0x310
[  288.310118]    [<ffffffff811755e0>] shrink_slab+0x100/0x140
[  288.310124]    [<ffffffff81178306>] kswapd_shrink_zone+0x116/0x1a0
[  288.310130]    [<ffffffff8117925b>] kswapd+0x4bb/0x9a0
[  288.310135]    [<ffffffff8107a664>] kthread+0xe4/0x100
[  288.310141]    [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.310147]
[  288.310149]
[  288.310149] stack backtrace:
[  288.310156] CPU: 8 PID: 87 Comm: kswapd1 Tainted: G            E  3.18.0-rc6-ceph-00024-g72ca172 #1
[  288.310162] Hardware name: Supermicro X9DRH-7TF/7F/iTF/iF/X9DRH-7TF/7F/iTF/iF, BIOS 3.0b 04/28/2014
[  288.310169]  ffffffff821208e0 ffff8804676ab608 ffffffff81733b38 0000000000000007
[  288.310177]  ffff8804676ab670 ffff8804676ab658 ffffffff810a5f68 ffffffff821208e0
[  288.310184]  ffffffff81a7cbe0 ffff8804676ab674 0000000000000000 ffff88046763cc50
[  288.310192] Call Trace:
[  288.310200]  [<ffffffff81733b38>] dump_stack+0x4e/0x68
[  288.310206]  [<ffffffff810a5f68>] print_irq_inversion_bug.part.41+0x1e8/0x1f0
[  288.310213]  [<ffffffff810a607b>] check_usage_forwards+0x10b/0x150
[  288.310220]  [<ffffffff810a6a8b>] mark_lock+0x18b/0x2e0
[  288.310226]  [<ffffffff810a5f70>] ? print_irq_inversion_bug.part.41+0x1f0/0x1f0
[  288.310234]  [<ffffffff811c9185>] ? __mem_cgroup_threshold+0x5/0x1d0
[  288.310241]  [<ffffffff810a6fb0>] __lock_acquire+0x3d0/0x1c90
[  288.310247]  [<ffffffff810a6ff1>] ? __lock_acquire+0x411/0x1c90
[  288.310266]  [<ffffffffa0682d44>] ? xfs_ilock+0x134/0x160 [xfs]
[  288.310272]  [<ffffffff810a8e9e>] lock_acquire+0x9e/0x140
[  288.310289]  [<ffffffffa0682d44>] ? xfs_ilock+0x134/0x160 [xfs]
[  288.310295]  [<ffffffff810a33ef>] down_write_nested+0x4f/0x80
[  288.310312]  [<ffffffffa0682d44>] ? xfs_ilock+0x134/0x160 [xfs]
[  288.310329]  [<ffffffffa0682d44>] xfs_ilock+0x134/0x160 [xfs]
[  288.310347]  [<ffffffffa067aa0c>] ? xfs_reclaim_inode+0x12c/0x340 [xfs]
[  288.310364]  [<ffffffffa067aa0c>] xfs_reclaim_inode+0x12c/0x340 [xfs]
[  288.310382]  [<ffffffffa067aea7>] xfs_reclaim_inodes_ag+0x287/0x400 [xfs]
[  288.310400]  [<ffffffffa067ad00>] ? xfs_reclaim_inodes_ag+0xe0/0x400 [xfs]
[  288.310418]  [<ffffffffa067bda3>] xfs_reclaim_inodes_nr+0x33/0x40 [xfs]
[  288.310438]  [<ffffffffa068b855>] xfs_fs_free_cached_objects+0x15/0x20 [xfs]
[  288.310445]  [<ffffffff811d8dd8>] super_cache_scan+0x178/0x180
[  288.310451]  [<ffffffff8117393e>] shrink_slab_node+0x15e/0x310
[  288.310457]  [<ffffffff811755e0>] shrink_slab+0x100/0x140
[  288.310463]  [<ffffffff81178306>] kswapd_shrink_zone+0x116/0x1a0
[  288.310469]  [<ffffffff8117925b>] kswapd+0x4bb/0x9a0
[  288.310475]  [<ffffffff81178da0>] ? mem_cgroup_shrink_node_zone+0x1c0/0x1c0
[  288.310481]  [<ffffffff8107a664>] kthread+0xe4/0x100
[  288.310488]  [<ffffffff8107a580>] ? flush_kthread_worker+0xf0/0xf0
[  288.310494]  [<ffffffff8173d66c>] ret_from_fork+0x7c/0xb0
[  288.310500]  [<ffffffff8107a580>] ? flush_kthread_worker+0xf0/0xf0



I've not seen any hang tasks just yet. The server seems to continue working. I will do more testing and get back to you with more info.

Andrei




From: "Ilya Dryomov" <ilya.dryomov@xxxxxxxxxxx>
To: "Andrei Mikhailovsky" <andrei@xxxxxxxxxx>
Cc: "ceph-users" <ceph-users@xxxxxxxxxxxxxx>
Sent: Friday, 28 November, 2014 5:28:28 PM
Subject: Re: Giant + nfs over cephfs hang tasks

On Fri, Nov 28, 2014 at 8:20 PM, Ilya Dryomov <ilya.dryomov@xxxxxxxxxxx> wrote:
> On Fri, Nov 28, 2014 at 8:19 PM, Ilya Dryomov <ilya.dryomov@xxxxxxxxxxx> wrote:
>> On Fri, Nov 28, 2014 at 8:13 PM, Andrei Mikhailovsky <andrei@xxxxxxxxxx> wrote:
>>> Ilya, yes I do! LIke these from different osds:
>>>
>>> [ 4422.212204] libceph: osd13 192.168.168.201:6819 socket closed (con state
>>> OPEN)
>>
>> Can you by any chance try a kernel from [1] ?  It's based on Ubuntu
>> config and unless you are doing something fancy should boot your box.
>> You have to install it only on the client box of course.
>>
>> This may be related to the bug I'm currently trying to nail down and
>> I'd like to know if the latest bits make any difference.
>
> [1] http://gitbuilder.ceph.com/kernel-deb-precise-x86_64-basic/ref/testing/linux-image-3.18.0-rc6-ceph-00024-g72ca172_3.18.0-rc6-ceph-00024-g72ca172-1_amd64.deb

It's currently rebuilding because of an unrelated patch and will be
overwritten once gitbuilder is done.  If it's not there by the time you
try use this link:

http://gitbuilder.ceph.com/kernel-deb-precise-x86_64-basic/sha1/72ca172a582d656930f413c3733401b8a5c120db/linux-image-3.18.0-rc6-ceph-00024-g72ca172_3.18.0-rc6-ceph-00024-g72ca172-1_amd64.deb

Thanks,

                Ilya

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux