Transparent Huge pages hanging on 5.1.x/5.2.0 kernels?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

In the last few weeks, one of my build boxes started hanging at the end of a build with a zombie ld.lld process stuck in the kernel:

[97199.634549] CPU: 14 PID: 72214 Comm: ld.lld Kdump: loaded Not tainted 5.2.0-1.fc31.x86_64 #1
[97199.634550] Hardware name: Supermicro SYS-5038K-i-NF9/K1SPE, BIOS 1.0b 04/13/2017
[97199.634551] RIP: 0010:compact_zone+0x4d0/0xce0
[97199.634553] Code: 41 c6 47 78 01 e9 52 fc ff ff 4c 89 f7 48 89 ea 4c 89 e6 e8 22 8e 02 00 49 89 c6 e9 d7 fd ff ff 8b 4c 24 10 4c 89 e2 4c 89 ee <4c> 89 ff e8 e8 e0 ff ff 49 89 c4 48 85 c0 0f 84 bd fe ff ff 45 8b
[97199.634555] RSP: 0018:ffffac6a53c879c0 EFLAGS: 00000202
[97199.634557] RAX: 0000000000000001 RBX: 000000000619f200 RCX: 000000000000000c
[97199.634558] RDX: 000000000619f000 RSI: 000000000619ee20 RDI: ffff95f77ffc8330
[97199.634559] RBP: ffff95fb7ffd4d00 R08: 0000000000000007 R09: 000000000619f000
[97199.634561] R10: 0000000000000000 R11: 0000000000000003 R12: 000000000619f000
[97199.634562] R13: 000000000619ee20 R14: fffffb58467b8000 R15: ffffac6a53c87a90
[97199.634563] FS:  00007ffff10fd700(0000) GS:ffff95f5fb780000(0000) knlGS:0000000000000000
[97199.634566] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[97199.634567] CR2: 00007fff08001378 CR3: 00000054737f6000 CR4: 00000000001406e0
[97199.634568] Call Trace:
[97199.634569]  compact_zone_order+0xde/0x140
[97199.634570]  try_to_compact_pages+0xcc/0x2a0
[97199.634570]  __alloc_pages_direct_compact+0x8c/0x170
[97199.634571]  __alloc_pages_slowpath+0x248/0xdf0
[97199.634572]  ? get_vtime_delta+0x13/0xe0
[97199.634573]  ? finish_task_switch+0x12f/0x2a0
[97199.634574]  __alloc_pages_nodemask+0x2f2/0x340
[97199.634575]  do_huge_pmd_anonymous_page+0x130/0x910
[97199.634576]  __handle_mm_fault+0xfd7/0x1ac0
[97199.634577]  handle_mm_fault+0xc4/0x1f0
[97199.634577]  do_user_addr_fault+0x1f6/0x450
[97199.634578]  do_page_fault+0x33/0x120
[97199.634579]  ? page_fault+0x8/0x30
[97199.634580]  page_fault+0x1e/0x30

This bug seems to go away if I comment out the following lines from my boot script:

# echo always > /sys/kernel/mm/transparent_hugepage/enabled
# echo always > /sys/kernel/mm/transparent_hugepage/defrag

What can I do to debug this further?

Dave





[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux