Re: stable-rc 5.4: libhugetlbfs fallocate_stress.sh: Unable to handle kernel paging request at virtual address ffff00006772f000

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 14 May 2020 at 22:01, Mike Kravetz <mike.kravetz@xxxxxxxxxx> wrote:
>
> On 5/13/20 11:40 PM, Michal Hocko wrote:
> > On Wed 13-05-20 23:11:40, Naresh Kamboju wrote:
> >> While running libhugetlbfs fallocate_stress.sh on stable-rc 5.4 branch kernel
> >> on arm64 hikey device. The following kernel Internal error: Oops:
> >> crash dump noticed.
> >
> > Is the same problem reproducible on vanilla 5.4 without any stable
> > patches?
> >
>
> Or, an earlier version of 5.4-stable?  Nothing in the changelog for 5.4.41
> looks related to this issue.  There was an arm specific hugetlb change
> "arm64: hugetlb: avoid potential NULL dereference", but that is pretty
> straight forward.
>
> I'm guessing this may not reproduce easily.  To help reproduce, you could
> change the
> #define FALLOCATE_ITERATIONS 100000
> in .../libhugetlbfs/tests/fallocate_stress.c to a larger number to force
> the stress test to run longer.

Sorry i did not get a chance to run as per your suggestion.
But this issue is reproducible on stable-rc 5.4.46-rc1 on arm64 hikey device

./runltp -p -q -f hugetlb
<>
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.
[  383.751513] oom01 invoked oom-killer:
gfp_mask=0x100cca(GFP_HIGHUSER_MOVABLE), order=0, oom_score_adj=0
[  384.715831]   EC = 0x25: DABT (current EL), IL = 32 bits
[  384.725478] CPU: 0 PID: 10948 Comm: oom01 Not tainted
5.4.46-rc1-00035-g12a5ce113626 #1
[  384.730887]   SET = 0, FnV = 0
[  384.739060] Hardware name: HiKey Development Board (DT)
[  384.739066] Call trace:
[  384.739081]  dump_backtrace+0x0/0x140
[  384.739090]  show_stack+0x14/0x20
[  384.742209]   EA = 0, S1PTW = 0
[  384.746701] dwmmc_k3 f723d000.dwmmc0: Unexpected interrupt latency
[  384.747550]  dump_stack+0xb4/0xf8
[  384.747559]  dump_header+0x44/0x1ec
[  384.747565]  oom_kill_process+0x1d4/0x1d8
[  384.747572]  out_of_memory+0x170/0x4e0
[  384.750070] Data abort info:
[  384.753813]  __alloc_pages_slowpath+0x954/0x9f8
[  384.753819]  __alloc_pages_nodemask+0x21c/0x280
[  384.753826]  alloc_pages_vma+0x88/0x210
[  384.753836]  __handle_mm_fault+0x638/0x1080
[  384.757236]   ISV = 0, ISS = 0x00000047
[  384.760428]  handle_mm_fault+0xdc/0x1a8
[  384.760436]  do_page_fault+0x130/0x460
[  384.760442]  do_translation_fault+0x5c/0x78
[  384.760450]  do_mem_abort+0x3c/0x98
[  384.766776]   CM = 0, WnR = 1
[  384.770154]  el0_da+0x1c/0x20
[  384.773735] swapper pgtable: 4k pages, 48-bit VAs, pgdp=00000000013c5000
[  384.777949] Mem-Info:
[  384.781679] [ffff0000641ff000] pgd=0000000077ff7003,
pud=0000000077e0d003, pmd=0000000077cec003, pte=0000000000000000
[  384.781694] Internal error: Oops: 96000047 [#1] PREEMPT SMP
[  384.781698] Modules linked in: wl18xx wlcore mac80211 cfg80211
hci_uart snd_soc_audio_graph_card btbcm snd_soc_simple_card_utils
crct10dif_ce wlcore_sdio adv7511 bluetooth kirin_drm cec dw_drm_dsi
rfkill drm_kms_helper drm fuse
[  384.784854] active_anon:472313 inactive_anon:2168 isolated_anon:0
[  384.784854]  active_file:63 inactive_file:0 isolated_file:0
[  384.784854]  unevictable:0 dirty:0 writeback:0 unstable:0
[  384.784854]  slab_reclaimable:2625 slab_unreclaimable:7426
[  384.784854]  mapped:202 shmem:2175 pagetables:1188 bounce:0
[  384.784854]  free:5469 free_pcp:1684 free_cma:14
[  384.789304] CPU: 5 PID: 10945 Comm: oom01 Not tainted
5.4.46-rc1-00035-g12a5ce113626 #1
[  384.789309] Hardware name: HiKey Development Board (DT)
[  384.789315] pstate: 80000005 (Nzcv daif -PAN -UAO)
[  384.789328] pc : clear_page+0x10/0x24
[  384.789339] lr : __cpu_clear_user_page+0xc/0x18
[  384.794000] Node 0 active_anon:1889252kB inactive_anon:8672kB
active_file:412kB inactive_file:0kB unevictable:0kB isolated(anon):0kB
isolated(file):0kB mapped:800kB dirty:0kB writeback:0kB shmem:8700kB
shmem_thp: 0kB shmem_pmdmapped: 0kB anon_thp: 1478656kB
writeback_tmp:0kB unstable:0kB all_unreclaimable? yes
[  384.797884] sp : ffff8000190ebc10
[  384.797888] x29: ffff8000190ebc10 x28: ffff000066cf2a00
[  384.797895] x27: 0000000000000002 x26: fffffe00019f2400
[  384.797901] x25: ffff00006e51df00 x24: 0000000000001000
[  384.802205] Node 0 DMA32 free:21876kB min:22528kB low:28160kB
high:33792kB active_anon:1889252kB inactive_anon:8672kB
active_file:0kB inactive_file:484kB unevictable:0kB writepending:0kB
present:2061364kB managed:1995396kB mlocked:0kB kernel_stack:2800kB
pagetables:4752kB bounce:0kB free_pcp:6864kB local_pcp:1320kB
free_cma:56kB
[  384.806099] x23: 0000000000000000 x22: fffffe0001700000
[  384.806106] x21: 0000000000000000 x20: 0000fffef3800000
[  384.806112] x19: ffff000066cf2a00 x18: 0000000000000000
[  384.806117] x17: 0000000000000000 x16: 0000000000000000
[  384.810066] lowmem_reserve[]: 0 0 0
[  384.813873] x15: 0000000000000000 x14: 0000000000000000
[  384.813879] x13: 0000000000000000 x12: 0000000000000000
[  384.813885] x11: 0000000000000000 x10: 0000000000000000
[  384.813891] x9 : ffff800066671000 x8 : 0000000000000200
[  384.818192] Node 0 DMA32: 899*4kB (UME) 471*8kB (UMEC) 205*16kB
(UEC) 153*32kB (UMEC) 61*64kB (UME) 15*128kB (UE) 2*256kB (ME) 0*512kB
0*1024kB 0*2048kB 0*4096kB = 21876kB
[  384.821730] x7 : ffff800066671000 x6 : 0000000000000000
[  384.821737] x5 : 0000000000000000 x4 : 0000020000200000
[  384.821743] x3 : 0000000000007fc0 x2 : 0000000000000004
[  384.821748] x1 : 0000000000000040 x0 : ffff0000641ff000
[  384.821754] Call trace:
[  384.821762]  clear_page+0x10/0x24
[  384.821771]  clear_subpage+0x54/0x90
[  384.821780]  clear_huge_page+0x6c/0x208
[  384.824842] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=1048576kB
[  384.827843]  do_huge_pmd_anonymous_page+0x1a4/0x7a0
[  384.827851]  __handle_mm_fault+0x83c/0x1080
[  384.827857]  handle_mm_fault+0xdc/0x1a8
[  384.827863]  do_page_fault+0x130/0x460
[  384.827872]  do_translation_fault+0x5c/0x78
[  384.834786] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=32768kB
[  384.837038]  do_mem_abort+0x3c/0x98
[  384.837044]  el0_da+0x1c/0x20
[  384.837056] Code: d53b00e1 12000c21 d2800082 9ac12041 (d50b7420)
[  384.847921] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=2048kB
[  384.853582] ---[ end trace 298eea3ec03b10c2 ]---
[  384.853619] note: oom01[10945] exited with preempt_count 1
[  384.874237] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=64kB
[  385.070710] dwmmc_k3 f723d000.dwmmc0: Unexpected interrupt latency
[  385.551002] 2384 total pagecache pages
[  385.563572] 0 pages in swap cache
[  385.575536] Swap cache stats: add 0, delete 0, find 0/0
[  385.589403] Free swap  = 0kB
[  385.600885] Total swap = 0kB
[  385.612403] 515341 pages RAM
[  385.623860] 0 pages HighMem/MovableOnly
[  385.636339] 16492 pages reserved
[  385.648140] 32768 pages cma reserved
[  385.660271] 0 pages hwpoisoned
[  385.671865] Tasks state (memory values in pages):
[  385.685192] [  pid  ]   uid  tgid total_vm      rss pgtables_bytes
swapents oom_score_adj name
[  385.702711] [    377]     0   377     3387      315    61440
0             0 systemd-journal
[  385.720672] [    413]     0   413     3520      310    49152
0         -1000 systemd-udevd
[  385.738467] [    435]   993   435     1526       79    53248
0             0 systemd-network
[  385.756400] [    459]   992   459     1665       99    49152
0             0 systemd-resolve
[  385.774283] [    463]     0   463      553       21    40960
0             0 tee-supplicant
[  385.792002] [    464]     0   464     1479      111    45056
0             0 systemd-logind
[  385.809585] [    472]   995   472     1197      105    45056
0             0 avahi-daemon
[  385.826656] [    473]   995   473     1166       66    45056
0             0 avahi-daemon
[  385.843371] [    474]     0   474      771       19    40960
0             0 syslogd
[  385.859600] [    475]     0   475      771       18    45056
0             0 klogd
[  385.875602] [    476]     0   476     1382       62    49152
0             0 bluetoothd
[  385.892028] [    479]   996   479     1151      187    45056
0          -900 dbus-daemon
[  385.908563] [    481]     0   481    78394      563   106496
0             0 NetworkManager
[  385.925332] [    482]     0   482      698      133    40960
0             0 crond
[  385.941236] [    527] 65534   527      629       44    40960
0             0 dnsmasq
[  385.957226] [    529]     0   529      578       32    40960
0             0 agetty
[  385.973067] [    530]     0   530     1173      107    49152
0             0 login
[  385.988832] [    531]     0   531      578       32    40960
0             0 agetty
[  386.004670] [    536]     0   536     2385      148    49152
0             0 wpa_supplicant
[  386.020865] [    537]   998   537   115916     1319   131072
0             0 polkitd
[  386.036060] [    563]     0   563    24661      430    69632
0             0 dhclient
[  386.051325] [    602]     0   602     1899      214    57344
0             0 systemd
[  386.066521] [    603]     0   603     2569      477    61440
0             0 (sd-pam)
[  386.081811] [    607]     0   607      910      102    40960
0             0 sh
[  386.096558] [    611]     0   611     1039       81    45056
0             0 su
[  386.110911] [    612]     0   612      910       97    40960
0             0 sh
[  386.124866] [    615]     0   615      756       55    40960
0             0 lava-test-runne
[  386.139888] [   1327]     0  1327      756       50    40960
0             0 lava-test-shell
[  386.154903] [   1328]     0  1328      756       52    36864
0             0 sh
[  386.168797] [   1330]     0  1330      822      133    40960
0             0 ltp.sh
[  386.183055] [   1348]     0  1348      822      133    40960
0             0 ltp.sh
[  386.197278] [   1349]     0  1349      822      133    40960
0             0 ltp.sh
[  386.211413] [   1350]     0  1350      822      133    40960
0             0 ltp.sh
[  386.225573] [   1351]     0  1351      921      230    45056
0             0 runltp
[  386.239776] [   1352]     0  1352      452       15    40960
0             0 tee
[  386.253702] [   1426]     0  1426      451       28    40960
0             0 ltp-pan
[  386.267740] [  10933]     0 10933      494       18    32768
0             0 oom01
[  386.281352] [  10934]     0 10934      527       31    36864
0             0 oom01
[  386.294984] [  10944]     0 10944  5519894   467709  3833856
0             0 oom01
[  386.308646] oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=/,mems_allowed=0,global_oom,task_memcg=/,task=oom01,pid=10944,uid=0
[  386.326954] Out of memory: Killed process 10944 (oom01)
total-vm:22079576kB, anon-rss:1870836kB, file-rss:0kB, shmem-rss:0kB,
UID:0 pgtables:3744kB oom_score_adj:0
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.



> --
> Mike Kravetz



[Index of Archives]     [Linux Kernel]     [Kernel Development Newbies]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite Hiking]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux