Re: mm: derefing NULL vma->vm_mm when unmapping

Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> · Mon, 30 Jun 2014 15:07:28 -0700

On Mon, 30 Jun 2014 09:49:57 -0400 Sasha Levin <levinsasha928@xxxxxxxxx> wrote:

> Hi all,
> 
> While fuzzing with trinity inside a KVM tools guest running the latest -next
> kernel I've stumbled on the following spew:
> 
> [  761.704089] BUG: unable to handle kernel NULL pointer dereference at           (null)
> [  761.704089] IP: mm_find_pmd (mm/rmap.c:570)

Does this mean it oopsed in mm_find_pmd()'s call to pgd_offset()?

> [  761.704089] PGD 51223067 PUD 50a09067 PMD 0
> [  761.704089] Oops: 0000 [#1] PREEMPT SMP DEBUG_PAGEALLOC
> [  761.704089] Dumping ftrace buffer:
> [  761.704089]    (ftrace buffer empty)
> [  761.704089] Modules linked in:
> [  761.704089] CPU: 4 PID: 20723 Comm: trinity-c131 Tainted: G        W      3.16.0-rc3-next-20140630-sasha-00023-g44434d4-dirty #756
> [  761.704089] task: ffff88004e3c0000 ti: ffff88004e0b8000 task.ti: ffff88004e0b8000
> [  761.704089] RIP: mm_find_pmd (mm/rmap.c:570)
> [  761.704089] RSP: 0000:ffff88004e0bbaa8  EFLAGS: 00010246
> [  761.704089] RAX: 0000000000000000 RBX: 0000000000a65000 RCX: ffff88004e0bbb30
> [  761.704089] RDX: 0000000000000000 RSI: 0000000000a65000 RDI: ffff880000146000
> [  761.704089] RBP: ffff88004e0bbaa8 R08: 0000000000000000 R09: 0000000000000000
> [  761.704089] R10: ffff88004e3c0000 R11: 0000000000000000 R12: ffffea000d766e00
> [  761.704089] R13: ffff88004e0bbb30 R14: ffff880000146000 R15: 0000000000000000
> [  761.704089] FS:  00007f0293c61700(0000) GS:ffff880144e00000(0000) knlGS:0000000000000000
> [  761.704089] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> [  761.704089] CR2: 0000000000000000 CR3: 000000004e3be000 CR4: 00000000000006a0
> [  761.704089] Stack:
> [  761.704089]  ffff88004e0bbae8 ffffffff9c2d0815 800000035d9b8805 ffff880000146000
> [  761.704089]  ffffea000d766e00 ffff88000b4c4e58 ffff880034d7d200 0000000000000302
> [  761.704089]  ffff88004e0bbb68 ffffffff9c2d1491 ffff88004e0bbb28 ffffffff9f57c58a
> [  761.704089] Call Trace:
> [  761.704089] __page_check_address (mm/rmap.c:618)
> [  761.704089] try_to_unmap_one (mm/rmap.c:1133)
> [  761.704089] ? down_read (kernel/locking/rwsem.c:45 (discriminator 2))
> [  761.704089] ? page_lock_anon_vma_read (./arch/x86/include/asm/atomic.h:118 mm/rmap.c:491)
> [  761.704089] ? page_lock_anon_vma_read (mm/rmap.c:448)
> [  761.704089] rmap_walk (mm/rmap.c:1634 mm/rmap.c:1705)
> [  761.704089] try_to_unmap (mm/rmap.c:1527)
> [  761.704089] ? page_remove_rmap (mm/rmap.c:1124)
> [  761.704089] ? invalid_migration_vma (mm/rmap.c:1483)
> [  761.704089] ? try_to_unmap_one (mm/rmap.c:1391)
> [  761.704089] ? anon_vma_prepare (mm/rmap.c:448)
> [  761.704089] ? invalid_mkclean_vma (mm/rmap.c:1478)
> [  761.704089] ? page_get_anon_vma (mm/rmap.c:405)
> [  761.704089] migrate_pages (mm/migrate.c:912 mm/migrate.c:955 mm/migrate.c:1142)
> [  761.704089] ? perf_trace_mm_numa_migrate_ratelimit (mm/migrate.c:1590)
> [  761.704089] migrate_misplaced_page (mm/migrate.c:1750)
> [  761.704089] __handle_mm_fault (mm/memory.c:3162 mm/memory.c:3212 mm/memory.c:3322)
> [  761.704089] handle_mm_fault (include/linux/memcontrol.h:124 mm/memory.c:3348)
> [  761.704089] ? __do_page_fault (arch/x86/mm/fault.c:1163)
> [  761.704089] __do_page_fault (arch/x86/mm/fault.c:1230)
> [  761.704089] ? vtime_account_user (kernel/sched/cputime.c:687)
> [  761.704089] ? get_parent_ip (kernel/sched/core.c:2550)
> [  761.704089] ? context_tracking_user_exit (include/linux/vtime.h:89 include/linux/jump_label.h:115 include/trace/events/context_tracking.h:47 kernel/context_tracking.c:180)
> [  761.704089] ? preempt_count_sub (kernel/sched/core.c:2606)
> [  761.704089] ? context_tracking_user_exit (kernel/context_tracking.c:184)
> [  761.704089] ? __this_cpu_preempt_check (lib/smp_processor_id.c:63)
> [  761.704089] ? trace_hardirqs_off_caller (kernel/locking/lockdep.c:2638 (discriminator 2))
> [  761.704089] trace_do_page_fault (arch/x86/mm/fault.c:1313 include/linux/jump_label.h:115 include/linux/context_tracking_state.h:27 include/linux/context_tracking.h:45 arch/x86/mm/fault.c:1314)
> [  761.704089] do_async_page_fault (arch/x86/kernel/kvm.c:264)
> [  761.704089] async_page_fault (arch/x86/kernel/entry_64.S:1322)
> [ 761.704089] Code: 00 48 8b 5d f0 4c 8b 65 f8 c9 c3 66 0f 1f 44 00 00 66 66 66 66 90 55 48 89 f2 48 8b 47 40 48 c1 ea 27 48 89 e5 81 e2 ff 01 00 00 <48> 8b 3c d0 40 f6 c7 01 75 0c 31 f6 e9 af 00 00 00 0f 1f 44 00
> All code
> ========
>    0:	00 48 8b             	add    %cl,-0x75(%rax)
>    3:	5d                   	pop    %rbp
>    4:	f0 4c 8b 65 f8       	lock mov -0x8(%rbp),%r12
>    9:	c9                   	leaveq
>    a:	c3                   	retq
>    b:	66 0f 1f 44 00 00    	nopw   0x0(%rax,%rax,1)
>   11:	66 66 66 66 90       	data32 data32 data32 xchg %ax,%ax
>   16:	55                   	push   %rbp
>   17:	48 89 f2             	mov    %rsi,%rdx
>   1a:	48 8b 47 40          	mov    0x40(%rdi),%rax

0x40 is mm_struct.pgd

>   1e:	48 c1 ea 27          	shr    $0x27,%rdx
>   22:	48 89 e5             	mov    %rsp,%rbp
>   25:	81 e2 ff 01 00 00    	and    $0x1ff,%edx
>   2b:*	48 8b 3c d0          	mov    (%rax,%rdx,8),%rdi		<-- trapping instruction

So we seem to have mm->pgd == NULL?

dump_pagetable() was able to locate the pgd OK when it printed "PGD
51223067 PUD 50a09067 PMD 0", but it plucks the pgd out of the physical
pagetables, not out of the mm_struct.

Dunno.  You're under KVM and tracing is enabled, yes?  I don't
immediately see how that would affect it.

>   2f:	40 f6 c7 01          	test   $0x1,%dil
>   33:	75 0c                	jne    0x41
>   35:	31 f6                	xor    %esi,%esi
>   37:	e9 af 00 00 00       	jmpq   0xeb
>   3c:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
> 
> Code starting with the faulting instruction
> ===========================================
>    0:	48 8b 3c d0          	mov    (%rax,%rdx,8),%rdi
>    4:	40 f6 c7 01          	test   $0x1,%dil
>    8:	75 0c                	jne    0x16
>    a:	31 f6                	xor    %esi,%esi
>    c:	e9 af 00 00 00       	jmpq   0xc0
>   11:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)
> [  761.704089] RIP mm_find_pmd (mm/rmap.c:570)
> [  761.704089]  RSP <ffff88004e0bbaa8>
> [  761.704089] CR2: 0000000000000000
> 
> As I didn't see any code changes around that part I'm thinking that it's a locking
> issue that got messed up somewhere rather then a missing '!= NULL' check.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>