Re: mm: hung task (handle_pte_fault)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, 23 Mar 2012 12:45:03 +0200
Sasha Levin <levinsasha928@xxxxxxxxx> wrote:

> Hi guys,
> 
> During fuzzing using trinity inside a KVM tools guest with latest
> linux-next, I seem to be getting it hung once in a while, with the
> following spew:
> 
> [ 1441.420617] INFO: task trinity:2706 blocked for more than 120 seconds.
> [ 1441.421894] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
> disables this message.
> [ 1441.424493] trinity         D 0000000000000000  3472  2706  16846 0x00000004
> [ 1441.426749]  ffff880029dbbb38 0000000000000086 ffff880029dbbae8
> ffff880029dbbfd8
> [ 1441.428582]  00000000001d45c0 ffff880029dba000 00000000001d45c0
> 00000000001d45c0
> [ 1441.430156]  00000000001d45c0 00000000001d45c0 ffff880029dbbfd8
> 00000000001d45c0
> [ 1441.432371] Call Trace:
> [ 1441.433042]  [<ffffffff81176dd0>] ? file_read_actor+0x1d0/0x1d0
> [ 1441.434251]  [<ffffffff827148d4>] schedule+0x24/0x70
> [ 1441.435314]  [<ffffffff82714e67>] io_schedule+0x87/0xd0
> [ 1441.436357]  [<ffffffff81176dd9>] sleep_on_page+0x9/0x10
> [ 1441.437442]  [<ffffffff82712cb7>] __wait_on_bit+0x57/0x80
> [ 1441.438584]  [<ffffffff81177bff>] ? __lock_page_or_retry+0x8f/0xd0
> [ 1441.439948]  [<ffffffff811775de>] wait_on_page_bit+0x6e/0x80
> [ 1441.440859]  [<ffffffff810d6d20>] ? autoremove_wake_function+0x40/0x40
> [ 1441.441700]  [<ffffffff810dbfbe>] ? up_read+0x1e/0x40
> [ 1441.442428]  [<ffffffff81177c36>] __lock_page_or_retry+0xc6/0xd0
> [ 1441.443270]  [<ffffffff81178490>] filemap_fault+0x440/0x4e0
> [ 1441.444072]  [<ffffffff811991cf>] __do_fault+0x7f/0x5f0
> [ 1441.444829]  [<ffffffff81112c00>] ?
> add_lock_to_list.clone.18.clone.27+0xd0/0xe0
> [ 1441.445886]  [<ffffffff8119cd27>] handle_pte_fault+0xf7/0x1e0
> [ 1441.446740]  [<ffffffff8119e1ce>] handle_mm_fault+0x1ce/0x330
> [ 1441.447537]  [<ffffffff8119e53c>] __get_user_pages+0x14c/0x640
> [ 1441.448399]  [<ffffffff811129ae>] ? put_lock_stats.clone.19+0xe/0x40
> [ 1441.449288]  [<ffffffff81117b1d>] ? __lock_acquired+0x19d/0x270
> [ 1441.450164]  [<ffffffff811a0087>] __mlock_vma_pages_range+0x87/0xa0
> [ 1441.451127]  [<ffffffff811a0129>] do_mlock_pages+0x89/0x160
> [ 1441.451932]  [<ffffffff811a0b71>] sys_mlockall+0x111/0x1a0
> [ 1441.452761]  [<ffffffff827176bd>] system_call_fastpath+0x1a/0x1f
> [ 1441.453659] no locks held by trinity/2706.
> [ 1441.454267] Kernel panic - not syncing: hung_task: blocked tasks
> 
> According to the logs, it's not the direct result of anything specific
> happening, so I can't give an exact scenario to reproduce it. It does
> happen rather often.

The task is waiting for IO to complete against a page, and it isn't
happening.

There are quite a lot of things which could cause this, alas.  VM,
readahead, scheduler, core wait/wakeup code, IO system, interrupt
system (if it happens outside KVM, I guess).

So....  ugh.  Hopefully someone will hit this in a situation where it
can be narrowed down or bisected.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>


[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [ECOS]     [Asterisk Internet PBX]     [Linux API]