Re: [Bug 75101] New: [bisected] s2disk / hibernate blocks on "Saving 506031 image data pages () ..."

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 6/13/2014 12:02 AM, Johannes Weiner wrote:
On Tue, May 06, 2014 at 01:45:01AM +0200, Rafael J. Wysocki wrote:
On 5/6/2014 1:33 AM, Johannes Weiner wrote:
Hi Oliver,

On Mon, May 05, 2014 at 11:00:13PM +0200, Oliver Winker wrote:
Hello,

1) Attached a full function-trace log + other SysRq outputs, see [1]
attached.

I saw bdi_...() calls in the s2disk paths, but didn't check in detail
Probably more efficient when one of you guys looks directly.
Thanks, this looks interesting.  balance_dirty_pages() wakes up the
bdi_wq workqueue as it should:

[  249.148009]   s2disk-3327    2.... 48550413us : global_dirty_limits <-balance_dirty_pages_ratelimited
[  249.148009]   s2disk-3327    2.... 48550414us : global_dirtyable_memory <-global_dirty_limits
[  249.148009]   s2disk-3327    2.... 48550414us : writeback_in_progress <-balance_dirty_pages_ratelimited
[  249.148009]   s2disk-3327    2.... 48550414us : bdi_start_background_writeback <-balance_dirty_pages_ratelimited
[  249.148009]   s2disk-3327    2.... 48550414us : mod_delayed_work_on <-balance_dirty_pages_ratelimited
but the worker wakeup doesn't actually do anything:
[  249.148009] kworker/-3466    2d... 48550431us : finish_task_switch <-__schedule
[  249.148009] kworker/-3466    2.... 48550431us : _raw_spin_lock_irq <-worker_thread
[  249.148009] kworker/-3466    2d... 48550431us : need_to_create_worker <-worker_thread
[  249.148009] kworker/-3466    2d... 48550432us : worker_enter_idle <-worker_thread
[  249.148009] kworker/-3466    2d... 48550432us : too_many_workers <-worker_enter_idle
[  249.148009] kworker/-3466    2.... 48550432us : schedule <-worker_thread
[  249.148009] kworker/-3466    2.... 48550432us : __schedule <-worker_thread

My suspicion is that this fails because the bdi_wq is frozen at this
point and so the flush work never runs until resume, whereas before my
patch the effective dirty limit was high enough so that image could be
written in one go without being throttled; followed by an fsync() that
then writes the pages in the context of the unfrozen s2disk.

Does this make sense?  Rafael?  Tejun?
Well, it does seem to make sense to me.
 From what I see, this is a deadlock in the userspace suspend model and
just happened to work by chance in the past.

Well, it had been working for quite a while, so it was a rather large opportunity
window it seems. :-)

Can we patch suspend-utils as follows?

Perhaps we can.  Let's ask the new maintainer.

Rodolfo, do you think you can apply the patch below to suspend-utils?

Alternatively, suspend-utils
could clear the dirty limits before it starts writing and restore them
post-resume.

That (and the patch too) doesn't seem to address the problem with existing suspend-utils
binaries, however.

Rafael


---
 From 73d6546d5e264130e3d108c97d8317f86dc11149 Mon Sep 17 00:00:00 2001
From: Johannes Weiner <hannes@xxxxxxxxxxx>
Date: Thu, 12 Jun 2014 17:43:05 -0400
Subject: [patch] s2disk: fix buffered IO throttling deadlock in frozen state

s2disk uses buffered IO when writing the snapshot image to disk.  If
it runs into the dirty limits, the kernel forces it to wait until the
flusher threads clean some of the dirty pages.  However, at this point
s2disk already froze the system, including the flusher infrastructure,
and the whole operation deadlocks.

Open the resume device with O_SYNC to force flushing any dirty pages
directly from the write() context before they accumulate and engage
dirty throttling.

Signed-off-by: Johannes Weiner <hannes@xxxxxxxxxxx>
---
  suspend.c | 2 +-
  1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/suspend.c b/suspend.c
index 479ce58555f7..1b9bed81f58a 100644
--- a/suspend.c
+++ b/suspend.c
@@ -2436,7 +2436,7 @@ int main(int argc, char *argv[])
  		suspend_error("Could not create %s/%s.", chroot_path, "resume");
  		goto Umount;
  	}
-	resume_fd = open("resume", O_RDWR);
+	resume_fd = open("resume", O_RDWR | O_SYNC);
  	if (resume_fd < 0) {
  		ret = errno;
  		suspend_error("Could not open the resume device.");

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>




[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [ECOS]     [Asterisk Internet PBX]     [Linux API]