Hi, Sorry for the delay, only today I was able too look into this again. On 21:01 Wed 24 Nov , Rafael J. Wysocki wrote: > > I'm trying to debug freezes upon resume from suspend-to-RAM on my > > Thinkpad X61s. The userland is a mostly stable x86 Gentoo Linux with > > recent X stack. The freeze does not occur always, but it is > > reproducable after a few successful suspend-resume cycles. I can > > reproduce it because affected kernels will not complete 10 > > suspend-resume cycles, so this has been my criterion for all the tests I > > have done. When the freeze occurs I have seen various things happening. > > In most instances the X screen comes up in the state before the resume > > works for a few milliseconds and then freezes. In this state the > > computer is not reachable by ssh and the only option is to reboot. In > > other instances I have also seen X crashing, dropping me to a console > > and then freezing. In one inctance (still with gentoo-sources, before I > > started testing with mainline) I was able to ssh into the system and > > found spurious things in the log. Please see the attachad file > > 'messages'. On 12:08:33 I connect via ssh as root. I see the process X > > occupying 100% CPU and it cannot be killed (kill -9 has no effect). I > > tried to reboot the machine with init x6, but this failed. Browsing the > > list archive it might be related to what Linus reported here: > > > > https://lists.linux-foundation.org/pipermail/linux-pm/2010-November/029299.html > > This warning means that resume took more that 10 s, which is suspicious, but > not a bug by itself. > > > These problems appear to me as a regression since everything is > > reproducibly fine with kernels <=2.6.34, and 2.6.35 seems to be fine too > > (although I have tested that less). I started a bisect, see below. > > Well, it would be good if you could find the commit that broke things for > you, but it might be a few different things - ACPI, PCI, graphics driver, etc. > > Please check if you can reproduce the symptom using pm_test: > > # echo core > /sys/power/pm_test > # echo mem > /sys/power/state > > (it should simulate a suspend-resume cycle and get back to the command prompt > in 5-10 sec). You can try to run that in a loop and see if that breaks things. I tried this and I can reproduce the problem in this way. Here are some additional observations: -) In the tries I did with the debug procedure the problem always occurred on exactly the 4th resume. -) I killed X before doing this to make sure that it is not involved, and indeed, the problem persists. -) When the resume failed I can still switch consoles with Alt+F?, but any keys typed into this console are lost, i.e. I can not log in or anything. The machine is not reachable by ssh, on the console where I was logged in I do get visual "^C" feedback, although "^C" has no effect. Everything is frozen. Then regarding the bisect, I consider my tries as failed. Even using git bisect skip I am constantly running into non-bootable kernels. How do people use git bisect? It seems quite useless to me, as all these patches depend on each other. Probably I would have to try to find the issues of the non-booting kernels, but I'm not very motivated for debugging the failures during debugging... What else can I do to track this down? The failure starts to show between 625c94df22fb4247c8ccd54635a8ec77cb60d651 (Fri Aug 13) (bad) and ab69bcd66fb4be64edfc767365cb9eb084961246 (Fri Aug 6) (good) Thanks, Thomas For reference, here is the log again: $ git bisect log git bisect start # bad: [3561d43fd289f590fdae672e5eb831b8d5cf0bf6] Linux 2.6.37-rc3 git bisect bad 3561d43fd289f590fdae672e5eb831b8d5cf0bf6 # good: [9fe6206f400646a2322096b56c59891d530e8d51] Linux 2.6.35 git bisect good 9fe6206f400646a2322096b56c59891d530e8d51 # bad: [a126681810a327bee60a58ac5ad77f4518cf7a5f] [media] staging/lirc: ioctl portability fixups git bisect bad a126681810a327bee60a58ac5ad77f4518cf7a5f # skip: [faa38b5e0e092914764cdba9f83d31a3f794d182] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6 git bisect skip faa38b5e0e092914764cdba9f83d31a3f794d182 # good: [0aa094331b19e54f928e2ac083285ff68d91c69b] [SCSI] be2iscsi: Remove debug print in IO path git bisect good 0aa094331b19e54f928e2ac083285ff68d91c69b # good: [c14c643b3d91cc741425c058968672228c310927] e1000e: update driver version number git bisect good c14c643b3d91cc741425c058968672228c310927 # skip: [d71048e22f47725a5808ea2e4e1e72fa36c1a788] Merge branch 'omap-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tmlind/linux-omap-2.6 git bisect skip d71048e22f47725a5808ea2e4e1e72fa36c1a788 # bad: [06c10884486a63a1e4ff657aaa51e848e64b9dc3] ARM: relax ioremap prohibition (309caa9) for -final and -stable git bisect bad 06c10884486a63a1e4ff657aaa51e848e64b9dc3 # bad: [625c94df22fb4247c8ccd54635a8ec77cb60d651] [S390] dasd: fix format string types git bisect bad 625c94df22fb4247c8ccd54635a8ec77cb60d651 # good: [ab69bcd66fb4be64edfc767365cb9eb084961246] Merge git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/driver-core-2.6 git bisect good ab69bcd66fb4be64edfc767365cb9eb084961246 # skip: [9e0ba741aabdf129d561e62d207c75146adef5e8] drivers/char/vt.c:vc_do_resize(): local var `end' should be unsigned long git bisect skip 9e0ba741aabdf129d561e62d207c75146adef5e8 # skip: [6b8e8282611ea35845dcff0cb321a7d735fc3155] drivers/hwmon/coretemp.c: remove unneeded #ifdef CONFIG_HOTPLUG_CPU git bisect skip 6b8e8282611ea35845dcff0cb321a7d735fc3155 # good: [b0c983375df8aafdddf250009a7bb929225e6011] Staging: dt3155: Change include to linux header git bisect good b0c983375df8aafdddf250009a7bb929225e6011 # skip: [5f248c9c251c60af3403902b26e08de43964ea0b] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs-2.6 git bisect skip 5f248c9c251c60af3403902b26e08de43964ea0b # skip: [b70a3e0702dee2ed9435e06a8bde7d9fa2228895] All filesystems that need invalidate_inode_buffers() are doing that explicitly git bisect skip b70a3e0702dee2ed9435e06a8bde7d9fa2228895 # skip: [c2fab1242a16f8e66b680d664322c3811e7b003d] ARM: 6306/1: fix inverted MMC card detect in U300 git bisect skip c2fab1242a16f8e66b680d664322c3811e7b003d -- Thomas Kahle http://dev.gentoo.org/~tomka/
Attachment:
pgpxGCjyImpNe.pgp
Description: PGP signature
_______________________________________________ linux-pm mailing list linux-pm@xxxxxxxxxxxxxxxxxxxxxxxxxx https://lists.linux-foundation.org/mailman/listinfo/linux-pm