Am Dienstag 28 September 2010 schrieb Martin Steigerwald: > Am Sonntag 26 September 2010 schrieb Nigel Cunningham: > > Hi. > > > > On 26/09/10 08:10, Martin Steigerwald wrote: > > > Hi Nigel. > > > > > > Am Samstag 25 September 2010 schrieb Nigel Cunningham: > > >> On 26/09/10 05:58, Martin Steigerwald wrote: > > >>> Am Samstag 25 September 2010 schrieb Martin Steigerwald: > > >>>> Hi Nigel and Rafael, > > >>>> > > >>>> Am Samstag 25 September 2010 schrieb Nigel Cunningham: > > >>>>> Add support for submitting reads before they're needed. This > > >>>>> greatly improves the speed of resuming: > > >>>>> > > >>>>> From > > >>>>> > > >>>>> PM: Image read at 66 MB/s. > > >>>>> > > >>>>> to > > >>>>> > > >>>>> PM: Image read at 229 MB/s. > > >>>>> > > >>>>> ...and removes the need for the sync_read flag. > > >>>> > > >>>> So > > >>>> > > >>>> martin@shambhala:~/Computer/Shambhala/Kernel/2.6.36/tuxonice-hea > > >>>> d> git branch -av | grep for-rafael > > >>>> * for-rafael d4e7490 Hibernate: Implement > > >>>> readahead when resuming > > >>>> > > >>>> remotes/origin/for-rafael d4e7490 Hibernate: Implement > > >>>> > > >>>> readahead when resuming > > >>> > > >>> [...] > > >>> > > >>>> basically seems to work. > > >>> > > >>> [...] > > >>> > > >>>> I tried 5 times: > > >>>> > > >>>> - one with just kdm started worked nicely and really rather > > >>>> fast! > > >>>> > > >>>> - four with a full blown KDE 4.5.1 session with OpenGL > > >>>> compositing > > >>>> > > >>>> - one seemed to hang prior to reinitializing the Radeon KMS > > >>>> DRM setup - three other worked just fine > > >>>> > > >>>> I do not think that the hang is related to your changes, Nigel. > > >>>> The kernel remained stuck at the lower initial resolution and > > >>>> didn't seem to initialize the radeon KMS framebuffers at > > >>>> 1400x1050 properly. I didn't see this with 2.6.35 and userspace > > >>>> software suspend. > > >>> > > >>> I am not so sure anymore. > > >>> > > >>> I got another one of these hangs with the 2.6.36-rc5 mentioned > > >>> above. See IMG_3871.jpg for the exact display were it hung. I was > > >>> able to switch view with Alt-F1 or something like that. And then > > >>> got IMG_3873.jpg. But nothing happened anymore. Find these on: > > >>> > > >>> http://martin-steigerwald.de/tmp/tuxonice/hang-after-resume-with- > > >>> pm - patches-for-rafael/ > > >>> > > >>> I now tried in kernel suspend to disk with > > >>> > > >>> martin@shambhala:~> cat /proc/version > > >>> Linux version 2.6.35.5-tp42-vmembase-0-pm-avoid-oom-dirty > > >>> (martin@shambhala) (gcc version 4.4.5 20100728 (prerelease) > > >>> (Debian 4.4.4-8) ) #4 PREEMPT Sat Sep 25 13:29:53 CEST 2010 > > >>> > > >>> which doesn't contain your patches, Nigel, for about 5 or 6 times > > >>> and I did not see that hang. > > >>> > > >>> So maybe something in your patches, even if just the debug output > > >>> I mentioned, or something in 2.6.36-rc5 triggers that hang. > > >>> > > >>> I will test 2.6.35.5 for a bit longer to make sure that there is > > >>> no hang on resume prior to loading. I need to reboot this one > > >>> now too, cause after one of the resume attempts USB stopped > > >>> working with: > > >>> > > >>> Sep 25 21:36:47 shambhala kernel: usb 1-3: New USB device found, > > >>> idVendor=1307, idProduct=0330 > > >>> Sep 25 21:36:47 shambhala kernel: usb 1-3: New USB device > > >>> strings: Mfr=1, Product=2, SerialNumber=3 > > >>> Sep 25 21:36:47 shambhala kernel: usb 1-3: Product: Mass Storage > > >>> Device Sep 25 21:36:47 shambhala kernel: usb 1-3: Manufacturer: > > >>> Generic Sep 25 21:36:47 shambhala kernel: usb 1-3: SerialNumber: > > >>> 00000000000006 Sep 25 21:36:47 shambhala kernel: scsi3 : > > >>> usb-storage 1-3:1.0 Sep 25 21:36:48 shambhala kernel: BUG: unable > > >>> to handle kernel NULL pointer dereference at 0000002c > > >>> Sep 25 21:36:48 shambhala kernel: IP: [<c125b5de>] > > >>> cfq_get_queue+0x33e/0x550 > > >>> Sep 25 21:36:48 shambhala kernel: *pde = 00000000 > > >>> Sep 25 21:36:48 shambhala kernel: Oops: 0000 [#1] PREEMPT > > > > > > [...] > > > > > >>> exited with preempt_count 1 > > >>> > > >>> Maybe the USB system has an powermanagement related issue that in > > >>> kernel suspend triggers more easily than userspace software > > >>> suspend? I didn't see this one before. But well, this is a bug I > > >>> will report in > > >>> bugzilla.kernel.org. > > >>> > > >>> Ciao, > > >> > > >> Okay. Can you try without the last patch, and confirm that it's > > >> reliable (albeit slower) then? > > > > > > Well as I told already (later on) the USB problem seems to be > > > related to my testing of systemd in Debian - I do not get how this > > > could be the case, but it works, when I remove init=/bin/systemd. > > > I think I will report this as a kernel bug nonetheless, cause when > > > I apply that "userspace shouldn't be able to let the kernel oops" > > > paradigm then its a kernel bug. > > > > > > Should I test without the readahead patch regarding whether that > > > hang after resume, prior to activating Radeon KMS framebuffer is > > > fixed as well? > > > > Yes, please. I only whipped up the readahead patch last night. Since > > it's single threaded, there's a lot less to go wrong compared to > > TuxOnIce, but it's still possible that there's some bug I haven't > > noticed yet. It would be good to be able to see it's definitely that > > patch. > > > > > I am a bit unsure on what to test next. My current thoughts are: > > > > > > - Test whether 2.6.35.5 is stable with unpatched in kernel suspend. > > > Currently in progress. Cause before that I only know its stable > > > with userspace software suspend. > > > > > > - Apply your patches on top of 2.6.35.5 and test that. > > > > > > - If that works, it appears to be a problem introduced by > > > 2.6.36-rc5 > > > > > > - Then I'd possibly test 2.6.36-rc5 with unpatched in kernel > > > suspend. > > > > > > - If that doesn't work, it appears to be an issue with your > > > patches > > > > > > - Then I test without readahead patch. > > > > > > Tell me when you have any different suggestions. > > > > > > Ciao, > > > > I'd start with 2.6.36-rc5 unpatched, and only fall back to 2.6.35 if > > that's unreliable. > > Done so. 2.6.36-rc5-tp42-vmembase-0-dirty without your pm patches is > stable. So these hangs on resume seem somehow related to your patches > or to the debug output that you left in. > > Now compiling > > martin@shambhala:~/Computer/Shambhala/Kernel/2.6.36/tuxonice-head> git > branch -v > * (no branch) 3394a84 Hibernation: Remove now-empty routines. > [...] > > Which is one commit before > > for-rafael 14c1209 Hibernate: Implement readahead when resuming > > I also enabled CONFIG_PM_DEBUG and CONFIG_PM_ADVANCED_DEBUG. > > Tell me, when I instead should try that suspend-next stuff. ;) Done six cycles, so far so good: shambhala:~> grep "PM: Image.*at " /var/log/syslog Sep 28 11:32:06 shambhala kernel: PM: Image written at 63 MB/s. Sep 28 11:32:06 shambhala kernel: PM: Image read at 32 MB/s. Sep 28 11:35:00 shambhala kernel: PM: Image written at 65 MB/s. Sep 28 11:35:00 shambhala kernel: PM: Image read at 32 MB/s. Sep 28 11:38:43 shambhala kernel: PM: Image written at 65 MB/s. Sep 28 11:38:43 shambhala kernel: PM: Image read at 33 MB/s. Sep 28 11:41:15 shambhala kernel: PM: Image written at 66 MB/s. Sep 28 11:41:15 shambhala kernel: PM: Image read at 33 MB/s. Sep 28 11:42:57 shambhala kernel: PM: Image written at 66 MB/s. Sep 28 11:42:57 shambhala kernel: PM: Image read at 34 MB/s. Sep 28 11:45:16 shambhala kernel: PM: Image written at 66 MB/s. Sep 28 11:45:16 shambhala kernel: PM: Image read at 34 MB/s. I think I will keep 2.6.36-rc5-tp42-hiber-wri-accel-vmembase-0-00133- g3394a84-dirty (for-rafael minus readahead patch) at least till tomorrow for some longer term testing. Read obviously is not sped up so far. You get my Tested-By line when that kernel works till tomorrow or so ;). What next to test? suspend-next? Last attempts with (currently not reproducably stably working) TuxOnIce: Sep 9 01:04:22 shambhala kernel: TuxOnIce debugging info: Sep 9 01:04:22 shambhala kernel: - TuxOnIce core : 3.2-rc2 Sep 9 01:04:22 shambhala kernel: - Kernel Version : 2.6.36-rc3-tp42- toi-3.2-rc1-vmembase-0-05032-g60140c1-dirty Sep 9 01:04:22 shambhala kernel: - Compiler vers. : 4.4 Sep 9 01:04:22 shambhala kernel: - Attempt number : 1 Sep 9 01:04:22 shambhala kernel: - Parameters : 0 667656 0 1 900 0 Sep 9 01:04:22 shambhala kernel: - Overall expected compression percentage: 50. Sep 9 01:04:22 shambhala kernel: - Checksum method is 'md4'. Sep 9 01:04:22 shambhala kernel: 0 pages resaved in atomic copy. Sep 9 01:04:22 shambhala kernel: - Compressor is 'lzo'. Sep 9 01:04:22 shambhala kernel: Compressed 923176960 bytes into 393309240 (57 percent compression). Sep 9 01:04:22 shambhala kernel: - Block I/O active. Sep 9 01:04:22 shambhala kernel: Used 96698 pages from swap on /dev/sda2. Sep 9 01:04:22 shambhala kernel: - Max outstanding reads 1620. Max writes 9. Sep 9 01:04:22 shambhala kernel: Memory_needed: 1024 x (4096 + 220 + 76) = 4497408 bytes. Sep 9 01:04:22 shambhala kernel: Free mem throttle point reached 0. Sep 9 01:04:22 shambhala kernel: - Swap Allocator enabled. Sep 9 01:04:22 shambhala kernel: Swap available for image: 1234586 pages. Sep 9 01:04:22 shambhala kernel: - File Allocator active. Sep 9 01:04:22 shambhala kernel: Storage available for image: 0 pages. Sep 9 01:04:22 shambhala kernel: - I/O speed: Write 50 MB/s, Read 98 MB/s. Sep 9 01:04:22 shambhala kernel: - Extra pages : 11320 used/20000. Sep 9 01:04:22 shambhala kernel: - Result : Succeeded. shambhala:~> zgrep "I/O speed" /var/log/syslog.3.gz Sep 9 01:04:22 shambhala kernel: - I/O speed: Write 50 MB/s, Read 98 MB/s. Sep 9 10:22:58 shambhala kernel: - I/O speed: Write 49 MB/s, Read 98 MB/s. Sep 10 10:21:20 shambhala kernel: - I/O speed: Write 47 MB/s, Read 99 MB/s. Sep 11 10:19:52 shambhala kernel: - I/O speed: Write 49 MB/s, Read 100 MB/s. Sep 11 16:54:56 shambhala kernel: - I/O speed: Write 46 MB/s, Read 93 MB/s. So in-kernel-suspend actually seems to write faster, but TuxOnIce had lzo compressor active. For compression it would be nice to have two stats: Reading / writing compressed as well as uncompressed data? Cause 50 MiB/s compressed data might equal or even exceed 66 MiB/s uncompressed data. Thanks, -- Martin 'Helios' Steigerwald - http://www.Lichtvoll.de GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
Attachment:
signature.asc
Description: This is a digitally signed message part.
_______________________________________________ linux-pm mailing list linux-pm@xxxxxxxxxxxxxxxxxxxxxxxxxx https://lists.linux-foundation.org/mailman/listinfo/linux-pm