https://bugzilla.kernel.org/show_bug.cgi?id=77181 Bug ID: 77181 Summary: radeon -- GPU lockup when hibernating or waking up Product: Drivers Version: 2.5 Kernel Version: 3.15.0-rc8 Hardware: x86-64 OS: Linux Tree: Mainline Status: NEW Severity: normal Priority: P1 Component: Video(DRI - non Intel) Assignee: drivers_video-dri@xxxxxxxxxxxxxxxxxxxx Reporter: grawity@xxxxxxxxx Regression: No With 3.15-rc* kernels (including -rc8 built today), hibernation (suspend to disk) doesn't work properly; the system hangs for ~10 seconds and I get a "GPU lockup" message before it continues. The same happens both when hibernating, and when resuming from hibernation. (Suspend-to-RAM, however, works perfectly.) In the last few attempts, during resume it recovered from the lockup, but remained unstable afterwards (see second "GPU softreset" log at the bottom). The same happens both with Xorg running, and without. Computer: ASUS K52JT.206 laptop, using UEFI. 01:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Robson CE [Radeon HD 6370M/7370M] [1002:68e4] dmesg after resuming from hibernate: [ 235.036435] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready [ 235.609112] PM: Hibernation mode set to 'platform' [ 235.765779] wlan0: deauthenticating from 24:a4:3c:ae:df:83 by local choice (Reason: 3=DEAUTH_LEAVING) [ 235.773778] cfg80211: Calling CRDA to update world regulatory domain [ 235.773990] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready [ 235.776029] cfg80211: World regulatory domain updated: [ 235.776034] cfg80211: [snip] [ 235.817885] IPv6: ADDRCONF(NETDEV_UP): wlan0: link is not ready [ 235.883507] PM: Syncing filesystems ... done. [ 237.132171] Freezing user space processes ... (elapsed 0.023 seconds) done. [ 237.155971] PM: Marking nosave pages: [mem 0x000a0000-0x000fffff] [ 237.155974] PM: Marking nosave pages: [mem 0x029a9000-0x029a9fff] [ 237.155975] PM: Marking nosave pages: [mem 0x029b8000-0x029b9fff] [ 237.155976] PM: Marking nosave pages: [mem 0x029c3000-0x029c3fff] [ 237.155978] PM: Marking nosave pages: [mem 0xbed06000-0xbed91fff] [ 237.155980] PM: Marking nosave pages: [mem 0xbed95000-0xbed9bfff] [ 237.155981] PM: Marking nosave pages: [mem 0xbed9d000-0xbee02fff] [ 237.155984] PM: Marking nosave pages: [mem 0xbf800000-0xffffffff] [ 237.156474] PM: Basic memory bitmaps created [ 237.156517] PM: Preallocating image memory... done (allocated 362140 pages) [ 237.269398] PM: Allocated 1448560 kbytes in 0.11 seconds (13168.72 MB/s) [ 237.269400] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done. [ 237.270653] Suspending console(s) (use no_console_suspend to debug) [ 237.274839] pciehp 0000:00:1c.1:pcie04: slot(1): Link Up event [ 237.283819] pciehp 0000:00:1c.5:pcie04: slot(5): Link Up event [ 237.377141] pciehp 0000:00:1c.1:pcie04: Device 0000:03:00.0 already exists at 0000:03:00, cannot hot-add [ 237.377145] pciehp 0000:00:1c.1:pcie04: Cannot add device at 0000:03:00 [ 237.387169] pciehp 0000:00:1c.5:pcie04: Device 0000:05:00.0 already exists at 0000:05:00, cannot hot-add [ 237.387172] pciehp 0000:00:1c.5:pcie04: Cannot add device at 0000:05:00 [ 237.802212] PM: freeze of devices complete after 531.509 msecs [ 237.802648] PM: late freeze of devices complete after 0.431 msecs [ 237.803625] PM: noirq freeze of devices complete after 0.974 msecs [ 237.803969] ACPI: Preparing to enter system sleep state S4 [ 237.805209] PM: Saving platform NVS memory [ 237.805540] Disabling non-boot CPUs ... [ 237.806878] kvm: disabling virtualization on CPU1 [ 237.907118] smpboot: CPU 1 is now offline [ 237.908730] kvm: disabling virtualization on CPU2 [ 238.010458] smpboot: CPU 2 is now offline [ 238.012035] kvm: disabling virtualization on CPU3 [ 238.113805] smpboot: CPU 3 is now offline [ 238.114503] PM: Creating hibernation image: [ 238.224525] PM: Need to copy 377652 pages [ 238.224530] PM: Normal pages needed: 377652 + 1024, available pages: 635664 [ 238.115167] PM: Restoring platform NVS memory [ 238.115648] microcode: CPU0 sig=0x20655, pf=0x10, revision=0x4 [ 238.115662] Enabling non-boot CPUs ... [ 238.115712] x86: Booting SMP configuration: [ 238.115712] smpboot: Booting Node 0 Processor 1 APIC 0x4 [ 238.126823] kvm: enabling virtualization on CPU1 [ 238.129214] microcode: CPU1 sig=0x20655, pf=0x10, revision=0x2 [ 238.129417] microcode: CPU1 updated to revision 0x4, date = 2013-06-28 [ 238.129422] CPU1 is up [ 238.129447] smpboot: Booting Node 0 Processor 2 APIC 0x1 [ 238.140626] kvm: enabling virtualization on CPU2 [ 238.143072] microcode: CPU2 sig=0x20655, pf=0x10, revision=0x2 [ 238.143249] microcode: CPU2 updated to revision 0x4, date = 2013-06-28 [ 238.143255] CPU2 is up [ 238.143276] smpboot: Booting Node 0 Processor 3 APIC 0x5 [ 238.154445] kvm: enabling virtualization on CPU3 [ 238.156881] microcode: CPU3 sig=0x20655, pf=0x10, revision=0x4 [ 238.156888] CPU3 is up [ 238.159235] ACPI: Waking up from system sleep state S4 [ 238.572267] ACPI: \_SB_.SLPB: ACPI_NOTIFY_DEVICE_WAKE event [ 238.585241] PM: noirq restore of devices complete after 11.564 msecs [ 238.585438] PM: early restore of devices complete after 0.165 msecs [ 238.643160] mei_me 0000:00:16.0: irq 47 for MSI/MSI-X [ 238.643162] usb usb1: root hub lost power or was reset [ 238.647082] ehci-pci 0000:00:1a.0: cache line size of 64 is not supported [ 238.647203] usb usb2: root hub lost power or was reset [ 238.651122] ehci-pci 0000:00:1d.0: cache line size of 64 is not supported [ 238.651363] snd_hda_intel 0000:00:1b.0: irq 48 for MSI/MSI-X [ 238.651681] snd_hda_intel 0000:01:00.1: irq 49 for MSI/MSI-X [ 238.653131] sd 0:0:0:0: [sda] Starting disk [ 238.658947] switching from power state: [ 238.658949] ui class: none [ 238.658951] internal class: boot [ 238.658952] caps: video [ 238.658953] uvd vclk: 0 dclk: 0 [ 238.658954] power level 0 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 [ 238.658956] power level 1 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 [ 238.658957] power level 2 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 [ 238.658958] status: c b [ 238.658958] switching to power state: [ 238.658958] ui class: performance [ 238.658960] internal class: none [ 238.658961] caps: single_disp video [ 238.658963] uvd vclk: 0 dclk: 0 [ 238.658963] power level 0 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 [ 238.658964] power level 1 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 [ 238.658965] power level 2 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 [ 238.658966] status: r [ 238.664760] [drm] PCIE GART of 1024M enabled (table at 0x000000000025D000). [ 238.664880] radeon 0000:01:00.0: WB enabled [ 238.664883] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0xffff880001912c00 [ 238.664884] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0xffff880001912c0c [ 238.666037] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x000000000005c418 and cpu addr 0xffffc90010d9c418 [ 238.682922] [drm] ring test on 0 succeeded in 1 usecs [ 238.682981] [drm] ring test on 3 succeeded in 1 usecs [ 238.748484] pciehp 0000:00:1c.1:pcie04: Device 0000:03:00.0 already exists at 0000:03:00, cannot hot-add [ 238.748487] pciehp 0000:00:1c.1:pcie04: Cannot add device at 0000:03:00 [ 238.748525] pciehp 0000:00:1c.5:pcie04: Device 0000:05:00.0 already exists at 0000:05:00, cannot hot-add [ 238.748529] pciehp 0000:00:1c.5:pcie04: Cannot add device at 0000:05:00 [ 238.860143] [drm] ring test on 5 succeeded in 1 usecs [ 238.860148] [drm] UVD initialized successfully. [ 238.860183] [drm] ib test on ring 0 succeeded in 0 usecs [ 238.860216] [drm] ib test on ring 3 succeeded in 1 usecs [ 238.965196] usb 2-1: reset high-speed USB device number 2 using ehci-pci [ 238.978495] ata6: SATA link down (SStatus 0 SControl 300) [ 238.985171] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300) [ 238.985207] ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300) [ 238.986567] ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out [ 238.989278] ata2.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded [ 238.990012] ata2.00: ACPI cmd ef/90:03:00:00:00:a0 (SET FEATURES) succeeded [ 238.991760] ata5: SATA link down (SStatus 0 SControl 300) [ 238.992884] ata2.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded [ 238.993618] ata2.00: ACPI cmd ef/90:03:00:00:00:a0 (SET FEATURES) succeeded [ 238.994024] ata2.00: configured for UDMA/100 [ 238.995663] ata1.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded [ 239.005635] ata1.00: ACPI cmd ef/90:03:00:00:00:a0 (SET FEATURES) succeeded [ 239.089888] ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out [ 239.099627] ata1.00: ACPI cmd ef/10:06:00:00:00:a0 (SET FEATURES) succeeded [ 239.110256] ata1.00: ACPI cmd ef/90:03:00:00:00:a0 (SET FEATURES) succeeded [ 239.140436] ata1.00: configured for UDMA/133 [ 249.009247] radeon 0000:01:00.0: ring 5 stalled for more than 10000msec [ 249.009250] radeon 0000:01:00.0: GPU lockup (waiting for 0x0000000000000004 last fence id 0x0000000000000002 on ring 5) [ 249.009252] [drm:uvd_v1_0_ib_test] *ERROR* radeon: fence wait failed (-35). [ 249.009256] [drm:radeon_ib_ring_tests] *ERROR* radeon: failed testing IB on ring 5 (-35). [ 250.435952] PM: restore of devices complete after 11792.082 msecs [ 250.436675] PM: Image restored successfully. [ 250.436698] PM: Basic memory bitmaps freed [ 250.436699] Restarting tasks ... done. [ 250.439326] video LNXVIDEO:00: Restoring backlight state [ 250.456916] jme 0000:05:00.5: irq 50 for MSI/MSI-X [ 250.457017] jme 0000:05:00.5 eth0: Link is up at ANed: 100 Mbps, Full-Duplex, MDI [ 250.458065] jme 0000:05:00.5 eth0: Link is down [ 250.479766] jme 0000:05:00.5 eth0: Link is down [ 250.479834] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready [ 250.494725] IPv6: ADDRCONF(NETDEV_UP): wlan0: link is not ready [ 250.546164] usb 1-1: new high-speed USB device number 3 using ehci-pci [ 250.670484] hub 1-1:1.0: USB hub found [ 250.670647] hub 1-1:1.0: 6 ports detected [ 250.686380] usb 1-1: USB disconnect, device number 3 [ 251.847086] wlan0: authenticate with 24:a4:3c:ae:df:83 [ 251.853958] wlan0: send auth to 24:a4:3c:ae:df:83 (try 1/3) [ 251.861038] wlan0: authenticated [ 251.862738] wlan0: associate with 24:a4:3c:ae:df:83 (try 1/3) [ 251.867237] wlan0: RX AssocResp from 24:a4:3c:ae:df:83 (capab=0x431 status=0 aid=2) [ 251.867306] wlan0: associated [ 251.867346] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready [ 252.248171] jme 0000:05:00.5 eth0: Link is up at ANed: 100 Mbps, Full-Duplex, MDI [ 252.248519] IPv6: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready dmesg when it reset itself a bit later: Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: ring 5 stalled for more than 281610msec Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GPU lockup (waiting for 0x0000000000000005 last fence id 0x0000000000000004 on ring 5) Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: Saved 23 dwords of commands on ring 0. Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GPU softreset: 0x00000009 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GRBM_STATUS = 0xA7702828 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0x7C000005 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: SRBM_STATUS = 0x200800C0 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00010800 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00028004 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_008680_CP_STAT = 0x80038647 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GRBM_SOFT_RESET=0x00007F6B Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GRBM_STATUS = 0x00003828 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0x00000007 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: SRBM_STATUS = 0x200800C0 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00000000 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00000000 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_008680_CP_STAT = 0x00000000 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: GPU reset succeeded, trying to resume Jun 02 10:15:54 rain kernel: switching from power state: Jun 02 10:15:54 rain kernel: ui class: none Jun 02 10:15:54 rain kernel: internal class: boot Jun 02 10:15:54 rain kernel: caps: video Jun 02 10:15:54 rain kernel: uvd vclk: 0 dclk: 0 Jun 02 10:15:54 rain kernel: power level 0 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 1 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 2 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: status: c b Jun 02 10:15:54 rain kernel: switching to power state: Jun 02 10:15:54 rain kernel: ui class: performance Jun 02 10:15:54 rain kernel: internal class: none Jun 02 10:15:54 rain kernel: caps: single_disp video Jun 02 10:15:54 rain kernel: uvd vclk: 0 dclk: 0 Jun 02 10:15:54 rain kernel: power level 0 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 1 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 2 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: status: r Jun 02 10:15:54 rain kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000000025D000). Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: WB enabled Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0xffff880001912c00 Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0xffff880001912c0c Jun 02 10:15:54 rain kernel: radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x000000000005c418 and cpu addr 0xffffc90010d9c418 Jun 02 10:15:54 rain kernel: [drm] ring test on 0 succeeded in 1 usecs Jun 02 10:15:54 rain kernel: [drm] ring test on 3 succeeded in 1 usecs Jun 02 10:15:54 rain kernel: [drm] ring test on 5 succeeded in 1 usecs Jun 02 10:15:54 rain kernel: [drm] UVD initialized successfully. Jun 02 10:15:54 rain kernel: [drm] ib test on ring 0 succeeded in 0 usecs Jun 02 10:15:54 rain kernel: [drm] ib test on ring 3 succeeded in 0 usecs Jun 02 10:15:54 rain kernel: [drm:uvd_v1_0_ib_test] *ERROR* radeon: failed to get create msg (-22). Jun 02 10:15:54 rain kernel: [drm:radeon_ib_ring_tests] *ERROR* radeon: failed testing IB on ring 5 (-22). Jun 02 10:15:54 rain kernel: [drm:radeon_pm_resume_dpm] *ERROR* radeon: dpm resume failed Jun 02 10:15:54 rain kernel: switching from power state: Jun 02 10:15:54 rain kernel: ui class: none Jun 02 10:15:54 rain kernel: internal class: boot Jun 02 10:15:54 rain kernel: caps: video Jun 02 10:15:54 rain kernel: uvd vclk: 0 dclk: 0 Jun 02 10:15:54 rain kernel: power level 0 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 1 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 2 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: status: c b Jun 02 10:15:54 rain kernel: switching to power state: Jun 02 10:15:54 rain kernel: ui class: performance Jun 02 10:15:54 rain kernel: internal class: none Jun 02 10:15:54 rain kernel: caps: single_disp video Jun 02 10:15:54 rain kernel: uvd vclk: 0 dclk: 0 Jun 02 10:15:54 rain kernel: power level 0 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 1 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: power level 2 sclk: 75000 mclk: 80000 vddc: 1100 vddci: 0 Jun 02 10:15:54 rain kernel: status: r -- You are receiving this mail because: You are watching the assignee of the bug. _______________________________________________ dri-devel mailing list dri-devel@xxxxxxxxxxxxxxxxxxxxx http://lists.freedesktop.org/mailman/listinfo/dri-devel