[Bug 107432] Periodic complete system lockup with Vega M and Kernel 4.18-rc6+

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Bug ID 107432
Summary Periodic complete system lockup with Vega M and Kernel 4.18-rc6+
Product DRI
Version unspecified
Hardware x86-64 (AMD64)
OS Linux (All)
Status NEW
Severity normal
Priority medium
Component DRM/AMDgpu
Assignee dri-devel@lists.freedesktop.org
Reporter rstrube@gmail.com

Created attachment 140902 [details]
System log leading up to hard crash

Description:

Periodically my system will begin to slow down dramatically (the mouse cursor
hitches as I try to move it) and I am unable to interact with anything on the
screen.  Eventually the mouse cursor disappears altogether.  Trying to switch
to a tty I do get prompted to login, but after entering my credentials nothing
happens.  It appears to be a hard lockup.  The only solution is to manually
power down my machine and reboot.

This probably happens one or two times a day, normally after starting a new
application.

Hardware:
Dell XPS 15 9575 2 in 1 (Kaby Lake G)

Versions:
Kernel 4.18-rc7
Mesa 18.1.5
Xorg 1.19.6
uCode for Vega M from Linux Firmware git (master) which includes the latest
18.20 uCode from AMD that was recently merged into Linux Firmware

I do have the two sinks available (one for the Intel iGPU and one for the AMD
Vega M), running:

xrandr --listproviders

Lists the following:

Providers: number : 2
Provider 0: id: 0x6f cap: 0x9, Source Output, Sink Offload crtcs: 3 outputs: 7
associated providers: 1 name:modesetting
Provider 1: id: 0x45 cap: 0x6, Sink Output, Source Offload crtcs: 6 outputs: 0
associated providers: 1 name:Unknown AMD Radeon GPU @ pci:0000:01:00.0

And running:

env DRI_PRIME=1 glxinfo | grep "OpenGL renderer"

Lists:

OpenGL renderer string: AMD VEGAM (DRM 3.26.0, 4.18.0-041800rc7-generic, LLVM
6.0.0)

So the Vega M is active and available in my system.

I noticed that this problem started happening after the release of kernel
4.18-rc6 and continues with 4.18-rc7. I've been using 4.18 since rc1 without
issue.  This entry in the changelog caught my eye:

Leo Liu (1):
      drm/amdgpu: Make sure IB tests flushed after IP resume

Not sure if this is at all related, but the reason I bring this up is because 
the errors I see in my logs everytime I encounter this problem are:

kernel: amdgpu 0000:01:00.0: GPU pci config reset
kernel: [drm:amdgpu_device_ip_suspend [amdgpu]] *ERROR* suspend of IP block
<uvd_v6_0> failed -12

Please note that so far I have only encountered this problem when launching
applications that use my Intel iGPU (i.e. I am not setting DRI_PRIME=1).

I've attached my entire log to provide more context.

Thanks!


You are receiving this mail because:
_______________________________________________
dri-devel mailing list
dri-devel@xxxxxxxxxxxxxxxxxxxxx
https://lists.freedesktop.org/mailman/listinfo/dri-devel

[Index of Archives]     [Linux DRI Users]     [Linux Intel Graphics]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [XFree86]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux