Hi everyone, I have some updates. I left the system idle most of the time during the weekend and from time to time I played a video on youtube and turned off the screen. Yesterday night I did the same and today morning I checked the system and it got hung up during the night. This time it took a lot longer to hang, but I think it was related to a Flash animation add that was only present on the youtube page the last time I switched off the screen. The amdgpu always seem to hang when that flash animation is present, from all the crash attempts I have made. There is a memory leak according to kmemleak which I attach along with the crash dmesg log. The kernel and patches are the same as on my previous email. I ended up not changing either the mesa version, nor the kernel version and patches. Regards, LuÃs On Fri, Feb 2, 2018 at 6:46 PM, LuÃs Mendes <luis.p.mendes at gmail.com> wrote: > Hi Christian, Alexander, > > I have enabled kmemleak, but memleak didn't detect anything special, > in fact this time, I don't know why, I didn't get any allocation > failure at all, but the GPU did hang after around 4h 6m of uptime with > Xorg. > The log can be found in attachment. I will try again to see if the > allocation failure reappears, or if it has become less apparent due to > kmemleak scans. > > The kernel stack trace is similar to the GPU hangs I was getting on > earlier kernel versions with Kodi, or Firefox when watching videos > with either one, but if I left Xorg idle, it would remain up and > available without hanging for more than one day. > This stack trace also looks quite similar to what Daniel Andersson > reported in "[BUG] Intermittent hang/deadlock when opening browser tab > with Vega gpu", looks like another demonstration of the same bug on > different architectures. > > Regards, > LuÃs > > On Fri, Feb 2, 2018 at 7:48 AM, Christian König > <ckoenig.leichtzumerken at gmail.com> wrote: >> Hi Luis, >> >> please enable kmemleak in your build and watch out for any suspicious >> messages in the system log. >> >> Regards, >> Christian. >> >> >> Am 02.02.2018 um 00:03 schrieb LuÃs Mendes: >>> >>> Hi Alexander, >>> >>> I didn't notice improvements on this issue with that particular patch >>> applied. It still ends up failing to allocate kernel memory after a >>> few hours of uptime with Xorg. >>> >>> I will try to upgrade to mesa 18.0.0-rc3 and to amd-staging-drm-next >>> head, to see if the issue still occurs with those versions. >>> >>> If you have additional suggestions I'll be happy to try them. >>> >>> Regards, >>> LuÃs Mendes >>> >>> On Thu, Feb 1, 2018 at 2:30 AM, Alex Deucher <alexdeucher at gmail.com> >>> wrote: >>>> >>>> On Wed, Jan 31, 2018 at 6:57 PM, LuÃs Mendes <luis.p.mendes at gmail.com> >>>> wrote: >>>>> >>>>> Hi everyone, >>>>> >>>>> I am getting a new issue with amdgpu with RX460, that is, now I can >>>>> play any videos with Kodi or play web videos with firefox and run >>>>> OpenGL applications without running into any issues, however after >>>>> some uptime with XOrg even when almost inactive I get a kmalloc >>>>> allocation failure, normally followed by a GPU hang a while after the >>>>> the allocation failure. >>>>> I had a terminal window under Ubuntu Mate 17.10 and I was compiling >>>>> code when I got the kernel messages that can be found in attachment. >>>>> >>>>> I am using the kernel as identified on my previous email, which can be >>>>> found below. >>>> >>>> does this patch help? >>>> https://patchwork.freedesktop.org/patch/198258/ >>>> >>>> Alex >>>> >>>>> Regards, >>>>> LuÃs Mendes >>>>> >>>>> On Wed, Jan 31, 2018 at 12:47 PM, LuÃs Mendes <luis.p.mendes at gmail.com> >>>>> wrote: >>>>>> >>>>>> Hi Alexander, >>>>>> >>>>>> I've cherry picked the patch you pointed out into kernel from >>>>>> amd-drm-next-4.17-wip at commit >>>>>> 9ab2894122275a6d636bb2654a157e88a0f7b9e2 ( drm/amdgpu: set >>>>>> DRIVER_ATOMIC flag early) and tested it on ARMv7l and the problem has >>>>>> gone indeed. >>>>>> >>>>>> >>>>>> Working great on ARMv7l with AMD RX460. >>>>>> >>>>>> Thanks, >>>>>> LuÃs Mendes >>>>>> >>>>>> >>>>>> On Tue, Jan 30, 2018 at 6:44 PM, Deucher, Alexander >>>>>> <Alexander.Deucher at amd.com> wrote: >>>>>>> >>>>>>> Fixed with this patch: >>>>>>> >>>>>>> >>>>>>> https://lists.freedesktop.org/archives/amd-gfx/2018-January/018472.html >>>>>>> >>>>>>> >>>>>>> Alex >>>>> >>>>> <> >>>>>>> >>>>>>> __________________ >>>>> >>>>> _______________________________________________ >>>>> amd-gfx mailing list >>>>> amd-gfx at lists.freedesktop.org >>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx >>>>> >>> _______________________________________________ >>> amd-gfx mailing list >>> amd-gfx at lists.freedesktop.org >>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx >> >> -------------- next part -------------- ubuntu at linux:~$ sudo cat /sys/kernel/debug/kmemleak [sudo] password for ubuntu: unreferenced object 0xb0fac380 (size 128): comm "Xorg", pid 3750, jiffies 5608934 (age 178088.970s) hex dump (first 32 bytes): 00 4e 9f b9 00 f0 33 bb 80 1a 15 97 00 00 00 00 .N....3......... fa 00 00 00 82 01 00 00 80 00 00 00 80 00 00 00 ................ backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<85099e84>] dm_drm_plane_duplicate_state+0x30/0x60 [amdgpu] [<5c55b4e7>] drm_atomic_get_plane_state+0x74/0x118 [drm] [<03e85711>] drm_atomic_add_affected_planes+0x84/0xb0 [drm] [<d9340120>] drm_atomic_helper_check_modeset+0x4d4/0xb04 [drm_kms_helper] [<3c94e005>] amdgpu_dm_atomic_check+0x44/0xc6c [amdgpu] [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm] [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm] [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper] [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm] [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm] [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm] [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm] [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu] [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc [<95adff3a>] SyS_ioctl+0x3c/0x60 unreferenced object 0xa44c5800 (size 1024): comm "Xorg", pid 3750, jiffies 5608934 (age 178088.970s) hex dump (first 32 bytes): 00 70 4c a4 40 05 00 00 00 00 00 00 00 04 00 00 .pL. at ........... 00 00 00 00 18 00 00 00 88 00 00 00 26 03 00 00 ............&... backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<a591e85d>] dc_create_stream_for_sink+0x30/0x15c [amdgpu] [<87d2cda1>] create_stream_for_sink+0x50/0x4b8 [amdgpu] [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu] [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu] [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm] [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm] [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper] [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm] [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm] [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm] [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm] [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu] [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc [<95adff3a>] SyS_ioctl+0x3c/0x60 [<4cde0ae2>] ret_fast_syscall+0x0/0x54 unreferenced object 0xb0fac080 (size 128): comm "Xorg", pid 3750, jiffies 5608935 (age 178088.960s) hex dump (first 32 bytes): 00 f0 33 bb 01 00 00 00 ff ff ff ff 08 00 08 00 ..3............. 90 c0 fa b0 90 c0 fa b0 ff ff ff ff 0a 00 0a 00 ................ backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper] [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper] [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu] [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm] [<8fac31c8>] drm_atomic_helper_set_config+0x9c/0xac [drm_kms_helper] [<a9956a39>] __drm_mode_set_config_internal+0x60/0xe4 [drm] [<617b0b52>] drm_mode_setcrtc+0x3f4/0x598 [drm] [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm] [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm] [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu] [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc [<95adff3a>] SyS_ioctl+0x3c/0x60 [<4cde0ae2>] ret_fast_syscall+0x0/0x54 [<1738b98c>] 0xffffffff unreferenced object 0xba604680 (size 128): comm "Xorg", pid 3750, jiffies 5608936 (age 178088.960s) hex dump (first 32 bytes): 00 f0 33 bb 01 00 00 00 ff ff ff ff 04 00 04 00 ..3............. 90 46 60 ba 90 46 60 ba ff ff ff ff 06 00 06 00 .F`..F`......... backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper] [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper] [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu] [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm] [<2b9bc4a3>] drm_atomic_connector_commit_dpms+0xec/0xfc [drm] [<9b7fdbd7>] drm_mode_obj_set_property_ioctl+0x1a8/0x318 [drm] [<be4c5272>] drm_mode_connector_property_set_ioctl+0x4c/0x68 [drm] [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm] [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm] [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu] [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc [<95adff3a>] SyS_ioctl+0x3c/0x60 [<4cde0ae2>] ret_fast_syscall+0x0/0x54 [<1738b98c>] 0xffffffff unreferenced object 0xb948fe00 (size 512): comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s) hex dump (first 32 bytes): 00 f0 33 bb 01 01 01 00 21 00 00 00 02 00 00 00 ..3.....!....... 02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<70e004db>] dm_crtc_duplicate_state+0x38/0x84 [amdgpu] [<b13e4c2f>] drm_atomic_get_crtc_state+0x78/0x10c [drm] [<1c6f2b44>] page_flip_common+0x28/0xcc [drm_kms_helper] [<cc45921a>] drm_atomic_helper_page_flip+0x50/0xac [drm_kms_helper] [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm] [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm] [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm] [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu] [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc [<95adff3a>] SyS_ioctl+0x3c/0x60 [<4cde0ae2>] ret_fast_syscall+0x0/0x54 [<1738b98c>] 0xffffffff unreferenced object 0xb0face00 (size 128): comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s) hex dump (first 32 bytes): 00 7a 81 b9 00 f0 33 bb 40 28 84 be 00 00 00 00 .z....3.@(...... 00 00 00 00 00 00 00 00 00 04 00 00 00 03 00 00 ................ backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<85099e84>] dm_drm_plane_duplicate_state+0x30/0x60 [amdgpu] [<5c55b4e7>] drm_atomic_get_plane_state+0x74/0x118 [drm] [<2802153b>] page_flip_common+0x50/0xcc [drm_kms_helper] [<cc45921a>] drm_atomic_helper_page_flip+0x50/0xac [drm_kms_helper] [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm] [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm] [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm] [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu] [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc [<95adff3a>] SyS_ioctl+0x3c/0x60 [<4cde0ae2>] ret_fast_syscall+0x0/0x54 [<1738b98c>] 0xffffffff unreferenced object 0xb0facf80 (size 128): comm "Xorg", pid 3750, jiffies 5608937 (age 178088.950s) hex dump (first 32 bytes): 00 f0 33 bb 01 00 00 00 ff ff ff ff 05 00 05 00 ..3............. 90 cf fa b0 90 cf fa b0 ff ff ff ff 06 00 06 00 ................ backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper] [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper] [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu] [<270dc6b5>] drm_atomic_nonblocking_commit+0x54/0x58 [drm] [<2e425aac>] drm_atomic_helper_page_flip+0x9c/0xac [drm_kms_helper] [<c18a310c>] drm_mode_page_flip_ioctl+0x490/0x4f4 [drm] [<31f247ae>] drm_ioctl_kernel+0x68/0xb4 [drm] [<4d074688>] drm_ioctl+0x2cc/0x3b0 [drm] [<83459b01>] amdgpu_drm_ioctl+0x10/0x14 [amdgpu] [<99bb30d0>] do_vfs_ioctl+0xb8/0x8cc [<95adff3a>] SyS_ioctl+0x3c/0x60 [<4cde0ae2>] ret_fast_syscall+0x0/0x54 [<1738b98c>] 0xffffffff unreferenced object 0x9bdcd800 (size 1024): comm "kworker/0:0", pid 5643, jiffies 5609303 (age 178085.640s) hex dump (first 32 bytes): 04 00 00 00 00 01 00 00 00 ff ff ff ff ff ff 00 ................ 2e 83 54 21 34 00 00 00 29 15 01 03 80 30 1b 78 ..T!4...)....0.x backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<f1541e77>] dc_sink_create+0x2c/0x88 [amdgpu] [<b58321e8>] dc_link_detect+0x1f0/0x994 [amdgpu] [<1e160950>] handle_hpd_irq+0x40/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0x933a6480 (size 64): comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s) hex dump (first 32 bytes): 01 00 00 00 00 a4 81 b9 01 00 00 00 00 c6 56 ba ..............V. 80 27 f4 9b 03 00 00 00 80 62 3a 93 00 00 00 00 .'.......b:..... backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<1a912412>] dm_atomic_state_alloc+0x2c/0x60 [amdgpu] [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm] [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0x9bf42780 (size 128): comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s) hex dump (first 32 bytes): 00 f0 33 bb 00 fe 48 b9 00 fe 48 b9 00 fe 71 b1 ..3...H...H...q. 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ backtrace: [<bac96619>] __kmalloc+0x1a0/0x284 [<6315c347>] drm_atomic_state_init+0x50/0xcc [drm] [<b3875b9a>] dm_atomic_state_alloc+0x40/0x60 [amdgpu] [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm] [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0xba56c600 (size 192): comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.440s) hex dump (first 32 bytes): 00 7a 81 b9 00 ce fa b0 00 ce fa b0 00 28 f4 9b .z...........(.. 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ backtrace: [<bac96619>] __kmalloc+0x1a0/0x284 [<81bcb9b0>] drm_atomic_state_init+0x78/0xcc [drm] [<b3875b9a>] dm_atomic_state_alloc+0x40/0x60 [amdgpu] [<1305fb86>] drm_atomic_state_alloc+0x24/0x78 [drm] [<b39c3463>] dm_restore_drm_connector_state+0x6c/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0x933a6280 (size 64): comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.780s) hex dump (first 32 bytes): 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 50 8c b9 80 c2 fa b0 80 c2 fa b0 80 20 f4 9b .P........... .. backtrace: [<b51d17be>] __kmalloc_track_caller+0x1a0/0x284 [<bfb2e13f>] krealloc+0x54/0xc0 [<b6fa0df1>] drm_atomic_get_connector_state+0x134/0x180 [drm] [<1d297503>] dm_restore_drm_connector_state+0x90/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0xb772e000 (size 8192): comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.780s) hex dump (first 32 bytes): 00 c4 dc 9b 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 00 00 00 00 00 00 00 00 00 00 00 03 00 00 00 ................ backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<ecab1fe2>] dc_create_state+0x28/0x3c [amdgpu] [<3f29ed80>] amdgpu_dm_atomic_check+0xe0/0xc6c [amdgpu] [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm] [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm] [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0x9bdcc400 (size 1024): comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.790s) hex dump (first 32 bytes): 00 d8 dc 9b 40 05 00 00 00 00 00 00 00 04 00 00 .... at ........... 00 00 00 00 18 00 00 00 88 00 00 00 26 03 00 00 ............&... backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<a591e85d>] dc_create_stream_for_sink+0x30/0x15c [amdgpu] [<87d2cda1>] create_stream_for_sink+0x50/0x4b8 [amdgpu] [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu] [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu] [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm] [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm] [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0x9bf30000 (size 24632): comm "kworker/0:0", pid 5643, jiffies 5609323 (age 178085.790s) hex dump (first 32 bytes): 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ backtrace: [<8ff04208>] kmalloc_order+0x4c/0x54 [<24677274>] kmalloc_order_trace+0x24/0xc8 [<b006d2b2>] dc_create_transfer_func+0x20/0x30 [amdgpu] [<a9439ec9>] create_stream_for_sink+0xc0/0x4b8 [amdgpu] [<4e0dde84>] dm_update_crtcs_state+0x120/0x36c [amdgpu] [<0fc8d7c2>] amdgpu_dm_atomic_check+0x290/0xc6c [amdgpu] [<4699f226>] drm_atomic_check_only+0x3bc/0x5c4 [drm] [<36cb27b1>] drm_atomic_commit+0x18/0x60 [drm] [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff unreferenced object 0xba604f80 (size 128): comm "kworker/0:0", pid 5643, jiffies 5609324 (age 178086.120s) hex dump (first 32 bytes): 00 f0 33 bb 01 00 00 00 ff ff ff ff 08 00 08 00 ..3............. 90 4f 60 ba 90 4f 60 ba ff ff ff ff 0a 00 0a 00 .O`..O`......... backtrace: [<400a53a4>] kmem_cache_alloc_trace+0x180/0x24c [<024c2a79>] drm_atomic_helper_setup_commit+0x1d0/0x4d4 [drm_kms_helper] [<b2a1265a>] drm_atomic_helper_commit+0x44/0x12c [drm_kms_helper] [<a8a36ef6>] amdgpu_dm_atomic_commit+0xc0/0xc8 [amdgpu] [<d422bfea>] drm_atomic_commit+0x54/0x60 [drm] [<ce70d259>] dm_restore_drm_connector_state+0xd8/0x15c [amdgpu] [<02786958>] handle_hpd_irq+0x70/0x90 [amdgpu] [<0ebe63e6>] dm_irq_work_func+0x68/0x78 [amdgpu] [<4bf849a6>] process_one_work+0x204/0x510 [<37e481ef>] worker_thread+0x5c/0x5f0 [<292a33bf>] kthread+0x164/0x16c [<c1559dd0>] ret_from_fork+0x14/0x2c [<1738b98c>] 0xffffffff -------------- next part -------------- Feb 4 23:36:30 linux kernel: [188528.461609] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=2186227, last emitted seq=2186230 Feb 4 23:36:30 linux kernel: [188528.472965] [drm] IP block:gmc_v8_0 is hung! Feb 4 23:36:30 linux kernel: [188528.472971] [drm] IP block:gfx_v8_0 is hung! Feb 4 23:36:30 linux kernel: [188528.473019] [drm] GPU recovery disabled. Feb 4 23:52:11 linux kernel: [189469.863152] INFO: task amdgpu_cs:0:3799 blocked for more than 120 seconds. Feb 4 23:52:11 linux kernel: [189469.870134] Not tainted 4.15.0-rc8-next2g-g9ab2894-dirty #3 Feb 4 23:52:11 linux kernel: [189469.876354] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Feb 4 23:52:11 linux kernel: [189469.884304] amdgpu_cs:0 D 0 3799 3088 0x00000000 Feb 4 23:52:11 linux kernel: [189469.884309] Backtrace: Feb 4 23:52:11 linux kernel: [189469.884320] [<80b571c8>] (__schedule) from [<80b578cc>] (schedule+0x44/0xa4) Feb 4 23:52:11 linux kernel: [189469.884325] r10:600c0013 r9:b6108000 r8:b6109bd4 r7:00000000 r6:7fffffff r5:81004c48 Feb 4 23:52:11 linux kernel: [189469.884327] r4:ffffe000 Feb 4 23:52:11 linux kernel: [189469.884333] [<80b57888>] (schedule) from [<80b5b4f0>] (schedule_timeout+0x1e0/0x2e8) Feb 4 23:52:11 linux kernel: [189469.884336] r5:81004c48 r4:7fffffff Feb 4 23:52:11 linux kernel: [189469.884344] [<80b5b310>] (schedule_timeout) from [<8065df3c>] (dma_fence_default_wait+0x218/0x2b0) Feb 4 23:52:11 linux kernel: [189469.884348] r10:600c0013 r9:b6108000 r8:b6109bd4 r7:00000000 r6:7fffffff r5:81004c48 Feb 4 23:52:11 linux kernel: [189469.884350] r4:94953e80 Feb 4 23:52:11 linux kernel: [189469.884355] [<8065dd24>] (dma_fence_default_wait) from [<8065d6b4>] (dma_fence_wait_timeout+0x48/0x15c) Feb 4 23:52:11 linux kernel: [189469.884360] r10:ba77b000 r9:b9820000 r8:00000001 r7:91374968 r6:00000000 r5:94953e80 Feb 4 23:52:11 linux kernel: [189469.884361] r4:81096c18 Feb 4 23:52:11 linux kernel: [189469.884566] [<8065d66c>] (dma_fence_wait_timeout) from [<7f1b5bc8>] (amdgpu_ctx_wait_prev_fence+0x48/0x80 [amdgpu]) Feb 4 23:52:11 linux kernel: [189469.884570] r7:91374968 r6:00000001 r5:b6bc60c0 r4:00000001 Feb 4 23:52:11 linux kernel: [189469.884829] [<7f1b5b80>] (amdgpu_ctx_wait_prev_fence [amdgpu]) from [<7f19e780>] (amdgpu_cs_ioctl+0x428/0x1edc [amdgpu]) Feb 4 23:52:11 linux kernel: [189469.884832] r5:b6bc60c0 r4:00000001 Feb 4 23:52:11 linux kernel: [189469.884995] [<7f19e358>] (amdgpu_cs_ioctl [amdgpu]) from [<7f045920>] (drm_ioctl_kernel+0x68/0xb4 [drm]) Feb 4 23:52:11 linux kernel: [189469.884999] r10:00000018 r9:b6109e2c r8:7f19e358 r7:00000021 r6:00000000 r5:b981a400 Feb 4 23:52:11 linux kernel: [189469.885001] r4:ba6ca240 Feb 4 23:52:11 linux kernel: [189469.885041] [<7f0458b8>] (drm_ioctl_kernel [drm]) from [<7f045dec>] (drm_ioctl+0x2cc/0x3b0 [drm]) Feb 4 23:52:11 linux kernel: [189469.885045] r9:00000044 r8:c0186444 r7:ba6ca240 r6:7f19e358 r5:7f2fcba4 r4:81004c48 Feb 4 23:52:11 linux kernel: [189469.885193] [<7f045b20>] (drm_ioctl [drm]) from [<7f180010>] (amdgpu_drm_ioctl+0x10/0x14 [amdgpu]) Feb 4 23:52:11 linux kernel: [189469.885197] r10:b9b28510 r9:b6108000 r8:732c5ac0 r7:0000000c r6:b6426480 r5:732c5ac0 Feb 4 23:52:11 linux kernel: [189469.885199] r4:81004c48 Feb 4 23:52:11 linux kernel: [189469.885324] [<7f180000>] (amdgpu_drm_ioctl [amdgpu]) from [<8028e4b4>] (do_vfs_ioctl+0xb8/0x8cc) Feb 4 23:52:11 linux kernel: [189469.885330] [<8028e3fc>] (do_vfs_ioctl) from [<8028ed04>] (SyS_ioctl+0x3c/0x60) Feb 4 23:52:11 linux kernel: [189469.885334] r10:00000000 r9:b6108000 r8:732c5ac0 r7:c0186444 r6:0000000c r5:b6426480 Feb 4 23:52:11 linux kernel: [189469.885336] r4:b6426481 Feb 4 23:52:11 linux kernel: [189469.885343] [<8028ecc8>] (SyS_ioctl) from [<80108f00>] (ret_fast_syscall+0x0/0x54) Feb 4 23:52:11 linux kernel: [189469.885347] r9:b6108000 r8:801090e4 r7:00000036 r6:c0186444 r5:732c5ac0 r4:c0006400 ubuntu at linux:~$ uptime 12:36:43 up 2 days, 17:22, 3 users, load average: 1.02, 1.03, 1.00