amd powerplay firmware load oops.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



	Good morning amd-gfx,

	I'd like to share the following oops that doesn't seem to be straight 
forward, to learn and identify what is the final issue in the background. The 
ouput is the following:

Jan 16 06:35:30 bee kernel: [   27.534407] amdgpu: [powerplay] smu not 
running, upload firmware again 
Jan 16 06:35:30 bee kernel: [   27.535717] BUG: unable to handle kernel paging 
request at ffffc91c00430fec
Jan 16 06:35:30 bee kernel: [   27.535757] IP: 
smu7_populate_single_firmware_entry.isra.3+0x68/0xc0 [amdgpu]
Jan 16 06:35:30 bee kernel: [   27.535760] PGD 35d10a067 P4D 35d10a067 PUD 0 
Jan 16 06:35:30 bee kernel: [   27.535764] Oops: 0002 [#1] SMP PTI
Jan 16 06:35:30 bee kernel: [   27.535766] Modules linked in: i915(+) arc4 
amdgpu(+) iwlmvm uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 
videobuf2_core videodev mac80211 media x86_pkg_temp_thermal intel_powerclamp 
btusb joy
dev btrtl coretemp mousedev rtsx_pci_ms btbcm kvm_intel i2c_algo_bit memstick 
drm_kms_helper rtsx_pci_sdmmc kvm iwlwifi mmc_core ttm drm wmi_bmof irqbypass 
snd_hda_intel hci_uart crc32c_intel btintel ghash_clmulni_intel snd_hda_codec 
blu
etooth cfg80211 intel_cstate snd_hwdep efi_pstore evdev intel_uncore psmouse 
efivars rtsx_pci input_leds ideapad_laptop intel_rapl_perf serio_raw 
snd_hda_core sparse_keymap thermal syscopyarea wmi i2c_i801 snd_pcm 
ecdh_generic video rfki
ll sysfillrect intel_lpss_acpi sysimgblt intel_lpss fan snd_timer fb_sys_fops 
mfd_core i2c_hid button battery ac snd soundcore
Jan 16 06:35:30 bee kernel: [   27.535809]  backlight tpm_crb acpi_pad 
efivarfs unix dm_zero dm_thin_pool dm_persistent_data dm_bio_prison 
dm_service_time dm_round_robin dm_queue_length dm_multipath dm_log_userspace 
cn dm_flakey dm_delay
 xts aesni_intel crypto_simd cryptd glue_helper aes_x86_64 cbc sha256_generic 
scsi_transport_iscsi r8169 mii fuse xfs nfs lockd grace sunrpc fscache ext4 
mbcache jbd2 multipath linear raid10 raid1 raid0 dm_raid raid456 md_mod 
async_raid6
_recov async_memcpy async_pq async_xor xor async_tx raid6_pq libcrc32c 
dm_snapshot dm_bufio dm_crypt dm_mirror dm_region_hash dm_log dm_mod dax 
hid_generic usbhid xhci_pci xhci_hcd ohci_hcd uhci_hcd usb_storage ehci_pci 
ehci_hcd usbcore 
usb_common scsi_transport_fc sr_mod cdrom sg sd_mod ata_piix ahci libahci 
sata_sx4 pata_oldpiix
Jan 16 06:35:30 bee kernel: [   27.535855] CPU: 2 PID: 3661 Comm: udevd Not 
tainted 4.14.12+ #2
Jan 16 06:35:30 bee kernel: [   27.535857] Hardware name: LENOVO 80UV/Lenovo 
ideapad 510S-14IKB, BIOS 2SCN21WW(V2.01) 12/20/2016
Jan 16 06:35:30 bee kernel: [   27.535860] task: ffff880358ce9980 task.stack: 
ffffc900005b4000
Jan 16 06:35:30 bee kernel: [   27.535893] RIP: 
0010:smu7_populate_single_firmware_entry.isra.3+0x68/0xc0 [amdgpu]
Jan 16 06:35:30 bee kernel: [   27.535895] RSP: 0018:ffffc900005b7a38 EFLAGS: 
00010246
Jan 16 06:35:30 bee kernel: [   27.535898] RAX: 000000000000007e RBX: 
0000000000000003 RCX: 000001000f542000
Jan 16 06:35:30 bee kernel: [   27.535900] RDX: 0000000000000004 RSI: 
0000000000000000 RDI: ffff88033e96d660
Jan 16 06:35:30 bee kernel: [   27.535902] RBP: ffffc91c00430fec R08: 
0000000000000001 R09: 0000000000000385
Jan 16 06:35:30 bee kernel: [   27.535905] R10: 0000000000000000 R11: 
0000000000000000 R12: ffff8803404f7148
Jan 16 06:35:30 bee kernel: [   27.535907] R13: ffff8803405a4000 R14: 
000000000000047e R15: ffff88033b8b0000
Jan 16 06:35:30 bee kernel: [   27.535909] FS:  00007fd844368780(0000) 
GS:ffff88046ed00000(0000) knlGS:0000000000000000
Jan 16 06:35:30 bee kernel: [   27.535912] CS:  0010 DS: 0000 ES: 0000 CR0: 
0000000080050033
Jan 16 06:35:30 bee kernel: [   27.535914] CR2: ffffc91c00430fec CR3: 
0000000358864001 CR4: 00000000003606e0
Jan 16 06:35:30 bee kernel: [   27.535916] Call Trace:
Jan 16 06:35:30 bee kernel: [   27.535948]  smu7_request_smu_load_fw
+0x91/0x310 [amdgpu]
Jan 16 06:35:30 bee kernel: [   27.535977]  pp_hw_init+0xbb/0x100 [amdgpu]
Jan 16 06:35:30 bee kernel: [   27.536005]  amdgpu_pp_hw_init+0x34/0x80 
[amdgpu]
Jan 16 06:35:30 bee kernel: [   27.536026]  amdgpu_device_init+0xc43/0x14a0 
[amdgpu]
Jan 16 06:35:30 bee kernel: [   27.536030]  ? 
cache_alloc_debugcheck_after.isra.22+0x195/0x1e0
Jan 16 06:35:30 bee kernel: [   27.536032]  ? kmem_cache_alloc_trace
+0x1f6/0x230
Jan 16 06:35:30 bee kernel: [   27.536051]  ? amdgpu_driver_load_kms
+0x25/0x1b0 [amdgpu]
Jan 16 06:35:30 bee kernel: [   27.536069]  amdgpu_driver_load_kms+0x4a/0x1b0 
[amdgpu]
Jan 16 06:35:30 bee kernel: [   27.536082]  drm_dev_register+0x12f/0x1c0 [drm]
Jan 16 06:35:30 bee kernel: [   27.536106]  amdgpu_pci_probe+0x10f/0x140 
[amdgpu]
Jan 16 06:35:30 bee kernel: [   27.536109]  pci_device_probe+0xc8/0x140
Jan 16 06:35:30 bee kernel: [   27.536113]  driver_probe_device+0x293/0x440
Jan 16 06:35:30 bee kernel: [   27.536116]  __driver_attach+0xda/0xe0
Jan 16 06:35:30 bee kernel: [   27.536119]  ? driver_probe_device+0x440/0x440
Jan 16 06:35:30 bee kernel: [   27.536121]  bus_for_each_dev+0x5a/0x90
Jan 16 06:35:30 bee kernel: [   27.536123]  bus_add_driver+0x16a/0x260
Jan 16 06:35:30 bee kernel: [   27.536126]  driver_register+0x57/0xc0
Jan 16 06:35:30 bee kernel: [   27.536128]  ? 0xffffffffa0fe1000
Jan 16 06:35:30 bee kernel: [   27.536130]  do_one_initcall+0x4e/0x190
Jan 16 06:35:30 bee kernel: [   27.536133]  ? kmem_cache_alloc_trace
+0x1f6/0x230
Jan 16 06:35:30 bee kernel: [   27.536135]  ? do_init_module+0x22/0x1fb
Jan 16 06:35:30 bee kernel: [   27.536137]  do_init_module+0x5b/0x1fb
Jan 16 06:35:30 bee kernel: [   27.536139]  load_module+0x1511/0x1740
Jan 16 06:35:30 bee kernel: [   27.536142]  ? SyS_finit_module+0xaa/0xe0
Jan 16 06:35:30 bee kernel: [   27.536144]  SyS_finit_module+0xaa/0xe0
Jan 16 06:35:30 bee kernel: [   27.536146]  do_syscall_64+0x60/0x190
Jan 16 06:35:30 bee kernel: [   27.536149]  entry_SYSCALL64_slow_path
+0x25/0x25
Jan 16 06:35:30 bee kernel: [   27.536151] RIP: 0033:0x7fd843a7f8f9
Jan 16 06:35:30 bee kernel: [   27.536153] RSP: 002b:00007fff82b37498 EFLAGS: 
00000246 ORIG_RAX: 0000000000000139
Jan 16 06:35:30 bee kernel: [   27.536156] RAX: ffffffffffffffda RBX: 
0000559d6b1b7030 RCX: 00007fd843a7f8f9
Jan 16 06:35:30 bee kernel: [   27.536157] RDX: 0000000000000000 RSI: 
00007fd843d4d565 RDI: 000000000000000f
Jan 16 06:35:30 bee kernel: [   27.536159] RBP: 00007fd843d4d565 R08: 
0000000000000000 R09: 00007fff82b37610
Jan 16 06:35:30 bee kernel: [   27.536161] R10: 000000000000000f R11: 
0000000000000246 R12: 0000000000000000
Jan 16 06:35:30 bee kernel: [   27.536163] R13: 0000559d6b1b80b0 R14: 
0000000000020000 R15: 0000559d6ab658e9
Jan 16 06:35:30 bee kernel: [   27.536165] Code: b6 b1 60 db f2 a0 ff d0 85 c0 
74 1a 83 eb 06 31 c0 83 e3 fb 0f 94 c0 66 89 45 18 48 83 c4 30 31 c0 5b 5d 41 
5c c3 0f b7 44 24 02 <66> 89 5d 00 c7 45 0c 00 00 00 00 c7 45 10 00 00 00 00 
66 89 45 
Jan 16 06:35:30 bee kernel: [   27.536214] RIP: 
smu7_populate_single_firmware_entry.isra.3+0x68/0xc0 [amdgpu] RSP: 
ffffc900005b7a38
Jan 16 06:35:30 bee kernel: [   27.536216] CR2: ffffc91c00430fec
Jan 16 06:35:30 bee kernel: [   27.536218] ---[ end trace 195d074775037a67 
]---

	This oops is just visible if the laptop is unplugged from powercord, and 
it's being there since 4.13 at least. This is from the latest LTS, 4.14 with 
some patches from Alex to prevent the virtualization path to be executed on a 
vga that is not fiji or tonga.

	The effect of this problem is a black screen on boot, most likely because 
of not letting the i915 module take the display.

	Best regards.

	José.


[Index of Archives]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux