New test on kernel Linux version 5.1.0-050100-generic. Same problem,
system crash after a few seconds.
Full kern.log: https://paste.ee/p/EmLsw
I can do access to my pc through SSH if useful.
starosta
Dne 3.5.2019 v 17:37 Joerg Roedel napsal(a):
On Mon, Apr 29, 2019 at 11:48:47AM +0200, Johan Hovold wrote:
So this is a debian 4.18 kernel seemingly crashing due to a xhci or
iommu issue.
Can you reproduce this on a mainline kernel?
If so, please post the corresponding logs to the lists and CC the xhci
and iommu maintainers (added to CC).
Your kernel is probably missing this upstream fix:
4e50ce03976f iommu/amd: fix sg->dma_address for sg->offset bigger than PAGE_SIZE
Regards,
Joerg
May 6 08:35:39 test-ubnt kernel: [ 0.000000] Linux version 5.1.0-050100-generic (kernel@gloin) (gcc version 8.3.0 (Ubuntu 8.3.0-7ubuntu1)) #201905052130 SMP Mon May 6 01:32:59 UTC 2019
May 6 08:35:39 test-ubnt kernel: [ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-5.1.0-050100-generic root=UUID=91232716-4b81-4e1d-9d58-c2ad4d090e93 ro
...
May 6 08:37:31 test-ubnt kernel: [ 123.968163] usbserial: USB Serial deregistering driver FTDI USB Serial Device
May 6 08:37:31 test-ubnt kernel: [ 123.968416] ftdi_sio ttyUSB0: FTDI USB Serial Device converter now disconnected from ttyUSB0
May 6 08:37:31 test-ubnt kernel: [ 123.968438] usbcore: deregistering interface driver ftdi_sio
May 6 08:37:31 test-ubnt kernel: [ 123.968504] ftdi_sio 1-9:1.0: device disconnected
May 6 08:37:36 test-ubnt kernel: [ 129.425748] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:37:36 test-ubnt kernel: [ 129.597852] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffa02000 flags=0x0020]
May 6 08:37:36 test-ubnt kernel: [ 129.600253] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffa26000 flags=0x0020]
May 6 08:37:37 test-ubnt kernel: [ 129.985262] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:37:37 test-ubnt kernel: [ 130.157935] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffb8a000 flags=0x0020]
May 6 08:37:37 test-ubnt kernel: [ 130.159739] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xff9f8000 flags=0x0020]
May 6 08:37:37 test-ubnt kernel: [ 130.544683] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:37:37 test-ubnt kernel: [ 130.716992] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xfea16000 flags=0x0020]
May 6 08:37:37 test-ubnt kernel: [ 130.719369] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffbfb000 flags=0x0020]
May 6 08:37:38 test-ubnt kernel: [ 131.104350] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:37:38 test-ubnt kernel: [ 131.279479] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffa3c000 flags=0x0020]
May 6 08:37:38 test-ubnt kernel: [ 131.664426] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:37:38 test-ubnt kernel: [ 131.837112] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffa32000 flags=0x0020]
May 6 08:37:38 test-ubnt kernel: [ 131.839540] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffc5d000 flags=0x0020]
May 6 08:37:39 test-ubnt kernel: [ 132.223953] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:37:39 test-ubnt kernel: [ 132.397215] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffc46000 flags=0x0020]
May 6 08:37:39 test-ubnt kernel: [ 132.398608] amd_iommu_report_page_fault: 5 callbacks suppressed
...
May 6 08:38:07 test-ubnt kernel: [ 160.289771] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xfa532000 flags=0x0020]
May 6 08:38:07 test-ubnt kernel: [ 160.292232] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xfa53d000 flags=0x0020]
May 6 08:38:07 test-ubnt kernel: [ 160.677243] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:38:07 test-ubnt kernel: [ 160.849877] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xfa532000 flags=0x0020]
May 6 08:38:07 test-ubnt kernel: [ 160.852303] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xffc70000 flags=0x0020]
May 6 08:38:08 test-ubnt kernel: [ 161.237342] xhci_hcd 0000:01:00.0: WARN Set TR Deq Ptr cmd failed due to incorrect slot or ep state.
May 6 08:38:08 test-ubnt kernel: [ 161.412400] xhci_hcd 0000:01:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0000 address=0xfe8f0000 flags=0x0020]
May 6 08:38:08 test-ubnt kernel: [ 161.562930] general protection fault: 0000 [#1] SMP NOPTI
May 6 08:38:08 test-ubnt kernel: [ 161.563709] CPU: 0 PID: 1632 Comm: readua Not tainted 5.1.0-050100-generic #201905052130
May 6 08:38:08 test-ubnt kernel: [ 161.564493] Hardware name: System manufacturer System Product Name/PRIME A320M-K, BIOS 4602 03/07/2019
May 6 08:38:08 test-ubnt kernel: [ 161.565323] RIP: 0010:__kmalloc+0xa5/0x220
May 6 08:38:08 test-ubnt kernel: [ 161.566116] Code: 65 49 8b 50 08 65 4c 03 05 80 c5 78 79 4d 8b 38 4d 85 ff 0f 84 2e 01 00 00 41 8b 59 20 49 8b 39 48 8d 4a 01 4c 89 f8 4c 01 fb <48> 33 1b 49 33 99 38 01 00 00 65 48 0f c7 0f 0f 94 c0 84 c0 74 bd
May 6 08:38:08 test-ubnt kernel: [ 161.567913] RSP: 0018:ffffa8614237bcd0 EFLAGS: 00010202
May 6 08:38:08 test-ubnt kernel: [ 161.568840] RAX: 534d1cbf998a8f86 RBX: 534d1cbf998a8f86 RCX: 00000000000062cb
May 6 08:38:08 test-ubnt kernel: [ 161.569801] RDX: 00000000000062ca RSI: 0000000000000cc0 RDI: 0000000000028040
May 6 08:38:08 test-ubnt kernel: [ 161.570779] RBP: ffffa8614237bd00 R08: ffff9cfa9ae28040 R09: ffff9cfa9a807c00
May 6 08:38:08 test-ubnt kernel: [ 161.571778] R10: 0000000000000009 R11: ffff9cfa36fcc100 R12: 0000000000000cc0
May 6 08:38:08 test-ubnt kernel: [ 161.572796] R13: 0000000000000002 R14: ffff9cfa9a807c00 R15: 534d1cbf998a8f86
May 6 08:38:08 test-ubnt kernel: [ 161.573825] FS: 0000000000000000(0000) GS:ffff9cfa9ae00000(0063) knlGS:00000000f7d01700
May 6 08:38:08 test-ubnt kernel: [ 161.574882] CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033
May 6 08:38:08 test-ubnt kernel: [ 161.575907] CR2: 00000000ff82280c CR3: 00000000b59ec000 CR4: 00000000003406f0
May 6 08:38:08 test-ubnt kernel: [ 161.576966] Call Trace:
May 6 08:38:08 test-ubnt kernel: [ 161.578028] ? proc_do_submiturb+0xaf1/0xc70
May 6 08:38:08 test-ubnt kernel: [ 161.579109] proc_do_submiturb+0xaf1/0xc70
May 6 08:38:08 test-ubnt kernel: [ 161.580202] proc_submiturb_compat+0x81/0xb0
May 6 08:38:08 test-ubnt kernel: [ 161.581304] usbdev_do_ioctl+0x930/0xd70
May 6 08:38:08 test-ubnt kernel: [ 161.582415] ? _copy_from_user+0x3e/0x60
May 6 08:38:08 test-ubnt kernel: [ 161.583534] usbdev_compat_ioctl+0x10/0x20
May 6 08:38:08 test-ubnt kernel: [ 161.584622] __ia32_compat_sys_ioctl+0x13d/0x230
May 6 08:38:08 test-ubnt kernel: [ 161.585722] do_fast_syscall_32+0x9b/0x210
May 6 08:38:08 test-ubnt kernel: [ 161.586831] entry_SYSCALL_compat_after_hwframe+0x45/0x4d
May 6 08:38:08 test-ubnt kernel: [ 161.587962] RIP: 0023:0xf7f56929
May 6 08:38:08 test-ubnt kernel: [ 161.589096] Code: 85 d2 74 02 89 0a 5b 5d c3 8b 04 24 c3 8b 14 24 c3 8b 34 24 c3 8b 3c 24 c3 90 90 90 90 90 90 90 90 51 52 55 89 cd 0f 05 cd 80 <5d> 5a 59 c3 90 90 90 90 8d b4 26 00 00 00 00 8d b4 26 00 00 00 00
May 6 08:38:08 test-ubnt kernel: [ 161.591579] RSP: 002b:00000000ff822588 EFLAGS: 00000202 ORIG_RAX: 0000000000000036
May 6 08:38:08 test-ubnt kernel: [ 161.592871] RAX: ffffffffffffffda RBX: 0000000000000009 RCX: 00000000802c550a
May 6 08:38:08 test-ubnt kernel: [ 161.594185] RDX: 0000000057b89bd0 RSI: 0000000000000033 RDI: 0000000000000002
May 6 08:38:08 test-ubnt kernel: [ 161.595508] RBP: 00000000ff8225e8 R08: 0000000000000000 R09: 0000000000000000
May 6 08:38:08 test-ubnt kernel: [ 161.596797] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
May 6 08:38:08 test-ubnt kernel: [ 161.598089] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
May 6 08:38:08 test-ubnt kernel: [ 161.599382] Modules linked in: nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua edac_mce_amd ccp kvm irqbypass crct10dif_pclmul crc32_pclmul snd_hda_codec_realtek ghash_clmulni_intel snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_hda_codec k10temp wmi_bmof snd_hda_core joydev eeepc_wmi asus_wmi snd_hwdep sparse_keymap snd_pcm input_leds snd_timer snd soundcore usbserial mac_hid sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid hid amdgpu aesni_intel chash gpu_sched amd_iommu_v2 i2c_algo_bit ttm aes_x86_64 crypto_simd cryptd glue_helper drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops i2c_piix4 drm r8169 ahci realtek libahci wmi video gpio_amdpt gpio_generic [last unloaded: ftdi_sio]
May 6 08:38:08 test-ubnt kernel: [ 161.610416] ---[ end trace e32a9be49aaeee88 ]---
May 6 08:38:08 test-ubnt kernel: [ 161.612558] RIP: 0010:__kmalloc+0xa5/0x220
May 6 08:38:08 test-ubnt kernel: [ 161.614259] Code: 65 49 8b 50 08 65 4c 03 05 80 c5 78 79 4d 8b 38 4d 85 ff 0f 84 2e 01 00 00 41 8b 59 20 49 8b 39 48 8d 4a 01 4c 89 f8 4c 01 fb <48> 33 1b 49 33 99 38 01 00 00 65 48 0f c7 0f 0f 94 c0 84 c0 74 bd
May 6 08:38:08 test-ubnt kernel: [ 161.617879] RSP: 0018:ffffa8614237bcd0 EFLAGS: 00010202
May 6 08:38:08 test-ubnt kernel: [ 161.619761] RAX: 534d1cbf998a8f86 RBX: 534d1cbf998a8f86 RCX: 00000000000062cb
May 6 08:38:08 test-ubnt kernel: [ 161.621665] RDX: 00000000000062ca RSI: 0000000000000cc0 RDI: 0000000000028040
May 6 08:38:08 test-ubnt kernel: [ 161.623586] RBP: ffffa8614237bd00 R08: ffff9cfa9ae28040 R09: ffff9cfa9a807c00
May 6 08:38:08 test-ubnt kernel: [ 161.625513] R10: 0000000000000009 R11: ffff9cfa36fcc100 R12: 0000000000000cc0
May 6 08:38:08 test-ubnt kernel: [ 161.627475] R13: 0000000000000002 R14: ffff9cfa9a807c00 R15: 534d1cbf998a8f86
May 6 08:38:08 test-ubnt kernel: [ 161.629383] FS: 0000000000000000(0000) GS:ffff9cfa9ae00000(0063) knlGS:00000000f7d01700
May 6 08:38:08 test-ubnt kernel: [ 161.631335] CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033
May 6 08:38:08 test-ubnt kernel: [ 161.633289] CR2: 00000000ff82280c CR3: 00000000b59ec000 CR4: 00000000003406f0