On 11/4/20 1:23 PM, Newsmails wrote:
On 11/4/20 1:14 PM, Vlastimil Babka wrote:
On 11/4/20 1:17 AM, Andrew Morton wrote:
(switched to email. Please respond via emailed reply-to-all, not via
the
bugzilla web interface).
On Tue, 03 Nov 2020 20:00:58 +0000
bugzilla-daemon@xxxxxxxxxxxxxxxxxxx wrote:
https://bugzilla.kernel.org/show_bug.cgi?id=210031
Bug ID: 210031
Summary: unable to handle page fault for address - EIP:
khugepaged
Product: Memory Management
Version: 2.5
Kernel Version: 5.9.1
Hardware: All
OS: Linux
Tree: Mainline
Status: NEW
Severity: normal
Priority: P1
Component: Other
Assignee: akpm@xxxxxxxxxxxxxxxxxxxx
Reporter: newsmails@xxxxxxxxxxxxxxx
Regression: No
Thanks. That's a strange looking trace. I'll optimistically cc some
people who have been working in that area lately.
What caused this kernel to be tainted?
laptop Skylake i915 Distribution : slackware 14.2 32 bits
Oct 23 17:38:22 linuxp kernel: [141330.499234] BUG: unable to handle
page fault
for address: 021d202d
Oct 23 17:38:22 linuxp kernel: [141330.499245] #PF: supervisor read
access in
kernel mode
Oct 23 17:38:22 linuxp kernel: [141330.499250] #PF:
error_code(0x0000) -
not-present page
Oct 23 17:38:22 linuxp kernel: [141330.499265] Oops: 0000 [#2] SMP PTI
#2 means this is not the first oops. Do you have the very first?
Yes sorry.
It was a resume too as you will see with the time.
For oct 23 17:38 it is a resume too i think : i think that I hibernated
and i forgot to look at something so i resumed.
It's always 021d202d (3 times) and always where a vma might be accessed (a /proc
file, khugepaged(), acct_collect()) so I would assume a struct vma was corrupted
in the hibernate/resume process.
Could be also firmware related AFAIK and there's I taint flag which means some
buggy firmware workaround is in effect.
Oct 23 13:22:10 linuxp dhcpcd[18199]: dhcpcd not running
Oct 23 15:55:49 linuxp dhcpcd[27045]: dhcpcd not running
Oct 23 15:55:49 linuxp dhcpcd[27053]: dhcpcd not running
Oct 23 15:55:50 linuxp dhcpcd[27061]: dhcpcd not running
Oct 23 15:55:50 linuxp dhcpcd[27067]: dhcpcd not running
Oct 23 17:31:13 linuxp kernel: [140897.356150] iwlwifi 0000:03:00.0:
RF_KILL bit toggled to enable radio.
Oct 23 17:31:16 linuxp kernel: [140900.724013] Bluetooth: hci0:
unexpected event for opcode 0xfc2f
Oct 23 17:31:39 linuxp kernel: [140928.245135] BUG: unable to handle
page fault for address: 021d2001
Oct 23 17:31:39 linuxp kernel: [140928.245147] #PF: supervisor read
access in kernel mode
Oct 23 17:31:39 linuxp kernel: [140928.245152] #PF: error_code(0x0000) -
not-present page
Oct 23 17:31:39 linuxp kernel: [140928.245169] Oops: 0000 [#1] SMP PTI
Oct 23 17:31:39 linuxp kernel: [140928.245179] CPU: 1 PID: 2302 Comm:
Breakpad Server Tainted: G I 5.9.1 #1
Oct 23 17:31:39 linuxp kernel: [140928.245184] Hardware name:
Notebook W65_W67RZ/W65_W67RZ, BIOS 1.05.06
02/22/2016
Oct 23 17:31:39 linuxp kernel: [140928.245197] EIP: m_next+0x1c/0x44
Oct 23 17:31:39 linuxp kernel: [140928.245205] Code: 24 08 d4 e6 c1 e8
1a 77 e2 ff eb d6 cc cc 3e 8d 74 26 00 55 89 e5 57 56 53 8b 40 44 8b 58
0c 39 da 74 24 8b 42 08 85 c0 74 0e <8b> 30 31 ff 89 31 89 79 04 5b 5e
5f 5d c3 be ff ff ff ff 31 ff 85
Oct 23 17:31:39 linuxp kernel: [140928.245212] EAX: 021d2001 EBX:
00000000 ECX: eeb9e56c EDX: eca0f000
Oct 23 17:31:39 linuxp kernel: [140928.245216] ESI: 00000000 EDI:
c128cad0 EBP: e59bff0c ESP: e59bff00
Oct 23 17:31:39 linuxp kernel: [140928.245221] DS: 007b ES: 007b FS:
00d8 GS: 00e0 SS: 0068 EFLAGS: 00010202
Oct 23 17:31:39 linuxp kernel: [140928.245224] CR0: 80050033 CR2:
021d2001 CR3: 2606c000 CR4: 003506f0
Oct 23 17:31:39 linuxp kernel: [140928.245228] DR0: 00000000 DR1:
00000000 DR2: 00000000 DR3: 00000000
Oct 23 17:31:39 linuxp kernel: [140928.245231] DR6: fffe0ff0 DR7: 00000400
Oct 23 17:31:39 linuxp kernel: [140928.245233] Call Trace:
Oct 23 17:31:39 linuxp kernel: [140928.245242] ?
quota_send_warning+0x220/0x220
Oct 23 17:31:39 linuxp kernel: [140928.245248] seq_read+0x2bc/0x3e1
Oct 23 17:31:39 linuxp kernel: [140928.245254] ?
quota_send_warning+0x220/0x220
Oct 23 17:31:39 linuxp kernel: [140928.245260] ? seq_open_private+0x17/0x17
Oct 23 17:31:39 linuxp kernel: [140928.245266] vfs_read+0x85/0x17f
Oct 23 17:31:39 linuxp kernel: [140928.245272] ? mutex_lock+0x10/0x33
Oct 23 17:31:39 linuxp kernel: [140928.245277] ksys_read+0x51/0xb6
Oct 23 17:31:39 linuxp kernel: [140928.245283] __ia32_sys_read+0x15/0x17
Oct 23 17:31:39 linuxp kernel: [140928.245289] do_int80_syscall_32+0x2c/0x39
Oct 23 17:31:39 linuxp kernel: [140928.245295] entry_INT80_32+0xf7/0xf7
Oct 23 17:31:39 linuxp kernel: [140928.245299] EIP: 0xafc787c8
Oct 23 17:31:39 linuxp kernel: [140928.245304] Code: 00 00 c6 47 04 01
8b 47 08 85 c0 75 b6 80 7f 04 00 75 5c 8b 37 ba 00 04 00 00 8d 4c 07 0c
29 c2 b8 03 00 00 00 53 89 f3 cd 80 <5b> 89 c6 3d 01 f0 ff ff 73 32 85
f6 78 37 74 c8 01 77 08 8b 47 08
Oct 23 17:31:39 linuxp kernel: [140928.245308] EAX: ffffffda EBX:
00000040 ECX: a3c194fc EDX: 000003d8
Oct 23 17:31:39 linuxp kernel: [140928.245311] ESI: 00000040 EDI:
a3c194c8 EBP: 995fece8 ESP: 995feccc
Oct 23 17:31:39 linuxp kernel: [140928.245316] DS: 007b ES: 007b FS:
0000 GS: 0033 SS: 007b EFLAGS: 00000216
Oct 23 17:31:39 linuxp kernel: [140928.245320] Modules linked in:
appletalk psnap llc ipv6 fuse uvcvideo videobuf2_vmalloc
videobuf2_memops btusb videobuf2_v4l2 hid_generic btrtl btbcm
videobuf2_common btintel videodev bluetooth mc usbhid hid ecdh_generic
ecc rtsx_pci_sdmmc joydev mmc_core snd_hda_codec_hdmi
snd_hda_codec_realtek snd_hda_codec_generic i2c_dev ledtrig_audio
coretemp i915 hwmon iwlmvm r8169 i2c_algo_bit x86_pkg_temp_thermal
mac80211 drm_kms_helper intel_powerclamp rtsx_pci realtek drm kvm_intel
mdio_devres mfd_core libphy kvm intel_gtt irqbypass crc32_pclmul iwlwifi
snd_hda_intel agpgart psmouse evdev crc32c_intel serio_raw fb_sys_fops
cfg80211 snd_intel_dspcfg snd_hda_codec rfkill snd_hda_core syscopyarea
wmi thermal snd_hwdep battery snd_pcm sysfillrect sysimgblt snd_timer
xhci_pci i2c_i801 button xhci_hcd snd i2c_smbus intel_pch_thermal mei_me
soundcore video mei i2c_core acpi_pad ac loop
Oct 23 17:31:39 linuxp kernel: [140928.245396] CR2: 00000000021d2001
Oct 23 17:31:39 linuxp kernel: [140928.245402] ---[ end trace
c79bfd2669dd9a26 ]---
Oct 23 17:31:39 linuxp kernel: [140928.245408] EIP: m_next+0x1c/0x44
Oct 23 17:31:39 linuxp kernel: [140928.245412] Code: 24 08 d4 e6 c1 e8
1a 77 e2 ff eb d6 cc cc 3e 8d 74 26 00 55 89 e5 57 56 53 8b 40 44 8b 58
0c 39 da 74 24 8b 42 08 85 c0 74 0e <8b> 30 31 ff 89 31 89 79 04 5b 5e
5f 5d c3 be ff ff ff ff 31 ff 85
Oct 23 17:31:39 linuxp kernel: [140928.245417] EAX: 021d2001 EBX:
00000000 ECX: eeb9e56c EDX: eca0f000
Oct 23 17:31:39 linuxp kernel: [140928.245420] ESI: 00000000 EDI:
c128cad0 EBP: e59bff0c ESP: e59bff00
Oct 23 17:31:39 linuxp kernel: [140928.245424] DS: 007b ES: 007b FS:
00d8 GS: 00e0 SS: 0068 EFLAGS: 00010202
Oct 23 17:31:39 linuxp kernel: [140928.245427] CR0: 80050033 CR2:
021d2001 CR3: 2606c000 CR4: 003506f0
Oct 23 17:31:39 linuxp kernel: [140928.245431] DR0: 00000000 DR1:
00000000 DR2: 00000000 DR3: 00000000
Oct 23 17:31:39 linuxp kernel: [140928.245434] DR6: fffe0ff0 DR7: 00000400
Oct 23 17:32:27 linuxp dhcpcd[27294]: dhcpcd not running
Oct 23 17:32:30 linuxp dhcpcd[27305]: dhcpcd not running
Oct 23 17:32:33 linuxp dhcpcd[27314]: dhcpcd not running
Oct 23 17:32:35 linuxp dhcpcd[27325]: dhcpcd not running
Oct 23 17:32:36 linuxp dhcpcd[27331]: dhcpcd not running
Oct 23 17:38:22 linuxp kernel: [141330.499234] BUG: unable to handle
page fault for address: 021d202d
Oct 23 17:38:22 linuxp kernel: [141330.499245] #PF: supervisor read
access in kernel mode
Oct 23 17:38:22 linuxp kernel: [141330.499250] #PF: error_code(0x0000) -
not-present page
Oct 23 17:38:22 linuxp kernel: [141330.499265] Oops: 0000 [#2] SMP PTI
Oct 23 17:38:22 linuxp kernel: [141330.499274] CPU: 0 PID: 37 Comm:
khugepaged Tainted: G D I 5.9.1 #1
Oct 23 17:38:22 linuxp kernel: [141330.499278] Hardware name:
Notebook W65_W67RZ/W65_W67RZ, BIOS 1.05.06
02/22/2016
Oct 23 17:38:22 linuxp kernel: [141330.499289] EIP: khugepaged+0x599/0x2226