Hi,
Hardware: Acer Swift-1 SF114-34-P6U1
SD Host controller: Intel Corporation Device 4dc4 (rev 01) 8086:4dc4
eMMC: Kingston A29128
* A couple of seconds after boot access to the eMMC is completely
blocked. Can be triggered by extensive I/O (e.g, with badblocks -w)
* Tested with stable branch linux-5.14.21 vs. linux-5.15.1
Latest highest version including the bug is a fedora 6.1.6
* Spiked the code with a lot of pr_debug messages. As a side effect the
driver
seems to be much more stable, so to me it appears like a timing issue
where
the driver fails to wait for a specific state change which seems to
appear
less often when slowed down by logging
* Would love to have a decent reference documentation to the SD
Controller
especially a detailed description of the behavior of the register map
in
mmc/host/cqhci.h All intel documents I could find seem to describe
similar but not quite identical register maps
Regards
Sebastian
* dmesg
[ 347.583082] mmc0: cqhci: timeout for tag 3, qcnt 4
[ 347.583086] mmc0: cqhci: ============ CQHCI REGISTER DUMP ===========
[ 347.583114] mmc0: cqhci: Caps: 0x000030c0 | Version: 0x00000510
[ 347.583117] mmc0: cqhci: Config: 0x00001101 | Control: 0x00000000
[ 347.583120] mmc0: cqhci: Int stat: 0x00000000 | Int enab: 0x00000016
[ 347.583123] mmc0: cqhci: Int sig: 0x00000016 | Int Coal: 0x00000000
[ 347.583157] mmc0: cqhci: TDL base: 0x7c2b5000 | TDL up32: 0x00000001
[ 347.583160] mmc0: cqhci: Doorbell: 0x0000000f | TCN: 0x00000000
[ 347.583163] mmc0: cqhci: Dev queue: 0x00000008 | Dev Pend: 0x00000008
[ 347.583166] mmc0: cqhci: Task clr: 0x00000000 | SSC1: 0x00010008
[ 347.583169] mmc0: cqhci: SSC2: 0x00000001 | DCMD rsp: 0x00000800
[ 347.583172] mmc0: cqhci: RED mask: 0xfdf9a080 | TERRI: 0x00000000
[ 347.583175] mmc0: cqhci: Resp idx: 0x0000002f | Resp arg: 0x00000900
[ 347.583176] mmc0: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 347.583182] mmc0: sdhci: Sys addr: 0x00000080 | Version: 0x00001002
[ 347.583185] mmc0: sdhci: Blk size: 0x00000200 | Blk cnt: 0x00000070
[ 347.583188] mmc0: sdhci: Argument: 0x00030000 | Trn mode: 0x00000023
[ 347.583191] mmc0: sdhci: Present: 0x1fff0106 | Host ctl: 0x0000003c
[ 347.583194] mmc0: sdhci: Power: 0x0000000b | Blk gap: 0x00000080
[ 347.583197] mmc0: sdhci: Wake-up: 0x00000000 | Clock: 0x00000007
[ 347.583200] mmc0: sdhci: Timeout: 0x0000000e | Int stat: 0x00000000
[ 347.583203] mmc0: sdhci: Int enab: 0x02ff4000 | Sig enab: 0x02ff4000
[ 347.583206] mmc0: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000000
[ 347.583209] mmc0: sdhci: Caps: 0x546ec881 | Caps_1: 0x80000807
[ 347.583212] mmc0: sdhci: Cmd: 0x00002f3a | Max curr: 0x00000000
[ 347.583214] mmc0: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x00000000
[ 347.583217] mmc0: sdhci: Resp[2]: 0x00000000 | Resp[3]: 0x00000000
[ 347.583219] mmc0: sdhci: Host ctl2: 0x0000000d
[ 347.583223] mmc0: sdhci: ADMA Err: 0x00000000 | ADMA Ptr:
0x000000011eef1218
[ 347.583224] mmc0: sdhci: ============================================
[ 347.583229] mmc0: running CQE recovery
[ 347.690108] mmc0: Reset 0x2 never completed.
[ 347.690109] mmc0: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 347.690115] mmc0: sdhci: Sys addr: 0x00000080 | Version: 0x00001002
[ 347.690118] mmc0: sdhci: Blk size: 0x00000200 | Blk cnt: 0x00000070
[ 347.690121] mmc0: sdhci: Argument: 0x00030000 | Trn mode: 0x00000023
[ 347.690124] mmc0: sdhci: Present: 0x1fff0106 | Host ctl: 0x0000003c
[ 347.690127] mmc0: sdhci: Power: 0x0000000b | Blk gap: 0x00000080
[ 347.690130] mmc0: sdhci: Wake-up: 0x00000000 | Clock: 0x00000007
[ 347.690133] mmc0: sdhci: Timeout: 0x0000000e | Int stat: 0x00000000
[ 347.690136] mmc0: sdhci: Int enab: 0x00ff0003 | Sig enab: 0x00ff0003
[ 347.690139] mmc0: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000000
[ 347.690141] mmc0: sdhci: Caps: 0x546ec881 | Caps_1: 0x80000807
[ 347.690144] mmc0: sdhci: Cmd: 0x00002f3a | Max curr: 0x00000000
[ 347.690147] mmc0: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x00000000
[ 347.690150] mmc0: sdhci: Resp[2]: 0x00000000 | Resp[3]: 0x00000000
[ 347.690152] mmc0: sdhci: Host ctl2: 0x0000000d
[ 347.690156] mmc0: sdhci: ADMA Err: 0x00000000 | ADMA Ptr:
0x000000011eef1218
[ 347.690157] mmc0: sdhci: ============================================
[ 347.790190] mmc0: Reset 0x4 never completed.
[ 347.790191] mmc0: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 347.790193] mmc0: sdhci: Sys addr: 0x00000080 | Version: 0x00001002
[ 347.790196] mmc0: sdhci: Blk size: 0x00000200 | Blk cnt: 0x00000070
[ 347.790199] mmc0: sdhci: Argument: 0x00030000 | Trn mode: 0x00000023
[ 347.790202] mmc0: sdhci: Present: 0x1fff0106 | Host ctl: 0x0000003c
[ 347.790205] mmc0: sdhci: Power: 0x0000000b | Blk gap: 0x00000080
[ 347.790208] mmc0: sdhci: Wake-up: 0x00000000 | Clock: 0x00000007
[ 347.790211] mmc0: sdhci: Timeout: 0x0000000e | Int stat: 0x00000000
[ 347.790213] mmc0: sdhci: Int enab: 0x00ff0003 | Sig enab: 0x00ff0003
[ 347.790216] mmc0: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000000
[ 347.790219] mmc0: sdhci: Caps: 0x546ec881 | Caps_1: 0x80000807
[ 347.790222] mmc0: sdhci: Cmd: 0x00002f3a | Max curr: 0x00000000
[ 347.790225] mmc0: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x00000000
[ 347.790228] mmc0: sdhci: Resp[2]: 0x00000000 | Resp[3]: 0x00000000
[ 347.790230] mmc0: sdhci: Host ctl2: 0x0000000d
[ 347.790234] mmc0: sdhci: ADMA Err: 0x00000000 | ADMA Ptr:
0x000000011eef1218
[ 347.790234] mmc0: sdhci: ============================================
[ 347.803056] mmc0: Controller never released inhibit bit(s).
[ 347.803058] mmc0: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 347.803077] mmc0: sdhci: Sys addr: 0x00000080 | Version: 0x00001002
[ 347.803080] mmc0: sdhci: Blk size: 0x00000200 | Blk cnt: 0x00000070
[ 347.803083] mmc0: sdhci: Argument: 0x00030000 | Trn mode: 0x00000023
[ 347.803086] mmc0: sdhci: Present: 0x1fff0106 | Host ctl: 0x0000003c
[ 347.803089] mmc0: sdhci: Power: 0x0000000b | Blk gap: 0x00000080
[ 347.803092] mmc0: sdhci: Wake-up: 0x00000000 | Clock: 0x00000007
[ 347.803095] mmc0: sdhci: Timeout: 0x0000000e | Int stat: 0x00000000
[ 347.803098] mmc0: sdhci: Int enab: 0x00ff0003 | Sig enab: 0x00ff0003
[ 347.803101] mmc0: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000000
[ 347.803103] mmc0: sdhci: Caps: 0x546ec881 | Caps_1: 0x80000807
[ 347.803106] mmc0: sdhci: Cmd: 0x00002f3a | Max curr: 0x00000000
[ 347.803109] mmc0: sdhci: Resp[0]: 0x00000900 | Resp[1]: 0x00000000
[ 347.803112] mmc0: sdhci: Resp[2]: 0x00000000 | Resp[3]: 0x00000000
[ 347.803114] mmc0: sdhci: Host ctl2: 0x0000000d
[ 347.803118] mmc0: sdhci: ADMA Err: 0x00000000 | ADMA Ptr:
0x000000011eef1218
[ 347.803119] mmc0: sdhci: ============================================
--
--