Hi Bart, Thank you for your response. On 22/02/2018 18:46, Bart Van Assche wrote: > On 02/22/18 02:58, Jaco Kroon wrote: >> We've been seeing sporadic IO lockups on recent kernels. > > Are you using the legacy I/O stack or blk-mq? If you are not yet using > blk-mq, can you switch to blk-mq + scsi-mq + dm-mq? If the lockup is > reproducible with blk-mq, can you share the output of the following > command: crowsnest ~ # zgrep MQ /proc/config.gz CONFIG_SCSI_MQ_DEFAULT=y # CONFIG_DM_MQ_DEFAULT is not set ... oi, so that's a very valid question. blk-mq is thus off by default, I've now enabled it on the "live" system with "echo 1 > /sys/module/dm_mod/parameters/use_blk_mq". I've also modified the kernel config to set CONFIG_DM_MQ_DEFAULT (I know I can just set this on cmdline too). The only immediately visible effect is that I seem to be more consistently get >300MB/s (and more frequently >400MB) off the array in terms of read speed, where normally I'd expect 200MB consistent with spikes just over 300MB and very infrequently over 400MB. This is a very simple spot check with iotop over approximately a minute. I am seeing I/O errors in dmesg from time to time, this to me hints that potentially it may be something related to some error path that's causing problems. Just so we're clear, we're seeing this happen approximately once a month, so if switching on dm_mod.use_blk_mq solves it then I won't really know beyond a shadow of a doubt, and the only way of "knowing" is if we can get to an uptime of three months or so ... > > (cd /sys/kernel/debug/block && find . -type f -exec grep -aH . {} \;) I don't have a /sys/kernel/debug folder - I've enabled CONFIG_DEBUG_FS and BLK_DEBUG_FS, will reboot at the first opportunity. As a general rule - is there additional overhead to having debugfs enabled? Any other risks that I should be aware of? In essence, are there any disadvantages to just enabling DEBUG_FS as a general rule? I did note that a few extra DEBUG options pop up for other modules ... so my gut is towards leaving this disabled as a general rule and enabling when needed. > > Thanks, Thank you! Kind Regards, Jaco