On 04/29/2016 02:47 PM, Laurence Oberman wrote:
Recovery with 21 LUNS is 300s that have in-flights to abort.
[ ... ]
eh_deadline is set to 10 on the 2 qlogic ports, eh_timeout is set
> to 10 for all devices. In multipath fast_io_fail_tmo=5
I jam one of the target array ports and discard the commands
> effectively black-holing the commands and leave it that way until
> we recover and I watch the I/O. The recovery takes around 300s even
> with all the tuning and this effectively lands up in Oracle cluster
> evictions.
Hello Laurence,
This discussion started as a discussion about the time needed to fail
over from one path to another. How long did it take in your test before
I/O failed over from the jammed port to another port?
Thanks,
Bart.
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html