On Wed, Jun 29, 2016 at 7:17 AM, Zygo Blaxell <u0oo5pgu@xxxxxxxxxxxxxxxxxxxxx> wrote: > OK, but the two links you provided are not examples of these. But there *are* plenty of examples of this. I've run into this personally, before I knew to specifically check the ERC/TLER/whatever configuration on all my drives and pro-actively configure them properly. When the only two options are 1) long kernel timeout and URE is caught and fixed, or 2) short kernel timeout and the drive is detected as failed and kicked from all arrays, then I'll take #1 please. Obviously, trying to detect misconfiguration and drives that don't support ERC/TLER and fixing the timeout accordingly would be better. I agree with others, the current default behavior is unintentionally user-hostile. > Long timeouts don't really serve anyone, even in single-disk cases. This statement is too dogmatic. It depends on the drive. For a drive with the proper features and settings, that is guaranteed to respond in a few seconds unless it has truly totally failed, I agree with you. For a drive with those features but misconfigured (e.g., by default), best is to configure it properly, so in that case I agree with you but changes are needed somewhere to get the configuration to occur automatically. For a consumer drive that lacks those features entirely, I disagree with you. Although for that case, it would be worth having an alarm of some sort be triggered perhaps similar to the EMails generated when an array degrades. That would let the user know that the drive is responding very slowly (probably indicating recoverable read errors) and may fail soon. Again, changes are needed to do that. Eddie -- To unsubscribe from this list: send the line "unsubscribe linux-raid" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html