smartd/smartctl offlining drives

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I have 2 similarly specced machines with Fedora 9 installed using kernel 2.6.26.5-45.fc9.x86_64 and a 16 x SATA md RAID6 behind an LSISAS1068E SAS controller. Both are running a locally compiled version of Samba 3.0.29.

If they are configured to start samba and smartd on boot, these two services are the last to start - samba directly followed by smartd.

This results in the SAS controller progressively offlining each of the 16 drives in turn, as the SMART commands cause the controller to repeatedly reset each drive:

Oct 25 12:39:13 storage smartd[3147]: Device: /dev/sdc, opened
Oct 25 12:39:13 storage smartd[3147]: Device: /dev/sdc, not found in smartd database. Oct 25 12:39:19 storage kernel: mptscsih: ioc0: attempting task abort! (sc=ffff81022d0597c0) Oct 25 12:39:19 storage kernel: sd 8:0:0:0: [sdc] CDB: ATA command pass through(16): 85 08 0e 00 d5 00 01 00 01 00 4f 00 c2 00 b0 00 Oct 25 12:39:20 storage kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000) Oct 25 12:39:21 storage kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff81022d0597c0) Oct 25 12:39:31 storage kernel: mptscsih: ioc0: attempting task abort! (sc=ffff81022d0597c0) Oct 25 12:39:31 storage kernel: sd 8:0:0:0: [sdc] CDB: Test Unit Ready: 00 00 00 00 00 00 Oct 25 12:39:32 storage kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000) Oct 25 12:39:32 storage kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff81022d0597c0) Oct 25 12:39:32 storage kernel: mptscsih: ioc0: attempting target reset! (sc=ffff81022d0597c0) Oct 25 12:39:32 storage kernel: sd 8:0:0:0: [sdc] CDB: ATA command pass through(16): 85 08 0e 00 d5 00 01 00 01 00 4f 00 c2 00 b0 00
Oct 25 12:39:34 storage kernel: mptscsih: ioc0: Issue of TaskMgmt failed!
Oct 25 12:39:34 storage kernel: mptscsih: ioc0: target reset: FAILED (sc=ffff81022d0597c0) Oct 25 12:39:34 storage kernel: mptscsih: ioc0: attempting bus reset! (sc=ffff81022d0597c0) Oct 25 12:39:34 storage kernel: sd 8:0:0:0: [sdc] CDB: ATA command pass through(16): 85 08 0e 00 d5 00 01 00 01 00 4f 00 c2 00 b0 00 Oct 25 12:39:43 storage kernel: mptscsih: ioc0: bus reset: SUCCESS (sc=ffff81022d0597c0) Oct 25 12:40:03 storage kernel: mptscsih: ioc0: attempting task abort! (sc=ffff81022d0597c0) Oct 25 12:40:03 storage kernel: sd 8:0:0:0: [sdc] CDB: Test Unit Ready: 00 00 00 00 00 00 Oct 25 12:40:04 storage kernel: mptbase: ioc0: LogInfo(0x31140000): Originator={PL}, Code={IO Executed}, SubCode(0x0000) Oct 25 12:40:04 storage kernel: mptscsih: ioc0: task abort: SUCCESS (sc=ffff81022d0597c0) Oct 25 12:40:04 storage kernel: mptscsih: ioc0: attempting host reset! (sc=ffff81022d0597c0)
Oct 25 12:40:04 storage kernel: mptbase: ioc0: Initiating recovery
Oct 25 12:40:19 storage kernel: mptscsih: ioc0: host reset: SUCCESS (sc=ffff81022d0597c0) Oct 25 12:40:19 storage kernel: sd 8:0:0:0: Device offlined - not ready after error recovery Oct 25 12:40:19 storage smartd[3147]: Device: /dev/sdc, Read SMART Error Log Failed Oct 25 12:40:19 storage smartd[3147]: Device: /dev/sdc, no SMART Error log; remove -l error Directive from smartd.conf Oct 25 12:40:19 storage smartd[3147]: Device: /dev/sdc, is SMART capable. Adding to "monitor" list.
Oct 25 12:40:19 storage smartd[3147]: Device: /dev/sdd, opened
Oct 25 12:40:19 storage smartd[3147]: Device: /dev/sdd, not found in smartd database. Oct 25 12:40:26 storage kernel: mptscsih: ioc0: attempting task abort! (sc=ffff81022d207b80) Oct 25 12:40:26 storage kernel: sd 8:0:1:0: [sdd] CDB: ATA command pass through(16): 85 08 0e 00 d5 00 01 00 01 00 4f 00 c2 00 b0 00

and so on until all 16 drives are down.

If samba is disabled on boot, with smartd enabled and samba is then manually started later or smartd is disabled on boot, with samba enabled and smartd started manually, everything works perfectly - smartd and smartctl work perfectly.

I am unsure exactly which component - samba, smartd or mpt driver is at fault here.

Regards,

Richard

--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]
  Powered by Linux