scan error after FC link recovery when device was in use

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,


With 2.6.17.2, if a FC device is in use (mounted FS), a scan error is reported when recovering after a 'long' link down period (>30s, 'no device timeout'). Then, devices aren't detected again (and oops are generated when reloading HBA drivers, but this may be a side effect).

We have this issue with point to point configuration using:
- Emulex + DDN
- Emulex + NEC
- QLogic + DDN
- QLogic + NEC

In all cases, we got the message "Unexpected response from lun 0 while scanning, scan aborted" in the scsi_report_lun_scan function (scsi_scan.c) and I'm thinking there's something wrong when calling scsi_probe_and_add_lun. We don't have such error if devices aren't used (not mounted) when link failure occurs.

Do you know if such issue has already been encountered ?
May be an already existing patch ? (I look for scsi_probe_and_add_lun in the mailling list but didn't find any revelant message)

Here are the extracts from syslog about these errors:


lpfc + DDN:
===========
...
Jul 3 09:15:30 s_kernel@iotiger2 kernel: lpfc 0000:06:02.0: 0:1305 Link Down Event x2 received Data: x2 x20 x0 Jul 3 09:16:00 s_kernel@iotiger2 kernel: lpfc 0000:06:02.0: 0:0203 Nodev timeout on WWPN 23:0:0:1:ff:3:2:a8 NPort xef Data: x8 x7 x0 Jul 3 09:16:05 s_kernel@iotiger2 kernel: rport-2:0-0: blocked FC remote port time out: removing target and saving binding Jul 3 09:17:01 s_kernel@iotiger2 kernel: lpfc 0000:06:02.0: 0:1303 Link Up Event x3 received Data: x3 x1 x8 x2 Jul 3 09:17:01 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A 8500 Rev: 5.22 Jul 3 09:17:01 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 03 Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: 571416576 512-byte hdwr sectors (292565 MB)
Jul  3 09:17:01 s_kernel@iotiger2 kernel: sdd: Write Protect is off
Jul  3 09:17:01 s_kernel@iotiger2 kernel: sdd: Mode Sense: a7 00 10 08
Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: drive cache: write back w/ FUA Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: 571416576 512-byte hdwr sectors (292565 MB)
Jul  3 09:17:01 s_kernel@iotiger2 kernel: sdd: Write Protect is off
Jul  3 09:17:01 s_kernel@iotiger2 kernel: sdd: Mode Sense: a7 00 10 08
Jul 3 09:17:01 s_kernel@iotiger2 kernel: SCSI device sdd: drive cache: write back w/ FUA
Jul  3 09:17:01 s_kernel@iotiger2 kernel: sdd: unknown partition table
Jul  3 09:17:01 s_kernel@iotiger2 kernel: sd 2:0:0:0: Attached scsi disk sdd
Jul 3 09:17:01 s_kernel@iotiger2 kernel: sd 2:0:0:0: Attached scsi generic sg4 type 0 Jul 3 09:17:01 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A 8500 Rev: 5.22 Jul 3 09:17:01 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 03
Jul  3 09:17:01 s_kernel@iotiger2 kernel: error 1
Jul 3 09:17:01 s_kernel@iotiger2 kernel: 2:0:0:0: Unexpected response from lun 0 while scanning, scan aborted
...


lpfc + NEC:
===========
...
Jul  4 06:11:44 s_kernel@iotiger2 kernel: EXT3 FS on sdg, internal journal
Jul  4 06:11:44 s_kernel@iotiger2 kernel: EXT3-fs: recovery complete.
Jul 4 06:11:44 s_kernel@iotiger2 kernel: EXT3-fs: mounted filesystem with ordered data mode. Jul 4 06:12:03 s_kernel@iotiger2 kernel: lpfc 0000:06:02.1: 1:1305 Link Down Event x2 received Data: x2 x20 x0 Jul 4 06:12:33 s_kernel@iotiger2 kernel: lpfc 0000:06:02.1: 1:0203 Nodev timeout on WWPN 20:6:0:0:13:84:0:35 NPort xdc Data: x8 x7 x0 Jul 4 06:12:38 s_kernel@iotiger2 kernel: rport-3:0-0: blocked FC remote port time out: removing target and saving binding Jul 4 06:12:42 s_kernel@iotiger2 kernel: lpfc 0000:06:02.1: 1:1303 Link Up Event x3 received Data: x3 x1 x8 x2 Jul 4 06:12:42 s_kernel@iotiger2 kernel: Vendor: NEC Model: iStorage 2000 Rev: 2800 Jul 4 06:12:42 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 04 Jul 4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: 138412032 512-byte hdwr sectors (70867 MB)
Jul  4 06:12:42 s_kernel@iotiger2 kernel: sdh: Write Protect is off
Jul  4 06:12:42 s_kernel@iotiger2 kernel: sdh: Mode Sense: 97 00 00 08
Jul  4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: drive cache: write back
Jul 4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: 138412032 512-byte hdwr sectors (70867 MB)
Jul  4 06:12:42 s_kernel@iotiger2 kernel: sdh: Write Protect is off
Jul  4 06:12:42 s_kernel@iotiger2 kernel: sdh: Mode Sense: 97 00 00 08
Jul  4 06:12:42 s_kernel@iotiger2 kernel: SCSI device sdh: drive cache: write back
Jul  4 06:12:42 s_kernel@iotiger2 kernel: sdh: unknown partition table
Jul  4 06:12:42 s_kernel@iotiger2 kernel: sd 3:0:0:0: Attached scsi disk sdh
Jul 4 06:12:42 s_kernel@iotiger2 kernel: sd 3:0:0:0: Attached scsi generic sg12 type 0 Jul 4 06:12:42 s_kernel@iotiger2 kernel: Vendor: NEC Model: iStorage 2000 Rev: 2800 Jul 4 06:12:42 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 04
Jul  4 06:12:42 s_kernel@iotiger2 kernel: error 1
Jul 4 06:12:42 s_kernel@iotiger2 kernel: 3:0:0:0: Unexpected response from lun 0 while scanning, scan aborted
...

QLogic+NEC
==========
...
Jul 4 09:47:10 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LIP reset occured (f8ef). Jul 4 09:47:10 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LOOP DOWN detected (2). Jul 4 09:47:14 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LOOP DOWN detected (2). Jul 4 09:47:45 s_kernel@iotiger2 kernel: rport-9:0-0: blocked FC remote port time out: removing target and saving binding Jul 4 09:47:50 s_kernel@iotiger2 kernel: rport-8:0-0: blocked FC remote port time out: removing target and saving binding Jul 4 09:48:11 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LIP reset occured (f7f7).
Jul  4 09:48:11 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LIP occured (f7f7).
Jul 4 09:48:11 s_kernel@iotiger2 kernel: qla2400 0000:07:01.0: LOOP UP detected (2 Gbps). Jul 4 09:48:11 s_kernel@iotiger2 kernel: Vendor: NEC Model: iStorage 2000 Rev: 2800 Jul 4 09:48:11 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 04 Jul 4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: 138412032 512-byte hdwr sectors (70867 MB)
Jul  4 09:48:11 s_kernel@iotiger2 kernel: sdi: Write Protect is off
Jul  4 09:48:11 s_kernel@iotiger2 kernel: sdi: Mode Sense: 97 00 00 08
Jul  4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: drive cache: write back
Jul 4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: 138412032 512-byte hdwr sectors (70867 MB)
Jul  4 09:48:11 s_kernel@iotiger2 kernel: sdi: Write Protect is off
Jul  4 09:48:11 s_kernel@iotiger2 kernel: sdi: Mode Sense: 97 00 00 08
Jul  4 09:48:11 s_kernel@iotiger2 kernel: SCSI device sdi: drive cache: write back
Jul  4 09:48:11 s_kernel@iotiger2 kernel: sdi: unknown partition table
Jul  4 09:48:11 s_kernel@iotiger2 kernel: sd 8:0:0:0: Attached scsi disk sdi
Jul 4 09:48:11 s_kernel@iotiger2 kernel: sd 8:0:0:0: Attached scsi generic sg14 type 0 Jul 4 09:48:11 s_kernel@iotiger2 kernel: Vendor: NEC Model: iStorage 2000 Rev: 2800 Jul 4 09:48:11 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 04
Jul  4 09:48:11 s_kernel@iotiger2 kernel: error 1
Jul 4 09:48:11 s_kernel@iotiger2 kernel: 8:0:0:0: Unexpected response from lun 0 while scanning, scan aborted
...

QLogic+DDN
==========
...
Jul 4 10:01:31 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LOOP DOWN detected (2). Jul 4 10:02:06 s_kernel@iotiger2 kernel: rport-9:0-0: blocked FC remote port time out: removing target and saving binding Jul 4 10:03:05 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LIP reset occured (f7f7).
Jul  4 10:03:05 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LIP occured (f7f7).
Jul 4 10:03:05 s_kernel@iotiger2 kernel: qla2400 0000:07:01.1: LOOP UP detected (2 Gbps). Jul 4 10:03:05 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A 8500 Rev: 5.22 Jul 4 10:03:05 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 03 Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: 571416576 512-byte hdwr sectors (292565 MB)
Jul  4 10:03:05 s_kernel@iotiger2 kernel: sdl: Write Protect is off
Jul  4 10:03:05 s_kernel@iotiger2 kernel: sdl: Mode Sense: a7 00 10 08
Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: drive cache: write back w/ FUA Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: 571416576 512-byte hdwr sectors (292565 MB)
Jul  4 10:03:05 s_kernel@iotiger2 kernel: sdl: Write Protect is off
Jul  4 10:03:05 s_kernel@iotiger2 kernel: sdl: Mode Sense: a7 00 10 08
Jul 4 10:03:05 s_kernel@iotiger2 kernel: SCSI device sdl: drive cache: write back w/ FUA
Jul  4 10:03:05 s_kernel@iotiger2 kernel: sdl: unknown partition table
Jul  4 10:03:05 s_kernel@iotiger2 kernel: sd 9:0:0:0: Attached scsi disk sdl
Jul 4 10:03:05 s_kernel@iotiger2 kernel: sd 9:0:0:0: Attached scsi generic sg15 type 0 Jul 4 10:03:05 s_kernel@iotiger2 kernel: Vendor: DDN Model: S2A 8500 Rev: 5.22 Jul 4 10:03:05 s_kernel@iotiger2 kernel: Type: Direct-Access ANSI SCSI revision: 03
Jul  4 10:03:05 s_kernel@iotiger2 kernel: error 1
Jul 4 10:03:05 s_kernel@iotiger2 kernel: 9:0:0:0: Unexpected response from lun 0 while scanning, scan aborted
...

--
Frederic TEMPORELLI
-
: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]
  Powered by Linux