Re: Handling multiple paths to enclosure devices?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 11-07-28 04:33 PM, James Bottomley wrote:
On Thu, 2011-07-28 at 10:05 -0700, Roland Dreier wrote:
Hi,

I'm seeing an issue with the current design of our enclosure
handling.  In a system with a bunch of drives in an enclosure, it's
definitely helpful to have a way to go from sdXX to which slot in the
enclosure that drive is in, and that's what the symlink
/sys/block/sdXX/device/enclosure_device:NN provides.

However, in a system with multiple paths to the enclosure, eg an HBA
with two external SAS ports, both connected to a SAS expander in a
JBOD, ie in lame ASCII graphics, something like:

      +-----+                 /-- drv1
      |     |       +-----+  /---  .
      |     |==SAS==|     |-/----  .
      | HBA |       | exp |------  .
      |     |==SAS==|     |-\----  .
      |     |       +-----+  \---  .
      +-----+                 \-- drvN

So this configuration should form a single wide port and thus not
actually be multiple paths.  However, if you have two HBAs instead of
one (or a non-SAS HBA), I grant it becomes multipath.

Here are some conflicting results for the single HBA case.
I tested two LSI HBAs separately, each with 8 phys and wired 5
of those phys to a LSI SAS-2 expander.

Here are the results for a SAS-1.1 (3 Gbps) 3444E HBA seen
from a SMP DISCOVER on the expander:
  phy   5:T:attached:[5001517e85c3efe5:00  t(SATA)]  3 Gbps
  phy   7:T:attached:[5000c500215725bd:00  t(SSP)]  6 Gbps
  phy  12:T:attached:[500605b00006f263:03  i(SSP+STP+SMP)]  3 Gbps
  phy  20:T:attached:[500605b00006f264:04  i(SSP+STP+SMP)]  3 Gbps
  phy  21:T:attached:[500605b00006f264:05  i(SSP+STP+SMP)]  3 Gbps
  phy  22:T:attached:[500605b00006f264:06  i(SSP+STP+SMP)]  3 Gbps
  phy  23:T:attached:[500605b00006f264:07  i(SSP+STP+SMP)]  3 Gbps
  phy  24:D:attached:[5001517e85c3effd:00  V i(SSP+SMP) t(SSP)]  6 Gbps

That is two ports:
  - a narrow port [HBA phy 3 to expander phy 12]
    note the different SAS address of HBA phy 3: 500605b00006f263
  - a wide port [HBA phys 4-7 to expander phys 20-23]
Should the HBA report as two hosts?? [It only reports as one host
in my test.]

So if a SAS HBA can't handle a wide port with more than
4 phys, it can just change the SAS addresses on one or more
phys at the HBA end.


Here are the results for a SAS-2 (6 Gbps) 9212 HBA seen
from a SMP DISCOVER on the same expander:
  phy   5:T:attached:[5001517e85c3efe5:00  t(SATA)]  3 Gbps
  phy   7:T:attached:[5000c500215725bd:00  t(SSP)]  6 Gbps
  phy  12:T:attached:[500605b001d0d3e0:04  i(SSP+STP+SMP)]  6 Gbps
  phy  20:T:attached:[500605b001d0d3e0:03  i(SSP+STP+SMP)]  6 Gbps
  phy  21:T:attached:[500605b001d0d3e0:02  i(SSP+STP+SMP)]  6 Gbps
  phy  22:T:attached:[500605b001d0d3e0:01  i(SSP+STP+SMP)]  6 Gbps
  phy  23:T:attached:[500605b001d0d3e0:00  i(SSP+STP+SMP)]  6 Gbps
  phy  24:D:attached:[5001517e85c3effd:00  V i(SSP+SMP) t(SSP)]  6 Gbps

Now that is a single wide port (5 phys wide) since all
5 expander phys have the same SAS address (not shown)
and all 5 HBA phys have the same address.

we have two paths to each drive, so each gets two names, sdXX and
sdYY.  However, in drivers/misc/enclosure.c, the code only allows one
device in each component and so what happens is that sdXX gets
discovered, then gets an enclosure_device:NN link, then sdYY is
discovered, so sdXX's enclosure_device:NN link is removed and one is
added for sdYY.  And so if I want to figure out which enclosure slot
sdXX is in, I'm in for a hard time.

It would be a simple matter of writing code to allow all the block
devices in a slot to link back to that slot -- we would have to be a
bit more careful of keeping track of what links exist, but it should
be doable.

The wrinkle is that there are also /sys/class/enclosure/ZZZ/NN/device
symlinks that allow going the other way.  And it's harder to see how
to express multiple block devices in one enclosure slot.

Thoughts on how to improve our enclosure handling?

IN SAS-2 the SMP DISCOVER response should contain slot information.
Looking at sysfs for my test system:

# lsscsi
[1:0:0:0]    disk    ATA      ST3320620AS      3.AA  /dev/sda
[6:0:0:0]    disk    ATA      ST31000528AS     CC38  /dev/sdb
[6:0:1:0]    disk    SEAGATE  ST32000444SS     0006  /dev/sdc
[6:0:2:0]    enclosu Intel    RES2SV240        0600  -

then fetching the corresponding SAS port addresses:

# lsscsi -t
[1:0:0:0]    disk    sata:                           /dev/sda
[6:0:0:0]    disk    sas:0x5001517e85c3efe5          /dev/sdb
[6:0:1:0]    disk    sas:0x5000c500215725bd          /dev/sdc
[6:0:2:0]    enclosu sas:0x5001517e85c3effd          -

Referring to the SMP DISCOVER response above, it can be seen
that /dev/sdc is connected to expander phy 7. So getting the
long form output for a SMP DISCOVER on phy 7 :

# smp_discover -p 7 /dev/bsg/expander-6\:0
Discover response:
  ...
  attached SAS address: 0x5000c500215725bd
  attached phy identifier: 0
  ...
  device slot number: 255
  device slot group number: 255
  device slot group output connector:

Sadly the value of 255 means not available. YMMV

My initial thought is that in a multi-path situation, as above, we get
two enclosures appearing as well (one down each path).  If we
incorporated the idea of topological subtrees into the identity matching
code, we'd end up filling each of the enclosures with the path connected
devices.  That seems to be an easy situation for multi-path drivers to
sort out and one requiring no alteration of the existing enclosure code
(except to do the topological subtree search).

How does that sound?

Does this also solve the problem reported a few weeks back
in which a SES logical unit reported duplicate element
descriptor names?

Doug Gilbert
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]
  Powered by Linux