Re: SES Enclosure Management.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 02/16/2012 12:00 PM, Benjamin ESTRABAUD wrote:
On 14/02/12 20:53, Robert Woodworth wrote:
On 02/14/2012 01:42 PM, Joe Landman wrote:
On 02/14/2012 03:31 PM, NeilBrown wrote:
On Tue, 14 Feb 2012 10:30:37 -0700 Robert Woodworth
<robertjwoodworth@xxxxxxxxx>  wrote:

Has anyone ever thought of integrating SES managed enclosures into the
kernel RAID system?  I briefly looked through the archives and have
not found anything on the topic.

Some HW based RAID controllers do this flawlessly now, there is no
reason why the kernel RAID cannot also.  (LSI MegaRAID)

1) When a drive is part if a managed enclosure, the RAID system should
address it by location instead of by enumerated device node.  The SES
device in the enclosure can map the physical slot to a physical drive.
  The RAID admin (mdamd) should be able to add/fail/identify devices
based on slot.

Does this just mean that the admin should using names in /dev/disk/by-path/ rather than /dev/sdXX to address devices? What can md or mdadm do to help?


Not sure on the SES (or SGPIO side), but one of the things we've been doing has been to create a file with disk placement "coordinates", so as to map serial number and device to physical location.


With real SES managed enclosures, you issue a SCSI command to read SES Page1 and Page2 to get the details about the drives in any given slot. This currently works fine in Linux with the sg_utils3 package. From the command line, 'sg_ses -p 2 /dev/sgXX` where the device is the SES device.

Take a look at your systems, if you see a device at /sys/class/enclosure/XXXX/ then you have a managed enclosure attached.


Hi,

True, but your definition of SES "slot 0" might be different from someone else's. The end user will still need to map "slot 0" to a physical slot on his system, instead of mapping a drive's serial number to a physical slot. There is no standard for slot numbering and most vendors use different schemes.
On the boxes I write SES firmware for, we have a sticker label on the slot 'slot1' and its my job to make sure the SES firmware always maps the physical label 'slot1' with the drive in that slot to the data in the SCSI SES page 10 (0x0A).

That is the whole point of the SES device.



2) If the RAID system fails a drive, it should notify the SES
management and turn on the fail bit and the fail LED.

"mdadm --monitor" will run a script on drive failure. This could easily
notify the SES management.

Yes, we are using this now for notifications and logging.


So maybe all we need here is a script to plug in to mdadm... Would you like
to write one?


Just need a "standard" SES (or SGPIO) mechanism to hook into, and we should be able to support this. Right now we have to work through HBA scripts.
A true managed enclosure has nothing to do with the HBA. A managed enclosure provides a device on the SCSI bus and you exclusively communicate with that device regardless of the HBA. Most HW RAIDs (LSI MegaRAID) will hide the SES device exactly like they hide the physical disks.



3) The RAID system should be able to turn on the 'identify' bit and
LEDs for an array or a single drive.

This is fine too, as you pointed out SES gives you information on disks, but in order to map a block device sda to a device in SES, won't you need to rely on its SAS address? or even the SAS address arbitrarily given by the expander to a SATA device?
Look at SCSI SES Page 10 (0x0A), that is the correct mapping of disk to slot on the host side.


Your expander's SES device might display this address for both SAS and SATA drives in SES page 0xa, but most SES devices won't, at least for SATA drives, as the SES spec specifically requires SATA drive's address to be set to 0x0.

So if drive "sd"a fails, you'll be able to tell which expander it is attached using kernel infos, tell its SAS address as you see it, but you won't be able to map it to a particular slot number in SES, and will not be able to set its failed LED reliably.
As stated above. If the firmware on the SES enclosure is written correctly the SES page1 and page2 will correctly map the physical devices *and LEDs* to the devices in the SCSI SES pages. This does work on proper enclosures, yes you can rely on it (on my boxes anyway).

Maybe people have bad experiences with bad firmware, but if you get a box that my expanders and SES firmware is in, they will work correctly.



Again, it sounds like you just need a script to ask mdadm which devices are included in a given array, and then do whatever magic is needed to turn on
the light.
It is fairly easy to extra the device list from the output of
   mdadm --detail --brief --verbose /dev/md/whatever

but it might be good to make it easier to extra from
   mdadm --detail --export /dev/md/whatever

Would you like to write such a script?


I'm currently doing firmware on a managed enclosure.  Although my
vendor only supports LSI MegaRAID, there is no reason why my
enclosures cannot work in the same manner on a kernel RAID system.



Request for comments...

It sounds to me like you just need a few scripts to provide some
enclosure-specific functionality. I would be happy to include them in the
mdadm distribution.

Or maybe there is something that I didn't understand??

Thanks,
NeilBrown


Regards,
Ben.

--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux