RE: Issue in sas_ex_discover_dev() for multiple level of SAS expanders in a domain

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I suppose you have got the log file I attached.
If not, please let me know.
Any update about this?

Eric LI


Internal Use - Confidential
+AD4- -----Original Message-----
+AD4- From: Li, Eric (Honggang)
+AD4- Sent: Thursday, April 25, 2024 1:04 PM
+AD4- To: Jason Yan +ADw-yanaijie+AEA-huawei.com+AD4AOw- John Garry +ADw-john.g.garry+AEA-oracle.com+AD4AOw-
+AD4- james.bottomley+AEA-hansenpartnership.com+ADs- Martin K . Petersen
+AD4- +ADw-martin.petersen+AEA-oracle.com+AD4-
+AD4- Cc: linux-scsi+AEA-vger.kernel.org
+AD4- Subject: RE: Issue in sas+AF8-ex+AF8-discover+AF8-dev() for multiple level of SAS expanders in a
+AD4- domain
+AD4-
+AD4- +AD4------Original Message-----
+AD4- +AD4-From: Jason Yan +ADw-yanaijie+AEA-huawei.com+AD4-
+AD4- +AD4-Sent: Thursday, April 25, 2024 10:58 AM
+AD4- +AD4-To: John Garry +ADw-john.g.garry+AEA-oracle.com+AD4AOw- Li, Eric (Honggang)
+AD4- +AD4APA-Eric.H.Li+AEA-Dell.com+AD4AOw- james.bottomley+AEA-hansenpartnership.com+ADs- Martin K .
+AD4- +AD4-Petersen +ADw-martin.petersen+AEA-oracle.com+AD4-
+AD4- +AD4-Cc: linux-scsi+AEA-vger.kernel.org
+AD4- +AD4-Subject: Re: Issue in sas+AF8-ex+AF8-discover+AF8-dev() for multiple level of SAS
+AD4- +AD4-expanders in a domain
+AD4- +AD4-
+AD4- +AD4-
+AD4- +AD4AWw-EXTERNAL EMAIL+AF0-
+AD4- +AD4-
+AD4- +AD4-On 2024/4/24 18:46, John Garry wrote:
+AD4- +AD4APg- On 24/04/2024 09:59, Li, Eric (Honggang) wrote:
+AD4- +AD4APgA+- Hi,
+AD4- +AD4APgA+-
+AD4- +AD4APgA+- There is an issue in the function sas+AF8-ex+AF8-discover+AF8-dev() when I have
+AD4- +AD4APgA+- multiple SAS expanders chained under one SAS port on SAS controller.
+AD4- +AD4APg-
+AD4- +AD4APg- I think typically we can't and so don't test such a setup.
+AD4- +AD4-
+AD4- +AD4-Eric,
+AD4- +AD4-
+AD4- +AD4-I also don't understand why you need such a setup. Can you explain more
+AD4- +AD4-details of your topology?
+AD4-
+AD4- I believe this is common setup if you want to support large number of drives under
+AD4- one SAS port of SAS controller.
+AD4-
+AD4- +AD4-
+AD4- +AD4APg-
+AD4- +AD4APgA+-
+AD4- +AD4APgA+- In this function, we first check whether the PHY+IBk-s
+AD4- +AD4APgA+- attached+AF8-sas+AF8-address is already present in the SAS domain, and then
+AD4- +AD4APgA+- check if this PHY belongs to an existing port on this SAS expander.
+AD4- +AD4APgA+- I think this has an issue if this SAS expander use a wide port
+AD4- +AD4APgA+- connecting a downstream SAS expander.
+AD4- +AD4APgA+- This is because if the PHY belongs to an existing port on this SAS
+AD4- +AD4APgA+- expander, the attached SAS address of this port must already be
+AD4- +AD4APgA+- present in the domain and it results in disabling that port.
+AD4- +AD4APgA+- I don+IBk-t think that is what we expect.
+AD4- +AD4APgA+-
+AD4- +AD4APgA+- In old release (4.x), at the end of this function, it would make
+AD4- +AD4APgA+- addition sas+AF8-ex+AF8-join+AF8-wide+AF8-port() call for any possibly PHYs that
+AD4- +AD4APgA+- could be added into the SAS port.
+AD4- +AD4APgA+- This will make subsequent PHYs (other than the first PHY of that
+AD4- +AD4APgA+- port) being marked to DISCOVERED so that this function would not be
+AD4- +AD4APgA+- invoked on those subsequent PHYs (in that port).
+AD4- +AD4APgA+- But potential question here is we didn+IBk-t configure the per-PHY
+AD4- +AD4APgA+- routing table for those PHYs.
+AD4- +AD4APgA+- As I don+IBk-t have such SAS expander on hand, I am not sure what+IBk-s
+AD4- +AD4APgA+- impact (maybe just performance/bandwidth impact).
+AD4- +AD4APgA+- But at least, it didn+IBk-t impact the functionality of that port.
+AD4- +AD4APgA+-
+AD4- +AD4APgA+- But in v5.3 or later release, that part of code was removed (in the
+AD4- +AD4APgA+- commit a1b6fb947f923).
+AD4- +AD4APg-
+AD4- +AD4APg- Jason, can you please check this?
+AD4- +AD4-
+AD4- +AD4-The removed code is only for races before we serialize the event
+AD4- +AD4-processing. All PHYs will still be scanned one by one and add to the
+AD4- +AD4-wide port if they have the same address. So are you encountering a real issue? If
+AD4- so, can you share the full log?
+AD4-
+AD4- Yes. We did hit this issue when we upgrade Linux kernel from 4.19.236 to 5.14.21.
+AD4- Full logs attached.
+AD4-
+AD4- +AD4-
+AD4- +AD4-Thanks,
+AD4- +AD4-Jason
+AD4- +AD4-
+AD4- +AD55XU4AUgeYelIp/wE-
+AD4- +AD4-
+AD4- +AD4APg-
+AD4- +AD4APg- Thanks+ACE-
+AD4- +AD4APg-
+AD4- +AD4APgA+- And this caused this problem occurred (downstream port of that SAS
+AD4- +AD4APgA+- expander was disabled and all downstream SAS devices were removed
+AD4- +AD4APgA+- from the domain).
+AD4- +AD4APgA+-
+AD4- +AD4APgA+- Regards.
+AD4- +AD4APgA+- Eric Li
+AD4- +AD4APgA+-
+AD4- +AD4APgA+- SPE, DellEMC
+AD4- +AD4APgA+- 3/F KIC 1, 252+ACM- Songhu Road, YangPu District, SHANGHAI
+AD4- +AD4APgA+- +-86-21-6036-4384
+AD4- +AD4APgA+-
+AD4- +AD4APgA+-
+AD4- +AD4APgA+- Internal Use - Confidential
+AD4- +AD4APg-
+AD4- +AD4APg- .





[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]

  Powered by Linux