RE: [PATCH V3 00/25] smartpqi updates

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Can you provide the base OS that you used to build the kernel.org kernel?

Thanks,
Don

-----Original Message-----
From: Don.Brace@xxxxxxxxxxxxx [mailto:Don.Brace@xxxxxxxxxxxxx] 
Sent: Monday, December 28, 2020 9:58 AM
To: buczek@xxxxxxxxxxxxx; Kevin Barnett - C33748 <Kevin.Barnett@xxxxxxxxxxxxx>; Scott Teel - C33730 <Scott.Teel@xxxxxxxxxxxxx>; Justin Lindley - C33718 <Justin.Lindley@xxxxxxxxxxxxx>; Scott Benesh - C33703 <Scott.Benesh@xxxxxxxxxxxxx>; Gerry Morong - C33720 <Gerry.Morong@xxxxxxxxxxxxx>; Mahesh Rajashekhara - I30583 <Mahesh.Rajashekhara@xxxxxxxxxxxxx>; hch@xxxxxxxxxxxxx; jejb@xxxxxxxxxxxxxxxxxx; joseph.szczypek@xxxxxxx; POSWALD@xxxxxxxx
Cc: linux-scsi@xxxxxxxxxxxxxxx; it+linux@xxxxxxxxxxxxx
Subject: RE: [PATCH V3 00/25] smartpqi updates


Subject: Re: [PATCH V3 00/25] smartpqi updates

EXTERNAL EMAIL: Do not click links or open attachments unless you know the content is safe

On 22.12.20 00:30, Don.Brace@xxxxxxxxxxxxx wrote:
> Can you please post your hw configuration and the stress load that you used? Was it fio?

Testsystem is a Dell PowerEdge R730. with two 10 core Intel® Xeon® Processor E5-2687W v3  and 200 GB memory.
Adapter is Adaptec HBA 1100-8e, Firmware 3.21 On it two AIC J3016-01 Enclosures with 16 8TB disks each The disks of each jbod are a combined into a raid6 software raid with xfs on it.
So I have two filesystems with ~100 TB ( 14 * 7.3 TB)

Unfortunately, for the time being, I was only able to reproduce this with a very complex load setup with both, file system activity (two parallel `cp -a` of big directory trees on each filesystem) and switching on and of raid scrubbing at the same time. I'm currently trigger the issue with less complex setups.

I'm not sure at all, whether this is really a problem of the smartpqi driver. Its just the frozen inflight counter seem to hint in the direction of the block layer.

Donald

>>Thanks for sharing your HW setup.
>>I will also setup a similar system. I have two scripts that I run against the driver before I feel satisfied that it will hold up against extreme conditions. One script performs a list of I/O stress tests (to all presented disks (LVs and HBAs): 1) mkfs {xfs, ext4}, 2) mount, 3) test using rsync, 4) fio using file system, 5) umount, 6) fsck, 7) fio to raw disk.

>>The other script continuously issues resets to all of the disks in parallel. Normally any issues will show up within 20 iterations of my scripts. I wait for 50K before I'm happy.

>>I have not tried layering in the dm driver, but that will be added to my tests. There have been a few patches added to both the block layer and dm driver recently.

>>Thanks again,
>>Don.



>
> Dear Don,
>
> just wanted to let you know that I've tested this series (plus the three Depends-on patches you mentioned) on top of Linux v5.10.1 with an Adaptec 1100-8e with fw 3.21.
>
> After three hours of heavy operation (including raid scrubbing!) the 
> driver seems to have lost some requests for the md0 member disks
>
> This is the static picture after all activity has ceased:
>
>       root:deadbird:/scratch/local/# for f in /sys/devices/virtual/block/md?/md/rd*/block/inflight;do echo $f: $(cat $f);done
>       /sys/devices/virtual/block/md0/md/rd0/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd1/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd10/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd11/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd12/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd13/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd14/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd15/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd2/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd3/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd4/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd5/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd6/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd7/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd8/block/inflight: 1 0
>       /sys/devices/virtual/block/md0/md/rd9/block/inflight: 1 0
>       /sys/devices/virtual/block/md1/md/rd0/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd1/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd10/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd11/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd12/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd13/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd14/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd15/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd2/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd3/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd4/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd5/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd6/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd7/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd8/block/inflight: 0 0
>       /sys/devices/virtual/block/md1/md/rd9/block/inflight: 0 0
>
> Best
>     Donald
>
> On 10.12.20 21:34, Don Brace wrote:
>> These patches are based on Martin Peterson's 5.11/scsi-queue tree
>>
>> Note that these patches depend on the following three patches applied 
>>to Martin Peterson's tree:
>>    https://git.kernel.org/pub/scm/linux/kernel/git/mkp/scsi.git
>>    5.11/scsi-queue
>> Depends-on: 5443bdc4cc77 scsi: smartpqi: Update version to 1.2.16-012
>> Depends-on: 408bdd7e5845 scsi: smartpqi: Correct pqi_sas_smp_handler 
>>busy condition
>> Depends-on: 1bdf6e934387 scsi: smartpqi: Correct driver removal with 
>>HBA disks
>>
>> This set of changes consist of:
>>    * Add support for newer controller hardware.
>>      * Refactor AIO and s/g processing code. (No functional changes)
>>      * Add write support for RAID 5/6/1 Raid bypass path (or accelerated I/O path).
>>      * Add check for sequential streaming.
>>      * Add in new PCI-IDs.
>>    * Format changes to re-align with our in-house driver. (No 
>>functional changes.)
>>    * Correct some issues relating to suspend/hibernation/OFA/shutdown.
>>      * Block I/O requests during these conditions.
>>    * Add in qdepth limit check to limit outstanding commands.
>>      to the max values supported by the controller.
>>    * Correct some minor issues found during regression testing.
>>    * Update the driver version.
>>
>> Changes since V1:
>>    * Re-added 32bit calculations to correct i386 compile issues
>>      to patch smartpqi-refactor-aio-submission-code
>>      Reported-by: kernel test robot <lkp@xxxxxxxxx>
>>      
>>https://lists.01.org/hyperkitty/list/kbuild-all@xxxxxxxxxxxx/thread/VM
>>BBGGGE5446SVEOQBRCKBTRRWTSH4AB/
>>
>> Changes since V2:
>>    * Added 32bit division to correct i386 compile issues
>>      to patch smartpqi-add-support-for-raid5-and-raid6-writes
>>      Reported-by: kernel test robot <lkp@xxxxxxxxx>
>>      
>>https://lists.01.org/hyperkitty/list/kbuild-all@xxxxxxxxxxxx/thread/ZC
>>XJJDGPPTTXLZCSCGWEY6VXPRB3IFOQ/
>>
>> ---
>>
>> Don Brace (7):
>>        smartpqi: refactor aio submission code
>>        smartpqi: refactor build sg list code
>>        smartpqi: add support for raid5 and raid6 writes
>>        smartpqi: add support for raid1 writes
>>        smartpqi: add stream detection
>>        smartpqi: add host level stream detection enable
>>        smartpqi: update version to 2.1.6-005
>>
>> Kevin Barnett (14):
>>        smartpqi: add support for product id
>>        smartpqi: add support for BMIC sense feature cmd and feature 
>>bits
>>        smartpqi: update AIO Sub Page 0x02 support
>>        smartpqi: add support for long firmware version
>>        smartpqi: align code with oob driver
>>        smartpqi: enable support for NVMe encryption
>>        smartpqi: disable write_same for nvme hba disks
>>        smartpqi: fix driver synchronization issues
>>        smartpqi: convert snprintf to scnprintf
>>        smartpqi: change timing of release of QRM memory during OFA
>>        smartpqi: return busy indication for IOCTLs when ofa is active
>>        smartpqi: add additional logging for LUN resets
>>        smartpqi: correct system hangs when resuming from hibernation
>>        smartpqi: add new pci ids
>>
>> Mahesh Rajashekhara (1):
>>        smartpqi: fix host qdepth limit
>>
>> Murthy Bhat (3):
>>        smartpqi: add phy id support for the physical drives
>>        smartpqi: update sas initiator_port_protocols and 
>>target_port_protocols
>>        smartpqi: update enclosure identifier in sysf
>>
>>
>>   drivers/scsi/smartpqi/smartpqi.h              |  301 +-
>>   drivers/scsi/smartpqi/smartpqi_init.c         | 3123 ++++++++++-------
>>   .../scsi/smartpqi/smartpqi_sas_transport.c    |   39 +-
>>   drivers/scsi/smartpqi/smartpqi_sis.c          |    4 +-
>>   4 files changed, 2189 insertions(+), 1278 deletions(-)
>>
>
> --
> Donald Buczek
> buczek@xxxxxxxxxxxxx
> Tel: +49 30 8413 1433

--
Donald Buczek
buczek@xxxxxxxxxxxxx
Tel: +49 30 8413 1433




[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]

  Powered by Linux