Re: [PATCH v3 17/26] lpfc: Rework EQ/CQ processing to address interrupt coalescing

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 1/26/19 2:18 AM, James Smart wrote:
When driving high iop counts, auto_imax coalescing kick in and drives
the performance to extremely small iops levels.

There are two issues:
1) auto_imax is enabled by default. The auto algorithm, when iops
    gets high divides the iops by the hdwq count and uses that value
    to calculate EQ_Delay. The EQ_Delay is set uniformly on all EQs
    whether they have load or not. The EQ_delay is only manipulated
    every 5s (a long time). Thus there were large 5s swings of no
    interrupt delay followed by large/maximum delay, before repeating.

2) When processing a CQ, the driver got mixed up on the rate of when
    to ring the doorbell to keep the chip appraised of the eqe or cqe
    consumption as well as how how long to sit in the thread and
    process queue entries. Currently, the driver capped its work at
    64 entries (very small) and exited/rearmed the CQ.  Thus, on heavy
    loads, additional overheads were taken to exit and re-enter the
    interrupt handler. Worse, if in the large/maximum coalescing
    windows,k it could be a while before getting back to servicing.

The issues are corrected by the following:
- A change in defaults. Auto_imax is turned OFF and fcp_imax is set
   to 0. Thus all interrupts are immediate.
- Cleanup of field names and their meanings. Existing names were
   non-intuitive or used for duplicate things.
- Added max_proc_limit field, to control the length of time the
   handlers would service completions.
- Reworked EQ handling:
     Added common routine that walks eq, applying notify interval and
       max processing limits. Use queue_claimed to claim ownership of
       the queue while processing. Always rearm the queue whenever the
       common routine is called.
     Rework queue element processing, namely to eliminate hba_index vs
       host_index. Only one index is necessary. The queue entry can be
       marked invalid and the host_index updated immediately after
       eqe processing.
     After rework, xx_release routines are now DB write functions.
       Renamed the routines as such.
     Moved lpfc_sli4_eq_flush(), which does similar action, to same area.
     Replaced the 2 individual loops that walk an eq with a call to the
       common routine.
     Slightly revised lpfc_sli4_hba_handle_eqe() calling syntax.
     Added per-cpu counters to detect interrupt rates and scale
       interrupt coalescing values.
- Reworked CQ handling:
     Added common routine that walks cq, applying notify interval and
       max processing limits. Use queue_claimed to claim ownership of
       the queue while processing. Always rearm the queue whenever the
       common routine is called.
     Rework queue element processing, namely to eliminate hba_index vs
       host_index. Only one index is necessary. The queue entry can be
       marked invalid and the host_index updated immediately after
       cqe processing.
     After rework, xx_release routines are now DB write functions.
       Renamed the routines as such.
     Replaced the 3 individual loops that walk a cq with a call to the
       common routine.
     Redefined lpfc_sli4_sp_handle_mcqe() to commong handler definition with
       queue reference. Add increment for mbox completion to handler.
- Added a new module/sysfs attribute: lpfc_cq_max_proc_limit
   To allow dynamic changing of the CQ max_proc_limit value being used.

Although this leaves an EQ as an immediate interrupt, that interrupt will
only occur if a CQ bound to it is in an armed state and has cqe's to
process.  By staying in the cq processing routine longer, high loads
will avoid generating more interrupts as they will only rearm as the
processing thread exits. The immediately interrupt is also beneficial
to idle or lower-processing CQ's as they get serviced immediately without
being penalized by sharing an EQ with a more loaded CQ.

Signed-off-by: Dick Kennedy <dick.kennedy@xxxxxxxxxxxx>
Signed-off-by: James Smart <jsmart2021@xxxxxxxxx>

Reviewed-by: Hannes Reinecke <hare@xxxxxxxx>

Cheers,

Hannes



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]

  Powered by Linux