Re: [PATCH RFC 0/6] bio_split() error handling rework

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 9/23/24 09:19, John Garry wrote:
On 23/09/2024 06:53, Hannes Reinecke wrote:
On 9/19/24 11:22, John Garry wrote:
bio_split() error handling could be improved as follows:
- Instead of returning NULL for an error - which is vague - return a
   PTR_ERR, which may hint what went wrong.
- Remove BUG_ON() calls - which are generally not preferred - and instead
   WARN and pass an error code back to the caller. Many callers of
   bio_split() don't check the return code. As such, for an error we would
   be getting a crash still from an invalid pointer dereference.

Most bio_split() callers don't check the return value. However, it could
be argued the bio_split() calls should not fail. So far I have just
fixed up the md RAID code to handle these errors, as that is my interest
now.

Sending as an RFC as unsure if this is the right direction.

The motivator for this series was initial md RAID atomic write support in https://lore.kernel.org/linux-block/21f19b4b-4b83-4ca2- a93b-0a433741fd26@xxxxxxxxxx/

There I wanted to ensure that we don't split an atomic write bio, and it
made more sense to handle this in bio_split() (instead of the bio_split()
caller).

John Garry (6):
   block: Rework bio_split() return value
   block: Error an attempt to split an atomic write in bio_split()
   block: Handle bio_split() errors in bio_submit_split()
   md/raid0: Handle bio_split() errors
   md/raid1: Handle bio_split() errors
   md/raid10: Handle bio_split() errors

  block/bio.c                 | 14 ++++++++++----
  block/blk-crypto-fallback.c |  2 +-
  block/blk-merge.c           |  5 +++++
  drivers/md/raid0.c          | 10 ++++++++++
  drivers/md/raid1.c          |  8 ++++++++
  drivers/md/raid10.c         | 18 ++++++++++++++++++
  6 files changed, 52 insertions(+), 5 deletions(-)

You are missing '__bio_split_to_limits()' which looks as it would need to be modified, too.


In __bio_split_to_limits(), for REQ_OP_DISCARD, REQ_OP_SECURE_ERASE, and REQ_OP_WRITE_ZEROES, we indirectly call bio_split(). And bio_split() might error. But functions like bio_split_discard() can return NULL for cases where a split is not required. So I suppose we need to check IS_ERR(split) for those request types mentioned. For NULL being returned, we would still have the __bio_split_to_limits() is "if (split)" check.

Indeed. And then you'll need to modify nvme:

diff --git a/drivers/nvme/host/multipath.c b/drivers/nvme/host/multipath.c
index f72c5a6a2d8e..c99f51e7730e 100644
--- a/drivers/nvme/host/multipath.c
+++ b/drivers/nvme/host/multipath.c
@@ -453,7 +453,7 @@ static void nvme_ns_head_submit_bio(struct bio *bio)
         * pool from the original queue to allocate the bvecs from.
         */
        bio = bio_split_to_limits(bio);
-       if (!bio)
+       if (IS_ERR_OR_NULL(bio))
                return;

        srcu_idx = srcu_read_lock(&head->srcu);

Cheers,

Hannes
--
Dr. Hannes Reinecke                  Kernel Storage Architect
hare@xxxxxxx                                +49 911 74053 688
SUSE Software Solutions GmbH, Frankenstr. 146, 90461 Nürnberg
HRB 36809 (AG Nürnberg), GF: I. Totev, A. McDonald, W. Knoblich





[Index of Archives]     [Linux RAID]     [Linux SCSI]     [Linux ATA RAID]     [IDE]     [Linux Wireless]     [Linux Kernel]     [ATH6KL]     [Linux Bluetooth]     [Linux Netdev]     [Kernel Newbies]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Device Mapper]

  Powered by Linux