Re: [PATCH v4 1/6] libceph: allow requests to return immediately on full conditions if caller wishes

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, 2017-02-10 at 13:37 +0100, Ilya Dryomov wrote:
> On Fri, Feb 10, 2017 at 12:52 PM, Jeff Layton <jlayton@xxxxxxxxxx> wrote:
> > On Fri, 2017-02-10 at 19:41 +0800, Yan, Zheng wrote:
> > > > On 9 Feb 2017, at 22:48, Jeff Layton <jlayton@xxxxxxxxxx> wrote:
> > > > 
> > > > Usually, when the osd map is flagged as full or the pool is at quota,
> > > > write requests just hang. This is not what we want for cephfs, where
> > > > it would be better to simply report -ENOSPC back to userland instead
> > > > of stalling.
> > > > 
> > > > If the caller knows that it will want an immediate error return instead
> > > > of blocking on a full or at-quota error condition then allow it to set a
> > > > flag to request that behavior. Cephfs write requests will always set
> > > > that flag.
> > > > 
> > > > A later patch will deal with requests that were submitted before the new
> > > > map showing the full condition came in.
> > > > 
> > > > Signed-off-by: Jeff Layton <jlayton@xxxxxxxxxx>
> > > > ---
> > > > fs/ceph/addr.c                  | 4 ++++
> > > > fs/ceph/file.c                  | 4 ++++
> > > > include/linux/ceph/osd_client.h | 1 +
> > > > net/ceph/osd_client.c           | 6 ++++++
> > > > 4 files changed, 15 insertions(+)
> > > > 
> > > > diff --git a/fs/ceph/addr.c b/fs/ceph/addr.c
> > > > index 4547bbf80e4f..308787eeee2c 100644
> > > > --- a/fs/ceph/addr.c
> > > > +++ b/fs/ceph/addr.c
> > > > @@ -1040,6 +1040,7 @@ static int ceph_writepages_start(struct address_space *mapping,
> > > > 
> > > >             req->r_callback = writepages_finish;
> > > >             req->r_inode = inode;
> > > > +           req->r_abort_on_full = true;
> > > > 
> > > >             /* Format the osd request message and submit the write */
> > > >             len = 0;
> > > > @@ -1689,6 +1690,7 @@ int ceph_uninline_data(struct file *filp, struct page *locked_page)
> > > >     }
> > > > 
> > > >     req->r_mtime = inode->i_mtime;
> > > > +   req->r_abort_on_full = true;
> > > >     err = ceph_osdc_start_request(&fsc->client->osdc, req, false);
> > > >     if (!err)
> > > >             err = ceph_osdc_wait_request(&fsc->client->osdc, req);
> > > > @@ -1732,6 +1734,7 @@ int ceph_uninline_data(struct file *filp, struct page *locked_page)
> > > >     }
> > > > 
> > > >     req->r_mtime = inode->i_mtime;
> > > > +   req->r_abort_on_full = true;
> > > >     err = ceph_osdc_start_request(&fsc->client->osdc, req, false);
> > > >     if (!err)
> > > >             err = ceph_osdc_wait_request(&fsc->client->osdc, req);
> > > > @@ -1893,6 +1896,7 @@ static int __ceph_pool_perm_get(struct ceph_inode_info *ci,
> > > >     err = ceph_osdc_start_request(&fsc->client->osdc, rd_req, false);
> > > > 
> > > >     wr_req->r_mtime = ci->vfs_inode.i_mtime;
> > > > +   wr_req->r_abort_on_full = true;
> > > >     err2 = ceph_osdc_start_request(&fsc->client->osdc, wr_req, false);
> > > > 
> > > >     if (!err)
> > > 
> > > do you ignore writepage_nounlock() case intentionally?
> > > 
> > > 
> > > 
> > 
> > No. Hmmm...writepage_nounlock calls ceph_osdc_writepages, and it's the
> > only caller so I guess we'll need to set this there. Maybe we should
> > just lift ceph_osdc_writepages into ceph.ko since there are no callers
> > in libceph?
> 
> Set it in ceph_osdc_new_request() -- it's only user is ceph.ko.  It
> should cover all filesystem OSD requests, except for pool check and
> ceph_aio_retry_work().
> 

Ok. The flag will end up being set on reads as well, but we don't
actually do anything with the flag on a read request so that should be
fine. I'll do that.

Thanks,
-- 
Jeff Layton <jlayton@xxxxxxxxxx>
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [CEPH Users]     [Ceph Large]     [Information on CEPH]     [Linux BTRFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]
  Powered by Linux