Re: [PATCH 3/8] pnfsblock: move find lock page logic out of bl_write_pagelist

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2011-11-10 10:49, tao.peng@xxxxxxx wrote:
> 
>> -----Original Message-----
>> From: linux-nfs-owner@xxxxxxxxxxxxxxx [mailto:linux-nfs-owner@xxxxxxxxxxxxxxx]
>> On Behalf Of Benny Halevy
>> Sent: Thursday, November 10, 2011 4:32 PM
>> To: Peng Tao
>> Cc: linux-nfs@xxxxxxxxxxxxxxx; Trond.Myklebust@xxxxxxxxxx; Peng, Tao
>> Subject: Re: [PATCH 3/8] pnfsblock: move find lock page logic out of
>> bl_write_pagelist
>>
>> On 2011-11-09 17:16, Peng Tao wrote:
>>> Also avoid unnecessary lock_page if page is handled by others.
>>>
>>> Signed-off-by: Peng Tao <peng_tao@xxxxxxx>
>>> ---
>>>  fs/nfs/blocklayout/blocklayout.c |   78
>> ++++++++++++++++++++++++++------------
>>>  1 files changed, 54 insertions(+), 24 deletions(-)
>>>
>>> diff --git a/fs/nfs/blocklayout/blocklayout.c b/fs/nfs/blocklayout/blocklayout.c
>>> index 4ced0b0..e8e13f3 100644
>>> --- a/fs/nfs/blocklayout/blocklayout.c
>>> +++ b/fs/nfs/blocklayout/blocklayout.c
>>> @@ -484,6 +484,56 @@ cleanup:
>>>  	return ret;
>>>  }
>>>
>>> +/* Find or create a zeroing page marked being writeback.
>>> + * Return ERR_PTR on error, NULL to indicate skip this page and page itself
>>> + * to indicate write out.
>>> + */
>>> +static struct page*
>>
>> coding style nit: "page *" rather than "page*"
>>
>>> +bl_find_get_zeroing_page(struct inode *inode, pgoff_t index,
>>> +			struct pnfs_block_extent *cow_read)
>>> +{
>>> +	struct page* page;
>>
>> checkpatch nit:
>> ERROR: "foo* bar" should be "foo *bar"
> Thank you! Will change both.
> 
> 
>>
>>> +	int locked = 0;
>>> +	page = find_get_page(inode->i_mapping, index);
>>> +	if (page)
>>> +		goto check_page;
>>> +
>>> +	page = find_or_create_page(inode->i_mapping, index, GFP_NOFS);
>>> +	if (unlikely(!page)) {
>>> +		dprintk("%s oom\n", __func__);
>>> +		return ERR_PTR(-ENOMEM);
>>> +	}
>>> +	locked = 1;
>>
>> Why not just call find_or_create_page in the first place?
>> dealing with both the locked and unlocked cases here just
>> seems to add unneeded complexity.
> It is to avoid lock page when possible. find_or_create_page() calls find_and_lock_page() but there are cases that the page is already locked by someone (e.g., in nfs_writepage_locked()) and current process has to wait. Pressure tests show many such contention. So I want to optimize it out.

I see.  So isn't there a missing page_cache_release
in the late locking case (see below...)

> 
> Thanks,
> Tao
>>
>> Benny
>>
>>> +
>>> +check_page:
>>> +	/* PageDirty: Other will write this out
>>> +	 * PageWriteback: Other is writing this out
>>> +	 * PageUptodate: It was read before
>>> +	 */
>>> +	if (PageDirty(page) || PageWriteback(page)) {
>>> +		print_page(page);
>>> +		if (locked)
>>> +			unlock_page(page);
>>> +		page_cache_release(page);
>>> +		return NULL;
>>> +	}
>>> +
>>> +	if (!locked) {
>>> +		lock_page(page);
>>> +		if (PageDirty(page) || PageWriteback(page))
>>> +			unlock_page(page);

I suspect page_cache_release is required here too.
Actually, doing the following instead makes sense to me:

	if (!locked) {
		lock_page(page);
		locked = true;	// yeah, make it should be bool, not int
		goto check_page;
	}

Benny

>>> +		return NULL;
>>> +	}
>>> +	if (!PageUptodate(page)) {
>>> +		/* New page, readin or zero it */
>>> +		init_page_for_write(page, cow_read);
>>> +	}
>>> +	set_page_writeback(page);
>>> +	unlock_page(page);
>>> +
>>> +	return page;
>>> +}
>>> +
>>>  static enum pnfs_try_status
>>>  bl_write_pagelist(struct nfs_write_data *wdata, int sync)
>>>  {
>>> @@ -543,32 +593,12 @@ fill_invalid_ext:
>>>  			dprintk("%s zero %dth page: index %lu isect %llu\n",
>>>  				__func__, npg_zero, index,
>>>  				(unsigned long long)isect);
>>> -			page =
>>> -			    find_or_create_page(wdata->inode->i_mapping, index,
>>> -						GFP_NOFS);
>>> -			if (!page) {
>>> -				dprintk("%s oom\n", __func__);
>>> -				wdata->pnfs_error = -ENOMEM;
>>> +			page = bl_find_get_zeroing_page(wdata->inode, index, cow_read);
>>> +			if (unlikely(IS_ERR(page))) {
>>> +				wdata->pnfs_error = PTR_ERR(page);
>>>  				goto out;
>>> -			}
>>> -
>>> -			/* PageDirty: Other will write this out
>>> -			 * PageWriteback: Other is writing this out
>>> -			 * PageUptodate: It was read before
>>> -			 * sector_initialized: already written out
>>> -			 */
>>> -			if (PageDirty(page) || PageWriteback(page)) {
>>> -				print_page(page);
>>> -				unlock_page(page);
>>> -				page_cache_release(page);
>>> +			} else if (page == NULL)
>>>  				goto next_page;
>>> -			}
>>> -			if (!PageUptodate(page)) {
>>> -				/* New page, readin or zero it */
>>> -				init_page_for_write(page, cow_read);
>>> -			}
>>> -			set_page_writeback(page);
>>> -			unlock_page(page);
>>>
>>>  			ret = bl_mark_sectors_init(be->be_inval, isect,
>>>  						       PAGE_CACHE_SECTORS);
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
>> the body of a message to majordomo@xxxxxxxxxxxxxxx
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux