RESEND Re: [PATCH v16 2/7] drm/ttm/pool, drm/ttm/tt: Provide a helper to shrink pages

Thomas Hellström <thomas.hellstrom@xxxxxxxxxxxxxxx> · Tue, 25 Feb 2025 09:26:25 +0100

Hi, Christian,

Ping? I'd really want to get this in before -rc6

Thanks,
Thomas

On Tue, 2025-02-18 at 16:40 +0100, Thomas Hellström wrote:
> Hi, Christian,
> 
> On Wed, 2025-02-05 at 15:02 +0100, Christian König wrote:
> > Am 30.01.25 um 11:13 schrieb Thomas Hellström:
> > > Provide a helper to shrink ttm_tt page-vectors on a per-page
> > > basis. A ttm_backup backend could then in theory get away with
> > > allocating a single temporary page for each struct ttm_tt.
> > > 
> > > This is accomplished by splitting larger pages before trying to
> > > back them up.
> > > 
> > > In the future we could allow ttm_backup to handle backing up
> > > large pages as well, but currently there's no benefit in
> > > doing that, since the shmem backup backend would have to
> > > split those anyway to avoid allocating too much temporary
> > > memory, and if the backend instead inserts pages into the
> > > swap-cache, those are split on reclaim by the core.
> > > 
> > > Due to potential backup- and recover errors, allow partially
> > > swapped
> > > out struct ttm_tt's, although mark them as swapped out stopping
> > > them
> > > from being swapped out a second time. More details in the
> > > ttm_pool.c
> > > DOC section.
> > > 
> > > v2:
> > > - A couple of cleanups and error fixes in ttm_pool_back_up_tt.
> > > - s/back_up/backup/
> > > - Add a writeback parameter to the exported interface.
> > > v8:
> > > - Use a struct for flags for readability (Matt Brost)
> > > - Address misc other review comments (Matt Brost)
> > > v9:
> > > - Update the kerneldoc for the ttm_tt::backup field.
> > > v10:
> > > - Rebase.
> > > v13:
> > > - Rebase on ttm_backup interface change. Update kerneldoc.
> > > - Rebase and adjust ttm_tt_is_swapped().
> > > v15:
> > > - Rebase on ttm_backup return value change.
> > > - Rebase on previous restructuring of ttm_pool_alloc()
> > > - Rework the ttm_pool backup interface (Christian König)
> > > - Remove cond_resched() (Christian König)
> > > - Get rid of the need to allocate an intermediate page array
> > >    when restoring a multi-order page (Christian König)
> > > - Update documentation.
> > > 
> > > Cc: Christian König <christian.koenig@xxxxxxx>
> > > Cc: Somalapuram Amaranath <Amaranath.Somalapuram@xxxxxxx>
> > > Cc: Matthew Brost <matthew.brost@xxxxxxxxx>
> > > Cc: <dri-devel@xxxxxxxxxxxxxxxxxxxxx>
> > > Signed-off-by: Thomas Hellström
> > > <thomas.hellstrom@xxxxxxxxxxxxxxx>
> > > Reviewed-by: Matthew Brost <matthew.brost@xxxxxxxxx>
> > 
> > I've tried to wrap my head around all of this like twenty times in
> > the 
> > last three month, but was always interrupted at some point.
> > 
> > Feel free to add Acked-by: Christian Koenig
> > <christian.koenig@xxxxxxx>.
> > 
> > Sorry,
> > Christian.
> 
> Thanks a lot for all reviewing and comments so far. There are two TTM
> patches left in the series that don't have an ack by you:
> 
> https://patchwork.freedesktop.org/patch/634715/?series=131815&rev=17
> and
> 
> https://patchwork.freedesktop.org/patch/634716/?series=131815&rev=17
> 
> None of them particularly big considering the amount of doc text.
> 
> It'd be great if those could have an ack as well so we could finally
> merge this series.
> 
> Thanks,
> Thomas
> 
> 
> 
> 
> 
> 
> > 
> > > ---
> > >   drivers/gpu/drm/ttm/ttm_pool.c | 554
> > > +++++++++++++++++++++++++++++----
> > >   drivers/gpu/drm/ttm/ttm_tt.c   |  54 ++++
> > >   include/drm/ttm/ttm_pool.h     |   8 +
> > >   include/drm/ttm/ttm_tt.h       |  67 +++-
> > >   4 files changed, 629 insertions(+), 54 deletions(-)
> > > 
> > > diff --git a/drivers/gpu/drm/ttm/ttm_pool.c
> > > b/drivers/gpu/drm/ttm/ttm_pool.c
> > > index c9eba76d5143..ffb7abf52bab 100644
> > > --- a/drivers/gpu/drm/ttm/ttm_pool.c
> > > +++ b/drivers/gpu/drm/ttm/ttm_pool.c
> > > @@ -41,6 +41,7 @@
> > >   #include <asm/set_memory.h>
> > >   #endif
> > >   
> > > +#include <drm/ttm/ttm_backup.h>
> > >   #include <drm/ttm/ttm_pool.h>
> > >   #include <drm/ttm/ttm_tt.h>
> > >   #include <drm/ttm/ttm_bo.h>
> > > @@ -75,6 +76,35 @@ struct ttm_pool_alloc_state {
> > >   	enum ttm_caching tt_caching;
> > >   };
> > >   
> > > +/**
> > > + * struct ttm_pool_tt_restore - State representing restore from
> > > backup
> > > + * @pool: The pool used for page allocation while restoring.
> > > + * @snapshot_alloc: A snapshot of the most recent struct
> > > ttm_pool_alloc_state.
> > > + * @alloced_page: Pointer to the page most recently allocated
> > > from
> > > a pool or system.
> > > + * @first_dma: The dma address corresponding to @alloced_page if
> > > dma_mapping
> > > + * is requested.
> > > + * @alloced_pages: The number of allocated pages present in the
> > > struct ttm_tt
> > > + * page vector from this restore session.
> > > + * @restored_pages: The number of 4K pages restored for
> > > @alloced_page (which
> > > + * is typically a multi-order page).
> > > + * @page_caching: The struct ttm_tt requested caching
> > > + * @order: The order of @alloced_page.
> > > + *
> > > + * Recovery from backup might fail when we've recovered less
> > > than
> > > the
> > > + * full ttm_tt. In order not to loose any data (yet), keep
> > > information
> > > + * around that allows us to restart a failed ttm backup
> > > recovery.
> > > + */
> > > +struct ttm_pool_tt_restore {
> > > +	struct ttm_pool *pool;
> > > +	struct ttm_pool_alloc_state snapshot_alloc;
> > > +	struct page *alloced_page;
> > > +	dma_addr_t first_dma;
> > > +	pgoff_t alloced_pages;
> > > +	pgoff_t restored_pages;
> > > +	enum ttm_caching page_caching;
> > > +	unsigned int order;
> > > +};
> > > +
> > >   static unsigned long page_pool_size;
> > >   
> > >   MODULE_PARM_DESC(page_pool_size, "Number of pages in the
> > > WC/UC/DMA pool");
> > > @@ -199,12 +229,11 @@ static int ttm_pool_apply_caching(struct
> > > ttm_pool_alloc_state *alloc)
> > >   	return 0;
> > >   }
> > >   
> > > -/* Map pages of 1 << order size and fill the DMA address array 
> > > */
> > > +/* DMA Map pages of 1 << order size and return the resulting
> > > dma_address. */
> > >   static int ttm_pool_map(struct ttm_pool *pool, unsigned int
> > > order,
> > > -			struct page *p, dma_addr_t **dma_addr)
> > > +			struct page *p, dma_addr_t *dma_addr)
> > >   {
> > >   	dma_addr_t addr;
> > > -	unsigned int i;
> > >   
> > >   	if (pool->use_dma_alloc) {
> > >   		struct ttm_pool_dma *dma = (void *)p->private;
> > > @@ -218,10 +247,7 @@ static int ttm_pool_map(struct ttm_pool
> > > *pool,
> > > unsigned int order,
> > >   			return -EFAULT;
> > >   	}
> > >   
> > > -	for (i = 1 << order; i ; --i) {
> > > -		*(*dma_addr)++ = addr;
> > > -		addr += PAGE_SIZE;
> > > -	}
> > > +	*dma_addr = addr;
> > >   
> > >   	return 0;
> > >   }
> > > @@ -371,6 +397,190 @@ static unsigned int
> > > ttm_pool_page_order(struct ttm_pool *pool, struct page *p)
> > >   	return p->private;
> > >   }
> > >   
> > > +/*
> > > + * Split larger pages so that we can free each PAGE_SIZE page as
> > > soon
> > > + * as it has been backed up, in order to avoid memory pressure
> > > during
> > > + * reclaim.
> > > + */
> > > +static void ttm_pool_split_for_swap(struct ttm_pool *pool,
> > > struct
> > > page *p)
> > > +{
> > > +	unsigned int order = ttm_pool_page_order(pool, p);
> > > +	pgoff_t nr;
> > > +
> > > +	if (!order)
> > > +		return;
> > > +
> > > +	split_page(p, order);
> > > +	nr = 1UL << order;
> > > +	while (nr--)
> > > +		(p++)->private = 0;
> > > +}
> > > +
> > > +/**
> > > + * DOC: Partial backup and restoration of a struct ttm_tt.
> > > + *
> > > + * Swapout using ttm_backup_backup_page() and swapin using
> > > + * ttm_backup_copy_page() may fail.
> > > + * The former most likely due to lack of swap-space or memory,
> > > the
> > > latter due
> > > + * to lack of memory or because of signal interruption during
> > > waits.
> > > + *
> > > + * Backup failure is easily handled by using a ttm_tt pages
> > > vector
> > > that holds
> > > + * both backup handles and page pointers. This has to be taken
> > > into account when
> > > + * restoring such a ttm_tt from backup, and when freeing it
> > > while
> > > backed up.
> > > + * When restoring, for simplicity, new pages are actually
> > > allocated from the
> > > + * pool and the contents of any old pages are copied in and then
> > > the old pages
> > > + * are released.
> > > + *
> > > + * For restoration failures, the struct ttm_pool_tt_restore
> > > holds
> > > sufficient state
> > > + * to be able to resume an interrupted restore, and that
> > > structure
> > > is freed once
> > > + * the restoration is complete. If the struct ttm_tt is
> > > destroyed
> > > while there
> > > + * is a valid struct ttm_pool_tt_restore attached, that is also
> > > properly taken
> > > + * care of.
> > > + */
> > > +
> > > +/* Is restore ongoing for the currently allocated page? */
> > > +static bool ttm_pool_restore_valid(const struct
> > > ttm_pool_tt_restore *restore)
> > > +{
> > > +	return restore && restore->restored_pages < (1 <<
> > > restore-
> > > > order);
> > > +}
> > > +
> > > +/* DMA unmap and free a multi-order page, either to the relevant
> > > pool or to system. */
> > > +static pgoff_t ttm_pool_unmap_and_free(struct ttm_pool *pool,
> > > struct page *page,
> > > +				       const dma_addr_t
> > > *dma_addr,
> > > enum ttm_caching caching)
> > > +{
> > > +	struct ttm_pool_type *pt = NULL;
> > > +	unsigned int order;
> > > +	pgoff_t nr;
> > > +
> > > +	if (pool) {
> > > +		order = ttm_pool_page_order(pool, page);
> > > +		nr = (1UL << order);
> > > +		if (dma_addr)
> > > +			ttm_pool_unmap(pool, *dma_addr, nr);
> > > +
> > > +		pt = ttm_pool_select_type(pool, caching, order);
> > > +	} else {
> > > +		order = page->private;
> > > +		nr = (1UL << order);
> > > +	}
> > > +
> > > +	if (pt)
> > > +		ttm_pool_type_give(pt, page);
> > > +	else
> > > +		ttm_pool_free_page(pool, caching, order, page);
> > > +
> > > +	return nr;
> > > +}
> > > +
> > > +/* Populate the page-array using the most recent allocated
> > > multi-
> > > order page. */
> > > +static void ttm_pool_allocated_page_commit(struct page
> > > *allocated,
> > > +					   dma_addr_t first_dma,
> > > +					   struct
> > > ttm_pool_alloc_state *alloc,
> > > +					   pgoff_t nr)
> > > +{
> > > +	pgoff_t i;
> > > +
> > > +	for (i = 0; i < nr; ++i)
> > > +		*alloc->pages++ = allocated++;
> > > +
> > > +	alloc->remaining_pages -= nr;
> > > +
> > > +	if (!alloc->dma_addr)
> > > +		return;
> > > +
> > > +	for (i = 0; i < nr; ++i) {
> > > +		*alloc->dma_addr++ = first_dma;
> > > +		first_dma += PAGE_SIZE;
> > > +	}
> > > +}
> > > +
> > > +/*
> > > + * When restoring, restore backed-up content to the newly
> > > allocated page and
> > > + * if successful, populate the page-table and dma-address
> > > arrays.
> > > + */
> > > +static int ttm_pool_restore_commit(struct ttm_pool_tt_restore
> > > *restore,
> > > +				   struct ttm_backup *backup,
> > > +				   const struct
> > > ttm_operation_ctx
> > > *ctx,
> > > +				   struct ttm_pool_alloc_state
> > > *alloc)
> > > +
> > > +{
> > > +	pgoff_t i, nr = 1UL << restore->order;
> > > +	struct page **first_page = alloc->pages;
> > > +	struct page *p;
> > > +	int ret = 0;
> > > +
> > > +	for (i = restore->restored_pages; i < nr; ++i) {
> > > +		p = first_page[i];
> > > +		if (ttm_backup_page_ptr_is_handle(p)) {
> > > +			unsigned long handle =
> > > ttm_backup_page_ptr_to_handle(p);
> > > +
> > > +			if (handle == 0) {
> > > +				restore->restored_pages++;
> > > +				continue;
> > > +			}
> > > +
> > > +			ret = ttm_backup_copy_page(backup,
> > > restore->alloced_page + i,
> > > +						   handle, ctx-
> > > > interruptible);
> > > +			if (ret)
> > > +				break;
> > > +
> > > +			ttm_backup_drop(backup, handle);
> > > +		} else if (p) {
> > > +			/*
> > > +			 * We could probably avoid splitting the
> > > old page
> > > +			 * using clever logic, but ATM we don't
> > > care, as
> > > +			 * we prioritize releasing memory ASAP.
> > > Note that
> > > +			 * here, the old retained page is always
> > > write-back
> > > +			 * cached.
> > > +			 */
> > > +			ttm_pool_split_for_swap(restore->pool,
> > > p);
> > > +			copy_highpage(restore->alloced_page + i,
> > > p);
> > > +			__free_pages(p, 0);
> > > +		}
> > > +
> > > +		restore->restored_pages++;
> > > +		first_page[i] =
> > > ttm_backup_handle_to_page_ptr(0);
> > > +	}
> > > +
> > > +	if (ret) {
> > > +		if (!restore->restored_pages) {
> > > +			dma_addr_t *dma_addr = alloc->dma_addr ?
> > > &restore->first_dma : NULL;
> > > +
> > > +			ttm_pool_unmap_and_free(restore->pool,
> > > restore->alloced_page,
> > > +						dma_addr,
> > > restore-
> > > > page_caching);
> > > +			restore->restored_pages = nr;
> > > +		}
> > > +		return ret;
> > > +	}
> > > +
> > > +	ttm_pool_allocated_page_commit(restore->alloced_page,
> > > restore->first_dma,
> > > +				       alloc, nr);
> > > +	if (restore->page_caching == alloc->tt_caching ||
> > > PageHighMem(restore->alloced_page))
> > > +		alloc->caching_divide = alloc->pages;
> > > +	restore->snapshot_alloc = *alloc;
> > > +	restore->alloced_pages += nr;
> > > +
> > > +	return 0;
> > > +}
> > > +
> > > +/* If restoring, save information needed for
> > > ttm_pool_restore_commit(). */
> > > +static void
> > > +ttm_pool_page_allocated_restore(struct ttm_pool *pool, unsigned
> > > int order,
> > > +				struct page *p,
> > > +				enum ttm_caching page_caching,
> > > +				dma_addr_t first_dma,
> > > +				struct ttm_pool_tt_restore
> > > *restore,
> > > +				const struct
> > > ttm_pool_alloc_state
> > > *alloc)
> > > +{
> > > +	restore->pool = pool;
> > > +	restore->order = order;
> > > +	restore->restored_pages = 0;
> > > +	restore->page_caching = page_caching;
> > > +	restore->first_dma = first_dma;
> > > +	restore->alloced_page = p;
> > > +	restore->snapshot_alloc = *alloc;
> > > +}
> > > +
> > >   /*
> > >    * Called when we got a page, either from a pool or newly
> > > allocated.
> > >    * if needed, dma map the page and populate the dma address
> > > array.
> > > @@ -380,10 +590,11 @@ static unsigned int
> > > ttm_pool_page_order(struct ttm_pool *pool, struct page *p)
> > >    */
> > >   static int ttm_pool_page_allocated(struct ttm_pool *pool,
> > > unsigned int order,
> > >   				   struct page *p, enum
> > > ttm_caching page_caching,
> > > -				   struct ttm_pool_alloc_state
> > > *alloc)
> > > +				   struct ttm_pool_alloc_state
> > > *alloc,
> > > +				   struct ttm_pool_tt_restore
> > > *restore)
> > >   {
> > > -	pgoff_t i, nr = 1UL << order;
> > >   	bool caching_consistent;
> > > +	dma_addr_t first_dma;
> > >   	int r = 0;
> > >   
> > >   	caching_consistent = (page_caching == alloc->tt_caching)
> > > > > PageHighMem(p);
> > > @@ -395,17 +606,20 @@ static int ttm_pool_page_allocated(struct
> > > ttm_pool *pool, unsigned int order,
> > >   	}
> > >   
> > >   	if (alloc->dma_addr) {
> > > -		r = ttm_pool_map(pool, order, p, &alloc-
> > > > dma_addr);
> > > +		r = ttm_pool_map(pool, order, p, &first_dma);
> > >   		if (r)
> > >   			return r;
> > >   	}
> > >   
> > > -	alloc->remaining_pages -= nr;
> > > -	for (i = 0; i < nr; ++i)
> > > -		*alloc->pages++ = p++;
> > > +	if (restore) {
> > > +		ttm_pool_page_allocated_restore(pool, order, p,
> > > page_caching,
> > > +						first_dma,
> > > restore, alloc);
> > > +	} else {
> > > +		ttm_pool_allocated_page_commit(p, first_dma,
> > > alloc, 1UL << order);
> > >   
> > > -	if (caching_consistent)
> > > -		alloc->caching_divide = alloc->pages;
> > > +		if (caching_consistent)
> > > +			alloc->caching_divide = alloc->pages;
> > > +	}
> > >   
> > >   	return 0;
> > >   }
> > > @@ -428,22 +642,24 @@ static void ttm_pool_free_range(struct
> > > ttm_pool *pool, struct ttm_tt *tt,
> > >   				pgoff_t start_page, pgoff_t
> > > end_page)
> > >   {
> > >   	struct page **pages = &tt->pages[start_page];
> > > -	unsigned int order;
> > > +	struct ttm_backup *backup = tt->backup;
> > >   	pgoff_t i, nr;
> > >   
> > >   	for (i = start_page; i < end_page; i += nr, pages += nr)
> > > {
> > > -		struct ttm_pool_type *pt = NULL;
> > > +		struct page *p = *pages;
> > >   
> > > -		order = ttm_pool_page_order(pool, *pages);
> > > -		nr = (1UL << order);
> > > -		if (tt->dma_address)
> > > -			ttm_pool_unmap(pool, tt->dma_address[i],
> > > nr);
> > > +		nr = 1;
> > > +		if (ttm_backup_page_ptr_is_handle(p)) {
> > > +			unsigned long handle =
> > > ttm_backup_page_ptr_to_handle(p);
> > >   
> > > -		pt = ttm_pool_select_type(pool, caching, order);
> > > -		if (pt)
> > > -			ttm_pool_type_give(pt, *pages);
> > > -		else
> > > -			ttm_pool_free_page(pool, caching, order,
> > > *pages);
> > > +			if (handle != 0)
> > > +				ttm_backup_drop(backup, handle);
> > > +		} else if (p) {
> > > +			dma_addr_t *dma_addr = tt->dma_address ?
> > > +				tt->dma_address + i : NULL;
> > > +
> > > +			nr = ttm_pool_unmap_and_free(pool, p,
> > > dma_addr, caching);
> > > +		}
> > >   	}
> > >   }
> > >   
> > > @@ -467,22 +683,11 @@ static unsigned int
> > > ttm_pool_alloc_find_order(unsigned int highest,
> > >   	return min_t(unsigned int, highest, __fls(alloc-
> > > > remaining_pages));
> > >   }
> > >   
> > > -/**
> > > - * ttm_pool_alloc - Fill a ttm_tt object
> > > - *
> > > - * @pool: ttm_pool to use
> > > - * @tt: ttm_tt object to fill
> > > - * @ctx: operation context
> > > - *
> > > - * Fill the ttm_tt object with pages and also make sure to DMA
> > > map
> > > them when
> > > - * necessary.
> > > - *
> > > - * Returns: 0 on successe, negative error code otherwise.
> > > - */
> > > -int ttm_pool_alloc(struct ttm_pool *pool, struct ttm_tt *tt,
> > > -		   struct ttm_operation_ctx *ctx)
> > > +static int __ttm_pool_alloc(struct ttm_pool *pool, struct ttm_tt
> > > *tt,
> > > +			    const struct ttm_operation_ctx *ctx,
> > > +			    struct ttm_pool_alloc_state *alloc,
> > > +			    struct ttm_pool_tt_restore *restore)
> > >   {
> > > -	struct ttm_pool_alloc_state alloc;
> > >   	enum ttm_caching page_caching;
> > >   	gfp_t gfp_flags = GFP_USER;
> > >   	pgoff_t caching_divide;
> > > @@ -491,10 +696,8 @@ int ttm_pool_alloc(struct ttm_pool *pool,
> > > struct ttm_tt *tt,
> > >   	struct page *p;
> > >   	int r;
> > >   
> > > -	ttm_pool_alloc_state_init(tt, &alloc);
> > > -
> > > -	WARN_ON(!alloc.remaining_pages ||
> > > ttm_tt_is_populated(tt));
> > > -	WARN_ON(alloc.dma_addr && !pool->dev);
> > > +	WARN_ON(!alloc->remaining_pages ||
> > > ttm_tt_is_populated(tt));
> > > +	WARN_ON(alloc->dma_addr && !pool->dev);
> > >   
> > >   	if (tt->page_flags & TTM_TT_FLAG_ZERO_ALLOC)
> > >   		gfp_flags |= __GFP_ZERO;
> > > @@ -509,9 +712,9 @@ int ttm_pool_alloc(struct ttm_pool *pool,
> > > struct ttm_tt *tt,
> > >   
> > >   	page_caching = tt->caching;
> > >   	allow_pools = true;
> > > -	for (order = ttm_pool_alloc_find_order(MAX_PAGE_ORDER,
> > > &alloc);
> > > -	     alloc.remaining_pages;
> > > -	     order = ttm_pool_alloc_find_order(order, &alloc)) {
> > > +	for (order = ttm_pool_alloc_find_order(MAX_PAGE_ORDER,
> > > alloc);
> > > +	     alloc->remaining_pages;
> > > +	     order = ttm_pool_alloc_find_order(order, alloc)) {
> > >   		struct ttm_pool_type *pt;
> > >   
> > >   		/* First, try to allocate a page from a pool if
> > > one exists. */
> > > @@ -541,30 +744,120 @@ int ttm_pool_alloc(struct ttm_pool *pool,
> > > struct ttm_tt *tt,
> > >   			r = -ENOMEM;
> > >   			goto error_free_all;
> > >   		}
> > > -		r = ttm_pool_page_allocated(pool, order, p,
> > > page_caching, &alloc);
> > > +		r = ttm_pool_page_allocated(pool, order, p,
> > > page_caching, alloc,
> > > +					    restore);
> > >   		if (r)
> > >   			goto error_free_page;
> > > +
> > > +		if (ttm_pool_restore_valid(restore)) {
> > > +			r = ttm_pool_restore_commit(restore, tt-
> > > > backup, ctx, alloc);
> > > +			if (r)
> > > +				goto error_free_all;
> > > +		}
> > >   	}
> > >   
> > > -	r = ttm_pool_apply_caching(&alloc);
> > > +	r = ttm_pool_apply_caching(alloc);
> > >   	if (r)
> > >   		goto error_free_all;
> > >   
> > > +	kfree(tt->restore);
> > > +	tt->restore = NULL;
> > > +
> > >   	return 0;
> > >   
> > >   error_free_page:
> > >   	ttm_pool_free_page(pool, page_caching, order, p);
> > >   
> > >   error_free_all:
> > > -	caching_divide = alloc.caching_divide - tt->pages;
> > > +	if (tt->restore)
> > > +		return r;
> > > +
> > > +	caching_divide = alloc->caching_divide - tt->pages;
> > >   	ttm_pool_free_range(pool, tt, tt->caching, 0,
> > > caching_divide);
> > >   	ttm_pool_free_range(pool, tt, ttm_cached,
> > > caching_divide,
> > > -			    tt->num_pages -
> > > alloc.remaining_pages);
> > > +			    tt->num_pages - alloc-
> > > > remaining_pages);
> > >   
> > >   	return r;
> > >   }
> > > +
> > > +/**
> > > + * ttm_pool_alloc - Fill a ttm_tt object
> > > + *
> > > + * @pool: ttm_pool to use
> > > + * @tt: ttm_tt object to fill
> > > + * @ctx: operation context
> > > + *
> > > + * Fill the ttm_tt object with pages and also make sure to DMA
> > > map
> > > them when
> > > + * necessary.
> > > + *
> > > + * Returns: 0 on successe, negative error code otherwise.
> > > + */
> > > +int ttm_pool_alloc(struct ttm_pool *pool, struct ttm_tt *tt,
> > > +		   struct ttm_operation_ctx *ctx)
> > > +{
> > > +	struct ttm_pool_alloc_state alloc;
> > > +
> > > +	if (WARN_ON(ttm_tt_is_backed_up(tt)))
> > > +		return -EINVAL;
> > > +
> > > +	ttm_pool_alloc_state_init(tt, &alloc);
> > > +
> > > +	return __ttm_pool_alloc(pool, tt, ctx, &alloc, NULL);
> > > +}
> > >   EXPORT_SYMBOL(ttm_pool_alloc);
> > >   
> > > +/**
> > > + * ttm_pool_restore_and_alloc - Fill a ttm_tt, restoring
> > > previously backed-up
> > > + * content.
> > > + *
> > > + * @pool: ttm_pool to use
> > > + * @tt: ttm_tt object to fill
> > > + * @ctx: operation context
> > > + *
> > > + * Fill the ttm_tt object with pages and also make sure to DMA
> > > map
> > > them when
> > > + * necessary. Read in backed-up content.
> > > + *
> > > + * Returns: 0 on successe, negative error code otherwise.
> > > + */
> > > +int ttm_pool_restore_and_alloc(struct ttm_pool *pool, struct
> > > ttm_tt *tt,
> > > +			       const struct ttm_operation_ctx
> > > *ctx)
> > > +{
> > > +	struct ttm_pool_alloc_state alloc;
> > > +
> > > +	if (WARN_ON(!ttm_tt_is_backed_up(tt)))
> > > +		return -EINVAL;
> > > +
> > > +	if (!tt->restore) {
> > > +		gfp_t gfp = GFP_KERNEL | __GFP_NOWARN;
> > > +
> > > +		ttm_pool_alloc_state_init(tt, &alloc);
> > > +		if (ctx->gfp_retry_mayfail)
> > > +			gfp |= __GFP_RETRY_MAYFAIL;
> > > +
> > > +		tt->restore = kzalloc(sizeof(*tt->restore),
> > > gfp);
> > > +		if (!tt->restore)
> > > +			return -ENOMEM;
> > > +
> > > +		tt->restore->snapshot_alloc = alloc;
> > > +		tt->restore->pool = pool;
> > > +		tt->restore->restored_pages = 1;
> > > +	} else {
> > > +		struct ttm_pool_tt_restore *restore = tt-
> > > >restore;
> > > +		int ret;
> > > +
> > > +		alloc = restore->snapshot_alloc;
> > > +		if (ttm_pool_restore_valid(tt->restore)) {
> > > +			ret = ttm_pool_restore_commit(restore,
> > > tt-
> > > > backup, ctx, &alloc);
> > > +			if (ret)
> > > +				return ret;
> > > +		}
> > > +		if (!alloc.remaining_pages)
> > > +			return 0;
> > > +	}
> > > +
> > > +	return __ttm_pool_alloc(pool, tt, ctx, &alloc, tt-
> > > > restore);
> > > +}
> > > +
> > >   /**
> > >    * ttm_pool_free - Free the backing pages from a ttm_tt object
> > >    *
> > > @@ -582,6 +875,163 @@ void ttm_pool_free(struct ttm_pool *pool,
> > > struct ttm_tt *tt)
> > >   }
> > >   EXPORT_SYMBOL(ttm_pool_free);
> > >   
> > > +/**
> > > + * ttm_pool_drop_backed_up() - Release content of a swapped-out
> > > struct ttm_tt
> > > + * @tt: The struct ttm_tt.
> > > + *
> > > + * Release handles with associated content or any remaining
> > > pages
> > > of
> > > + * a backed-up struct ttm_tt.
> > > + */
> > > +void ttm_pool_drop_backed_up(struct ttm_tt *tt)
> > > +{
> > > +	struct ttm_pool_tt_restore *restore;
> > > +	pgoff_t start_page = 0;
> > > +
> > > +	WARN_ON(!ttm_tt_is_backed_up(tt));
> > > +
> > > +	restore = tt->restore;
> > > +
> > > +	/*
> > > +	 * Unmap and free any uncommitted restore page.
> > > +	 * any tt page-array backup entries already read back
> > > has
> > > +	 * been cleared already
> > > +	 */
> > > +	if (ttm_pool_restore_valid(restore)) {
> > > +		dma_addr_t *dma_addr = tt->dma_address ?
> > > &restore-
> > > > first_dma : NULL;
> > > +
> > > +		ttm_pool_unmap_and_free(restore->pool, restore-
> > > > alloced_page,
> > > +					dma_addr, restore-
> > > > page_caching);
> > > +		restore->restored_pages = 1UL << restore->order;
> > > +	}
> > > +
> > > +	/*
> > > +	 * If a restore is ongoing, part of the tt pages may
> > > have
> > > a
> > > +	 * caching different than writeback.
> > > +	 */
> > > +	if (restore) {
> > > +		pgoff_t mid = restore-
> > > > snapshot_alloc.caching_divide - tt->pages;
> > > +
> > > +		start_page = restore->alloced_pages;
> > > +		WARN_ON(mid > start_page);
> > > +		/* Pages that might be dma-mapped and non-cached
> > > */
> > > +		ttm_pool_free_range(restore->pool, tt, tt-
> > > > caching,
> > > +				    0, mid);
> > > +		/* Pages that might be dma-mapped but cached */
> > > +		ttm_pool_free_range(restore->pool, tt,
> > > ttm_cached,
> > > +				    mid, restore-
> > > >alloced_pages);
> > > +		kfree(restore);
> > > +		tt->restore = NULL;
> > > +	}
> > > +
> > > +	ttm_pool_free_range(NULL, tt, ttm_cached, start_page,
> > > tt-
> > > > num_pages);
> > > +}
> > > +
> > > +/**
> > > + * ttm_pool_backup() - Back up or purge a struct ttm_tt
> > > + * @pool: The pool used when allocating the struct ttm_tt.
> > > + * @tt: The struct ttm_tt.
> > > + * @flags: Flags to govern the backup behaviour.
> > > + *
> > > + * Back up or purge a struct ttm_tt. If @purge is true, then
> > > + * all pages will be freed directly to the system rather than to
> > > the pool
> > > + * they were allocated from, making the function behave
> > > similarly
> > > to
> > > + * ttm_pool_free(). If @purge is false the pages will be backed
> > > up
> > > instead,
> > > + * exchanged for handles.
> > > + * A subsequent call to ttm_pool_restore_and_alloc() will then
> > > read back the content and
> > > + * a subsequent call to ttm_pool_drop_backed_up() will drop it.
> > > + * If backup of a page fails for whatever reason, @ttm will
> > > still
> > > be
> > > + * partially backed up, retaining those pages for which backup
> > > fails.
> > > + * In that case, this function can be retried, possibly after
> > > freeing up
> > > + * memory resources.
> > > + *
> > > + * Return: Number of pages actually backed up or freed, or
> > > negative
> > > + * error code on error.
> > > + */
> > > +long ttm_pool_backup(struct ttm_pool *pool, struct ttm_tt *tt,
> > > +		     const struct ttm_backup_flags *flags)
> > > +{
> > > +	struct ttm_backup *backup = tt->backup;
> > > +	struct page *page;
> > > +	unsigned long handle;
> > > +	gfp_t alloc_gfp;
> > > +	gfp_t gfp;
> > > +	int ret = 0;
> > > +	pgoff_t shrunken = 0;
> > > +	pgoff_t i, num_pages;
> > > +
> > > +	if (WARN_ON(ttm_tt_is_backed_up(tt)))
> > > +		return -EINVAL;
> > > +
> > > +	if ((!ttm_backup_bytes_avail() && !flags->purge) ||
> > > +	    pool->use_dma_alloc || ttm_tt_is_backed_up(tt))
> > > +		return -EBUSY;
> > > +
> > > +#ifdef CONFIG_X86
> > > +	/* Anything returned to the system needs to be cached.
> > > */
> > > +	if (tt->caching != ttm_cached)
> > > +		set_pages_array_wb(tt->pages, tt->num_pages);
> > > +#endif
> > > +
> > > +	if (tt->dma_address || flags->purge) {
> > > +		for (i = 0; i < tt->num_pages; i += num_pages) {
> > > +			unsigned int order;
> > > +
> > > +			page = tt->pages[i];
> > > +			if (unlikely(!page)) {
> > > +				num_pages = 1;
> > > +				continue;
> > > +			}
> > > +
> > > +			order = ttm_pool_page_order(pool, page);
> > > +			num_pages = 1UL << order;
> > > +			if (tt->dma_address)
> > > +				ttm_pool_unmap(pool, tt-
> > > > dma_address[i],
> > > +					       num_pages);
> > > +			if (flags->purge) {
> > > +				shrunken += num_pages;
> > > +				page->private = 0;
> > > +				__free_pages(page, order);
> > > +				memset(tt->pages + i, 0,
> > > +				       num_pages * sizeof(*tt-
> > > > pages));
> > > +			}
> > > +		}
> > > +	}
> > > +
> > > +	if (flags->purge)
> > > +		return shrunken;
> > > +
> > > +	if (pool->use_dma32)
> > > +		gfp = GFP_DMA32;
> > > +	else
> > > +		gfp = GFP_HIGHUSER;
> > > +
> > > +	alloc_gfp = GFP_KERNEL | __GFP_HIGH | __GFP_NOWARN |
> > > __GFP_RETRY_MAYFAIL;
> > > +
> > > +	for (i = 0; i < tt->num_pages; ++i) {
> > > +		s64 shandle;
> > > +
> > > +		page = tt->pages[i];
> > > +		if (unlikely(!page))
> > > +			continue;
> > > +
> > > +		ttm_pool_split_for_swap(pool, page);
> > > +
> > > +		shandle = ttm_backup_backup_page(backup, page,
> > > flags->writeback, i,
> > > +						 gfp,
> > > alloc_gfp);
> > > +		if (shandle < 0) {
> > > +			/* We allow partially shrunken tts */
> > > +			ret = shandle;
> > > +			break;
> > > +		}
> > > +		handle = shandle;
> > > +		tt->pages[i] =
> > > ttm_backup_handle_to_page_ptr(handle);
> > > +		put_page(page);
> > > +		shrunken++;
> > > +	}
> > > +
> > > +	return shrunken ? shrunken : ret;
> > > +}
> > > +
> > >   /**
> > >    * ttm_pool_init - Initialize a pool
> > >    *
> > > diff --git a/drivers/gpu/drm/ttm/ttm_tt.c
> > > b/drivers/gpu/drm/ttm/ttm_tt.c
> > > index 3baf215eca23..00b7c28f2329 100644
> > > --- a/drivers/gpu/drm/ttm/ttm_tt.c
> > > +++ b/drivers/gpu/drm/ttm/ttm_tt.c
> > > @@ -40,6 +40,7 @@
> > >   #include <drm/drm_cache.h>
> > >   #include <drm/drm_device.h>
> > >   #include <drm/drm_util.h>
> > > +#include <drm/ttm/ttm_backup.h>
> > >   #include <drm/ttm/ttm_bo.h>
> > >   #include <drm/ttm/ttm_tt.h>
> > >   
> > > @@ -158,6 +159,8 @@ static void ttm_tt_init_fields(struct ttm_tt
> > > *ttm,
> > >   	ttm->swap_storage = NULL;
> > >   	ttm->sg = bo->sg;
> > >   	ttm->caching = caching;
> > > +	ttm->restore = NULL;
> > > +	ttm->backup = NULL;
> > >   }
> > >   
> > >   int ttm_tt_init(struct ttm_tt *ttm, struct ttm_buffer_object
> > > *bo,
> > > @@ -182,6 +185,13 @@ void ttm_tt_fini(struct ttm_tt *ttm)
> > >   		fput(ttm->swap_storage);
> > >   	ttm->swap_storage = NULL;
> > >   
> > > +	if (ttm_tt_is_backed_up(ttm))
> > > +		ttm_pool_drop_backed_up(ttm);
> > > +	if (ttm->backup) {
> > > +		ttm_backup_fini(ttm->backup);
> > > +		ttm->backup = NULL;
> > > +	}
> > > +
> > >   	if (ttm->pages)
> > >   		kvfree(ttm->pages);
> > >   	else
> > > @@ -253,6 +263,49 @@ int ttm_tt_swapin(struct ttm_tt *ttm)
> > >   }
> > >   EXPORT_SYMBOL_FOR_TESTS_ONLY(ttm_tt_swapin);
> > >   
> > > +/**
> > > + * ttm_tt_backup() - Helper to back up a struct ttm_tt.
> > > + * @bdev: The TTM device.
> > > + * @tt: The struct ttm_tt.
> > > + * @flags: Flags that govern the backup behaviour.
> > > + *
> > > + * Update the page accounting and call ttm_pool_shrink_tt to
> > > free
> > > pages
> > > + * or back them up.
> > > + *
> > > + * Return: Number of pages freed or swapped out, or negative
> > > error
> > > code on
> > > + * error.
> > > + */
> > > +long ttm_tt_backup(struct ttm_device *bdev, struct ttm_tt *tt,
> > > +		   const struct ttm_backup_flags flags)
> > > +{
> > > +	long ret;
> > > +
> > > +	if (WARN_ON(IS_ERR_OR_NULL(tt->backup)))
> > > +		return 0;
> > > +
> > > +	ret = ttm_pool_backup(&bdev->pool, tt, &flags);
> > > +	if (ret > 0) {
> > > +		tt->page_flags &= ~TTM_TT_FLAG_PRIV_POPULATED;
> > > +		tt->page_flags |= TTM_TT_FLAG_BACKED_UP;
> > > +	}
> > > +
> > > +	return ret;
> > > +}
> > > +
> > > +int ttm_tt_restore(struct ttm_device *bdev, struct ttm_tt *tt,
> > > +		   const struct ttm_operation_ctx *ctx)
> > > +{
> > > +	int ret = ttm_pool_restore_and_alloc(&bdev->pool, tt,
> > > ctx);
> > > +
> > > +	if (ret)
> > > +		return ret;
> > > +
> > > +	tt->page_flags &= ~TTM_TT_FLAG_BACKED_UP;
> > > +
> > > +	return 0;
> > > +}
> > > +EXPORT_SYMBOL(ttm_tt_restore);
> > > +
> > >   /**
> > >    * ttm_tt_swapout - swap out tt object
> > >    *
> > > @@ -348,6 +401,7 @@ int ttm_tt_populate(struct ttm_device *bdev,
> > >   		goto error;
> > >   
> > >   	ttm->page_flags |= TTM_TT_FLAG_PRIV_POPULATED;
> > > +	ttm->page_flags &= ~TTM_TT_FLAG_BACKED_UP;
> > >   	if (unlikely(ttm->page_flags & TTM_TT_FLAG_SWAPPED)) {
> > >   		ret = ttm_tt_swapin(ttm);
> > >   		if (unlikely(ret != 0)) {
> > > diff --git a/include/drm/ttm/ttm_pool.h
> > > b/include/drm/ttm/ttm_pool.h
> > > index 160d954a261e..54cd34a6e4c0 100644
> > > --- a/include/drm/ttm/ttm_pool.h
> > > +++ b/include/drm/ttm/ttm_pool.h
> > > @@ -33,6 +33,7 @@
> > >   
> > >   struct device;
> > >   struct seq_file;
> > > +struct ttm_backup_flags;
> > >   struct ttm_operation_ctx;
> > >   struct ttm_pool;
> > >   struct ttm_tt;
> > > @@ -89,6 +90,13 @@ void ttm_pool_fini(struct ttm_pool *pool);
> > >   
> > >   int ttm_pool_debugfs(struct ttm_pool *pool, struct seq_file
> > > *m);
> > >   
> > > +void ttm_pool_drop_backed_up(struct ttm_tt *tt);
> > > +
> > > +long ttm_pool_backup(struct ttm_pool *pool, struct ttm_tt *ttm,
> > > +		     const struct ttm_backup_flags *flags);
> > > +int ttm_pool_restore_and_alloc(struct ttm_pool *pool, struct
> > > ttm_tt *tt,
> > > +			       const struct ttm_operation_ctx
> > > *ctx);
> > > +
> > >   int ttm_pool_mgr_init(unsigned long num_pages);
> > >   void ttm_pool_mgr_fini(void);
> > >   
> > > diff --git a/include/drm/ttm/ttm_tt.h b/include/drm/ttm/ttm_tt.h
> > > index 991edafdb2dd..c736c01ac2ca 100644
> > > --- a/include/drm/ttm/ttm_tt.h
> > > +++ b/include/drm/ttm/ttm_tt.h
> > > @@ -32,11 +32,13 @@
> > >   #include <drm/ttm/ttm_caching.h>
> > >   #include <drm/ttm/ttm_kmap_iter.h>
> > >   
> > > +struct ttm_backup;
> > >   struct ttm_device;
> > >   struct ttm_tt;
> > >   struct ttm_resource;
> > >   struct ttm_buffer_object;
> > >   struct ttm_operation_ctx;
> > > +struct ttm_pool_tt_restore;
> > >   
> > >   /**
> > >    * struct ttm_tt - This is a structure holding the pages,
> > > caching- and aperture
> > > @@ -85,17 +87,22 @@ struct ttm_tt {
> > >   	 * fault handling abuses the DMA api a bit and
> > > dma_map_attrs can't be
> > >   	 * used to assure pgprot always matches.
> > >   	 *
> > > +	 * TTM_TT_FLAG_BACKED_UP: TTM internal only. This is set
> > > if the
> > > +	 * struct ttm_tt has been (possibly partially) backed
> > > up.
> > > +	 *
> > >   	 * TTM_TT_FLAG_PRIV_POPULATED: TTM internal only. DO NOT
> > > USE. This is
> > >   	 * set by TTM after ttm_tt_populate() has successfully
> > > returned, and is
> > >   	 * then unset when TTM calls ttm_tt_unpopulate().
> > > +	 *
> > >   	 */
> > >   #define TTM_TT_FLAG_SWAPPED		BIT(0)
> > >   #define TTM_TT_FLAG_ZERO_ALLOC		BIT(1)
> > >   #define TTM_TT_FLAG_EXTERNAL		BIT(2)
> > >   #define TTM_TT_FLAG_EXTERNAL_MAPPABLE	BIT(3)
> > >   #define TTM_TT_FLAG_DECRYPTED		BIT(4)
> > > +#define TTM_TT_FLAG_BACKED_UP	        BIT(5)
> > >   
> > > -#define TTM_TT_FLAG_PRIV_POPULATED	BIT(5)
> > > +#define TTM_TT_FLAG_PRIV_POPULATED	BIT(6)
> > >   	uint32_t page_flags;
> > >   	/** @num_pages: Number of pages in the page array. */
> > >   	uint32_t num_pages;
> > > @@ -105,11 +112,20 @@ struct ttm_tt {
> > >   	dma_addr_t *dma_address;
> > >   	/** @swap_storage: Pointer to shmem struct file for swap
> > > storage. */
> > >   	struct file *swap_storage;
> > > +	/**
> > > +	 * @backup: Pointer to backup struct for backed up tts.
> > > +	 * Could be unified with @swap_storage. Meanwhile, the
> > > driver's
> > > +	 * ttm_tt_create() callback is responsible for assigning
> > > +	 * this field.
> > > +	 */
> > > +	struct ttm_backup *backup;
> > >   	/**
> > >   	 * @caching: The current caching state of the pages, see
> > > enum
> > >   	 * ttm_caching.
> > >   	 */
> > >   	enum ttm_caching caching;
> > > +	/** @restore: Partial restoration from backup state. TTM
> > > private */
> > > +	struct ttm_pool_tt_restore *restore;
> > >   };
> > >   
> > >   /**
> > > @@ -129,9 +145,38 @@ static inline bool
> > > ttm_tt_is_populated(struct
> > > ttm_tt *tt)
> > >   	return tt->page_flags & TTM_TT_FLAG_PRIV_POPULATED;
> > >   }
> > >   
> > > +/**
> > > + * ttm_tt_is_swapped() - Whether the ttm_tt is swapped out or
> > > backed up
> > > + * @tt: The struct ttm_tt.
> > > + *
> > > + * Return: true if swapped or backed up, false otherwise.
> > > + */
> > >   static inline bool ttm_tt_is_swapped(const struct ttm_tt *tt)
> > >   {
> > > -	return tt->page_flags & TTM_TT_FLAG_SWAPPED;
> > > +	return tt->page_flags & (TTM_TT_FLAG_SWAPPED |
> > > TTM_TT_FLAG_BACKED_UP);
> > > +}
> > > +
> > > +/**
> > > + * ttm_tt_is_backed_up() - Whether the ttm_tt backed up
> > > + * @tt: The struct ttm_tt.
> > > + *
> > > + * Return: true if swapped or backed up, false otherwise.
> > > + */
> > > +static inline bool ttm_tt_is_backed_up(const struct ttm_tt *tt)
> > > +{
> > > +	return tt->page_flags & TTM_TT_FLAG_BACKED_UP;
> > > +}
> > > +
> > > +/**
> > > + * ttm_tt_clear_backed_up() - Clear the ttm_tt backed-up status
> > > + * @tt: The struct ttm_tt.
> > > + *
> > > + * Drivers can use this functionto clear the backed-up status,
> > > + * for example before destroying or re-validating a purged tt.
> > > + */
> > > +static inline void ttm_tt_clear_backed_up(struct ttm_tt *tt)
> > > +{
> > > +	tt->page_flags &= ~TTM_TT_FLAG_BACKED_UP;
> > >   }
> > >   
> > >   /**
> > > @@ -235,6 +280,24 @@ void ttm_tt_mgr_init(unsigned long
> > > num_pages,
> > > unsigned long num_dma32_pages);
> > >   struct ttm_kmap_iter *ttm_kmap_iter_tt_init(struct
> > > ttm_kmap_iter_tt *iter_tt,
> > >   					    struct ttm_tt *tt);
> > >   unsigned long ttm_tt_pages_limit(void);
> > > +
> > > +/**
> > > + * struct ttm_backup_flags - Flags to govern backup behaviour.
> > > + * @purge: Free pages without backing up. Bypass pools.
> > > + * @writeback: Attempt to copy contents directly to swap space,
> > > even
> > > + * if that means blocking on writes to external memory.
> > > + */
> > > +struct ttm_backup_flags {
> > > +	u32 purge : 1;
> > > +	u32 writeback : 1;
> > > +};
> > > +
> > > +long ttm_tt_backup(struct ttm_device *bdev, struct ttm_tt *tt,
> > > +		   const struct ttm_backup_flags flags);
> > > +
> > > +int ttm_tt_restore(struct ttm_device *bdev, struct ttm_tt *tt,
> > > +		   const struct ttm_operation_ctx *ctx);
> > > +
> > >   #if IS_ENABLED(CONFIG_AGP)
> > >   #include <linux/agp_backend.h>
> > >   
> > 
>