[cc +Kirti] On Wed, 18 Apr 2018 18:55:45 +0800 Xu Yandong <xuyandong2@xxxxxxxxxx> wrote: > The task structure in vfio_dma struct used to identify the same > task who map it or other task who shares same adress space is > allowed to unmap. But if the task who map it has exited, mm of > the task has been set to null, we should unmap the vfio dma directly. > > Signed-off-by: Xu Yandong <xuyandong2@xxxxxxxxxx> > --- > Hi all, > When I unplug a vcpu from a VM lanched with a VFIO hostdev device, > I found that the *vfio_dma* mapped by this vcpu task could not be unmaped > in the future, so I send this patch to unmap vfio_dma directly if the > task who mapped it has exited. > > Howerver this patch may introduce a new security risk because any task can > unmap the *vfio_dma* if the mapper task has exited. Well that's unexpected, but adding some debugging code I can clearly see that the map and unmap ioctls are typically called by the various processor threads, which all share the same mm_struct (so accounting is correct regardless of which CPU does the unmap). I don't think the fix below is correct though, it's not for a security risk, but for accounting issue and correctness issues. The pages are mapped and accounted against the users locked memory limits, if we simply bail out, both the IOMMU mappings and the limit accounting are wrong. Perhaps rather than referencing the calling task_struct in the vfio_dma on mapping, we should traverse to the highest parent task sharing the same mm_struct. Kirti, any thoughts since this code originated for mdev support? Thanks, Alex > --- > drivers/vfio/vfio_iommu_type1.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/drivers/vfio/vfio_iommu_type1.c b/drivers/vfio/vfio_iommu_type1.c > index 5c212bf..601a353 100644 > --- a/drivers/vfio/vfio_iommu_type1.c > +++ b/drivers/vfio/vfio_iommu_type1.c > @@ -947,7 +947,7 @@ static int vfio_dma_do_unmap(struct vfio_iommu *iommu, > * Task with same address space who mapped this iova range is > * allowed to unmap the iova range. > */ > - if (dma->task->mm != current->mm) > + if (dma->task->mm && (dma->task->mm != current->mm)) > break; > > if (!RB_EMPTY_ROOT(&dma->pfn_list)) {