On Tue, Jul 14, 2020 at 10:16:14PM -0700, Ram Pai wrote: > On Mon, Jul 13, 2020 at 10:59:41AM +0530, Bharata B Rao wrote: > > On Sat, Jul 11, 2020 at 02:13:43AM -0700, Ram Pai wrote: > > > Merging of pages associated with each memslot of a SVM is > > > disabled the page is migrated in H_SVM_PAGE_IN handler. > > > > > > This operation should have been done much earlier; the moment the VM > > > is initiated for secure-transition. Delaying this operation, increases > > > the probability for those pages to acquire new references , making it > > > impossible to migrate those pages in H_SVM_PAGE_IN handler. > > > > > > Disable page-migration in H_SVM_INIT_START handling. > > > > While it is a good idea to disable KSM merging for all VMAs during > > H_SVM_INIT_START, I am curious if you did observe an actual case of > > ksm_madvise() failing which resulted in subsequent H_SVM_PAGE_IN > > failing to migrate? > > No. I did not find any ksm_madvise() failing. But it did not make sense > to ksm_madvise() everytime a page_in was requested. Hence i proposed > this patch. H_SVM_INIT_START is the right place for ksm_advise(). Indeed yes. Then you may want to update the description which currently seems to imply that this change is being done to avoid issues arising out of delayed KSM unmerging advice. > > > > > > > > > Signed-off-by: Ram Pai <linuxram@xxxxxxxxxx> > > > --- > > > arch/powerpc/kvm/book3s_hv_uvmem.c | 96 +++++++++++++++++++++++++++++--------- > > > 1 file changed, 74 insertions(+), 22 deletions(-) > > > > > > diff --git a/arch/powerpc/kvm/book3s_hv_uvmem.c b/arch/powerpc/kvm/book3s_hv_uvmem.c > > > index 3d987b1..bfc3841 100644 > > > --- a/arch/powerpc/kvm/book3s_hv_uvmem.c > > > +++ b/arch/powerpc/kvm/book3s_hv_uvmem.c > > > @@ -211,6 +211,65 @@ static bool kvmppc_gfn_is_uvmem_pfn(unsigned long gfn, struct kvm *kvm, > > > return false; > > > } > > > > > > +static int kvmppc_memslot_page_merge(struct kvm *kvm, > > > + struct kvm_memory_slot *memslot, bool merge) > > > +{ > > > + unsigned long gfn = memslot->base_gfn; > > > + unsigned long end, start = gfn_to_hva(kvm, gfn); > > > + int ret = 0; > > > + struct vm_area_struct *vma; > > > + int merge_flag = (merge) ? MADV_MERGEABLE : MADV_UNMERGEABLE; > > > + > > > + if (kvm_is_error_hva(start)) > > > + return H_STATE; > > > > This and other cases below seem to be a new return value from > > H_SVM_INIT_START. May be update the documentation too along with > > this patch? > > ok. > > > > > > + > > > + end = start + (memslot->npages << PAGE_SHIFT); > > > + > > > + down_write(&kvm->mm->mmap_sem); > > > > When you rebase the patches against latest upstream you may want to > > replace the above and other instances by mmap_write/read_lock(). > > ok. > > > > > > + do { > > > + vma = find_vma_intersection(kvm->mm, start, end); > > > + if (!vma) { > > > + ret = H_STATE; > > > + break; > > > + } > > > + ret = ksm_madvise(vma, vma->vm_start, vma->vm_end, > > > + merge_flag, &vma->vm_flags); > > > + if (ret) { > > > + ret = H_STATE; > > > + break; > > > + } > > > + start = vma->vm_end + 1; > > > + } while (end > vma->vm_end); > > > + > > > + up_write(&kvm->mm->mmap_sem); > > > + return ret; > > > +} > > > + > > > +static int __kvmppc_page_merge(struct kvm *kvm, bool merge) > > > +{ > > > + struct kvm_memslots *slots; > > > + struct kvm_memory_slot *memslot; > > > + int ret = 0; > > > + > > > + slots = kvm_memslots(kvm); > > > + kvm_for_each_memslot(memslot, slots) { > > > + ret = kvmppc_memslot_page_merge(kvm, memslot, merge); > > > + if (ret) > > > + break; > > > + } > > > + return ret; > > > +} > > > + > > > +static inline int kvmppc_disable_page_merge(struct kvm *kvm) > > > +{ > > > + return __kvmppc_page_merge(kvm, false); > > > +} > > > + > > > +static inline int kvmppc_enable_page_merge(struct kvm *kvm) > > > +{ > > > + return __kvmppc_page_merge(kvm, true); > > > +} > > > + > > > unsigned long kvmppc_h_svm_init_start(struct kvm *kvm) > > > { > > > struct kvm_memslots *slots; > > > @@ -232,11 +291,18 @@ unsigned long kvmppc_h_svm_init_start(struct kvm *kvm) > > > return H_AUTHORITY; > > > > > > srcu_idx = srcu_read_lock(&kvm->srcu); > > > + > > > + /* disable page-merging for all memslot */ > > > + ret = kvmppc_disable_page_merge(kvm); > > > + if (ret) > > > + goto out; > > > + > > > + /* register the memslot */ > > > slots = kvm_memslots(kvm); > > > kvm_for_each_memslot(memslot, slots) { > > > if (kvmppc_uvmem_slot_init(kvm, memslot)) { > > > ret = H_PARAMETER; > > > - goto out; > > > + break; > > > } > > > ret = uv_register_mem_slot(kvm->arch.lpid, > > > memslot->base_gfn << PAGE_SHIFT, > > > @@ -245,9 +311,12 @@ unsigned long kvmppc_h_svm_init_start(struct kvm *kvm) > > > if (ret < 0) { > > > kvmppc_uvmem_slot_free(kvm, memslot); > > > ret = H_PARAMETER; > > > - goto out; > > > + break; > > > } > > > } > > > + > > > + if (ret) > > > + kvmppc_enable_page_merge(kvm); > > > > Is there any use of enabling KSM merging in the failure path here? > > Won't UV terminate the VM if H_SVM_INIT_START fails? If there is no need, > > you can do away with some extra routines above. > > UV will terminate it. But I did not want to tie that assumption into > this function. Hmm ok, but having code around which isn't expected to be executed at all was my concern. Regards, Bharata.