On 07/01/2010 03:52 PM, Alexander Graf wrote:
Don't you use lazy spte updates?
We do, but given enough time, the guest will touch its entire memory.
Oh, so that's the major difference. On PPC we have the HTAB with a
fraction of all the mapped pages in it. We don't have a notion of a full
page table for a guest process. We always only have a snapshot of some
mappings and shadow those lazily.
So at worst, we have HPTEG_CACHE_NUM shadow pages mapped, which would be
(1<< 15) * 4k which again would be at most 128MB of guest memory. We
can't hold more mappings than that anyways, so chances are low we have a
mapping for each hva.
Doesn't that seriously impact performance? A guest that recycles pages
from its lru will touch pages at random from its entire address space.
On bare metal that isn't a problem (I imagine) due to large tlbs. But
virtualized on 4K pages that means the htlb will be thrashed.
But then again I probably do need an rmap for the mmu_notifier magic,
right? But I'd rather prefer to have that code path be slow and the
dirty bitmap invalidation fast than the other way around. Swapping is
slow either way.
It's not just swapping, it's also page ageing. That needs to be
fast. Does ppc have a hardware-set referenced bit? If so, you need a
fast rmap for mmu notifiers.
Page ageing is difficult. The HTAB has a hardware set referenced bit,
but we don't have a guarantee that the entry is still there when we look
for it. Something else could have overwritten it by then, but the entry
could still be lingering around in the TLB.
Whoever's dropping the HTAB needs to update the host struct page, and
also reflect the bit into the guest's HTAB, no?
In fact, on x86 shadow, we don't have an spte for a gpte that is not
accessed, precisely so we know the exact point in time when the accessed
bit is set.
So I think the only reasonable way to implement page ageing is to unmap
pages. And that's slow, because it means we have to map them again on
access. Bleks. Or we could look for the HTAB entry and only unmap them
if the entry is moot.
I think it works out if you update struct page when you clear out an HTAB.
--
error compiling committee.c: too many arguments to function
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html