> I think just disabling guest cpuid might not be enough, since guest > could write to the msr without checking the cpuid. > > Why not just add a bitmap for fixed counter? > e.g. fixed_counter_reject_bitmap > > At the beginning of reprogram_fixed_counter, we could add the check: > > if (test_bit(idx, &kvm->arch.fixed_counter_reject_bitmap)) > return -EACCES; > > (Please test with your old guest and see if they have issues if we > inject #GP when > they try to set the fixed_ctrl msr. If there is, we could drop -EACCESS > above) > > The bitmap could be set at kvm_vm_ioctl_set_pmu_event_filter. intel_pmu_refresh() checks the guest cpuid and sets the number of fixed counters according to that: pmu->nr_arch_fixed_counters = min_t(int, edx.split.num_counters_fixed, INTEL_PMC_MAX_FIXED); and reprogram_fixed_counters()/get_fixed_pmc() respect this so the guest can't just ignore the cpuid. Adding a bitmap does let you do things like disable the first counter but keep the second and third, but given that there are only three and the events are likely to be on a whitelist anyway, it seemed like adding the bitmap wasn't worth it. If you still feel the same way even though we can disable them via the cpuid, I can add this in. > I think it would be better to add more, please see below: > > enum kvm_pmu_action_type { > KVM_PMU_EVENT_ACTION_NONE = 0, > KVM_PMU_EVENT_ACTION_ACCEPT = 1, > KVM_PMU_EVENT_ACTION_REJECT = 2, > KVM_PMU_EVENT_ACTION_MAX > }; > > and do a check in kvm_vm_ioctl_set_pmu_event_filter() > if (filter->action >= KVM_PMU_EVENT_ACTION_MAX) > return -EINVAL; > > This is for detecting the case that we add a new action in > userspace, while the kvm hasn't been updated to support that. > > KVM_PMU_EVENT_ACTION_NONE is for userspace to remove > the filter after they set it. We can achieve the same result by using a reject action with an empty set of events - is there some advantage to "none" over that? I can add that check for valid actions. > > +#define KVM_PMU_EVENT_FILTER_MAX_EVENTS 63 > > Why is this limit needed? Serves to keep the filters on the smaller side and ensures the size calculation can't overflow if users attempt to. Keeping the filter under 4k is nicer for allocation - also, if we want really large filters we might want to do something smarter than a linear traversal of the filter when guests program counters. > I think it looks tidier to wrap the changes above into a function: > > if (kvm_pmu_filter_event(kvm, eventsel & AMD64_RAW_EVENT_MASK_NB)) > return; Okay - I can do that. > > + kvfree(filter); > > Probably better to have it conditionally? > > if (filter) { > synchronize_srcu(); > kfree(filter) > } > > You may want to factor it out, so that kvm_pmu_destroy could reuse. Do you mean kvm_arch_destroy_vm? It looks like that's where kvm_arch members are freed. I can do that. Eric