On Thu, May 29, 2008 at 8:13 PM, Vegard Nossum <vegard.nossum@xxxxxxxxx> wrote: > On Thu, May 29, 2008 at 10:25 AM, Vegard Nossum <vegard.nossum@xxxxxxxxx> wrote: >> On 5/29/08, Peter Teoh <htmldeveloper@xxxxxxxxx> wrote: >>> Hi Vegard, I have some questions for you. >>> >> >> [...] >> >>> Second, your present tried to retain the CR3 value, but change the >>> contents of the page table instead....but has problem with >>> multiprocessor. >>> >>> How about the other way round - change the contents of per-CPU CR3, >>> which will point to a different pagetable - (subtable). Not sure if >>> this is the "per-cpu" pagetable that Ingo is suggesting to you. But >>> I am suggesting a very small subtable - only those entries needed are >>> constructed. After use it will be switch back to the original CR3. >>> Possible? >> >> Hi, >> >> Sorry for the short reply but I am short on time :) >> >> So this is actually a very good idea. I previously disregarded it as >> impossible because you can have for example a CMPS instruction that >> compares (in fact) TWO memory locations). But it shouldn't be worse >> than simply making two entries in that new page table :-) >> >> For this, we would only need as many pages as we have levels in the >> page table, e.g. two or three extra pages per CPU. And we only have to >> copy at most two words per level, i.e. at most 6 reads/writes. This >> will certainly be a killer if it works. >> >> (The most overhead I'm guessing will come from reloading the CR3 twice >> on each #PF. But then again, it might not be so much compared to what >> we already do. I honestly don't know! It's much, much better than >> halting all the other CPUs, though!) >> >> Thanks a lot for this suggestion! I will try to implement the idea and >> see if I can find any fatal consequences. > > Okay, I remembered why this won't work. I just needed to wake up properly. > > Of course we also need to copy all the page tables for the code > itself, the stack, etc. And copying all this for each page fault will > also take too much time. > I may wrong, but I was thinking: if u disabled all forms of interrupt, and ensure that all processing are done via registers only (YES....very unlikely possible ever), avoiding usage of any memory-access related assembly instructions, setup the environment for the CPU and its new pagetable, chnage the CR3 to point there, DO ALL THE NECESSARY memory check (read before write checking etc) there....and then write the results to the registers again....and switch back the CR3 to original table, and then reenable interrupt again. So effectively within this small windows of interrupt-disabled period, since u don't have any chance of jumping /using other codes/data area/global data, memeory related addressing will not be used, and thus the old original page table will not be needed to be copied. Ie, the temporary pagetable (mini) is purely setup for your kmemcheck stuff only....and after that the original pagetable will come in... all conjectures....i may be wrong....have not thought through properly yet.....my problem is when NMI comes in, then it may be necessary to redirect this NMI to another CPU to handle it (since it should still have the original CR3 intact). zzzzzzzzzzzzzzzzzzzzzzzzzzzz....i am sleepy now......12 midnight...good night... > So it seems per-cpu page tables will be the only way to go for SMP > support. Maybe I will find the time to try this during the summer :-) > > Vegard > -- Regards, Peter Teoh -- To unsubscribe from this list: send an email with "unsubscribe kernelnewbies" to ecartis@xxxxxxxxxxxx Please read the FAQ at http://kernelnewbies.org/FAQ