On Sat, Feb 22, 2025 at 08:16:53PM +0000, Michael Kelley wrote: > From: Saurabh Singh Sengar <ssengar@xxxxxxxxxxxxxxxxxxx> Sent: Saturday, February 22, 2025 9:27 AM > > > > On Wed, Feb 19, 2025 at 05:22:36AM +0000, Michael Kelley wrote: > > > From: Saurabh Sengar <ssengar@xxxxxxxxxxxxxxxxxxx> Sent: Saturday, February 15, > > 2025 1:21 AM > > > > > > > > When a Hyper-V framebuffer device is unbind, hyperv_fb driver tries to > > > > release the framebuffer forcefully. If this framebuffer is in use it > > > > produce the following WARN and hence this framebuffer is never released. > > > > > > > > [ 44.111220] WARNING: CPU: 35 PID: 1882 at drivers/video/fbdev/core/fb_info.c:70 > > framebuffer_release+0x2c/0x40 > > > > < snip > > > > > [ 44.111289] Call Trace: > > > > [ 44.111290] <TASK> > > > > [ 44.111291] ? show_regs+0x6c/0x80 > > > > [ 44.111295] ? __warn+0x8d/0x150 > > > > [ 44.111298] ? framebuffer_release+0x2c/0x40 > > > > [ 44.111300] ? report_bug+0x182/0x1b0 > > > > [ 44.111303] ? handle_bug+0x6e/0xb0 > > > > [ 44.111306] ? exc_invalid_op+0x18/0x80 > > > > [ 44.111308] ? asm_exc_invalid_op+0x1b/0x20 > > > > [ 44.111311] ? framebuffer_release+0x2c/0x40 > > > > [ 44.111313] ? hvfb_remove+0x86/0xa0 [hyperv_fb] > > > > [ 44.111315] vmbus_remove+0x24/0x40 [hv_vmbus] > > > > [ 44.111323] device_remove+0x40/0x80 > > > > [ 44.111325] device_release_driver_internal+0x20b/0x270 > > > > [ 44.111327] ? bus_find_device+0xb3/0xf0 > > > > > > > > Fix this by moving the release of framebuffer to fb_ops.fb_destroy function > > > > so that framebuffer framework handles it gracefully > > > > > > These changes look good for solving the specific problem where > > > the reference count WARN is produced. But there is another > > > problem of the same type that happens when doing unbind > > > of a hyperv_fb device that is in use (i.e., /dev/fb0 is open and > > > mmap'ed by some user space program). > > > > > > For this additional problem, there are three sub-cases, > > > depending on what memory gets mmap'ed into user space. > > > Two of the three sub-cases have a problem. > > > > > > 1) When Hyper-V FB uses deferred I/O, the vmalloc dio memory > > > is what get mapped into user space. When hyperv_fb is unbound, > > > the vmalloc dio memory is freed. But the memory doesn't actually > > > get freed if it is still mmap'ed into user space. The deferred I/O > > > mechanism is stopped, but user space can keep writing to the > > > memory even though the pixels don't get copied to the actual > > > framebuffer any longer. When the user space program terminates > > > (or unmaps the memory), the memory will be freed. So this case > > > is OK, though perhaps a bit dubious. > > > > > > 2) When Hyper-V FB is in a Gen 1 VM, and the frame buffer size > > > is <= 4 MiB, a normal kernel allocation is used for the > > > memory that is mmap'ed to user space. If this memory > > > is freed when hyperv_fb is unbound, bad things happen > > > because the memory is still being written to via the user space > > > mmap. There are multiple "BUG: Bad page state in process > > > bash pfn:106c65" errors followed by stack traces. > > > > > > 3) Similarly in a Gen 1 VM, if the frame buffer size is > 4 MiB, > > > CMA memory is allocated (assuming it is available). This CMA > > > memory gets mapped into user space. When hyperv_fb is > > > unbound, that memory is freed. But CMA complains that the > > > ref count on the pages is not zero. Here's the dmesg output: > > > > > > [ 191.629780] ------------[ cut here ]------------ > > > [ 191.629784] 200 pages are still in use! > > > [ 191.629789] WARNING: CPU: 3 PID: 1115 at mm/page_alloc.c:6757 > > free_contig_range+0x15e/0x170 > > > > > > Stack trace is: > > > > > > [ 191.629847] ? __warn+0x97/0x160 > > > [ 191.629849] ? free_contig_range+0x15e/0x170 > > > [ 191.629849] ? report_bug+0x1bb/0x1d0 > > > [ 191.629851] ? console_unlock+0xdd/0x1e0 > > > [ 191.629854] ? handle_bug+0x60/0xa0 > > > [ 191.629857] ? exc_invalid_op+0x1d/0x80 > > > [ 191.629859] ? asm_exc_invalid_op+0x1f/0x30 > > > [ 191.629862] ? free_contig_range+0x15e/0x170 > > > [ 191.629862] ? free_contig_range+0x15e/0x170 > > > [ 191.629863] cma_release+0xc6/0x150 > > > [ 191.629865] dma_free_contiguous+0x34/0x70 > > > [ 191.629868] dma_direct_free+0xd3/0x130 > > > [ 191.629869] dma_free_attrs+0x6b/0x130 > > > [ 191.629872] hvfb_putmem.isra.0+0x99/0xd0 [hyperv_fb] > > > [ 191.629874] hvfb_remove+0x75/0x80 [hyperv_fb] > > > [ 191.629876] vmbus_remove+0x28/0x40 [hv_vmbus] > > > [ 191.629883] device_remove+0x43/0x70 > > > [ 191.629886] device_release_driver_internal+0xbd/0x140 > > > [ 191.629888] device_driver_detach+0x18/0x20 > > > [ 191.629890] unbind_store+0x8f/0xa0 > > > [ 191.629891] drv_attr_store+0x25/0x40 > > > [ 191.629892] sysfs_kf_write+0x3f/0x50 > > > [ 191.629894] kernfs_fop_write_iter+0x142/0x1d0 > > > [ 191.629896] vfs_write+0x31b/0x450 > > > [ 191.629898] ksys_write+0x6e/0xe0 > > > [ 191.629899] __x64_sys_write+0x1e/0x30 > > > [ 191.629900] x64_sys_call+0x16bf/0x2150 > > > [ 191.629903] do_syscall_64+0x4e/0x110 > > > [ 191.629904] entry_SYSCALL_64_after_hwframe+0x76/0x7e > > > > > > For all three cases, I think the memory freeing and iounmap() operations > > > can be moved to the new hvfb_destroy() function so that the memory > > > is cleaned up only when there aren't any users. While these additional > > > changes could be done as a separate patch, it seems to me like they are all > > > part of the same underlying issue as the reference count problem, and > > > could be combined into this patch. > > > > > > Michael > > > > > > > Thanks for your review. > > > > I had considered moving the entire `hvfb_putmem()` function to `destroy`, > > but I was hesitant for two reasons: > > > > 1. I wasn’t aware of any scenario where this would be useful. However, > > your explanation has convinced me that it is necessary. > > 2. `hvfb_release_phymem()` relies on the `hdev` pointer, which requires > > multiple `container_of` operations to derive it from the `info` pointer. > > I was unsure if the complexity was justified, but it seems worthwhile now. > > > > I will move `hvfb_putmem()` to the `destroy` function in V2, and I hope this > > will address all the cases you mentioned. > > > > Yes, that's what I expect needs to happen, though I haven't looked at the > details of making sure all the needed data structures are still around. Like > you, I just had this sense that hvfb_putmem() might need to be moved as > well, so I tried to produce a failure scenario to prove it, which turned out > to be easy. > > Michael I will add this in V2 as well. But I have found an another issue which is not very frequent. [ 176.562153] ------------[ cut here ]------------ [ 176.562159] fb0: fb_WARN_ON_ONCE(pageref->page != page) [ 176.562176] WARNING: CPU: 50 PID: 1522 at drivers/video/fbdev/core/fb_defio.c:67 fb_deferred_io_mkwrite+0x215/0x280 <snip> [ 176.562258] Call Trace: [ 176.562260] <TASK> [ 176.562263] ? show_regs+0x6c/0x80 [ 176.562269] ? __warn+0x8d/0x150 [ 176.562273] ? fb_deferred_io_mkwrite+0x215/0x280 [ 176.562275] ? report_bug+0x182/0x1b0 [ 176.562280] ? handle_bug+0x133/0x1a0 [ 176.562283] ? exc_invalid_op+0x18/0x80 [ 176.562284] ? asm_exc_invalid_op+0x1b/0x20 [ 176.562289] ? fb_deferred_io_mkwrite+0x215/0x280 [ 176.562291] ? fb_deferred_io_mkwrite+0x215/0x280 [ 176.562293] do_page_mkwrite+0x4d/0xb0 [ 176.562296] do_wp_page+0xe8/0xd50 [ 176.562300] ? ___pte_offset_map+0x1c/0x1b0 [ 176.562304] __handle_mm_fault+0xbe1/0x10e0 [ 176.562307] handle_mm_fault+0x17f/0x2e0 [ 176.562309] do_user_addr_fault+0x2d1/0x8d0 [ 176.562314] exc_page_fault+0x85/0x1e0 [ 176.562318] asm_exc_page_fault+0x27/0x30 Looks this is because driver is unbind still Xorg is trying to write to memory which is causing some page faults. I have confirmed PID 1522 is of Xorg. I think this is because we need to cancel the framebuffer deferred work after flushing it. After adding the below in hvfb_remove I don't see this issue anymore. Although as the issue is not very frequent I am not 100% sure. cancel_delayed_work_sync(&info->deferred_work); If you think this is reasonable I can add this as well in V2. - Saurabh