Hi Peter and Babu, On 5/2/2024 1:14 PM, Moger, Babu wrote: > On 5/2/24 12:50, Peter Newman wrote: >> On Thu, May 2, 2024 at 9:25 AM Moger, Babu <babu.moger@xxxxxxx> wrote: >>> On 5/1/24 12:48, Peter Newman wrote: ... >>>> I chose to make this a mount option to simplify the management of the >>>> monitor tracking data structures. They are simply allocated at mount >>>> time and deallocated and unmount. >>> >>> Initially I added it as an mount option. >>> Based on our earlier discussion, we decided to use the assign feature by >>> default if hardware supports it. Users don't have to worry about the details. >>>> >>>> I called the option "mon_assign": The mount option parser calls >>>> resctrl_arch_mon_assign_enable() to determine whether the >>>> implementation supports assignment in some form. If it returns an >>>> error, the mount fails. When successful, the assignable monitor count >>>> is made non-zero in the appropriate rdt_resource, triggering the >>>> behavior change in the FS layer. >>>> >>>> I'm still not sure if it's a good idea to enable monitor assignment by >>>> default. This would be a major disruption in the MBM usage model >>>> triggered by moving software between AMD CPU models. I thought the >>> >>> Why will it be a disruption? Why do you think mount option will solve the >>> problem? As always, there will be option to go back to legacy mode. right? >>> >>>> safest option was to disallow creating more monitoring groups than >>>> monitors unless the option is selected. Given that nobody else >>> >>> Current code allows to create more groups, but it will report "Monitor >>> assignment failed" when it runs out of monitors. >> >> Ok that should be fine then. >> >> However, I don't think it's necessary to support dynamically changing >> the usage model of monitoring groups without remounting. I believe it >> makes it more difficult for the FS code to generically manage monitor >> assignment. > > Are you suggesting to enable ABMC by default when available? I do think ABMC should be enabled by default when available and it looks to be what this series aims to do [1]. The way I reason about this is that legacy user space gets more reliable monitoring behavior without needing to change behavior. I thought there was discussion about communicating to user space when an attempt is made to read data from an event that does not have a counter assigned. Something like below but I did not notice this in this series. # cat /sys/fs/resctrl/mon_data/mon_L3_00/mbm_total_bytes Unassigned > > Then provide the mount option switch back to legacy mode? > I am fine with that if we all agree on that. Why is a mount option needed? I think we should avoid requiring a remount unless required and I would like to understand why it is required here. Peter: could you please elaborate what you mean with it makes it more difficult for the FS code to generically manage monitor assignment? Why would user space be required to recreate all control and monitor groups if wanting to change how memory bandwidth monitoring is done? >From this implementation it has been difficult to understand the impact of switching between ABMC and legacy. Reinette [1] https://lore.kernel.org/lkml/e898059f3c182886b1c16353be7db76d9b852b02.1711674410.git.babu.moger@xxxxxxx/