Re: [PATCH] cxl: Update Soft Reserved resources upon region creation

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




On 12/12/2024 7:01 PM, Alison Schofield wrote:
> On Thu, Dec 12, 2024 at 05:42:08PM -0500, Gregory Price wrote:
>> On Mon, Dec 02, 2024 at 09:55:42AM -0600, Nathan Fontenot wrote:
>> ... snip ...
>>> diff --git a/kernel/resource.c b/kernel/resource.c
>>> index a83040fde236..8fc4121a1887 100644
>>> --- a/kernel/resource.c
>>> +++ b/kernel/resource.c
>> ... snip ...
>>> +static int srmem_parse_cfmws(union acpi_subtable_headers *hdr,
>>> +			     void *arg, const unsigned long unused)
>>> +{
>>
>> Chiming in a little late to the party here
>>
>> I don't think encoding CXL-specific terminology and functionality
>> directly into kernel/resource.c is wise by any measure.
>>
>> The abstraction here is completely inverted, and this is probably
>> in line with Dan's comments.
>>
>> The comments in e820.c allude to a similar issue
>>
>>     * Prior to inserting SOFT_RESERVED resources we want to check
>>     * for an intersection with potential CXL resources.
>>
>> This is similarly inverted - e820 doesn't know anthing about CXL
>> and it shouldn't have to be made aware of CXL. Mucking with
>> e820 is *begging* to be bitten elsewhere in parts of the system
>> that depend on it to be a relatively stable source of truth.
>>
>>
>> This tells me that this patch is trying to solve the wrong problem.
>>
>>
>> Your changelog alludes to supporting hotplug replace
>>
>> """
>>   The current approach of leaving the SOFT RESERVE resource as is can
>>   cause failure during hotplug replace of CXL devices because the 
>>   resource is not available for reuse after teardown of the CXL device.
>> """
>>
>> It sounds like we should be making the SR resource available for
>> re-use through proper teardown and cleanup of the resource tree,
>> rather than trying to change fundamental components like e820.
>>
>> If the driver was capable of using the SOFT_RESERVED region on
>> initial setup, it should be capable of re-using that region.
>>
>>
>> Is the issue here that the hotplug-replaced component has a
>> different capacity? It it being assigned a new region entirely?
>> Is it exactly the same, but the resource isn't being cleaned up?
>>
>> Can you provide more specifics about the exact hotplug interaction
>> that is happening? That might help understand the issue a bit better.
>>
>>
>> Much of this sounds like we need additional better tear-down
>> management and possibly additional cxl/acpi features to handle 
>> hotplug of these devices - rather than changing resource.c.
> 
> Hi Gregory,
> 
> Never too late, and it serves to revisit and remember to carry
> along some of the why behind this patch as we adopt it to do more.
> Admittedly, the more, has also become the more important here!
> 
> I'm not sure what Nathan first saw that led him to this issue.
> For me it was - 
> 
> BIOS labels a resource Soft Reserved and programs a region using
> that range. Later, the existing cxl path to destroy that region
> does not free up that Soft Reserved range. Users cannot create
> another region in it's place. Resource lost. We considered simply
> removing soft reserved resources on region teardown, and you can
> probably find a patches on lore doing just that.

Yes, this is the same problem I was seeing and looking to solve.

> 
> But - the problem grew. Sometimes BIOS creates an SR that is not
> aligned with the region they go on to program. Stranded resources.
> That's where the trim and give to DAX path originated.
> 
> But - the problem grew. Sometimes the CXL driver fails to enumerate
> that BIOS defined region. More stranded resources. Let's find those
> too and give them to DAX. This is something we are seeing in the
> wild now and why Dan raised its priority.
> 
> Dan is also suggesting that at that last event - failure to enumerate
> a BIOS defined region, we tear down the entire ACPI0017 toplogy
> and give everything to DAX.
> 
> What Dan called, "the minimum requirement": all Soft Reserved ranges
> end up as dax-devices sounds like the right guideline moving forward.
> 
> Hope that answers some of your why questions about the SR reuse.
>

Thanks for the summery on where the patch started and where we're
currently at. I think it will help everyone.
 
> It seems the worst problem, failure to enumerate a region, kind of
> drives the solution now, and we should handle the soft reserveds
> all the same. Even if Nathan doesn't implement it in one final
> sweep through the Soft Reserveds, one way or another, those SR
> ranges must end up as dax-device resources - either CXL managed
> or BIOS Soft Reserved.
> 

This leads us to how to design the handling of soft reserves from
a larger system wide perspective, not just the CXL specific focus
I have come at this with in my patches.

If we start with setting aside soft reserved resources on a separate
list from iomem resources at boot, we could then let drivers use that
to either remove pieces that the driver uses and release unused pieces
to the iomem tree. One thing to consider when setting aside soft reserves
is when to do it. If CXL isn't configured we don't want to set these
aside, Dan mentioned a possible config option to control this.

For CXL, I think the approach Dan is asking for is to wait for CXL
probe to complete and the walk through all created regions and remove
the intersecting pieces from the soft reserves. Any soft reserves
remaining could then be released and used as dax devices. Dan, please
correct me if I got that wrong.

-Nathan

> --Alison
> 
>>
>>
>> ~Gregory
>>
>>> +	struct acpi_cedt_cfmws *cfmws;
>>> +	struct srmem_arg *args = arg;
>>> +	struct resource cfmws_res;
>>> +	struct resource *res;
>>> +
>>> +	res = args->res;
>>> +
>>> +	cfmws = (struct acpi_cedt_cfmws *)hdr;
>>> +	cfmws_res = DEFINE_RES_MEM(cfmws->base_hpa,
>>> +				   cfmws->base_hpa + cfmws->window_size);
>>> +
>>> +	if (resource_overlaps(&cfmws_res, res)) {
>>> +		args->overlaps += 1;
>>> +		return 1;
>>> +	}
>>> +
>>> +	return 0;
>>> +}
>>> +
>>> +static bool resource_overlaps_cfmws(struct resource *res)
>>> +{
>>> +	struct srmem_arg arg = {
>>> +		.res = res,
>>> +		.overlaps = 0
>>> +	};
>>> +
>>> +	acpi_table_parse_cedt(ACPI_CEDT_TYPE_CFMWS, srmem_parse_cfmws, &arg);
>>> +
>>> +	if (arg.overlaps)
>>> +		return true;
>>> +
>>> +	return false;
>>> +}
>>> +
>>> +int insert_soft_reserve_resource(struct resource *res)
>>> +{
>>> +	if (resource_overlaps_cfmws(res)) {
>>> +		pr_info("Reserving Soft Reserve %pr\n", res);
>>> +		return insert_resource(&srmem_resource, res);
>>> +	}
>>> +
>>> +	return insert_resource(&iomem_resource, res);
>>> +}
>>> +EXPORT_SYMBOL(insert_soft_reserve_resource);
>>> +
>>>  static void __init
>>>  __reserve_region_with_split(struct resource *root, resource_size_t start,
>>>  			    resource_size_t end, const char *name)
>>> -- 
>>> 2.43.0
>>>





[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux