On 1/30/20 4:39 PM, Suman Anna wrote: > On 1/30/20 3:19 PM, Andrew F. Davis wrote: >> On 1/30/20 3:39 PM, Suman Anna wrote: >>> On 1/30/20 2:22 PM, Andrew F. Davis wrote: >>>> On 1/30/20 2:55 PM, Suman Anna wrote: >>>>> On 1/30/20 1:42 PM, Tero Kristo wrote: >>>>>> On 30/01/2020 21:20, Andrew F. Davis wrote: >>>>>>> On 1/30/20 2:18 PM, Tero Kristo wrote: >>>>>>>> On 30/01/2020 20:11, Andrew F. Davis wrote: >>>>>>>>> On 1/16/20 8:53 AM, Tero Kristo wrote: >>>>>>>>>> From: Suman Anna <s-anna@xxxxxx> >>>>>>>>>> >>>>>>>>>> The reserved memory nodes are not assigned to platform devices by >>>>>>>>>> default in the driver core to avoid the lookup for every platform >>>>>>>>>> device and incur a penalty as the real users are expected to be >>>>>>>>>> only a few devices. >>>>>>>>>> >>>>>>>>>> OMAP remoteproc devices fall into the above category and the OMAP >>>>>>>>>> remoteproc driver _requires_ specific CMA pools to be assigned >>>>>>>>>> for each device at the moment to align on the location of the >>>>>>>>>> vrings and vring buffers in the RTOS-side firmware images. So, >>>>>>>>> >>>>>>>>> >>>>>>>>> Same comment as before, this is a firmware issue for only some >>>>>>>>> firmwares >>>>>>>>> that do not handle being assigned vring locations correctly and instead >>>>>>>>> hard-code them. >>>>> >>>>> As for this statement, this can do with some updating. Post 4.20, >>>>> because of the lazy allocation scheme used for carveouts including the >>>>> vrings, the resource tables now have to use FW_RSC_ADDR_ANY and will >>>>> have to wait for the vdev synchronization to happen. >>>>> >>>>>>>> >>>>>>>> I believe we discussed this topic in length in previous version but >>>>>>>> there was no conclusion on it. >>>>>>>> >>>>>>>> The commit desc might be a bit misleading, we are not actually forced to >>>>>>>> use specific CMA buffers, as we use IOMMU to map these to device >>>>>>>> addresses. For example IPU1/IPU2 use internally exact same memory >>>>>>>> addresses, iommu is used to map these to specific CMA buffer. >>>>>>>> >>>>>>>> CMA buffers are mostly used so that we get aligned large chunk of memory >>>>>>>> which can be mapped properly with the limited IOMMU OMAP family of chips >>>>>>>> have. Not sure if there is any sane way to get this done in any other >>>>>>>> manner. >>>>>>>> >>>>>>> >>>>>>> >>>>>>> Why not use the default CMA area? >>>>>> >>>>>> I think using default CMA area getting the actual memory block is not >>>>>> guaranteed and might fail. There are other users for the memory, and it >>>>>> might get fragmented at the very late phase we are grabbing the memory >>>>>> (omap remoteproc driver probe time.) Some chunks we need are pretty large. >>>>>> >>>>>> I believe I could experiment with this a bit though and see, or Suman >>>>>> could maybe provide feedback why this was designed initially like this >>>>>> and why this would not be a good idea. >>>>> >>>>> I have given some explanation on this on v4 as well, but if it is not >>>>> clear, there are restrictions with using default CMA. Default CMA has >>>>> switched to be assigned from the top of the memory (higher addresses, >>>>> since 3.18 IIRC), and the MMUs on IPUs and DSPs can only address >>>>> 32-bits. So, we cannot blindly use the default CMA pool, and this will >>>>> definitely not work on boards > 2 GB RAM. And, if you want to add in any >>>>> firewall capability, then specific physical addresses becomes mandatory. >>>>> >>>> >>>> >>>> If you need 32bit range allocations then >>>> dma_set_mask(dev, DMA_BIT_MASK(32)); >>>> >>>> I'm not saying don't have support for carveouts, just make them >>>> optional, keystone_remoteproc.c does this: >>>> >>>> if (of_reserved_mem_device_init(dev)) >>>> dev_warn(dev, "device does not have specific CMA pool\n"); >>>> >>>> There doesn't even needs to be a warning but that is up to you. >>> >>> It is not exactly an apples to apples comparison. K2s do not have MMUs, >>> and most of our firmware images on K2 are actually running out of the >>> DSP internal memory. >>> >> >> >> So again we circle back to it being a firmware issue, if K2 can get away >> without needing carveouts and it doesn't even have an MMU then certainly >> OMAP/DRA7x class devices can handle it even better given they *do* have >> an IOMMU. Unless someone is hard-coding the IOMMU configuration.. In >> which case we are still just hacking around the problem here with >> mandatory specific address memory carveouts. > > Optional carveouts on OMAP remoteprocs can be an enhancement in the > future, but at the moment, we won't be able to run use-cases without > this. And I have already given some of the reasons for the same here and > on v4. > No reason to be dismissive, my questions are valid. What "use-cases" are we talking about, I have firmware that doesn't need specific carved-out addresses. If you have misbehaving firmware that needs statically carved out memory addresses then you can have carveouts if you want, but it should be optional. If I don't want to pollute my system's memory space with a bunch of carveout holes then I shouldn't have to just because your specific firmware needs them. Andrew > regards > Suman > >> >> Andrew >> >> >>> regards >>> Suman >>> >>>> >>>> Andrew >>>> >>>> >>>>> regards >>>>> Suman >>>>> >>>>>> >>>>>> -Tero >>>>>> >>>>>>> >>>>>>> Andrew >>>>>>> >>>>>>> >>>>>>>> -Tero >>>>>>>> >>>>>>>>> >>>>>>>>> This is not a requirement of the remote processor itself and so it >>>>>>>>> should not fail to probe if a specific memory carveout isn't given. >>>>>>>>> >>>>>>>>> >>>>>>>>>> use the of_reserved_mem_device_init/release() API appropriately >>>>>>>>>> to assign the corresponding reserved memory region to the OMAP >>>>>>>>>> remoteproc device. Note that only one region per device is >>>>>>>>>> allowed by the framework. >>>>>>>>>> >>>>>>>>>> Signed-off-by: Suman Anna <s-anna@xxxxxx> >>>>>>>>>> Signed-off-by: Tero Kristo <t-kristo@xxxxxx> >>>>>>>>>> Reviewed-by: Bjorn Andersson <bjorn.andersson@xxxxxxxxxx> >>>>>>>>>> --- >>>>>>>>>> v5: no changes >>>>>>>>>> >>>>>>>>>> drivers/remoteproc/omap_remoteproc.c | 12 +++++++++++- >>>>>>>>>> 1 file changed, 11 insertions(+), 1 deletion(-) >>>>>>>>>> >>>>>>>>>> diff --git a/drivers/remoteproc/omap_remoteproc.c >>>>>>>>>> b/drivers/remoteproc/omap_remoteproc.c >>>>>>>>>> index 0846839b2c97..194303b860b2 100644 >>>>>>>>>> --- a/drivers/remoteproc/omap_remoteproc.c >>>>>>>>>> +++ b/drivers/remoteproc/omap_remoteproc.c >>>>>>>>>> @@ -17,6 +17,7 @@ >>>>>>>>>> #include <linux/module.h> >>>>>>>>>> #include <linux/err.h> >>>>>>>>>> #include <linux/of_device.h> >>>>>>>>>> +#include <linux/of_reserved_mem.h> >>>>>>>>>> #include <linux/platform_device.h> >>>>>>>>>> #include <linux/dma-mapping.h> >>>>>>>>>> #include <linux/remoteproc.h> >>>>>>>>>> @@ -480,14 +481,22 @@ static int omap_rproc_probe(struct >>>>>>>>>> platform_device *pdev) >>>>>>>>>> if (ret) >>>>>>>>>> goto free_rproc; >>>>>>>>>> + ret = of_reserved_mem_device_init(&pdev->dev); >>>>>>>>>> + if (ret) { >>>>>>>>>> + dev_err(&pdev->dev, "device does not have specific CMA >>>>>>>>>> pool\n"); >>>>>>>>>> + goto free_rproc; >>>>>>>>>> + } >>>>>>>>>> + >>>>>>>>>> platform_set_drvdata(pdev, rproc); >>>>>>>>>> ret = rproc_add(rproc); >>>>>>>>>> if (ret) >>>>>>>>>> - goto free_rproc; >>>>>>>>>> + goto release_mem; >>>>>>>>>> return 0; >>>>>>>>>> +release_mem: >>>>>>>>>> + of_reserved_mem_device_release(&pdev->dev); >>>>>>>>>> free_rproc: >>>>>>>>>> rproc_free(rproc); >>>>>>>>>> return ret; >>>>>>>>>> @@ -499,6 +508,7 @@ static int omap_rproc_remove(struct >>>>>>>>>> platform_device *pdev) >>>>>>>>>> rproc_del(rproc); >>>>>>>>>> rproc_free(rproc); >>>>>>>>>> + of_reserved_mem_device_release(&pdev->dev); >>>>>>>>>> return 0; >>>>>>>>>> } >>>>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>> >>>>>> -- >>>>>> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki. >>>>>> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki >>>>> >>> >