Re: [RFC net-next 2/2] net: phy: bcm7xxx: request and manage GPHY clock

Florian Fainelli <f.fainelli@xxxxxxxxx> · Thu, 3 Sep 2020 12:35:12 -0700

On 9/3/2020 12:21 PM, Adam Rudziński wrote:

W dniu 2020-09-03 o 19:17, Florian Fainelli pisze:

On 9/3/2020 10:13 AM, Adam Rudziński wrote:

W dniu 2020-09-03 o 17:21, Florian Fainelli pisze:

On 9/2/2020 11:00 PM, Adam Rudziński wrote:

W dniu 2020-09-03 o 04:13, Florian Fainelli pisze:

On 9/2/2020 3:20 PM, Andrew Lunn wrote:
+    priv->clk = devm_clk_get_optional(&phydev->mdio.dev, 
"sw_gphy");
+    if (IS_ERR(priv->clk))
+        return PTR_ERR(priv->clk);
+
+    /* To get there, the mdiobus registration logic already 
enabled our
+     * clock otherwise we would not have probed this device 
since we would
+     * not be able to read its ID. To avoid artificially 
bumping up the
+     * clock reference count, only do the clock enable from a 
phy_remove ->
+     * phy_probe path (driver unbind, then rebind).
+     */
+    if (!__clk_is_enabled(priv->clk))
+        ret = clk_prepare_enable(priv->clk);

This i don't get. The clock subsystem does reference counting. So 
what
i would expect to happen is that during scanning of the bus, phylib
enables the clock and keeps it enabled until after probe. To keep
things balanced, phylib would disable the clock after probe.

That would be fine, although it assumes that the individual PHY 
drivers have obtained the clocks and called clk_prepare_enable(), 
which is a fair assumption I suppose.

If the driver wants the clock enabled all the time, it can enable it
in the probe method. The common clock framework will then have two
reference counts for the clock, so that when the probe exists, and
phylib disables the clock, the CCF keeps the clock ticking. The PHY
driver can then disable the clock in .remove.

But then the lowest count you will have is 1, which will lead to 
the clock being left on despite having unbound the PHY driver from 
the device (->remove was called). This does not allow saving any 
power unfortunately.

There are some PHYs which will enumerate with the clock disabled. 
They
only need it ticking for packet transfer. Such PHY drivers can 
enable
the clock only when needed in order to save some power when the
interface is administratively down.

Then the best approach would be for the OF scanning code to enable 
all clocks reference by the Ethernet PHY node (like it does in the 
proposed patch), since there is no knowledge of which clock is 
necessary and all must be assumed to be critical for MDIO bus 
scanning. Right before drv->probe() we drop all resources 
reference counts, and from there on ->probe() is assumed to manage 
the necessary clocks.

It looks like another solution may be to use the assigned-clocks 
property which will take care of assigning clock references to 
devices and having those applied as soon as the clock provider is 
available.

Hi Guys,

I've just realized that a PHY may also have a reset signal 
connected. The reset signal may be controlled by the dedicated 
peripheral or by GPIO.

There is already support for such a thing within 
drivers/net/phy/mdio_bus.c though it assumes we could bind the PHY 
device to its driver already.

In general terms, there might be a set of control signals needed to 
enable the PHY. It seems that the clock and the reset would be the 
typical useful options.

Going further with my imagination of how evil the hardware design 
could be, in general the signals for the PHY may have some 
relations to other control signals.

I think that from the software point of view this comes down to 
assumption that the PHY is to be controlled "driver only knows how".

That is all well and good as long as we can actually bind the PHY 
device which its driver, and right now this means that we either have:

- a compatible string in Device Tree which is of the form 
ethernet-phy-id%4x.%4x (see of_get_phy_id) which means that we 
*know* already which PHY we have and we avoid doing reads of 
MII_PHYSID1 and MII_PHYSID2. This is a Linux implementation detail 
that should not have to be known to systems designer IMHO

- a successful read of MII_PHYSID1 and MII_PHYSID2 (or an equivalent 
for the clause 45 PHYs) that allows us to know what PHY device we 
have, which is something that needs to happen eventually.

The problem is when there are essential resources such as clocks, 
regulators, reset signals that must be enabled, respectively 
de-asserted in order for a successful MDIO read of MII_PHYSID1 and 
MII_PHYSID2 to succeed.

There is no driver involvement at that stage because we have no 
phy_device to bind it to *yet*. Depending on what we read from 
MII_PHYSID1/PHY_ID2 we will either successfully bind to the Generic 
PHY driver (assuming we did not read all Fs) or not and we will 
return -ENODEV and then it is game over.

This is the chicken and egg problem that this patch series is 
addressing, for clocks, because we can retrieve clock devices with 
just a device_node reference.

I have an impression that here the effort goes in the wrong 
direction. If I understand correctly, the goal is to have the kernel 
find out what will the driver need to use the phy. But, the kernel 
anyway calls a probe function of the driver, doesn't it? To me it 
looks as if you were trying to do something that the driver 
will/would/might do later, and possibly "undo" it in the meantime. In 
this regard, this becomes kind of a workaround, not solution of the 
problem. Also, having taken a glance at your previous messages, I can 
tell that this is all becoming even more complex.

What is the problem according to you, and what would an acceptable 
solution look like then?

I think that the effort should be to allow any node in the device 
tree to take care about its child nodes by itself, and just "report" 
to the kernel, or "install" in the kernel whatever is necessary, but 
without any initiative of the kernel. Let the drivers be as 
complicated as necessary, not the kernel.

The Device Tree representation is already correct in that it lists 
clocks/regulators/reset signals that are needed by the PHY. The 
problem is its implementation with the Linux device driver model. 
Please read again what I wrote.

The problem which I had, was that kernel was unable to read ID of second 
PHY, because it needed to have the clock enabled, and during probing it 
still didn't have. I don't know what problems are addressed by the 
discussed patches - only the same one, or if something else too.

That is precisely the problem being addressed here, an essential clock 
to the PHY must be enabled for the PHY to be accessible on the MDIO bus.

In my solution, of my problem, which now works well for me, instead of 
teaching kernel any new tricks, I've taught the kernel to listen to the 
driver, by adding a function allowing the driver to register its PHY on 
the MDIO bus at any time. Then, each driver instance (for a particular 
interface) configures whatever is necessary, finds its PHY when probing 
the shared MDIO bus, and tells the kernel(?) to add it to the shared 
MDIO bus. Since each one does that, when the time comes, all PHYs are 
known and all interfaces are up.

Except this is bending the Linux driver model backwards, a driver does 
not register devices, a bus does, and then it binds the driver to that 
device object. For the bus to scan a hardware device, said hardware 
device must be in a functional state to be discovered.

This is just an example. It doesn't need the kernel to mimic the driver. 
But, it requires a bit different structure of the device tree, and I 
guess I'm not aware of tons of reasons for which it's not "the good 
way". Sorry, I can't be more constructive here, I don't have that much 
experience with the kernel. The idea is simple: the driver does the job, 
not the kernel.
You know much better than me how Linux works, so you decide if this 
makes sense, or not, and what does this actually mean in the context of 
the system.

Concerning invalid PHY ID, all Fs are read when the MDIO bus works, but 
the PHY is inactive. Another invalid ID is all 0s. I have seen this 
value when the MDIO bus didn't have the pins assigned, so its signals 
were "trapped" in the processor. Another cause could be a physical fault 
(short?) on the bus. Maybe this case should end up in returning -ENODEV 
too.

Read failures are already treated as such, there is no need to change 
anything here.
--
Florian