[PATCH] i2c: rk3x: adjust the LOW divison based on characteristics of SCL

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Addy,

On Wed, Sep 24, 2014 at 1:23 AM, addy ke <addy.ke at rock-chips.com> wrote:
>
>
> On 2014/9/24 12:10, Doug Anderson wrote:
>> Addy,
>>
>> On Tue, Sep 23, 2014 at 6:55 PM, Addy Ke <addy.ke at rock-chips.com> wrote:
>>> As show in I2C specification:
>>> - Standard-mode:
>>>   the minimum HIGH period of the scl clock is 4.0us
>>>   the minimum LOW period of the scl clock is 4.7us
>>> - Fast-mode:
>>>   the minimum HIGH period of the scl clock is 0.6us
>>>   the minimum LOW period of the scl clock is 1.3us
>>> - Fast-mode plus:
>>>   the minimum HIGH period of the scl clock is 0.26us
>>>   the minimum LOW period of the scl clock is 0.5us
>>> - HS-mode(<1.7MHz):
>>>   the minimum HIGH period of the scl clock is 0.12us
>>>   the minimum LOW period of the scl clock is 0.32us
>>> - HS-mode(<3.4MHz):
>>>   the minimum HIGH period of the scl clock is 0.06us
>>>   the minimum LOW period of the scl clock is 0.16us
>>>
>>> I have measured i2c SCL waveforms in fast-mode by oscilloscope
>>> on rk3288-pinky board. the LOW period of the scl clock is 1.3us.
>>> It is so critical that we must adjust LOW division to increase
>>> the LOW period of the scl clock.
>>>
>>> Thanks Doug for the suggestion about division formula.
>>>
>>> Signed-off-by: Addy Ke <addy.ke at rock-chips.com>
>>> ---
>>>  drivers/i2c/busses/i2c-rk3x.c | 79 +++++++++++++++++++++++++++++++++++++++----
>>>  1 file changed, 72 insertions(+), 7 deletions(-)
>>>
>>> diff --git a/drivers/i2c/busses/i2c-rk3x.c b/drivers/i2c/busses/i2c-rk3x.c
>>> index 93cfc83..49d67b7 100644
>>> --- a/drivers/i2c/busses/i2c-rk3x.c
>>> +++ b/drivers/i2c/busses/i2c-rk3x.c
>>> @@ -428,18 +428,83 @@ out:
>>>         return IRQ_HANDLED;
>>>  }
>>>
>>> +static void rk3x_i2c_get_ratios(unsigned long scl_rate,
>>> +                               unsigned long *high_ratio,
>>> +                               unsigned long *low_ratio)
>>> +{
>>> +       /* As show in I2C specification:
>>> +        * - Standard-mode:
>>> +        *   the minimum HIGH period of the scl clock is 4.0us
>>> +        *   the minimum LOW period of the scl clock is 4.7us
>>> +        * - Fast-mode:
>>> +        *   the minimum HIGH period of the scl clock is 0.6us
>>> +        *   the minimum LOW period of the scl clock is 1.3us
>>> +        * - Fast-mode plus:
>>> +        *   the minimum HIGH period of the scl clock is 0.26us
>>> +        *   the minimum LOW period of the scl clock is 0.5us
>>> +        * - HS-mode(<1.7MHz):
>>> +        *   the minimum HIGH period of the scl clock is 0.12us
>>> +        *   the minimum LOW period of the scl clock is 0.32us
>>> +        * - HS-mode(<3.4MHz):
>>> +        *   the minimum HIGH period of the scl clock is 0.06us
>>> +        *   the minimum LOW period of the scl clock is 0.16us
>>
>> Is the rest of the driver ready for Fast-mode plus or HS mode?  If not
>> then maybe leave those off?  If nothing else the commit message should
>> indicate that this is just being forward thinking.
>>
>>> +        */
>>> +       if (scl_rate <= 100000) {
>>> +               *high_ratio = 40;
>>> +               *low_ratio = 47;
>>> +       } else if (scl_rate <= 400000) {
>>> +               *high_ratio = 6;
>>> +               *low_ratio = 13;
>>> +       } else if (scl_rate <= 1000000) {
>>> +               *high_ratio = 26;
>>> +               *low_ratio = 50;
>>> +       } else if (scl_rate <= 1700000) {
>>> +               *high_ratio = 12;
>>> +               *low_ratio = 32;
>>> +       } else {
>>> +               *high_ratio = 6;
>>> +               *low_ratio = 16;
>>
>> Since it's only the ratio of high to low that matters, you can combine
>> the last two.  12 : 32 == 6 : 16
>>
>>> +       }
>>> +}
>>> +
>>> +static void rk3x_i2c_calc_divs(unsigned long i2c_rate, unsigned long scl_rate,
>>> +                              unsigned long *divh, unsigned long *divl)
>>> +{
>>> +       unsigned long high_ratio, low_ratio;
>>> +       unsigned long ratio_sum;
>>> +
>>> +       rk3x_i2c_get_ratios(scl_rate, &high_ratio, &low_ratio);
>>> +       ratio_sum = high_ratio + low_ratio;
>>> +
>>> +       /* T_high = T_clk * (divh + 1) * 8
>>> +        * T_low = T_clk * (divl + 1) * 8
>>> +        * T_scl = T_high + T_low
>>> +        * T_scl = 1 / scl_rate
>>> +        * T_clk = 1 / i2c_rate
>>> +        * T_high : T_low = high_ratio : low_ratio
>>> +        * ratio_sum = high_ratio + low_ratio
>>> +        *
>>> +        * so:
>>> +        * divh = (i2c_rate * high_ratio) / (scl_rate * ratio_sum * 8) - 1
>>> +        * divl = (i2c_rate * low_ratio) / (scl_rate * ratio_sum * 8) - 1
>>> +        */
>>> +       *divh = DIV_ROUND_UP(i2c_rate * high_ratio, scl_rate * ratio_sum * 8);
>>> +       if (*divh)
>>> +               *divh = *divh - 1;
>>> +
>>> +       *divl = DIV_ROUND_UP(i2c_rate * low_ratio, scl_rate * ratio_sum * 8);
>>> +       if (*divl)
>>> +               *divl = *divl - 1;
>>
>> When I sent you the sample formulas I purposely did it differently
>> than this.  Any reason you changed from my formulas?
>>
>>   div_low = DIV_ROUND_UP(clk_rate * low_ratio, scl_rate * 8 * ratio_sum)
>>   div_high = DIV_ROUND_UP(clk_rate, scl_rate * 8) - div_low
>>
>>   div_low -= 1
>>   if div_high:
>>     div_high -= 1
>>
>> Why did I do it that way?
>>
>> * Assuming i2c_rate and the ratio is non-zero then you can assume that
>> DIV_ROUND_UP gives a value that is >= 1.  No need to test the result
>> against 0.
>>
>> * (I think) you'll get a more accurate clock rate by subtracting.
>>
>> Try running your formula vs. my formula with a ratio of 13 : 6, an i2c
>> rate of 12800000, and an scl rate of 400000
>>
>> Mine will get:
>>   Req = 400000, act = 400000, 1.88 us low, 0.62 us high, low/high = 3.00
>>
>> Yours will get:
>>   Req = 400000, act = 320000, 1.88 us low, 1.25 us high, low/high = 1.50
>>
> yes, you are right. yours is closer to the scl clock what we want to set.
>
> But if (clk_rate * low_ratio) can not be divisible by (scl_rate * 8 * ratio_sum),
> div_low will be round up, and div _high will be round down.
> The gap between div_low and div_high is increased.

Is that important?  As far as I can tell as long as we are meeting the
minimum requirements for low and high hold times then we're OK.


> so maybe we can set:
> div_high = DIV_ROUND_UP(clk_rate * high_ratio, scl_rate * 8 * ratio_sum)
> div_low = DIV_ROUND_UP(clk_rate, scl_rate * 8) - div_low
>
> i2c rate is 128Mhz:
> 1) calculate div_high first:
> div_high = DIV_ROUND_UP(clk_rate * high_ratio, scl_rate * 8 * ratio_sum)
> div_low = DIV_ROUND_UP(clk_rate, scl_rate * 8) - div_low
>
> req = 400000, act = 400000, div_high = 13, div_low = 27
>
> 2) calculate div_low first:
> div_low = DIV_ROUND_UP(clk_rate * low_ratio, scl_rate * 8 * ratio_sum)
> div_high = DIV_ROUND_UP(clk_rate, scl_rate * 8) - div_low
>
> req = 400000, act = 400000, div_high = 12, div_high = 28
>
> I think that the first is more appropriate.

I guess I was biasing towards making low a little longer instead of
biasing towards keeping high and low the same.  I'm not sure the bias
is terribly critical.

In any case, I spent more time testing and I realized that my old
formulas could potentially violate hold times because I wasn't careful
enough.  I've crafted some new code that is more careful to ensure
_both_ minimum hold times and minimum clock rate times.

Do you want to spin your patch with these new formulas, or do you
think I should?


(anyone else on the thread listening, feel free to comment).

---

def test_it(min_low_ns, min_high_ns, clk_rate, scl_rate):
  min_total_ns = min_low_ns + min_high_ns

  # We need the total div to be >= this number so we don't clock too fast.
  min_total_div = DIV_ROUND_UP(clk_rate, scl_rate * 8);

  # These are the min dividers needed for hold times.
  min_low_div = DIV_ROUND_UP(clk_rate * min_low_ns, 8 * 1000000000)
  min_high_div = DIV_ROUND_UP(clk_rate * min_high_ns, 8 * 1000000000)
  min_div_for_hold = (min_low_div + min_high_div)

  if min_div_for_hold > min_total_div:
    # Time needed to meet hold requirements is important.  Just use that
    div_low = min_low_div
    div_high = min_high_div
  else:
    # We've got to distribute some time among the low and high so we
    # don't run too fast.
    extra_div = min_total_div - min_div_for_hold

    # We'll try to split things up perfectly evenly, biasing slightly
    # towards having a higher div for low (spend more time low).
    ideal_low_div = DIV_ROUND_UP(clk_rate * min_low_ns,
                                 scl_rate * 8 * min_total_ns)

    # Handle when the ideal low div is going to take up more than we have
    if ideal_low_div > min_low_div + extra_div:
      assert ideal_low_div == min_low_div + extra_div + 1
      ideal_low_div = min_low_div + extra_div

    # Give low the "ideal" and give high whatever extra is left.
    div_low = ideal_low_div
    div_high = min_high_div + (extra_div - (ideal_low_div - min_low_div))

  # Adjust to the fact that the hardware has an implicit "+1".
  # NOTE: Above calculations always produce div_low > 0 and  div_high > 0.
  div_low -= 1
  div_high -= 1

  T_pclk_us = 1000000. / clk_rate
  T_sclk_us = 1000000. / scl_rate

  T_low_us = T_pclk_us * (div_low + 1) * 8
  T_high_us = T_pclk_us * (div_high + 1) * 8

  T_tot_us = (T_high_us + T_low_us)
  freq = 1000000. / T_tot_us

  if T_low_us * 1000 < min_low_ns:
    print "ERROR: not low long enough"
  if T_high_us * 1000 < min_high_ns:
    print "ERROR: not high long enough"

  print "CLK = %d, Req = %d, act = %.2f, %.2f us low, " \
        "%.2f us high, low/high = %.2f" % (
        clk_rate, scl_rate, freq, T_low_us,
        T_high_us, T_low_us / T_high_us)

  return (clk_rate, scl_rate, freq, T_low_us, T_high_us)

test_it(4700, 4000, 1484000, 100000)
test_it(4700, 4000, 2000001, 100000)
test_it(4700, 4000, 74250000, 99799)
test_it(4700, 4000, 74250000, 99798)
test_it(4700, 4000, 74250000, 99797)
test_it(4700, 4000, 74250000, 100000)

test_it(1300, 600,   5000000, 400000)
test_it(1300, 600,   9400000, 400000)

test_it(1300,  600, 74250000, 400000)
test_it(1300,  600, 12800000, 400000)
test_it(1300,  600,  6400000, 400000)
test_it(1300,  600,  3200000, 400000)
test_it(1300,  600,  1600000, 400000)
test_it(1300,  600,   800000, 400000)

for i in xrange(800000, 74250000, 100):
  test_it(4700, 4000, i, 100000)

for i in xrange(800000, 74250000, 100):
  test_it(1300, 600, i, 400000)



[Index of Archives]     [LM Sensors]     [Linux Sound]     [ALSA Users]     [ALSA Devel]     [Linux Audio Users]     [Linux Media]     [Kernel]     [Gimp]     [Yosemite News]     [Linux Media]

  Powered by Linux