Re: [PATCH] ALSA: usb: improve delay reporting

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 4/5/18 6:49 AM, Georg Chini wrote:
On 04.04.2018 21:24, Pierre-Louis Bossart wrote:
On 04/04/2018 08:57 AM, Georg Chini wrote:

Currently, the kernel uses only the number of USB frames to interpolate the delay between sending or receiving URB's. This limits the granuarity of the delay reports to one millisecond because the frame counter is updated in a millisecond interval.

This patch additionally uses time stamps to achieve higher precision of the reported delay. If the time difference computed using time stamps does not deviate more than one millisecond from the difference computed using frames, it is assumed that the
system time difference delivers the more precise delay estimate.

This will significantly increase the precision of the reported delay in the majority of cases where sound card clock and system clock are almost in sync while falling back to the old method for the rare cases where the two times differ too much.
I am all for better delay estimates but I have mixed feelings about this - and some hardware-related parts are broken.

Well, I only stumbled into this because of the recent delay estimation issues in pulseaudio, so forgive my ignorance about ALSA. If the general idea of the patch is not right, just ignore
my other comments and drop the patch.

It's complicated to change this part of the code. When I added the use of the frame counter some 5 years back, it created all sorts of issues on various platforms that I was not even aware of.

Your implementation raises obvious questions and would need quite a bit of revalidation. In its current state it's not ready for prime time.



The frame counter is already an approximation of the actual delay which doesn't really account for the type of USB interface (Isoc, Async) used on the device side. You'd be adding a lot of precision without really solving the fact that you don't know what the actual playback rate is. You'd really need a time smoother that models playback rate v. system time if you want to be fully accurate below the ms level for all types of USB audio devices.

The goal is not to be fully accurate, only to be somewhat better than the current estimate.

Define 'better'. What is the goal and the expected benefit for applications?

I'll copy here what I already said to Takashi in private communication:

The mismatch between sound card time and system time should not matter on the timescale we are talking about. Let's say we are using 10 USB frames (which is more than I have seen while debugging), so hw_ptr is updated every 10 ms. Assume that the deviation between sound card and system time is 1% (which again should be more than you will ever see). Then using system time, the computed delay can be off by 100 microseconds at maximum, while the delay computed using USB frames
can be off by 2ms. Therefore using system time should be acceptable in that
situation because it delivers higher precision results as can be achieved using
USB frames.

For the rare cases where system time and sound card time are completely different,
the patch falls back to the original method.


In addition I've only seen drivers report the delay based on hardware counters. If we start doing interpolations based on system time, then why not do it for other devices as well with a fix at the core level

I could not do that, I don't have enough ALSA knowledge. A motivation to improve the delay reports for USB devices only could be that they are probably the most
common devices apart from HDA.

Exactly, why not do this for HDAudio then...



And last, I think you have an issue for the pause case and you cannot remove the code that limits the frame counter to 8 bits, this will not work across multiple versions of [e|x]HCI controllers, see below.


As a downside, the possible worst case error increases from 2ms to 3ms.
---
  sound/usb/card.h |  1 +
  sound/usb/pcm.c  | 47 +++++++++++++++++++++++++++++++----------------
  2 files changed, 32 insertions(+), 16 deletions(-)

diff --git a/sound/usb/card.h b/sound/usb/card.h
index ed87cc83eb47..8835d9628cac 100644
--- a/sound/usb/card.h
+++ b/sound/usb/card.h
@@ -149,6 +149,7 @@ struct snd_usb_substream {
        int last_frame_number;          /* stored frame number */
      int last_delay;                 /* stored delay */
+    struct timespec last_update_time;    /* time when last_delay was updated */
        struct {
          int marker;
diff --git a/sound/usb/pcm.c b/sound/usb/pcm.c
index 3cbfae6604f9..6d40cb0b8777 100644
--- a/sound/usb/pcm.c
+++ b/sound/usb/pcm.c
@@ -46,9 +46,8 @@ snd_pcm_uframes_t snd_usb_pcm_delay(struct snd_usb_substream *subs,
      int current_frame_number;
      int frame_diff;
      int est_delay;
-
-    if (!subs->last_delay)
-        return 0; /* short path */

NAK. This was added on purpose by 48779a0b8ffc ('ALSA: usb-audio: fix delay account during pause') And since you kept the code that sets last_delay there is no reason to take this out.

OK, I did not know that this is needed. I took it out because otherwise the delay reported for capture will always be 0. retire_capture_urb() sets last_delay to 0. Does the problem that was fixed in the commit  only apply to playback streams?
Then I could change the if condition accordingly.

That's precisely the issue: the code was contributed by multiple people to work on multiple platforms. What parts do what exactly is complicated to reverse-engineer.



+    int time_diff;
+    struct timespec now;
        current_frame_number = usb_get_current_frame_number(subs->dev);
      /*
@@ -58,9 +57,21 @@ snd_pcm_uframes_t snd_usb_pcm_delay(struct snd_usb_substream *subs,
       */
      frame_diff = (current_frame_number - subs->last_frame_number) & 0xff;
  -    /* Approximation based on number of samples per USB frame (ms),
-       some truncation for 44.1 but the estimate is good enough */
-    est_delay =  frame_diff * rate / 1000;
+    /* Estimate time since last update of last_delay */
+    snd_pcm_gettime(subs->pcm_substream->runtime, &now);
+    time_diff = (now.tv_sec - subs->last_update_time.tv_sec) * 1000000000;
+    time_diff += (int)now.tv_nsec - subs->last_update_time.tv_nsec;
+    time_diff /= 1000;
+
+    if (abs(time_diff - frame_diff * 1000) < 1000) {
that's a 1ms error, this is quite high IMO to assess a divergence between system and frame deltas

Because time_diff is used to interpolate between frames and frames are updated every millisecond, it is perfectly normal if time_diff is for example 900 microseconds larger than frame_diff * 1000. Only when the difference is larger than 1 ms we can conclude that something is wrong. That is the reason why the patch increases the worst case error from 2 ms to 3 ms.

Yeah, well, given that the actual playback rate is not aligned with the frame counter for async interfaces, this test is questionable.



+        /* System time and sound card time do not differ
+         * too much. Use system time to calculate delay */
+        est_delay =  time_diff * rate / 1000000;
+    } else {
+        /* Approximation based on number of samples per USB frame (ms),
+           some truncation for 44.1 but the estimate is good enough */
+        est_delay =  frame_diff * rate / 1000;
+    }
      if (subs->direction == SNDRV_PCM_STREAM_PLAYBACK)
          est_delay = subs->last_delay - est_delay;
      else
@@ -853,6 +864,8 @@ static int snd_usb_pcm_prepare(struct snd_pcm_substream *substream)
      subs->hwptr_done = 0;
      subs->transfer_done = 0;
      subs->last_delay = 0;
+    subs->last_update_time.tv_sec = 0;
+    subs->last_update_time.tv_nsec = 0;
      subs->last_frame_number = 0;
      runtime->delay = 0;
  @@ -1334,7 +1347,9 @@ static void retire_capture_urb(struct snd_usb_substream *subs,
            /* realign last_frame_number */
          subs->last_frame_number = current_frame_number;
-        subs->last_frame_number &= 0xFF; /* keep 8 LSBs */

You can't remove this, this models the fact that only 8 bits are valid in the frame counter.

If the high order bits are not arbitrary, this is not necessary because the value is only used in snd_usb_pcm_delay() and (a - b) & 0xff = (a - b & 0xff) & 0xff. You can prove this by writing a and b as a = 256 * a1 + a2 and b = 256 * b1 + b2 with a2
and b2 smaller than 256.

the number of valid bits varies in different controllers so the goal was to align on the common denominator - 8 bits only.



+
+        /* Take time stamp for delay accounting */
+        snd_pcm_gettime(runtime, &subs->last_update_time);
            spin_unlock_irqrestore(&subs->lock, flags);
          /* copy a data chunk */
@@ -1550,7 +1565,9 @@ static void prepare_playback_urb(struct snd_usb_substream *subs,
        /* realign last_frame_number */
      subs->last_frame_number = usb_get_current_frame_number(subs->dev);
-    subs->last_frame_number &= 0xFF; /* keep 8 LSBs */
same here, can't do this

+
+    /* Take time stamp for delay accounting */
+    snd_pcm_gettime(runtime, &subs->last_update_time);
        if (subs->trigger_tstamp_pending_update) {
          /* this is the first actual URB submitted,
@@ -1597,6 +1614,12 @@ static void retire_playback_urb(struct snd_usb_substream *subs,
          subs->last_delay -= processed;
      runtime->delay = subs->last_delay;
  +    /* realign last_frame_number */
+    subs->last_frame_number = usb_get_current_frame_number(subs->dev);

need to filter on 8 bits..
+
+    /* Take time stamp for delay accounting */
+    snd_pcm_gettime(runtime, &subs->last_update_time);
+
      /*
       * Report when delay estimate is off by more than 2ms.
       * The error should be lower than 2ms since the estimate relies
@@ -1607,14 +1630,6 @@ static void retire_playback_urb(struct snd_usb_substream *subs,
              "delay: estimated %d, actual %d\n",
              est_delay, subs->last_delay);
  -    if (!subs->running) {
-        /* update last_frame_number for delay counting here since
-         * prepare_playback_urb won't be called during pause
-         */
-        subs->last_frame_number =
-            usb_get_current_frame_number(subs->dev) & 0xff;
-    }
-
and why is this removed.

This is removed because the last_frame_number is now always updated,
so it is no longer necessary.

makes no sense to me.


   out:
      spin_unlock_irqrestore(&subs->lock, flags);
  }


_______________________________________________
Alsa-devel mailing list
Alsa-devel@xxxxxxxxxxxxxxxx
http://mailman.alsa-project.org/mailman/listinfo/alsa-devel

_______________________________________________
Alsa-devel mailing list
Alsa-devel@xxxxxxxxxxxxxxxx
http://mailman.alsa-project.org/mailman/listinfo/alsa-devel




[Index of Archives]     [ALSA User]     [Linux Audio Users]     [Kernel Archive]     [Asterisk PBX]     [Photo Sharing]     [Linux Sound]     [Video 4 Linux]     [Gimp]     [Yosemite News]

  Powered by Linux