[PATCH 07/13] loopback: Refactor latency initialization

tanuk@xxxxxx (Tanu Kaskinen) · Fri, 20 Nov 2015 02:03:07 +0200

On Wed, 2015-02-25 at 19:43 +0100, Georg Chini wrote:
> as well as the way of reacting to sink input or source output moving.
> 
> The goal is to make sure that the initial latency of the system matches
> the configured one.
> 
> While at it, allow to set adjust_time to 0, which means "no adjustments
> at all".
> ---
> Â src/modules/module-loopback.c | 329 +++++++++++++++++++++++++++++++++++-------
> Â 1 file changed, 275 insertions(+), 54 deletions(-)
> 
> diff --git a/src/modules/module-loopback.c b/src/modules/module-loopback.c
> index 79bd106..09f2f58 100644
> --- a/src/modules/module-loopback.c
> +++ b/src/modules/module-loopback.c
> @@ -59,7 +59,9 @@ PA_MODULE_USAGE(
> Â 
> Â #define DEFAULT_LATENCY_MSEC 200
> Â 
> -#define MEMBLOCKQ_MAXLENGTH (1024*1024*16)
> +#define MEMBLOCKQ_MAXLENGTH (1024*1024*32)

Why is this change done? Whatever the reason is, shouldn't this be done
in a separate patch?

> +
> +#define DEFAULT_BUFFER_MARGIN_MSEC 20
> Â 
> Â #define DEFAULT_ADJUST_TIME_USEC (10*PA_USEC_PER_SEC)
> Â 
> @@ -80,11 +82,21 @@ struct userdata {
> Â 
> Â Â Â Â Â int64_t recv_counter;
> Â Â Â Â Â int64_t send_counter;
> +Â Â Â Â uint32_t sink_adjust_counter;
> +Â Â Â Â uint32_t source_adjust_counter;
> Â 
> -Â Â Â Â size_t skip;
> Â Â Â Â Â pa_usec_t latency;
> +Â Â Â Â pa_usec_t buffer_latency;
> +Â Â Â Â pa_usec_t initial_buffer_latency;
> +Â Â Â Â pa_usec_t configured_sink_latency;
> +Â Â Â Â pa_usec_t configured_source_latency;

It would be nice to have comments about what each of these different
latency variables mean.

I'm trying to understand what buffer_latency is... I suppose it's used
to configure the buffering so that the sink, source and buffer_latency
together equal u->latency (i.e. the user-configured loopback latency).
It's 1/4 of u->latency by default, so I guess you try to configure the
sink and source latencies to be 3/8 of u->latency each? buffer_latency
can't be less than 1.667 ms, however. (Where does that number come
from?)Â 

Hmm... When configuring the sink and source latencies, you divide u-
>latency by 3, which is consistent with the old code, so maybe it was a
typo to divide u->latency by 4 when setting the default buffer_latency?
I'll assume it was a typo, and you meant all three latency components
to be one third of u->latency by default.

buffer_latency can't be less than 3/4 of the sink or source latency
(where does that 3/4 come from?), so if the sink or source latency
exceeds that limit, buffer_latency is raised accordingly. That's with
dynamic latency support. If the sink or source doesn't support dynamic
latency, buffer_latency is raised toÂ default_fragment_size_msec + 20
ms. I don't think it makes sense to use default_fragment_size_msec.
That variable is not guaranteed to have any relation to the sink/source
behaviour. Something derived from max_request for sinks would probably
be appropriate. I'm not sure about sources, maybe the fixed_latency
variable could be used.

> +
> +Â Â Â Â pa_usec_t source_latency_sum;
> +Â Â Â Â pa_usec_t sink_latency_sum;
> Â Â Â Â Â bool in_pop;
> +Â Â Â Â bool pop_called;
> +Â Â Â Â bool source_sink_changed;
> Â 
> Â Â Â Â Â struct {
> Â Â Â Â Â Â Â Â Â int64_t send_counter;
> @@ -189,13 +201,20 @@ static uint32_t rate_controller(
> Â static void adjust_rates(struct userdata *u) {
> Â Â Â Â Â size_t buffer;
> Â Â Â Â Â uint32_t old_rate, base_rate, new_rate;
> -Â Â Â Â pa_usec_t final_latency, current_buffer_latency, current_latency, corrected_latency;
> +Â Â Â Â pa_usec_t final_latency, source_sink_latency, current_buffer_latency, current_latency, corrected_latency;
> Â Â Â Â Â int32_t latency_difference;
> Â Â Â Â Â pa_usec_t snapshot_delay;
> Â 
> Â Â Â Â Â pa_assert(u);
> Â Â Â Â Â pa_assert_ctl_context();
> Â 
> +Â Â Â Â u->sink_adjust_counter +=1;
> +Â Â Â Â u->source_adjust_counter +=1;
> +
> +Â Â Â Â /* Latency sums */
> +Â Â Â Â u->source_latency_sum += u->latency_snapshot.source_latency;
> +Â Â Â Â u->sink_latency_sum += u->latency_snapshot.sink_latency;
> +
> Â Â Â Â Â /* Rates and latencies*/
> Â Â Â Â Â old_rate = u->sink_input->sample_spec.rate;
> Â Â Â Â Â base_rate = u->source_output->sample_spec.rate;
> @@ -210,10 +229,12 @@ static void adjust_rates(struct userdata *u) {
> Â Â Â Â Â snapshot_delay = u->latency_snapshot.source_timestamp - u->latency_snapshot.sink_timestamp;
> Â Â Â Â Â current_latency = u->latency_snapshot.sink_latency + current_buffer_latency + base_rate * u->latency_snapshot.source_latency / old_rate - snapshot_delay;
> Â 
> -Â Â Â Â final_latency = u->latency;
> -
> Â Â Â Â Â /* Latency and latency difference at base rate */
> Â Â Â Â Â corrected_latency = u->latency_snapshot.source_latency + (u->latency_snapshot.sink_latency + current_buffer_latency) * old_rate / base_rate - snapshot_delay;
> +
> +Â Â Â Â source_sink_latency = u->sink_latency_sum / u->sink_adjust_counter +
> +Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â u->source_latency_sum / u->source_adjust_counter;
> +Â Â Â Â final_latency = PA_MAX(u->latency, source_sink_latency + u->buffer_latency);
> Â Â Â Â Â latency_difference = (int32_t)(corrected_latency - final_latency);

If I understand correctly, here you fix the target latency to something
more sensible if the user-configured latency is impossible to reach.
Could this be put into a separate patch? This patch is pretty painful
to review, so further splitting would be welcome.

> Â 
> Â Â Â Â Â pa_log_debug("Loopback overall latency is %0.2f ms + %0.2f ms + %0.2f ms = %0.2f ms (at the base rate: %0.2f ms, old estimate: %0.2f ms)",
> @@ -227,6 +248,8 @@ static void adjust_rates(struct userdata *u) {
> Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (double) latency_difference / PA_USEC_PER_MSEC,
> Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â (int32_t)(old_rate - base_rate));
> Â 
> +Â Â Â Â u->source_sink_changed = false;
> +
> Â Â Â Â Â /* Calculate new rate */
> Â Â Â Â Â new_rate = rate_controller(base_rate, u->adjust_time, latency_difference);
> Â 
> @@ -253,11 +276,14 @@ static void time_callback(pa_mainloop_api *a, pa_time_event *e, const struct tim
> Â Â Â Â Â adjust_rates(u);
> Â }
> Â 
> -/* Called from main context */
> +/* Called from main context
> + * When source or sink changes, give it a third of a second to settle down, then call adjust_rates for the first time */
> Â static void enable_adjust_timer(struct userdata *u, bool enable) {
> Â Â Â Â Â if (enable) {
> -Â Â Â Â Â Â Â Â if (u->time_event || u->adjust_time <= 0)
> +Â Â Â Â Â Â Â Â if (!u->adjust_time)
> Â Â Â Â Â Â Â Â Â Â Â Â Â return;
> +Â Â Â Â Â Â Â Â if (u->time_event)
> +Â Â Â Â Â Â Â Â Â Â Â Â u->core->mainloop->time_free(u->time_event);
> Â 
> Â Â Â Â Â Â Â Â Â u->time_event = pa_core_rttime_new(u->module->core, pa_rtclock_now() + 333 * PA_USEC_PER_MSEC, time_callback, u);
> Â Â Â Â Â } else {
> @@ -277,29 +303,58 @@ static void update_adjust_timer(struct userdata *u) {
> Â Â Â Â Â Â Â Â Â enable_adjust_timer(u, true);
> Â }
> Â 
> +static pa_usec_t get_requested_latency(struct userdata *u) {
> +
> +Â Â Â Â return PA_MAX(u->configured_sink_latency + u->buffer_latency, u->latency);
> +}
> +
> +/* Called from all contexts */
> +static void memblockq_adjust(struct userdata *u, int32_t offset, bool allow_push) {
> +Â Â Â Â size_t memblock_bytes, requested_buffer_bytes;
> +Â Â Â Â pa_usec_t requested_buffer_latency;
> +Â Â Â Â size_t buffer_offset;
> +Â Â Â Â pa_memchunk silence;
> +
> +Â Â Â Â requested_buffer_latency = get_requested_latency(u);
> +Â Â Â Â if (offset > 0)
> +Â Â Â Â Â Â Â requested_buffer_latency = PA_CLIP_SUB(requested_buffer_latency, (pa_usec_t)offset);
> +Â Â Â Â else
> +Â Â Â Â Â Â Â requested_buffer_latency = requested_buffer_latency - offset;
> +
> +Â Â Â Â requested_buffer_bytes = pa_usec_to_bytes(requested_buffer_latency, &u->sink_input->sample_spec);
> +Â Â Â Â memblock_bytes = pa_memblockq_get_length(u->memblockq);
> +
> +Â Â Â Â /* Drop audio from queue */
> +Â Â Â Â if ((int32_t)(memblock_bytes - requested_buffer_bytes) > 0) {
> +Â Â Â Â Â Â Â buffer_offset = memblock_bytes - requested_buffer_bytes;
> +Â Â Â Â Â Â Â pa_log_info("Dropping %zd bytes from queue", buffer_offset);
> +Â Â Â Â Â Â Â pa_memblockq_drop(u->memblockq, buffer_offset);
> +Â Â Â Â }
> +Â Â Â Â /* Add silence to queue, will never happen from IO-thread */
> +Â Â Â Â else if ((int32_t)(memblock_bytes - requested_buffer_bytes) < 0 && allow_push) {
> +Â Â Â Â Â Â Â requested_buffer_bytes = requested_buffer_bytes - memblock_bytes;
> +Â Â Â Â Â Â Â pa_log_info("Adding %zd bytes of silence to queue", requested_buffer_bytes);
> +Â Â Â Â Â Â Â pa_sink_input_get_silence(u->sink_input, &silence);
> +Â Â Â Â Â Â Â while (requested_buffer_bytes >= silence.length) {
> +Â Â Â Â Â Â Â Â Â Â pa_memblockq_push_align(u->memblockq, &silence);
> +Â Â Â Â Â Â Â Â Â Â requested_buffer_bytes -= silence.length;
> +Â Â Â Â Â Â Â }
> +Â Â Â Â Â Â Â if (requested_buffer_bytes > 0) {
> +Â Â Â Â Â Â Â Â Â Â silence.length = requested_buffer_bytes;
> +Â Â Â Â Â Â Â Â Â Â pa_memblockq_push_align(u->memblockq, &silence);
> +Â Â Â Â Â Â Â }
> +Â Â Â Â Â Â Â pa_memblock_unref(silence.memblock);
> +Â Â Â Â }
> +}
> +
> Â /* Called from input thread context */
> Â static void source_output_push_cb(pa_source_output *o, const pa_memchunk *chunk) {
> Â Â Â Â Â struct userdata *u;
> -Â Â Â Â pa_memchunk copy;
> Â 
> Â Â Â Â Â pa_source_output_assert_ref(o);
> Â Â Â Â Â pa_source_output_assert_io_context(o);
> Â Â Â Â Â pa_assert_se(u = o->userdata);
> Â 
> -Â Â Â Â if (u->skip >= chunk->length) {
> -Â Â Â Â Â Â Â Â u->skip -= chunk->length;
> -Â Â Â Â Â Â Â Â return;
> -Â Â Â Â }
> -
> -Â Â Â Â if (u->skip > 0) {
> -Â Â Â Â Â Â Â Â copy = *chunk;
> -Â Â Â Â Â Â Â Â copy.index += u->skip;
> -Â Â Â Â Â Â Â Â copy.length -= u->skip;
> -Â Â Â Â Â Â Â Â u->skip = 0;
> -
> -Â Â Â Â Â Â Â Â chunk = Â©
> -Â Â Â Â }
> -
> Â Â Â Â Â pa_asyncmsgq_post(u->asyncmsgq, PA_MSGOBJECT(u->sink_input), SINK_INPUT_MESSAGE_POST, NULL, 0, chunk, NULL);
> Â Â Â Â Â u->send_counter += (int64_t) chunk->length;
> Â }
> @@ -339,6 +394,33 @@ static int source_output_process_msg_cb(pa_msgobject *obj, int code, void *data,
> Â Â Â Â Â return pa_source_output_process_msg(obj, code, data, offset, chunk);
> Â }
> Â 
> +static void set_source_output_latency(struct userdata *u, pa_source *source) {
> +Â Â Â Â Â pa_usec_t min_latency, max_latency, buffer_msec, latency;
> +
> +Â Â Â Â /* Set lower limit of source latency to 2.333 ms */
> +Â Â Â Â latency = PA_MAX(u->latency / 3, 2.333 * PA_USEC_PER_MSEC);

Where does that 2.333 ms come from? Defining a constant for the minimum
source latency would be good.

> +
> +Â Â Â Â if(source->flags & PA_SOURCE_DYNAMIC_LATENCY) {
> +Â Â Â Â Â Â Â pa_source_get_latency_range(source, &min_latency, &max_latency);
> +Â Â Â Â Â Â Â if (min_latency > latency) {
> +Â Â Â Â Â Â Â Â Â Â u->buffer_latency = PA_MAX(u->buffer_latency, (pa_usec_t)(min_latency * 0.75));
> +Â Â Â Â Â Â Â Â Â Â pa_log_warn("Cannot set requested source latency, adjusting buffer to %0.2f ms", (double)u->buffer_latency / PA_USEC_PER_MSEC);
> +Â Â Â Â Â Â Â }
> +Â Â Â Â Â Â Â latency = PA_CLAMP(latency, min_latency, max_latency);
> +Â Â Â Â }
> +Â Â Â Â else {
> +Â Â Â Â Â Â Â latency = pa_source_get_latency(source);
> +Â Â Â Â Â Â Â if (latency == 0)
> +Â Â Â Â Â Â Â Â Â Â latency = pa_source_get_fixed_latency(source);
> +Â Â Â Â Â Â Â buffer_msec = u->core->default_fragment_size_msec + DEFAULT_BUFFER_MARGIN_MSEC;
> +Â Â Â Â Â Â Â if (u->buffer_latency < buffer_msec * PA_USEC_PER_MSEC) {
> +Â Â Â Â Â Â Â Â Â Â pa_log_warn("Fixed latency device, setting buffer latency to %zd.00 ms", buffer_msec);
> +Â Â Â Â Â Â Â Â Â Â u->buffer_latency = buffer_msec * PA_USEC_PER_MSEC;
> +Â Â Â Â Â Â Â }
> +Â Â Â Â }
> +Â Â Â Â u->configured_source_latency = pa_source_output_set_requested_latency(u->source_output, latency);
> +}
> +
> Â /* Called from output thread context */
> Â static void source_output_attach_cb(pa_source_output *o) {
> Â Â Â Â Â struct userdata *u;
> @@ -365,24 +447,10 @@ static void source_output_detach_cb(pa_source_output *o) {
> Â Â Â Â Â Â Â Â Â pa_rtpoll_item_free(u->rtpoll_item_write);
> Â Â Â Â Â Â Â Â Â u->rtpoll_item_write = NULL;
> Â Â Â Â Â }
> -}
> -
> -/* Called from output thread context */
> -static void source_output_state_change_cb(pa_source_output *o, pa_source_output_state_t state) {
> -Â Â Â Â struct userdata *u;
> -
> -Â Â Â Â pa_source_output_assert_ref(o);
> -Â Â Â Â pa_source_output_assert_io_context(o);
> -Â Â Â Â pa_assert_se(u = o->userdata);
> -
> -Â Â Â Â if (PA_SOURCE_OUTPUT_IS_LINKED(state) && o->thread_info.state == PA_SOURCE_OUTPUT_INIT) {
> -
> -Â Â Â Â Â Â Â Â u->skip = pa_usec_to_bytes(PA_CLIP_SUB(pa_source_get_latency_within_thread(o->source),
> -Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â u->latency),
> -Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â &o->sample_spec);
> -
> -Â Â Â Â Â Â Â Â pa_log_info("Skipping %lu bytes", (unsigned long) u->skip);
> -Â Â Â Â }
> +Â Â Â u->source_sink_changed = true;
> +Â Â Â u->source_latency_sum = 0;
> +Â Â Â u->source_adjust_counter = 0;
> +Â Â Â u->buffer_latency = u->initial_buffer_latency;
> Â }
> Â 
> Â /* Called from main thread */
> @@ -408,7 +476,12 @@ static bool source_output_may_move_to_cb(pa_source_output *o, pa_source *dest) {
> Â Â Â Â Â if (!u->sink_input || !u->sink_input->sink)
> Â Â Â Â Â Â Â Â Â return true;
> Â 
> -Â Â Â Â return dest != u->sink_input->sink->monitor_source;
> +Â Â Â Â /* We may still be adjusting, so reset rate to default before moving the source */
> +Â Â Â Â if (dest != u->sink_input->sink->monitor_source) {
> +Â Â Â Â Â Â Â pa_sink_input_set_rate(u->sink_input, u->source_output->sample_spec.rate);
> +Â Â Â Â Â Â Â return true;
> +Â Â Â Â }
> +Â Â Â Â return false;
> Â }
> Â 
> Â /* Called from main thread */
> @@ -416,6 +489,7 @@ static void source_output_moving_cb(pa_source_output *o, pa_source *dest) {
> Â Â Â Â Â pa_proplist *p;
> Â Â Â Â Â const char *n;
> Â Â Â Â Â struct userdata *u;
> +Â Â Â Â pa_usec_t sink_latency;
> Â 
> Â Â Â Â Â if (!dest)
> Â Â Â Â Â Â Â Â Â return;
> @@ -433,6 +507,29 @@ static void source_output_moving_cb(pa_source_output *o, pa_source *dest) {
> Â Â Â Â Â pa_sink_input_update_proplist(u->sink_input, PA_UPDATE_REPLACE, p);
> Â Â Â Â Â pa_proplist_free(p);
> Â 
> +Â Â Â Â /* Set latency and calculate necessary buffer length
> +Â Â Â Â Â * In some profile switching situations the sink will be invalid here. If so,
> +Â Â Â Â Â * skip the buffer adjustment, it will be done when the sink becomes valid */
> +Â Â Â Â set_source_output_latency(u, dest);
> +

The comment confused me, because I first thought it referred only to
the set_source_output_latency() call. Adding an empty line between the
comment and the function call should help. The same goes for the sink
input side.

I'm going to sleep now. I can continue reviewing this patch tomorrow,
or I can do it when you send v2, if you split this into more manageable
pieces (I'd prefer the latter). Which one do you think is better?

-- 
Tanu