Vitaly Wool wrote:
Hi,
I did some measurement, having two eMMC connected (one of them with a
root file system mounted) and one rather good SD-card with VFAT. The
resume time for the kernel before these patches were around 600 ms.
After my patches I had around 20 ms.
What do you call "resume time" in this case?
Total kernel resume time.
This is an artificial and unfair definition. No one should care about
that. You can think of a time-to-splashscreen or
time-to-functional-userspace and both are not going to be improved
much with your patch if we run off of a eMMC-based root.
This is again not correct. You must not _always_ assume, just because
you have a eMMC-based root, that requests arrives within 3 s after the
kernel has been resumed, this is completely dependent on the active use
cases. Moreover I think you are missing a very valuable thing about how
the resume sequence is being changed with this patch. I will try to
elaborate a bit more.
Before this patch:
_Every_ MMC and SD card were resumed sequentially. Thus _every_ resume
for _every_ card adds up to the resume time _always_.
After this patch:
1. MMC and SD cards can be is resumed in parallel since the resume for
those cards are deferred to be handled in a work instead.
2. Only those usecases which sends request on MMC and SD cards within ~3
s after the kernel has been resumed, will notice a resume time for that
particular card and of course only for the first request after a resume.
Requests triggered after the ~3 s will be performed on a already resumed
host.
Moreover, I noticed very seldom than any mmc/sd request arrived
within the time were the deferred resumed has not been done. Of
course this will very much depend on what kind of userspace
application that is running and if there is an ongoing file transfer
that were frozen when doing suspend.
...or the wakeup source was the userspace alarm etc. etc.
But, if this happens (deferred resume not done), the total resume
time for that particular userspace application will anyway be heavily
decreased since the other hosts resume time did not affect the
resume time for this application.
I take that by "other hosts" you mean SD card? :)
"Other hosts", are all those hosts holding an eMMC/MMC or an SD-card,
but not that host that there were a request for, before the deferred
resume has finalized.
Ok, what if a rootfs application to be started first has to re-read
say certificates from the SD card? And what if not doing that in time
means QoS degradation?
No, you don't see all the _real_ use cases that you can break with
your patch.
I understand that potentially some userspace application might be
affected. I did not see this a any critical problem which you definitely
pointed out that it might be. But the reason to why I took this approach
is simply because I think it is more a matter of in what context you are
"waiting".
Earlier a userspace application could not even execute until all mmc and
sd cards were resumed and thus waiting a long time for the kernel to be
resumed. After this patch a userspace application will be able to
execute much earlier but could in worst case have to wait for a SD or
MMC card to be resumed when it need access to it.
I think this problem for these userspace applications, should be
possible to solve within each application instead. Do you not think that
should be possible?
If you feel that dot 2 (see above) is kind of what makes this patch more
problematic for you to accept, what do you think about removing the 3 s
resume delay and instead schedule the resume work immediately?
-be async (e. g. start card resume process in resume routine, set
atomic, return success and have wait_event_interruptible_timeout in
block_rq if this atomic is set).
Don't follow you. This is exactly what the patch is doing. Not just
for SD, but also for (e)MMC. I don't see your issue.
No it doesn't. It defers the execution by arbitrary chosen time of 3
seconds.
Execution? Why?
Do you mean that we should implement this for SD cards as well?
Anyway, I don't understand what this should prevent a resume from
being executed for SD/SDIO/(e)MMC at all?
Please elaborate.
I'm not going to, at least not in this thread. The method you propose
is a hack and it can not be accepted at least because it doesn't give
a damn about QoS. The whole idea of "let's unconditionally defer
resume of something for arbitrary amount of time no matter what" is
invalid.
As suggested above. We could easily remove the timer for the scheduled
work. The idea as such will not change for this patch but userspace will
be less affected.
It would have been good to really understand more about the idea you
have of how to solve the long kernel resume time for mmc/sd/sdio with
QoS. If you decide to send out a patch of rfc please keep me on the cc-list.
Thanks, Vitaly
Br
Ulf Hansson
--
To unsubscribe from this list: send the line "unsubscribe linux-mmc" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html