Andreas Mohr wrote:
Hello all, originally I just intended to prepare a patch (to -mm), but it seems better to discuss this first. Currently (2.6.25-rc6) we have: static inline u8 ata_busy_wait(struct ata_port *ap, unsigned int bits, unsigned int max) { u8 status; do { udelay(10); status = ata_chk_status(ap); max--; } while (status != 0xff && (status & bits) && (max > 0)); return status; } This can easily be improved to have merged max handling by doing a single (--max > 0) instead, without any change in behaviour. Building this on i386 showed combined image savings of almost 100 bytes, most likely due to not having to re-fetch max again for the duplicate max access. However we're not finished yet: Since status != 0xff is most certainly a check for the "unplugged hotplug" case, it's a bit wasteful to check this as the very first abort condition. By moving this more towards the end one should be able to improve exit latency of this loop, without altering behaviour here as well (- right??). Since the "(status != 0xff && (status & bits)" part is being used verbatim (ICK, COPY&PASTE!! ;) many times (e.g. also in libata-core.c), IMHO the best way going forward would be to create another fittingly named inline header helper for this combined check which could then have its check order swapped for better exit latency. Those two tweaks alone may already be able to deliver a noticeable speedup of ata operations given that this is frequently used inner libata code.
.. While you're at it, the udelay(10) should really be *much* smaller, or at least broken into a top/bottom pair of udelay(5). I really suspect that much of the time, the status value is satisified on the first iteration, requiring no more than a microsecond or so. Yet we always force it to take at least 10us, or about 15000 instructions worth on a modern CPU. Cheers -- To unsubscribe from this list: send the line "unsubscribe linux-ide" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html