tytso@xxxxxxx writes: >> CFQ currently seems to be preempting any thread doing IO if a request has >> been marked as metadata. I think this is going to be bad for any other IO >> going on. >> >> I wrote a small fio script which is doing buffered writes with bs=32K and I >> am doing fsync on file after every 20 IOs (fsync=20). I am assuming that this >> something close to writting a small file and then doing fsync on that. >> >> With that fio script running I launched firefox and measured the >> time it takes..... it looks like that firefox launching times have >> seems to just almost doubled. > > Vivek, thanks for pointing this out. Sounds like we need to think > carefully about whether the potential unfairness that this patch might > impose on other workloads sharing the file system dominates the > improvements that Jeff found when there's only a single workload > running on the file system. > > I'm tentatively leaning towards pulling this patch so we can do more > testing / benchmarking. Jeff, any thoughts or comments? Yeah, pull it. I just talked this over with Vivek, and looking back at the blktrace data I think I have another idea that might work and be less invasive. Basically, the iozone process is, umm, special. It does the following at startup (from memory, so I might've missed a step): open(fd); truncate(fd,0); close(fd) open(fd); fsync(fd); *then* it does I/O. So, we get a sync cfqq for the iozone process that ends up doing the metadata lookups. Then it does the truncate and of course blocks. At this point, we are waiting for jbd2 to run to sync out its transaction. However, we're idling on the iozone cfqq, so it doesn't get a chance to run for 8ms. If we instead just pass a hint down to the I/O scheduler on fsync, similar to the schedule() call for the cpu scheduler, then I think we can give up our time slice and allow the jbd2 process to run. I'll report back with my findings. Vivek, thanks a ton for the careful (as always) review. Cheers, Jeff -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html