Hi, On 2021-07-26 10:19:11 -0400, Theodore Ts'o wrote: > On Sat, Jul 24, 2021 at 02:44:13PM -0700, Andres Freund wrote: > > The phoronix test uses postgres with only one relevant setting adjusted > > (increasing the max connection count). That will end up using a buffer pool of > > 128MB, no huge pages, and importantly is configured to aim for not more than > > 1GB for postgres' journal, which will lead to constant checkpointing. The test > > also only runs for 15 seconds, which likely isn't even enough to "warm up" > > (the creation of the data set here will take longer than the run). > > > > Given that the dataset phoronix is using is about ~16GB of data (excluding > > WAL), and uses 256 concurrent clients running full tilt, using that limited > > postgres settings doesn't end up measuring something particularly interesting > > in my opinion. > I tend to use the phoronix test suite for my performance runs when > testing ext4 changes simply because it's convenient. Can you suggest > a better set configuration settings that I should perhaps use that > might give more "real world" numbers that you would find more > significant? It depends a bit on what you want to test, obviously... At the very least you should 'max_wal_size = 32GB' or such (it'll only use that much if enough WAL is generated within checkpoint timeout, which defaults to 5min). And unfortunately you're not going to get meaningful performance results for a read/write test within 10s, you need to run at least ~11min (so two checkpoints happen). With the default shared_buffers setting of 128MB you are going to simulate a much-larger-than-postgres's-memory workload, albeit one where the page cache *is* big enough on most current machines, unless you limit the size of the page cache considerably. Doing so can be useful to approximate a workload that would take much longer to initialize due to the size. I suggest *not* disabling autovacuum as currently done for performance testing - it's not something many real-world setups can afford to do, so benchmarking FS performance with it disabled doesn't seem like a good idea. FWIW, depending on what kind of thing you want to test, it'd not be hard to come up with a test that less time to initialize. E.g. an insert-only workload without an initial dataset or such. As long as you *do* initialize 16GB of data, I think it'd make sense to measure the time that takes. There's definitely been filesystem level performance changes of that, and it's often going to be more IO intensive. Greetings, Andres Freund