Hi, On 2018-07-23 15:23:04 -0500, Dimitri Maziuk wrote: > I'm playing with logical replication in postgres-10 and I got it to hang > on startup with > > > 2018-07-23 13:15:06.577 CDT [18624] FATAL: the database system is starting up > > 2018-07-23 13:15:06.577 CDT [18624] DEBUG: shmem_exit(1): 0 before_shmem_exit callbacks to make > > 2018-07-23 13:15:06.577 CDT [18624] DEBUG: shmem_exit(1): 0 on_shmem_exit callbacks to make > > 2018-07-23 13:15:06.577 CDT [18624] DEBUG: proc_exit(1): 1 callbacks to make > > 2018-07-23 13:15:06.577 CDT [18624] DEBUG: exit(1) > > 2018-07-23 13:15:06.577 CDT [18624] DEBUG: shmem_exit(-1): 0 before_shmem_exit callbacks to make > > 2018-07-23 13:15:06.577 CDT [18624] DEBUG: shmem_exit(-1): 0 on_shmem_exit callbacks to make > > 2018-07-23 13:15:06.577 CDT [18624] DEBUG: proc_exit(-1): 0 callbacks to make > ... forever ... What precedes that "loop"? Could you describe what exactly you did into that situation? > After poking around: > > # find /var/lib/pgsql/10/data/pg_logical/snapshots -printf . | wc -c > 12727975 Ugh, something absolutely clearly isn't right here. > It looks like ZFS simply can't handle this many files: `ls` etc. hang > forever. ls probably works in general, it's just the sorting that makes it hang. Should work with -f or such. Greetings, Andres Freund