Hi, Ihave following situation: amazon, but not RDS - just plain EC2, with ubuntu system. zone reclaiming is disabled (at least in my system, no idea about host). and there is no "transparent huge page support". today we had at least 3 cases where bunch of abckends (~40) gets "interlocked". Some of them are shown as "waiting" in pg_stat_activity, but not all. simple "insert into table ()" - without any triggers - gets killed after 10 minutes with no visible progress. From what I saw - most of the backends are "locked" (though it's not visible in pg_stat_actrivity) on the same table. Did take coredump from one of the backends, but there are no symbols, so the backtrace is just: root@xxxxxxxxxx:/mnt# gdb --batch --quiet -ex "thread apply all bt full" -ex "quit" /usr/lib/postgresql/9.1/bin/postgres pg.core.21422 [New LWP 21422] [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1". Core was generated by `postgres'. #0 0x00007ffa60da2dc7 in semop () from /lib/x86_64-linux-gnu/libc.so.6 Thread 1 (LWP 21422): #0 0x00007ffa60da2dc7 in semop () from /lib/x86_64-linux-gnu/libc.so.6 No symbol table info available. #1 0x00000000005f65e8 in PGSemaphoreLock () No symbol table info available. #2 0x0000000000636125 in LWLockAcquire () No symbol table info available. #3 0x0000000000630f91 in LockAcquireExtended () No symbol table info available. #4 0x000000000062f88c in LockRelationOid () No symbol table info available. #5 0x0000000000470f6d in relation_open () No symbol table info available. #6 0x000000000047b013 in index_open () No symbol table info available. #7 0x000000000057bb4c in ExecOpenIndices () No symbol table info available. #8 0x00000000005894c8 in ExecInitModifyTable () No symbol table info available. #9 0x000000000057266a in ExecInitNode () No symbol table info available. #10 0x0000000000570e4a in standard_ExecutorStart () No symbol table info available. #11 0x0000000000593406 in ?? () No symbol table info available. #12 0x0000000000593947 in SPI_execute_plan_with_paramlist () No symbol table info available. #13 0x00007ff8c34c8aed in ?? () from /usr/lib/postgresql/9.1/lib/plpgsql.so No symbol table info available. #14 0x00007ff8c34c9716 in ?? () from /usr/lib/postgresql/9.1/lib/plpgsql.so No symbol table info available. #15 0x00007ff8c34ca252 in ?? () from /usr/lib/postgresql/9.1/lib/plpgsql.so No symbol table info available. #16 0x00007ff8c34ca252 in ?? () from /usr/lib/postgresql/9.1/lib/plpgsql.so No symbol table info available. #17 0x00007ff8c34cc9dc in ?? () from /usr/lib/postgresql/9.1/lib/plpgsql.so No symbol table info available. #18 0x00007ff8c34ccf01 in plpgsql_exec_trigger () from /usr/lib/postgresql/9.1/lib/plpgsql.so No symbol table info available. #19 0x00007ff8c34c214a in plpgsql_call_handler () from /usr/lib/postgresql/9.1/lib/plpgsql.so No symbol table info available. #20 0x0000000000709d67 in ?? () No symbol table info available. #21 0x0000000000555fda in ?? () No symbol table info available. #22 0x000000000055668f in ?? () No symbol table info available. #23 0x000000000055ba6a in AfterTriggerEndQuery () No symbol table info available. #24 0x000000000056febf in standard_ExecutorFinish () No symbol table info available. #25 0x0000000000645a2a in ?? () No symbol table info available. #26 0x0000000000645c13 in ?? () No symbol table info available. #27 0x0000000000646962 in PortalRun () No symbol table info available. #28 0x000000000064274a in PostgresMain () No symbol table info available. #29 0x0000000000604443 in ?? () No symbol table info available. #30 0x0000000000604eb1 in PostmasterMain () No symbol table info available. #31 0x000000000045a720 in main () No symbol table info available. What could that be, and how to get rid of the problem? Best regards, depesz -- The best thing about modern society is how easy it is to avoid contact with it. http://depesz.com/
Attachment:
signature.asc
Description: Digital signature