Hello All, I've just got hit by this again - lockups on s390. Looks like I have 100% reproducer (just try to build Erlang and it will stuck eventually). * https://koji.fedoraproject.org/koji/taskinfo?taskID=37327589 Where should I open a ticket? Bugzilla.redhat.com or somewhere else? чт, 25 июл. 2019 г. в 17:44, Kevin Fenzi <kevin@xxxxxxxxx>: > > On 7/25/19 3:04 AM, Peter Lemenkov wrote: > > Hello All! > > It started to get stuck again. Right now I'm experiencing this issue > > with RabbitMQ for F-30 and F-31: > > > > * https://koji.fedoraproject.org/koji/taskinfo?taskID=36457376 > > * https://koji.fedoraproject.org/koji/taskinfo?taskID=36457345 > > So, yeah. > > > | |-kojid,31279 /usr/sbin/kojid --fg --force-lock --verbose > > | | `-mock,31584 -tt /usr/libexec/mock/mock -r koji/f30-build-16961487-1222718 --old-chroot --no-clean --target s390x ... > > | | `-rpmbuild,32205 -bb --target s390x --nodeps /builddir/build/SPECS/rabbitmq-server.spec > > | | `-sh,32237 -e /var/tmp/rpm-tmp.GwzEQt > > | | `-make,32238 -j4 VERSION=3.7.16 V=1 > > | | `-sh,32318 -c... > > | | `-make,2112 -C /builddir/build/BUILD/rabbitmq-server-3.7.16/deps/amqp10_client IS_DEP=1 > > | | `-make,2237 --no-print-directory app-build > > | | `-beam.smp,2302 -sbtu -A0 -- -root /usr/lib64/erlang -progname erl -- -home /builddir -- ... > > | | |-{beam.smp},2303 > > | | |-{beam.smp},2304 > > | | |-erl_child_setup,2305 1024 > > | | |-{beam.smp},2306 > > | | |-{beam.smp},2307 > > | | |-{beam.smp},2308 > > | | |-{beam.smp},2309 > > | | |-{beam.smp},2310 > > | | |-{beam.smp},2311 > > | | |-{beam.smp},2312 > > | | |-{beam.smp},2313 > > | | |-{beam.smp},2314 > > | | |-{beam.smp},2315 > > | | |-{beam.smp},2316 > > | | |-{beam.smp},2317 > > | | |-{beam.smp},2318 > > | | |-{beam.smp},2319 > > | | |-{beam.smp},2320 > > | | |-{beam.smp},2321 > > | | |-{beam.smp},2322 > > | | |-{beam.smp},2323 > > | | |-{beam.smp},2324 > > | | `-{beam.smp},2325 > > When I strace the 2302 process: > > strace: Process 2302 attached with 23 threads > [pid 2324] ppoll([{fd=12, events=POLLIN|POLLRDNORM}], 1, NULL, NULL, 8 > <unfinished ...> > [pid 2320] futex(0x3ff58800550, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2321] futex(0x3ff58800590, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2319] futex(0x3ff58800510, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2318] futex(0x3ff588004d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2317] futex(0x3ff58800490, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2316] futex(0x3ff58800450, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2315] futex(0x3ff58800410, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2313] futex(0x3ff58800390, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2312] futex(0x3ff58800350, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2308] restart_syscall(<... resuming interrupted > syscall_0xfffffffffffffdfc ...> <unfinish > ed ...> > [pid 2303] read(14, <unfinished ...> > [pid 2302] select(0, NULL, NULL, NULL, NULL <unfinished ...> > [pid 2309] restart_syscall(<... resuming interrupted select ...> > <unfinished ...> > [pid 2323] futex(0x3ff58800610, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2322] futex(0x3ff588005d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2314] futex(0x3ff588003d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2311] futex(0x3ff58800310, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2310] futex(0x3ff588002d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL > <unfinished ...> > [pid 2306] restart_syscall(<... resuming interrupted > syscall_0xfffffffffffffdfc ...> <unfinish > ed ...> > [pid 2304] futex(0x2aa3d9af520, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished > ...> > [pid 2307] timerfd_settime(11, 0, {it_interval={tv_sec=0, tv_nsec=0}, > it_value={tv_sec=0, tv_n > sec=0}}, NULL) = 0 > [pid 2325] epoll_wait(4, <unfinished ...> > [pid 2307] futex(0x3ff588001d0, FUTEX_WAKE_PRIVATE, 1) = 1 > [pid 2306] <... restart_syscall resumed>) = 0 > [pid 2307] fcntl(2, F_GETFL <unfinished ...> > [pid 2306] timerfd_settime(11, 0, {it_interval={tv_sec=0, tv_nsec=0}, > it_value={tv_sec=23, tv_ > nsec=941692107}}, <unfinished ...> > > and then... it starts going again. So something was stuck and strace > unstuck it? > > So, it looks like some odd signal thing with s390x? > > Not sure, but perhaps we should file a bug and try and track it more? > > kevin > > > _______________________________________________ > devel mailing list -- devel@xxxxxxxxxxxxxxxxxxxxxxx > To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxxxxxxxx > Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ > List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines > List Archives: https://lists.fedoraproject.org/archives/list/devel@xxxxxxxxxxxxxxxxxxxxxxx -- With best regards, Peter Lemenkov. _______________________________________________ devel mailing list -- devel@xxxxxxxxxxxxxxxxxxxxxxx To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxxxxxxxx Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/devel@xxxxxxxxxxxxxxxxxxxxxxx