On 8/28/19 2:13 PM, Peter Lemenkov wrote: > Hello All, > > I've just got hit by this again - lockups on s390. Looks like I have > 100% reproducer (just try to build Erlang and it will stuck > eventually). > > * https://koji.fedoraproject.org/koji/taskinfo?taskID=37327589 > > Where should I open a ticket? Bugzilla.redhat.com or somewhere else? I'm really not fully sure. ;( Perhaps a kernel bug and get the s390x folks involved? kevin -- > > чт, 25 июл. 2019 г. в 17:44, Kevin Fenzi <kevin@xxxxxxxxx>: >> >> On 7/25/19 3:04 AM, Peter Lemenkov wrote: >>> Hello All! >>> It started to get stuck again. Right now I'm experiencing this issue >>> with RabbitMQ for F-30 and F-31: >>> >>> * https://koji.fedoraproject.org/koji/taskinfo?taskID=36457376 >>> * https://koji.fedoraproject.org/koji/taskinfo?taskID=36457345 >> >> So, yeah. >> >>> | |-kojid,31279 /usr/sbin/kojid --fg --force-lock --verbose >>> | | `-mock,31584 -tt /usr/libexec/mock/mock -r koji/f30-build-16961487-1222718 --old-chroot --no-clean --target s390x ... >>> | | `-rpmbuild,32205 -bb --target s390x --nodeps /builddir/build/SPECS/rabbitmq-server.spec >>> | | `-sh,32237 -e /var/tmp/rpm-tmp.GwzEQt >>> | | `-make,32238 -j4 VERSION=3.7.16 V=1 >>> | | `-sh,32318 -c... >>> | | `-make,2112 -C /builddir/build/BUILD/rabbitmq-server-3.7.16/deps/amqp10_client IS_DEP=1 >>> | | `-make,2237 --no-print-directory app-build >>> | | `-beam.smp,2302 -sbtu -A0 -- -root /usr/lib64/erlang -progname erl -- -home /builddir -- ... >>> | | |-{beam.smp},2303 >>> | | |-{beam.smp},2304 >>> | | |-erl_child_setup,2305 1024 >>> | | |-{beam.smp},2306 >>> | | |-{beam.smp},2307 >>> | | |-{beam.smp},2308 >>> | | |-{beam.smp},2309 >>> | | |-{beam.smp},2310 >>> | | |-{beam.smp},2311 >>> | | |-{beam.smp},2312 >>> | | |-{beam.smp},2313 >>> | | |-{beam.smp},2314 >>> | | |-{beam.smp},2315 >>> | | |-{beam.smp},2316 >>> | | |-{beam.smp},2317 >>> | | |-{beam.smp},2318 >>> | | |-{beam.smp},2319 >>> | | |-{beam.smp},2320 >>> | | |-{beam.smp},2321 >>> | | |-{beam.smp},2322 >>> | | |-{beam.smp},2323 >>> | | |-{beam.smp},2324 >>> | | `-{beam.smp},2325 >> >> When I strace the 2302 process: >> >> strace: Process 2302 attached with 23 threads >> [pid 2324] ppoll([{fd=12, events=POLLIN|POLLRDNORM}], 1, NULL, NULL, 8 >> <unfinished ...> >> [pid 2320] futex(0x3ff58800550, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2321] futex(0x3ff58800590, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2319] futex(0x3ff58800510, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2318] futex(0x3ff588004d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2317] futex(0x3ff58800490, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2316] futex(0x3ff58800450, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2315] futex(0x3ff58800410, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2313] futex(0x3ff58800390, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2312] futex(0x3ff58800350, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2308] restart_syscall(<... resuming interrupted >> syscall_0xfffffffffffffdfc ...> <unfinish >> ed ...> >> [pid 2303] read(14, <unfinished ...> >> [pid 2302] select(0, NULL, NULL, NULL, NULL <unfinished ...> >> [pid 2309] restart_syscall(<... resuming interrupted select ...> >> <unfinished ...> >> [pid 2323] futex(0x3ff58800610, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2322] futex(0x3ff588005d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2314] futex(0x3ff588003d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2311] futex(0x3ff58800310, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2310] futex(0x3ff588002d0, FUTEX_WAIT_PRIVATE, 4294967295, NULL >> <unfinished ...> >> [pid 2306] restart_syscall(<... resuming interrupted >> syscall_0xfffffffffffffdfc ...> <unfinish >> ed ...> >> [pid 2304] futex(0x2aa3d9af520, FUTEX_WAIT_PRIVATE, 0, NULL <unfinished >> ...> >> [pid 2307] timerfd_settime(11, 0, {it_interval={tv_sec=0, tv_nsec=0}, >> it_value={tv_sec=0, tv_n >> sec=0}}, NULL) = 0 >> [pid 2325] epoll_wait(4, <unfinished ...> >> [pid 2307] futex(0x3ff588001d0, FUTEX_WAKE_PRIVATE, 1) = 1 >> [pid 2306] <... restart_syscall resumed>) = 0 >> [pid 2307] fcntl(2, F_GETFL <unfinished ...> >> [pid 2306] timerfd_settime(11, 0, {it_interval={tv_sec=0, tv_nsec=0}, >> it_value={tv_sec=23, tv_ >> nsec=941692107}}, <unfinished ...> >> >> and then... it starts going again. So something was stuck and strace >> unstuck it? >> >> So, it looks like some odd signal thing with s390x? >> >> Not sure, but perhaps we should file a bug and try and track it more? >> >> kevin >> >> >> _______________________________________________ >> devel mailing list -- devel@xxxxxxxxxxxxxxxxxxxxxxx >> To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxxxxxxxx >> Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >> List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines >> List Archives: https://lists.fedoraproject.org/archives/list/devel@xxxxxxxxxxxxxxxxxxxxxxx > > >
Attachment:
signature.asc
Description: OpenPGP digital signature
_______________________________________________ devel mailing list -- devel@xxxxxxxxxxxxxxxxxxxxxxx To unsubscribe send an email to devel-leave@xxxxxxxxxxxxxxxxxxxxxxx Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/devel@xxxxxxxxxxxxxxxxxxxxxxx