Re: ❌ FAIL: Stable queue: queue-5.3

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 11/11/19 3:10 PM, Rachel Sibley wrote:
See https://lore.kernel.org/stable/120fa5fd-a82f-443e-156e-a9a11937a9ca@xxxxxxxxxx/
for selinux-policy failures, there was also a panic on the mustang board (aarch64) when
installing the dependencies for thinp sanity test:
https://artifacts.cki-project.org/pipelines/278447/logs/aarch64_host_2_console.log

[ 1168.041811] restraintd[838]: ** Fetching task: 101902649 [/mnt/tests/github.com/CKI-project/tests-beaker/archive/master.zip/storage/lvm/thinp/sanity]
...
[ 1265.060278] Unable to handle kernel write to read-only memory at virtual address 0000300000000020
[ 1265.069118] Mem abort info:
[ 1265.071897]   ESR = 0x96000044
[ 1265.074936]   Exception class = DABT (current EL), IL = 32 bits
[ 1265.080825]   SET = 0, FnV = 0
[ 1265.083862]   EA = 0, S1PTW = 0
[ 1265.086985] Data abort info:
[ 1265.089849]   ISV = 0, ISS = 0x00000044
[ 1265.093663]   CM = 0, WnR = 1
[ 1265.096614] user pgtable: 4k pages, 48-bit VAs, pgdp=00000041e6436000
[ 1265.103023] [0000300000000020] pgd=0000000000000000
[ 1265.107878] Internal error: Oops: 96000044 [#1] SMP
[ 1265.112730] Modules linked in: sctp sunrpc vfat fat xgene_enet at803x mdio_xgene xgene_hwmon xgene_edac xgene_rng mailbox_xgene_slimpro crct10dif_ce ip_tables xfs libcrc32c sdhci_of_arasan sdhci_pltfm i2c_xgene_slimpro sdhci cqhci gpio_dwapb gpio_xgene_sb xhci_plat_hcd gpio_keys
[ 1265.137307] CPU: 3 PID: 454 Comm: kworker/3:1H Not tainted 5.3.10-b260a08.cki #1
[ 1265.144666] Hardware name: AppliedMicro X-Gene Mustang Board/X-Gene Mustang Board, BIOS 3.06.25 Oct 17 2016
[ 1265.154366] Workqueue: kblockd blk_mq_run_work_fn
[ 1265.159048] pstate: 20400085 (nzCv daIf +PAN -UAO)
[ 1265.163816] pc : rb_erase+0x9c/0x3a8
[ 1265.167373] lr : bfq_idle_extract+0x58/0xc0
[ 1265.171534] sp : ffff000011d4bb60
[ 1265.174830] x29: ffff000011d4bb60 x28: ffff8001ef4ac100
[ 1265.180115] x27: ffff00001174dae8 x26: ffff000011727000
[ 1265.185399] x25: ffff8001fcface00 x24: 0000000000000030
[ 1265.190684] x23: ffff8001fcfaec00 x22: 0000000000000008
[ 1265.195968] x21: ffff8001fae05d50 x20: ffff8001b56b3010
[ 1265.201253] x19: ffff8001b56b3098 x18: 00000000fffffffa
[ 1265.206537] x17: 0000000000000000 x16: 0000000000000000
[ 1265.211821] x15: 0000000000000001 x14: ffffffffffffffff
[ 1265.217106] x13: ffff000011d4bc20 x12: ffff000011d4bc14
[ 1265.222390] x11: ffff000010f3f6d0 x10: ffff000011d4bbc0
[ 1265.227674] x9 : 00000000ffffffd8 x8 : 000000000000359a
[ 1265.232959] x7 : 0000000000000018 x6 : 0000000000000004
[ 1265.238243] x5 : 0000000000000000 x4 : 0000300000000020
[ 1265.243527] x3 : ffff000011d4bc19 x2 : 0000000000000000
[ 1265.248812] x1 : ffff8001fae05d58 x0 : ffff8001b56b3098
[ 1265.254096] Call trace:
[ 1265.256530]  rb_erase+0x9c/0x3a8
[ 1265.259741]  bfq_put_idle_entity+0x28/0x50
[ 1265.263816]  bfq_forget_idle+0x74/0x80
[ 1265.267545]  bfq_bfqq_served+0xb4/0x180
[ 1265.271361]  bfq_dispatch_request+0x184/0x6d8
[ 1265.275696]  blk_mq_do_dispatch_sched+0xc4/0x108
[ 1265.280290]  blk_mq_sched_dispatch_requests+0x114/0x190
[ 1265.285487]  __blk_mq_run_hw_queue+0x9c/0x128
[ 1265.289821]  blk_mq_run_work_fn+0x28/0x38
[ 1265.293810]  process_one_work+0x1bc/0x3e8
[ 1265.297798]  worker_thread+0x54/0x440
[ 1265.301441]  kthread+0x104/0x130
[ 1265.304653]  ret_from_fork+0x10/0x18
[ 1265.308211] Code: b2400042 f90000e2 d65f03c0 f9400002 (f9000082)
[ 1265.314275] ---[ end trace 3c2a926246d30ab0 ]---


The panic looks like another issue we've been tracking in Fedora
https://bugzilla.redhat.com/show_bug.cgi?id=1767539
The bfq maintainer has given a possible fix.

On 11/11/19 10:36 AM, CKI Project wrote:
Hello,

We ran automated tests on a patchset that was proposed for merging into this
kernel tree. The patches were applied to:

        Kernel repo: https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
             Commit: 81584694bb70 - Linux 5.3.10

The results of these automated tests are provided below.

     Overall result: FAILED (see details below)
              Merge: OK
            Compile: OK
              Tests: FAILED

All kernel binaries, config files, and logs are available for download here:

   https://artifacts.cki-project.org/pipelines/278447

One or more kernel tests failed:

     ppc64le:
      ❌ selinux-policy: serge-testsuite

     aarch64:
      ❌ selinux-policy: serge-testsuite
      ❌ lvm thinp sanity

     x86_64:
      ❌ selinux-policy: serge-testsuite

We hope that these logs can help you find the problem quickly. For the full
detail on our testing procedures, please scroll to the bottom of this message.

Please reply to this email if you have any questions about the tests that we
ran or if you have any suggestions on how to make future tests more effective.

         ,-.   ,-.
        ( C ) ( K )  Continuous
         `-',-.`-'   Kernel
           ( I )     Integration
            `-'
______________________________________________________________________________

Merge testing
-------------

We cloned this repository and checked out the following commit:

   Repo: https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
   Commit: 81584694bb70 - Linux 5.3.10


We grabbed the 92daf91c1099 commit of the stable queue repository.

We then merged the patchset with `git am`:

   bonding-fix-state-transition-issue-in-link-monitoring.patch
   cdc-ncm-handle-incomplete-transfer-of-mtu.patch
   ipv4-fix-table-id-reference-in-fib_sync_down_addr.patch
   net-ethernet-octeon_mgmt-account-for-second-possible-vlan-header.patch
   net-fix-data-race-in-neigh_event_send.patch
   net-qualcomm-rmnet-fix-potential-uaf-when-unregistering.patch
   net-tls-fix-sk_msg-trim-on-fallback-to-copy-mode.patch
   net-usb-qmi_wwan-add-support-for-dw5821e-with-esim-support.patch
   nfc-fdp-fix-incorrect-free-object.patch
   nfc-netlink-fix-double-device-reference-drop.patch
   nfc-st21nfca-fix-double-free.patch
   qede-fix-null-pointer-deref-in-__qede_remove.patch
   net-mscc-ocelot-don-t-handle-netdev-events-for-other-netdevs.patch
   net-mscc-ocelot-fix-null-pointer-on-lag-slave-removal.patch
   net-tls-don-t-pay-attention-to-sk_write_pending-when-pushing-partial-records.patch
   net-tls-add-a-tx-lock.patch
   selftests-tls-add-test-for-concurrent-recv-and-send.patch
   ipv6-fixes-rt6_probe-and-fib6_nh-last_probe-init.patch
   net-hns-fix-the-stray-netpoll-locks-causing-deadlock-in-napi-path.patch
   net-prevent-load-store-tearing-on-sk-sk_stamp.patch
   net-sched-prevent-duplicate-flower-rules-from-tcf_proto-destroy-race.patch
   net-smc-fix-ethernet-interface-refcounting.patch
   vsock-virtio-fix-sock-refcnt-holding-during-the-shutdown.patch
   r8169-fix-page-read-in-r8168g_mdio_read.patch
   alsa-timer-fix-incorrectly-assigned-timer-instance.patch
   alsa-bebob-fix-to-detect-configured-source-of-sampling-clock-for-focusrite-saffire-pro-i-o-series.patch
   alsa-hda-ca0132-fix-possible-workqueue-stall.patch
   mm-memcontrol-fix-null-ptr-deref-in-percpu-stats-flush.patch
   mm-memcontrol-fix-network-errors-from-failing-__gfp_atomic-charges.patch
   mm-meminit-recalculate-pcpu-batch-and-high-limits-after-init-completes.patch
   mm-thp-handle-page-cache-thp-correctly-in-pagetranscompoundmap.patch
   mm-vmstat-hide-proc-pagetypeinfo-from-normal-users.patch
   dump_stack-avoid-the-livelock-of-the-dump_lock.patch
   mm-slab-make-page_cgroup_ino-to-recognize-non-compound-slab-pages-properly.patch
   btrfs-consider-system-chunk-array-size-for-new-system-chunks.patch
   btrfs-tree-checker-fix-wrong-check-on-max-devid.patch
   btrfs-save-i_size-to-avoid-double-evaluation-of-i_size_read-in-compress_file_range.patch
   tools-gpio-use-building_out_of_srctree-to-determine-srctree.patch
   pinctrl-intel-avoid-potential-glitches-if-pin-is-in-gpio-mode.patch
   perf-tools-fix-time-sorting.patch
   perf-map-use-zalloc-for-map_groups.patch
   drm-radeon-fix-si_enable_smc_cac-failed-issue.patch
   hid-wacom-generic-treat-serial-number-and-related-fields-as-unsigned.patch
   mm-khugepaged-fix-might_sleep-warn-with-config_highpte-y.patch
   soundwire-depend-on-acpi.patch
   soundwire-depend-on-acpi-of.patch
   soundwire-bus-set-initial-value-to-port_status.patch
   blkcg-make-blkcg_print_stat-print-stats-only-for-online-blkgs.patch
   arm64-do-not-mask-out-pte_rdonly-in-pte_same.patch
   asoc-rsnd-dma-fix-ssi9-4-5-6-7-busif-dma-address.patch
   ceph-fix-use-after-free-in-__ceph_remove_cap.patch
   ceph-fix-rcu-case-handling-in-ceph_d_revalidate.patch
   ceph-add-missing-check-in-d_revalidate-snapdir-handling.patch
   ceph-don-t-try-to-handle-hashed-dentries-in-non-o_creat-atomic_open.patch
   ceph-don-t-allow-copy_file_range-when-stripe_count-1.patch
   iio-adc-stm32-adc-fix-stopping-dma.patch
   iio-imu-adis16480-make-sure-provided-frequency-is-positive.patch
   iio-imu-inv_mpu6050-fix-no-data-on-mpu6050.patch
   iio-srf04-fix-wrong-limitation-in-distance-measuring.patch
   arm-sunxi-fix-cpu-powerdown-on-a83t.patch
   arm-dts-imx6-logicpd-re-enable-snvs-power-key.patch
   cpufreq-intel_pstate-fix-invalid-epb-setting.patch
   clone3-validate-stack-arguments.patch
   netfilter-nf_tables-align-nft_expr-private-data-to-64-bit.patch
   netfilter-ipset-fix-an-error-code-in-ip_set_sockfn_get.patch
   intel_th-gth-fix-the-window-switching-sequence.patch
   intel_th-pci-add-comet-lake-pch-support.patch
   intel_th-pci-add-jasper-lake-pch-support.patch
   x86-dumpstack-64-don-t-evaluate-exception-stacks-before-setup.patch
   x86-apic-32-avoid-bogus-ldr-warnings.patch
   smb3-fix-persistent-handles-reconnect.patch
   can-usb_8dev-fix-use-after-free-on-disconnect.patch
   can-flexcan-disable-completely-the-ecc-mechanism.patch
   can-c_can-c_can_poll-only-read-status-register-after-status-irq.patch
   can-peak_usb-fix-a-potential-out-of-sync-while-decoding-packets.patch
   can-rx-offload-can_rx_offload_queue_sorted-fix-error-handling-avoid-skb-mem-leak.patch
   can-gs_usb-gs_can_open-prevent-memory-leak.patch
   can-dev-add-missing-of_node_put-after-calling-of_get_child_by_name.patch
   can-mcba_usb-fix-use-after-free-on-disconnect.patch
   can-peak_usb-fix-slab-info-leak.patch
   configfs-fix-a-deadlock-in-configfs_symlink.patch

Compile testing
---------------

We compiled the kernel for 3 architectures:

     aarch64:
       make options: -j30 INSTALL_MOD_STRIP=1 targz-pkg

     ppc64le:
       make options: -j30 INSTALL_MOD_STRIP=1 targz-pkg

     x86_64:
       make options: -j30 INSTALL_MOD_STRIP=1 targz-pkg


Hardware testing
----------------
We booted each kernel and ran the following tests:

   aarch64:
     Host 1:
        ✅ Boot test
        ✅ Podman system integration test (as root)
        ✅ Podman system integration test (as user)
        ✅ LTP lite
        ✅ Loopdev Sanity
        ✅ jvm test suite
        ✅ AMTU (Abstract Machine Test Utility)
        ✅ LTP: openposix test suite
        ✅ Ethernet drivers sanity
        ✅ Networking socket: fuzz
        ✅ Networking route: pmtu
        ✅ Networking route_func: local
        ✅ Networking route_func: forward
        ✅ audit: audit testsuite test
        ✅ httpd: mod_ssl smoke sanity
        ✅ iotop: sanity
        ✅ tuned: tune-processes-through-perf
        ✅ ALSA PCM loopback test
        ✅ ALSA Control (mixer) Userspace Element test
        ✅ Usex - version 1.9-29
        🚧 ✅ CIFS Connectathon
        🚧 ✅ POSIX pjd-fstest suites

     Host 2:
        ✅ Boot test
        ❌ selinux-policy: serge-testsuite
        ❌ lvm thinp sanity
        ⚡⚡⚡ storage: software RAID testing
        🚧 ⚡⚡⚡ Storage blktests

   ppc64le:
     Host 1:
        ✅ Boot test
        ✅ Podman system integration test (as root)
        ✅ Podman system integration test (as user)
        ✅ LTP lite
        ✅ Loopdev Sanity
        ✅ jvm test suite
        ✅ AMTU (Abstract Machine Test Utility)
        ✅ LTP: openposix test suite
        ✅ Ethernet drivers sanity
        ✅ Networking socket: fuzz
        ✅ Networking route: pmtu
        ✅ Networking route_func: local
        ✅ Networking route_func: forward
        ✅ audit: audit testsuite test
        ✅ httpd: mod_ssl smoke sanity
        ✅ iotop: sanity
        ✅ tuned: tune-processes-through-perf
        ✅ ALSA PCM loopback test
        ✅ ALSA Control (mixer) Userspace Element test
        ✅ Usex - version 1.9-29
        🚧 ✅ CIFS Connectathon
        🚧 ✅ POSIX pjd-fstest suites

     Host 2:
        ✅ Boot test
        ❌ selinux-policy: serge-testsuite
        ✅ lvm thinp sanity
        ✅ storage: software RAID testing
        🚧 ✅ Storage blktests

   x86_64:
     Host 1:
        ✅ Boot test
        ❌ selinux-policy: serge-testsuite
        ✅ lvm thinp sanity
        ✅ storage: software RAID testing
        🚧 ✅ Storage blktests

     Host 2:
        ✅ Boot test
        ✅ Podman system integration test (as root)
        ✅ Podman system integration test (as user)
        ✅ LTP lite
        ✅ Loopdev Sanity
        ✅ jvm test suite
        ✅ AMTU (Abstract Machine Test Utility)
        ✅ LTP: openposix test suite
        ✅ Ethernet drivers sanity
        ✅ Networking socket: fuzz
        ✅ Networking route: pmtu
        ✅ Networking route_func: local
        ✅ Networking route_func: forward
        ✅ audit: audit testsuite test
        ✅ httpd: mod_ssl smoke sanity
        ✅ iotop: sanity
        ✅ tuned: tune-processes-through-perf
        ✅ pciutils: sanity smoke test
        ✅ ALSA PCM loopback test
        ✅ ALSA Control (mixer) Userspace Element test
        ✅ Usex - version 1.9-29
        ✅ stress: stress-ng
        🚧 ✅ CIFS Connectathon
        🚧 ✅ POSIX pjd-fstest suites

   Test sources: https://github.com/CKI-project/tests-beaker
     💚 Pull requests are welcome for new tests or improvements to existing tests!

Waived tests
------------
If the test run included waived tests, they are marked with 🚧. Such tests are
executed but their results are not taken into account. Tests are waived when
their results are not reliable enough, e.g. when they're just introduced or are
being fixed.

Testing timeout
---------------
We aim to provide a report within reasonable timeframe. Tests that haven't
finished running are marked with ⏱. Reports for non-upstream kernels have
a Beaker recipe linked to next to each host.







[Index of Archives]     [Linux Kernel]     [Kernel Development Newbies]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite Hiking]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux