On 11/11/19 3:10 PM, Rachel Sibley wrote:
See https://lore.kernel.org/stable/120fa5fd-a82f-443e-156e-a9a11937a9ca@xxxxxxxxxx/
for selinux-policy failures, there was also a panic on the mustang board (aarch64) when
installing the dependencies for thinp sanity test:
https://artifacts.cki-project.org/pipelines/278447/logs/aarch64_host_2_console.log
[ 1168.041811] restraintd[838]: ** Fetching task: 101902649 [/mnt/tests/github.com/CKI-project/tests-beaker/archive/master.zip/storage/lvm/thinp/sanity]
...
[ 1265.060278] Unable to handle kernel write to read-only memory at virtual address 0000300000000020
[ 1265.069118] Mem abort info:
[ 1265.071897] ESR = 0x96000044
[ 1265.074936] Exception class = DABT (current EL), IL = 32 bits
[ 1265.080825] SET = 0, FnV = 0
[ 1265.083862] EA = 0, S1PTW = 0
[ 1265.086985] Data abort info:
[ 1265.089849] ISV = 0, ISS = 0x00000044
[ 1265.093663] CM = 0, WnR = 1
[ 1265.096614] user pgtable: 4k pages, 48-bit VAs, pgdp=00000041e6436000
[ 1265.103023] [0000300000000020] pgd=0000000000000000
[ 1265.107878] Internal error: Oops: 96000044 [#1] SMP
[ 1265.112730] Modules linked in: sctp sunrpc vfat fat xgene_enet at803x mdio_xgene xgene_hwmon xgene_edac xgene_rng mailbox_xgene_slimpro crct10dif_ce ip_tables xfs libcrc32c sdhci_of_arasan sdhci_pltfm i2c_xgene_slimpro sdhci cqhci gpio_dwapb gpio_xgene_sb xhci_plat_hcd gpio_keys
[ 1265.137307] CPU: 3 PID: 454 Comm: kworker/3:1H Not tainted 5.3.10-b260a08.cki #1
[ 1265.144666] Hardware name: AppliedMicro X-Gene Mustang Board/X-Gene Mustang Board, BIOS 3.06.25 Oct 17 2016
[ 1265.154366] Workqueue: kblockd blk_mq_run_work_fn
[ 1265.159048] pstate: 20400085 (nzCv daIf +PAN -UAO)
[ 1265.163816] pc : rb_erase+0x9c/0x3a8
[ 1265.167373] lr : bfq_idle_extract+0x58/0xc0
[ 1265.171534] sp : ffff000011d4bb60
[ 1265.174830] x29: ffff000011d4bb60 x28: ffff8001ef4ac100
[ 1265.180115] x27: ffff00001174dae8 x26: ffff000011727000
[ 1265.185399] x25: ffff8001fcface00 x24: 0000000000000030
[ 1265.190684] x23: ffff8001fcfaec00 x22: 0000000000000008
[ 1265.195968] x21: ffff8001fae05d50 x20: ffff8001b56b3010
[ 1265.201253] x19: ffff8001b56b3098 x18: 00000000fffffffa
[ 1265.206537] x17: 0000000000000000 x16: 0000000000000000
[ 1265.211821] x15: 0000000000000001 x14: ffffffffffffffff
[ 1265.217106] x13: ffff000011d4bc20 x12: ffff000011d4bc14
[ 1265.222390] x11: ffff000010f3f6d0 x10: ffff000011d4bbc0
[ 1265.227674] x9 : 00000000ffffffd8 x8 : 000000000000359a
[ 1265.232959] x7 : 0000000000000018 x6 : 0000000000000004
[ 1265.238243] x5 : 0000000000000000 x4 : 0000300000000020
[ 1265.243527] x3 : ffff000011d4bc19 x2 : 0000000000000000
[ 1265.248812] x1 : ffff8001fae05d58 x0 : ffff8001b56b3098
[ 1265.254096] Call trace:
[ 1265.256530] rb_erase+0x9c/0x3a8
[ 1265.259741] bfq_put_idle_entity+0x28/0x50
[ 1265.263816] bfq_forget_idle+0x74/0x80
[ 1265.267545] bfq_bfqq_served+0xb4/0x180
[ 1265.271361] bfq_dispatch_request+0x184/0x6d8
[ 1265.275696] blk_mq_do_dispatch_sched+0xc4/0x108
[ 1265.280290] blk_mq_sched_dispatch_requests+0x114/0x190
[ 1265.285487] __blk_mq_run_hw_queue+0x9c/0x128
[ 1265.289821] blk_mq_run_work_fn+0x28/0x38
[ 1265.293810] process_one_work+0x1bc/0x3e8
[ 1265.297798] worker_thread+0x54/0x440
[ 1265.301441] kthread+0x104/0x130
[ 1265.304653] ret_from_fork+0x10/0x18
[ 1265.308211] Code: b2400042 f90000e2 d65f03c0 f9400002 (f9000082)
[ 1265.314275] ---[ end trace 3c2a926246d30ab0 ]---
The panic looks like another issue we've been tracking in Fedora
https://bugzilla.redhat.com/show_bug.cgi?id=1767539
The bfq maintainer has given a possible fix.
On 11/11/19 10:36 AM, CKI Project wrote:
Hello,
We ran automated tests on a patchset that was proposed for merging into this
kernel tree. The patches were applied to:
Kernel repo: https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
Commit: 81584694bb70 - Linux 5.3.10
The results of these automated tests are provided below.
Overall result: FAILED (see details below)
Merge: OK
Compile: OK
Tests: FAILED
All kernel binaries, config files, and logs are available for download here:
https://artifacts.cki-project.org/pipelines/278447
One or more kernel tests failed:
ppc64le:
❌ selinux-policy: serge-testsuite
aarch64:
❌ selinux-policy: serge-testsuite
❌ lvm thinp sanity
x86_64:
❌ selinux-policy: serge-testsuite
We hope that these logs can help you find the problem quickly. For the full
detail on our testing procedures, please scroll to the bottom of this message.
Please reply to this email if you have any questions about the tests that we
ran or if you have any suggestions on how to make future tests more effective.
,-. ,-.
( C ) ( K ) Continuous
`-',-.`-' Kernel
( I ) Integration
`-'
______________________________________________________________________________
Merge testing
-------------
We cloned this repository and checked out the following commit:
Repo: https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
Commit: 81584694bb70 - Linux 5.3.10
We grabbed the 92daf91c1099 commit of the stable queue repository.
We then merged the patchset with `git am`:
bonding-fix-state-transition-issue-in-link-monitoring.patch
cdc-ncm-handle-incomplete-transfer-of-mtu.patch
ipv4-fix-table-id-reference-in-fib_sync_down_addr.patch
net-ethernet-octeon_mgmt-account-for-second-possible-vlan-header.patch
net-fix-data-race-in-neigh_event_send.patch
net-qualcomm-rmnet-fix-potential-uaf-when-unregistering.patch
net-tls-fix-sk_msg-trim-on-fallback-to-copy-mode.patch
net-usb-qmi_wwan-add-support-for-dw5821e-with-esim-support.patch
nfc-fdp-fix-incorrect-free-object.patch
nfc-netlink-fix-double-device-reference-drop.patch
nfc-st21nfca-fix-double-free.patch
qede-fix-null-pointer-deref-in-__qede_remove.patch
net-mscc-ocelot-don-t-handle-netdev-events-for-other-netdevs.patch
net-mscc-ocelot-fix-null-pointer-on-lag-slave-removal.patch
net-tls-don-t-pay-attention-to-sk_write_pending-when-pushing-partial-records.patch
net-tls-add-a-tx-lock.patch
selftests-tls-add-test-for-concurrent-recv-and-send.patch
ipv6-fixes-rt6_probe-and-fib6_nh-last_probe-init.patch
net-hns-fix-the-stray-netpoll-locks-causing-deadlock-in-napi-path.patch
net-prevent-load-store-tearing-on-sk-sk_stamp.patch
net-sched-prevent-duplicate-flower-rules-from-tcf_proto-destroy-race.patch
net-smc-fix-ethernet-interface-refcounting.patch
vsock-virtio-fix-sock-refcnt-holding-during-the-shutdown.patch
r8169-fix-page-read-in-r8168g_mdio_read.patch
alsa-timer-fix-incorrectly-assigned-timer-instance.patch
alsa-bebob-fix-to-detect-configured-source-of-sampling-clock-for-focusrite-saffire-pro-i-o-series.patch
alsa-hda-ca0132-fix-possible-workqueue-stall.patch
mm-memcontrol-fix-null-ptr-deref-in-percpu-stats-flush.patch
mm-memcontrol-fix-network-errors-from-failing-__gfp_atomic-charges.patch
mm-meminit-recalculate-pcpu-batch-and-high-limits-after-init-completes.patch
mm-thp-handle-page-cache-thp-correctly-in-pagetranscompoundmap.patch
mm-vmstat-hide-proc-pagetypeinfo-from-normal-users.patch
dump_stack-avoid-the-livelock-of-the-dump_lock.patch
mm-slab-make-page_cgroup_ino-to-recognize-non-compound-slab-pages-properly.patch
btrfs-consider-system-chunk-array-size-for-new-system-chunks.patch
btrfs-tree-checker-fix-wrong-check-on-max-devid.patch
btrfs-save-i_size-to-avoid-double-evaluation-of-i_size_read-in-compress_file_range.patch
tools-gpio-use-building_out_of_srctree-to-determine-srctree.patch
pinctrl-intel-avoid-potential-glitches-if-pin-is-in-gpio-mode.patch
perf-tools-fix-time-sorting.patch
perf-map-use-zalloc-for-map_groups.patch
drm-radeon-fix-si_enable_smc_cac-failed-issue.patch
hid-wacom-generic-treat-serial-number-and-related-fields-as-unsigned.patch
mm-khugepaged-fix-might_sleep-warn-with-config_highpte-y.patch
soundwire-depend-on-acpi.patch
soundwire-depend-on-acpi-of.patch
soundwire-bus-set-initial-value-to-port_status.patch
blkcg-make-blkcg_print_stat-print-stats-only-for-online-blkgs.patch
arm64-do-not-mask-out-pte_rdonly-in-pte_same.patch
asoc-rsnd-dma-fix-ssi9-4-5-6-7-busif-dma-address.patch
ceph-fix-use-after-free-in-__ceph_remove_cap.patch
ceph-fix-rcu-case-handling-in-ceph_d_revalidate.patch
ceph-add-missing-check-in-d_revalidate-snapdir-handling.patch
ceph-don-t-try-to-handle-hashed-dentries-in-non-o_creat-atomic_open.patch
ceph-don-t-allow-copy_file_range-when-stripe_count-1.patch
iio-adc-stm32-adc-fix-stopping-dma.patch
iio-imu-adis16480-make-sure-provided-frequency-is-positive.patch
iio-imu-inv_mpu6050-fix-no-data-on-mpu6050.patch
iio-srf04-fix-wrong-limitation-in-distance-measuring.patch
arm-sunxi-fix-cpu-powerdown-on-a83t.patch
arm-dts-imx6-logicpd-re-enable-snvs-power-key.patch
cpufreq-intel_pstate-fix-invalid-epb-setting.patch
clone3-validate-stack-arguments.patch
netfilter-nf_tables-align-nft_expr-private-data-to-64-bit.patch
netfilter-ipset-fix-an-error-code-in-ip_set_sockfn_get.patch
intel_th-gth-fix-the-window-switching-sequence.patch
intel_th-pci-add-comet-lake-pch-support.patch
intel_th-pci-add-jasper-lake-pch-support.patch
x86-dumpstack-64-don-t-evaluate-exception-stacks-before-setup.patch
x86-apic-32-avoid-bogus-ldr-warnings.patch
smb3-fix-persistent-handles-reconnect.patch
can-usb_8dev-fix-use-after-free-on-disconnect.patch
can-flexcan-disable-completely-the-ecc-mechanism.patch
can-c_can-c_can_poll-only-read-status-register-after-status-irq.patch
can-peak_usb-fix-a-potential-out-of-sync-while-decoding-packets.patch
can-rx-offload-can_rx_offload_queue_sorted-fix-error-handling-avoid-skb-mem-leak.patch
can-gs_usb-gs_can_open-prevent-memory-leak.patch
can-dev-add-missing-of_node_put-after-calling-of_get_child_by_name.patch
can-mcba_usb-fix-use-after-free-on-disconnect.patch
can-peak_usb-fix-slab-info-leak.patch
configfs-fix-a-deadlock-in-configfs_symlink.patch
Compile testing
---------------
We compiled the kernel for 3 architectures:
aarch64:
make options: -j30 INSTALL_MOD_STRIP=1 targz-pkg
ppc64le:
make options: -j30 INSTALL_MOD_STRIP=1 targz-pkg
x86_64:
make options: -j30 INSTALL_MOD_STRIP=1 targz-pkg
Hardware testing
----------------
We booted each kernel and ran the following tests:
aarch64:
Host 1:
✅ Boot test
✅ Podman system integration test (as root)
✅ Podman system integration test (as user)
✅ LTP lite
✅ Loopdev Sanity
✅ jvm test suite
✅ AMTU (Abstract Machine Test Utility)
✅ LTP: openposix test suite
✅ Ethernet drivers sanity
✅ Networking socket: fuzz
✅ Networking route: pmtu
✅ Networking route_func: local
✅ Networking route_func: forward
✅ audit: audit testsuite test
✅ httpd: mod_ssl smoke sanity
✅ iotop: sanity
✅ tuned: tune-processes-through-perf
✅ ALSA PCM loopback test
✅ ALSA Control (mixer) Userspace Element test
✅ Usex - version 1.9-29
🚧 ✅ CIFS Connectathon
🚧 ✅ POSIX pjd-fstest suites
Host 2:
✅ Boot test
❌ selinux-policy: serge-testsuite
❌ lvm thinp sanity
⚡⚡⚡ storage: software RAID testing
🚧 ⚡⚡⚡ Storage blktests
ppc64le:
Host 1:
✅ Boot test
✅ Podman system integration test (as root)
✅ Podman system integration test (as user)
✅ LTP lite
✅ Loopdev Sanity
✅ jvm test suite
✅ AMTU (Abstract Machine Test Utility)
✅ LTP: openposix test suite
✅ Ethernet drivers sanity
✅ Networking socket: fuzz
✅ Networking route: pmtu
✅ Networking route_func: local
✅ Networking route_func: forward
✅ audit: audit testsuite test
✅ httpd: mod_ssl smoke sanity
✅ iotop: sanity
✅ tuned: tune-processes-through-perf
✅ ALSA PCM loopback test
✅ ALSA Control (mixer) Userspace Element test
✅ Usex - version 1.9-29
🚧 ✅ CIFS Connectathon
🚧 ✅ POSIX pjd-fstest suites
Host 2:
✅ Boot test
❌ selinux-policy: serge-testsuite
✅ lvm thinp sanity
✅ storage: software RAID testing
🚧 ✅ Storage blktests
x86_64:
Host 1:
✅ Boot test
❌ selinux-policy: serge-testsuite
✅ lvm thinp sanity
✅ storage: software RAID testing
🚧 ✅ Storage blktests
Host 2:
✅ Boot test
✅ Podman system integration test (as root)
✅ Podman system integration test (as user)
✅ LTP lite
✅ Loopdev Sanity
✅ jvm test suite
✅ AMTU (Abstract Machine Test Utility)
✅ LTP: openposix test suite
✅ Ethernet drivers sanity
✅ Networking socket: fuzz
✅ Networking route: pmtu
✅ Networking route_func: local
✅ Networking route_func: forward
✅ audit: audit testsuite test
✅ httpd: mod_ssl smoke sanity
✅ iotop: sanity
✅ tuned: tune-processes-through-perf
✅ pciutils: sanity smoke test
✅ ALSA PCM loopback test
✅ ALSA Control (mixer) Userspace Element test
✅ Usex - version 1.9-29
✅ stress: stress-ng
🚧 ✅ CIFS Connectathon
🚧 ✅ POSIX pjd-fstest suites
Test sources: https://github.com/CKI-project/tests-beaker
💚 Pull requests are welcome for new tests or improvements to existing tests!
Waived tests
------------
If the test run included waived tests, they are marked with 🚧. Such tests are
executed but their results are not taken into account. Tests are waived when
their results are not reliable enough, e.g. when they're just introduced or are
being fixed.
Testing timeout
---------------
We aim to provide a report within reasonable timeframe. Tests that haven't
finished running are marked with ⏱. Reports for non-upstream kernels have
a Beaker recipe linked to next to each host.