On 9/3/20 1:46 PM, Jens Axboe wrote:
On 9/3/20 11:10 AM, Rachel Sibley wrote:
On 9/3/20 1:07 PM, CKI Project wrote:
Hello,
We ran automated tests on a recent commit from this kernel tree:
Kernel repo: https://git.kernel.org/pub/scm/linux/kernel/git/axboe/linux-block.git
Commit: 020ad0333b03 - Merge branch 'for-5.10/block' into for-next
The results of these automated tests are provided below.
Overall result: FAILED (see details below)
Merge: OK
Compile: OK
Tests: PANICKED
All kernel binaries, config files, and logs are available for download here:
https://cki-artifacts.s3.us-east-2.amazonaws.com/index.html?prefix=datawarehouse/2020/09/02/613166
One or more kernel tests failed:
ppc64le:
💥 storage: software RAID testing
aarch64:
💥 storage: software RAID testing
x86_64:
💥 storage: software RAID testing
Hello,
We're seeing a panic for all non s390x arches triggered by swraid test. Seems to be reproducible
for all succeeding pipelines after this one, and we haven't yet seen it in mainline or yesterday's
block tree results.
Thank you,
Rachel
https://cki-artifacts.s3.us-east-2.amazonaws.com/datawarehouse/2020/09/02/613166/build_aarch64_redhat%3A968098/tests/8757835_aarch64_3_console.log
[ 8394.609219] Internal error: Oops: 96000004 [#1] SMP
[ 8394.614070] Modules linked in: raid0 loop raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx dm_log_writes dm_flakey
rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache rfkill sunrpc vfat fat xgene_hwmon xgene_enet at803x mdio_xgene xgene_rng
xgene_edac mailbox_xgene_slimpro drm ip_tables xfs sdhci_of_arasan sdhci_pltfm i2c_xgene_slimpro crct10dif_ce sdhci gpio_dwapb cqhci xhci_plat_hcd
gpio_xgene_sb gpio_keys aes_neon_bs
[ 8394.654298] CPU: 3 PID: 471427 Comm: kworker/3:2 Kdump: loaded Not tainted 5.9.0-rc3-020ad03.cki #1
[ 8394.663299] Hardware name: AppliedMicro X-Gene Mustang Board/X-Gene Mustang Board, BIOS 3.06.25 Oct 17 2016
[ 8394.672999] Workqueue: md_misc mddev_delayed_delete
[ 8394.677853] pstate: 40400085 (nZcv daIf +PAN -UAO BTYPE=--)
[ 8394.683399] pc : percpu_ref_exit+0x5c/0xc8
[ 8394.687473] lr : percpu_ref_exit+0x20/0xc8
[ 8394.691547] sp : ffff800019f33d00
[ 8394.694843] x29: ffff800019f33d00 x28: 0000000000000000
[ 8394.700129] x27: ffff0003c63ae000 x26: ffff8000120b6228
[ 8394.705414] x25: 0000000000000001 x24: ffff0003d8322a80
[ 8394.710698] x23: 0000000000000000 x22: 0000000000000000
[ 8394.715983] x21: 0000000000000000 x20: ffff8000121d2000
[ 8394.721266] x19: ffff0003d8322af0 x18: 0000000000000000
[ 8394.726550] x17: 0000000000000000 x16: 0000000000000000
[ 8394.731834] x15: 0000000000000007 x14: 0000000000000003
[ 8394.737119] x13: 0000000000000000 x12: ffff0003888a1978
[ 8394.742403] x11: ffff0003888a1918 x10: 0000000000000001
[ 8394.747688] x9 : 0000000000000000 x8 : 0000000000000000
[ 8394.752972] x7 : 0000000000000400 x6 : 0000000000000001
[ 8394.758257] x5 : ffff800010423030 x4 : ffff8000121d2e40
[ 8394.763540] x3 : 0000000000000000 x2 : 0000000000000000
[ 8394.768825] x1 : 0000000000000000 x0 : 0000000000000000
[ 8394.774110] Call trace:
[ 8394.776544] percpu_ref_exit+0x5c/0xc8
[ 8394.780273] md_free+0x64/0xa0
[ 8394.783311] kobject_put+0x7c/0x218
[ 8394.786781] mddev_delayed_delete+0x3c/0x50
[ 8394.790944] process_one_work+0x1c4/0x450
[ 8394.794932] worker_thread+0x164/0x4a8
[ 8394.798662] kthread+0xf4/0x120
[ 8394.801787] ret_from_fork+0x10/0x18
[ 8394.805344] Code: 2a0403e0 350002c0 a9400262 52800001 (f9400000)
[ 8394.811407] ---[ end trace 481cab6e1ad73da1 ]---
Ming, I wonder if this is:
commit d0c567d60f3730b97050347ea806e1ee06445c78
Author: Ming Lei <ming.lei@xxxxxxxxxx>
Date: Wed Sep 2 20:26:42 2020 +0800
percpu_ref: reduce memory footprint of percpu_ref in fast path
Rachel, any chance you can do a run with that commit reverted?
Hi Jens, yes we're working on it and will share our findings as soon as the job finishes.