RAID 5 kernel BUG

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi,

I'm having an issue with a new system I've put together with RAID 5. I'm
not suffering data loss, but I'm triggering a kernel bug message, and
since my root file system is underpinned by that RAID device, things
wanting to write hang.

From my reading of the kernel code, it appears to be the
BUG_ON(dev->written); inside the ops_run_biodrain() function. I'm
unclear if it's a hardware issue, or something else is going on. Any
help appreciated.

Please CC me, I am not subscribed.

uname -a:

Linux thrashed 4.18.8-041808-generic #201809150431 SMP Sat Sep 15
08:33:36 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

mdadm --detail output:

/dev/md0:
           Version : 1.2
     Creation Time : Tue Jul  3 22:26:08 2018
        Raid Level : raid5
        Array Size : 23440220160 (22354.34 GiB 24002.79 GB)
     Used Dev Size : 7813406720 (7451.45 GiB 8000.93 GB)
      Raid Devices : 4
     Total Devices : 4
       Persistence : Superblock is persistent

     Intent Bitmap : Internal

       Update Time : Thu Oct  4 14:34:12 2018
             State : clean
    Active Devices : 4
   Working Devices : 4
    Failed Devices : 0
     Spare Devices : 0

            Layout : left-symmetric
        Chunk Size : 512K

Consistency Policy : bitmap

              Name : thrashed:0  (local to host thrashed)
              UUID : 30c060c2:ef971ee5:9da59520:c136e932
            Events : 26360

    Number   Major   Minor   RaidDevice State
       0       8        2        0      active sync   /dev/sda2
       1       8       50        1      active sync   /dev/sdd2
       2       8       18        2      active sync   /dev/sdb2
       3       8       34        3      active sync   /dev/sdc2

BUG output:

[786475.430507] ------------[ cut here ]------------
[786475.430508] kernel BUG at /home/kernel/COD/linux/drivers/md/raid5.c:1762!
[786475.430514] invalid opcode: 0000 [#1] SMP PTI
[786475.430517] CPU: 2 PID: 299 Comm: md0_raid5 Not tainted 4.18.0-041800rc5-generic #201807152130 [786475.430518] Hardware name: Gigabyte Technology Co., Ltd. H370M-D3H/H370M D3H-CF, BIOS F3 03/01/2018
[786475.430523] RIP: 0010:raid_run_ops+0xbcb/0x15e0 [raid456]
[786475.430524] Code: 63 d2 48 8d 04 d0 48 89 45 c8 0f 84 8b 09 00 00 48 8b 7d 98 44 89 f1 31 d2 4c 89 e6 e8 8e 38 c9 ff 48 89 45 98 e9 07 f6 ff ff <0f> 0b 0f 0b e9 85 fb ff ff 8b 4d 90 89 fe e9 12 fd ff ff 4c 8b ad
[786475.430542] RSP: 0018:ffffb1c381333b30 EFLAGS: 00010002
[786475.430544] RAX: 0000000000000178 RBX: ffff8ac962774ed8 RCX: 0000000000000000 [786475.430545] RDX: ffff8ac772f966b8 RSI: ffff8ac8031153f0 RDI: ffff8ac772f965a8 [786475.430546] RBP: ffffb1c381333be8 R08: 0000000000001000 R09: ffffb1c381333ac0 [786475.430548] R10: 0000000000001000 R11: ffff8ac772f965c8 R12: ffff8ac772f96900 [786475.430549] R13: ffff8ac772f965a8 R14: ffff8ac772f96540 R15: ffff8ac9581e7800 [786475.430551] FS: 0000000000000000(0000) GS:ffff8ac96e480000(0000) knlGS:0000000000000000
[786475.430553] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[786475.430555] CR2: 00007fbf70858000 CR3: 0000000256a0a001 CR4: 00000000003606e0
[786475.430556] Call Trace:
[786475.430561]  handle_stripe+0x66c/0x21d0 [raid456]
[786475.430564]  handle_active_stripes.isra.62+0x3d8/0x5c0 [raid456]
[786475.430567]  raid5d+0x407/0x5f0 [raid456]
[786475.430570]  md_thread+0x129/0x170
[786475.430573]  ? handle_active_stripes.isra.62+0x5c0/0x5c0 [raid456]
[786475.430575]  ? md_thread+0x129/0x170
[786475.430577]  ? wait_woken+0x80/0x80
[786475.430579]  kthread+0x121/0x140
[786475.430581]  ? md_rdev_init+0xb0/0xb0
[786475.430583]  ? kthread_create_worker_on_cpu+0x70/0x70
[786475.430586]  ret_from_fork+0x35/0x40
[786475.430587] Modules linked in: ufs qnx4 hfsplus hfs minix ntfs msdos jfs cpuid xfs nls_iso8859_1 snd_hda_codec_hdmi snd_hda_codec_realtek intel_rapl snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel kvm snd_hda_codec snd_hda_core irqbypass ppdev snd_hwdep snd_pcm snd_timer parport_pc intel_cstate wmi_bmof parport snd intel_rapl_perf soundcore intel_pch_thermal mei_me mac_hid mei acpi_pad sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core nfsd auth_rpcgss nfs_acl lockd iscsi_tcp libiscsi_tcp libiscsi grace scsi_transport_iscsi sunrpc ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid1 raid0 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c morus1280_avx2 morus1280_sse2 morus1280_glue morus640_sse2 [786475.430616] morus640_glue aegis256_aesni aegis128l_aesni i915 aegis128_aesni crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc i2c_algo_bit aesni_intel drm_kms_helper aes_x86_64 syscopyarea sysfillrect crypto_simd sysimgblt cryptd fb_sys_fops glue_helper drm e1000e ahci libahci wmi video pinctrl_cannonlake pinctrl_intel
[786475.430629] ---[ end trace d86c65ff224a886f ]---
[786475.430632] RIP: 0010:raid_run_ops+0xbcb/0x15e0 [raid456]
[786475.430633] Code: 63 d2 48 8d 04 d0 48 89 45 c8 0f 84 8b 09 00 00 48 8b 7d 98 44 89 f1 31 d2 4c 89 e6 e8 8e 38 c9 ff 48 89 45 98 e9 07 f6 ff ff <0f> 0b 0f 0b e9 85 fb ff ff 8b 4d 90 89 fe e9 12 fd ff ff 4c 8b ad
[786475.430651] RSP: 0018:ffffb1c381333b30 EFLAGS: 00010002
[786475.430652] RAX: 0000000000000178 RBX: ffff8ac962774ed8 RCX: 0000000000000000 [786475.430654] RDX: ffff8ac772f966b8 RSI: ffff8ac8031153f0 RDI: ffff8ac772f965a8 [786475.430655] RBP: ffffb1c381333be8 R08: 0000000000001000 R09: ffffb1c381333ac0 [786475.430657] R10: 0000000000001000 R11: ffff8ac772f965c8 R12: ffff8ac772f96900 [786475.430658] R13: ffff8ac772f965a8 R14: ffff8ac772f96540 R15: ffff8ac9581e7800 [786475.430660] FS: 0000000000000000(0000) GS:ffff8ac96e480000(0000) knlGS:0000000000000000
[786475.430662] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[786475.430663] CR2: 00007fbf70858000 CR3: 0000000256a0a001 CR4: 00000000003606e0

Thanks,
--
                                        Steve
"I'm a doctor, not a doorstop!"
         - EMH, USS Enterprise



[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux