Hard CPU Lockup when accessing MD RAID5

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Im having some issues on a brand new Supermicro server that we have running in production along side a few other machines which are identical to this server..

The output from the netconsole attached to the server is here:

Apr 12 21:34:45 [75704.964946] NMI watchdog: Watchdog detected hard LOCKUP on cpu 6
Apr 12 21:34:45
Apr 12 21:34:45  [75704.964973] Modules linked in:
Apr 12 21:34:45   ipt_REJECT
Apr 12 21:34:45   nf_reject_ipv4
Apr 12 21:34:45   iptable_mangle
Apr 12 21:34:45   tun
Apr 12 21:34:45   netconsole
Apr 12 21:34:45   configfs
Apr 12 21:34:45   xt_multiport
Apr 12 21:34:45   ip6table_filter
Apr 12 21:34:45   ip6_tables
Apr 12 21:34:45   iptable_filter
Apr 12 21:34:45   ip_tables
Apr 12 21:34:45   x_tables
Apr 12 21:34:45   bridge
Apr 12 21:34:45   stp
Apr 12 21:34:45   llc
Apr 12 21:34:45   bonding
Apr 12 21:34:45   ext4
Apr 12 21:34:45   crc16
Apr 12 21:34:45   mbcache
Apr 12 21:34:45   jbd2
Apr 12 21:34:45   raid1
Apr 12 21:34:45   raid0
Apr 12 21:34:45   raid456
Apr 12 21:34:45   async_raid6_recov
Apr 12 21:34:45   async_memcpy
Apr 12 21:34:45   async_pq
Apr 12 21:34:45   async_xor
Apr 12 21:34:45   xor
Apr 12 21:34:45   async_tx
Apr 12 21:34:45   raid6_pq
Apr 12 21:34:45   md_mod
Apr 12 21:34:45   sr_mod
Apr 12 21:34:45   cdrom
Apr 12 21:34:45   usb_storage
Apr 12 21:34:45   hid_generic
Apr 12 21:34:45   usbhid
Apr 12 21:34:45   hid
Apr 12 21:34:45   sg
Apr 12 21:34:45   sd_mod
Apr 12 21:34:45   x86_pkg_temp_thermal
Apr 12 21:34:45   coretemp
Apr 12 21:34:45   crct10dif_pclmul
Apr 12 21:34:45   crc32_pclmul
Apr 12 21:34:45   crc32c_intel
Apr 12 21:34:45   jitterentropy_rng
Apr 12 21:34:45   sha256_ssse3
Apr 12 21:34:45   sha256_generic
Apr 12 21:34:45   hmac
Apr 12 21:34:45   iTCO_wdt
Apr 12 21:34:45   iTCO_vendor_support
Apr 12 21:34:45   drbg
Apr 12 21:34:45   ansi_cprng
Apr 12 21:34:45   aesni_intel
Apr 12 21:34:45   aes_x86_64
Apr 12 21:34:45   lrw
Apr 12 21:34:45   gf128mul
Apr 12 21:34:45   glue_helper
Apr 12 21:34:45   ablk_helper
Apr 12 21:34:45   cryptd
Apr 12 21:34:45   ahci
Apr 12 21:34:45   libahci
Apr 12 21:34:45   sb_edac
Apr 12 21:34:45   libata
Apr 12 21:34:45   igb
Apr 12 21:34:45   megaraid_sas
Apr 12 21:34:45   xhci_pci
Apr 12 21:34:45   ehci_pci
Apr 12 21:34:45   i2c_algo_bit
Apr 12 21:34:45   xhci_hcd
Apr 12 21:34:45   ehci_hcd
Apr 12 21:34:45   edac_core
Apr 12 21:34:45   ptp
Apr 12 21:34:45   mei_me
Apr 12 21:34:45   lpc_ich
Apr 12 21:34:45   i2c_i801
Apr 12 21:34:45   usbcore
Apr 12 21:34:45   pps_core
Apr 12 21:34:45   mfd_core
Apr 12 21:34:45   mei
Apr 12 21:34:45   usb_common
Apr 12 21:34:45   i2c_core
Apr 12 21:34:45   ioatdma
Apr 12 21:34:45   scsi_mod
Apr 12 21:34:45   dca
Apr 12 21:34:45   ipmi_si
Apr 12 21:34:45   ipmi_msghandler
Apr 12 21:34:45   acpi_power_meter
Apr 12 21:34:45   tpm_tis
Apr 12 21:34:45   tpm
Apr 12 21:34:45   processor
Apr 12 21:34:45   button
Apr 12 21:34:45
Apr 12 21:34:45 [75704.965874] CPU: 6 PID: 25339 Comm: main Not tainted 4.4.1 #2 Apr 12 21:34:45 [75704.965916] Hardware name: Supermicro Super Server/X10DRi-LN4+, BIOS 2.0 12/17/2015
Apr 12 21:34:45  [75704.965979]  0000000000000000
Apr 12 21:34:45   ffffffff812abdf3
Apr 12 21:34:45   0000000000000000
Apr 12 21:34:45   ffffffff810cf5f5
Apr 12 21:34:45
Apr 12 21:34:45  [75704.966054]  ffff881ff2870000
Apr 12 21:34:45   ffffffff810fcea2
Apr 12 21:34:45   0000000000000001
Apr 12 21:34:45   ffff881fffcc5e58
Apr 12 21:34:45
Apr 12 21:34:45  [75704.966134]  ffff881fffccaf00
Apr 12 21:34:45   ffff881fffccb100
Apr 12 21:34:45   ffff881ff2870000
Apr 12 21:34:45   ffffffff8101bc63
Apr 12 21:34:45
Apr 12 21:34:45  [75704.966211] Call Trace:
Apr 12 21:34:45  [75704.966246]  <NMI>
Apr 12 21:34:45   [<ffffffff812abdf3>] ? dump_stack+0x40/0x5d
Apr 12 21:34:45 [75704.966297] [<ffffffff810cf5f5>] ? watchdog_overflow_callback+0xb5/0xd0 Apr 12 21:34:45 [75704.966339] [<ffffffff810fcea2>] ? __perf_event_overflow+0x82/0x1c0 Apr 12 21:34:45 [75704.966384] [<ffffffff8101bc63>] ? intel_pmu_handle_irq+0x1c3/0x3e0 Apr 12 21:34:45 [75704.966431] [<ffffffff8113b5cb>] ? vunmap_page_range+0x1bb/0x320 Apr 12 21:34:45 [75704.966474] [<ffffffff813213e0>] ? ghes_copy_tofrom_phys+0x110/0x1d0 Apr 12 21:34:45 [75704.966519] [<ffffffff81014f53>] ? perf_event_nmi_handler+0x23/0x40 Apr 12 21:34:45 [75704.966560] [<ffffffff81007b85>] ? nmi_handle+0x65/0x100
Apr 12 21:34:45  [75704.966597]  [<ffffffff81007dfe>] ? do_nmi+0x1de/0x360
Apr 12 21:34:45 [75704.970603] [<ffffffff8148f957>] ? end_repeat_nmi+0x1a/0x1e Apr 12 21:34:45 [75704.970644] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150 Apr 12 21:34:45 [75704.970685] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150 Apr 12 21:34:45 [75704.970728] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45  [75704.970768]  <<EOE>>
Apr 12 21:34:45   [<ffffffffa01b413b>] ? make_request+0x60b/0xbd0 [raid456]
Apr 12 21:34:45  [75704.970838]  [<ffffffff810815c0>] ? wait_woken+0x80/0x80
Apr 12 21:34:45 [75704.970878] [<ffffffff81151ec4>] ? kmem_cache_alloc+0xf4/0x120 Apr 12 21:34:45 [75704.970922] [<ffffffffa017632d>] ? md_make_request+0xdd/0x220 [md_mod] Apr 12 21:34:45 [75704.970969] [<ffffffff81219fde>] ? xfs_map_buffer.isra.12+0x2e/0x60 Apr 12 21:34:45 [75704.971012] [<ffffffff8128691d>] ? generic_make_request+0xed/0x1d0 Apr 12 21:34:45 [75704.971052] [<ffffffff81286a5a>] ? submit_bio+0x5a/0x140 Apr 12 21:34:45 [75704.971098] [<ffffffff81113379>] ? release_pages+0xc9/0x270 Apr 12 21:34:45 [75704.971145] [<ffffffff811a2c01>] ? do_mpage_readpage+0x2d1/0x640 Apr 12 21:34:45 [75704.971187] [<ffffffff811a304d>] ? mpage_readpages+0xdd/0x130 Apr 12 21:34:45 [75704.971226] [<ffffffff8121b510>] ? __xfs_get_blocks+0x750/0x750 Apr 12 21:34:45 [75704.971267] [<ffffffff8121b510>] ? __xfs_get_blocks+0x750/0x750 Apr 12 21:34:45 [75704.971313] [<ffffffff8114ad45>] ? alloc_pages_current+0x85/0x110 Apr 12 21:34:45 [75704.971354] [<ffffffff81111d25>] ? __do_page_cache_readahead+0x165/0x1f0 Apr 12 21:34:45 [75704.971399] [<ffffffff81105902>] ? pagecache_get_page+0x22/0x1a0 Apr 12 21:34:45 [75704.971441] [<ffffffff8110768c>] ? filemap_fault+0x37c/0x400 Apr 12 21:34:45 [75704.971481] [<ffffffff8122474b>] ? xfs_filemap_fault+0x3b/0x80
Apr 12 21:34:45  [75704.971526]  [<ffffffff8112d2da>] ? __do_fault+0x3a/0xc0
Apr 12 21:34:45 [75704.971564] [<ffffffff81130883>] ? handle_mm_fault+0x1063/0x1650 Apr 12 21:34:45 [75704.971614] [<ffffffff8103bdae>] ? __do_page_fault+0x11e/0x370 Apr 12 21:34:45 [75704.971653] [<ffffffff811aa4ff>] ? SyS_epoll_wait+0x8f/0xd0
Apr 12 21:34:45  [75704.971694]  [<ffffffff8148f64f>] ? page_fault+0x1f/0x30
Apr 12 21:34:45 [75705.493640] NMI watchdog: Watchdog detected hard LOCKUP on cpu 12
Apr 12 21:34:45
Apr 12 21:34:45  [75705.493668] Modules linked in:
Apr 12 21:34:45   ipt_REJECT
Apr 12 21:34:45   nf_reject_ipv4
Apr 12 21:34:45   iptable_mangle
Apr 12 21:34:45   tun
Apr 12 21:34:45   netconsole
Apr 12 21:34:45   configfs
Apr 12 21:34:45   xt_multiport
Apr 12 21:34:45   ip6table_filter
Apr 12 21:34:45   ip6_tables
Apr 12 21:34:45   iptable_filter
Apr 12 21:34:45   ip_tables
Apr 12 21:34:45   x_tables
Apr 12 21:34:45   bridge
Apr 12 21:34:45   stp
Apr 12 21:34:45   llc
Apr 12 21:34:45   bonding
Apr 12 21:34:45   ext4
Apr 12 21:34:45   crc16
Apr 12 21:34:45   mbcache
Apr 12 21:34:45   jbd2
Apr 12 21:34:45   raid1
Apr 12 21:34:45   raid0
Apr 12 21:34:45   raid456
Apr 12 21:34:45   async_raid6_recov
Apr 12 21:34:45   async_memcpy
Apr 12 21:34:45   async_pq
Apr 12 21:34:45   async_xor
Apr 12 21:34:45   xor
Apr 12 21:34:45   async_tx
Apr 12 21:34:45   raid6_pq
Apr 12 21:34:45   md_mod
Apr 12 21:34:45   sr_mod
Apr 12 21:34:45   cdrom
Apr 12 21:34:45   usb_storage
Apr 12 21:34:45   hid_generic
Apr 12 21:34:45   usbhid
Apr 12 21:34:45   hid
Apr 12 21:34:45   sg
Apr 12 21:34:45   sd_mod
Apr 12 21:34:45   x86_pkg_temp_thermal
Apr 12 21:34:45   coretemp
Apr 12 21:34:45   crct10dif_pclmul
Apr 12 21:34:45   crc32_pclmul
Apr 12 21:34:45   crc32c_intel
Apr 12 21:34:45   jitterentropy_rng
Apr 12 21:34:45   sha256_ssse3
Apr 12 21:34:45   sha256_generic
Apr 12 21:34:45   hmac
Apr 12 21:34:45   iTCO_wdt
Apr 12 21:34:45   iTCO_vendor_support
Apr 12 21:34:45   drbg
Apr 12 21:34:45   ansi_cprng
Apr 12 21:34:45   aesni_intel
Apr 12 21:34:45   aes_x86_64
Apr 12 21:34:45   lrw
Apr 12 21:34:45   gf128mul
Apr 12 21:34:45   glue_helper
Apr 12 21:34:45   ablk_helper
Apr 12 21:34:45   cryptd
Apr 12 21:34:45   ahci
Apr 12 21:34:45   libahci
Apr 12 21:34:45   sb_edac
Apr 12 21:34:45   libata
Apr 12 21:34:45   igb
Apr 12 21:34:45   megaraid_sas
Apr 12 21:34:45   xhci_pci
Apr 12 21:34:45   ehci_pci
Apr 12 21:34:45   i2c_algo_bit
Apr 12 21:34:45   xhci_hcd
Apr 12 21:34:45   ehci_hcd
Apr 12 21:34:45   edac_core
Apr 12 21:34:45   ptp
Apr 12 21:34:45   mei_me
Apr 12 21:34:45   lpc_ich
Apr 12 21:34:45   i2c_i801
Apr 12 21:34:45   usbcore
Apr 12 21:34:45   pps_core
Apr 12 21:34:45   mfd_core
Apr 12 21:34:45   mei
Apr 12 21:34:45   usb_common
Apr 12 21:34:45   i2c_core
Apr 12 21:34:45   ioatdma
Apr 12 21:34:45   scsi_mod
Apr 12 21:34:45   dca
Apr 12 21:34:45   ipmi_si
Apr 12 21:34:45   ipmi_msghandler
Apr 12 21:34:45   acpi_power_meter
Apr 12 21:34:45   tpm_tis
Apr 12 21:34:45   tpm
Apr 12 21:34:45   processor
Apr 12 21:34:45   button
Apr 12 21:34:45
Apr 12 21:34:45 [75705.494688] CPU: 12 PID: 32350 Comm: main Not tainted 4.4.1 #2 Apr 12 21:34:45 [75705.494728] Hardware name: Supermicro Super Server/X10DRi-LN4+, BIOS 2.0 12/17/2015
Apr 12 21:34:45  [75705.494790]  0000000000000000
Apr 12 21:34:45   ffffffff812abdf3
Apr 12 21:34:45   0000000000000000
Apr 12 21:34:45   ffffffff810cf5f5
Apr 12 21:34:45
Apr 12 21:34:45  [75705.494886]  ffff883ff29a0000
Apr 12 21:34:45   ffffffff810fcea2
Apr 12 21:34:45   0000000000000001
Apr 12 21:34:45   ffff88407fc85e58
Apr 12 21:34:45
Apr 12 21:34:45  [75705.494976]  ffff88407fc8af00
Apr 12 21:34:45   ffff88407fc8b100
Apr 12 21:34:45   ffff883ff29a0000
Apr 12 21:34:45   ffffffff8101bc63
Apr 12 21:34:45
Apr 12 21:34:45  [75705.495064] Call Trace:
Apr 12 21:34:45  [75705.495094]  <NMI>
Apr 12 21:34:45   [<ffffffff812abdf3>] ? dump_stack+0x40/0x5d
Apr 12 21:34:45 [75705.495150] [<ffffffff810cf5f5>] ? watchdog_overflow_callback+0xb5/0xd0 Apr 12 21:34:45 [75705.495193] [<ffffffff810fcea2>] ? __perf_event_overflow+0x82/0x1c0 Apr 12 21:34:45 [75705.495237] [<ffffffff8101bc63>] ? intel_pmu_handle_irq+0x1c3/0x3e0 Apr 12 21:34:45 [75705.495284] [<ffffffff8113b5cb>] ? vunmap_page_range+0x1bb/0x320 Apr 12 21:34:45 [75705.495330] [<ffffffff813213e0>] ? ghes_copy_tofrom_phys+0x110/0x1d0 Apr 12 21:34:45 [75705.495373] [<ffffffff81014f53>] ? perf_event_nmi_handler+0x23/0x40 Apr 12 21:34:45 [75705.495418] [<ffffffff81007b85>] ? nmi_handle+0x65/0x100
Apr 12 21:34:45  [75705.495458]  [<ffffffff81007d2e>] ? do_nmi+0x10e/0x360
Apr 12 21:34:45 [75705.495497] [<ffffffff8148f957>] ? end_repeat_nmi+0x1a/0x1e Apr 12 21:34:45 [75705.495540] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150 Apr 12 21:34:45 [75705.495581] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150 Apr 12 21:34:45 [75705.495621] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:45  [75705.495661]  <<EOE>>
Apr 12 21:34:45   [<ffffffffa01b413b>] ? make_request+0x60b/0xbd0 [raid456]
Apr 12 21:34:45 [75705.495733] [<ffffffff81282d87>] ? blk_rq_init+0x87/0xa0 Apr 12 21:34:45 [75705.495771] [<ffffffff81283e3c>] ? get_request+0x29c/0x6e0
Apr 12 21:34:45  [75705.495812]  [<ffffffff810815c0>] ? wait_woken+0x80/0x80
Apr 12 21:34:45 [75705.495853] [<ffffffffa017632d>] ? md_make_request+0xdd/0x220 [md_mod] Apr 12 21:34:45 [75705.495898] [<ffffffff8128829e>] ? blk_queue_bio+0x15e/0x350 Apr 12 21:34:45 [75705.495937] [<ffffffff8128691d>] ? generic_make_request+0xed/0x1d0 Apr 12 21:34:45 [75705.495978] [<ffffffff81286a5a>] ? submit_bio+0x5a/0x140 Apr 12 21:34:45 [75705.496018] [<ffffffff811a215e>] ? mpage_bio_submit+0x1e/0x30 Apr 12 21:34:45 [75705.496057] [<ffffffff811a3076>] ? mpage_readpages+0x106/0x130 Apr 12 21:34:45 [75705.496102] [<ffffffff8121b510>] ? __xfs_get_blocks+0x750/0x750 Apr 12 21:34:45 [75705.496144] [<ffffffff8121b510>] ? __xfs_get_blocks+0x750/0x750 Apr 12 21:34:45 [75705.496185] [<ffffffff8114ad45>] ? alloc_pages_current+0x85/0x110 Apr 12 21:34:45 [75705.496227] [<ffffffff81111d25>] ? __do_page_cache_readahead+0x165/0x1f0
Apr 12 21:34:45  [75705.496268]  [<ffffffff811344f5>] ? vma_link+0x75/0xb0
Apr 12 21:34:45 [75705.496307] [<ffffffff811120eb>] ? force_page_cache_readahead+0x9b/0xe0 Apr 12 21:34:45 [75705.496352] [<ffffffff8113f876>] ? madvise_willneed+0x76/0x140 Apr 12 21:34:45 [75705.496395] [<ffffffff811301ce>] ? handle_mm_fault+0x9ae/0x1650
Apr 12 21:34:45  [75705.496437]  [<ffffffff81133dcb>] ? find_vma+0x5b/0x70
Apr 12 21:34:45 [75705.496476] [<ffffffff8113fc52>] ? SyS_madvise+0x312/0x6f0 Apr 12 21:34:45 [75705.496515] [<ffffffff8148d9db>] ? entry_SYSCALL_64_fastpath+0x16/0x6e Apr 12 21:34:47 [75707.118049] NMI watchdog: Watchdog detected hard LOCKUP on cpu 15
Apr 12 21:34:47
Apr 12 21:34:47  [75707.118078] Modules linked in:
Apr 12 21:34:47   ipt_REJECT
Apr 12 21:34:47   nf_reject_ipv4
Apr 12 21:34:47   iptable_mangle
Apr 12 21:34:47   tun
Apr 12 21:34:47   netconsole
Apr 12 21:34:47   configfs
Apr 12 21:34:47   xt_multiport
Apr 12 21:34:47   ip6table_filter
Apr 12 21:34:47   ip6_tables
Apr 12 21:34:47   iptable_filter
Apr 12 21:34:47   ip_tables
Apr 12 21:34:47   x_tables
Apr 12 21:34:47   bridge
Apr 12 21:34:47   stp
Apr 12 21:34:47   llc
Apr 12 21:34:47   bonding
Apr 12 21:34:47   ext4
Apr 12 21:34:47   crc16
Apr 12 21:34:47   mbcache
Apr 12 21:34:47   jbd2
Apr 12 21:34:47   raid1
Apr 12 21:34:47   raid0
Apr 12 21:34:47   raid456
Apr 12 21:34:47   async_raid6_recov
Apr 12 21:34:47   async_memcpy
Apr 12 21:34:47   async_pq
Apr 12 21:34:47   async_xor
Apr 12 21:34:47   xor
Apr 12 21:34:47   async_tx
Apr 12 21:34:47   raid6_pq
Apr 12 21:34:47   md_mod
Apr 12 21:34:47   sr_mod
Apr 12 21:34:47   cdrom
Apr 12 21:34:47   usb_storage
Apr 12 21:34:47   hid_generic
Apr 12 21:34:47   usbhid
Apr 12 21:34:47   hid
Apr 12 21:34:47   sg
Apr 12 21:34:47   sd_mod
Apr 12 21:34:47   x86_pkg_temp_thermal
Apr 12 21:34:47   coretemp
Apr 12 21:34:47   crct10dif_pclmul
Apr 12 21:34:47   crc32_pclmul
Apr 12 21:34:47   crc32c_intel
Apr 12 21:34:47   jitterentropy_rng
Apr 12 21:34:47   sha256_ssse3
Apr 12 21:34:47   sha256_generic
Apr 12 21:34:47   hmac
Apr 12 21:34:47   iTCO_wdt
Apr 12 21:34:47   iTCO_vendor_support
Apr 12 21:34:47   drbg
Apr 12 21:34:47   ansi_cprng
Apr 12 21:34:47   aesni_intel
Apr 12 21:34:47   aes_x86_64
Apr 12 21:34:47   lrw
Apr 12 21:34:47   gf128mul
Apr 12 21:34:47   glue_helper
Apr 12 21:34:47   ablk_helper
Apr 12 21:34:47   cryptd
Apr 12 21:34:47   ahci
Apr 12 21:34:47   libahci
Apr 12 21:34:47   sb_edac
Apr 12 21:34:47   libata
Apr 12 21:34:47   igb
Apr 12 21:34:47   megaraid_sas
Apr 12 21:34:47   xhci_pci
Apr 12 21:34:47   ehci_pci
Apr 12 21:34:47   i2c_algo_bit
Apr 12 21:34:47   xhci_hcd
Apr 12 21:34:47   ehci_hcd
Apr 12 21:34:47   edac_core
Apr 12 21:34:47   ptp
Apr 12 21:34:47   mei_me
Apr 12 21:34:47   lpc_ich
Apr 12 21:34:47   i2c_i801
Apr 12 21:34:47   usbcore
Apr 12 21:34:47   pps_core
Apr 12 21:34:47   mfd_core
Apr 12 21:34:47   mei
Apr 12 21:34:47   usb_common
Apr 12 21:34:47   i2c_core
Apr 12 21:34:47   ioatdma
Apr 12 21:34:47   scsi_mod
Apr 12 21:34:47   dca
Apr 12 21:34:47   ipmi_si
Apr 12 21:34:47   ipmi_msghandler
Apr 12 21:34:47   acpi_power_meter
Apr 12 21:34:47   tpm_tis
Apr 12 21:34:47   tpm
Apr 12 21:34:47   processor
Apr 12 21:34:47   button
Apr 12 21:34:47
Apr 12 21:34:47 [75707.119088] CPU: 15 PID: 31940 Comm: main Not tainted 4.4.1 #2 Apr 12 21:34:47 [75707.119134] Hardware name: Supermicro Super Server/X10DRi-LN4+, BIOS 2.0 12/17/2015
Apr 12 21:34:47  [75707.119196]  0000000000000000
Apr 12 21:34:47   ffffffff812abdf3
Apr 12 21:34:47   0000000000000000
Apr 12 21:34:47   ffffffff810cf5f5
Apr 12 21:34:47
Apr 12 21:34:47  [75707.119277]  ffff883ff2a20000
Apr 12 21:34:47   ffffffff810fcea2
Apr 12 21:34:47   0000000000000001
Apr 12 21:34:47   ffff88407fce5e58
Apr 12 21:34:47
Apr 12 21:34:47  [75707.119360]  ffff88407fceaf00
Apr 12 21:34:47   ffff88407fceb100
Apr 12 21:34:47   ffff883ff2a20000
Apr 12 21:34:47   ffffffff8101bc63
Apr 12 21:34:47
Apr 12 21:34:47  [75707.119439] Call Trace:
Apr 12 21:34:47  [75707.119471]  <NMI>
Apr 12 21:34:47   [<ffffffff812abdf3>] ? dump_stack+0x40/0x5d
Apr 12 21:34:47 [75707.119527] [<ffffffff810cf5f5>] ? watchdog_overflow_callback+0xb5/0xd0 Apr 12 21:34:47 [75707.119571] [<ffffffff810fcea2>] ? __perf_event_overflow+0x82/0x1c0 Apr 12 21:34:47 [75707.119614] [<ffffffff8101bc63>] ? intel_pmu_handle_irq+0x1c3/0x3e0 Apr 12 21:34:47 [75707.119657] [<ffffffff8113b5cb>] ? vunmap_page_range+0x1bb/0x320 Apr 12 21:34:47 [75707.119703] [<ffffffff813213e0>] ? ghes_copy_tofrom_phys+0x110/0x1d0 Apr 12 21:34:47 [75707.119758] [<ffffffff81014f53>] ? perf_event_nmi_handler+0x23/0x40 Apr 12 21:34:47 [75707.119800] [<ffffffff81007b85>] ? nmi_handle+0x65/0x100
Apr 12 21:34:47  [75707.119838]  [<ffffffff81007d2e>] ? do_nmi+0x10e/0x360
Apr 12 21:34:47 [75707.119878] [<ffffffff8148f957>] ? end_repeat_nmi+0x1a/0x1e Apr 12 21:34:47 [75707.119920] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150 Apr 12 21:34:47 [75707.119962] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150 Apr 12 21:34:47 [75707.120002] [<ffffffff810862ca>] ? queued_spin_lock_slowpath+0xea/0x150
Apr 12 21:34:47  [75707.120042]  <<EOE>>
Apr 12 21:34:47   [<ffffffffa01b413b>] ? make_request+0x60b/0xbd0 [raid456]
Apr 12 21:34:47  [75707.120113]  [<ffffffff810815c0>] ? wait_woken+0x80/0x80
Apr 12 21:34:47 [75707.120152] [<ffffffffa017632d>] ? md_make_request+0xdd/0x220 [md_mod] Apr 12 21:34:47 [75707.120195] [<ffffffff8128691d>] ? generic_make_request+0xed/0x1d0 Apr 12 21:34:47 [75707.120236] [<ffffffff81286a5a>] ? submit_bio+0x5a/0x140 Apr 12 21:34:47 [75707.120277] [<ffffffff8112afaf>] ? workingset_refault+0x4f/0xa0 Apr 12 21:34:47 [75707.120320] [<ffffffff811a215e>] ? mpage_bio_submit+0x1e/0x30 Apr 12 21:34:47 [75707.120359] [<ffffffff811a3076>] ? mpage_readpages+0x106/0x130 Apr 12 21:34:47 [75707.120401] [<ffffffff8121b510>] ? __xfs_get_blocks+0x750/0x750 Apr 12 21:34:47 [75707.120439] [<ffffffff8121b510>] ? __xfs_get_blocks+0x750/0x750 Apr 12 21:34:47 [75707.120481] [<ffffffff8114ad45>] ? alloc_pages_current+0x85/0x110 Apr 12 21:34:47 [75707.120523] [<ffffffff81111d25>] ? __do_page_cache_readahead+0x165/0x1f0
Apr 12 21:34:47  [75707.120564]  [<ffffffff811344f5>] ? vma_link+0x75/0xb0
Apr 12 21:34:47 [75707.120602] [<ffffffff811120c7>] ? force_page_cache_readahead+0x77/0xe0 Apr 12 21:34:47 [75707.120644] [<ffffffff8113f876>] ? madvise_willneed+0x76/0x140 Apr 12 21:34:47 [75707.120683] [<ffffffff811301ce>] ? handle_mm_fault+0x9ae/0x1650
Apr 12 21:34:47  [75707.120722]  [<ffffffff81133dcb>] ? find_vma+0x5b/0x70
Apr 12 21:34:47 [75707.120760] [<ffffffff8113fc52>] ? SyS_madvise+0x312/0x6f0 Apr 12 21:34:47 [75707.120799] [<ffffffff8148d9db>] ? entry_SYSCALL_64_fastpath+0x16/0x6e

Once this starts, a couple of minutes goes by and the machine locks up completely.

I have been unable to locate the problem here, anyone that can point me in the right direction?

Best regards
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux