Re: 4.13.12: frequent AACraid crash with a drive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Dec 08, 2017 at 02:47:36PM +0300, Andrey Jr. Melnikov wrote:
> In article <20171127152421.GA5171@xxxxxxxxxxx> you wrote:
> > On Mon, Nov 27, 2017 at 07:21:47AM -0800, Marc MERLIN wrote:
> > > Crash log:
> 
> > I should add that sometimes the reset works ok:
> > aacraid: Host adapter abort request.
> > aacraid: Outstanding commands on (10,1,0,0):
> > aacraid: Host adapter reset request. SCSI hang ?
> > aacraid 0000:02:00.0: outstanding cmd: midlevel-0
> > aacraid 0000:02:00.0: outstanding cmd: lowlevel-0
> > aacraid 0000:02:00.0: outstanding cmd: error handler-1
> > aacraid 0000:02:00.0: outstanding cmd: firmware-0
> > aacraid 0000:02:00.0: outstanding cmd: kernel-0
> > aacraid 0000:02:00.0: Controller reset type is 3
> > aacraid 0000:02:00.0: Issuing IOP reset
> > aacraid 0000:02:00.0: IOP reset succeded
> > aacraid: Comm Interface type2 enabled
> > aacraid 0000:02:00.0: Issuing bus rescan
> > sd 10:1:0:0: [sdd] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
> > sd 10:1:0:0: [sdd] tag#0 CDB: ATA command pass through(16) 85 06 20 00 d8 00 00 00 00 00 4f 00 c2 00 b0 00
> 
> > The drive in question seems ok when I test it directly, but it's hard to tell for sure.
> 
> Use 'arcconf GETLOGS 1 DEVICE' and 'arcconf GETLOGS 1 EVENT' to see what happens.
> And arcconf have 'hidden' command 'arcconf GETLOGS 1 UART' which fetch last 2Mb controller console logs. 
> May be it's help to understand what is wrong.

So, things hung today with 4.13 and the patch you suggested.
-       if (aac->thread->pid != current->pid) {
+       if (aac->aif_thread && aac->thread->pid != current->pid) {

arcconf GETLOGS 1 DEVICE never ran or never completed, I'm not very sure.
Eventually the kernel went into a hard lockup.

[  780.924146] aacraid 0000:02:00.0: outstanding cmd: error handler-1
[  780.942717] aacraid 0000:02:00.0: outstanding cmd: firmware-0
[  780.959965] aacraid 0000:02:00.0: outstanding cmd: kernel-0
[  780.976726] aacraid 0000:02:00.0: Controller reset type is 3
[  780.993741] aacraid 0000:02:00.0: Issuing IOP reset
[  810.364663] aacraid 0000:02:00.0: IOP reset succeded
[  810.403457] aacraid: Comm Interface type2 enabled
[  819.445691] aacraid 0000:02:00.0: Issuing bus rescan
[  829.663308] sd 10:1:2:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
[  829.688926] sd 10:1:2:0: [sde] tag#0 CDB: ATA command pass through(16) 85 06 20 00 d8 00 00 00 00 00 4f 00 c2 00 b0 00
[ 3849.880205] aacraid: Host adapter abort request.
[ 3849.880205] aacraid: Outstanding commands on (10,1,2,0):
[ 3849.910020] aacraid: Host adapter reset request. SCSI hang ?
[ 3849.927134] aacraid 0000:02:00.0: outstanding cmd: midlevel-0
[ 3849.944606] aacraid 0000:02:00.0: outstanding cmd: lowlevel-0
[ 3849.961941] aacraid 0000:02:00.0: outstanding cmd: error handler-1
[ 3849.980673] aacraid 0000:02:00.0: outstanding cmd: firmware-0
[ 3849.998174] aacraid 0000:02:00.0: outstanding cmd: kernel-0
[ 3850.015095] aacraid 0000:02:00.0: Controller reset type is 3
[ 3850.032219] aacraid 0000:02:00.0: Issuing IOP reset
[ 3878.373788] aacraid 0000:02:00.0: IOP reset succeded
[ 3878.413062] aacraid: Comm Interface type2 enabled
[ 3887.465850] aacraid 0000:02:00.0: Issuing bus rescan
[ 3897.609211] sd 10:1:2:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
[ 3897.634845] sd 10:1:2:0: [sde] tag#0 CDB: ATA command pass through(16) 85 06 20 00 d8 00 00 00 00 00 4f 00 c2 00 b0 00
[ 4763.539277] nf_conntrack: default automatic helper assignment has been turned off for security reasons and CT-based  firewall rule not found. Use the iptables CT target to attach helpers instead.
[ 5431.337215] perf: interrupt took too long (2504 > 2500), lowering kernel.perf_event_max_sample_rate to 79750
[ 6916.866506] aacraid: Host adapter abort request.
[ 6916.866506] aacraid: Outstanding commands on (10,1,2,0):
[ 6916.897314] aacraid: Host adapter reset request. SCSI hang ?
[ 6916.915264] aacraid 0000:02:00.0: outstanding cmd: midlevel-0
[ 6916.933419] aacraid 0000:02:00.0: outstanding cmd: lowlevel-0
[ 6916.951388] aacraid 0000:02:00.0: outstanding cmd: error handler-1
[ 6916.970849] aacraid 0000:02:00.0: outstanding cmd: firmware-0
[ 6916.988710] aacraid 0000:02:00.0: outstanding cmd: kernel-0
[ 6917.006110] aacraid 0000:02:00.0: Controller reset type is 3
[ 6917.023850] aacraid 0000:02:00.0: Issuing IOP reset
[ 6945.930659] aacraid 0000:02:00.0: IOP reset succeded
[ 6945.971388] aacraid: Comm Interface type2 enabled
[ 6955.022216] aacraid 0000:02:00.0: Issuing bus rescan
[ 6965.043384] sd 10:1:2:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
[ 6965.069561] sd 10:1:2:0: [sde] tag#0 CDB: ATA command pass through(16) 85 06 20 00 d8 00 00 00 00 00 4f 00 c2 00 b0 00
[ 7583.915855] aacraid: Host adapter abort request.
[ 7583.915855] aacraid: Outstanding commands on (10,1,2,0):
[ 7583.946916] aacraid: Host adapter reset request. SCSI hang ?
[ 7583.964678] aacraid 0000:02:00.0: outstanding cmd: midlevel-0
[ 7583.982846] aacraid 0000:02:00.0: outstanding cmd: lowlevel-0
[ 7584.000738] aacraid 0000:02:00.0: outstanding cmd: error handler-1
[ 7584.019938] aacraid 0000:02:00.0: outstanding cmd: firmware-0
[ 7584.037851] aacraid 0000:02:00.0: outstanding cmd: kernel-0
[ 7584.055265] aacraid 0000:02:00.0: Controller reset type is 3
[ 7584.072886] aacraid 0000:02:00.0: Issuing IOP reset
[ 7654.304833] aacraid 0000:02:00.0: IOP reset failed

Looks like getting EVENT hung things more?
gargamel:~# arcconf GETLOGS 1 EVENT
[ 7709.761618] aacraid: Host adapter abort request.
[ 7709.761618] aacraid: Outstanding commands on (10,1,2,0):
[ 7709.793114] aacraid: Host adapter reset request. SCSI hang ?
[ 7709.810694] aacraid 0000:02:00.0: Adapter health - -3
[ 7709.826449] aacraid 0000:02:00.0: outstanding cmd: midlevel-0
[ 7709.844273] aacraid 0000:02:00.0: outstanding cmd: lowlevel-0
[ 7709.862112] aacraid 0000:02:00.0: outstanding cmd: error handler-1
[ 7709.881215] aacraid 0000:02:00.0: outstanding cmd: firmware-0
[ 7709.899024] aacraid 0000:02:00.0: outstanding cmd: kernel-5
[ 7709.916314] aacraid 0000:02:00.0: Controller reset type is 3
[ 7709.933868] aacraid 0000:02:00.0: Issuing IOP reset
[ 7709.949074] BUG: scheduling while atomic: scsi_eh_10/402/0x00000002
[ 7709.968433] Modules linked in: veth ip6table_filter ip6_tables ebtable_nat ebtables ppdev lp xt_addrtype br_netfilter bridge stp llc tun autofs4 softdog binfmt_misc ftdi_sio nfsd auth_rpcgss nfs_acl nfs lockd grace fscache sunrpc ipt_REJECT nf_reject_ipv4 xt_conntrack xt_mark xt_nat xt_tcpudp iptable_mangle iptable_filter lm85 hwmon_vid pl2303 dm_snapshot dm_bufio iptable_nat ip_tables nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_conntrack_ftp ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_nat nf_conntrack x_tables sg st snd_pcm_oss snd_mixer_oss bcache kvm_intel kvm irqbypass snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_cmipci snd_hda_codec snd_mpu401_uart eeepc_wmi snd_opl3_lib asus_wmi snd_hda_core snd_rawmidi sparse_keymap snd_seq_device snd_hwdep rfkill snd_pcm asix hwmon snd_timer
[ 7710.185358]  usbnet rc_ati_x10 tpm_infineon wmi_bmof tpm_tis ati_remote parport_pc tpm_tis_core snd rc_core libphy i915 i2c_i801 usbserial mei_me parport pcspkr tpm soundcore wmi battery lpc_ich evdev input_leds e1000e ptp pps_core fuse raid456 multipath mmc_block mmc_core lrw ablk_helper dm_crypt dm_mod async_raid6_recov async_pq async_xor async_memcpy async_tx crc32c_intel blowfish_x86_64 blowfish_common pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd xhci_pci ehci_pci xhci_hcd ehci_hcd r8169 aacraid sata_sil24 mii usbcore fan thermal [last unloaded: ftdi_sio]
[ 7710.340683] CPU: 5 PID: 402 Comm: scsi_eh_10 Tainted: G     U          4.13.12-amd64-acraid-sysrq-20171018 #3
[ 7710.371133] Hardware name: System manufacturer System Product Name/P8H67-M PRO, BIOS 3904 04/27/2013
[ 7710.399228] Call Trace:
[ 7710.407307]  dump_stack+0x61/0x7d
[ 7710.418024]  __schedule_bug+0x5c/0x6a
[ 7710.429813]  __schedule+0x70/0x5c5
[ 7710.440796]  ? __mod_timer+0x1be/0x1e4
[ 7710.452783]  schedule+0x89/0x9a
[ 7710.462951]  schedule_timeout+0xc1/0x101
[ 7710.475454]  ? call_timer_fn+0x140/0x140
[ 7710.487977]  schedule_timeout_uninterruptible+0x1f/0x21
[ 7710.504441]  ? schedule_timeout_uninterruptible+0x1f/0x21
[ 7710.521319]  msleep+0x1b/0x22
[ 7710.530940]  src_sync_cmd+0x13f/0x22f [aacraid]
[ 7710.545277]  ? aac_nark_init+0x26/0x26 [aacraid]
[ 7710.559854]  aac_src_restart_adapter+0x10f/0x29d [aacraid]
[ 7710.577022]  aac_reset_adapter+0x1b3/0x709 [aacraid]
[ 7710.592646]  aac_eh_reset+0x396/0x3aa [aacraid]
[ 7710.606966]  scsi_try_host_reset+0x5d/0xb1
[ 7710.620021]  scsi_send_eh_cmnd+0x296/0x2dc
[ 7710.633067]  ? __call_rcu.constprop.44+0x10d/0x188
[ 7710.648208]  ? schedule_timeout+0xca/0x101
[ 7710.661220]  scsi_eh_try_stu+0x53/0x7a
[ 7710.673209]  scsi_eh_test_devices+0xcd/0x16e
[ 7710.686765]  scsi_eh_ready_devs+0x824/0x8c4
[ 7710.700118]  scsi_error_handler+0x291/0x523
[ 7710.713404]  ? __schedule+0x4f5/0x5c5
[ 7710.725119]  ? scsi_eh_get_sense+0x1a9/0x1a9
[ 7710.738643]  kthread+0xfb/0x100
[ 7710.748797]  ? init_completion+0x24/0x24
[ 7710.761327]  ? do_fast_syscall_32+0xb7/0xfe
[ 7710.774624]  ret_from_fork+0x25/0x30


I never got my shell back after the first arcconf command, and then got this:
[ 7734.334018] INFO: task nfsd:9608 blocked for more than 120 seconds.
[ 7734.354311]       Tainted: G     U  W       4.13.12-amd64-acraid-sysrq-20171018 #3
[ 7734.378095] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 7734.402398] nfsd            D    0  9608      2 0x00000080
[ 7734.420088] Call Trace:
[ 7734.428262]  __schedule+0x4bb/0x5c5
[ 7734.439559]  schedule+0x89/0x9a
[ 7734.449732]  io_schedule+0x16/0x38
[ 7734.460653]  wait_on_page_bit_common+0x112/0x19b
[ 7734.475177]  ? file_check_and_advance_wb_err+0xc5/0xc5
[ 7734.491257]  wait_on_page_bit+0x3b/0x3d
[ 7734.503394]  read_extent_buffer_pages+0x1c4/0x224
[ 7734.518132]  ? free_root_pointers+0x64/0x64
[ 7734.531253]  btree_read_extent_buffer_pages+0x5d/0xef
[ 7734.547009]  read_tree_block+0x33/0x4e
[ 7734.558856]  read_block_for_search.isra.20+0x211/0x264
[ 7734.574820]  btrfs_search_slot+0x50a/0x70d
[ 7734.587690]  btrfs_lookup_csum+0x52/0xf7
[ 7734.600008]  __btrfs_lookup_bio_sums+0x20f/0x449
[ 7734.614430]  ? kmem_cache_alloc+0xcc/0x143
[ 7734.627263]  btrfs_lookup_bio_sums+0x16/0x18
[ 7734.640619]  btrfs_submit_bio_hook+0xce/0x149
[ 7734.654232]  submit_one_bio+0x5e/0x84
[ 7734.665797]  submit_extent_page+0xa1/0x1c9
[ 7734.678717]  __do_readpage+0x51a/0x5ac
[ 7734.690521]  ? btrfs_create_repair_bio+0xb9/0xb9
[ 7734.704932]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7734.724283]  __do_contiguous_readpages.constprop.29+0xc8/0xec
[ 7734.742094]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7734.761467]  __extent_readpages.constprop.28+0xea/0xfd
[ 7734.777419]  ? lru_cache_add+0xe/0x10

[ 7734.788962]  extent_readpages+0x139/0x1eb
[ 7734.801544]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7734.820908]  btrfs_readpages+0x1f/0x21
[ 7734.832693]  __do_page_cache_readahead+0x1af/0x27b
[ 7734.847610]  ? wb_congested+0x16/0x26 [bcache]
[ 7734.861514]  ondemand_readahead+0x21e/0x22f
[ 7734.874634]  ? ondemand_readahead+0x21e/0x22f
[ 7734.888255]  page_cache_async_readahead+0x63/0x6c
[ 7734.902935]  generic_file_read_iter+0x225/0x552
[ 7734.917086]  generic_file_splice_read+0xd1/0x140
[ 7734.931509]  do_splice_to+0x7d/0x87
[ 7734.942544]  splice_direct_to_actor+0xcc/0x1bd
[ 7734.956807]  ? commit_metadata+0x44/0x44 [nfsd]
[ 7734.971006]  nfsd_splice_read+0x68/0x92 [nfsd]
[ 7734.984884]  nfsd_read+0x117/0x260 [nfsd]
[ 7734.997459]  nfsd3_proc_read+0xc8/0x113 [nfsd]
[ 7735.011330]  nfsd_dispatch+0xfc/0x1d8 [nfsd]
[ 7735.024690]  svc_process_common+0x2e7/0x4f7 [sunrpc]
[ 7735.040137]  ? nfsd_destroy+0x68/0x68 [nfsd]
[ 7735.053502]  svc_process+0xdb/0x115 [sunrpc]
[ 7735.066852]  nfsd+0x100/0x157 [nfsd]
[ 7735.078413]  kthread+0xfb/0x100
[ 7735.088373]  ? init_completion+0x24/0x24
[ 7735.100667]  ret_from_fork+0x25/0x30
[ 7735.111925] INFO: task nfsd:9610 blocked for more than 120 seconds.
[ 7735.131252]       Tainted: G     U  W       4.13.12-amd64-acraid-sysrq-20171018 #3
[ 7735.154754] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 7735.178759] nfsd            D    0  9610      2 0x00000080
[ 7735.195779] Call Trace:
[ 7735.203672]  __schedule+0x4bb/0x5c5
[ 7735.214765]  schedule+0x89/0x9a
[ 7735.224727]  io_schedule+0x16/0x38
[ 7735.235499]  wait_on_page_bit_common+0x112/0x19b
[ 7735.249901]  ? file_check_and_advance_wb_err+0xc5/0xc5
[ 7735.265859]  __lock_page+0x40/0x42
[ 7735.276666]  lock_page+0x19/0x1c
[ 7735.286899]  read_extent_buffer_pages+0x90/0x224
[ 7735.301320]  ? free_root_pointers+0x64/0x64
[ 7735.314403]  btree_read_extent_buffer_pages+0x5d/0xef
[ 7735.330120]  btrfs_read_buffer+0x33/0x35
[ 7735.342764]  read_block_for_search.isra.20+0xa6/0x264
[ 7735.358783]  ? _raw_read_unlock+0xe/0x1e
[ 7735.371138]  btrfs_search_slot+0x50a/0x70d
[ 7735.383975]  btrfs_lookup_csum+0x52/0xf7
[ 7735.396267]  __btrfs_lookup_bio_sums+0x20f/0x449
[ 7735.410988]  ? kmem_cache_alloc+0xcc/0x143
[ 7735.423839]  btrfs_lookup_bio_sums+0x16/0x18
[ 7735.437195]  btrfs_submit_bio_hook+0xce/0x149
[ 7735.450803]  submit_one_bio+0x5e/0x84
[ 7735.462341]  submit_extent_page+0xa1/0x1c9
[ 7735.475436]  __do_readpage+0x51a/0x5ac
[ 7735.487227]  ? btrfs_create_repair_bio+0xb9/0xb9
[ 7735.501597]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7735.520894]  __do_contiguous_readpages.constprop.29+0xc8/0xec
[ 7735.538671]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7735.558299]  __extent_readpages.constprop.28+0xea/0xfd
[ 7735.574267]  ? lru_cache_add+0xe/0x10
[ 7735.585781]  extent_readpages+0x139/0x1eb
[ 7735.598371]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7735.617726]  btrfs_readpages+0x1f/0x21
[ 7735.629542]  __do_page_cache_readahead+0x1af/0x27b
[ 7735.644462]  ondemand_readahead+0x21e/0x22f
[ 7735.657572]  ? ondemand_readahead+0x21e/0x22f
[ 7735.671469]  page_cache_sync_readahead+0x3d/0x3f
[ 7735.685870]  generic_file_read_iter+0x1e9/0x552
[ 7735.700029]  generic_file_splice_read+0xd1/0x140
[ 7735.714496]  do_splice_to+0x7d/0x87
[ 7735.725527]  splice_direct_to_actor+0xcc/0x1bd
[ 7735.739427]  ? commit_metadata+0x44/0x44 [nfsd]
[ 7735.753910]  nfsd_splice_read+0x68/0x92 [nfsd]
[ 7735.768024]  nfsd_read+0x117/0x260 [nfsd]
[ 7735.780650]  nfsd3_proc_read+0xc8/0x113 [nfsd]
[ 7735.794556]  nfsd_dispatch+0xfc/0x1d8 [nfsd]
[ 7735.807934]  svc_process_common+0x2e7/0x4f7 [sunrpc]
[ 7735.823372]  ? nfsd_destroy+0x68/0x68 [nfsd]
[ 7735.836719]  svc_process+0xdb/0x115 [sunrpc]
[ 7735.850052]  nfsd+0x100/0x157 [nfsd]
[ 7735.861318]  kthread+0xfb/0x100
[ 7735.871543]  ? init_completion+0x24/0x24
[ 7735.884112]  ret_from_fork+0x25/0x30
[ 7735.895356] INFO: task nfsd:9611 blocked for more than 120 seconds.
[ 7735.914651]       Tainted: G     U  W       4.13.12-amd64-acraid-sysrq-20171018 #3
[ 7735.937899] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 7735.961908] nfsd            D    0  9611      2 0x00000080
[ 7735.978909] Call Trace:
[ 7735.986809]  __schedule+0x4bb/0x5c5
[ 7735.997799]  schedule+0x89/0x9a
[ 7736.007766]  io_schedule+0x16/0x38
[ 7736.018497]  wait_on_page_bit_common+0x112/0x19b
[ 7736.032871]  ? file_check_and_advance_wb_err+0xc5/0xc5
[ 7736.048829]  wait_on_page_bit+0x3b/0x3d
[ 7736.060899]  read_extent_buffer_pages+0x1c4/0x224
[ 7736.075588]  ? free_root_pointers+0x64/0x64
[ 7736.088698]  btree_read_extent_buffer_pages+0x5d/0xef
[ 7736.104393]  read_tree_block+0x33/0x4e
[ 7736.116248]  read_block_for_search.isra.20+0x211/0x264
[ 7736.132516]  btrfs_search_slot+0x50a/0x70d
[ 7736.145685]  btrfs_lookup_csum+0x52/0xf7
[ 7736.158020]  ? free_extent_buffer+0x78/0x7b
[ 7736.171108]  __btrfs_lookup_bio_sums+0x20f/0x449
[ 7736.185550]  btrfs_lookup_bio_sums+0x16/0x18
[ 7736.198979]  btrfs_submit_bio_hook+0xce/0x149
[ 7736.212619]  submit_one_bio+0x5e/0x84
[ 7736.224398]  submit_extent_page+0xa1/0x1c9
[ 7736.237205]  __do_readpage+0x51a/0x5ac
[ 7736.248978]  ? btrfs_create_repair_bio+0xb9/0xb9
[ 7736.263411]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7736.283011]  __do_contiguous_readpages.constprop.29+0xc8/0xec
[ 7736.301074]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7736.320434]  __extent_readpages.constprop.28+0xea/0xfd
[ 7736.336394]  ? lru_cache_add+0xe/0x10
[ 7736.347932]  extent_readpages+0x139/0x1eb
[ 7736.360487]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7736.379853]  btrfs_readpages+0x1f/0x21
[ 7736.391656]  __do_page_cache_readahead+0x1af/0x27b
[ 7736.406679]  ? wb_congested+0x16/0x26 [bcache]
[ 7736.420555]  ondemand_readahead+0x21e/0x22f
[ 7736.433650]  ? ondemand_readahead+0x21e/0x22f
[ 7736.447580]  page_cache_async_readahead+0x63/0x6c
[ 7736.462543]  generic_file_read_iter+0x225/0x552
[ 7736.476703]  generic_file_splice_read+0xd1/0x140
[ 7736.491081]  do_splice_to+0x7d/0x87
[ 7736.502103]  splice_direct_to_actor+0xcc/0x1bd
[ 7736.515984]  ? commit_metadata+0x44/0x44 [nfsd]
[ 7736.530155]  nfsd_splice_read+0x68/0x92 [nfsd]
[ 7736.544032]  nfsd_read+0x117/0x260 [nfsd]
[ 7736.556613]  nfsd3_proc_read+0xc8/0x113 [nfsd]
[ 7736.570521]  nfsd_dispatch+0xfc/0x1d8 [nfsd]
[ 7736.583905]  svc_process_common+0x2e7/0x4f7 [sunrpc]
[ 7736.599369]  ? nfsd_destroy+0x68/0x68 [nfsd]
[ 7736.612992]  svc_process+0xdb/0x115 [sunrpc]
[ 7736.626614]  nfsd+0x100/0x157 [nfsd]
[ 7736.637834]  kthread+0xfb/0x100
[ 7736.647876]  ? init_completion+0x24/0x24
[ 7736.660154]  ret_from_fork+0x25/0x30
[ 7736.671386] INFO: task nfsd:9613 blocked for more than 120 seconds.
[ 7736.690702]       Tainted: G     U  W       4.13.12-amd64-acraid-sysrq-20171018 #3
[ 7736.714248] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 7736.738295] nfsd            D    0  9613      2 0x00000080
[ 7736.755304] Call Trace:
[ 7736.763502]  __schedule+0x4bb/0x5c5
[ 7736.774792]  schedule+0x89/0x9a
[ 7736.784760]  io_schedule+0x16/0x38
[ 7736.795520]  wait_on_page_bit_common+0x112/0x19b
[ 7736.809917]  ? file_check_and_advance_wb_err+0xc5/0xc5
[ 7736.825853]  wait_on_page_bit+0x3b/0x3d
[ 7736.837887]  read_extent_buffer_pages+0x1c4/0x224
[ 7736.852834]  ? free_root_pointers+0x64/0x64
[ 7736.865934]  btree_read_extent_buffer_pages+0x5d/0xef
[ 7736.881692]  read_tree_block+0x33/0x4e
[ 7736.893508]  read_block_for_search.isra.20+0x211/0x264
[ 7736.909697]  btrfs_search_slot+0x50a/0x70d
[ 7736.922872]  btrfs_lookup_csum+0x52/0xf7
[ 7736.935519]  __btrfs_lookup_bio_sums+0x20f/0x449
[ 7736.949927]  ? kmem_cache_alloc+0xcc/0x143
[ 7736.962790]  btrfs_lookup_bio_sums+0x16/0x18
[ 7736.976152]  btrfs_submit_bio_hook+0xce/0x149
[ 7736.989845]  submit_one_bio+0x5e/0x84
[ 7737.001621]  submit_extent_page+0xa1/0x1c9
[ 7737.014484]  __do_readpage+0x51a/0x5ac
[ 7737.026276]  ? btrfs_create_repair_bio+0xb9/0xb9
[ 7737.040652]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7737.060002]  __do_contiguous_readpages.constprop.29+0xc8/0xec
[ 7737.077807]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7737.097455]  __extent_readpages.constprop.28+0xea/0xfd
[ 7737.113718]  ? lru_cache_add+0xe/0x10
[ 7737.125258]  extent_readpages+0x139/0x1eb
[ 7737.138117]  ? insert_reserved_file_extent.constprop.38+0x29b/0x29b
[ 7737.157756]  btrfs_readpages+0x1f/0x21
[ 7737.169587]  __do_page_cache_readahead+0x1af/0x27b
[ 7737.184789]  ondemand_readahead+0x21e/0x22f
[ 7737.197914]  ? ondemand_readahead+0x21e/0x22f
[ 7737.211567]  page_cache_sync_readahead+0x3d/0x3f
[ 7737.225997]  generic_file_read_iter+0x1e9/0x552
[ 7737.240149]  generic_file_splice_read+0xd1/0x140
[ 7737.254819]  do_splice_to+0x7d/0x87
[ 7737.266141]  splice_direct_to_actor+0xcc/0x1bd
[ 7737.280279]  ? commit_metadata+0x44/0x44 [nfsd]
[ 7737.294739]  nfsd_splice_read+0x68/0x92 [nfsd]
[ 7737.308895]  nfsd_read+0x117/0x260 [nfsd]
[ 7737.321499]  nfsd3_proc_read+0xc8/0x113 [nfsd]
[ 7737.335417]  nfsd_dispatch+0xfc/0x1d8 [nfsd]
[ 7737.348813]  svc_process_common+0x2e7/0x4f7 [sunrpc]
[ 7737.364279]  ? nfsd_destroy+0x68/0x68 [nfsd]
[ 7737.377661]  svc_process+0xdb/0x115 [sunrpc]
[ 7737.391048]  nfsd+0x100/0x157 [nfsd]
[ 7737.402341]  kthread+0xfb/0x100
[ 7737.412309]  ? init_completion+0x24/0x24
[ 7737.424833]  ret_from_fork+0x25/0x30                                                                      
[ 7785.122658] aacraid 0000:02:00.0: IOP reset failed
[ 7785.137536] aacraid 0000:02:00.0: ARC Reset attempt failed
[ 7807.293052] NMI watchdog: Watchdog detected hard LOCKUP on cpu 4
[ 7807.293054] Modules linked in: veth ip6table_filter ip6_tables ebtable_nat ebtables ppdev lp xt_addrtype br_netfilter bridge stp llc tun autofs4 softdog binfmt_misc ftdi_sio nfsd auth_rpcgss nfs_acl nfs lockd grace fscache sunrpc ipt_REJECT nf_reject_ipv4 xt_conntrack xt_mark xt_nat xt_tcpudp iptable_mangle iptable_filter lm85 hwmon_vid pl2303 dm_snapshot dm_bufio iptable_nat ip_tables nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_conntrack_ftp ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_nat nf_conntrack x_tables sg st snd_pcm_oss snd_mixer_oss bcache kvm_intel kvm irqbypass snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_cmipci snd_hda_codec snd_mpu401_uart eeepc_wmi snd_opl3_lib asus_wmi snd_hda_core snd_rawmidi sparse_keymap snd_seq_device snd_hwdep rfkill snd_pcm asix hwmon snd_timer
[ 7807.293072]  usbnet rc_ati_x10 tpm_infineon wmi_bmof tpm_tis ati_remote parport_pc tpm_tis_core snd rc_core libphy i915 i2c_i801 usbserial mei_me parport pcspkr tpm soundcore wmi battery lpc_ich evdev input_leds e1000e ptp pps_core fuse raid456 multipath mmc_block mmc_core lrw ablk_helper dm_crypt dm_mod async_raid6_recov async_pq async_xor async_memcpy async_tx crc32c_intel blowfish_x86_64 blowfish_common pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd xhci_pci ehci_pci xhci_hcd ehci_hcd r8169 aacraid sata_sil24 mii usbcore fan thermal [last unloaded: ftdi_sio]
[ 7807.293089] CPU: 4 PID: 402 Comm: scsi_eh_10 Tainted: G     U  W       4.13.12-amd64-acraid-sysrq-20171018 #3
[ 7807.293089] Hardware name: System manufacturer System Product Name/P8H67-M PRO, BIOS 3904 04/27/2013
[ 7807.293090] task: ffff9cbdf4f441c0 task.stack: ffffb22043870000
[ 7807.293093] RIP: 0010:queued_spin_lock_slowpath+0x76/0x18c
[ 7807.293093] RSP: 0018:ffffb22043873b28 EFLAGS: 00000002
[ 7807.293094] RAX: 0000000000000101 RBX: 0000000000000246 RCX: 0000000000000101
[ 7807.293094] RDX: 0000000000000100 RSI: 0000000000000001 RDI: ffff9cbdf9f88030
[ 7807.293095] RBP: ffffb22043873b28 R08: 0000000000000001 R09: 0000000000000000
[ 7807.293095] R10: ffffb22043873a80 R11: ffffffffba899a20 R12: ffff9cbdf9f88000
[ 7807.293095] R13: 0000000000000000 R14: ffff9cbdf9f88000 R15: 0000000000000003
[ 7807.293096] FS:  0000000000000000(0000) GS:ffff9cbe1e300000(0000) knlGS:0000000000000000
[ 7807.293097] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 7807.293097] CR2: 00000000dfb60000 CR3: 00000007aaf7d000 CR4: 00000000001406e0
[ 7807.293098] Call Trace:
[ 7807.293101]  _raw_spin_lock_irqsave+0x3d/0x44
[ 7807.293103]  __scsi_iterate_devices+0x2c/0x77
[ 7807.293105]  scsi_run_host_queues+0x2e/0x37
[ 7807.293106]  scsi_unblock_requests+0x15/0x17
[ 7807.293110]  aac_reset_adapter+0x5aa/0x709 [aacraid]
[ 7807.293112]  aac_eh_reset+0x396/0x3aa [aacraid]
[ 7807.293113]  scsi_try_host_reset+0x5d/0xb1
[ 7807.293114]  scsi_send_eh_cmnd+0x296/0x2dc
[ 7807.293117]  ? __call_rcu.constprop.44+0x10d/0x188
[ 7807.293119]  ? schedule_timeout+0xca/0x101
[ 7807.293120]  scsi_eh_try_stu+0x53/0x7a
[ 7807.293121]  scsi_eh_test_devices+0xcd/0x16e
[ 7807.293122]  scsi_eh_ready_devs+0x824/0x8c4
[ 7807.293123]  scsi_error_handler+0x291/0x523
[ 7807.293124]  ? __schedule+0x4f5/0x5c5
[ 7807.293126]  ? scsi_eh_get_sense+0x1a9/0x1a9
[ 7807.293126]  kthread+0xfb/0x100
[ 7807.293127]  ? init_completion+0x24/0x24
[ 7807.293129]  ? do_fast_syscall_32+0xb7/0xfe
[ 7807.293130]  ret_from_fork+0x25/0x30
[ 7807.293131] Code: 00 00 eb 02 89 c6 f7 c6 00 ff ff ff 75 2f 83 fe 01 89 ca 89 f0 41 0f 45 d0 f0 0f b1 17 39 c6 75 e3 ff ca 0f 84 16 01 00 00 8b 07 <84> c0 74 04 f3 90 eb f6 66 c7 07 01 00 e9 02 01 00 00 48 c7 c2
[ 7807.293141] Kernel panic - not syncing: Hard LOCKUP
[ 7807.293142] CPU: 4 PID: 402 Comm: scsi_eh_10 Tainted: G     U  W       4.13.12-amd64-acraid-sysrq-20171018 #3
[ 7807.293142] Hardware name: System manufacturer System Product Name/P8H67-M PRO, BIOS 3904 04/27/2013
[ 7807.293142] Call Trace:
[ 7807.293143]  <NMI>
[ 7807.293144]  dump_stack+0x61/0x7d
[ 7807.293147]  panic+0xe7/0x235
[ 7807.293148]  ? 0xffffffffc0199000
[ 7807.293149]  nmi_panic+0x2c/0x37
[ 7807.293151]  watchdog_overflow_callback+0x104/0x119
[ 7807.293153]  __perf_event_overflow+0x78/0xbd
[ 7807.293154]  perf_event_overflow+0x14/0x16
[ 7807.293155]  intel_pmu_handle_irq+0x353/0x405
[ 7807.293158]  ? cycles_2_ns+0x52/0x72
[ 7807.293160]  perf_event_nmi_handler+0x2c/0x47
[ 7807.293161]  ? paravirt_sched_clock+0x9/0xd
[ 7807.293161]  ? sched_clock+0x9/0xb
[ 7807.293163]  ? perf_event_nmi_handler+0x2c/0x47
[ 7807.293165]  nmi_handle+0x55/0x119
[ 7807.293166]  default_do_nmi+0x4b/0xf5
[ 7807.293167]  do_nmi+0x9b/0x115
[ 7807.293168]  end_repeat_nmi+0x1a/0x1e
[ 7807.293169] RIP: 0010:queued_spin_lock_slowpath+0x76/0x18c
[ 7807.293169] RSP: 0018:ffffb22043873b28 EFLAGS: 00000002
[ 7807.293170] RAX: 0000000000000101 RBX: 0000000000000246 RCX: 0000000000000101
[ 7807.293170] RDX: 0000000000000100 RSI: 0000000000000001 RDI: ffff9cbdf9f88030
[ 7807.293171] RBP: ffffb22043873b28 R08: 0000000000000001 R09: 0000000000000000
[ 7807.293171] R10: ffffb22043873a80 R11: ffffffffba899a20 R12: ffff9cbdf9f88000
[ 7807.293171] R13: 0000000000000000 R14: ffff9cbdf9f88000 R15: 0000000000000003
[ 7807.293173]  ? queued_spin_lock_slowpath+0x76/0x18c
[ 7807.293174]  ? queued_spin_lock_slowpath+0x76/0x18c
[ 7807.293174]  </NMI>
[ 7807.293175]  _raw_spin_lock_irqsave+0x3d/0x44
[ 7807.293176]  __scsi_iterate_devices+0x2c/0x77
[ 7807.293177]  scsi_run_host_queues+0x2e/0x37
[ 7807.293178]  scsi_unblock_requests+0x15/0x17
[ 7807.293180]  aac_reset_adapter+0x5aa/0x709 [aacraid]
[ 7807.293182]  aac_eh_reset+0x396/0x3aa [aacraid]
[ 7807.293184]  scsi_try_host_reset+0x5d/0xb1
[ 7807.293185]  scsi_send_eh_cmnd+0x296/0x2dc
[ 7807.293186]  ? __call_rcu.constprop.44+0x10d/0x188
[ 7807.293187]  ? schedule_timeout+0xca/0x101
[ 7807.293188]  scsi_eh_try_stu+0x53/0x7a
[ 7807.293189]  scsi_eh_test_devices+0xcd/0x16e
[ 7807.293190]  scsi_eh_ready_devs+0x824/0x8c4
[ 7807.293191]  scsi_error_handler+0x291/0x523
[ 7807.293192]  ? __schedule+0x4f5/0x5c5
[ 7807.293193]  ? scsi_eh_get_sense+0x1a9/0x1a9
[ 7807.293194]  kthread+0xfb/0x100
[ 7807.293195]  ? init_completion+0x24/0x24
[ 7807.293196]  ? do_fast_syscall_32+0xb7/0xfe
[ 7807.293196]  ret_from_fork+0x25/0x30
[ 7807.293240] Kernel Offset: 0x39000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)

-- 
"A mouse is a device used to point at the xterm you want to type in" - A.S.R.
Microsoft is to operating systems ....
                                      .... what McDonalds is to gourmet cooking
Home page: http://marc.merlins.org/                         | PGP 1024R/763BE901



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Index of Archives]     [SCSI Target Devel]     [Linux SCSI Target Infrastructure]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Samba]     [Device Mapper]

  Powered by Linux