Re: Few questions about (attempting to use) write journal + call traces

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, May 22, 2019 at 6:18 AM Michal Soltys <soltys@xxxxxxxx> wrote:
>
> Hi,
>
> As I started experimenting with journaled raids:
>
> 1) can another (raid1) device or lvm mirror (using md underneath) be used formally as a write journal ?
> 2) for the testing purposes, are loopback devices expected to work ?
>
> I know I can successfully create both of the above (case scenario below), but any attempt to write to such md device ends with hung tasks and D states.
>
>
> kernel 5.1.3
> mdadm 4.1
> btrfs filesystem with /var subvolume (where losetup files were created with fallocate)
>
> What I did:
>
> - six 10gb files as backings for /dev/loop[0-5]
> - /dev/loop[4-5] - raid1
> - /dev/loop[0-3] - raid5
>
> Now at this point both raids work just fine, separately. The problem starts if I create raid1 as raid5's journal (doesn't matter whether w-t or w-b). Any attempt to write to a device like that instantly ends with D and respective traces in dmesg:

Hi Michal,

Yes, the journal _should_ work with another md, lvm, or loop device.
Could you please share the commands you used in this test?

Thanks,
Song

>
> May 22 14:39:37 hakai kernel: INFO: task dd:899 blocked for more than 245 seconds.
> May 22 14:39:37 hakai kernel:       Not tainted 5.1.3-arch1-1-ARCH #1
> May 22 14:39:37 hakai kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> May 22 14:39:37 hakai kernel: dd              D    0   899    824 0x00000004
> May 22 14:39:37 hakai kernel: Call Trace:
> May 22 14:39:37 hakai kernel:  ? __schedule+0x30b/0x8b0
> May 22 14:39:37 hakai kernel:  ? raid5_unplug+0xb2/0x140 [raid456]
> May 22 14:39:37 hakai kernel:  schedule+0x32/0x80
> May 22 14:39:37 hakai kernel:  io_schedule+0x12/0x40
> May 22 14:39:37 hakai kernel:  wait_on_page_bit+0x138/0x200
> May 22 14:39:37 hakai kernel:  ? add_to_page_cache_lru+0xc0/0xc0
> May 22 14:39:37 hakai kernel:  __filemap_fdatawait_range+0xb9/0x110
> May 22 14:39:37 hakai kernel:  filemap_fdatawait_range+0xe/0x20
> May 22 14:39:37 hakai kernel:  filemap_write_and_wait+0x47/0x70
> May 22 14:39:37 hakai kernel:  __blkdev_put+0x71/0x1e0
> May 22 14:39:37 hakai kernel:  blkdev_close+0x21/0x30
> May 22 14:39:37 hakai kernel:  __fput+0xa5/0x1d0
> May 22 14:39:37 hakai kernel:  task_work_run+0x8f/0xb0
> May 22 14:39:37 hakai kernel:  exit_to_usermode_loop+0xd3/0xe0
> May 22 14:39:37 hakai kernel:  do_syscall_64+0x157/0x180
> May 22 14:39:37 hakai kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
> May 22 14:39:37 hakai kernel: RIP: 0033:0x7f13c4a66348
> May 22 14:39:37 hakai kernel: Code: Bad RIP value.
> May 22 14:39:37 hakai kernel: RSP: 002b:00007ffcde214058 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
> May 22 14:39:37 hakai kernel: RAX: 0000000000000000 RBX: 0000562ed16d5160 RCX: 00007f13c4a66348
> May 22 14:39:37 hakai kernel: RDX: 0000000000080000 RSI: 0000000000000000 RDI: 0000000000000001
> May 22 14:39:37 hakai kernel: RBP: 0000000000000001 R08: 00000000ffffffff R09: 0000000000000000
> May 22 14:39:37 hakai kernel: R10: 0000000000000022 R11: 0000000000000246 R12: 0000000000000001
> May 22 14:39:37 hakai kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 00007f13c4301000
>
> May 22 14:39:37 hakai kernel: INFO: task systemd-udevd:313 blocked for more than 245 seconds.
> May 22 14:39:37 hakai kernel:       Not tainted 5.1.3-arch1-1-ARCH #1
> May 22 14:39:37 hakai kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> May 22 14:39:37 hakai kernel: systemd-udevd   D    0   313      1 0x00000104
> May 22 14:39:37 hakai kernel: Call Trace:
> May 22 14:39:37 hakai kernel:  ? __schedule+0x30b/0x8b0
> May 22 14:39:37 hakai kernel:  schedule+0x32/0x80
> May 22 14:39:37 hakai kernel:  schedule_preempt_disabled+0x14/0x20
> May 22 14:39:37 hakai kernel:  __mutex_lock.isra.1+0x217/0x520
> May 22 14:39:37 hakai kernel:  ? kobj_lookup+0xf1/0x160
> May 22 14:39:37 hakai kernel:  __blkdev_get+0x83/0x540
> May 22 14:39:37 hakai kernel:  blkdev_get+0x108/0x340
> May 22 14:39:37 hakai kernel:  ? preempt_count_add+0x79/0xb0
> May 22 14:39:37 hakai kernel:  ? _raw_spin_lock+0x13/0x30
> May 22 14:39:37 hakai kernel:  ? bd_acquire+0xc0/0xc0
> May 22 14:39:37 hakai kernel:  do_dentry_open+0x13a/0x380
> May 22 14:39:37 hakai kernel:  path_openat+0x2d6/0x1500
> May 22 14:39:37 hakai kernel:  ? mntput_no_expire+0x11/0x1a0
> May 22 14:39:37 hakai kernel:  ? terminate_walk+0xeb/0x100
> May 22 14:39:37 hakai kernel:  do_filp_open+0x93/0x100
> May 22 14:39:37 hakai kernel:  ? __check_object_size+0xc1/0x175
> May 22 14:39:37 hakai kernel:  ? _raw_spin_unlock+0x16/0x30
> May 22 14:39:37 hakai kernel:  do_sys_open+0x186/0x220
> May 22 14:39:37 hakai kernel:  do_syscall_64+0x5b/0x180
> May 22 14:39:37 hakai kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
> May 22 14:39:37 hakai kernel: RIP: 0033:0x7ff24fe549ff
> May 22 14:39:37 hakai kernel: Code: Bad RIP value.
> May 22 14:39:37 hakai kernel: RSP: 002b:00007ffe7b7ce060 EFLAGS: 00000246 ORIG_RAX: 0000000000000101
> May 22 14:39:37 hakai kernel: RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007ff24fe549ff
> May 22 14:39:37 hakai kernel: RDX: 00000000000a0800 RSI: 00005578d289be10 RDI: 00000000ffffff9c
> May 22 14:39:37 hakai kernel: RBP: 00005578d28b3ea0 R08: 00007ff24fb896d0 R09: 00005578d2891fe0
> May 22 14:39:37 hakai kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
> May 22 14:39:37 hakai kernel: R13: 00007ffe7b7cf198 R14: 00007ffe7b7cf188 R15: 00005578d27e1254



[Index of Archives]     [Linux RAID Wiki]     [ATA RAID]     [Linux SCSI Target Infrastructure]     [Linux Block]     [Linux IDE]     [Linux SCSI]     [Linux Hams]     [Device Mapper]     [Device Mapper Cryptographics]     [Kernel]     [Linux Admin]     [Linux Net]     [GFS]     [RPM]     [git]     [Yosemite Forum]


  Powered by Linux