Re: xfsaild in D state seems to be blocking all other i/o sporadically

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Brian!

On 2017-04-19 20:08, Brian Foster wrote:
> On Wed, Apr 19, 2017 at 06:36:00PM +0200, Michael Weissenbacher wrote:
>> On 19.04.2017 16:04, Carlos Maiolino wrote:
>>> If I am not blind, the task currently owning the cpu isn't listed there, which I
>>> suppose to be already running.
>> Correct, the only process utilizing the CPU is xfsaild. It is also the
>> only process doing I/O, and it looks like it's doing only reads. It
>> (xfsaild) never shows up in dmesg as blocked.
>>
> 
> A snippet of tracepoint output from the fs might also be interesting
> when you hit this state. E.g., 'trace-cmd start -e xfs:*' and save the
> output of /sys/kernel/debug/tracing/trace_pipe to a file for a few
> seconds or so.
> 
See:
https://we.tl/7AAyBZxt4R

> BTW, is the original /proc/meminfo output you posted from the system in
> a normal state or when the problem occurs? If the former, could you
> include it while the problem occurs as well?
It was from a normal state.

All the following was captured during the problem occurs.

*** snip ***
/proc/meminfo

Wed Apr 19 20:36:50 CEST 2017
MemTotal:       65886208 kB
MemFree:         2923552 kB
MemAvailable:   43018524 kB
Buffers:              52 kB
Cached:         32416864 kB
SwapCached:          580 kB
Active:         30685760 kB
Inactive:        2051532 kB
Active(anon):     165844 kB
Inactive(anon):   155516 kB
Active(file):   30519916 kB
Inactive(file):  1896016 kB
Unevictable:           0 kB
Mlocked:               0 kB
SwapTotal:      64511996 kB
SwapFree:       64506540 kB
Dirty:            202368 kB
Writeback:             0 kB
AnonPages:        319948 kB
Mapped:            18672 kB
Shmem:               868 kB
Slab:           22817492 kB
SReclaimable:   16332540 kB
SUnreclaim:      6484952 kB
KernelStack:        4048 kB
PageTables:        16516 kB
NFS_Unstable:          0 kB
Bounce:                0 kB
WritebackTmp:          0 kB
CommitLimit:    97455100 kB
Committed_AS:     516532 kB
VmallocTotal:   34359738367 kB
VmallocUsed:           0 kB
VmallocChunk:          0 kB
HardwareCorrupted:     0 kB
AnonHugePages:         0 kB
ShmemHugePages:        0 kB
ShmemPmdMapped:        0 kB
HugePages_Total:       0
HugePages_Free:        0
HugePages_Rsvd:        0
HugePages_Surp:        0
Hugepagesize:       2048 kB
DirectMap4k:       88820 kB
DirectMap2M:     7155712 kB
DirectMap1G:    61865984 kB
*** snip ***

*** snip ***
Wed Apr 19 20:36:51 CEST 2017
slabinfo - version: 2.1
# name            <active_objs> <num_objs> <objsize> <objperslab>
<pagesperslab> : tunables <limit> <batchcount> <sharedfactor> : slabdata
<active_slabs> <num_slabs> <sharedavail>
kcopyd_job             0      0   3240    2    2 : tunables   24   12
8 : slabdata      0      0      0
dm_uevent              0      0   2632    3    2 : tunables   24   12
8 : slabdata      0      0      0
t10_pr_reg_cache       0      0    704   11    2 : tunables   54   27
8 : slabdata      0      0      0
ext4_groupinfo_4k      8     28    144   28    1 : tunables  120   60
8 : slabdata      1      1      0
ext4_inode_cache     328    330   1072    3    1 : tunables   24   12
8 : slabdata    110    110      0
ext4_allocation_context      0      0    128   32    1 : tunables  120
60    8 : slabdata      0      0      0
ext4_io_end            0      0     64   64    1 : tunables  120   60
8 : slabdata      0      0      0
ext4_extent_status     36    198     40   99    1 : tunables  120   60
 8 : slabdata      2      2      0
jbd2_journal_head      0      0    112   36    1 : tunables  120   60
8 : slabdata      0      0      0
jbd2_revoke_table_s      0      0     16  240    1 : tunables  120   60
  8 : slabdata      0      0      0
jbd2_revoke_record_s      0      0     32  124    1 : tunables  120   60
   8 : slabdata      0      0      0
fscrypt_info           0      0     24  163    1 : tunables  120   60
8 : slabdata      0      0      0
fscrypt_ctx            0      0     48   83    1 : tunables  120   60
8 : slabdata      0      0      0
mbcache                0      0     56   71    1 : tunables  120   60
8 : slabdata      0      0      0
kvm_async_pf           0      0    136   30    1 : tunables  120   60
8 : slabdata      0      0      0
kvm_vcpu               0      0  19136    1    8 : tunables    8    4
0 : slabdata      0      0      0
kvm_mmu_page_header      0      0    168   24    1 : tunables  120   60
  8 : slabdata      0      0      0
xfs_dqtrx             14     14    528    7    1 : tunables   54   27
8 : slabdata      2      2      0
xfs_dquot            149    168    472    8    1 : tunables   54   27
8 : slabdata     21     21      0
xfs_rui_item           0      0    664    6    1 : tunables   54   27
8 : slabdata      0      0      0
xfs_rud_item           0      0    144   28    1 : tunables  120   60
8 : slabdata      0      0      0
xfs_icr                0      0    152   26    1 : tunables  120   60
8 : slabdata      0      0      0
xfs_inode         12898650 12907268    960    4    1 : tunables   54
27    8 : slabdata 3226817 3226817      0
xfs_efd_item           0      0    408   10    1 : tunables   54   27
8 : slabdata      0      0      0
xfs_da_state           0      0    480    8    1 : tunables   54   27
8 : slabdata      0      0      0
xfs_btree_cur        105    153    232   17    1 : tunables  120   60
8 : slabdata      9      9      0
bio-2                 52     96    320   12    1 : tunables   54   27
8 : slabdata      8      8      0
ip6-frags              0      0    200   20    1 : tunables  120   60
8 : slabdata      0      0      0
UDPv6                 18     28   1152    7    2 : tunables   24   12
8 : slabdata      4      4      0
tw_sock_TCPv6          0      0    240   17    1 : tunables  120   60
8 : slabdata      0      0      0
request_sock_TCPv6      0      0    304   13    1 : tunables   54   27
 8 : slabdata      0      0      0
TCPv6                  7     12   2112    3    2 : tunables   24   12
8 : slabdata      4      4      0
cfq_queue          99928  99977    240   17    1 : tunables  120   60
8 : slabdata   5881   5881      0
mqueue_inode_cache      1      4    896    4    1 : tunables   54   27
 8 : slabdata      1      1      0
userfaultfd_ctx_cache      0      0    128   32    1 : tunables  120
60    8 : slabdata      0      0      0
dio                    0      0    640    6    1 : tunables   54   27
8 : slabdata      0      0      0
pid_namespace          0      0   2232    3    2 : tunables   24   12
8 : slabdata      0      0      0
posix_timers_cache      0      0    216   18    1 : tunables  120   60
 8 : slabdata      0      0      0
ip4-frags             34    132    184   22    1 : tunables  120   60
8 : slabdata      6      6      0
PING                   0      0    896    4    1 : tunables   54   27
8 : slabdata      0      0      0
UDP                   44     44    960    4    1 : tunables   54   27
8 : slabdata     11     11      0
tw_sock_TCP            1     17    240   17    1 : tunables  120   60
8 : slabdata      1      1      0
request_sock_TCP       0      0    304   13    1 : tunables   54   27
8 : slabdata      0      0      0
TCP                   18     22   1920    2    1 : tunables   24   12
8 : slabdata     11     11      0
hugetlbfs_inode_cache      2      6    600    6    1 : tunables   54
27    8 : slabdata      1      1      0
dquot                  0      0    256   16    1 : tunables  120   60
8 : slabdata      0      0      0
request_queue          3     15   1488    5    2 : tunables   24   12
8 : slabdata      3      3      0
blkdev_ioc           100    234    104   39    1 : tunables  120   60
8 : slabdata      6      6      0
user_namespace         0      0    448    9    1 : tunables   54   27
8 : slabdata      0      0      0
dmaengine-unmap-256      1      3   2112    3    2 : tunables   24   12
  8 : slabdata      1      1      0
dmaengine-unmap-128      1      7   1088    7    2 : tunables   24   12
  8 : slabdata      1      1      0
sock_inode_cache     179    204    640    6    1 : tunables   54   27
8 : slabdata     34     34      0
file_lock_cache       83     95    208   19    1 : tunables  120   60
8 : slabdata      5      5      0
net_namespace          0      0   5888    1    2 : tunables    8    4
0 : slabdata      0      0      0
shmem_inode_cache    775    825    688   11    2 : tunables   54   27
8 : slabdata     75     75      0
taskstats              2     12    328   12    1 : tunables   54   27
8 : slabdata      1      1      0
proc_inode_cache    2024   2190    656    6    1 : tunables   54   27
8 : slabdata    343    365    200
sigqueue          12104736 12110275    160   25    1 : tunables  120
60    8 : slabdata 484411 484411      0
bdev_cache             8     20    832    4    1 : tunables   54   27
8 : slabdata      5      5      0
kernfs_node_cache  29210  29240    120   34    1 : tunables  120   60
8 : slabdata    860    860      0
mnt_cache         652927 827520    384   10    1 : tunables   54   27
8 : slabdata  82752  82752      0
inode_cache        12968  12985    584    7    1 : tunables   54   27
8 : slabdata   1854   1855      0
dentry            12741855 12742065    192   21    1 : tunables  120
60    8 : slabdata 606757 606765    480
iint_cache             0      0     72   56    1 : tunables  120   60
8 : slabdata      0      0      0
buffer_head       127140 127140    104   39    1 : tunables  120   60
8 : slabdata   3260   3260      0
fs_cache             171    378     64   63    1 : tunables  120   60
8 : slabdata      6      6      0
files_cache          158    220    704   11    2 : tunables   54   27
8 : slabdata     20     20      0
signal_cache         425    508   1024    4    1 : tunables   54   27
8 : slabdata    127    127      0
sighand_cache        261    294   2112    3    2 : tunables   24   12
8 : slabdata     98     98      0
task_struct          278    295   3648    1    1 : tunables   24   12
8 : slabdata    278    295      0
cred_jar           15411  17451    192   21    1 : tunables  120   60
8 : slabdata    831    831     60
Acpi-Operand      147223 147504     72   56    1 : tunables  120   60
8 : slabdata   2634   2634      0
Acpi-Parse            69    355     56   71    1 : tunables  120   60
8 : slabdata      5      5      0
Acpi-State             0      0     80   51    1 : tunables  120   60
8 : slabdata      0      0      0
Acpi-Namespace     13264  13365     40   99    1 : tunables  120   60
8 : slabdata    135    135      0
anon_vma_chain     11212  14336     64   64    1 : tunables  120   60
8 : slabdata    224    224     60
anon_vma            6167   7000     72   56    1 : tunables  120   60
8 : slabdata    125    125      0
pid                  312    576    128   32    1 : tunables  120   60
8 : slabdata     18     18      0
numa_policy            5    163     24  163    1 : tunables  120   60
8 : slabdata      1      1      0
trace_event_file    2418   2530     88   46    1 : tunables  120   60
8 : slabdata     55     55      0
ftrace_event_field   3691   3901     48   83    1 : tunables  120   60
 8 : slabdata     47     47      0
pool_workqueue        26     96    256   16    1 : tunables  120   60
8 : slabdata      6      6      0
radix_tree_node   1644760 1707538    576    7    1 : tunables   54   27
  8 : slabdata 243934 243934      0
task_group            43     84    576    7    1 : tunables   54   27
8 : slabdata     12     12      0
dma-kmalloc-4194304      0      0 4194304    1 1024 : tunables    1    1
   0 : slabdata      0      0      0
dma-kmalloc-2097152      0      0 2097152    1  512 : tunables    1    1
   0 : slabdata      0      0      0
dma-kmalloc-1048576      0      0 1048576    1  256 : tunables    1    1
   0 : slabdata      0      0      0
dma-kmalloc-524288      0      0 524288    1  128 : tunables    1    1
 0 : slabdata      0      0      0
dma-kmalloc-262144      0      0 262144    1   64 : tunables    1    1
 0 : slabdata      0      0      0
dma-kmalloc-131072      0      0 131072    1   32 : tunables    8    4
 0 : slabdata      0      0      0
dma-kmalloc-65536      0      0  65536    1   16 : tunables    8    4
0 : slabdata      0      0      0
dma-kmalloc-32768      0      0  32768    1    8 : tunables    8    4
0 : slabdata      0      0      0
dma-kmalloc-16384      0      0  16384    1    4 : tunables    8    4
0 : slabdata      0      0      0
dma-kmalloc-8192       0      0   8192    1    2 : tunables    8    4
0 : slabdata      0      0      0
dma-kmalloc-4096       0      0   4096    1    1 : tunables   24   12
8 : slabdata      0      0      0
dma-kmalloc-2048       0      0   2048    2    1 : tunables   24   12
8 : slabdata      0      0      0
dma-kmalloc-1024       0      0   1024    4    1 : tunables   54   27
8 : slabdata      0      0      0
dma-kmalloc-512        4      8    512    8    1 : tunables   54   27
8 : slabdata      1      1      0
dma-kmalloc-256        0      0    256   16    1 : tunables  120   60
8 : slabdata      0      0      0
dma-kmalloc-128        0      0    128   32    1 : tunables  120   60
8 : slabdata      0      0      0
dma-kmalloc-64         0      0     64   64    1 : tunables  120   60
8 : slabdata      0      0      0
dma-kmalloc-32         4    124     32  124    1 : tunables  120   60
8 : slabdata      1      1      0
dma-kmalloc-192        0      0    192   21    1 : tunables  120   60
8 : slabdata      0      0      0
dma-kmalloc-96         0      0    128   32    1 : tunables  120   60
8 : slabdata      0      0      0
kmalloc-4194304        0      0 4194304    1 1024 : tunables    1    1
 0 : slabdata      0      0      0
kmalloc-2097152        0      0 2097152    1  512 : tunables    1    1
 0 : slabdata      0      0      0
kmalloc-1048576        0      0 1048576    1  256 : tunables    1    1
 0 : slabdata      0      0      0
kmalloc-524288         1      1 524288    1  128 : tunables    1    1
0 : slabdata      1      1      0
kmalloc-262144         3      3 262144    1   64 : tunables    1    1
0 : slabdata      3      3      0
kmalloc-131072         3      3 131072    1   32 : tunables    8    4
0 : slabdata      3      3      0
kmalloc-65536         13     13  65536    1   16 : tunables    8    4
0 : slabdata     13     13      0
kmalloc-32768          4      4  32768    1    8 : tunables    8    4
0 : slabdata      4      4      0
kmalloc-16384         23     23  16384    1    4 : tunables    8    4
0 : slabdata     23     23      0
kmalloc-8192       44121  44121   8192    1    2 : tunables    8    4
0 : slabdata  44121  44121      0
kmalloc-4096       11850  11850   4096    1    1 : tunables   24   12
8 : slabdata  11850  11850     24
kmalloc-2048       18655  18718   2048    2    1 : tunables   24   12
8 : slabdata   9359   9359      0
kmalloc-1024      584913 603244   1024    4    1 : tunables   54   27
8 : slabdata 150811 150811      0
kmalloc-512       856079 1020576    512    8    1 : tunables   54   27
 8 : slabdata 127572 127572      0
kmalloc-256       114928 141968    256   16    1 : tunables  120   60
8 : slabdata   8873   8873    480
kmalloc-192       244249 272643    192   21    1 : tunables  120   60
8 : slabdata  12983  12983      0
kmalloc-96        3139278 3139328    128   32    1 : tunables  120   60
  8 : slabdata  98104  98104      0
kmalloc-64        25714408 33297024     64   64    1 : tunables  120
60    8 : slabdata 520266 520266      0
kmalloc-32        423132 631036     32  124    1 : tunables  120   60
8 : slabdata   5089   5089      0
kmalloc-128       213657 269952    128   32    1 : tunables  120   60
8 : slabdata   8436   8436      0
*** snip ***

*** snip ***
/proc/<pidof_xfsaild>/stack
Wed Apr 19 20:36:52 CEST 2017
[<ffffffff8ef3881e>] ring_buffer_unlock_commit+0x1e/0xa0
[<ffffffff8ef419a7>] trace_buffer_unlock_commit_regs+0x37/0x90
[<ffffffff8eea1e20>] wake_up_q+0x60/0x60
[<ffffffffc0719b5b>] xfs_buf_read_map+0x10b/0x180 [xfs]
[<ffffffffc0719888>] xfs_buf_submit_wait+0x78/0x210 [xfs]
[<ffffffffc074d2bd>] xfs_trans_read_buf_map+0xed/0x300 [xfs]
[<ffffffffc0719b5b>] xfs_buf_read_map+0x10b/0x180 [xfs]
[<ffffffffc074d2bd>] xfs_trans_read_buf_map+0xed/0x300 [xfs]
[<ffffffffc0702b77>] xfs_imap_to_bp+0x67/0xe0 [xfs]
[<ffffffffc072e84b>] xfs_iflush+0xdb/0x240 [xfs]
[<ffffffffc07437f9>] xfs_inode_item_push+0x69/0x140 [xfs]
[<ffffffffc0743886>] xfs_inode_item_push+0xf6/0x140 [xfs]
[<ffffffffc074bf25>] xfsaild+0x385/0x7e0 [xfs]
[<ffffffffc074bba0>] xfs_trans_ail_cursor_first+0x80/0x80 [xfs]
[<ffffffff8ee96d64>] kthread+0x104/0x140
[<ffffffff8ee96d64>] kthread+0x104/0x140
[<ffffffff8ee96c60>] kthread_park+0x80/0x80
[<ffffffff8ee7ba79>] do_group_exit+0x39/0xb0
[<ffffffff8f4370b6>] ret_from_fork+0x26/0x40
[<ffffffffffffffff>] 0xffffffffffffffff
*** snip ***

*** snip ***
iotop --batch -o output

Total DISK READ :    1948.60 K/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:     974.30 K/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root     1948.60 K/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :    1924.56 K/s | Total DISK WRITE :       3.94 K/s
Actual DISK READ:    1956.11 K/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root     1924.56 K/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :       2.53 M/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:       2.51 M/s | Actual DISK WRITE:      63.77 K/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root        2.53 M/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :       2.09 M/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:       2.10 M/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root        2.09 M/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :      12.12 M/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:      12.10 M/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root       12.12 M/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :       2.30 M/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:       2.28 M/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root        2.30 M/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :       4.86 M/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:       4.88 M/s | Actual DISK WRITE:    2025.26 K/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root        4.86 M/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :       2.64 M/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:       2.62 M/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root        2.64 M/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :    1899.36 K/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:    1899.36 K/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root     1899.36 K/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
Total DISK READ :    1837.53 K/s | Total DISK WRITE :       0.00 B/s
Actual DISK READ:    1837.53 K/s | Actual DISK WRITE:       0.00 B/s
    TID  PRIO  USER     DISK READ  DISK WRITE  SWAPIN      IO    COMMAND
   1609 be/4 root     1837.53 K/s    0.00 B/s  0.00 %  0.00 %
[xfsaild/md127]
*** snip ***

--
To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [XFS Filesystem Development (older mail)]     [Linux Filesystem Development]     [Linux Audio Users]     [Yosemite Trails]     [Linux Kernel]     [Linux RAID]     [Linux SCSI]


  Powered by Linux