Hi community,
we ran an extensive issue on
our installation of gluster 3.10, which we did
upgraded from 3.8.8 (it's a distribute+replicate, 5
nodes, 3 bricks in replica 2+1 quorum); recently we
noticed a frequent issue where files get duplicated on
the some of the directories; this is visible on the
fuse mount points (RW), but not on the NFS/Ganesha
(RO) mount points.
A sample of an ll output:
---------T 1 48 web_rw 0 Mar
10 11:57 paginazione.shtml
-rw-rw-r-- 1 48 web_rw 272 Feb
18 22:00 paginazione.shtml
As you can see, the file is listed twice, but only one
of the two is good (the name is identical, we verified
that no spurious/hidden characters are present in the
name); the issue maybe is related on how we uploaded the
files on the file system, via incremental rsync on the
fuse mount.
Do anyone have suggestion on how it can happen, how to
solve existing duplication or how to prevent to happen
anymore.
Thanks in advance.
Options Reconfigured:
performance.cache-invalidation:
true
cluster.favorite-child-policy:
mtime
features.cache-invalidation: 1
network.inode-lru-limit: 90000
performance.cache-size: 1024MB
storage.linux-aio: on
nfs.outstanding-rpc-limit: 64
storage.build-pgfid: on
cluster.server-quorum-type:
server
cluster.self-heal-daemon: enable
performance.nfs.io-cache: on
performance.client-io-threads: on
performance.nfs.stat-prefetch: on
performance.nfs.io-threads: on
diagnostics.latency-measurement:
on
diagnostics.count-fop-hits: on
performance.md-cache-timeout: 1
performance.io-thread-count: 16
performance.high-prio-threads: 32
performance.normal-prio-threads:
32
performance.low-prio-threads: 32
performance.least-prio-threads: 1
nfs.acl: off
nfs.rpc-auth-unix: off
diagnostics.client-log-level:
ERROR
diagnostics.brick-log-level:
ERROR
cluster.lookup-unhashed: auto
performance.nfs.quick-read: on
performance.nfs.read-ahead: on
cluster.quorum-type: auto
cluster.locking-scheme: granular
cluster.data-self-heal-algorithm:
full
transport.address-family: inet
performance.readdir-ahead: on
nfs.disable: on
cluster.lookup-optimize: on
cluster.readdir-optimize: on
performance.read-ahead: off
performance.write-behind-window-size:
1MB
client.event-threads: 4
server.event-threads: 16
cluster.granular-entry-heal:
enable
performance.parallel-readdir: on
cluster.server-quorum-ratio: 51