Re: Inconsistent PGs every few days

Konstantin Shalygin <k0ste@xxxxxxxx> · Wed, 8 Aug 2018 12:11:15 +0700

        Hi,

I run a cluster with 7 OSD. The cluster has no much traffic on it. But 
every few days, I get a HEALTH_ERR, because of inconsistent PGs:

root at Sam ~ # ceph status
   cluster:
     id:     c4bfc288-8ba8-4c3a-b3a6-ed95503f50b7
     health: HEALTH_ERR
             3 scrub errors
             Possible data damage: 3 pgs inconsistent

   services:
     mon: 1 daemons, quorum mon1
     mgr: ceph-osd1(active)
     mds: FS-1/1/1 up  {0=ceph-osd1=up:active}
     osd: 11 osds: 8 up, 7 in
     rgw: 1 daemon active

   data:
     pools:   6 pools, 168 pgs
     objects: 901.8 k objects, 2.6 TiB
     usage: 7.9 TiB used, 7.4 TiB / 15 TiB avail
     pgs:     165 active+clean
              3   active+clean+inconsistent

   io:
     client:   641 KiB/s wr, 0 op/s rd, 3 op/s wr

root at Sam ~ # ceph health detail
HEALTH_ERR 3 scrub errors; Possible data damage: 3 pgs inconsistent
OSD_SCRUB_ERRORS 3 scrub errors
PG_DAMAGED Possible data damage: 3 pgs inconsistent
     pg 5.1d is active+clean+inconsistent, acting [6,8,3]
     pg 5.20 is active+clean+inconsistent, acting [3,9,0]
     pg 5.4a is active+clean+inconsistent, acting [6,3,7]

What's the reason for this problem? How can I analyse it?

    Dimitri, what is your hardware? Your RAM is ECC?

    k

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com