Hello List, i have size = 3 and min_size = 2 with 3 Nodes. My OSDs: ceph osd tree ID CLASS WEIGHT TYPE NAME STATUS REWEIGHT PRI-AFF -1 60.17775 root default -2 20.21155 host ceph01 0 hdd 1.71089 osd.0 up 1.00000 1.00000 8 hdd 1.71660 osd.8 up 1.00000 1.00000 9 hdd 2.67029 osd.9 up 1.00000 1.00000 11 hdd 1.71649 osd.11 up 1.00000 1.00000 12 hdd 2.67020 osd.12 up 1.00000 1.00000 14 hdd 2.67020 osd.14 up 1.00000 1.00000 18 hdd 1.71649 osd.18 up 1.00000 1.00000 22 hdd 2.67020 osd.22 up 1.00000 1.00000 23 hdd 2.67020 osd.23 up 1.00000 1.00000 -3 19.08154 host ceph02 2 hdd 2.67029 osd.2 up 1.00000 1.00000 3 hdd 2.70000 osd.3 up 1.00000 1.00000 7 hdd 2.67029 osd.7 up 1.00000 1.00000 13 hdd 2.67020 osd.13 up 1.00000 1.00000 16 hdd 1.59999 osd.16 up 1.00000 1.00000 19 hdd 2.38409 osd.19 up 1.00000 1.00000 24 hdd 2.67020 osd.24 up 1.00000 1.00000 25 hdd 1.71649 osd.25 up 1.00000 1.00000 -4 20.88466 host ceph03 1 hdd 1.71660 osd.1 up 1.00000 1.00000 4 hdd 2.67020 osd.4 up 1.00000 1.00000 5 hdd 1.71660 osd.5 up 1.00000 1.00000 6 hdd 1.71660 osd.6 up 1.00000 1.00000 15 hdd 2.67020 osd.15 up 1.00000 1.00000 17 hdd 1.62109 osd.17 up 1.00000 1.00000 20 hdd 1.71649 osd.20 up 1.00000 1.00000 21 hdd 2.67020 osd.21 up 1.00000 1.00000 27 hdd 1.71649 osd.27 up 1.00000 1.00000 32 hdd 2.67020 osd.32 up 1.00000 1.00000 I replaced two osds on node ceph01 and ran into "HEALTH_ERR". My problem: it waits for the backfilling process? Why did i run into HEALTH_ERR? I thought all data will be available on at least one more node. or even two: HEALTH_ERR 343351/10358292 objects misplaced (3.315%); Reduced data availability: 19 pgs inactive; Degraded data redundancy: 639455/10358292 objects degraded (6.173%), 208 pgs degraded, 204 pgs undersized; application not enabled on 1 pool(s); 29 slow requests are blocked > 32 sec. Implicated osds ; 29 stuck requests are blocked > 4096 sec. Implicated osds 2,19,24 OBJECT_MISPLACED 343351/10358292 objects misplaced (3.315%) PG_AVAILABILITY Reduced data availability: 19 pgs inactive pg 0.4 is stuck inactive for 4227.236803, current state undersized+degraded+remapped+backfilling+peered, last acting [19] pg 0.12 is stuck inactive for 4227.267137, current state undersized+degraded+remapped+backfilling+peered, last acting [13] pg 0.1b is stuck inactive for 4198.153642, current state undersized+degraded+remapped+backfill_wait+peered, last acting [24] pg 0.1f is stuck inactive for 4226.574006, current state undersized+degraded+remapped+backfilling+peered, last acting [19] pg 0.61 is stuck inactive for 4227.316336, current state undersized+degraded+remapped+backfilling+peered, last acting [2] pg 0.85 is stuck inactive for 4227.287134, current state undersized+degraded+remapped+backfill_wait+peered, last acting [13] pg 0.88 is stuck inactive for 4197.261935, current state undersized+degraded+remapped+backfill_wait+peered, last acting [24] pg 0.bd is stuck inactive for 4226.607646, current state undersized+degraded+remapped+backfilling+peered, last acting [2] pg 0.fc is stuck inactive for 4226.642664, current state undersized+degraded+remapped+backfill_wait+peered, last acting [13] pg 0.140 is stuck inactive for 4198.277165, current state undersized+degraded+remapped+backfilling+peered, last acting [2] pg 0.16c is stuck inactive for 4198.268985, current state undersized+degraded+remapped+backfilling+peered, last acting [7] pg 0.21f is stuck inactive for 4198.228206, current state undersized+degraded+remapped+backfilling+peered, last acting [2] pg 0.222 is stuck inactive for 4198.241280, current state undersized+degraded+remapped+backfilling+peered, last acting [2] pg 0.27f is stuck inactive for 4198.201034, current state undersized+degraded+remapped+backfill_wait+peered, last acting [19] pg 0.297 is stuck inactive for 4197.247869, current state undersized+degraded+remapped+backfilling+peered, last acting [24] pg 0.298 is stuck inactive for 4226.572652, current state undersized+degraded+remapped+backfilling+peered, last acting [19] pg 0.2cd is stuck inactive for 4226.643455, current state undersized+degraded+remapped+backfilling+peered, last acting [16] pg 0.314 is stuck inactive for 4227.339749, current state undersized+degraded+remapped+backfilling+peered, last acting [2] pg 0.375 is stuck inactive for 4227.260662, current state undersized+degraded+remapped+backfilling+peered, last acting [19] PG_DEGRADED Degraded data redundancy: 639455/10358292 objects degraded (6.173%), 208 pgs degraded, 204 pgs undersized pg 0.17a is active+undersized+degraded+remapped+backfilling, acting [24,4] pg 0.17f is stuck undersized for 3811.397010, current state active+undersized+degraded+remapped+backfill_wait, last acting [19,17] pg 0.182 is stuck undersized for 10640.416744, current state active+undersized+degraded+remapped+backfill_wait, last acting [14,16] pg 0.184 is stuck undersized for 3938.548717, current state active+undersized+degraded+remapped+backfill_wait, last acting [7,1] pg 0.195 is stuck undersized for 3939.556198, current state active+undersized+degraded+remapped+backfill_wait, last acting [21,16] pg 0.196 is stuck undersized for 4196.543567, current state active+undersized+degraded+remapped+backfilling, last acting [3,20] pg 0.337 is stuck undersized for 3938.457718, current state active+undersized+degraded+remapped+backfill_wait, last acting [15,13] pg 0.33c is stuck undersized for 10715.420596, current state active+undersized+degraded+remapped+backfilling, last acting [2,12] pg 0.340 is stuck undersized for 3811.450013, current state active+undersized+degraded+remapped+backfilling, last acting [21,19] pg 0.345 is stuck undersized for 3939.510525, current state active+undersized+degraded+remapped+backfill_wait, last acting [4,24] pg 0.346 is stuck undersized for 10639.199276, current state active+undersized+degraded+remapped+backfill_wait, last acting [18,2] pg 0.34c is stuck undersized for 3811.523689, current state active+undersized+degraded+remapped+backfill_wait, last acting [2,15] pg 0.351 is stuck undersized for 3811.347509, current state active+undersized+degraded+remapped+backfill_wait, last acting [2,4] pg 0.356 is stuck undersized for 3811.671104, current state active+undersized+degraded+remapped+backfill_wait, last acting [0,24] pg 0.35b is stuck undersized for 4191.430143, current state active+undersized+degraded+remapped+backfilling, last acting [16,20] pg 0.35c is stuck undersized for 3939.514422, current state active+undersized+degraded+remapped+backfill_wait, last acting [4,25] pg 0.35d is stuck undersized for 3938.543293, current state active+undersized+degraded+remapped+backfill_wait, last acting [19,32] pg 0.365 is stuck undersized for 3938.524132, current state active+undersized+degraded+remapped+backfill_wait, last acting [4,25] pg 0.36c is stuck undersized for 10715.466460, current state active+undersized+degraded+remapped+backfilling, last acting [2,14] pg 0.36d is stuck undersized for 3939.540201, current state active+undersized+degraded+remapped+backfill_wait, last acting [3,32] pg 0.370 is stuck undersized for 4191.552409, current state active+undersized+degraded+remapped+backfilling, last acting [13,21] pg 0.371 is stuck undersized for 3938.440298, current state active+undersized+degraded+remapped+backfill_wait, last acting [1,13] pg 0.375 is stuck undersized for 3938.545599, current state undersized+degraded+remapped+backfilling+peered, last acting [19] pg 0.381 is stuck undersized for 3811.517412, current state active+undersized+degraded+remapped+backfill_wait, last acting [4,3] pg 0.38a is stuck undersized for 10640.436011, current state active+undersized+degraded+remapped+backfill_wait, last acting [2,11] pg 0.38b is stuck undersized for 4191.525469, current state active+undersized+degraded+remapped+backfilling, last acting [24,32] pg 0.391 is stuck undersized for 3810.314900, current state active+undersized+degraded+remapped+backfill_wait, last acting [1,19] pg 0.394 is stuck undersized for 3811.492367, current state active+undersized+degraded+remapped+backfill_wait, last acting [3,14] pg 0.397 is stuck undersized for 4191.488161, current state active+undersized+degraded+remapped+backfilling, last acting [7,32] pg 0.39a is stuck undersized for 3941.583783, current state active+undersized+degraded+remapped+backfill_wait, last acting [11,19] pg 0.3a1 is stuck undersized for 3811.656295, current state active+undersized+degraded+remapped+backfilling, last acting [2,4] pg 0.3a5 is stuck undersized for 3939.536321, current state active+undersized+degraded+remapped+backfilling, last acting [24,20] pg 0.3ab is stuck undersized for 10640.435197, current state active+undersized+degraded+remapped+backfill_wait, last acting [2,11] pg 0.3bb is stuck undersized for 10639.374080, current state active+undersized+degraded+remapped+backfill_wait, last acting [14,24] pg 0.3c1 is stuck undersized for 3811.566173, current state active+undersized+degraded+remapped+backfill_wait, last acting [19,17] pg 0.3c3 is stuck undersized for 10641.420944, current state active+undersized+degraded+remapped+backfill_wait, last acting [2,23] pg 0.3c4 is stuck undersized for 3811.554642, current state active+undersized+degraded+remapped+backfill_wait, last acting [19,21] pg 0.3c9 is stuck undersized for 4219.043674, current state active+undersized+degraded+remapped+backfilling, last acting [13,4] pg 0.3cf is stuck undersized for 3941.146510, current state active+undersized+degraded+remapped+backfill_wait, last acting [16,23] pg 0.3d0 is stuck undersized for 3938.433337, current state active+undersized+degraded+remapped+backfill_wait, last acting [1,24] pg 0.3e6 is stuck undersized for 3939.459758, current state active+undersized+degraded+remapped+backfill_wait, last acting [2,4] pg 0.3e9 is stuck undersized for 10640.420901, current state active+undersized+degraded+remapped+backfill_wait, last acting [22,2] pg 0.3eb is stuck undersized for 3811.573977, current state active+undersized+degraded+remapped+backfill_wait, last acting [1,13] pg 0.3ed is stuck undersized for 3939.549283, current state active+undersized+degraded+remapped+backfill_wait, last acting [3,4] pg 0.3f1 is stuck undersized for 3938.542883, current state active+undersized+degraded+remapped+backfill_wait, last acting [16,32] pg 0.3f2 is stuck undersized for 10639.375600, current state active+undersized+degraded+remapped+backfill_wait, last acting [23,13] pg 0.3f3 is stuck undersized for 3811.496577, current state active+undersized+degraded+remapped+backfill_wait, last acting [3,32] pg 0.3f5 is stuck undersized for 4191.587520, current state active+undersized+degraded+remapped+backfilling, last acting [13,21] pg 0.3f7 is stuck undersized for 10639.374420, current state active+undersized+degraded+remapped+backfill_wait, last acting [14,2] pg 0.3fa is stuck undersized for 10640.425955, current state active+undersized+degraded+remapped+backfill_wait, last acting [3,11] pg 0.3fe is stuck undersized for 3939.552615, current state active+undersized+degraded+remapped+backfill_wait, last acting [7,27] POOL_APP_NOT_ENABLED application not enabled on 1 pool(s) application not enabled on pool 'rbdbench' use 'ceph osd pool application enable <pool-name> <app-name>', where <app-name> is 'cephfs', 'rbd', 'rgw', or freeform for custom applications. REQUEST_SLOW 29 slow requests are blocked > 32 sec. Implicated osds 29 ops are blocked > 2097.15 sec REQUEST_STUCK 29 stuck requests are blocked > 4096 sec. Implicated osds 2,19,24 29 ops are blocked > 4194.3 sec osds 2,19,24 have stuck requests > 4194.3 sec Thanks, Mario _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com