Ceph objects unfound

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



  
Hello,
 
After a host reboot the cluster could not find an object. The cluster was in stable state with all osd active+clean, no OSD was out, no other OSD was restarted during host reboot. It was 1 month ago, we hoped that the cluster will find the object eventually, but it did not.
Cluster version: ceph version 16.2.9, ceph-deploy cluster, pool size 2.
 
Attached are ceph.log, osds logs, pg query and other logs
 
Cluster status:
 cluster:
 id: 2517da9e-af62-405e-b71f-1f2e145822f7
 health: HEALTH_ERR
 client is using insecure global_id reclaim
 mons are allowing insecure global_id reclaim
 1/606943089 objects unfound (0.000%)
 Possible data damage: 1 pg recovery_unfound
 Degraded data redundancy: 7252/1219946300 objects degraded (0.001%), 1 pg degraded, 1 pg undersized
 1 pgs not deep-scrubbed in time
 1 pgs not scrubbed in time 

data:
 volumes: 1/1 healthy
 pools: 12 pools, 6560 pgs
 objects: 606.94M objects, 85 TiB
 usage: 169 TiB used, 268 TiB / 438 TiB avail
 pgs: 7252/1219946300 objects degraded (0.001%)
 7250/1219946300 objects misplaced (0.001%)
 1/606943089 objects unfound (0.000%)
 6554 active+clean
 4 active+clean+scrubbing+deep
 1 active+recovery_unfound+undersized+degraded+remapped
 1 active+clean+scrubbing
 
io:
 client: 1.2 GiB/s rd, 1.4 GiB/s wr, 40.87k op/s rd, 72.80k op/s wr
 
progress:
 Global Recovery Event (2w)
 [===========================.] (remaining: 4m)


 
Ceph health detail
 
HEALTH_ERR clients are using insecure global_id reclaim; mons are allowing insecure global_id reclaim; 1/606997573 objects unfound (0.000%); Possible data damage: 1 pg recovery_unfound; Degraded data redundancy: 7294/1220048932 objects degraded (0.001%), 1 pg degraded, 1 pg undersized; 1 pgs not deep-scrubbed in time; 1 pgs not scrubbed in time
...
[WRN] OBJECT_UNFOUND: 1/606997573 objects unfound (0.000%)
 pg 16.1e has 1 unfound objects
[ERR] PG_DAMAGED: Possible data damage: 1 pg recovery_unfound
 pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
[WRN] PG_DEGRADED: Degraded data redundancy: 7294/1220048932 objects degraded (0.001%), 1 pg degraded, 1 pg undersized
 pg 16.1e is stuck undersized for 3h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
[WRN] PG_NOT_DEEP_SCRUBBED: 1 pgs not deep-scrubbed in time
 pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
[WRN] PG_NOT_SCRUBBED: 1 pgs not scrubbed in time
 pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
 
The PG is acting only on osd.131, even we move the PG to other OSD:
ceph pg map 16.1e
osdmap e723093 pg 16.1e (16.1e) -> up [41,141] acting [131]
 
On ceph osd dump the pg is mapped as a pg_temp:
 
ceph osd dump | grep -w 16.1e
pg_temp 16.1e [131]
 
What we did:
- restarted all osd and hosts involved
- force a deep-scrub on PG (the pg cannot be scrubed anymore)
- If we stop osd.131 the PG becomes inactive and down (like it is the only osd containing the objects): Reduced data availability: 1 pg inactive, 1 pg down
- If we take out the osd.131, the pg is not moving to the new osd, it remains the only object on osd.131 
- ceph force recovery
- ceph force repeer
- ceph pg repair 16.1e
- Used ceph-objectstore-tool to search for the unfound object (rbd_data.ad5ab66b8b4567.0000000000011055) on all osd's involved, the object is present only on osd.41 and osd.131 even the PG is mapped to other OSD's.
- ceph-objectstore-tool 
 ceph pg remap: we tryed to remap the pg to others OSD's (ceph osd pg-upmap-items 16.1e 131 141) but the PG does not move to new OSD's, remain on osd.41 and osd.131 (ceph pg map 16.1e: osdmap e723093 pg 16.1e (16.1e) -> up [41,141] acting [131])
 
Why is this happening ?
How can we help the cluster to find the lost object?
Can we remove pg_temp 16.1e [131] from upmap (ceph osd dump) ?
 
Thank you,
Martin Culcea
 
ceph pg map 16.1e
osdmap e723050 pg 16.1e (16.1e) -> up [41,141] acting [131]









ceph pg 16.1e query
{
    "snap_trimq": "[]",
    "snap_trimq_len": 0,
    "state": "active+recovery_unfound+undersized+degraded+remapped",
    "epoch": 723050,
    "up": [
        41,
        141
    ],
    "acting": [
        131
    ],
    "backfill_targets": [
        "41",
        "141"
    ],
    "acting_recovery_backfill": [
        "41",
        "131",
        "141"
    ],
    "info": {
        "pgid": "16.1e",
        "last_update": "723050'1044649126",
        "last_complete": "0'0",
        "log_tail": "723049'1044639041",
        "last_user_version": 1044649126,
        "last_backfill": "MAX",
        "purged_snaps": [],
        "history": {
            "epoch_created": 6813,
            "epoch_pool_created": 6813,
            "last_epoch_started": 723046,
            "last_interval_started": 723045,
            "last_epoch_clean": 713865,
            "last_interval_clean": 713864,
            "last_epoch_split": 53947,
            "last_epoch_marked_full": 76013,
            "same_up_since": 723045,
            "same_interval_since": 723045,
            "same_primary_since": 722482,
            "last_scrub": "710579'988167138",
            "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
            "last_deep_scrub": "708883'971173140",
            "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
            "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
            "prior_readable_until_ub": 0
        },
        "stats": {
            "version": "723050'1044649126",
            "reported_seq": 1520851317,
            "reported_epoch": 723050,
            "state": "active+recovery_unfound+undersized+degraded+remapped",
            "last_fresh": "2022-06-29T13:58:01.090917+0300",
            "last_change": "2022-06-29T13:44:37.670390+0300",
            "last_active": "2022-06-29T13:58:01.090917+0300",
            "last_peered": "2022-06-29T13:58:01.090917+0300",
            "last_clean": "2022-06-09T16:18:34.751609+0300",
            "last_became_active": "2022-06-29T13:44:37.586107+0300",
            "last_became_peered": "2022-06-29T13:44:37.586107+0300",
            "last_unstale": "2022-06-29T13:58:01.090917+0300",
            "last_undegraded": "2022-06-29T13:44:37.669704+0300",
            "last_fullsized": "2022-06-29T13:44:37.072980+0300",
            "mapping_epoch": 723045,
            "log_start": "723049'1044639041",
            "ondisk_log_start": "723049'1044639041",
            "created": 6813,
            "last_epoch_clean": 713865,
            "parent": "0.0",
            "parent_split_bits": 0,
            "last_scrub": "710579'988167138",
            "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
            "last_deep_scrub": "708883'971173140",
            "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
            "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
            "log_size": 10085,
            "ondisk_log_size": 10085,
            "stats_invalid": false,
            "dirty_stats_invalid": false,
            "omap_stats_invalid": false,
            "hitset_stats_invalid": false,
            "hitset_bytes_stats_invalid": false,
            "pin_stats_invalid": false,
            "manifest_stats_invalid": false,
            "snaptrimq_len": 0,
            "stat_sum": {
                "num_bytes": 29269213184,
                "num_objects": 7255,
                "num_object_clones": 0,
                "num_object_copies": 14510,
                "num_objects_missing_on_primary": 1,
                "num_objects_missing": 1,
                "num_objects_degraded": 7256,
                "num_objects_misplaced": 7254,
                "num_objects_unfound": 1,
                "num_objects_dirty": 7255,
                "num_whiteouts": 0,
                "num_read": 449932920,
                "num_read_kb": 8094044560,
                "num_write": 1022291571,
                "num_write_kb": 17372688024,
                "num_scrub_errors": 0,
                "num_shallow_scrub_errors": 0,
                "num_deep_scrub_errors": 0,
                "num_objects_recovered": 82882,
                "num_bytes_recovered": 328765462528,
                "num_keys_recovered": 6,
                "num_objects_omap": 0,
                "num_objects_hit_set_archive": 0,
                "num_bytes_hit_set_archive": 0,
                "num_flush": 0,
                "num_flush_kb": 0,
                "num_evict": 0,
                "num_evict_kb": 0,
                "num_promote": 0,
                "num_flush_mode_high": 0,
                "num_flush_mode_low": 0,
                "num_evict_mode_some": 0,
                "num_evict_mode_full": 0,
                "num_objects_pinned": 0,
                "num_legacy_snapsets": 0,
                "num_large_omap_objects": 0,
                "num_objects_manifest": 0,
                "num_omap_bytes": 0,
                "num_omap_keys": 0,
                "num_objects_repaired": 0
            },
            "up": [
                41,
                141
            ],
            "acting": [
                131
            ],
            "avail_no_missing": [
                "42",
                "210",
                "348"
            ],
            "object_location_counts": [
                {
                    "shards": "131",
                    "objects": 7255
                }
            ],
            "blocked_by": [],
            "up_primary": 41,
            "acting_primary": 131,
            "purged_snaps": []
        },
        "empty": 0,
        "dne": 0,
        "incomplete": 0,
        "last_epoch_started": 723046,
        "hit_set_history": {
            "current_last_update": "0'0",
            "history": []
        }
    },
    "peer_info": [
        {
            "peer": "0",
            "pgid": "16.1e",
            "last_update": "0'0",
            "last_complete": "0'0",
            "log_tail": "0'0",
            "last_user_version": 0,
            "last_backfill": "MAX",
            "purged_snaps": [],
            "history": {
                "epoch_created": 0,
                "epoch_pool_created": 0,
                "last_epoch_started": 0,
                "last_interval_started": 0,
                "last_epoch_clean": 0,
                "last_interval_clean": 0,
                "last_epoch_split": 0,
                "last_epoch_marked_full": 0,
                "same_up_since": 0,
                "same_interval_since": 0,
                "same_primary_since": 0,
                "last_scrub": "0'0",
                "last_scrub_stamp": "0.000000",
                "last_deep_scrub": "0'0",
                "last_deep_scrub_stamp": "0.000000",
                "last_clean_scrub_stamp": "0.000000",
                "prior_readable_until_ub": 0
            },
            "stats": {
                "version": "0'0",
                "reported_seq": 0,
                "reported_epoch": 0,
                "state": "unknown",
                "last_fresh": "0.000000",
                "last_change": "0.000000",
                "last_active": "0.000000",
                "last_peered": "0.000000",
                "last_clean": "0.000000",
                "last_became_active": "0.000000",
                "last_became_peered": "0.000000",
                "last_unstale": "0.000000",
                "last_undegraded": "0.000000",
                "last_fullsized": "0.000000",
                "mapping_epoch": 0,
                "log_start": "0'0",
                "ondisk_log_start": "0'0",
                "created": 0,
                "last_epoch_clean": 0,
                "parent": "0.0",
                "parent_split_bits": 0,
                "last_scrub": "0'0",
                "last_scrub_stamp": "0.000000",
                "last_deep_scrub": "0'0",
                "last_deep_scrub_stamp": "0.000000",
                "last_clean_scrub_stamp": "0.000000",
                "log_size": 0,
                "ondisk_log_size": 0,
                "stats_invalid": false,
                "dirty_stats_invalid": false,
                "omap_stats_invalid": false,
                "hitset_stats_invalid": false,
                "hitset_bytes_stats_invalid": false,
                "pin_stats_invalid": false,
                "manifest_stats_invalid": false,
                "snaptrimq_len": 0,
                "stat_sum": {
                    "num_bytes": 0,
                    "num_objects": 0,
                    "num_object_clones": 0,
                    "num_object_copies": 0,
                    "num_objects_missing_on_primary": 0,
                    "num_objects_missing": 7255,
                    "num_objects_degraded": 0,
                    "num_objects_misplaced": 0,
                    "num_objects_unfound": 0,
                    "num_objects_dirty": 0,
                    "num_whiteouts": 0,
                    "num_read": 0,
                    "num_read_kb": 0,
                    "num_write": 0,
                    "num_write_kb": 0,
                    "num_scrub_errors": 0,
                    "num_shallow_scrub_errors": 0,
                    "num_deep_scrub_errors": 0,
                    "num_objects_recovered": 0,
                    "num_bytes_recovered": 0,
                    "num_keys_recovered": 0,
                    "num_objects_omap": 0,
                    "num_objects_hit_set_archive": 0,
                    "num_bytes_hit_set_archive": 0,
                    "num_flush": 0,
                    "num_flush_kb": 0,
                    "num_evict": 0,
                    "num_evict_kb": 0,
                    "num_promote": 0,
                    "num_flush_mode_high": 0,
                    "num_flush_mode_low": 0,
                    "num_evict_mode_some": 0,
                    "num_evict_mode_full": 0,
                    "num_objects_pinned": 0,
                    "num_legacy_snapsets": 0,
                    "num_large_omap_objects": 0,
                    "num_objects_manifest": 0,
                    "num_omap_bytes": 0,
                    "num_omap_keys": 0,
                    "num_objects_repaired": 0
                },
                "up": [],
                "acting": [],
                "avail_no_missing": [],
                "object_location_counts": [],
                "blocked_by": [],
                "up_primary": -1,
                "acting_primary": -1,
                "purged_snaps": []
            },
            "empty": 1,
            "dne": 1,
            "incomplete": 0,
            "last_epoch_started": 0,
            "hit_set_history": {
                "current_last_update": "0'0",
                "history": []
            }
        },
        {
            "peer": "41",
            "pgid": "16.1e",
            "last_update": "723050'1044649126",
            "last_complete": "723050'1044649126",
            "log_tail": "723039'1044614668",
            "last_user_version": 1044611140,
            "last_backfill": "MIN",
            "purged_snaps": [],
            "history": {
                "epoch_created": 6813,
                "epoch_pool_created": 6813,
                "last_epoch_started": 723046,
                "last_interval_started": 723045,
                "last_epoch_clean": 713865,
                "last_interval_clean": 713864,
                "last_epoch_split": 53947,
                "last_epoch_marked_full": 76013,
                "same_up_since": 723045,
                "same_interval_since": 723045,
                "same_primary_since": 722482,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "prior_readable_until_ub": 0
            },
            "stats": {
                "version": "722254'1039600127",
                "reported_seq": 1443357445,
                "reported_epoch": 722260,
                "state": "peering",
                "last_fresh": "2022-06-27T15:22:20.422891+0300",
                "last_change": "2022-06-27T15:22:20.422694+0300",
                "last_active": "2022-06-09T16:24:10.485928+0300",
                "last_peered": "2022-06-09T16:18:34.751609+0300",
                "last_clean": "2022-06-09T16:18:34.751609+0300",
                "last_became_active": "2022-06-09T14:46:08.142913+0300",
                "last_became_peered": "2022-06-09T14:46:08.142913+0300",
                "last_unstale": "2022-06-27T15:22:20.422891+0300",
                "last_undegraded": "2022-06-27T15:22:20.422891+0300",
                "last_fullsized": "2022-06-27T15:22:20.422891+0300",
                "mapping_epoch": 723045,
                "log_start": "722254'1039590041",
                "ondisk_log_start": "722254'1039590041",
                "created": 6813,
                "last_epoch_clean": 713865,
                "parent": "0.0",
                "parent_split_bits": 0,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "log_size": 10086,
                "ondisk_log_size": 10086,
                "stats_invalid": false,
                "dirty_stats_invalid": false,
                "omap_stats_invalid": false,
                "hitset_stats_invalid": false,
                "hitset_bytes_stats_invalid": false,
                "pin_stats_invalid": false,
                "manifest_stats_invalid": false,
                "snaptrimq_len": 0,
                "stat_sum": {
                    "num_bytes": 46920675328,
                    "num_objects": 0,
                    "num_object_clones": 0,
                    "num_object_copies": 0,
                    "num_objects_missing_on_primary": 0,
                    "num_objects_missing": 7255,
                    "num_objects_degraded": 0,
                    "num_objects_misplaced": 0,
                    "num_objects_unfound": 0,
                    "num_objects_dirty": 0,
                    "num_whiteouts": 0,
                    "num_read": 0,
                    "num_read_kb": 0,
                    "num_write": 0,
                    "num_write_kb": 0,
                    "num_scrub_errors": 0,
                    "num_shallow_scrub_errors": 0,
                    "num_deep_scrub_errors": 0,
                    "num_objects_recovered": 0,
                    "num_bytes_recovered": 0,
                    "num_keys_recovered": 0,
                    "num_objects_omap": 0,
                    "num_objects_hit_set_archive": 0,
                    "num_bytes_hit_set_archive": 0,
                    "num_flush": 0,
                    "num_flush_kb": 0,
                    "num_evict": 0,
                    "num_evict_kb": 0,
                    "num_promote": 0,
                    "num_flush_mode_high": 0,
                    "num_flush_mode_low": 0,
                    "num_evict_mode_some": 0,
                    "num_evict_mode_full": 0,
                    "num_objects_pinned": 0,
                    "num_legacy_snapsets": 0,
                    "num_large_omap_objects": 0,
                    "num_objects_manifest": 0,
                    "num_omap_bytes": 0,
                    "num_omap_keys": 0,
                    "num_objects_repaired": 0
                },
                "up": [
                    41,
                    141
                ],
                "acting": [
                    131
                ],
                "avail_no_missing": [],
                "object_location_counts": [],
                "blocked_by": [
                    210,
                    348
                ],
                "up_primary": 41,
                "acting_primary": 131,
                "purged_snaps": []
            },
            "empty": 0,
            "dne": 0,
            "incomplete": 1,
            "last_epoch_started": 723046,
            "hit_set_history": {
                "current_last_update": "0'0",
                "history": []
            }
        },
        {
            "peer": "42",
            "pgid": "16.1e",
            "last_update": "723044'1044624668",
            "last_complete": "723044'1044624668",
            "log_tail": "723039'1044614641",
            "last_user_version": 1044624668,
            "last_backfill": "MIN",
            "purged_snaps": [],
            "history": {
                "epoch_created": 6813,
                "epoch_pool_created": 6813,
                "last_epoch_started": 723039,
                "last_interval_started": 723038,
                "last_epoch_clean": 713865,
                "last_interval_clean": 713864,
                "last_epoch_split": 53947,
                "last_epoch_marked_full": 76013,
                "same_up_since": 723045,
                "same_interval_since": 723045,
                "same_primary_since": 722482,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "prior_readable_until_ub": 0
            },
            "stats": {
                "version": "722477'1039743723",
                "reported_seq": 17,
                "reported_epoch": 722481,
                "state": "peering",
                "last_fresh": "2022-06-27T16:39:50.568020+0300",
                "last_change": "2022-06-27T16:39:50.568020+0300",
                "last_active": "0.000000",
                "last_peered": "0.000000",
                "last_clean": "0.000000",
                "last_became_active": "0.000000",
                "last_became_peered": "0.000000",
                "last_unstale": "2022-06-27T16:39:50.568020+0300",
                "last_undegraded": "2022-06-27T16:39:50.568020+0300",
                "last_fullsized": "2022-06-27T16:39:50.568020+0300",
                "mapping_epoch": 723045,
                "log_start": "722460'1039733641",
                "ondisk_log_start": "722460'1039733641",
                "created": 6813,
                "last_epoch_clean": 713865,
                "parent": "0.0",
                "parent_split_bits": 0,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "log_size": 10082,
                "ondisk_log_size": 10082,
                "stats_invalid": false,
                "dirty_stats_invalid": false,
                "omap_stats_invalid": false,
                "hitset_stats_invalid": false,
                "hitset_bytes_stats_invalid": false,
                "pin_stats_invalid": false,
                "manifest_stats_invalid": false,
                "snaptrimq_len": 0,
                "stat_sum": {
                    "num_bytes": 0,
                    "num_objects": 0,
                    "num_object_clones": 0,
                    "num_object_copies": 0,
                    "num_objects_missing_on_primary": 0,
                    "num_objects_missing": 0,
                    "num_objects_degraded": 0,
                    "num_objects_misplaced": 0,
                    "num_objects_unfound": 0,
                    "num_objects_dirty": 0,
                    "num_whiteouts": 0,
                    "num_read": 0,
                    "num_read_kb": 0,
                    "num_write": 0,
                    "num_write_kb": 0,
                    "num_scrub_errors": 0,
                    "num_shallow_scrub_errors": 0,
                    "num_deep_scrub_errors": 0,
                    "num_objects_recovered": 0,
                    "num_bytes_recovered": 0,
                    "num_keys_recovered": 0,
                    "num_objects_omap": 0,
                    "num_objects_hit_set_archive": 0,
                    "num_bytes_hit_set_archive": 0,
                    "num_flush": 0,
                    "num_flush_kb": 0,
                    "num_evict": 0,
                    "num_evict_kb": 0,
                    "num_promote": 0,
                    "num_flush_mode_high": 0,
                    "num_flush_mode_low": 0,
                    "num_evict_mode_some": 0,
                    "num_evict_mode_full": 0,
                    "num_objects_pinned": 0,
                    "num_legacy_snapsets": 0,
                    "num_large_omap_objects": 0,
                    "num_objects_manifest": 0,
                    "num_omap_bytes": 0,
                    "num_omap_keys": 0,
                    "num_objects_repaired": 0
                },
                "up": [
                    41,
                    141
                ],
                "acting": [
                    131
                ],
                "avail_no_missing": [],
                "object_location_counts": [],
                "blocked_by": [
                    0,
                    41,
                    131,
                    210,
                    348
                ],
                "up_primary": 41,
                "acting_primary": 131,
                "purged_snaps": []
            },
            "empty": 0,
            "dne": 0,
            "incomplete": 1,
            "last_epoch_started": 723039,
            "hit_set_history": {
                "current_last_update": "0'0",
                "history": []
            }
        },
        {
            "peer": "141",
            "pgid": "16.1e",
            "last_update": "723050'1044649126",
            "last_complete": "723050'1044649126",
            "log_tail": "723039'1044614668",
            "last_user_version": 1044565479,
            "last_backfill": "MIN",
            "purged_snaps": [],
            "history": {
                "epoch_created": 6813,
                "epoch_pool_created": 6813,
                "last_epoch_started": 723046,
                "last_interval_started": 723045,
                "last_epoch_clean": 713865,
                "last_interval_clean": 713864,
                "last_epoch_split": 53947,
                "last_epoch_marked_full": 76013,
                "same_up_since": 723045,
                "same_interval_since": 723045,
                "same_primary_since": 722482,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "prior_readable_until_ub": 0
            },
            "stats": {
                "version": "0'0",
                "reported_seq": 0,
                "reported_epoch": 0,
                "state": "unknown",
                "last_fresh": "0.000000",
                "last_change": "0.000000",
                "last_active": "0.000000",
                "last_peered": "0.000000",
                "last_clean": "0.000000",
                "last_became_active": "0.000000",
                "last_became_peered": "0.000000",
                "last_unstale": "0.000000",
                "last_undegraded": "0.000000",
                "last_fullsized": "0.000000",
                "mapping_epoch": 723045,
                "log_start": "0'0",
                "ondisk_log_start": "0'0",
                "created": 0,
                "last_epoch_clean": 0,
                "parent": "0.0",
                "parent_split_bits": 0,
                "last_scrub": "0'0",
                "last_scrub_stamp": "0.000000",
                "last_deep_scrub": "0'0",
                "last_deep_scrub_stamp": "0.000000",
                "last_clean_scrub_stamp": "0.000000",
                "log_size": 0,
                "ondisk_log_size": 0,
                "stats_invalid": false,
                "dirty_stats_invalid": false,
                "omap_stats_invalid": false,
                "hitset_stats_invalid": false,
                "hitset_bytes_stats_invalid": false,
                "pin_stats_invalid": false,
                "manifest_stats_invalid": false,
                "snaptrimq_len": 0,
                "stat_sum": {
                    "num_bytes": 0,
                    "num_objects": 0,
                    "num_object_clones": 0,
                    "num_object_copies": 0,
                    "num_objects_missing_on_primary": 0,
                    "num_objects_missing": 7255,
                    "num_objects_degraded": 0,
                    "num_objects_misplaced": 0,
                    "num_objects_unfound": 0,
                    "num_objects_dirty": 0,
                    "num_whiteouts": 0,
                    "num_read": 0,
                    "num_read_kb": 0,
                    "num_write": 0,
                    "num_write_kb": 0,
                    "num_scrub_errors": 0,
                    "num_shallow_scrub_errors": 0,
                    "num_deep_scrub_errors": 0,
                    "num_objects_recovered": 0,
                    "num_bytes_recovered": 0,
                    "num_keys_recovered": 0,
                    "num_objects_omap": 0,
                    "num_objects_hit_set_archive": 0,
                    "num_bytes_hit_set_archive": 0,
                    "num_flush": 0,
                    "num_flush_kb": 0,
                    "num_evict": 0,
                    "num_evict_kb": 0,
                    "num_promote": 0,
                    "num_flush_mode_high": 0,
                    "num_flush_mode_low": 0,
                    "num_evict_mode_some": 0,
                    "num_evict_mode_full": 0,
                    "num_objects_pinned": 0,
                    "num_legacy_snapsets": 0,
                    "num_large_omap_objects": 0,
                    "num_objects_manifest": 0,
                    "num_omap_bytes": 0,
                    "num_omap_keys": 0,
                    "num_objects_repaired": 0
                },
                "up": [
                    41,
                    141
                ],
                "acting": [
                    131
                ],
                "avail_no_missing": [],
                "object_location_counts": [],
                "blocked_by": [],
                "up_primary": 41,
                "acting_primary": 131,
                "purged_snaps": []
            },
            "empty": 0,
            "dne": 0,
            "incomplete": 1,
            "last_epoch_started": 723046,
            "hit_set_history": {
                "current_last_update": "0'0",
                "history": []
            }
        },
        {
            "peer": "210",
            "pgid": "16.1e",
            "last_update": "720573'1025257351",
            "last_complete": "720573'1025257351",
            "log_tail": "720569'1025247341",
            "last_user_version": 1025257351,
            "last_backfill": "MIN",
            "purged_snaps": [],
            "history": {
                "epoch_created": 6813,
                "epoch_pool_created": 6813,
                "last_epoch_started": 723039,
                "last_interval_started": 723038,
                "last_epoch_clean": 713865,
                "last_interval_clean": 713864,
                "last_epoch_split": 53947,
                "last_epoch_marked_full": 76013,
                "same_up_since": 723045,
                "same_interval_since": 723045,
                "same_primary_since": 722482,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "prior_readable_until_ub": 0
            },
            "stats": {
                "version": "718146'1005652863",
                "reported_seq": 92,
                "reported_epoch": 718151,
                "state": "peering",
                "last_fresh": "2022-06-15T14:30:08.220366+0300",
                "last_change": "2022-06-15T14:30:08.220331+0300",
                "last_active": "0.000000",
                "last_peered": "0.000000",
                "last_clean": "0.000000",
                "last_became_active": "0.000000",
                "last_became_peered": "0.000000",
                "last_unstale": "2022-06-15T14:30:08.220366+0300",
                "last_undegraded": "2022-06-15T14:30:08.220366+0300",
                "last_fullsized": "2022-06-15T14:30:08.220366+0300",
                "mapping_epoch": 723045,
                "log_start": "718129'1005642841",
                "ondisk_log_start": "718129'1005642841",
                "created": 6813,
                "last_epoch_clean": 713865,
                "parent": "0.0",
                "parent_split_bits": 0,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "log_size": 10022,
                "ondisk_log_size": 10022,
                "stats_invalid": false,
                "dirty_stats_invalid": false,
                "omap_stats_invalid": false,
                "hitset_stats_invalid": false,
                "hitset_bytes_stats_invalid": false,
                "pin_stats_invalid": false,
                "manifest_stats_invalid": false,
                "snaptrimq_len": 0,
                "stat_sum": {
                    "num_bytes": 0,
                    "num_objects": 0,
                    "num_object_clones": 0,
                    "num_object_copies": 0,
                    "num_objects_missing_on_primary": 0,
                    "num_objects_missing": 0,
                    "num_objects_degraded": 0,
                    "num_objects_misplaced": 0,
                    "num_objects_unfound": 0,
                    "num_objects_dirty": 0,
                    "num_whiteouts": 0,
                    "num_read": 0,
                    "num_read_kb": 0,
                    "num_write": 0,
                    "num_write_kb": 0,
                    "num_scrub_errors": 0,
                    "num_shallow_scrub_errors": 0,
                    "num_deep_scrub_errors": 0,
                    "num_objects_recovered": 0,
                    "num_bytes_recovered": 0,
                    "num_keys_recovered": 0,
                    "num_objects_omap": 0,
                    "num_objects_hit_set_archive": 0,
                    "num_bytes_hit_set_archive": 0,
                    "num_flush": 0,
                    "num_flush_kb": 0,
                    "num_evict": 0,
                    "num_evict_kb": 0,
                    "num_promote": 0,
                    "num_flush_mode_high": 0,
                    "num_flush_mode_low": 0,
                    "num_evict_mode_some": 0,
                    "num_evict_mode_full": 0,
                    "num_objects_pinned": 0,
                    "num_legacy_snapsets": 0,
                    "num_large_omap_objects": 0,
                    "num_objects_manifest": 0,
                    "num_omap_bytes": 0,
                    "num_omap_keys": 0,
                    "num_objects_repaired": 0
                },
                "up": [
                    41,
                    141
                ],
                "acting": [
                    131
                ],
                "avail_no_missing": [],
                "object_location_counts": [],
                "blocked_by": [
                    41,
                    348
                ],
                "up_primary": 41,
                "acting_primary": 131,
                "purged_snaps": []
            },
            "empty": 0,
            "dne": 0,
            "incomplete": 1,
            "last_epoch_started": 720232,
            "hit_set_history": {
                "current_last_update": "0'0",
                "history": []
            }
        },
        {
            "peer": "348",
            "pgid": "16.1e",
            "last_update": "720570'1025257287",
            "last_complete": "720570'1025257287",
            "log_tail": "720569'1025247241",
            "last_user_version": 1025257287,
            "last_backfill": "MIN",
            "purged_snaps": [],
            "history": {
                "epoch_created": 6813,
                "epoch_pool_created": 6813,
                "last_epoch_started": 723039,
                "last_interval_started": 723038,
                "last_epoch_clean": 713865,
                "last_interval_clean": 713864,
                "last_epoch_split": 53947,
                "last_epoch_marked_full": 76013,
                "same_up_since": 723045,
                "same_interval_since": 723045,
                "same_primary_since": 722482,
                "last_scrub": "710579'988167138",
                "last_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "last_deep_scrub": "708883'971173140",
                "last_deep_scrub_stamp": "2022-06-03T01:20:13.786232+0300",
                "last_clean_scrub_stamp": "2022-06-09T03:27:36.771392+0300",
                "prior_readable_until_ub": 0
            },
            "stats": {
                "version": "0'0",
                "reported_seq": 0,
                "reported_epoch": 0,
                "state": "unknown",
                "last_fresh": "0.000000",
                "last_change": "0.000000",
                "last_active": "0.000000",
                "last_peered": "0.000000",
                "last_clean": "0.000000",
                "last_became_active": "0.000000",
                "last_became_peered": "0.000000",
                "last_unstale": "0.000000",
                "last_undegraded": "0.000000",
                "last_fullsized": "0.000000",
                "mapping_epoch": 723045,
                "log_start": "0'0",
                "ondisk_log_start": "0'0",
                "created": 0,
                "last_epoch_clean": 0,
                "parent": "0.0",
                "parent_split_bits": 0,
                "last_scrub": "0'0",
                "last_scrub_stamp": "0.000000",
                "last_deep_scrub": "0'0",
                "last_deep_scrub_stamp": "0.000000",
                "last_clean_scrub_stamp": "0.000000",
                "log_size": 0,
                "ondisk_log_size": 0,
                "stats_invalid": false,
                "dirty_stats_invalid": false,
                "omap_stats_invalid": false,
                "hitset_stats_invalid": false,
                "hitset_bytes_stats_invalid": false,
                "pin_stats_invalid": false,
                "manifest_stats_invalid": false,
                "snaptrimq_len": 0,
                "stat_sum": {
                    "num_bytes": 0,
                    "num_objects": 0,
                    "num_object_clones": 0,
                    "num_object_copies": 0,
                    "num_objects_missing_on_primary": 0,
                    "num_objects_missing": 0,
                    "num_objects_degraded": 0,
                    "num_objects_misplaced": 0,
                    "num_objects_unfound": 0,
                    "num_objects_dirty": 0,
                    "num_whiteouts": 0,
                    "num_read": 0,
                    "num_read_kb": 0,
                    "num_write": 0,
                    "num_write_kb": 0,
                    "num_scrub_errors": 0,
                    "num_shallow_scrub_errors": 0,
                    "num_deep_scrub_errors": 0,
                    "num_objects_recovered": 0,
                    "num_bytes_recovered": 0,
                    "num_keys_recovered": 0,
                    "num_objects_omap": 0,
                    "num_objects_hit_set_archive": 0,
                    "num_bytes_hit_set_archive": 0,
                    "num_flush": 0,
                    "num_flush_kb": 0,
                    "num_evict": 0,
                    "num_evict_kb": 0,
                    "num_promote": 0,
                    "num_flush_mode_high": 0,
                    "num_flush_mode_low": 0,
                    "num_evict_mode_some": 0,
                    "num_evict_mode_full": 0,
                    "num_objects_pinned": 0,
                    "num_legacy_snapsets": 0,
                    "num_large_omap_objects": 0,
                    "num_objects_manifest": 0,
                    "num_omap_bytes": 0,
                    "num_omap_keys": 0,
                    "num_objects_repaired": 0
                },
                "up": [
                    41,
                    141
                ],
                "acting": [
                    131
                ],
                "avail_no_missing": [],
                "object_location_counts": [],
                "blocked_by": [],
                "up_primary": 41,
                "acting_primary": 131,
                "purged_snaps": []
            },
            "empty": 0,
            "dne": 0,
            "incomplete": 1,
            "last_epoch_started": 720291,
            "hit_set_history": {
                "current_last_update": "0'0",
                "history": []
            }
        }
    ],
    "recovery_state": [
        {
            "name": "Started/Primary/Active",
            "enter_time": "2022-06-29T13:44:37.073051+0300",
            "might_have_unfound": [
                {
                    "osd": "0",
                    "status": "not queried"
                },
                {
                    "osd": "41",
                    "status": "already probed"
                },
                {
                    "osd": "42",
                    "status": "already probed"
                },
                {
                    "osd": "141",
                    "status": "already probed"
                },
                {
                    "osd": "210",
                    "status": "already probed"
                },
                {
                    "osd": "348",
                    "status": "already probed"
                }
            ],
            "recovery_progress": {
                "backfill_targets": [
                    "41",
                    "141"
                ],
                "waiting_on_backfill": [],
                "last_backfill_started": "MIN",
                "backfill_info": {
                    "begin": "MIN",
                    "end": "MIN",
                    "objects": []
                },
                "peer_backfill_info": [],
                "backfills_in_flight": [],
                "recovering": [],
                "pg_backend": {
                    "pull_from_peer": [],
                    "pushing": []
                }
            }
        },
        {
            "name": "Started",
            "enter_time": "2022-06-29T13:44:36.027931+0300"
        }
    ],
    "scrubber": {
        "epoch_start": "0",
        "active": false
    },
    "agent_state": {}
}

grep 16.1e /var/log/ceph/ceph-osd.131.log


2022-06-29T12:33:49.213+0300 7f37cefad700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:49:29.272+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:50:46.629+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:54:00.683+0300 7f37cefad700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:04:05.758+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:12:52.468+0300 7f37cefad700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:13:13.188+0300 7f37cafa5700  1 osd.131 pg_epoch: 723033 pg[16.1e( v 723032'1044565479 lc 0'0 (723031'1044555441,723032'1044565479] local-lis/les=722726/722727 n=7259 ec=6813/6813 lis/c=722726/713864 les/c/f=722727/713865/76013 sis=723033 pruub=8.442323685s) [41,131]/[131] backfill=[41,141] r=0 lpr=723033 pi=[713864,723033)/4 crt=723032'1044565479 mlcod 0'0 active+remapped pruub 160422.625000000s@ MUST_REPAIR MUST_DEEP_SCRUB MUST_SCRUB planned REQ_SCRUB m=1 u=1 mbc={}] start_peering_interval up [41,141] -> [41,131], acting [131] -> [131], acting_primary 131 -> 131, up_primary 41 -> 41, role 0 -> 0, features acting 4540138297136906239 upacting 4540138297136906239
2022-06-29T13:13:13.192+0300 7f37cafa5700  1 osd.131 pg_epoch: 723033 pg[16.1e( v 723032'1044565479 lc 0'0 (723031'1044555441,723032'1044565479] local-lis/les=722726/722727 n=7259 ec=6813/6813 lis/c=722726/713864 les/c/f=722727/713865/76013 sis=723033 pruub=8.442323685s) [41,131]/[131] r=0 lpr=723033 pi=[713864,723033)/4 crt=723032'1044565479 mlcod 0'0 remapped pruub 160422.625000000s@ MUST_REPAIR MUST_DEEP_SCRUB MUST_SCRUB planned REQ_SCRUB m=1 mbc={}] state<Start>: transitioning to Primary
2022-06-29T13:13:14.712+0300 7f37cefad700  0 log_channel(cluster) log [INF] : 16.1e continuing backfill to osd.41 from (723031'1044555441,723032'1044565479] MIN to 723032'1044565479
2022-06-29T13:18:01.451+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:27:10.766+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:34:42.955+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:37:06.973+0300 7f37cefad700  1 osd.131 pg_epoch: 723038 pg[16.1e( v 723037'1044611140 lc 0'0 (723036'1044601041,723037'1044611140] local-lis/les=723033/723034 n=7254 ec=6813/6813 lis/c=723033/713864 les/c/f=723034/713865/76013 sis=723038 pruub=15.828340530s) [42,131]/[131] backfill=[41] r=0 lpr=723038 pi=[713864,723038)/4 crt=723037'1044611140 mlcod 0'0 active+remapped pruub 161863.812500000s@ MUST_REPAIR MUST_DEEP_SCRUB MUST_SCRUB planned REQ_SCRUB m=1 u=1 mbc={}] start_peering_interval up [41,131] -> [42,131], acting [131] -> [131], acting_primary 131 -> 131, up_primary 41 -> 42, role 0 -> 0, features acting 4540138297136906239 upacting 4540138297136906239
2022-06-29T13:37:06.973+0300 7f37cefad700  1 osd.131 pg_epoch: 723038 pg[16.1e( v 723037'1044611140 lc 0'0 (723036'1044601041,723037'1044611140] local-lis/les=723033/723034 n=7254 ec=6813/6813 lis/c=723033/713864 les/c/f=723034/713865/76013 sis=723038 pruub=15.828340530s) [42,131]/[131] r=0 lpr=723038 pi=[713864,723038)/4 crt=723037'1044611140 mlcod 0'0 remapped pruub 161863.812500000s@ MUST_REPAIR MUST_DEEP_SCRUB MUST_SCRUB planned REQ_SCRUB m=1 mbc={}] state<Start>: transitioning to Primary
2022-06-29T13:37:08.005+0300 7f37cefad700  0 log_channel(cluster) log [DBG] : 16.1e starting backfill to osd.42 from (722712'1041612441,722712'1041622482] MIN to 723037'1044611140
2022-06-29T13:40:12.043+0300 7f37cefad700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:41:09.416+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:42:27.073+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:42:34.633+0300 7f37cefad700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:42:39.753+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:44:36.027+0300 7f37cafa5700  1 osd.131 pg_epoch: 723045 pg[16.1e( v 723044'1044624668 lc 0'0 (723039'1044614641,723044'1044624668] local-lis/les=723038/723039 n=7265 ec=6813/6813 lis/c=723038/713864 les/c/f=723039/713865/76013 sis=723045 pruub=8.584972382s) [41,141]/[131] backfill=[42] r=0 lpr=723045 pi=[713864,723045)/4 crt=723044'1044624668 mlcod 0'0 active+remapped pruub 162305.609375000s@ MUST_REPAIR MUST_DEEP_SCRUB MUST_SCRUB planned REQ_SCRUB m=1 u=1 mbc={}] start_peering_interval up [42,131] -> [41,141], acting [131] -> [131], acting_primary 131 -> 131, up_primary 42 -> 41, role 0 -> 0, features acting 4540138297136906239 upacting 4540138297136906239
2022-06-29T13:44:36.027+0300 7f37cafa5700  1 osd.131 pg_epoch: 723045 pg[16.1e( v 723044'1044624668 lc 0'0 (723039'1044614641,723044'1044624668] local-lis/les=723038/723039 n=7265 ec=6813/6813 lis/c=723038/713864 les/c/f=723039/713865/76013 sis=723045 pruub=8.584972382s) [41,141]/[131] r=0 lpr=723045 pi=[713864,723045)/4 crt=723044'1044624668 mlcod 0'0 remapped pruub 162305.609375000s@ MUST_REPAIR MUST_DEEP_SCRUB MUST_SCRUB planned REQ_SCRUB m=1 mbc={}] state<Start>: transitioning to Primary
2022-06-29T13:44:37.071+0300 7f37cefad700  0 log_channel(cluster) log [DBG] : 16.1e starting backfill to osd.41 from (723036'1044601041,723037'1044611140] MIN to 723044'1044624668
2022-06-29T13:44:37.099+0300 7f37cefad700  0 log_channel(cluster) log [DBG] : 16.1e starting backfill to osd.141 from (723031'1044555441,723032'1044565479] MIN to 723044'1044624668
2022-06-29T13:46:59.696+0300 7f37cefad700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:47:01.196+0300 7f37cafa5700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:48:47.026+0300 7f37cefad700 -1 log_channel(cluster) log [ERR] : 16.1e has 1 objects unfound and apparently lost





grep 16.1e /var/log/ceph/ceph-osd.41.log

2022-06-29T13:13:13.191+0300 7f61c4c26700  1 osd.41 pg_epoch: 723033 pg[16.1e( v 723032'1044565479 (723031'1044555441,723032'1044565479] lb MIN local-lis/les=722713/722714 n=0 ec=6813/6813 lis/c=722713/713864 les/c/f=722714/713865/76013 sis=723033) [41,131]/[131] r=-1 lpr=723033 pi=[713864,723033)/4 luod=0'0 lua=722929'1043551464 crt=723032'1044565479 lcod 723032'1044565478 mlcod 0'0 active+remapped mbc={}] start_peering_interval up [41,141] -> [41,131], acting [131] -> [131], acting_primary 131 -> 131, up_primary 41 -> 41, role -1 -> -1, features acting 4540138297136906239 upacting 4540138297136906239
2022-06-29T13:13:13.195+0300 7f61c4c26700  1 osd.41 pg_epoch: 723033 pg[16.1e( v 723032'1044565479 (723031'1044555441,723032'1044565479] lb MIN local-lis/les=722713/722714 n=0 ec=6813/6813 lis/c=722713/713864 les/c/f=722714/713865/76013 sis=723033) [41,131]/[131] r=-1 lpr=723033 pi=[713864,723033)/4 crt=723032'1044565479 lcod 723032'1044565478 mlcod 0'0 remapped NOTIFY mbc={}] state<Start>: transitioning to Stray
2022-06-29T13:37:06.989+0300 7f61c4c26700  1 osd.41 pg_epoch: 723038 pg[16.1e( v 723037'1044611140 (723036'1044601041,723037'1044611140] lb MIN local-lis/les=722713/722714 n=0 ec=6813/6813 lis/c=722713/713864 les/c/f=722714/713865/76013 sis=723038) [42,131]/[131] r=-1 lpr=723038 pi=[713864,723038)/4 luod=0'0 lua=723032'1044565479 crt=723037'1044611140 lcod 723037'1044611139 mlcod 0'0 active+remapped mbc={}] start_peering_interval up [41,131] -> [42,131], acting [131] -> [131], acting_primary 131 -> 131, up_primary 41 -> 42, role -1 -> -1, features acting 4540138297136906239 upacting 4540138297136906239
2022-06-29T13:37:06.989+0300 7f61c4c26700  1 osd.41 pg_epoch: 723038 pg[16.1e( v 723037'1044611140 (723036'1044601041,723037'1044611140] lb MIN local-lis/les=722713/722714 n=0 ec=6813/6813 lis/c=722713/713864 les/c/f=722714/713865/76013 sis=723038) [42,131]/[131] r=-1 lpr=723038 pi=[713864,723038)/4 crt=723037'1044611140 lcod 723037'1044611139 mlcod 0'0 remapped NOTIFY mbc={}] state<Start>: transitioning to Stray
2022-06-29T13:44:36.023+0300 7f61c4c26700  1 osd.41 pg_epoch: 723045 pg[16.1e( v 723037'1044611140 (723036'1044601041,723037'1044611140] lb MIN local-lis/les=722713/722714 n=0 ec=6813/6813 lis/c=723033/713864 les/c/f=723034/713865/76013 sis=723045) [41,141]/[131] r=-1 lpr=723045 pi=[713864,723045)/4 crt=723037'1044611140 lcod 723037'1044611139 mlcod 0'0 remapped NOTIFY mbc={}] start_peering_interval up [42,131] -> [41,141], acting [131] -> [131], acting_primary 131 -> 131, up_primary 42 -> 41, role -1 -> -1, features acting 4540138297136906239 upacting 4540138297136906239
2022-06-29T13:44:36.023+0300 7f61c4c26700  1 osd.41 pg_epoch: 723045 pg[16.1e( v 723037'1044611140 (723036'1044601041,723037'1044611140] lb MIN local-lis/les=722713/722714 n=0 ec=6813/6813 lis/c=723033/713864 les/c/f=723034/713865/76013 sis=723045) [41,141]/[131] r=-1 lpr=723045 pi=[713864,723045)/4 crt=723037'1044611140 lcod 723037'1044611139 mlcod 0'0 remapped NOTIFY mbc={}] state<Start>: transitioning to Stray

grep -w 16.1e /var/log/ceph/ceph.log

2022-06-29T11:40:00.000887+0300 mon.ceph-nimbus-05 (mon.0) 1023041 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T11:40:00.000908+0300 mon.ceph-nimbus-05 (mon.0) 1023043 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T11:48:32.755122+0300 osd.131 (osd.131) 554 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T11:50:00.000336+0300 mon.ceph-nimbus-05 (mon.0) 1023350 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T11:50:00.000356+0300 mon.ceph-nimbus-05 (mon.0) 1023352 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T11:50:00.000383+0300 mon.ceph-nimbus-05 (mon.0) 1023354 : cluster [ERR]     pg 16.1e is stuck undersized for 24h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T11:50:00.000405+0300 mon.ceph-nimbus-05 (mon.0) 1023356 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T11:50:00.000429+0300 mon.ceph-nimbus-05 (mon.0) 1023358 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T11:50:39.961350+0300 osd.131 (osd.131) 555 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T11:56:04.370136+0300 osd.131 (osd.131) 556 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:00:00.000467+0300 mon.ceph-nimbus-05 (mon.0) 1023653 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T12:00:00.000509+0300 mon.ceph-nimbus-05 (mon.0) 1023655 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T12:00:00.000537+0300 mon.ceph-nimbus-05 (mon.0) 1023657 : cluster [ERR]     pg 16.1e is stuck undersized for 24h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T12:00:00.000663+0300 mon.ceph-nimbus-05 (mon.0) 1023659 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T12:00:00.000695+0300 mon.ceph-nimbus-05 (mon.0) 1023661 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T12:04:28.946537+0300 osd.131 (osd.131) 557 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:09:05.789427+0300 osd.131 (osd.131) 558 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:10:00.000406+0300 mon.ceph-nimbus-05 (mon.0) 1023955 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T12:10:00.000433+0300 mon.ceph-nimbus-05 (mon.0) 1023957 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T12:10:00.000454+0300 mon.ceph-nimbus-05 (mon.0) 1023959 : cluster [ERR]     pg 16.1e is stuck undersized for 25h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T12:10:00.000478+0300 mon.ceph-nimbus-05 (mon.0) 1023961 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T12:10:00.000500+0300 mon.ceph-nimbus-05 (mon.0) 1023963 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T12:13:33.139030+0300 osd.131 (osd.131) 559 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:20:00.000376+0300 mon.ceph-nimbus-05 (mon.0) 1024251 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T12:20:00.000407+0300 mon.ceph-nimbus-05 (mon.0) 1024253 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T12:20:00.000432+0300 mon.ceph-nimbus-05 (mon.0) 1024255 : cluster [ERR]     pg 16.1e is stuck undersized for 25h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T12:20:00.000460+0300 mon.ceph-nimbus-05 (mon.0) 1024257 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T12:20:00.000513+0300 mon.ceph-nimbus-05 (mon.0) 1024259 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T12:27:58.826058+0300 osd.131 (osd.131) 560 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:28:43.300823+0300 osd.131 (osd.131) 561 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:30:00.000288+0300 mon.ceph-nimbus-05 (mon.0) 1024561 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T12:30:00.000311+0300 mon.ceph-nimbus-05 (mon.0) 1024563 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T12:30:00.000334+0300 mon.ceph-nimbus-05 (mon.0) 1024565 : cluster [ERR]     pg 16.1e is stuck undersized for 25h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T12:30:00.000355+0300 mon.ceph-nimbus-05 (mon.0) 1024567 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T12:30:00.000385+0300 mon.ceph-nimbus-05 (mon.0) 1024569 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T12:33:23.373298+0300 osd.131 (osd.131) 562 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:33:49.216370+0300 osd.131 (osd.131) 563 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:40:00.000336+0300 mon.ceph-nimbus-05 (mon.0) 1024869 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T12:40:00.000369+0300 mon.ceph-nimbus-05 (mon.0) 1024871 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T12:40:00.000399+0300 mon.ceph-nimbus-05 (mon.0) 1024873 : cluster [ERR]     pg 16.1e is stuck undersized for 25h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T12:40:00.000429+0300 mon.ceph-nimbus-05 (mon.0) 1024875 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T12:40:00.000465+0300 mon.ceph-nimbus-05 (mon.0) 1024877 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T12:49:29.277633+0300 osd.131 (osd.131) 565 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:50:00.000402+0300 mon.ceph-nimbus-05 (mon.0) 1025177 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T12:50:00.000434+0300 mon.ceph-nimbus-05 (mon.0) 1025179 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T12:50:00.000464+0300 mon.ceph-nimbus-05 (mon.0) 1025181 : cluster [ERR]     pg 16.1e is stuck undersized for 25h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T12:50:00.000496+0300 mon.ceph-nimbus-05 (mon.0) 1025183 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T12:50:00.000526+0300 mon.ceph-nimbus-05 (mon.0) 1025185 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T12:50:46.632326+0300 osd.131 (osd.131) 566 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T12:54:00.688822+0300 osd.131 (osd.131) 567 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:00:00.000350+0300 mon.ceph-nimbus-05 (mon.0) 1025477 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T13:00:00.000371+0300 mon.ceph-nimbus-05 (mon.0) 1025479 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T13:00:00.000401+0300 mon.ceph-nimbus-05 (mon.0) 1025481 : cluster [ERR]     pg 16.1e is stuck undersized for 25h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T13:00:00.000435+0300 mon.ceph-nimbus-05 (mon.0) 1025483 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T13:00:00.000466+0300 mon.ceph-nimbus-05 (mon.0) 1025485 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T13:04:05.760884+0300 osd.131 (osd.131) 568 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:10:00.000364+0300 mon.ceph-nimbus-05 (mon.0) 1025789 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T13:10:00.000389+0300 mon.ceph-nimbus-05 (mon.0) 1025791 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T13:10:00.000414+0300 mon.ceph-nimbus-05 (mon.0) 1025793 : cluster [ERR]     pg 16.1e is stuck undersized for 26h, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T13:10:00.000435+0300 mon.ceph-nimbus-05 (mon.0) 1025795 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T13:10:00.000464+0300 mon.ceph-nimbus-05 (mon.0) 1025797 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T13:12:52.472457+0300 osd.131 (osd.131) 569 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:13:14.718778+0300 osd.131 (osd.131) 570 : cluster [INF] 16.1e continuing backfill to osd.41 from (723031'1044555441,723032'1044565479] MIN to 723032'1044565479
2022-06-29T13:18:01.458724+0300 osd.131 (osd.131) 571 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:20:00.000327+0300 mon.ceph-nimbus-05 (mon.0) 1026102 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T13:20:00.000358+0300 mon.ceph-nimbus-05 (mon.0) 1026104 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T13:20:00.000383+0300 mon.ceph-nimbus-05 (mon.0) 1026106 : cluster [ERR]     pg 16.1e is stuck undersized for 6m, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T13:20:00.000405+0300 mon.ceph-nimbus-05 (mon.0) 1026108 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T13:20:00.000428+0300 mon.ceph-nimbus-05 (mon.0) 1026110 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T13:27:10.770987+0300 osd.131 (osd.131) 572 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:30:00.000300+0300 mon.ceph-nimbus-05 (mon.0) 1026410 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T13:30:00.000325+0300 mon.ceph-nimbus-05 (mon.0) 1026412 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T13:30:00.000349+0300 mon.ceph-nimbus-05 (mon.0) 1026414 : cluster [ERR]     pg 16.1e is stuck undersized for 16m, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T13:30:00.000388+0300 mon.ceph-nimbus-05 (mon.0) 1026416 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T13:30:00.000410+0300 mon.ceph-nimbus-05 (mon.0) 1026418 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T13:34:42.961359+0300 osd.131 (osd.131) 573 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:37:08.007290+0300 osd.131 (osd.131) 574 : cluster [DBG] 16.1e starting backfill to osd.42 from (722712'1041612441,722712'1041622482] MIN to 723037'1044611140
2022-06-29T13:40:00.000306+0300 mon.ceph-nimbus-05 (mon.0) 1026733 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T13:40:00.000324+0300 mon.ceph-nimbus-05 (mon.0) 1026735 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T13:40:00.000342+0300 mon.ceph-nimbus-05 (mon.0) 1026737 : cluster [ERR]     pg 16.1e is stuck undersized for 2m, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T13:40:00.000377+0300 mon.ceph-nimbus-05 (mon.0) 1026739 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T13:40:00.000400+0300 mon.ceph-nimbus-05 (mon.0) 1026741 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300
2022-06-29T13:40:12.048711+0300 osd.131 (osd.131) 575 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:41:09.419580+0300 osd.131 (osd.131) 576 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:42:27.078924+0300 osd.131 (osd.131) 577 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:42:34.636260+0300 osd.131 (osd.131) 578 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:42:39.756345+0300 osd.131 (osd.131) 579 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:44:37.073096+0300 osd.131 (osd.131) 580 : cluster [DBG] 16.1e starting backfill to osd.41 from (723036'1044601041,723037'1044611140] MIN to 723044'1044624668
2022-06-29T13:44:37.103043+0300 osd.131 (osd.131) 581 : cluster [DBG] 16.1e starting backfill to osd.141 from (723031'1044555441,723032'1044565479] MIN to 723044'1044624668
2022-06-29T13:46:59.702124+0300 osd.131 (osd.131) 582 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:47:01.201601+0300 osd.131 (osd.131) 583 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:48:47.029613+0300 osd.131 (osd.131) 584 : cluster [ERR] 16.1e has 1 objects unfound and apparently lost
2022-06-29T13:50:00.000384+0300 mon.ceph-nimbus-05 (mon.0) 1027052 : cluster [ERR]     pg 16.1e has 1 unfound objects
2022-06-29T13:50:00.000414+0300 mon.ceph-nimbus-05 (mon.0) 1027054 : cluster [ERR]     pg 16.1e is active+recovery_unfound+undersized+degraded+remapped, acting [131], 1 unfound
2022-06-29T13:50:00.000443+0300 mon.ceph-nimbus-05 (mon.0) 1027056 : cluster [ERR]     pg 16.1e is stuck undersized for 5m, current state active+recovery_unfound+undersized+degraded+remapped, last acting [131]
2022-06-29T13:50:00.000474+0300 mon.ceph-nimbus-05 (mon.0) 1027058 : cluster [ERR]     pg 16.1e not deep-scrubbed since 2022-06-03T01:20:13.786232+0300
2022-06-29T13:50:00.000506+0300 mon.ceph-nimbus-05 (mon.0) 1027060 : cluster [ERR]     pg 16.1e not scrubbed since 2022-06-09T03:27:36.771392+0300

_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux