Our cluster has a little over 100 RBDs. Each RBD is snapshotted with a typical "frequently", hourly, daily, monthly type of schedule.
A while back a 4th monitor was temporarily added to the cluster that took hours to synchronize with the other 3.
While trying to figure out why that addition took so long, we discovered that our monitors have what seems like a really large number of osd_snap keys:
ceph-monstore-tool /var/lib/ceph/mon/xxxxxx dump-keys |awk '{print $1}'|uniq -c
153 auth
2 config
10 health
1441 logm
3 mdsmap
313 mgr
1 mgr_command_descs
3 mgr_metadata
163 mgrstat
1 mkfs
323 mon_config_key
1 mon_sync
6 monitor
1 monitor_store
32 monmap
120 osd_metadata
1 osd_pg_creating
5818618 osd_snap 41338 osdmap
754 paxos
A few questions:
Could this be the cause of the slow addition/synchronization?
Is what looks like an unbounded number of osd_snaps expected?
If trimming/compacting them would help, how would one do that?
Thanks,
Brian
The materials in this message are private and may contain Protected Healthcare Information or other information of a sensitive nature. If you are not the intended recipient, be advised that any unauthorized use, disclosure, copying or the taking of any action in reliance on the contents of this information is strictly prohibited. If you have received this email in error, please immediately notify the sender via telephone or return mail. |
_______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx