Re: Fencing an entire client cluster from access to Ceph (in kubernetes)

Jason Dillaman <jdillama@xxxxxxxxxx> · Thu, 29 Oct 2020 09:11:53 -0400

On Thu, Oct 29, 2020 at 1:10 AM Madhu Rajanna <mrajanna@xxxxxxxxxx> wrote:
>
> A couple of scenarios we may need to consider for fencing.
>
> * Fencing the workload of a namespace, when we want to move workload only for a namespace, not for
> all namespace (the non-critical workload won't be moved to the secondary site but they need to be started again once the primary cluster is recovered).

Couldn't each Ceph namespace utilize a different CephX user? In the
case of sync (metro) DR or isolating multiple k8s clusters are
independent tenants sharing the same Ceph cluster resources, it seems
like this would just be different CephX users.

> * Few applications which are critical in a namespace may need to be fenced(applications may be running on the different nodes) when they moved to the secondary site.
> * We also need to revert back to the same state(unblocklist) once the primary cluster is recovered.

That was why I was proposing the revocation list idea since there
would be no need to blocklist / unblocklist. The revocation list would
force a client to obtain a new ticket -- but by that time the caps
should have been updated to revoke its privileges as needed.

> * Do we need to consider anything In the case of Async DR? if the primary cluster control plane is dead?

For RBD, this would be a forced-failover so there actually would be no
need for fencing since it's handled as a split-brain event. Plus, the
nature of async DR implies that you have two separate Ceph clusters.

> Or do we need to fence all the clients in the primary cluster in case of DR?
> _______________________________________________
> Dev mailing list -- dev@xxxxxxx
> To unsubscribe send an email to dev-leave@xxxxxxx
>

-- 
Jason
_______________________________________________
Dev mailing list -- dev@xxxxxxx
To unsubscribe send an email to dev-leave@xxxxxxx