Proposal: More pause options for disaster recovery control #360

chlunde · 2022-10-06T09:02:28Z

Just writing down two related ideas here

What problem are you facing?

Disaster recover or migrating resources to other clusters is hard and scary

How could Crossplane help solve your problem?

During migration or disaster recovery, it will be difficult to set "pause" on all resources. It would be nice to pause a full provider, like a CLI argument --pause.

It would also be nice to have a pause option which would Observe but not Create/Update/Delete. This would give an operator confidence in what kinds of actions would run when the cluster is unpaused. This might be a different CLI option or annotation.

The text was updated successfully, but these errors were encountered:

bobh66 · 2022-10-06T15:01:51Z

Another way to completely disable a provider is to set replicas to 0 in the provider's ControllerConfig

luebken · 2022-11-03T11:47:16Z

@chlunde so we have two options for disaster recovery use-cases:

As @bobh66 mentioned, setting the replicas to 0 for the ControllerConfig.
Setting the pause annotation for specific resources: https://crossplane.io/docs/v1.10/concepts/managed-resources.html#pausing-reconciliations.

Would that be sufficient for your use-cases? If not would you mind elaborating why not.

chlunde · 2022-11-09T21:17:09Z

@luebken my main worry when doing use cases such as

restoring a cluster (recreate, partial restore, go back in time for a namespace) with thousands of managed resources
restore an external resource from backup and then restore and re-attach it to a managed resource

would be that due to some unforeseen issue:

many resources are doubly created, for example due to generateName we get role-HASH2 when we had role-HASH1. For example if just restoring a claim and the composition rendering does not use predicatable name/external-name.
resources are garbage collected, and then, deleted if we only restore managed resource without claims

So I would like to pause Create/Update/Delete but not Observe to ensure everything is as expected. Pause (as implemented today) would not give any comfort similar to a terraform plan, but this might do that.

chlunde added the enhancement New feature or request label Oct 6, 2022

chlunde mentioned this issue Oct 6, 2022

Add support for a pause annotation on composite resources & claims, which pauses reconciliations crossplane/crossplane#3349

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: More pause options for disaster recovery control #360

Proposal: More pause options for disaster recovery control #360

Proposal: More pause options for disaster recovery control #360

Proposal: More pause options for disaster recovery control #360

Comments

What problem are you facing?

How could Crossplane help solve your problem?