Key Default Type Description
kubernetes.operator.cluster.health-check.checkpoint-progress.enabled
false Boolean Whether to enable checkpoint progress health check for clusters.
kubernetes.operator.cluster.health-check.checkpoint-progress.window
5 min Duration If no checkpoints are completed within the defined time window, the job is considered unhealthy. This must be bigger than checkpointing interval.
kubernetes.operator.cluster.health-check.enabled
false Boolean Whether to enable health check for clusters.
kubernetes.operator.cluster.health-check.restarts.threshold
64 Integer The threshold which is checked against job restart count within a configured window. If the restart count is reaching the threshold then full cluster restart is initiated.
kubernetes.operator.cluster.health-check.restarts.window
2 min Duration The duration of the time window where job restart count measured.
kubernetes.operator.deployment.readiness.timeout
5 min Duration The timeout for deployments to become ready/stable before being rolled back if rollback is enabled.
kubernetes.operator.deployment.rollback.enabled
false Boolean Whether to enable rolling back failed deployment upgrades.
kubernetes.operator.jm-deployment-recovery.enabled
true Boolean Whether to enable recovery of missing/deleted jobmanager deployments.
kubernetes.operator.jm-deployment.shutdown-ttl
86400000 ms Duration Time after which jobmanager pods of terminal application deployments are shut down.
kubernetes.operator.job.restart.failed
false Boolean Whether to restart failed jobs.
kubernetes.operator.job.upgrade.ignore-pending-savepoint
false Boolean Whether to ignore pending savepoint during job upgrade.
kubernetes.operator.job.upgrade.last-state-fallback.enabled
true Boolean Enables last-state fallback for savepoint upgrade mode. When the job is not running thus savepoint cannot be triggered but HA metadata is available for last state restore the operator can initiate the upgrade process when the flag is enabled.
kubernetes.operator.job.upgrade.last-state.max.allowed.checkpoint.age
(none) Duration Max allowed checkpoint age for initiating last-state upgrades on running jobs. If a checkpoint is not available within the desired age (and nothing in progress) a savepoint will be triggered.
kubernetes.operator.periodic.savepoint.interval
0 ms Duration Interval at which periodic savepoints will be triggered. The triggering schedule is not guaranteed, savepoints will be triggered as part of the regular reconcile loop.
kubernetes.operator.pod-template.merge-arrays-by-name
false Boolean Configure the array merge behaviour during pod merging. Arrays can be either merged by position or name matching.
kubernetes.operator.savepoint.cleanup.enabled
true Boolean Whether to enable clean up of savepoint history.
kubernetes.operator.savepoint.format.type
CANONICAL

Enum

Type of the binary format in which a savepoint should be taken.

Possible values:
  • "CANONICAL": A canonical, common for all state backends format. It lets you switch state backends.
  • "NATIVE": A format specific for the chosen state backend, in its native binary format. Might be faster to take and restore from than the canonical one.
kubernetes.operator.savepoint.history.max.age
86400000 ms Duration Maximum age for savepoint history entries to retain. Due to lazy clean-up, the most recent savepoint may live longer than the max age.
kubernetes.operator.savepoint.history.max.count
10 Integer Maximum number of savepoint history entries to retain.
kubernetes.operator.savepoint.trigger.grace-period
1 min Duration The interval before a savepoint trigger attempt is marked as unsuccessful.
kubernetes.operator.user.artifacts.http.header
(none) Map Custom HTTP header for HttpArtifactFetcher. The header will be applied when getting the session job artifacts. Expected format: headerKey1:headerValue1,headerKey2:headerValue2.