You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"When true (requires --untaintControllerEnabled), the Untaint controller removes the startup taint only after both the node Agent and Datadog CSI node-server pods are Ready. Requires Pod watch coverage of CSI namespaces (DD_CSIDRIVER_WATCH_NAMESPACE).")
191
194
192
195
// DatadogAgentInternal
193
196
flag.BoolVar(&opts.createControllerRevisions, "createControllerRevisions", false, "Enable creation of ControllerRevision snapshots on each DDA spec change")
| `true` | `true` | Wait for Agent **and** CSI node-server Ready; widened Pod cache (agent + `DD_CSIDRIVER_WATCH_NAMESPACE` namespaces); startup toleration on Agent and, when the DatadogCSIDriver controller is enabled, on the CSI node DaemonSet. |
81
+
82
+
`--untaintControllerWaitForCSIDriver`requires `--untaintControllerEnabled=true` (the operator exits on invalid combinations).
72
83
73
-
When this flag is enabled, the operator injects a toleration for
84
+
When `--untaintControllerEnabled` is enabled, the operator injects a toleration for
74
85
`agent.datadoghq.com/not-ready=presence:NoSchedule`into the node Agent
75
86
DaemonSet (or ExtendedDaemonSet) pod template, unless an equivalent toleration
76
-
is already present. When **`--datadogCSIDriverEnabled`** is also true, the same
87
+
is already present. When **`--untaintControllerWaitForCSIDriver`** is also true **and**
88
+
the DatadogCSIDriver controller is running (`--datadogCSIDriverEnabled=true`), the same
77
89
toleration is injected into the **Datadog CSI node-server** DaemonSet pod
78
90
template so the CSI workload can schedule on tainted nodes before the taint is
79
91
removed.
@@ -104,7 +116,7 @@ Metrics, under the `untaint` Prometheus subsystem:
104
116
Kubernetes Events (gated by `DD_UNTAINT_CONTROLLER_EVENTS_ENABLED=true`):
105
117
106
118
- `TaintRemoved`(Normal) — taint removed after the Agent became Ready, or (when
107
-
the Datadog CSI driver controller is also enabled) after both the Agent and
119
+
`--untaintControllerWaitForCSIDriver`is enabled) after both the Agent and
108
120
CSI node-server pods became Ready.
109
121
- `UntaintTimeout`— a timeout fired. Normal under `remove`, Warning under `keep`. Message carries the reason, elapsed time, and policy.
0 commit comments