Skip to content

Commit ee37633

Browse files
committed
feat: Launchers - Kubernetes - Fail the Kubernetes Job as soon as one of its Pods fails
1 parent ee37a16 commit ee37633

1 file changed

Lines changed: 2 additions & 0 deletions

File tree

cloud_pipelines_backend/launchers/kubernetes_launchers.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -976,6 +976,8 @@ def launch_container_task(
976976
completion_mode="Indexed",
977977
# backoff_limit=0,
978978
backoff_limit_per_index=0,
979+
# Without explicit max_failed_indexes=0, the job waits for all pods to end and then succeeds ("Complete") despite pod failures!
980+
max_failed_indexes=0,
979981
completions=num_nodes,
980982
parallelism=num_nodes,
981983
),

0 commit comments

Comments
 (0)