Skip to content

ci: cap the E2E job with timeout-minutes#145

Open
viniciusdc wants to merge 1 commit into
mainfrom
ci/e2e-timeout
Open

ci: cap the E2E job with timeout-minutes#145
viniciusdc wants to merge 1 commit into
mainfrom
ci/e2e-timeout

Conversation

@viniciusdc

Copy link
Copy Markdown
Collaborator

Summary

Add a timeout-minutes: 30 cap to the test-e2e job. It had no timeout, so a hung setup step ran until GitHub's 6-hour default, holding a runner the entire time.

Why

On a recent PR the Install foundational services step (dev/scripts/services/install.sh) wedged — apparently on an unbounded readiness wait — and the job sat in-progress for 40+ minutes with no end in sight (it would have run to 6h). That step runs before the operator is even deployed, so it's an environment/setup hang, not a code failure. A job-level cap is the robust backstop regardless of which inner command stalls.

A healthy E2E run finishes well under 30 minutes (the foundational step normally completes in ~2 min), so this only ever trips on a genuine hang — failing it in minutes instead of squatting a runner.

Scope

Deliberately minimal and standalone. The strategic fix — replacing the hand-rolled dev/scripts/services/* setup with the nebari-dev/action-nebari-sandbox platform profile — is a larger, multi-repo effort tracked separately. This just stops the bleeding in the meantime.

Test plan

  • YAML parses; jobs.test-e2e.timeout-minutes == 30.
  • No behavior change on healthy runs; only bounds hangs.

…runner

The E2E job had no timeout, so when a setup step hangs (a foundational-services
install recently wedged on an unbounded readiness wait, well before the operator
is even deployed) the job sat in-progress until GitHub's 6h default, holding a
runner the whole time. Add a 30-minute job cap -- a healthy run finishes well
under that, so this only bites genuine hangs, failing them in minutes.

This is a standalone resilience fix. Replacing the hand-rolled foundational
setup (dev/scripts/services/*) with the nebari-sandbox action is tracked
separately.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants