Skip to content

fix(ci): increase GPU speech unit test timeouts (TTS: 30 min, ASR: 60 min)#15651

Merged
ko3n1g merged 2 commits intomainfrom
ko3n1g/fix/tts-ci-timeout
Apr 29, 2026
Merged

fix(ci): increase GPU speech unit test timeouts (TTS: 30 min, ASR: 60 min)#15651
ko3n1g merged 2 commits intomainfrom
ko3n1g/fix/tts-ci-timeout

Conversation

@ko3n1g
Copy link
Copy Markdown
Contributor

@ko3n1g ko3n1g commented Apr 28, 2026

Claude summary

Two timeouts fixed in cicd-main-speech.yml:

L0_Unit_Tests_GPU_TTStimeout: 30
The MagpieTTS inference tests load real checkpoints and run end-to-end inference, taking ~8–9 minutes each. With the default 10-minute per-attempt budget all 3 retries timed out consistently on both main and PRs.

L0_Unit_Tests_GPU_ASRtimeout: 60
The ASR test suite runs past the previous 30-minute per-attempt limit. A recent run took 61 minutes total (18:12 → 19:14), indicating two timed-out attempts followed by a successful third, wasting ~60 minutes of runner time. Raised to 60 minutes so a single attempt covers the full suite.

Example failing run: https://github.com/NVIDIA-NeMo/NeMo/actions/runs/25060055760/job/73434530111

The TTS test suite includes magpietts_inference tests that load real
model checkpoints and run full inference, taking ~8-9 minutes. With
the default 10-minute timeout per attempt the step consistently times
out across all retry attempts. Raise the timeout to 30 minutes,
matching the ASR unit test budget.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
The ASR test suite consistently runs past the 30-minute per-attempt
limit, burning through all retries and adding ~60 minutes of wasted
runner time. Raise to 60 minutes so a single attempt covers the full
suite.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Signed-off-by: oliver könig <okoenig@nvidia.com>
@ko3n1g ko3n1g changed the title fix(ci): increase L0_Unit_Tests_GPU_TTS timeout to 30 minutes fix(ci): increase GPU speech unit test timeouts (TTS: 30 min, ASR: 60 min) Apr 28, 2026
@github-actions
Copy link
Copy Markdown
Contributor

[🤖]: Hi @ko3n1g 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

@ko3n1g ko3n1g merged commit 0f7e262 into main Apr 29, 2026
135 checks passed
@ko3n1g ko3n1g deleted the ko3n1g/fix/tts-ci-timeout branch April 29, 2026 07:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants