Skip to content

fix: Catch can't start new thread#453

Merged
ko3n1g merged 2 commits into
mainfrom
ko3n1g/fix/catch-no-new-thread
Mar 11, 2026
Merged

fix: Catch can't start new thread#453
ko3n1g merged 2 commits into
mainfrom
ko3n1g/fix/catch-no-new-thread

Conversation

@ko3n1g

@ko3n1g ko3n1g commented Mar 11, 2026

Copy link
Copy Markdown
Contributor

No description provided.

Signed-off-by: oliver könig <okoenig@nvidia.com>
@ko3n1g ko3n1g added the r0.8.0 Cherry-pick PR to the r0.8.0 release branch label Mar 11, 2026
malay-nagda
malay-nagda previously approved these changes Mar 11, 2026
)
time.sleep(thread_retry_delay)
thread_retry_delay = min(thread_retry_delay * 2, 60)
continue

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What are consequences with constantly hitting can't start new thread error? The number of tries is not incremented here. But is there a point where something is really wrong and we should just fail?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm.. yeah, let's change that to exponential backoff with maybe 5 retries

Signed-off-by: oliver könig <okoenig@nvidia.com>
@ko3n1g ko3n1g merged commit a30587f into main Mar 11, 2026
24 of 25 checks passed
svcnvidia-nemo-ci pushed a commit that referenced this pull request Mar 11, 2026
* fix: Catch `can't start new thread`

Signed-off-by: oliver könig <okoenig@nvidia.com>

* add exponential backoff

Signed-off-by: oliver könig <okoenig@nvidia.com>

---------

Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
ko3n1g added a commit that referenced this pull request Mar 11, 2026
* fix: Catch `can't start new thread`



* add exponential backoff



---------

Signed-off-by: oliver könig <okoenig@nvidia.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: oliver könig <okoenig@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

r0.8.0 Cherry-pick PR to the r0.8.0 release branch

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants