Skip to content

[Autotuner] Long-lived worker pool for parallel precompile#2128

Closed
choijon5 wants to merge 1 commit into
mainfrom
choijon5/stack/31
Closed

[Autotuner] Long-lived worker pool for parallel precompile#2128
choijon5 wants to merge 1 commit into
mainfrom
choijon5/stack/31

Conversation

@choijon5
Copy link
Copy Markdown
Contributor

@choijon5 choijon5 commented Apr 28, 2026

This should fix bringing down the entire process from CUDA sticky errors.
On par with fork in terms of performance as well as total compile time, 2X faster than spawn.
image
image

CI shows "pool" is basically a win or on par with "fork" for compile time and perf.
image
image

choijon5 added a commit that referenced this pull request Apr 28, 2026
stack-info: PR: #2128, branch: choijon5/stack/31
@choijon5 choijon5 force-pushed the choijon5/stack/31 branch from 733ba65 to 16925d5 Compare April 28, 2026 07:09
@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 28, 2026
choijon5 added a commit that referenced this pull request Apr 28, 2026
stack-info: PR: #2128, branch: choijon5/stack/31
@choijon5 choijon5 changed the base branch from choijon5/stack/30 to main April 28, 2026 07:12
@choijon5 choijon5 force-pushed the choijon5/stack/31 branch 14 times, most recently from 711fcec to a72dcba Compare May 1, 2026 22:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant