You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: GPU yield — prevent Metal from starving macOS WindowServer
Every 8 tokens, insert a 50μs Task.sleep to yield the GPU.
This prevents heavy inference from freezing the macOS UI
(WindowServer). Applied to all 4 generation loops:
- Chat streaming
- Chat non-streaming
- Text streaming
- Text non-streaming
0 commit comments