Load/Unload model via API #1167

Jonseed · 2024-10-14T21:06:12Z

Jonseed
Oct 14, 2024

It would be great if the API could load a model into vram upon receiving a request (if not already loaded), and unload a model from vram upon request (like keep_alive in Ollama). Having the model always loaded when Koboldcpp server is running is problematic when using ComfyUI workflows with other services, models, generations (image generation), that need vram. This might also allow model swapping/changing via API.

pupphelper · 2026-03-16T03:58:41Z

pupphelper
Mar 16, 2026

I also would like this. Both a timeout to automatically unload after a timeout, then reload on request. or API unload/load. Similar to ollama.

2 replies

LostRuins Mar 16, 2026
Maintainer

it should be coming soon

LostRuins Mar 16, 2026
Maintainer

will be added in the upcoming version

pupphelper · 2026-03-21T06:00:34Z

pupphelper
Mar 21, 2026

Thanks so much for adding the feature. it properly times out and unloads the model, is there a setting to have it auto reload the last model used when it receives the next api query from silly tavern? Keep up the amazing work with this engine!

3 replies

LostRuins Mar 21, 2026
Maintainer

If you're using the Chat Completions mode in sillytavern, that should already happen. Just make sure the model selected matches the one you want to use.

pupphelper Mar 21, 2026

thanks, I'll try that, have been using text completion for more control and DRY settings in kobold. Thanks again for your quick responses and excellent work!

LostRuins Mar 21, 2026
Maintainer

it will work with text completions too. the model name just needs to be sent

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load/Unload model via API #1167

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 5 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Load/Unload model via API #1167

Uh oh!

Jonseed Oct 14, 2024

Replies: 2 comments · 5 replies

Uh oh!

pupphelper Mar 16, 2026

Uh oh!

LostRuins Mar 16, 2026 Maintainer

Uh oh!

LostRuins Mar 16, 2026 Maintainer

Uh oh!

pupphelper Mar 21, 2026

Uh oh!

LostRuins Mar 21, 2026 Maintainer

Uh oh!

pupphelper Mar 21, 2026

Uh oh!

LostRuins Mar 21, 2026 Maintainer

Jonseed
Oct 14, 2024

Replies: 2 comments 5 replies

pupphelper
Mar 16, 2026

LostRuins Mar 16, 2026
Maintainer

LostRuins Mar 16, 2026
Maintainer

pupphelper
Mar 21, 2026

LostRuins Mar 21, 2026
Maintainer

LostRuins Mar 21, 2026
Maintainer