You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: MyApp/_posts/2025-04-29_ai-server-2025-04.md
+11-10Lines changed: 11 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,13 +1,11 @@
1
1
---
2
-
title: AI Server improvements
2
+
title: AI Server - April 2025 Update
3
3
summary: New SOTA LLMs added, support for thinking responses, Ollama Vision Models & Generate API
4
4
tags: [ai-server,ai,gpt,service-reference,c#,js]
5
5
author: Demis Bellot
6
6
image: ./img/posts/ai-server/bg.webp
7
7
---
8
8
9
-
## AI Server improvements
10
-
11
9
AI Server is our Free OSS Docker self-hosted private gateway to manage API access to multiple LLM APIs, Ollama endpoints, Media APIs, Comfy UI and FFmpeg Agents that's designed as a one-stop solution to manage an organization's AI integrations for all their System Apps with its developer friendly HTTP JSON APIs that supports any programming language or framework.
@@ -16,6 +14,8 @@ AI Server is our Free OSS Docker self-hosted private gateway to manage API acces
16
14
Introducing AI Server
17
15
:::
18
16
17
+
## Support for new Models
18
+
19
19
This release continues to see a number of improvements to AI Server starting with adding support for popular LLM models added during this release, including:
By default [ImageToText](/ai-server/image-to-text) uses a purpose-specific **Florence 2 Vision model** with ComfyUI for its functionality which is capable of generating a very short description about an image, e.g:
@@ -92,3 +85,11 @@ To support Ollama's vision models AI Server added a new feature pipeline around
92
85
-[OllamaGeneration](https://openai.servicestack.net/ui/OllamaGeneration): Synchronous invocation of Ollama's Generate API
93
86
-[QueueOllamaGeneration](https://openai.servicestack.net/ui/QueueOllamaGeneration): Asynchronous or Web Callback invocation of Ollama's Generate API
94
87
-[GetOllamaGenerationStatus](https://openai.servicestack.net/ui/GetOllamaGenerationStatus): Get the generation status of an Ollama Generate API
88
+
89
+
## Support for Thinking Responses
90
+
91
+
With the rise and popularity of **Thinking** Models we've added custom rendering of *thinking* responses in a collapsible
0 commit comments