Atmosphere
diff --git a/‎docs/astro.config.mjs‎
Lines changed: 6 additions & 0 deletions b/‎docs/astro.config.mjs‎
Lines changed: 6 additions & 0 deletions
diff --git a/‎docs/src/content/docs/agents/embeddings.md‎
Lines changed: 210 additions & 0 deletions b/‎docs/src/content/docs/agents/embeddings.md‎
Lines changed: 210 additions & 0 deletions
diff --git a/‎docs/src/content/docs/integrations/koog.md‎
Lines changed: 112 additions & 0 deletions b/‎docs/src/content/docs/integrations/koog.md‎
Lines changed: 112 additions & 0 deletions
@@ -92,6 +92,10 @@ export default defineConfig({
           label: 'Reference',
           items: [
             { label: 'AI / LLM', slug: 'reference/ai' },
+            { label: 'Embeddings', slug: 'agents/embeddings' },
+            { label: 'AgentLifecycleListener', slug: 'reference/lifecycle-listener' },
+            { label: 'ExecutionHandle', slug: 'reference/execution-handle' },
+            { label: 'ToolApprovalPolicy', slug: 'reference/tool-approval-policy' },
             { label: 'AI Testing', slug: 'reference/testing' },
             { label: 'Core Runtime', slug: 'reference/core' },
             { label: 'Rooms & Presence', slug: 'reference/rooms' },
@@ -115,6 +119,8 @@ export default defineConfig({
             { label: 'LangChain4j', slug: 'integrations/langchain4j' },
             { label: 'Google ADK', slug: 'integrations/adk' },
             { label: 'Embabel', slug: 'integrations/embabel' },
+            { label: 'JetBrains Koog', slug: 'integrations/koog' },
+            { label: 'Semantic Kernel', slug: 'integrations/semantic-kernel' },
           ],
         },
         {
 
@@ -0,0 +1,210 @@
+---
+title: "Embeddings"
+description: "EmbeddingRuntime SPI — runtime-agnostic text embedding across Spring AI, LangChain4j, Semantic Kernel, Embabel, and the Built-in client"
+---
+
+# Embeddings
+
+`EmbeddingRuntime` is the sibling SPI to `AgentRuntime` for text-embedding
+generation. Each supported LLM framework ships an implementation discovered
+through `ServiceLoader`, so your RAG pipeline can swap embedding backends by
+changing one dependency — exactly the same contract as `@AiEndpoint` and
+`@AiTool` across chat runtimes.
+
+## Why a separate SPI
+
+Prior to 4.0.36 the `modules/rag` package consumed provider-specific embedding
+APIs directly (Spring AI `EmbeddingModel`, LangChain4j `EmbeddingModel`, etc.),
+which meant RAG code locked you into one backend. `EmbeddingRuntime` lifts
+that into a runtime-agnostic SPI with a uniform API:
+
+- `float[] embed(String text)` — embed a single text into a vector
+- `List<float[]> embedAll(List<String> texts)` — batch variant (preferred for
+  amortizing per-request overhead)
+- `int dimensions()` — the vector length, or `-1` if the runtime cannot
+  answer without a network call
+- `boolean isAvailable()` — whether this runtime's native API is on the
+  classpath AND a concrete embedding model has been wired
+- `String name()` — human-readable ID: `"spring-ai"`, `"langchain4j"`,
+  `"semantic-kernel"`, `"embabel"`, `"built-in"`
+- `int priority()` — selection priority when multiple runtimes are present
+  (higher wins)
+
+## Runtime priorities
+
+The default resolver picks the highest-priority runtime whose
+`isAvailable()` returns `true`. Priorities are stable across releases so
+adapter wrappers always win over the zero-dependency fallback:
+
+| Runtime | Priority | Status |
+|---------|---------:|--------|
+| Spring AI (`spring-ai`) | **200** | Available when a Spring-managed `EmbeddingModel` bean is wired |
+| LangChain4j (`langchain4j`) | **190** | Available when a `dev.langchain4j.model.embedding.EmbeddingModel` instance is injected |
+| Semantic Kernel (`semantic-kernel`) | **180** | Available when a `TextEmbeddingGenerationService` is supplied. Uses `Mono.block()` at the sync boundary and unwraps `List<Float>` → `float[]` |
+| Embabel (`embabel`) | **170** | Thin pass-through over `com.embabel.common.ai.model.EmbeddingService` |
+| Built-in (`built-in`) | **50** | Zero-dependency OpenAI-compatible `/v1/embeddings` client. Fallback used when no framework-native `EmbeddingModel` is wired. |
+
+The Built-in runtime sits below every adapter so a framework-native
+`EmbeddingModel` always wins when present. Direct `OpenAiCompatibleClient`
+callers bypass the resolver and use the Built-in implementation unconditionally.
+
+## Auto-discovery
+
+Add the corresponding `atmosphere-*` dependency to your project:
+
+```xml
+<properties>
+    <atmosphere.version>4.0.36</atmosphere.version>
+</properties>
+
+<!-- Spring AI embedding runtime (priority 200) -->
+<dependency>
+    <groupId>org.atmosphere</groupId>
+    <artifactId>atmosphere-spring-ai</artifactId>
+    <version>${atmosphere.version}</version>
+</dependency>
+
+<!-- or LangChain4j (priority 190) -->
+<dependency>
+    <groupId>org.atmosphere</groupId>
+    <artifactId>atmosphere-langchain4j</artifactId>
+    <version>${atmosphere.version}</version>
+</dependency>
+
+<!-- or Semantic Kernel (priority 180) -->
+<dependency>
+    <groupId>org.atmosphere</groupId>
+    <artifactId>atmosphere-semantic-kernel</artifactId>
+    <version>${atmosphere.version}</version>
+</dependency>
+```
+
+On application startup Atmosphere scans the classpath via
+`ServiceLoader<EmbeddingRuntime>` and picks the highest-priority
+`isAvailable()` runtime. No code changes needed when swapping backends.
+
+## Programmatic usage
+
+```java
+import org.atmosphere.ai.EmbeddingRuntime;
+import org.atmosphere.ai.EmbeddingRuntimeResolver;
+
+// Resolve the active runtime (highest-priority available)
+EmbeddingRuntime runtime = EmbeddingRuntimeResolver.resolve()
+        .orElseThrow(() -> new IllegalStateException(
+                "No EmbeddingRuntime available — add atmosphere-spring-ai or " +
+                "atmosphere-langchain4j to the classpath, or configure AiConfig " +
+                "for the Built-in fallback"));
+
+// Single embedding
+float[] vector = runtime.embed("Atmosphere is the unified AI runtime abstraction on the JVM.");
+System.out.println("Dimensions: " + vector.length);
+
+// Batch embedding (preferred for multiple inputs)
+List<float[]> vectors = runtime.embedAll(List.of(
+        "First document",
+        "Second document",
+        "Third document"));
+```
+
+`EmbeddingRuntimeResolver.resolveAll()` returns all available runtimes in
+priority order if you need to fan out or pick a specific backend manually.
+
+## Wiring a framework-native embedding model
+
+The adapter runtimes (Spring AI, LangChain4j, Semantic Kernel, Embabel)
+wrap a framework-managed `EmbeddingModel` / `EmbeddingService` /
+`TextEmbeddingGenerationService` instance. Wire it via the adapter's
+static setter during startup:
+
+### Spring AI
+
+```java
+@Configuration
+public class EmbeddingConfig {
+    @Bean
+    EmbeddingModel openAiEmbeddingModel() {
+        return new OpenAiEmbeddingModel(...);
+    }
+
+    @PostConstruct
+    void registerWithAtmosphere(@Autowired EmbeddingModel model) {
+        SpringAiEmbeddingRuntime.setEmbeddingModel(model);
+    }
+}
+```
+
+### LangChain4j
+
+```java
+var lc4jEmbedder = OpenAiEmbeddingModel.builder()
+        .apiKey(System.getenv("OPENAI_API_KEY"))
+        .build();
+LangChain4jEmbeddingRuntime.setEmbeddingModel(lc4jEmbedder);
+```
+
+### Semantic Kernel
+
+```java
+var skService = OpenAITextEmbeddingGenerationService.builder()
+        .withApiKey(System.getenv("OPENAI_API_KEY"))
+        .withModelId("text-embedding-3-small")
+        .build();
+SemanticKernelEmbeddingRuntime.setEmbeddingService(skService);
+```
+
+### Embabel
+
+```java
+EmbabelEmbeddingRuntime.setEmbeddingService(embabelEmbeddingService);
+```
+
+### Built-in (zero-dep fallback)
+
+No wiring needed — the Built-in runtime reads `AiConfig.baseUrl` + `apiKey`
+at call time and POSTs to `/v1/embeddings` on any OpenAI-compatible endpoint
+(OpenAI, Azure OpenAI, Gemini's OpenAI gateway, Ollama, LocalAI, etc.).
+
+```java
+// Override the default model name if needed
+var builtIn = new BuiltInEmbeddingRuntime();
+builtIn.setEmbeddingModel("text-embedding-3-large");
+```
+
+## Contract tests — `AbstractEmbeddingRuntimeContractTest`
+
+Every concrete `EmbeddingRuntime` ships with a contract-test subclass of
+`AbstractEmbeddingRuntimeContractTest`. The base assertions exercise
+`embed()`, `embedAll()`, `dimensions()`, and `isAvailable()` with a
+deterministic fake embedder so the bridge plumbing is validated without
+live network calls.
+
+The six parity assertions are:
+
+1. `runtimeHasStableName()` — `name()` returns a non-blank, stable identifier
+2. `embedSingleTextReturnsVectorOfExpectedDimension()` — single-text round-trip
+3. `embedAllReturnsVectorPerInputInOrder()` — batch round-trip preserves order
+4. `embedAllWithEmptyListReturnsEmptyList()` — edge case
+5. `runtimeIsAvailableAfterFakeInjection()` — availability gate flips on injection
+6. `dimensionsAccessorIsNonNegativeOrMinusOne()` — dimension advertising contract
+
+If you add a new `EmbeddingRuntime` implementation, subclass the base and
+supply a deterministic fake embedder via `installFakeEmbedder()` — no
+need to write the same assertions again.
+
+## Capabilities matrix
+
+| Runtime | `embed()` | `embedAll()` batched | `dimensions()` | Notes |
+|---------|:---------:|:--------------------:|:--------------:|-------|
+| Spring AI      | ✅ | ✅ native batch | ✅ | `model.dimensions()` |
+| LangChain4j    | ✅ | ✅ native batch via `TextSegment` | ✅ | `model.dimension()` |
+| Semantic Kernel| ✅ | ✅ native batch | `-1` | `Mono.block()` sync boundary |
+| Embabel        | ✅ | ✅ native batch | ✅ | 1:1 pass-through |
+| Built-in       | ✅ | ✅ via `/v1/embeddings` | ✅ from config | OpenAI-compatible wire format |
+
+## See also
+
+- [AI / LLM Reference](../../reference/ai/) — `AgentRuntime` SPI and capability matrix
+- [RAG Module](../../reference/ai/#rag) — how RAG pipelines consume `EmbeddingRuntime`
+- [Spring AI Integration](../../integrations/spring-ai/)
+- [LangChain4j Integration](../../integrations/langchain4j/)
@@ -0,0 +1,112 @@
+---
+title: "JetBrains Koog"
+description: "AgentRuntime backed by JetBrains Koog — Kotlin-native AI framework with structured concurrency and tool calling"
+---
+
+# JetBrains Koog Adapter
+
+`AgentRuntime` implementation backed by [JetBrains Koog](https://github.com/JetBrains/koog), a Kotlin-native AI framework with structured concurrency, typed tools, and a composable agent DSL. When the `atmosphere-koog` JAR is on the classpath, `@AiEndpoint` and `@AiTool` work across Koog agents without code changes.
+
+## Maven Coordinates
+
+```xml
+<properties>
+    <atmosphere.version>4.0.36</atmosphere.version>
+</properties>
+
+<dependency>
+    <groupId>org.atmosphere</groupId>
+    <artifactId>atmosphere-koog</artifactId>
+    <version>${atmosphere.version}</version>
+</dependency>
+```
+
+## Quick Start
+
+### AgentRuntime SPI (auto-detected)
+
+Drop the dependency alongside `atmosphere-ai` and the framework auto-selects Koog via `ServiceLoader`:
+
+```java
+@AiEndpoint(path = "/ai/chat", systemPrompt = "You are a helpful assistant")
+public class MyChat {
+
+    @Prompt
+    public void onPrompt(String message, StreamingSession session) {
+        session.stream(message);  // uses Koog when atmosphere-koog is on classpath
+    }
+}
+```
+
+No code changes needed — the `KoogAgentRuntime` implementation declares a priority that wins when framework-native runtimes are not wired.
+
+### Programmatic Koog executor
+
+For full control over the Koog `PromptExecutor` and `AIAgent` configuration:
+
+```kotlin
+val executor = OpenRouterLLMClient(
+    apiKey = System.getenv("LLM_API_KEY"),
+    baseUrl = "https://openrouter.ai/api/v1"
+)
+KoogAgentRuntime.setPromptExecutor(executor)
+KoogAgentRuntime.setModel(OpenAIModels.Chat.GPT_4o_Mini)
+```
+
+Once the executor and model are set, every `@AiEndpoint` call routes through Koog's streaming `executeStreaming(prompt, model)` or through a full `AIAgent.run()` when tools are present.
+
+## How It Works
+
+`KoogAgentRuntime` runs the Koog pipeline inside a `runBlocking { }` block on a dedicated virtual thread. Two execution paths:
+
+- **Tool-aware path** — when `context.tools()` is non-empty, the runtime builds a Koog `ToolRegistry` via `AtmosphereToolBridge`, constructs an `AIAgent`, and calls `agent.run(message)`. Tool invocations flow through `ToolExecutionHelper.executeWithApproval()` so `@RequiresApproval` gates HITL approval just like every other runtime.
+- **Streaming path** — when no tools are registered, the runtime calls `executor.executeStreaming(prompt, model).collect { frame -> session.send(frame.text) }` for minimal-overhead text streaming.
+
+Both paths fire `AgentLifecycleListener.onStart` / `onToolCall` / `onToolResult` / `onCompletion` / `onError` for full observability parity with the other runtimes.
+
+## Cooperative cancellation
+
+`KoogAgentRuntime.executeWithHandle()` captures `coroutineContext[Job]` inside the `runBlocking` block and stores it in an `AtomicReference`. On `handle.cancel()`, the adapter calls `activeJob.get()?.cancel()` which propagates `CancellationException` through Koog's coroutine machinery and unblocks the `runBlocking` wrapper.
+
+See [ExecutionHandle](../../reference/execution-handle/) for the full cancel contract.
+
+## Key Classes
+
+| Class | Purpose |
+|-------|---------|
+| `KoogAgentRuntime` | `AgentRuntime` SPI implementation |
+| `AtmosphereToolBridge` | Converts `ToolDefinition` → Koog `ToolDescriptor` + `ToolRegistry` with approval-gate hooks |
+| `KoogEmbeddingRuntime` | `EmbeddingRuntime` SPI for Koog-provided embedders (deferred — see below) |
+
+## Capability matrix
+
+| Capability | Status | Notes |
+|------------|:------:|-------|
+| `TEXT_STREAMING` | ✅ | `executeStreaming` flow + `AIAgent.run()` |
+| `TOOL_CALLING`   | ✅ | `AtmosphereToolBridge` builds the Koog `ToolRegistry` |
+| `TOOL_APPROVAL`  | ✅ | Every tool routes through `ToolExecutionHelper.executeWithApproval` |
+| `STRUCTURED_OUTPUT` | ✅ | System-prompt schema injection via the pipeline layer |
+| `SYSTEM_PROMPT` | ✅ | Honored by the Koog prompt builder |
+| `CONVERSATION_MEMORY` | ✅ | Per-session memory threaded through `AgentExecutionContext` |
+| `TOKEN_USAGE` | ✅ | Emitted as `ai.tokens.*` metadata when the Koog client reports it |
+| `AGENT_ORCHESTRATION` | ✅ | Works with `@Coordinator` and `@Fleet` |
+| `VISION` | — | Koog 0.7.x does not expose a stable multi-modal input API on the bridge path |
+| `MULTI_MODAL` | — | Same limitation |
+| `PROMPT_CACHING` | — | Koog 0.7.3 only ships Bedrock-specific cache variants; no OpenAI-compatible passthrough |
+| `PER_REQUEST_RETRY` | — | Inherits Koog's native retry layer (not per-request overridable from the bridge) |
+
+Exclusions are **honest** — Koog declares them as absent in its `capabilities()` set so runtime-truth advertising is accurate (Correctness Invariant #5). When Koog upstream adds these surfaces in a future release, the bridge will honor them without a breaking change.
+
+## Samples
+
+- [spring-boot-koog-chat](https://github.com/Atmosphere/atmosphere/tree/main/samples/spring-boot-koog-chat) — `@AiEndpoint` chat sample routing through Koog's `executeStreaming`
+- [spring-boot-ai-classroom](https://github.com/Atmosphere/atmosphere/tree/main/samples/spring-boot-ai-classroom) — swap the runtime to Koog by changing one dependency, same `@Agent` code
+
+## See Also
+
+- [AI Reference](../../reference/ai/) — `AgentRuntime` SPI, capability matrix
+- [Spring AI Adapter](spring-ai/)
+- [LangChain4j Adapter](langchain4j/)
+- [Google ADK Adapter](adk/)
+- [Embabel Adapter](embabel/)
+- [Semantic Kernel Adapter](semantic-kernel/)