Skip to content

Commit 72014c3

Browse files
Add costing sample workbook screenshots
1 parent f2aa540 commit 72014c3

3 files changed

Lines changed: 8 additions & 0 deletions

File tree

samples/costing/README.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -255,6 +255,14 @@ The AI Gateway tab shows per-client token consumption and estimated costs when A
255255

256256
![AI Gateway - Model & Caller Breakdown](screenshots/AIGateway-04.png)
257257

258+
![AI Gateway - Token & PTU Summary](screenshots/AI-Gateway-Token-PTU.png)
259+
260+
### Per-Request Detail Tab
261+
262+
The Per-Request Detail tab provides a row-level drill-in across every AI request, joining gateway logs with LLM diagnostic data so you can inspect a single call end to end. The `AI Delivery Mode` and `Usage Provenance` columns make it easy to confirm whether a streaming request supplied its own `usage` chunk or relied on the APIM policy fragment to inject one.
263+
264+
![Per-Request Detail](screenshots/Per-Request%20Detail.png)
265+
258266
### Streaming vs Non-Streaming Verification
259267

260268
When `enable_foundry = True`, the multi-caller traffic phase alternates between non-streaming and streaming chat completions for every business unit. The **AI Gateway** tab includes a *Streaming vs Non-Streaming Breakdown* group with:
336 KB
Loading
427 KB
Loading

0 commit comments

Comments
 (0)