Skip to content

Commit 568aa0c

Browse files
committed
publish model problem post
1 parent e19a636 commit 568aa0c

1 file changed

Lines changed: 6 additions & 17 deletions

File tree

content/posts/the-model-problem-nobody-warned-me-about.md

Lines changed: 6 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -1,20 +1,9 @@
1-
+++
2-
title = "The Model Problem Nobody Warned Me About"
3-
date = "2026-04-05T15:23:52+02:00"
4-
#dateFormat = "2006-01-02" # This value can be configured for per-post date formatting
5-
author = ""
6-
authorTwitter = "" #do not include @
7-
cover = ""
8-
tags = ["models", "agents"]
9-
keywords = ["openrouter", "deepseek", "mistral", "ai agents"]
10-
description = "Picking a model used to be simple. Bigger is better, newer is better, benchmark wins mean production wins. That's not how it works when you're actually running something.
11-
I've spent the last few weeks debugging a bridge between a Matrix channel and an OpenRouter-backed agent. The technical problems were annoying but solvable. Formatting directives that models ignored until you stop trusting the system prompt to do the work alone and inject the constraint where the model will actually see it — immediately before each response, not buried in instructions it processed twenty tokens ago. Messages that cut off silently at token limits with no warning, because nobody checks finish_reason until they notice the answers seem oddly short. Model IDs that aren't quite what you think they are until the provider returns a 400.
12-
Each one had a fix. None were obvious until they were.
13-
The harder problem was model selection itself."
14-
showFullContent = false
15-
readingTime = false
16-
hideComments = false
17-
+++
1+
---
2+
title: "The Model Problem Nobody Warned Me About"
3+
date: 2026-04-05
4+
draft: false
5+
tags: ["models", "agents"]
6+
---
187
Picking a model used to be simple. Bigger is better, newer is better, benchmark wins mean production wins. That's not how it works when you're actually running something.
198

209
I've spent the last few weeks debugging a bridge between a Matrix channel and an OpenRouter-backed agent. The technical problems were annoying but solvable. Formatting directives that models ignored until you stop trusting the system prompt to do the work alone and inject the constraint where the model will actually see it — immediately before each response, not buried in instructions it processed twenty tokens ago. Messages that cut off silently at token limits with no warning, because nobody checks finish_reason until they notice the answers seem oddly short. Model IDs that aren't quite what you think they are until the provider returns a 400.

0 commit comments

Comments
 (0)