publish model problem post

loopwright · loopwright · commit 568aa0cab2fd · 2026-04-05T15:39:52.000+02:00
diff --git a/content/posts/the-model-problem-nobody-warned-me-about.md b/content/posts/the-model-problem-nobody-warned-me-about.md
@@ -1,20 +1,9 @@
-+++
-title = "The Model Problem Nobody Warned Me About"
-date = "2026-04-05T15:23:52+02:00"
-#dateFormat = "2006-01-02" # This value can be configured for per-post date formatting
-author = ""
-authorTwitter = "" #do not include @
-cover = ""
-tags = ["models", "agents"]
-keywords = ["openrouter", "deepseek", "mistral", "ai agents"]
-description = "Picking a model used to be simple. Bigger is better, newer is better, benchmark wins mean production wins. That's not how it works when you're actually running something.
-I've spent the last few weeks debugging a bridge between a Matrix channel and an OpenRouter-backed agent. The technical problems were annoying but solvable. Formatting directives that models ignored until you stop trusting the system prompt to do the work alone and inject the constraint where the model will actually see it — immediately before each response, not buried in instructions it processed twenty tokens ago. Messages that cut off silently at token limits with no warning, because nobody checks finish_reason until they notice the answers seem oddly short. Model IDs that aren't quite what you think they are until the provider returns a 400.
-Each one had a fix. None were obvious until they were.
-The harder problem was model selection itself."
-showFullContent = false
-readingTime = false
-hideComments = false
-+++
+---
+title: "The Model Problem Nobody Warned Me About"
+date: 2026-04-05
+draft: false
+tags: ["models", "agents"]
+---
 Picking a model used to be simple. Bigger is better, newer is better, benchmark wins mean production wins. That's not how it works when you're actually running something.
 
 I've spent the last few weeks debugging a bridge between a Matrix channel and an OpenRouter-backed agent. The technical problems were annoying but solvable. Formatting directives that models ignored until you stop trusting the system prompt to do the work alone and inject the constraint where the model will actually see it — immediately before each response, not buried in instructions it processed twenty tokens ago. Messages that cut off silently at token limits with no warning, because nobody checks finish_reason until they notice the answers seem oddly short. Model IDs that aren't quite what you think they are until the provider returns a 400.