Skip to content

Commit fa1f8cb

Browse files
chore: remove recommendation section and internal architecture references
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Committed-By-Agent: claude
1 parent 0095fb6 commit fa1f8cb

1 file changed

Lines changed: 2 additions & 9 deletions

File tree

docs/plans/2026-05-11-cache-sidecar-architecture.md

Lines changed: 2 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -147,7 +147,7 @@ The current approach is time-based pruning with a fixed buffer. There is no way
147147

148148
**Core framing:** For paying customers, denying valid usage is WORSE than allowing brief overage. Overage is billable and self-corrects on the next checkpoint. Denial loses revenue and erodes trust. Exception: free-tier and fraud-prevention contexts need strict enforcement (never permissive).
149149

150-
**Background:** Metronome processes events across multiple partitions with variable lag. Out-of-order processing is real. External apps may send events the sidecar never sees.
150+
**Background:** Event processing has variable lag and out-of-order delivery is real. External apps may send events the sidecar never sees.
151151

152152
### Comparison
153153

@@ -187,20 +187,13 @@ Query Metronome's usage API to confirm specific events appear in aggregated usag
187187

188188
### Watermark-based
189189

190-
Metronome exposes `watermark_low` / `watermark_high` in balance responses. Prune events <= `watermark_low` (confirmed across all partitions). Events between low and high are ambiguous.
190+
Metronome exposes `watermark_low` / `watermark_high` in balance responses. Prune events <= `watermark_low` (confirmed processed). Events between low and high are ambiguous.
191191

192192
- **Overage risk:** None in strict mode. Tiny in lenient mode (ambiguous-zone events).
193193
- **False denial risk:** None.
194194
- **Right choice:** Long-term production steady-state. Precise, minimal drift, no polling overhead.
195195
- **Deploy:** Blocked. Watermark metadata in balance responses is pending vendor support. Once available, days to integrate.
196196

197-
### Recommendation
198-
199-
- **Default (paying customers):** Time-based pruning + TTL floor in lenient mode. Brief overage is billable and self-corrects. Ship immediately.
200-
- **Strict (free-tier / fraud):** Same strategy, conservative tuning -- long TTL (10 min+), short buffer, strict mode. Deny rather than permit.
201-
- **Long-term:** Watermark-based. Precise, safe, minimal drift. Upgrade when available.
202-
- **High-value VIPs:** Event confirmation. Zero drift, provably correct. Reserve for accounts where API cost is justified.
203-
204197
---
205198

206199
## Pricing Strategy (as implemented)

0 commit comments

Comments
 (0)