You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/plans/2026-05-11-cache-sidecar-architecture.md
+2-9Lines changed: 2 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -147,7 +147,7 @@ The current approach is time-based pruning with a fixed buffer. There is no way
147
147
148
148
**Core framing:** For paying customers, denying valid usage is WORSE than allowing brief overage. Overage is billable and self-corrects on the next checkpoint. Denial loses revenue and erodes trust. Exception: free-tier and fraud-prevention contexts need strict enforcement (never permissive).
149
149
150
-
**Background:**Metronome processes events across multiple partitions with variable lag. Out-of-order processing is real. External apps may send events the sidecar never sees.
150
+
**Background:**Event processing has variable lag and out-of-order delivery is real. External apps may send events the sidecar never sees.
151
151
152
152
### Comparison
153
153
@@ -187,20 +187,13 @@ Query Metronome's usage API to confirm specific events appear in aggregated usag
187
187
188
188
### Watermark-based
189
189
190
-
Metronome exposes `watermark_low` / `watermark_high` in balance responses. Prune events <= `watermark_low` (confirmed across all partitions). Events between low and high are ambiguous.
190
+
Metronome exposes `watermark_low` / `watermark_high` in balance responses. Prune events <= `watermark_low` (confirmed processed). Events between low and high are ambiguous.
191
191
192
192
-**Overage risk:** None in strict mode. Tiny in lenient mode (ambiguous-zone events).
193
193
-**False denial risk:** None.
194
194
-**Right choice:** Long-term production steady-state. Precise, minimal drift, no polling overhead.
195
195
-**Deploy:** Blocked. Watermark metadata in balance responses is pending vendor support. Once available, days to integrate.
196
196
197
-
### Recommendation
198
-
199
-
-**Default (paying customers):** Time-based pruning + TTL floor in lenient mode. Brief overage is billable and self-corrects. Ship immediately.
200
-
-**Strict (free-tier / fraud):** Same strategy, conservative tuning -- long TTL (10 min+), short buffer, strict mode. Deny rather than permit.
201
-
-**Long-term:** Watermark-based. Precise, safe, minimal drift. Upgrade when available.
202
-
-**High-value VIPs:** Event confirmation. Zero drift, provably correct. Reserve for accounts where API cost is justified.
0 commit comments