@@ -43,52 +43,66 @@ raglogs explains incidents.
4343## The killer commands
4444
4545``` bash
46- raglogs explain --since 30m
47- ```
48-
49- ```
50- Incident summary
51-
52- Window: 2026-03-12 22:00:00 UTC → 2026-03-12 22:30:00 UTC
53- Services affected: api, billing-worker
54- Primary issue: Stripe signature verification failed for endpoint /webhooks/stripe
55- Secondary effects: POST /api/checkout 500 Internal Server Error — upstream billing error (39 events)
56- Likely trigger: Deploy completed for billing-worker v2.4.1 at 21:58:15 UTC
57-
58- Evidence:
59- - 184 similar errors in billing-worker
60- - No comparable error volume in prior 24h baseline
61- - First error spike occurred 2m after deploy trigger
62- - Endpoint '/webhooks/stripe' referenced in 100% of primary failures
63- - 39 checkout 500s in api began after webhook error spike
64-
65- Confidence: medium-high
46+ raglogs explain --since 2h
47+ ```
48+
49+ ```
50+ ╭──────────────────────────────────────────────────────── raglogs explain ─────────────────────────────────────────────────────────╮
51+ │ Incident summary │
52+ │ │
53+ │ Window: 2026-03-12T22:33:30 to 2026-03-12T23:33:30 │
54+ │ │
55+ │ Services affected: billing-worker, api │
56+ │ │
57+ │ Primary issue: A surge of 184 Stripe signature verification failures occurred in the billing-worker service at the │
58+ │ /webhooks/stripe endpoint, starting about 2 minutes after deployment of billing-worker version v2.4.1. │
59+ │ │
60+ │ Secondary effects: Following the primary failures, the api service experienced 39 checkout requests returning 500 Internal Server │
61+ │ Errors due to upstream billing errors, along with 25 checkout requests showing high latency. Additionally, billing-worker logged │
62+ │ webhook retry attempts for failed events. │
63+ │ │
64+ │ Likely trigger: Deployment of billing-worker version v2.4.1 at 22:38:29, immediately followed by application start, appears to │
65+ │ have introduced the Stripe signature verification failures. │
66+ │ │
67+ │ Confidence: high │
68+ ╰───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
6669```
6770
6871``` bash
6972raglogs timeline --since 2h
7073```
7174
7275```
73- Incident timeline 2026-03-12 21:58:00 UTC → 2026-03-12 23:58:00 UTC
76+ 22:38:29 deploy Deploy completed for billing-worker version v2.4.1 · deployment-controller
77+ 22:38:30 startup Application started billing-worker v2.4.1 on port 8080 · billing-worker
7478
75- 21:58:14 deploy Deploy completed for billing-worker version v2.4.1 · deployment-controller
76- 21:58:15 startup Application started billing-worker v2.4.1 on port 8080 · billing-worker
79+ 22:40:31 error ↑ Stripe signature verification failed for endpoint /webhooks/stripe
80+ 184 events · billing-worker · 49 min span
7781
78- 22:00:10 error ↑ Stripe signature verification failed for endpoint /webhooks/stripe
79- 184 events · billing-worker · 49 min span
82+ 22:42:00 effect POST /api/checkout 500 Internal Server Error — upstream billing error
83+ 39 events · api · 48 min span
84+ 22:42:50 effect Webhook retries (2 retry events)
85+ 2 events · billing-worker
8086
81- 22:01:27 effect POST /api/checkout 200 OK latency=<duration> (high latency detected)
82- 25 events · api · 43 min span
83-
84- 22:01:49 effect Webhook retries (2 retry events)
85- 2 events · billing-worker
87+ 22:45:29 effect POST /api/checkout 200 OK latency=<duration> (high latency detected)
88+ 25 events · api · 44 min span
89+ ```
8690
87- 22:02:56 effect POST /api/checkout 500 Internal Server Error — upstream billing error
88- 39 events · api · 45 min span
91+ ``` sh
92+ raglogs ask ' why did stripe fail?'
93+ ```
8994
90- 22:11:25 symptom Webhook queue growing, 251 events pending processing
91- 2 events · billing-worker · 30 min span
95+ ```
96+ ╭─────────────────────────────────────────────────────────── raglogs ask ───────────────────────────────────────────────────────────╮
97+ │ Stripe failed because the signature verification for incoming webhook requests to the /webhooks/stripe endpoint failed │
98+ │ repeatedly. This caused the billing-worker service to reject or fail processing Stripe webhook events, likely disrupting payment │
99+ │ or billing workflows. The errors were consistently observed between 22:54 and 23:30 UTC on 2026-03-12. │
100+ │ │
101+ │ Key supporting evidence: │
102+ │ - 500 errors logged with the message "Stripe signature verification failed for endpoint /webhooks/stripe" │
103+ │ - Errors occurred in the billing-worker service │
104+ │ - Time window of errors: 2026-03-12T22:54:49 to 2026-03-12T23:30:29 UTC │
105+ ╰───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
92106```
93107
94108` explain ` answers ** what happened** .
@@ -98,6 +112,8 @@ Together they work like `git log` and `git blame` — but for incidents.
98112
99113Both outputs are fully deterministic. No LLM required.
100114
115+ ` ask ` answers ** questions you didn’t think to ask ahead of time** .
116+
101117---
102118
103119## Why raglogs
0 commit comments