docs: three-latencies explanation (producer / subscriber / end-to-end)

Nik Samokhvalov · Nik Samokhvalov · commit defdc4d0132c · 2026-04-18T16:58:55.000-07:00
Name the three distinct latencies in any Postgres queue and explain why PgQue's batch-ticker model makes #1 and #2 sub-ms while bounding #3 by the tick cadence (not by load). Addresses recurring confusion about the apparent contradiction between sub-ms consumer-path latency and the ~1 s end-to-end delivery bound. - README.md: brief paragraph + 3-bullet list, new "Three latencies" subsection between "Latency trade-off" and "Comparison". - docs/pgq-concepts.md: detailed version with per-latency physics, tick-frequency trade-off table, comparison to pgmq's poll-on-demand model, when-to-pick guidance, provenance link. Uses actual pgque.sql column names: queue_ticker_max_lag (3s), queue_ticker_idle_period (1min idle-decelerator), queue_ticker_max_count (500). The 1-second cadence comes from the pg_cron schedule set by pgque.start(), not from queue_ticker_idle_period.
diff --git a/README.md b/README.md
@@ -16,6 +16,7 @@
 
 - [Why PgQue](#why-pgque)
 - [Latency trade-off](#latency-trade-off)
+- [Three latencies](#three-latencies)
 - [Comparison](#comparison)
 - [Installation](#installation)
 - [Roles and grants](#roles-and-grants)
@@ -65,6 +66,14 @@ Ways to reduce delivery latency: tune tick frequency and queue thresholds; use `
 
 If your top priority is single-digit-millisecond dispatch, PgQue is the wrong tool. If your priority is **stability under load without bloat**, that is where PgQue fits.
 
+## Three latencies
+
+"Queue latency" is three numbers, not one. PgQue makes #1 and #2 sub-ms and bounds #3 by whatever tick cadence you configure:
+
+1. **Producer latency** — `send` / `insert_event`. Sub-ms.
+2. **Subscriber latency** — `next_batch` + `get_batch_events`. Sub-ms.
+3. **End-to-end delivery** — `send` → consumer visibility. ≈ tick period. **Tunable, not floored.** Default `pg_cron` at 1 s → ~500 ms average; sub-ms e2e is achievable with aggressive ticking (staggered `pg_cron` jobs, in-tick `pg_sleep` loop — see [concept doc](docs/pgq-concepts.md#three-latencies)). Trade-off: more ticks mean more `tick`/`subscription` metadata churn, which at very high rates warrants rotating those metadata tables as well. Under sustained load the ticker keeps firing at its configured rate — batch size absorbs the load, e2e does not inflate.
+
 ## Comparison
 
 | Feature | PgQue | PgQ | PGMQ | River | Que | pg-boss |
diff --git a/docs/pgq-concepts.md b/docs/pgq-concepts.md
@@ -66,3 +66,81 @@ below; the function auto-prefixes `queue_` internally.
 > produce huge batches consumers can't handle.
 
 — Kreen & Pihlak, PgCon 2009
+
+## Three latencies
+
+"Queue latency" is three numbers, not one. Conflating them confuses
+design discussion — each reflects a different bottleneck, and PgQue's
+trade-offs only make sense once they are separated.
+
+| # | Name | What it is | PgQue | Bottleneck |
+|---|---|---|---|---|
+| 1 | Producer | `send` / `insert_event` → durable | sub-ms (~high-µs; ~86k ev/s PL/pgSQL single-INSERT in prelim bench) | WAL flush, triggers |
+| 2 | Subscriber | `next_batch` + `get_batch_events` returning an already-built batch | sub-ms (snapshot SELECT, no SKIP LOCKED scan; ~2.4M ev/s consumer read) | how "next work" is located |
+| 3 | End-to-end | `send` → consumer visibility | ≈ tick period + consumer poll interval | ticker cadence (tunable) |
+
+#3 is the one application behavior depends on (SLAs, retries, perceived
+staleness). You can have #1 and #2 in microseconds and still have #3 in
+seconds — or vice versa. They are independent.
+
+### End-to-end is tunable, not floored
+
+**The default 1-second tick is a `pg_cron` schedule, not a design floor.**
+PgQue's e2e is bounded by whatever tick cadence you configure. Sub-ms
+e2e is achievable with more aggressive ticking:
+
+- **Staggered `pg_cron` jobs.** Schedule N jobs at `1 second` each, offset
+  by `1/N` via a shared coordinating lock, to get effective tick periods
+  down to ~10 ms (N=100) or ~1 ms (N=1000).
+- **In-tick sleep loop.** Single cron callout that internally does
+  `pg_sleep(0.01)` ×100 inside one invocation — same effective cadence,
+  fewer scheduler wakeups.
+- **Native sub-second cron.** Future `pg_cron` may support sub-second
+  schedules directly, removing the workaround.
+
+Trade-off at very high tick rates: every tick UPDATEs `pgque.tick` and
+`pgque.subscription`, so more ticks = more dead tuples on those metadata
+tables under held-xmin conditions. The event tables stay bloat-free
+(TRUNCATE rotation); the metadata-table bloat is a separate story and
+is addressed by extending the same rotation pattern to those tables —
+at sufficiently high tick rates that mitigation becomes necessary.
+
+Rough guidance:
+
+| `pg_cron` schedule | Average e2e | Notes |
+|---|---|---|
+| `1 second` (default) | ~500 ms | pgqd-compatible, minimal metadata churn |
+| `250 ms` | ~125 ms | 4× metadata writes, still cheap |
+| `10 ms` staggered | ~5 ms | needs coordinated jobs or in-tick sleep |
+| `1 ms` staggered | sub-ms | kHz-range; metadata-table rotation recommended |
+
+Per-queue thresholds (`queue_ticker_max_lag` default `3 seconds`,
+`queue_ticker_max_count` default 500, `queue_ticker_idle_period` default
+`1 minute` idle-decelerator) go through `pgque.set_queue_config()`.
+
+### Load behavior: PgQue vs. UPDATE/DELETE designs
+
+The key property of the tick model: **e2e does not grow with load.** The
+ticker fires at its configured rate regardless of backlog, so under
+pressure batch size grows (up to `queue_ticker_max_count`) — not e2e.
+
+UPDATE/DELETE-based systems use a different model: a consumer call
+returns messages immediately, marking them consumed via UPDATE (claim)
+and DELETE (ack) rather than advancing a snapshot cursor. So e2e ≈
+consumer poll interval — sub-ms when the consumer is actively polling,
+up to the poll interval otherwise. Drain rate is
+`batch_size / poll_interval`; if producers outrun that, queue depth
+grows and e2e grows with it until consumers scale out. Separately, those
+UPDATEs and DELETEs produce dead tuples that autovacuum cannot reclaim
+under MVCC pressure (long-running tx, idle-in-transaction, lagging
+logical replication slot, physical standby with
+`hot_standby_feedback=on`) — the bloat failure mode
+[PgQue avoids by construction](../README.md#why-pgque).
+
+### When to pick which
+
+Pick PgQue if you want batching efficiency and bloat immunity and can
+configure a tick cadence that meets your SLA (the default 1 s or a faster
+one). Pick an UPDATE/DELETE-based system if you need always-hot
+single-digit-ms delivery for synchronous request/response patterns, MVCC
+pressure is low in your environment, and that system's API fits better.