github
diff --git a/‎crates/consistent-hashing/README.md‎
Lines changed: 23 additions & 1 deletion b/‎crates/consistent-hashing/README.md‎
Lines changed: 23 additions & 1 deletion
diff --git a/‎crates/consistent-hashing/benchmarks/performance.rs‎
Lines changed: 31 additions & 4 deletions b/‎crates/consistent-hashing/benchmarks/performance.rs‎
Lines changed: 31 additions & 4 deletions
diff --git a/‎crates/consistent-hashing/examples/bounded_load.rs‎
Lines changed: 140 additions & 0 deletions b/‎crates/consistent-hashing/examples/bounded_load.rs‎
Lines changed: 140 additions & 0 deletions
@@ -17,7 +17,7 @@ where `N` is the number of nodes and `R` is the number of replicas.
 
 | Algorithm               | Lookup per key<br>(no replication)                                       | Node add/remove | Memory         | Lookup with replication                       |
 |-------------------------|--------------------------------------------------------------------------|-----------------|----------------|-----------------------------------------------|
-| Hash ring (with vnodes) | O(log N): binary search over N points; O(1): with specialized structures | O(log N)        | O(N)           | O(log N + R): Take next R distinct successors |
+| Hash ring (with vnodes) | O(log(V·N)): binary search; V = 100–200 virtual nodes per physical node  | O(V·log(V·N))   | O(V·N)         | O(log(V·N) + R): walk to next R distinct nodes |
 | Rendezvous              | O(N): max score                                                          | O(1)            | O(N) node list | O(N log R): pick top R scores                 |
 | Jump consistent hash    | O(log(N)) expected                                                       | 0               | O(1)           | O(R log N)                                    |
 | AnchorHash              | O(1) expected                                                            | O(1)            | O(N)           | Not native                                    |
@@ -37,6 +37,28 @@ Why replication matters
 - Distributes read/write load across multiple owners, reducing hotspots.
 - Enables fast recovery and higher tail-latency resilience.
 
+## Applications beyond replication
+
+The `ConsistentChooseK` iterator produces a per-key ranking of all `n` nodes in priority order — consistently and with zero memory overhead. This ranking is a strict superset of simple replication and enables drop-in replacements for several well-known algorithms that traditionally require maintaining expensive data structures such as hash rings.
+
+### Bounded-load consistent hashing
+
+[Consistent Hashing with Bounded Loads](https://research.google/pubs/pub46580/) (Mirrokni et al., 2018) caps the maximum load any single node may receive. When a key's preferred node is full, it overflows to the next candidate. Classic implementations walk a hash ring to find successors, requiring O(V·N) memory for the ring where V is the number of virtual nodes per physical node (typically V > 100–200 for acceptable load variance). Lookups cost O(log(V·N)) via binary search.
+
+With `ConsistentChooseK`, the ranking iterator directly yields each key's preference list on the fly — no ring required. Assignment becomes: iterate tokens round by round, and for each token advance its ranking iterator until a node with remaining capacity is found. This achieves the same bounded-load guarantees with O(k) for k keys and O(k) time to extract the k-th key.
+
+See [`examples/bounded_load.rs`](examples/bounded_load.rs) for a working implementation.
+
+### Power of two choices
+
+The [power of two choices](https://www.eecs.harvard.edu/~michaelm/postscripts/mythesis.pdf) paradigm (Mitzenmacher, 2001; Azar et al., 1999) assigns each key to the least-loaded of two (or d) randomly chosen nodes. This reduces maximum load from O(log n / log log n) to O(log log n / log d) with high probability.
+
+Traditionally this requires drawing d independent random nodes per key. However, the original algorithm ignores the corner case where multiple independent hash functions collide on the same node, effectively reducing the number of distinct choices below d. With `ConsistentChooseK`, the first d elements from the ranking iterator are guaranteed to be distinct nodes. The choices are also consistent across time — the same key always considers the same d candidates — so reassignment only happens when a node actually joins or leaves.
+
+### Priority-based failover
+
+In active-passive or tiered architectures, each key needs a deterministic failover order. The ranking iterator provides exactly this: the first node is the primary, the second is the hot standby, and so on. When a node fails, the next node in the ranking takes over — consistently for all keys that had the failed node at the same rank position, and without any coordination or ring rebalancing.
+
 ## ConsistentChooseK algorithm
 
 The following functions summarize the core algorithmic innovation as a minimal Rust excerpt.
 
@@ -1,5 +1,5 @@
 use std::{
-    hash::{DefaultHasher, Hash, Hasher},
+    hash::{DefaultHasher, Hash},
     hint::black_box,
     time::Duration,
 };
@@ -36,12 +36,11 @@ fn throughput_benchmark(c: &mut Criterion) {
                 b.iter_batched(
                     || &keys,
                     |keys| {
-                        let mut res = Vec::with_capacity(k);
                         for key in keys {
                             let mut h = DefaultHasher::default();
                             key.hash(&mut h);
                             black_box(
-                                ConsistentChooseKHasher::new(h, k).prev_with_vec(*n + k, &mut res),
+                                ConsistentChooseKHasher::new_with_k(h, *n + k, k),
                             );
                         }
                     },
@@ -53,13 +52,41 @@ fn throughput_benchmark(c: &mut Criterion) {
     group.finish();
 }
 
+fn append_vs_new_with_k(c: &mut Criterion) {
+    let mut group = c.benchmark_group("append_vs_new_with_k");
+    group.plot_config(PlotConfiguration::default().summary_scale(AxisScale::Logarithmic));
+    for n in [10usize, 100, 1000, 10000] {
+        for k in [2, 3, 10, 100] {
+            group.bench_function(
+                BenchmarkId::new(format!("new_with_k/k_{k}"), n),
+                |b| {
+                    b.iter(|| {
+                        let h = DefaultHasher::default();
+                        black_box(ConsistentChooseKHasher::new_with_k(h, n + k, k));
+                    })
+                },
+            );
+            group.bench_function(BenchmarkId::new(format!("append/k_{k}"), n), |b| {
+                b.iter(|| {
+                    let h = DefaultHasher::default();
+                    let mut iter = ConsistentChooseKHasher::new(h, n + k);
+                    black_box(for _ in 0..k {
+                        iter.grow_k();
+                    })
+                })
+            });
+        }
+    }
+    group.finish();
+}
+
 criterion_group!(
     name = benches;
     config = Criterion::default()
                 .warm_up_time(Duration::from_millis(500))
                 .measurement_time(Duration::from_millis(4000))
                 .nresamples(1000);
 
-    targets = throughput_benchmark,
+    targets = throughput_benchmark, append_vs_new_with_k,
 );
 criterion_main!(benches);
@@ -0,0 +1,140 @@
+//! Bounded-load consistent hashing example.
+//!
+//! Pure consistent hashing selects each node with equal probability, but for
+//! small workloads (e.g. 64 tokens across 24 machines) random variance causes
+//! highly skewed assignments. This example layers a capacity cap on top of
+//! ConsistentChooseK to enforce near-perfect balance.
+//!
+//! Assignment uses round-robin over replicas: first assign every token's
+//! most-preferred machine, then every token's second-preferred, etc. This
+//! ensures all tokens compete fairly for each replica round.
+//!
+//! Run with:  cargo run --example bounded_load
+
+use std::hash::{DefaultHasher, Hash};
+
+use consistent_hashing::ConsistentChooseKHasher;
+
+/// Round-robin bounded-load assignment.
+///
+/// For each replica round r = 0..k, iterate over all tokens and assign each
+/// to its next most-preferred node that still has capacity. This gives every
+/// token equal priority within each round.
+fn bounded_load_assign(
+    rankings: &[Vec<usize>],
+    k: usize,
+    n: usize,
+    max_load: usize,
+) -> (Vec<Vec<usize>>, Vec<usize>) {
+    let mut load = vec![0usize; n];
+    let num_tokens = rankings.len();
+    let mut assignments = vec![Vec::with_capacity(k); num_tokens];
+    let mut cursors = vec![0usize; num_tokens];
+
+    for _round in 0..k {
+        for (token, ranking) in rankings.iter().enumerate() {
+            while cursors[token] < ranking.len() {
+                let node = ranking[cursors[token]];
+                cursors[token] += 1;
+                if load[node] < max_load {
+                    load[node] += 1;
+                    assignments[token].push(node);
+                    break;
+                }
+            }
+        }
+    }
+    (assignments, load)
+}
+
+fn main() {
+    let num_tokens: usize = 64;
+    let k: usize = 2; // replicas per token
+    let n: usize = 24; // machines
+    let total = num_tokens * k;
+    let cap = total.div_ceil(n); // ceil(128/24) = 6
+
+    println!("Parameters: {num_tokens} tokens, k={k} replicas, {n} machines");
+    println!("Total assignments: {total},  capacity cap per machine: {cap}");
+    println!(
+        "Perfect balance: {}×{} + {}×{}\n",
+        n - total % n,
+        total / n,
+        total % n,
+        total / n + 1
+    );
+
+    // ── Unbounded ────────────────────────────────────────────────────────
+    let unbounded: Vec<Vec<usize>> = (0..num_tokens as u64)
+        .map(|key| {
+            let mut h = DefaultHasher::default();
+            key.hash(&mut h);
+            ConsistentChooseKHasher::new(h, n).take(k).collect()
+        })
+        .collect();
+    let mut unbounded_load = vec![0usize; n];
+    for a in &unbounded {
+        for &node in a {
+            unbounded_load[node] += 1;
+        }
+    }
+
+    // ── Bounded (round-robin) ────────────────────────────────────────────
+    let rankings: Vec<Vec<usize>> = (0..num_tokens as u64)
+        .map(|key| {
+            let mut h = DefaultHasher::default();
+            key.hash(&mut h);
+            ConsistentChooseKHasher::new(h, n).collect()
+        })
+        .collect();
+    let (bounded, bounded_load) = bounded_load_assign(&rankings, k, n, cap);
+
+    // ── Display ──────────────────────────────────────────────────────────
+    println!("{:<12} {:>10} {:>10}", "Machine", "Unbounded", "Bounded");
+    println!("{:-<12} {:->10} {:->10}", "", "", "");
+    for i in 0..n {
+        println!(
+            "{:<12} {:>10} {:>10}",
+            i, unbounded_load[i], bounded_load[i]
+        );
+    }
+
+    let ub_min = *unbounded_load.iter().min().unwrap();
+    let ub_max = *unbounded_load.iter().max().unwrap();
+    let b_min = *bounded_load.iter().min().unwrap();
+    let b_max = *bounded_load.iter().max().unwrap();
+    println!("{:-<12} {:->10} {:->10}", "", "", "");
+    println!(
+        "{:<12} {:>10} {:>10}",
+        "spread",
+        ub_max - ub_min,
+        b_max - b_min
+    );
+
+    // ── Consistency check: what happens when we add one machine? ─────────
+    let n2 = n + 1;
+    let cap2 = (num_tokens * k).div_ceil(n2);
+    let rankings2: Vec<Vec<usize>> = (0..num_tokens as u64)
+        .map(|key| {
+            let mut h = DefaultHasher::default();
+            key.hash(&mut h);
+            ConsistentChooseKHasher::new(h, n2).collect()
+        })
+        .collect();
+    let (bounded2, _) = bounded_load_assign(&rankings2, k, n2, cap2);
+
+    let mut changes = 0;
+    for (before, after) in bounded.iter().zip(bounded2.iter()) {
+        for node in before {
+            if !after.contains(node) {
+                changes += 1;
+            }
+        }
+    }
+    println!("\nConsistency: adding machine {n} → {n2}");
+    println!(
+        "  {changes}/{total} assignments changed ({:.1}%),  ideal ≈ {:.1}%",
+        changes as f64 / total as f64 * 100.0,
+        k as f64 / n2 as f64 * 100.0
+    );
+}