diff --git a/README.md b/README.md
index b4d9121..f9d9e1d 100644
--- a/README.md
+++ b/README.md
@@ -6,15 +6,16 @@
   </picture>
 </p>
 
-# SoftTorch
+# Soft differentiable programming in PyTorch
 
 [![PyPI version](https://img.shields.io/pypi/v/softtorch)](https://pypi.org/project/softtorch/)
 [![Python version](https://img.shields.io/pypi/pyversions/softtorch)](https://pypi.org/project/softtorch/)
 [![License](https://img.shields.io/pypi/l/softtorch)](https://github.com/a-paulus/softtorch/blob/main/LICENSE)
+[![arXiv paper](https://img.shields.io/badge/arXiv-paper-salmon)](https://arxiv.org/abs/2603.08824)
 
 Looking for JAX? See [SoftJAX](https://github.com/a-paulus/softjax).
 
-## In a nutshell
+## What is SoftTorch?
 
 SoftTorch provides soft differentiable drop-in replacements for traditionally non-differentiable functions in [PyTorch](https://pytorch.org), including
 
@@ -28,8 +29,7 @@ All operators offer multiple modes (controlling smoothness or boundedness of the
 
 All operators also support straight-through estimation, using the non-differentiable function in the forward pass and the soft relaxation in the backward pass.
 
-SoftTorch functions are drop-in replacements for their non-differentiable PyTorch counterparts.
-Special care is needed for functions operating on indices, as we relax discrete indices into distributions over indices, which modifies the shape of returned/accepted values.
+*Note, while SoftTorch is designed to provide direct drop-in replacements for PyTorch's operators, soft axis-wise operators return a probability distribution over indices (instead of an index), effectively changing the shape of the function's output.*
 
 
 ## Installation
@@ -44,351 +44,152 @@ pip install softtorch
 Available at https://a-paulus.github.io/softtorch/.
 
 
-## Quick example
-```python
-import torch
-import softtorch as st
-
-x = torch.tensor([-0.2, -1.0, 0.3, 1.0])
-
-# Elementwise functions
-print("\nTorch absolute:", torch.abs(x))
-print("SoftTorch absolute (hard mode):", st.abs(x, mode="hard"))
-print("SoftTorch absolute (soft mode):", st.abs(x))
-
-print("\nTorch clamp:", torch.clamp(x, -0.5, 0.5))
-print("SoftTorch clamp (hard mode):", st.clamp(x, -0.5, 0.5, mode="hard"))
-print("SoftTorch clamp (soft mode):", st.clamp(x, -0.5, 0.5))
-
-print("\nTorch heaviside:", torch.heaviside(x, torch.tensor(0.5)))
-print("SoftTorch heaviside (hard mode):", st.heaviside(x, mode="hard"))
-print("SoftTorch heaviside (soft mode):", st.heaviside(x))
-
-print("\nTorch ReLU:", torch.nn.functional.relu(x))
-print("SoftTorch ReLU (hard mode):", st.relu(x, mode="hard"))
-print("SoftTorch ReLU (soft mode):", st.relu(x))
-
-print("\nTorch round:", torch.round(x))
-print("SoftTorch round (hard mode):", st.round(x, mode="hard"))
-print("SoftTorch round (soft mode):", st.round(x))
-
-print("\nTorch sign:", torch.sign(x))
-print("SoftTorch sign (hard mode):", st.sign(x, mode="hard"))
-print("SoftTorch sign (soft mode):", st.sign(x))
-```
-```
-Torch absolute: tensor([0.2000, 1.0000, 0.3000, 1.0000])
-SoftTorch absolute (hard mode): tensor([0.2000, 1.0000, 0.3000, 1.0000])
-SoftTorch absolute (soft mode): tensor([0.1523, 0.9999, 0.2715, 0.9999])
-
-Torch clamp: tensor([-0.2000, -0.5000,  0.3000,  0.5000])
-SoftTorch clamp (hard mode): tensor([-0.2000, -0.5000,  0.3000,  0.5000])
-SoftTorch clamp (soft mode): tensor([-0.1952, -0.4993,  0.2873,  0.4993])
-
-Torch heaviside: tensor([0., 0., 1., 1.])
-SoftTorch heaviside (hard mode): tensor([0., 0., 1., 1.])
-SoftTorch heaviside (soft mode): tensor([0.1192, 0.0000, 0.9526, 1.0000])
-
-Torch ReLU: tensor([0.0000, 0.0000, 0.3000, 1.0000])
-SoftTorch ReLU (hard mode): tensor([0.0000, 0.0000, 0.3000, 1.0000])
-SoftTorch ReLU (soft mode): tensor([0.0127, 0.0000, 0.3049, 1.0000])
-
-Torch round: tensor([-0., -1.,  0.,  1.])
-SoftTorch round (hard mode): tensor([-0., -1.,  0.,  1.])
-SoftTorch round (soft mode): tensor([-0.0465, -1.0000,  0.1189,  1.0000])
-
-Torch sign: tensor([-1., -1.,  1.,  1.])
-SoftTorch sign (hard mode): tensor([-1., -1.,  1.,  1.])
-SoftTorch sign (soft mode): tensor([-0.7616, -0.9999,  0.9051,  0.9999])
-```
+## Quick examples
 
+**Robust median regression:**
+Minimize the median absolute residual to be robust to outliers.
 ```python
-# Tensor-valued operators
-print("\nTorch max:", torch.max(x))
-print("SoftTorch max (hard mode):", st.max(x, mode="hard"))
-print("SoftTorch max (soft mode):", st.max(x))
-
-print("\nTorch min:", torch.min(x))
-print("SoftTorch min (hard mode):", st.min(x, mode="hard"))
-print("SoftTorch min (soft mode):", st.min(x))
-
-print("\nTorch sort:", torch.sort(x).values)
-print("SoftTorch sort (hard mode):", st.sort(x, mode="hard").values)
-print("SoftTorch sort (soft mode):", st.sort(x).values)
-
-print("\nTorch quantile:", torch.quantile(x, q=0.2))
-print("SoftTorch quantile (hard mode):", st.quantile(x, q=0.2, mode="hard"))
-print("SoftTorch quantile (soft mode):", st.quantile(x, q=0.2))
-
-print("\nTorch median:", torch.median(x))
-print("SoftTorch median (hard mode):", st.median(x, mode="hard"))
-print("SoftTorch median (soft mode):", st.median(x))
-
-print("\nTorch topk:", torch.topk(x, k=3).values)
-print("SoftTorch topk (hard mode):", st.topk(x, k=3, mode="hard").values)
-print("SoftTorch topk (soft mode):", st.topk(x, k=3).values)
-
-print("\nTorch rank:", torch.argsort(torch.argsort(x)))
-print("SoftTorch rank (hard mode):", st.rank(x, mode="hard", descending=False))
-print("SoftTorch rank (soft mode):", st.rank(x, descending=False))
-```
-```
-Torch max: tensor(1.)
-SoftTorch max (hard mode): tensor(1.)
-SoftTorch max (soft mode): tensor(0.8874)
-
-Torch min: tensor(-1.)
-SoftTorch min (hard mode): tensor(-1.)
-SoftTorch min (soft mode): tensor(-0.8996)
-
-Torch sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (hard mode): tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (soft mode): tensor([-0.8792, -0.1641,  0.2767,  0.8738])
+import torch, softtorch as st
 
-Torch quantile: tensor(-0.5200)
-SoftTorch quantile (hard mode): tensor(-0.5200)
-SoftTorch quantile (soft mode): tensor(-0.4501)
+torch.manual_seed(0)
+X = torch.randn(20, 3)
+w_true = torch.tensor([1.0, -2.0, 0.5])
+y = X @ w_true
+y[0] = 1e6  # inject outlier
 
-Torch median: tensor(-0.2000)
-SoftTorch median (hard mode): tensor(-0.2000)
-SoftTorch median (soft mode): tensor(-0.1641)
+def median_regression_loss(w, X, y, mode="smooth"):
+    residuals = y - X @ w
+    return st.median(st.abs(residuals, mode=mode), mode=mode)
 
-Torch topk: tensor([ 1.0000,  0.3000, -0.2000])
-SoftTorch topk (hard mode): tensor([ 1.0000,  0.3000, -0.2000])
-SoftTorch topk (soft mode): tensor([ 0.8738,  0.2767, -0.1641])
+w = torch.zeros(3, requires_grad=True)
+hard_loss = median_regression_loss(w, X, y, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, w)[0])
+soft_loss = median_regression_loss(w, X, y, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, w)[0])
 
-Torch rank: tensor([1, 0, 2, 3])
-SoftTorch rank (hard mode): tensor([2., 1., 3., 4.])
-SoftTorch rank (soft mode): tensor([1.9950, 1.0548, 3.0239, 3.9228])
-```
-
-```python
-# Sort: sweep over methods
-print("\nTorch sort:", torch.sort(x).values)
-print("SoftTorch sort (softsort):", st.sort(x, method="softsort", softness=0.1).values)
-print("SoftTorch sort (neuralsort):", st.sort(x, method="neuralsort", softness=0.1).values)
-print("SoftTorch sort (fast_soft_sort):", st.sort(x, method="fast_soft_sort", softness=2.0).values)
-print("SoftTorch sort (ot):", st.sort(x, method="ot", softness=0.1).values)
-print("SoftTorch sort (sorting_network):", st.sort(x, method="sorting_network", softness=0.1).values)
-
-# Sort: sweep over modes
-print("\nTorch sort:", torch.sort(x).values)
-for mode in ["hard", "smooth", "c0", "c1", "c2"]:
-    print(f"SoftTorch sort ({mode}):", st.sort(x, softness=0.5, mode=mode).values)
+w = torch.zeros(3)
+for _ in range(50):
+    w.requires_grad_(True)
+    loss = median_regression_loss(w, X, y)
+    g = torch.autograd.grad(loss, w)[0]
+    w = (w - 0.1 * g).detach()
+print("Learned w:", w, " (true:", w_true, ")")
 ```
 ```
-Torch sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (softsort): tensor([-0.8996, -0.1705,  0.2847,  0.8874])
-SoftTorch sort (neuralsort): tensor([-0.8792, -0.1641,  0.2767,  0.8738])
-SoftTorch sort (fast_soft_sort): tensor([-0.7462, -0.1971,  0.2938,  0.8569])
-SoftTorch sort (ot): tensor([-0.7324, -0.2396,  0.3286,  0.7434])
-SoftTorch sort (sorting_network): tensor([-0.7999, -0.2672,  0.3847,  0.7863])
-
-Torch sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (hard): tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (smooth): tensor([-0.6057, -0.1997,  0.2729,  0.6281])
-SoftTorch sort (c0): tensor([-1.0000, -0.6313,  0.6525,  0.9824])
-SoftTorch sort (c1): tensor([-0.9982, -0.5432,  0.5814,  0.9837])
-SoftTorch sort (c2): tensor([-0.9978, -0.4905,  0.5425,  0.9903])
+Hard grad: tensor([ 0.2103,  0.1772, -0.8305])
+Soft grad: tensor([ 0.0731,  0.7100, -0.2970])
+Learned w: tensor([ 1.0000, -2.0000,  0.5000])  (true: tensor([ 1.0000, -2.0000,  0.5000]) )
 ```
 
+**Top-k feature selection:**
+Discover which features of a trained model are important.
 ```python
-# Operators returning indices
-print("\nTorch argmax:", torch.argmax(x))
-print("SoftTorch argmax (hard mode):", st.argmax(x, mode="hard"))
-print("SoftTorch argmax (soft mode):", st.argmax(x))
-
-print("\nTorch argmin:", torch.argmin(x))
-print("SoftTorch argmin (hard mode):", st.argmin(x, mode="hard"))
-print("SoftTorch argmin (soft mode):", st.argmin(x))
-
-print("\nTorch argquantile:", "Not implemented in standard PyTorch")
-print("SoftTorch argquantile (hard mode):", st.argquantile(x, q=0.2, mode="hard"))
-print("SoftTorch argquantile (soft mode):", st.argquantile(x, q=0.2))
-
-print("\nTorch argmedian:", torch.median(x, dim=0).indices)
-print("SoftTorch argmedian (hard mode):", st.median(x, mode="hard", dim=0).indices)
-print("SoftTorch argmedian (soft mode):", st.median(x, dim=0).indices)
-
-print("\nTorch argsort:", torch.argsort(x))
-print("SoftTorch argsort (hard mode):", st.argsort(x, mode="hard"))
-print("SoftTorch argsort (soft mode):", st.argsort(x))
-
-print("\nTorch argtopk:", torch.topk(x, k=3).indices)
-print("SoftTorch argtopk (hard mode):", st.topk(x, k=3, mode="hard").indices)
-print("SoftTorch argtopk (soft mode):", st.topk(x, k=3).indices)
-```
-```
-Torch argmax: tensor(3)
-SoftTorch argmax (hard mode): tensor([0., 0., 0., 1.])
-SoftTorch argmax (soft mode): tensor([0.0215, 0.0022, 0.1176, 0.8586])
-
-Torch argmin: tensor(1)
-SoftTorch argmin (hard mode): tensor([0., 1., 0., 0.])
-SoftTorch argmin (soft mode): tensor([0.0922, 0.8885, 0.0169, 0.0023])
-
-Torch argquantile: Not implemented in standard PyTorch
-SoftTorch argquantile (hard mode): tensor([0.6000, 0.4000, 0.0000, 0.0000])
-SoftTorch argquantile (soft mode): tensor([0.5403, 0.3693, 0.0902, 0.0001])
-
-Torch argmedian: tensor(0)
-SoftTorch argmedian (hard mode): tensor([1., 0., 0., 0.])
-SoftTorch argmedian (soft mode): tensor([0.8009, 0.0491, 0.1498, 0.0002])
-
-Torch argsort: tensor([1, 0, 2, 3])
-SoftTorch argsort (hard mode): tensor([[0., 1., 0., 0.],
-        [1., 0., 0., 0.],
-        [0., 0., 1., 0.],
-        [0., 0., 0., 1.]])
-SoftTorch argsort (soft mode): tensor([[0.1494, 0.8496, 0.0009, 0.0000],
-        [0.8009, 0.0491, 0.1498, 0.0002],
-        [0.1418, 0.0001, 0.7899, 0.0681],
-        [0.0011, 0.0000, 0.1784, 0.8205]])
-
-Torch argtopk: tensor([3, 2, 0])
-SoftTorch argtopk (hard mode): tensor([[0., 0., 0., 1.],
-        [0., 0., 1., 0.],
-        [1., 0., 0., 0.]])
-SoftTorch argtopk (soft mode): tensor([[0.0011, 0.0000, 0.1784, 0.8205],
-        [0.1418, 0.0001, 0.7899, 0.0681],
-        [0.8009, 0.0491, 0.1498, 0.0002]])
-```
-
+n_features, k = 10, 3
+torch.manual_seed(42)
+X = torch.randn(100, n_features)
+w_model = torch.tensor([0, 2.0, 0, -1.5, 0, 0, 0, 5.0, 0, 0])
+y = X @ w_model + 0.1 * torch.randn(100)
+
+def feature_selection_loss(g, X, y, w_model, mode="smooth"):
+    _, soft_idx = st.topk(g, k=k, mode=mode, gated_grad=False)
+    mask = soft_idx.sum(dim=0)
+    y_pred = (X * mask) @ w_model
+    return torch.mean(st.abs(y_pred - y))
+
+g = torch.zeros(n_features, requires_grad=True)
+hard_loss = feature_selection_loss(g, X, y, w_model, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, g)[0] if hard_loss.requires_grad else torch.zeros_like(g))
+soft_loss = feature_selection_loss(g, X, y, w_model, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, g)[0])
+
+g = torch.zeros(n_features)
+for _ in range(5):
+    g.requires_grad_(True)
+    loss = feature_selection_loss(g, X, y, w_model)
+    g_grad = torch.autograd.grad(loss, g)[0]
+    g = (g - 0.001 * g_grad).detach()
+print("Selected features:", torch.topk(g, k=k).indices)
+```
+```
+Hard grad: tensor([0., 0., 0., 0., 0., 0., 0., 0., 0., 0.])
+Soft grad: tensor([  2359.3386,     62.9980,   2359.3386,   -890.2852,   2359.3386,
+          2359.3386,   2359.3386, -15688.0829,   2359.3386,   2359.3386])
+Selected features: tensor([7, 3, 1])
+```
+
+**Differentiable threshold filtering:**
+Learn a threshold that gates inputs.
 ```python
-y = torch.tensor([0.2, -0.5, 0.5, -1.0])
-
-# Comparison operators
-print("\nTorch greater:", torch.greater(x, y))
-print("SoftTorch greater (hard mode):", st.greater(x, y, mode="hard"))
-print("SoftTorch greater (soft mode):", st.greater(x, y))
-
-print("\nTorch greater equal:", torch.greater_equal(x, y))
-print("SoftTorch greater equal (hard mode):", st.greater_equal(x, y, mode="hard"))
-print("SoftTorch greater equal (soft mode):", st.greater_equal(x, y))
-
-print("\nTorch less:", torch.less(x, y))
-print("SoftTorch less (hard mode):", st.less(x, y, mode="hard"))
-print("SoftTorch less (soft mode):", st.less(x, y))
+x = torch.tensor([0.2, 0.8, 0.5, 1.2, 0.1])
+target_sum = 2.0  # sum of values above threshold = 2.0 (i.e. 0.8 + 1.2)
 
-print("\nTorch less equal:", torch.less_equal(x, y))
-print("SoftTorch less equal (hard mode):", st.less_equal(x, y, mode="hard"))
-print("SoftTorch less equal (soft mode):", st.less_equal(x, y))
+def filter_loss(t, x, target, mode="smooth"):
+    mask = st.greater(x, t, mode=mode)
+    return (torch.sum(mask * x) - target) ** 2
 
-print("\nTorch eq:", torch.eq(x, y))
-print("SoftTorch eq (hard mode):", st.eq(x, y, mode="hard"))
-print("SoftTorch eq (soft mode):", st.eq(x, y))
+t = torch.tensor(0.0, requires_grad=True)
+hard_loss = filter_loss(t, x, target_sum, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, t)[0] if hard_loss.requires_grad else torch.zeros_like(t))
+soft_loss = filter_loss(t, x, target_sum, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, t)[0])
 
-print("\nTorch not equal:", torch.not_equal(x, y))
-print("SoftTorch not equal (hard mode):", st.not_equal(x, y, mode="hard"))
-print("SoftTorch not equal (soft mode):", st.not_equal(x, y))
-
-print("\nTorch isclose:", torch.isclose(x, y))
-print("SoftTorch isclose (hard mode):", st.isclose(x, y, mode="hard"))
-print("SoftTorch isclose (soft mode):", st.isclose(x, y))
+t = torch.tensor(0.0)
+for _ in range(20):
+    t.requires_grad_(True)
+    loss = filter_loss(t, x, target_sum)
+    t_grad = torch.autograd.grad(loss, t)[0]
+    t = (t - 0.1 * t_grad).detach()
+print("Learned threshold:", t)
 ```
 ```
-Torch greater: tensor([False, False, False,  True])
-SoftTorch greater (hard mode): tensor([0., 0., 0., 1.])
-SoftTorch greater (soft mode): tensor([0.0180, 0.0067, 0.1192, 1.0000])
-
-Torch greater equal: tensor([False, False, False,  True])
-SoftTorch greater equal (hard mode): tensor([0., 0., 0., 1.])
-SoftTorch greater equal (soft mode): tensor([0.0180, 0.0067, 0.1192, 1.0000])
-
-Torch less: tensor([ True,  True,  True, False])
-SoftTorch less (hard mode): tensor([1., 1., 1., 0.])
-SoftTorch less (soft mode): tensor([0.9820, 0.9933, 0.8808, 0.0000])
-
-Torch less equal: tensor([ True,  True,  True, False])
-SoftTorch less equal (hard mode): tensor([1., 1., 1., 0.])
-SoftTorch less equal (soft mode): tensor([0.9820, 0.9933, 0.8808, 0.0000])
-
-Torch eq: tensor([False, False, False, False])
-SoftTorch eq (hard mode): tensor([0., 0., 0., 0.])
-SoftTorch eq (soft mode): tensor([0.0414, 0.0143, 0.3580, 0.0000])
-
-Torch not equal: tensor([True, True, True, True])
-SoftTorch not equal (hard mode): tensor([1., 1., 1., 1.])
-SoftTorch not equal (soft mode): tensor([0.9586, 0.9857, 0.6420, 1.0000])
-
-Torch isclose: tensor([False, False, False, False])
-SoftTorch isclose (hard mode): tensor([0., 0., 0., 0.])
-SoftTorch isclose (soft mode): tensor([0.0414, 0.0143, 0.3580, 0.0000])
+Hard grad: tensor(0.)
+Soft grad: tensor(-0.6600)
+Learned threshold: tensor(0.6211)
 ```
 
+**Rule-based classifier:**
+Learn decision boundaries `[lo, hi]` for a rule using soft logic and straight-through estimation. The rule is true if any element of a feature is inside `[lo, hi]`.
 ```python
-# Logical operators
-fuzzy_a = torch.tensor([0.1, 0.2, 0.8, 1.0])
-fuzzy_b = torch.tensor([0.7, 0.3, 0.1, 0.9])
-bool_a = fuzzy_a >= 0.5
-bool_b = fuzzy_b >= 0.5
-
-print("\nTorch AND:", torch.logical_and(bool_a, bool_b))
-print("SoftTorch AND:", st.logical_and(fuzzy_a, fuzzy_b))
-
-print("\nTorch OR:", torch.logical_or(bool_a, bool_b))
-print("SoftTorch OR:", st.logical_or(fuzzy_a, fuzzy_b))
+x = torch.tensor([[0.2, 0.8], [0.5, 0.3], [0.9, 0.1], [0.4, 0.7], [0.1, 0.4], [0.2, 0.7], [0.4, 0.1], [0.4, 0.7],
+               [0.7, 0.29], [0.3, 0.3], [0.61, 0.25], [0.4, 0.6], [0.0, 0.1], [0.5, 0.3], [0.4, 0.9], [0.1, 0.57]])
+labels = torch.tensor([0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0,
+                    0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0])
 
-print("\nTorch NOT:", torch.logical_not(bool_a))
-print("SoftTorch NOT:", st.logical_not(fuzzy_a))
+@st.st
+def rule_loss(params, x, labels, mode="smooth"):
+    lo, hi = params[0], params[1]
+    above = st.greater(x, lo, mode=mode)
+    below = st.less(x, hi, mode=mode)
+    in_range = st.logical_and(above, below)
+    preds = st.any(in_range, dim=-1)
+    return ((preds - labels) ** 2).sum()
 
-print("\nTorch XOR:", torch.logical_xor(bool_a, bool_b))
-print("SoftTorch XOR:", st.logical_xor(fuzzy_a, fuzzy_b))
+params = torch.tensor([0.0, 1.0], requires_grad=True)
+hard_loss = rule_loss(params, x, labels, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, params)[0] if hard_loss.requires_grad else torch.zeros_like(params))
+soft_loss = rule_loss(params, x, labels, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, params)[0])
 
-print("\nTorch ALL:", torch.all(bool_a))
-print("SoftTorch ALL:", st.all(fuzzy_a))
-
-print("\nTorch ANY:", torch.any(bool_a))
-print("SoftTorch ANY:", st.any(fuzzy_a))
-
-# Selection operators
-print("\nTorch Where:", torch.where(bool_a, x, y))
-print("SoftTorch Where:", st.where(fuzzy_a, x, y))
+params = torch.tensor([0.0, 1.0])
+for _ in range(20):
+    params.requires_grad_(True)
+    loss = rule_loss(params, x, labels)
+    p_grad = torch.autograd.grad(loss, params)[0]
+    params = (params - 0.01 * p_grad).detach()
+print("Learned [lo, hi]:", params)
 ```
 ```
-Torch AND: tensor([False, False, False,  True])
-SoftTorch AND: tensor([0.0700, 0.0600, 0.0800, 0.9000])
-
-Torch OR: tensor([ True, False,  True,  True])
-SoftTorch OR: tensor([0.7300, 0.4400, 0.8200, 1.0000])
-
-Torch NOT: tensor([ True,  True, False, False])
-SoftTorch NOT: tensor([0.9000, 0.8000, 0.2000, 0.0000])
-
-Torch XOR: tensor([ True, False,  True, False])
-SoftTorch XOR: tensor([0.6411, 0.3464, 0.7256, 0.1000])
-
-Torch ALL: tensor(False)
-SoftTorch ALL: tensor(0.0160)
-
-Torch ANY: tensor(True)
-SoftTorch ANY: tensor(1.)
-
-Torch Where: tensor([ 0.2000, -0.5000,  0.3000,  1.0000])
-SoftTorch Where: tensor([ 0.1600, -0.6000,  0.3400,  1.0000])
+Hard grad: tensor([0., 0.])
+Soft grad: tensor([-4.2777,  1.4152])
+Learned [lo, hi]: tensor([0.2925, 0.5999])
 ```
 
-```python
-# Straight-through operators: Use hard function on forward and soft on backward
-print("Straight-through ReLU:", st.relu_st(x))
-print("Straight-through sort:", st.sort_st(x).values)
-print("Straight-through argtopk:", st.topk_st(x, k=3).indices)
-print("Straight-through greater:", st.greater_st(x, y))
-# And many more...
-```
-```
-Straight-through ReLU: tensor([0.0000, 0.0000, 0.3000, 1.0000])
-Straight-through sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-Straight-through argtopk: tensor([[0., 0., 0., 1.],
-        [0., 0., 1., 0.],
-        [1., 0., 0., 0.]])
-Straight-through greater: tensor([0., 0., 0., 1.])
-```
+<img src="docs/examples/quick_example_optimization.svg" alt="Optimization trajectories" width="100%">
 
 
 ## Citation
 
-If this library helped your academic work, please consider citing:
+If this library helped your academic work, please consider citing: ([arXiv link](https://arxiv.org/abs/2603.08824))
 
 ```bibtex
 @article{paulus2026softjax,
@@ -400,14 +201,14 @@ If this library helped your academic work, please consider citing:
 }
 ```
 
-Also consider starring the project [on GitHub](https://github.com/a-paulus/softtorch)!
+(Also consider starring the project [on GitHub](https://github.com/a-paulus/softtorch))
 
 Special thanks and credit go to [Patrick Kidger](https://kidger.site) for the awesome [JAX repositories](https://github.com/patrick-kidger) that served as the basis for the documentation of this project.
 
 
 ## Feedback
 
-This project is still relatively young, if you have any suggestions for improvement or other feedback, please [reach out](mailto:paulus.anselm@gmail.com) or raise a GitHub issue!
+If you have any suggestions for improvement or other feedback, please [reach out](mailto:paulus.anselm@gmail.com) or raise a GitHub issue!
 
 
 ## See also
diff --git a/docs/.citation.md b/docs/.citation.md
deleted file mode 100644
index b8ab08a..0000000
--- a/docs/.citation.md
+++ /dev/null
@@ -1,13 +0,0 @@
-If this library helped your academic work, please consider citing:
-
-```bibtex
-@article{paulus2026softjax,
-  title={{SoftJAX} \& {SoftTorch}: Empowering Automatic Differentiation Libraries with Informative Gradients},
-  author={Paulus, Anselm and Geist, A.\ Ren\'e and Musil, V\'it and Hoffmann, Sebastian and Beker, Onur and Martius, Georg},
-  journal={arXiv preprint},
-  year={2026},
-  eprint={2603.08824}
-}
-```
-
-Also consider starring the project [on GitHub](https://github.com/a-paulus/softtorch)!
diff --git a/docs/quick_example.py b/docs/examples/long_example.py
similarity index 100%
rename from docs/quick_example.py
rename to docs/examples/long_example.py
diff --git a/docs/manifold_points.ipynb b/docs/examples/manifold_points.ipynb
similarity index 100%
rename from docs/manifold_points.ipynb
rename to docs/examples/manifold_points.ipynb
diff --git a/docs/paper_examples.py b/docs/examples/paper_examples.py
similarity index 100%
rename from docs/paper_examples.py
rename to docs/examples/paper_examples.py
diff --git a/docs/examples/quick_example.py b/docs/examples/quick_example.py
new file mode 100644
index 0000000..9572c18
--- /dev/null
+++ b/docs/examples/quick_example.py
@@ -0,0 +1,200 @@
+import matplotlib.pyplot as plt
+import torch
+import softtorch as st
+
+torch.set_printoptions(precision=4, sci_mode=False)
+torch.set_default_dtype(torch.float64)
+
+
+# 1. Median regression
+# Minimize the median absolute residual to be robust to outliers.
+
+torch.manual_seed(0)
+X = torch.randn(20, 3)
+w_true = torch.tensor([1.0, -2.0, 0.5])
+y = X @ w_true
+y[0] = 1e6  # inject outlier
+
+
+def median_regression_loss(w, X, y, mode="smooth"):
+    residuals = y - X @ w
+    return st.median(st.abs(residuals, mode=mode), mode=mode)
+
+
+w = torch.zeros(3, requires_grad=True)
+hard_loss = median_regression_loss(w, X, y, mode="hard")
+print("=== 1. Robust median regression ===")
+print("Hard grad:", torch.autograd.grad(hard_loss, w)[0])
+soft_loss = median_regression_loss(w, X, y, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, w)[0])
+
+ws = []
+w = torch.zeros(3)
+for _ in range(50):
+    ws.append(w.tolist())
+    w.requires_grad_(True)
+    loss = median_regression_loss(w, X, y)
+    g = torch.autograd.grad(loss, w)[0]
+    w = (w - 0.1 * g).detach()
+print("Learned w:", w, " (true:", w_true, ")")
+
+
+# 2. Top-k feature selection
+# Discover which features of a trained model are important.
+# 10 features total, only 3 informative — learn gating scores to find them.
+
+n_features, k = 10, 3
+torch.manual_seed(42)
+X = torch.randn(100, n_features)
+w_model = torch.tensor([0, 2.0, 0, -1.5, 0, 0, 0, 5.0, 0, 0])
+y = X @ w_model + 0.1 * torch.randn(100)
+
+
+def feature_selection_loss(g, X, y, w_model, mode="smooth"):
+    _, soft_idx = st.topk(g, k=k, mode=mode, gated_grad=False)
+    mask = soft_idx.sum(dim=0)
+    y_pred = (X * mask) @ w_model
+    return torch.mean(st.abs(y_pred - y))
+
+
+g = torch.zeros(n_features, requires_grad=True)
+print("\n=== 2. Top-k feature selection ===")
+hard_loss = feature_selection_loss(g, X, y, w_model, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, g)[0] if hard_loss.requires_grad else torch.zeros_like(g))
+soft_loss = feature_selection_loss(g, X, y, w_model, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, g)[0])
+
+gs = []
+g = torch.zeros(n_features)
+for _ in range(5):
+    gs.append(g.tolist())
+    g.requires_grad_(True)
+    loss = feature_selection_loss(g, X, y, w_model)
+    g_grad = torch.autograd.grad(loss, g)[0]
+    g = (g - 0.001 * g_grad).detach()
+print("Selected features:", torch.topk(g, k=k).indices)
+
+
+# 3. Differentiable filter
+# Learn a threshold that gates inputs.
+
+x_filt = torch.tensor([0.2, 0.8, 0.5, 1.2, 0.1])
+target_sum = 2.0  # sum of values above threshold should equal 2.0 (= 0.8 + 1.2)
+
+
+def filter_loss(t, x, target, mode="smooth"):
+    mask = st.greater(x, t, mode=mode)
+    return (torch.sum(mask * x) - target) ** 2
+
+
+t = torch.tensor(0.0, requires_grad=True)
+print("\n=== 3. Differentiable threshold filtering ===")
+hard_loss = filter_loss(t, x_filt, target_sum, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, t)[0] if hard_loss.requires_grad else torch.zeros_like(t))
+soft_loss = filter_loss(t, x_filt, target_sum, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, t)[0])
+
+ts = []
+t = torch.tensor(0.0)
+for _ in range(20):
+    ts.append(float(t))
+    t.requires_grad_(True)
+    loss = filter_loss(t, x_filt, target_sum)
+    t_grad = torch.autograd.grad(loss, t)[0]
+    t = (t - 0.1 * t_grad).detach()
+print("Learned threshold:", t)
+
+
+# 4. Differentiable rule-based classifier
+# Learn decision boundaries: classify positive if ANY feature is in [lo, hi].
+# The rule is true if any element of a feature is inside `[lo, hi]`.
+x_rules = torch.tensor([[0.2, 0.8], [0.5, 0.3], [0.9, 0.1], [0.4, 0.7],
+                         [0.1, 0.4], [0.2, 0.7], [0.4, 0.1], [0.4, 0.7],
+                         [0.7, 0.29], [0.3, 0.3], [0.61, 0.25], [0.4, 0.6],
+                         [0.0, 0.1], [0.5, 0.3], [0.4, 0.9], [0.1, 0.57]])
+labels = torch.tensor([0.0, 1.0, 0.0, 1.0,
+                        1.0, 0.0, 1.0, 1.0,
+                        0.0, 1.0, 0.0, 1.0,
+                        0.0, 1.0, 1.0, 1.0])
+
+
+@st.st
+def rule_loss(params, x, labels, mode="smooth"):
+    lo, hi = params[0], params[1]
+    above = st.greater(x, lo, mode=mode)
+    below = st.less(x, hi, mode=mode)
+    in_range = st.logical_and(above, below)
+    preds = st.any(in_range, dim=-1)
+    return ((preds - labels) ** 2).sum()
+
+
+params = torch.tensor([0.0, 1.0], requires_grad=True)
+print("\n=== 4. Differentiable rule-based classifier ===")
+hard_loss = rule_loss(params, x_rules, labels, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, params)[0] if hard_loss.requires_grad else torch.zeros_like(params))
+soft_loss = rule_loss(params, x_rules, labels, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, params)[0])
+
+params_hist = []
+params = torch.tensor([0.0, 1.0])
+for _ in range(20):
+    params_hist.append(params.tolist())
+    params.requires_grad_(True)
+    loss = rule_loss(params, x_rules, labels)
+    p_grad = torch.autograd.grad(loss, params)[0]
+    params = (params - 0.01 * p_grad).detach()
+print("Learned [lo, hi]:", params)
+
+
+# ── Plot ─────────────────────────────────────────────────────────────────────
+palette = ["#00bfff", "#e7a1e5", "#6dd1ac", "#e1be6a", "#368f80", "#889fd9", "#f4836d", "#cecece"]
+informative = {i for i, v in enumerate(w_model) if v != 0}
+
+fig, axes = plt.subplots(1, 4, figsize=(8, 2.5))
+
+for ax in axes:
+    ax.spines["top"].set_visible(False)
+    ax.spines["right"].set_visible(False)
+    ax.tick_params(labelsize=7)
+    ax.set_xlabel("Iteration", fontsize=7)
+    ax.yaxis.set_major_locator(plt.MaxNLocator(3))
+    ax.margins(x=0)
+
+ws = torch.tensor(ws)
+for i in range(ws.shape[1]):
+    axes[0].plot(ws[:, i], color=palette[i], label=f"w[{i}]")
+    axes[0].axhline(w_true[i], color=palette[i], ls="--", alpha=0.3)
+axes[0].set_title("Median regression", fontsize=8)
+axes[0].legend(fontsize=6)
+
+gs = torch.tensor(gs)
+for i in range(gs.shape[1]):
+    if i in informative:
+        if i == 1:
+            kw = {"lw": 1.5, "color": "#6dd1ac", "label": "Informative"}
+        else:
+            kw = {"lw": 1.5, "color": "#6dd1ac", "label": None}
+    else:
+        if i == 4:
+            kw = {"alpha": 0.2, "color": "#889fd9", "label": "Uninformative"}
+        else:
+            kw = {"alpha": 0.2, "color": "#889fd9", "label": None}
+    axes[1].plot(gs[:, i], **kw)
+axes[1].set_title("Top-k feature selection", fontsize=8)
+axes[1].legend(fontsize=6, title="Feature scores", title_fontsize=6)
+
+axes[2].plot(ts, color=palette[0])
+for xi in x_filt:
+    axes[2].axhline(xi, ls="--", color=palette[-1], alpha=0.5)
+axes[2].set_title("Threshold filtering", fontsize=8)
+
+params_hist = torch.tensor(params_hist)
+axes[3].plot(params_hist[:, 1], color=palette[0], label="higher bound")
+axes[3].plot(params_hist[:, 0], color=palette[2], label="lower bound")
+axes[3].axhline(0.3, ls="--", color=palette[2], alpha=0.5)
+axes[3].axhline(0.6, ls="--", color=palette[0], alpha=0.5)
+axes[3].set_title("Rule classifier", fontsize=8)
+axes[3].legend(fontsize=6)
+
+fig.tight_layout()
+fig.savefig("docs/examples/quick_example_optimization.svg", bbox_inches="tight", transparent=True)
diff --git a/docs/examples/quick_example_optimization.svg b/docs/examples/quick_example_optimization.svg
new file mode 100644
index 0000000..7c0f8ea
--- /dev/null
+++ b/docs/examples/quick_example_optimization.svg
@@ -0,0 +1,2096 @@
+<?xml version="1.0" encoding="utf-8" standalone="no"?>
+<!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"
+  "http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd">
+<svg xmlns:xlink="http://www.w3.org/1999/xlink" width="568.877969pt" height="172.268125pt" viewBox="0 0 568.877969 172.268125" xmlns="http://www.w3.org/2000/svg" version="1.1">
+ <metadata>
+  <rdf:RDF xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:cc="http://creativecommons.org/ns#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
+   <cc:Work>
+    <dc:type rdf:resource="http://purl.org/dc/dcmitype/StillImage"/>
+    <dc:date>2026-04-07T13:18:58.074655</dc:date>
+    <dc:format>image/svg+xml</dc:format>
+    <dc:creator>
+     <cc:Agent>
+      <dc:title>Matplotlib v3.10.8, https://matplotlib.org/</dc:title>
+     </cc:Agent>
+    </dc:creator>
+   </cc:Work>
+  </rdf:RDF>
+ </metadata>
+ <defs>
+  <style type="text/css">*{stroke-linejoin: round; stroke-linecap: butt}</style>
+ </defs>
+ <g id="figure_1">
+  <g id="patch_1">
+   <path d="M 0 172.268125 
+L 568.877969 172.268125 
+L 568.877969 0 
+L 0 0 
+L 0 172.268125 
+z
+" style="fill: none"/>
+  </g>
+  <g id="axes_1">
+   <g id="patch_2">
+    <path d="M 31.197969 140.51875 
+L 140.511719 140.51875 
+L 140.511719 19.27875 
+L 31.197969 19.27875 
+L 31.197969 140.51875 
+z
+" style="fill: none"/>
+   </g>
+   <g id="matplotlib.axis_1">
+    <g id="xtick_1">
+     <g id="line2d_1">
+      <defs>
+       <path id="m1349860ead" d="M 0 0 
+L 0 3.5 
+" style="stroke: #000000; stroke-width: 0.8"/>
+      </defs>
+      <g>
+       <use xlink:href="#m1349860ead" x="31.197969" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_1">
+      <!-- 0 -->
+      <g transform="translate(28.971094 152.837656) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-30" d="M 2034 4250 
+Q 1547 4250 1301 3770 
+Q 1056 3291 1056 2328 
+Q 1056 1369 1301 889 
+Q 1547 409 2034 409 
+Q 2525 409 2770 889 
+Q 3016 1369 3016 2328 
+Q 3016 3291 2770 3770 
+Q 2525 4250 2034 4250 
+z
+M 2034 4750 
+Q 2819 4750 3233 4129 
+Q 3647 3509 3647 2328 
+Q 3647 1150 3233 529 
+Q 2819 -91 2034 -91 
+Q 1250 -91 836 529 
+Q 422 1150 422 2328 
+Q 422 3509 836 4129 
+Q 1250 4750 2034 4750 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-30"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_2">
+     <g id="line2d_2">
+      <g>
+       <use xlink:href="#m1349860ead" x="53.506897" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_2">
+      <!-- 10 -->
+      <g transform="translate(49.053147 152.837656) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-31" d="M 794 531 
+L 1825 531 
+L 1825 4091 
+L 703 3866 
+L 703 4441 
+L 1819 4666 
+L 2450 4666 
+L 2450 531 
+L 3481 531 
+L 3481 0 
+L 794 0 
+L 794 531 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-31"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_3">
+     <g id="line2d_3">
+      <g>
+       <use xlink:href="#m1349860ead" x="75.815826" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_3">
+      <!-- 20 -->
+      <g transform="translate(71.362076 152.837656) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-32" d="M 1228 531 
+L 3431 531 
+L 3431 0 
+L 469 0 
+L 469 531 
+Q 828 903 1448 1529 
+Q 2069 2156 2228 2338 
+Q 2531 2678 2651 2914 
+Q 2772 3150 2772 3378 
+Q 2772 3750 2511 3984 
+Q 2250 4219 1831 4219 
+Q 1534 4219 1204 4116 
+Q 875 4013 500 3803 
+L 500 4441 
+Q 881 4594 1212 4672 
+Q 1544 4750 1819 4750 
+Q 2544 4750 2975 4387 
+Q 3406 4025 3406 3419 
+Q 3406 3131 3298 2873 
+Q 3191 2616 2906 2266 
+Q 2828 2175 2409 1742 
+Q 1991 1309 1228 531 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-32"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_4">
+     <g id="line2d_4">
+      <g>
+       <use xlink:href="#m1349860ead" x="98.124754" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_4">
+      <!-- 30 -->
+      <g transform="translate(93.671004 152.837656) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-33" d="M 2597 2516 
+Q 3050 2419 3304 2112 
+Q 3559 1806 3559 1356 
+Q 3559 666 3084 287 
+Q 2609 -91 1734 -91 
+Q 1441 -91 1130 -33 
+Q 819 25 488 141 
+L 488 750 
+Q 750 597 1062 519 
+Q 1375 441 1716 441 
+Q 2309 441 2620 675 
+Q 2931 909 2931 1356 
+Q 2931 1769 2642 2001 
+Q 2353 2234 1838 2234 
+L 1294 2234 
+L 1294 2753 
+L 1863 2753 
+Q 2328 2753 2575 2939 
+Q 2822 3125 2822 3475 
+Q 2822 3834 2567 4026 
+Q 2313 4219 1838 4219 
+Q 1578 4219 1281 4162 
+Q 984 4106 628 3988 
+L 628 4550 
+Q 988 4650 1302 4700 
+Q 1616 4750 1894 4750 
+Q 2613 4750 3031 4423 
+Q 3450 4097 3450 3541 
+Q 3450 3153 3228 2886 
+Q 3006 2619 2597 2516 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-33"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_5">
+     <g id="line2d_5">
+      <g>
+       <use xlink:href="#m1349860ead" x="120.433683" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_5">
+      <!-- 40 -->
+      <g transform="translate(115.979933 152.837656) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-34" d="M 2419 4116 
+L 825 1625 
+L 2419 1625 
+L 2419 4116 
+z
+M 2253 4666 
+L 3047 4666 
+L 3047 1625 
+L 3713 1625 
+L 3713 1100 
+L 3047 1100 
+L 3047 0 
+L 2419 0 
+L 2419 1100 
+L 313 1100 
+L 313 1709 
+L 2253 4666 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-34"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="text_6">
+     <!-- Iteration -->
+     <g transform="translate(71.008828 163.612344) scale(0.07 -0.07)">
+      <defs>
+       <path id="DejaVuSans-49" d="M 628 4666 
+L 1259 4666 
+L 1259 0 
+L 628 0 
+L 628 4666 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-74" d="M 1172 4494 
+L 1172 3500 
+L 2356 3500 
+L 2356 3053 
+L 1172 3053 
+L 1172 1153 
+Q 1172 725 1289 603 
+Q 1406 481 1766 481 
+L 2356 481 
+L 2356 0 
+L 1766 0 
+Q 1100 0 847 248 
+Q 594 497 594 1153 
+L 594 3053 
+L 172 3053 
+L 172 3500 
+L 594 3500 
+L 594 4494 
+L 1172 4494 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-65" d="M 3597 1894 
+L 3597 1613 
+L 953 1613 
+Q 991 1019 1311 708 
+Q 1631 397 2203 397 
+Q 2534 397 2845 478 
+Q 3156 559 3463 722 
+L 3463 178 
+Q 3153 47 2828 -22 
+Q 2503 -91 2169 -91 
+Q 1331 -91 842 396 
+Q 353 884 353 1716 
+Q 353 2575 817 3079 
+Q 1281 3584 2069 3584 
+Q 2775 3584 3186 3129 
+Q 3597 2675 3597 1894 
+z
+M 3022 2063 
+Q 3016 2534 2758 2815 
+Q 2500 3097 2075 3097 
+Q 1594 3097 1305 2825 
+Q 1016 2553 972 2059 
+L 3022 2063 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-72" d="M 2631 2963 
+Q 2534 3019 2420 3045 
+Q 2306 3072 2169 3072 
+Q 1681 3072 1420 2755 
+Q 1159 2438 1159 1844 
+L 1159 0 
+L 581 0 
+L 581 3500 
+L 1159 3500 
+L 1159 2956 
+Q 1341 3275 1631 3429 
+Q 1922 3584 2338 3584 
+Q 2397 3584 2469 3576 
+Q 2541 3569 2628 3553 
+L 2631 2963 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-61" d="M 2194 1759 
+Q 1497 1759 1228 1600 
+Q 959 1441 959 1056 
+Q 959 750 1161 570 
+Q 1363 391 1709 391 
+Q 2188 391 2477 730 
+Q 2766 1069 2766 1631 
+L 2766 1759 
+L 2194 1759 
+z
+M 3341 1997 
+L 3341 0 
+L 2766 0 
+L 2766 531 
+Q 2569 213 2275 61 
+Q 1981 -91 1556 -91 
+Q 1019 -91 701 211 
+Q 384 513 384 1019 
+Q 384 1609 779 1909 
+Q 1175 2209 1959 2209 
+L 2766 2209 
+L 2766 2266 
+Q 2766 2663 2505 2880 
+Q 2244 3097 1772 3097 
+Q 1472 3097 1187 3025 
+Q 903 2953 641 2809 
+L 641 3341 
+Q 956 3463 1253 3523 
+Q 1550 3584 1831 3584 
+Q 2591 3584 2966 3190 
+Q 3341 2797 3341 1997 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-69" d="M 603 3500 
+L 1178 3500 
+L 1178 0 
+L 603 0 
+L 603 3500 
+z
+M 603 4863 
+L 1178 4863 
+L 1178 4134 
+L 603 4134 
+L 603 4863 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-6f" d="M 1959 3097 
+Q 1497 3097 1228 2736 
+Q 959 2375 959 1747 
+Q 959 1119 1226 758 
+Q 1494 397 1959 397 
+Q 2419 397 2687 759 
+Q 2956 1122 2956 1747 
+Q 2956 2369 2687 2733 
+Q 2419 3097 1959 3097 
+z
+M 1959 3584 
+Q 2709 3584 3137 3096 
+Q 3566 2609 3566 1747 
+Q 3566 888 3137 398 
+Q 2709 -91 1959 -91 
+Q 1206 -91 779 398 
+Q 353 888 353 1747 
+Q 353 2609 779 3096 
+Q 1206 3584 1959 3584 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-6e" d="M 3513 2113 
+L 3513 0 
+L 2938 0 
+L 2938 2094 
+Q 2938 2591 2744 2837 
+Q 2550 3084 2163 3084 
+Q 1697 3084 1428 2787 
+Q 1159 2491 1159 1978 
+L 1159 0 
+L 581 0 
+L 581 3500 
+L 1159 3500 
+L 1159 2956 
+Q 1366 3272 1645 3428 
+Q 1925 3584 2291 3584 
+Q 2894 3584 3203 3211 
+Q 3513 2838 3513 2113 
+z
+" transform="scale(0.015625)"/>
+      </defs>
+      <use xlink:href="#DejaVuSans-49"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(29.492188 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(68.701172 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(130.224609 0)"/>
+      <use xlink:href="#DejaVuSans-61" transform="translate(171.337891 0)"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(232.617188 0)"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(271.826172 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(299.609375 0)"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(360.791016 0)"/>
+     </g>
+    </g>
+   </g>
+   <g id="matplotlib.axis_2">
+    <g id="ytick_1">
+     <g id="line2d_6">
+      <defs>
+       <path id="m4787e62a98" d="M 0 0 
+L -3.5 0 
+" style="stroke: #000000; stroke-width: 0.8"/>
+      </defs>
+      <g>
+       <use xlink:href="#m4787e62a98" x="31.197969" y="116.638144" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_7">
+      <!-- −1.5 -->
+      <g transform="translate(7.2 119.297597) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-2212" d="M 678 2272 
+L 4684 2272 
+L 4684 1741 
+L 678 1741 
+L 678 2272 
+z
+" transform="scale(0.015625)"/>
+        <path id="DejaVuSans-2e" d="M 684 794 
+L 1344 794 
+L 1344 0 
+L 684 0 
+L 684 794 
+z
+" transform="scale(0.015625)"/>
+        <path id="DejaVuSans-35" d="M 691 4666 
+L 3169 4666 
+L 3169 4134 
+L 1269 4134 
+L 1269 2991 
+Q 1406 3038 1543 3061 
+Q 1681 3084 1819 3084 
+Q 2600 3084 3056 2656 
+Q 3513 2228 3513 1497 
+Q 3513 744 3044 326 
+Q 2575 -91 1722 -91 
+Q 1428 -91 1123 -41 
+Q 819 9 494 109 
+L 494 744 
+Q 775 591 1075 516 
+Q 1375 441 1709 441 
+Q 2250 441 2565 725 
+Q 2881 1009 2881 1497 
+Q 2881 1984 2565 2268 
+Q 2250 2553 1709 2553 
+Q 1456 2553 1204 2497 
+Q 953 2441 691 2322 
+L 691 4666 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-2212"/>
+       <use xlink:href="#DejaVuSans-31" transform="translate(83.789062 0)"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(147.412109 0)"/>
+       <use xlink:href="#DejaVuSans-35" transform="translate(179.199219 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="ytick_2">
+     <g id="line2d_7">
+      <g>
+       <use xlink:href="#m4787e62a98" x="31.197969" y="61.529053" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_8">
+      <!-- 0.0 -->
+      <g transform="translate(13.065781 64.188506) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(63.623047 0)"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(95.410156 0)"/>
+      </g>
+     </g>
+    </g>
+   </g>
+   <g id="line2d_8">
+    <path d="M 31.197969 61.529053 
+L 33.428862 61.797785 
+L 35.659754 62.00891 
+L 37.890647 62.111653 
+L 40.12154 62.052247 
+L 42.352433 61.780581 
+L 44.583326 61.273915 
+L 46.814219 60.570718 
+L 49.045112 59.736809 
+L 51.276004 58.821197 
+L 53.506897 57.85139 
+L 55.73779 56.841627 
+L 57.968683 55.799304 
+L 60.199576 54.728573 
+L 62.430469 53.632216 
+L 64.661362 52.512573 
+L 66.892254 51.371958 
+L 69.123147 50.212785 
+L 71.35404 49.037549 
+L 73.584933 47.848739 
+L 75.815826 46.648755 
+L 78.046719 45.439812 
+L 80.277612 44.223875 
+L 82.508504 43.002588 
+L 84.739397 41.777203 
+L 86.97029 40.54849 
+L 89.201183 39.31662 
+L 91.432076 38.081024 
+L 93.662969 36.840246 
+L 95.893862 35.591872 
+L 98.124754 34.332688 
+L 100.355647 33.059286 
+L 102.58654 31.769384 
+L 104.817433 30.464136 
+L 107.048326 29.152535 
+L 109.279219 27.863106 
+L 111.510112 26.677173 
+L 113.741004 25.746792 
+L 115.971897 25.167219 
+L 118.20279 24.941548 
+L 120.433683 24.845672 
+L 122.664576 24.812039 
+L 124.895469 24.797832 
+L 127.126362 24.792954 
+L 129.357254 24.79085 
+L 131.588147 24.790145 
+L 133.81904 24.789832 
+L 136.049933 24.789731 
+L 138.280826 24.789684 
+L 140.511719 24.78967 
+" clip-path="url(#p434b40172d)" style="fill: none; stroke: #00bfff; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_9">
+    <path d="M 31.197969 24.789659 
+L 140.511719 24.789659 
+" clip-path="url(#p434b40172d)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #00bfff; stroke-opacity: 0.3; stroke-width: 1.5"/>
+   </g>
+   <g id="line2d_10">
+    <path d="M 31.197969 61.529053 
+L 33.428862 64.137722 
+L 35.659754 66.734176 
+L 37.890647 69.291373 
+L 40.12154 71.78158 
+L 42.352433 74.181637 
+L 44.583326 76.480137 
+L 46.814219 78.689822 
+L 49.045112 80.835016 
+L 51.276004 82.934179 
+L 53.506897 84.99766 
+L 55.73779 87.030729 
+L 57.968683 89.036116 
+L 60.199576 91.015549 
+L 62.430469 92.970607 
+L 64.661362 94.903152 
+L 66.892254 96.815456 
+L 69.123147 98.710149 
+L 71.35404 100.590087 
+L 73.584933 102.458194 
+L 75.815826 104.317337 
+L 78.046719 106.170228 
+L 80.277612 108.019362 
+L 82.508504 109.866968 
+L 84.739397 111.714952 
+L 86.97029 113.56482 
+L 89.201183 115.417554 
+L 91.432076 117.273426 
+L 93.662969 119.131753 
+L 95.893862 120.990626 
+L 98.124754 122.846727 
+L 100.355647 124.695424 
+L 102.58654 126.531422 
+L 104.817433 128.350348 
+L 107.048326 130.151453 
+L 109.279219 131.935269 
+L 111.510112 133.636092 
+L 113.741004 134.777266 
+L 115.971897 134.916226 
+L 118.20279 134.971792 
+L 120.433683 134.993023 
+L 122.664576 135.002736 
+L 124.895469 135.005612 
+L 127.126362 135.007111 
+L 129.357254 135.007508 
+L 131.588147 135.007736 
+L 133.81904 135.007791 
+L 136.049933 135.007826 
+L 138.280826 135.007833 
+L 140.511719 135.007839 
+" clip-path="url(#p434b40172d)" style="fill: none; stroke: #e7a1e5; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_11">
+    <path d="M 31.197969 135.007841 
+L 140.511719 135.007841 
+" clip-path="url(#p434b40172d)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #e7a1e5; stroke-opacity: 0.3; stroke-width: 1.5"/>
+   </g>
+   <g id="line2d_12">
+    <path d="M 31.197969 61.529053 
+L 33.428862 60.437989 
+L 35.659754 59.411581 
+L 37.890647 58.462812 
+L 40.12154 57.620657 
+L 42.352433 56.937078 
+L 44.583326 56.443632 
+L 46.814219 56.100981 
+L 49.045112 55.837346 
+L 51.276004 55.599432 
+L 53.506897 55.357929 
+L 55.73779 55.098286 
+L 57.968683 54.813726 
+L 60.199576 54.501643 
+L 62.430469 54.161923 
+L 64.661362 53.7961 
+L 66.892254 53.406795 
+L 69.123147 52.997264 
+L 71.35404 52.571014 
+L 73.584933 52.131502 
+L 75.815826 51.681942 
+L 78.046719 51.225188 
+L 80.277612 50.763705 
+L 82.508504 50.299568 
+L 84.739397 49.8345 
+L 86.97029 49.369924 
+L 89.201183 48.907022 
+L 91.432076 48.446785 
+L 93.662969 47.990038 
+L 95.893862 47.537372 
+L 98.124754 47.088903 
+L 100.355647 46.643715 
+L 102.58654 46.198947 
+L 104.817433 45.748592 
+L 107.048326 45.282533 
+L 109.279219 44.787281 
+L 111.510112 44.254112 
+L 113.741004 43.740811 
+L 115.971897 43.41617 
+L 118.20279 43.249579 
+L 120.433683 43.197077 
+L 122.664576 43.172544 
+L 124.895469 43.164919 
+L 127.126362 43.16127 
+L 129.357254 43.16018 
+L 131.588147 43.159633 
+L 133.81904 43.159478 
+L 136.049933 43.159396 
+L 138.280826 43.159374 
+L 140.511719 43.159362 
+" clip-path="url(#p434b40172d)" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_13">
+    <path d="M 31.197969 43.159356 
+L 140.511719 43.159356 
+" clip-path="url(#p434b40172d)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #6dd1ac; stroke-opacity: 0.3; stroke-width: 1.5"/>
+   </g>
+   <g id="patch_3">
+    <path d="M 31.197969 140.51875 
+L 31.197969 19.27875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="patch_4">
+    <path d="M 31.197969 140.51875 
+L 140.511719 140.51875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="text_9">
+    <!-- Median regression -->
+    <g transform="translate(49.202344 13.27875) scale(0.08 -0.08)">
+     <defs>
+      <path id="DejaVuSans-4d" d="M 628 4666 
+L 1569 4666 
+L 2759 1491 
+L 3956 4666 
+L 4897 4666 
+L 4897 0 
+L 4281 0 
+L 4281 4097 
+L 3078 897 
+L 2444 897 
+L 1241 4097 
+L 1241 0 
+L 628 0 
+L 628 4666 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-64" d="M 2906 2969 
+L 2906 4863 
+L 3481 4863 
+L 3481 0 
+L 2906 0 
+L 2906 525 
+Q 2725 213 2448 61 
+Q 2172 -91 1784 -91 
+Q 1150 -91 751 415 
+Q 353 922 353 1747 
+Q 353 2572 751 3078 
+Q 1150 3584 1784 3584 
+Q 2172 3584 2448 3432 
+Q 2725 3281 2906 2969 
+z
+M 947 1747 
+Q 947 1113 1208 752 
+Q 1469 391 1925 391 
+Q 2381 391 2643 752 
+Q 2906 1113 2906 1747 
+Q 2906 2381 2643 2742 
+Q 2381 3103 1925 3103 
+Q 1469 3103 1208 2742 
+Q 947 2381 947 1747 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-20" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-67" d="M 2906 1791 
+Q 2906 2416 2648 2759 
+Q 2391 3103 1925 3103 
+Q 1463 3103 1205 2759 
+Q 947 2416 947 1791 
+Q 947 1169 1205 825 
+Q 1463 481 1925 481 
+Q 2391 481 2648 825 
+Q 2906 1169 2906 1791 
+z
+M 3481 434 
+Q 3481 -459 3084 -895 
+Q 2688 -1331 1869 -1331 
+Q 1566 -1331 1297 -1286 
+Q 1028 -1241 775 -1147 
+L 775 -588 
+Q 1028 -725 1275 -790 
+Q 1522 -856 1778 -856 
+Q 2344 -856 2625 -561 
+Q 2906 -266 2906 331 
+L 2906 616 
+Q 2728 306 2450 153 
+Q 2172 0 1784 0 
+Q 1141 0 747 490 
+Q 353 981 353 1791 
+Q 353 2603 747 3093 
+Q 1141 3584 1784 3584 
+Q 2172 3584 2450 3431 
+Q 2728 3278 2906 2969 
+L 2906 3500 
+L 3481 3500 
+L 3481 434 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-73" d="M 2834 3397 
+L 2834 2853 
+Q 2591 2978 2328 3040 
+Q 2066 3103 1784 3103 
+Q 1356 3103 1142 2972 
+Q 928 2841 928 2578 
+Q 928 2378 1081 2264 
+Q 1234 2150 1697 2047 
+L 1894 2003 
+Q 2506 1872 2764 1633 
+Q 3022 1394 3022 966 
+Q 3022 478 2636 193 
+Q 2250 -91 1575 -91 
+Q 1294 -91 989 -36 
+Q 684 19 347 128 
+L 347 722 
+Q 666 556 975 473 
+Q 1284 391 1588 391 
+Q 1994 391 2212 530 
+Q 2431 669 2431 922 
+Q 2431 1156 2273 1281 
+Q 2116 1406 1581 1522 
+L 1381 1569 
+Q 847 1681 609 1914 
+Q 372 2147 372 2553 
+Q 372 3047 722 3315 
+Q 1072 3584 1716 3584 
+Q 2034 3584 2315 3537 
+Q 2597 3491 2834 3397 
+z
+" transform="scale(0.015625)"/>
+     </defs>
+     <use xlink:href="#DejaVuSans-4d"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(86.279297 0)"/>
+     <use xlink:href="#DejaVuSans-64" transform="translate(147.802734 0)"/>
+     <use xlink:href="#DejaVuSans-69" transform="translate(211.279297 0)"/>
+     <use xlink:href="#DejaVuSans-61" transform="translate(239.0625 0)"/>
+     <use xlink:href="#DejaVuSans-6e" transform="translate(300.341797 0)"/>
+     <use xlink:href="#DejaVuSans-20" transform="translate(363.720703 0)"/>
+     <use xlink:href="#DejaVuSans-72" transform="translate(395.507812 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(434.371094 0)"/>
+     <use xlink:href="#DejaVuSans-67" transform="translate(495.894531 0)"/>
+     <use xlink:href="#DejaVuSans-72" transform="translate(559.371094 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(598.234375 0)"/>
+     <use xlink:href="#DejaVuSans-73" transform="translate(659.757812 0)"/>
+     <use xlink:href="#DejaVuSans-73" transform="translate(711.857422 0)"/>
+     <use xlink:href="#DejaVuSans-69" transform="translate(763.957031 0)"/>
+     <use xlink:href="#DejaVuSans-6f" transform="translate(791.740234 0)"/>
+     <use xlink:href="#DejaVuSans-6e" transform="translate(852.921875 0)"/>
+    </g>
+   </g>
+   <g id="legend_1">
+    <g id="patch_5">
+     <path d="M 103.705469 94.009062 
+L 136.311719 94.009062 
+Q 137.511719 94.009062 137.511719 92.809062 
+L 137.511719 66.988438 
+Q 137.511719 65.788438 136.311719 65.788438 
+L 103.705469 65.788438 
+Q 102.505469 65.788438 102.505469 66.988438 
+L 102.505469 92.809062 
+Q 102.505469 94.009062 103.705469 94.009062 
+z
+" style="fill: #ffffff; opacity: 0.8; stroke: #cccccc; stroke-linejoin: miter"/>
+    </g>
+    <g id="line2d_14">
+     <path d="M 104.905469 70.6475 
+L 110.905469 70.6475 
+L 116.905469 70.6475 
+" style="fill: none; stroke: #00bfff; stroke-width: 1.5; stroke-linecap: square"/>
+    </g>
+    <g id="text_10">
+     <!-- w[0] -->
+     <g transform="translate(121.705469 72.7475) scale(0.06 -0.06)">
+      <defs>
+       <path id="DejaVuSans-77" d="M 269 3500 
+L 844 3500 
+L 1563 769 
+L 2278 3500 
+L 2956 3500 
+L 3675 769 
+L 4391 3500 
+L 4966 3500 
+L 4050 0 
+L 3372 0 
+L 2619 2869 
+L 1863 0 
+L 1184 0 
+L 269 3500 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-5b" d="M 550 4863 
+L 1875 4863 
+L 1875 4416 
+L 1125 4416 
+L 1125 -397 
+L 1875 -397 
+L 1875 -844 
+L 550 -844 
+L 550 4863 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-5d" d="M 1947 4863 
+L 1947 -844 
+L 622 -844 
+L 622 -397 
+L 1369 -397 
+L 1369 4416 
+L 622 4416 
+L 622 4863 
+L 1947 4863 
+z
+" transform="scale(0.015625)"/>
+      </defs>
+      <use xlink:href="#DejaVuSans-77"/>
+      <use xlink:href="#DejaVuSans-5b" transform="translate(81.787109 0)"/>
+      <use xlink:href="#DejaVuSans-30" transform="translate(120.800781 0)"/>
+      <use xlink:href="#DejaVuSans-5d" transform="translate(184.423828 0)"/>
+     </g>
+    </g>
+    <g id="line2d_15">
+     <path d="M 104.905469 79.454375 
+L 110.905469 79.454375 
+L 116.905469 79.454375 
+" style="fill: none; stroke: #e7a1e5; stroke-width: 1.5; stroke-linecap: square"/>
+    </g>
+    <g id="text_11">
+     <!-- w[1] -->
+     <g transform="translate(121.705469 81.554375) scale(0.06 -0.06)">
+      <use xlink:href="#DejaVuSans-77"/>
+      <use xlink:href="#DejaVuSans-5b" transform="translate(81.787109 0)"/>
+      <use xlink:href="#DejaVuSans-31" transform="translate(120.800781 0)"/>
+      <use xlink:href="#DejaVuSans-5d" transform="translate(184.423828 0)"/>
+     </g>
+    </g>
+    <g id="line2d_16">
+     <path d="M 104.905469 88.26125 
+L 110.905469 88.26125 
+L 116.905469 88.26125 
+" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+    </g>
+    <g id="text_12">
+     <!-- w[2] -->
+     <g transform="translate(121.705469 90.36125) scale(0.06 -0.06)">
+      <use xlink:href="#DejaVuSans-77"/>
+      <use xlink:href="#DejaVuSans-5b" transform="translate(81.787109 0)"/>
+      <use xlink:href="#DejaVuSans-32" transform="translate(120.800781 0)"/>
+      <use xlink:href="#DejaVuSans-5d" transform="translate(184.423828 0)"/>
+     </g>
+    </g>
+   </g>
+  </g>
+  <g id="axes_2">
+   <g id="patch_6">
+    <path d="M 171.586719 140.51875 
+L 280.900469 140.51875 
+L 280.900469 19.27875 
+L 171.586719 19.27875 
+L 171.586719 140.51875 
+z
+" style="fill: none"/>
+   </g>
+   <g id="matplotlib.axis_3">
+    <g id="xtick_6">
+     <g id="line2d_17">
+      <g>
+       <use xlink:href="#m1349860ead" x="171.586719" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_13">
+      <!-- 0 -->
+      <g transform="translate(169.359844 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_7">
+     <g id="line2d_18">
+      <g>
+       <use xlink:href="#m1349860ead" x="198.915156" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_14">
+      <!-- 1 -->
+      <g transform="translate(196.688281 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-31"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_8">
+     <g id="line2d_19">
+      <g>
+       <use xlink:href="#m1349860ead" x="226.243594" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_15">
+      <!-- 2 -->
+      <g transform="translate(224.016719 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-32"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_9">
+     <g id="line2d_20">
+      <g>
+       <use xlink:href="#m1349860ead" x="253.572031" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_16">
+      <!-- 3 -->
+      <g transform="translate(251.345156 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-33"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_10">
+     <g id="line2d_21">
+      <g>
+       <use xlink:href="#m1349860ead" x="280.900469" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_17">
+      <!-- 4 -->
+      <g transform="translate(278.673594 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-34"/>
+      </g>
+     </g>
+    </g>
+    <g id="text_18">
+     <!-- Iteration -->
+     <g transform="translate(211.397578 163.612344) scale(0.07 -0.07)">
+      <use xlink:href="#DejaVuSans-49"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(29.492188 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(68.701172 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(130.224609 0)"/>
+      <use xlink:href="#DejaVuSans-61" transform="translate(171.337891 0)"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(232.617188 0)"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(271.826172 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(299.609375 0)"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(360.791016 0)"/>
+     </g>
+    </g>
+   </g>
+   <g id="matplotlib.axis_4">
+    <g id="ytick_3">
+     <g id="line2d_22">
+      <g>
+       <use xlink:href="#m4787e62a98" x="171.586719" y="120.599114" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_19">
+      <!-- 0 -->
+      <g transform="translate(160.132969 123.258567) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+      </g>
+     </g>
+    </g>
+    <g id="ytick_4">
+     <g id="line2d_23">
+      <g>
+       <use xlink:href="#m4787e62a98" x="171.586719" y="71.742279" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_20">
+      <!-- 8 -->
+      <g transform="translate(160.132969 74.401732) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-38" d="M 2034 2216 
+Q 1584 2216 1326 1975 
+Q 1069 1734 1069 1313 
+Q 1069 891 1326 650 
+Q 1584 409 2034 409 
+Q 2484 409 2743 651 
+Q 3003 894 3003 1313 
+Q 3003 1734 2745 1975 
+Q 2488 2216 2034 2216 
+z
+M 1403 2484 
+Q 997 2584 770 2862 
+Q 544 3141 544 3541 
+Q 544 4100 942 4425 
+Q 1341 4750 2034 4750 
+Q 2731 4750 3128 4425 
+Q 3525 4100 3525 3541 
+Q 3525 3141 3298 2862 
+Q 3072 2584 2669 2484 
+Q 3125 2378 3379 2068 
+Q 3634 1759 3634 1313 
+Q 3634 634 3220 271 
+Q 2806 -91 2034 -91 
+Q 1263 -91 848 271 
+Q 434 634 434 1313 
+Q 434 1759 690 2068 
+Q 947 2378 1403 2484 
+z
+M 1172 3481 
+Q 1172 3119 1398 2916 
+Q 1625 2713 2034 2713 
+Q 2441 2713 2670 2916 
+Q 2900 3119 2900 3481 
+Q 2900 3844 2670 4047 
+Q 2441 4250 2034 4250 
+Q 1625 4250 1398 4047 
+Q 1172 3844 1172 3481 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-38"/>
+      </g>
+     </g>
+    </g>
+    <g id="ytick_5">
+     <g id="line2d_24">
+      <g>
+       <use xlink:href="#m4787e62a98" x="171.586719" y="22.885443" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_21">
+      <!-- 16 -->
+      <g transform="translate(155.679219 25.544897) scale(0.07 -0.07)">
+       <defs>
+        <path id="DejaVuSans-36" d="M 2113 2584 
+Q 1688 2584 1439 2293 
+Q 1191 2003 1191 1497 
+Q 1191 994 1439 701 
+Q 1688 409 2113 409 
+Q 2538 409 2786 701 
+Q 3034 994 3034 1497 
+Q 3034 2003 2786 2293 
+Q 2538 2584 2113 2584 
+z
+M 3366 4563 
+L 3366 3988 
+Q 3128 4100 2886 4159 
+Q 2644 4219 2406 4219 
+Q 1781 4219 1451 3797 
+Q 1122 3375 1075 2522 
+Q 1259 2794 1537 2939 
+Q 1816 3084 2150 3084 
+Q 2853 3084 3261 2657 
+Q 3669 2231 3669 1497 
+Q 3669 778 3244 343 
+Q 2819 -91 2113 -91 
+Q 1303 -91 875 529 
+Q 447 1150 447 2328 
+Q 447 3434 972 4092 
+Q 1497 4750 2381 4750 
+Q 2619 4750 2861 4703 
+Q 3103 4656 3366 4563 
+z
+" transform="scale(0.015625)"/>
+       </defs>
+       <use xlink:href="#DejaVuSans-31"/>
+       <use xlink:href="#DejaVuSans-36" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+   </g>
+   <g id="line2d_25">
+    <path d="M 171.586719 120.599114 
+L 198.915156 135.007841 
+L 226.243594 135.007685 
+L 253.572031 135.007528 
+L 280.900469 135.007372 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_26">
+    <path d="M 171.586719 120.599114 
+L 198.915156 120.983849 
+L 226.243594 120.983986 
+L 253.572031 120.984122 
+L 280.900469 120.984259 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_27">
+    <path d="M 171.586719 120.599114 
+L 198.915156 135.007841 
+L 226.243594 135.007685 
+L 253.572031 135.007528 
+L 280.900469 135.007372 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_28">
+    <path d="M 171.586719 120.599114 
+L 198.915156 115.162049 
+L 226.243594 115.163238 
+L 253.572031 115.164427 
+L 280.900469 115.165617 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_29">
+    <path d="M 171.586719 120.599114 
+L 198.915156 135.007841 
+L 226.243594 135.007685 
+L 253.572031 135.007528 
+L 280.900469 135.007372 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_30">
+    <path d="M 171.586719 120.599114 
+L 198.915156 135.007841 
+L 226.243594 135.007685 
+L 253.572031 135.007528 
+L 280.900469 135.007372 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_31">
+    <path d="M 171.586719 120.599114 
+L 198.915156 135.007841 
+L 226.243594 135.007685 
+L 253.572031 135.007528 
+L 280.900469 135.007372 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_32">
+    <path d="M 171.586719 120.599114 
+L 198.915156 24.790354 
+L 226.243594 24.790122 
+L 253.572031 24.789891 
+L 280.900469 24.789659 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_33">
+    <path d="M 171.586719 120.599114 
+L 198.915156 135.007841 
+L 226.243594 135.007685 
+L 253.572031 135.007528 
+L 280.900469 135.007372 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_34">
+    <path d="M 171.586719 120.599114 
+L 198.915156 135.007841 
+L 226.243594 135.007685 
+L 253.572031 135.007528 
+L 280.900469 135.007372 
+" clip-path="url(#pe6bc2e38a2)" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="patch_7">
+    <path d="M 171.586719 140.51875 
+L 171.586719 19.27875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="patch_8">
+    <path d="M 171.586719 140.51875 
+L 280.900469 140.51875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="text_22">
+    <!-- Top-k feature selection -->
+    <g transform="translate(180.773594 13.27875) scale(0.08 -0.08)">
+     <defs>
+      <path id="DejaVuSans-54" d="M -19 4666 
+L 3928 4666 
+L 3928 4134 
+L 2272 4134 
+L 2272 0 
+L 1638 0 
+L 1638 4134 
+L -19 4134 
+L -19 4666 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-70" d="M 1159 525 
+L 1159 -1331 
+L 581 -1331 
+L 581 3500 
+L 1159 3500 
+L 1159 2969 
+Q 1341 3281 1617 3432 
+Q 1894 3584 2278 3584 
+Q 2916 3584 3314 3078 
+Q 3713 2572 3713 1747 
+Q 3713 922 3314 415 
+Q 2916 -91 2278 -91 
+Q 1894 -91 1617 61 
+Q 1341 213 1159 525 
+z
+M 3116 1747 
+Q 3116 2381 2855 2742 
+Q 2594 3103 2138 3103 
+Q 1681 3103 1420 2742 
+Q 1159 2381 1159 1747 
+Q 1159 1113 1420 752 
+Q 1681 391 2138 391 
+Q 2594 391 2855 752 
+Q 3116 1113 3116 1747 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-2d" d="M 313 2009 
+L 1997 2009 
+L 1997 1497 
+L 313 1497 
+L 313 2009 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-6b" d="M 581 4863 
+L 1159 4863 
+L 1159 1991 
+L 2875 3500 
+L 3609 3500 
+L 1753 1863 
+L 3688 0 
+L 2938 0 
+L 1159 1709 
+L 1159 0 
+L 581 0 
+L 581 4863 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-66" d="M 2375 4863 
+L 2375 4384 
+L 1825 4384 
+Q 1516 4384 1395 4259 
+Q 1275 4134 1275 3809 
+L 1275 3500 
+L 2222 3500 
+L 2222 3053 
+L 1275 3053 
+L 1275 0 
+L 697 0 
+L 697 3053 
+L 147 3053 
+L 147 3500 
+L 697 3500 
+L 697 3744 
+Q 697 4328 969 4595 
+Q 1241 4863 1831 4863 
+L 2375 4863 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-75" d="M 544 1381 
+L 544 3500 
+L 1119 3500 
+L 1119 1403 
+Q 1119 906 1312 657 
+Q 1506 409 1894 409 
+Q 2359 409 2629 706 
+Q 2900 1003 2900 1516 
+L 2900 3500 
+L 3475 3500 
+L 3475 0 
+L 2900 0 
+L 2900 538 
+Q 2691 219 2414 64 
+Q 2138 -91 1772 -91 
+Q 1169 -91 856 284 
+Q 544 659 544 1381 
+z
+M 1991 3584 
+L 1991 3584 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-6c" d="M 603 4863 
+L 1178 4863 
+L 1178 0 
+L 603 0 
+L 603 4863 
+z
+" transform="scale(0.015625)"/>
+      <path id="DejaVuSans-63" d="M 3122 3366 
+L 3122 2828 
+Q 2878 2963 2633 3030 
+Q 2388 3097 2138 3097 
+Q 1578 3097 1268 2742 
+Q 959 2388 959 1747 
+Q 959 1106 1268 751 
+Q 1578 397 2138 397 
+Q 2388 397 2633 464 
+Q 2878 531 3122 666 
+L 3122 134 
+Q 2881 22 2623 -34 
+Q 2366 -91 2075 -91 
+Q 1284 -91 818 406 
+Q 353 903 353 1747 
+Q 353 2603 823 3093 
+Q 1294 3584 2113 3584 
+Q 2378 3584 2631 3529 
+Q 2884 3475 3122 3366 
+z
+" transform="scale(0.015625)"/>
+     </defs>
+     <use xlink:href="#DejaVuSans-54"/>
+     <use xlink:href="#DejaVuSans-6f" transform="translate(44.083984 0)"/>
+     <use xlink:href="#DejaVuSans-70" transform="translate(105.265625 0)"/>
+     <use xlink:href="#DejaVuSans-2d" transform="translate(168.742188 0)"/>
+     <use xlink:href="#DejaVuSans-6b" transform="translate(204.826172 0)"/>
+     <use xlink:href="#DejaVuSans-20" transform="translate(262.736328 0)"/>
+     <use xlink:href="#DejaVuSans-66" transform="translate(294.523438 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(329.728516 0)"/>
+     <use xlink:href="#DejaVuSans-61" transform="translate(391.251953 0)"/>
+     <use xlink:href="#DejaVuSans-74" transform="translate(452.53125 0)"/>
+     <use xlink:href="#DejaVuSans-75" transform="translate(491.740234 0)"/>
+     <use xlink:href="#DejaVuSans-72" transform="translate(555.119141 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(593.982422 0)"/>
+     <use xlink:href="#DejaVuSans-20" transform="translate(655.505859 0)"/>
+     <use xlink:href="#DejaVuSans-73" transform="translate(687.292969 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(739.392578 0)"/>
+     <use xlink:href="#DejaVuSans-6c" transform="translate(800.916016 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(828.699219 0)"/>
+     <use xlink:href="#DejaVuSans-63" transform="translate(890.222656 0)"/>
+     <use xlink:href="#DejaVuSans-74" transform="translate(945.203125 0)"/>
+     <use xlink:href="#DejaVuSans-69" transform="translate(984.412109 0)"/>
+     <use xlink:href="#DejaVuSans-6f" transform="translate(1012.195312 0)"/>
+     <use xlink:href="#DejaVuSans-6e" transform="translate(1073.376953 0)"/>
+    </g>
+   </g>
+   <g id="legend_2">
+    <g id="patch_9">
+     <path d="M 214.908906 94.009062 
+L 276.700469 94.009062 
+Q 277.900469 94.009062 277.900469 92.809062 
+L 277.900469 66.988438 
+Q 277.900469 65.788438 276.700469 65.788438 
+L 214.908906 65.788438 
+Q 213.708906 65.788438 213.708906 66.988438 
+L 213.708906 92.809062 
+Q 213.708906 94.009062 214.908906 94.009062 
+z
+" style="fill: #ffffff; opacity: 0.8; stroke: #cccccc; stroke-linejoin: miter"/>
+    </g>
+    <g id="text_23">
+     <!-- Feature scores -->
+     <g transform="translate(223.894844 72.7475) scale(0.06 -0.06)">
+      <defs>
+       <path id="DejaVuSans-46" d="M 628 4666 
+L 3309 4666 
+L 3309 4134 
+L 1259 4134 
+L 1259 2759 
+L 3109 2759 
+L 3109 2228 
+L 1259 2228 
+L 1259 0 
+L 628 0 
+L 628 4666 
+z
+" transform="scale(0.015625)"/>
+      </defs>
+      <use xlink:href="#DejaVuSans-46"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(52.019531 0)"/>
+      <use xlink:href="#DejaVuSans-61" transform="translate(113.542969 0)"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(174.822266 0)"/>
+      <use xlink:href="#DejaVuSans-75" transform="translate(214.03125 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(277.410156 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(316.273438 0)"/>
+      <use xlink:href="#DejaVuSans-20" transform="translate(377.796875 0)"/>
+      <use xlink:href="#DejaVuSans-73" transform="translate(409.583984 0)"/>
+      <use xlink:href="#DejaVuSans-63" transform="translate(461.683594 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(516.664062 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(577.845703 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(616.708984 0)"/>
+      <use xlink:href="#DejaVuSans-73" transform="translate(678.232422 0)"/>
+     </g>
+    </g>
+    <g id="line2d_35">
+     <path d="M 216.108906 79.454375 
+L 222.108906 79.454375 
+L 228.108906 79.454375 
+" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+    </g>
+    <g id="text_24">
+     <!-- Informative -->
+     <g transform="translate(232.908906 81.554375) scale(0.06 -0.06)">
+      <defs>
+       <path id="DejaVuSans-6d" d="M 3328 2828 
+Q 3544 3216 3844 3400 
+Q 4144 3584 4550 3584 
+Q 5097 3584 5394 3201 
+Q 5691 2819 5691 2113 
+L 5691 0 
+L 5113 0 
+L 5113 2094 
+Q 5113 2597 4934 2840 
+Q 4756 3084 4391 3084 
+Q 3944 3084 3684 2787 
+Q 3425 2491 3425 1978 
+L 3425 0 
+L 2847 0 
+L 2847 2094 
+Q 2847 2600 2669 2842 
+Q 2491 3084 2119 3084 
+Q 1678 3084 1418 2786 
+Q 1159 2488 1159 1978 
+L 1159 0 
+L 581 0 
+L 581 3500 
+L 1159 3500 
+L 1159 2956 
+Q 1356 3278 1631 3431 
+Q 1906 3584 2284 3584 
+Q 2666 3584 2933 3390 
+Q 3200 3197 3328 2828 
+z
+" transform="scale(0.015625)"/>
+       <path id="DejaVuSans-76" d="M 191 3500 
+L 800 3500 
+L 1894 563 
+L 2988 3500 
+L 3597 3500 
+L 2284 0 
+L 1503 0 
+L 191 3500 
+z
+" transform="scale(0.015625)"/>
+      </defs>
+      <use xlink:href="#DejaVuSans-49"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(29.492188 0)"/>
+      <use xlink:href="#DejaVuSans-66" transform="translate(92.871094 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(128.076172 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(189.257812 0)"/>
+      <use xlink:href="#DejaVuSans-6d" transform="translate(228.621094 0)"/>
+      <use xlink:href="#DejaVuSans-61" transform="translate(326.033203 0)"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(387.3125 0)"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(426.521484 0)"/>
+      <use xlink:href="#DejaVuSans-76" transform="translate(454.304688 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(513.484375 0)"/>
+     </g>
+    </g>
+    <g id="line2d_36">
+     <path d="M 216.108906 88.26125 
+L 222.108906 88.26125 
+L 228.108906 88.26125 
+" style="fill: none; stroke: #889fd9; stroke-opacity: 0.2; stroke-width: 1.5; stroke-linecap: square"/>
+    </g>
+    <g id="text_25">
+     <!-- Uninformative -->
+     <g transform="translate(232.908906 90.36125) scale(0.06 -0.06)">
+      <defs>
+       <path id="DejaVuSans-55" d="M 556 4666 
+L 1191 4666 
+L 1191 1831 
+Q 1191 1081 1462 751 
+Q 1734 422 2344 422 
+Q 2950 422 3222 751 
+Q 3494 1081 3494 1831 
+L 3494 4666 
+L 4128 4666 
+L 4128 1753 
+Q 4128 841 3676 375 
+Q 3225 -91 2344 -91 
+Q 1459 -91 1007 375 
+Q 556 841 556 1753 
+L 556 4666 
+z
+" transform="scale(0.015625)"/>
+      </defs>
+      <use xlink:href="#DejaVuSans-55"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(73.193359 0)"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(136.572266 0)"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(164.355469 0)"/>
+      <use xlink:href="#DejaVuSans-66" transform="translate(227.734375 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(262.939453 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(324.121094 0)"/>
+      <use xlink:href="#DejaVuSans-6d" transform="translate(363.484375 0)"/>
+      <use xlink:href="#DejaVuSans-61" transform="translate(460.896484 0)"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(522.175781 0)"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(561.384766 0)"/>
+      <use xlink:href="#DejaVuSans-76" transform="translate(589.167969 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(648.347656 0)"/>
+     </g>
+    </g>
+   </g>
+  </g>
+  <g id="axes_3">
+   <g id="patch_10">
+    <path d="M 311.975469 140.51875 
+L 421.289219 140.51875 
+L 421.289219 19.27875 
+L 311.975469 19.27875 
+L 311.975469 140.51875 
+z
+" style="fill: none"/>
+   </g>
+   <g id="matplotlib.axis_5">
+    <g id="xtick_11">
+     <g id="line2d_37">
+      <g>
+       <use xlink:href="#m1349860ead" x="311.975469" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_26">
+      <!-- 0 -->
+      <g transform="translate(309.748594 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_12">
+     <g id="line2d_38">
+      <g>
+       <use xlink:href="#m1349860ead" x="340.742245" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_27">
+      <!-- 5 -->
+      <g transform="translate(338.51537 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-35"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_13">
+     <g id="line2d_39">
+      <g>
+       <use xlink:href="#m1349860ead" x="369.509021" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_28">
+      <!-- 10 -->
+      <g transform="translate(365.055271 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-31"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_14">
+     <g id="line2d_40">
+      <g>
+       <use xlink:href="#m1349860ead" x="398.275798" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_29">
+      <!-- 15 -->
+      <g transform="translate(393.822048 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-31"/>
+       <use xlink:href="#DejaVuSans-35" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="text_30">
+     <!-- Iteration -->
+     <g transform="translate(351.786328 163.612344) scale(0.07 -0.07)">
+      <use xlink:href="#DejaVuSans-49"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(29.492188 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(68.701172 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(130.224609 0)"/>
+      <use xlink:href="#DejaVuSans-61" transform="translate(171.337891 0)"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(232.617188 0)"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(271.826172 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(299.609375 0)"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(360.791016 0)"/>
+     </g>
+    </g>
+   </g>
+   <g id="matplotlib.axis_6">
+    <g id="ytick_6">
+     <g id="line2d_41">
+      <g>
+       <use xlink:href="#m4787e62a98" x="311.975469" y="135.007841" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_31">
+      <!-- 0.0 -->
+      <g transform="translate(293.843281 137.667294) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(63.623047 0)"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(95.410156 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="ytick_7">
+     <g id="line2d_42">
+      <g>
+       <use xlink:href="#m4787e62a98" x="311.975469" y="89.083598" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_32">
+      <!-- 0.5 -->
+      <g transform="translate(293.843281 91.743052) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(63.623047 0)"/>
+       <use xlink:href="#DejaVuSans-35" transform="translate(95.410156 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="ytick_8">
+     <g id="line2d_43">
+      <g>
+       <use xlink:href="#m4787e62a98" x="311.975469" y="43.159356" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_33">
+      <!-- 1.0 -->
+      <g transform="translate(293.843281 45.818809) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-31"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(63.623047 0)"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(95.410156 0)"/>
+      </g>
+     </g>
+    </g>
+   </g>
+   <g id="line2d_44">
+    <path d="M 311.975469 135.007841 
+L 317.728824 128.945511 
+L 323.482179 120.590254 
+L 329.235535 110.316779 
+L 334.98889 100.253035 
+L 340.742245 90.496649 
+L 346.4956 82.778326 
+L 352.248956 79.417819 
+L 358.002311 78.35235 
+L 363.755666 78.060461 
+L 369.509021 77.985487 
+L 375.262377 77.966609 
+L 381.015732 77.96188 
+L 386.769087 77.960697 
+L 392.522442 77.960402 
+L 398.275798 77.960328 
+L 404.029153 77.960309 
+L 409.782508 77.960305 
+L 415.535863 77.960303 
+L 421.289219 77.960303 
+" clip-path="url(#p0c56ad0c37)" style="fill: none; stroke: #00bfff; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_45">
+    <path d="M 311.975469 116.638144 
+L 421.289219 116.638144 
+" clip-path="url(#p0c56ad0c37)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #cecece; stroke-opacity: 0.5; stroke-width: 1.5"/>
+   </g>
+   <g id="line2d_46">
+    <path d="M 311.975469 61.529053 
+L 421.289219 61.529053 
+" clip-path="url(#p0c56ad0c37)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #cecece; stroke-opacity: 0.5; stroke-width: 1.5"/>
+   </g>
+   <g id="line2d_47">
+    <path d="M 311.975469 89.083598 
+L 421.289219 89.083598 
+" clip-path="url(#p0c56ad0c37)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #cecece; stroke-opacity: 0.5; stroke-width: 1.5"/>
+   </g>
+   <g id="line2d_48">
+    <path d="M 311.975469 24.789659 
+L 421.289219 24.789659 
+" clip-path="url(#p0c56ad0c37)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #cecece; stroke-opacity: 0.5; stroke-width: 1.5"/>
+   </g>
+   <g id="line2d_49">
+    <path d="M 311.975469 125.822992 
+L 421.289219 125.822992 
+" clip-path="url(#p0c56ad0c37)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #cecece; stroke-opacity: 0.5; stroke-width: 1.5"/>
+   </g>
+   <g id="patch_11">
+    <path d="M 311.975469 140.51875 
+L 311.975469 19.27875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="patch_12">
+    <path d="M 311.975469 140.51875 
+L 421.289219 140.51875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="text_34">
+    <!-- Threshold filtering -->
+    <g transform="translate(330.160469 13.27875) scale(0.08 -0.08)">
+     <defs>
+      <path id="DejaVuSans-68" d="M 3513 2113 
+L 3513 0 
+L 2938 0 
+L 2938 2094 
+Q 2938 2591 2744 2837 
+Q 2550 3084 2163 3084 
+Q 1697 3084 1428 2787 
+Q 1159 2491 1159 1978 
+L 1159 0 
+L 581 0 
+L 581 4863 
+L 1159 4863 
+L 1159 2956 
+Q 1366 3272 1645 3428 
+Q 1925 3584 2291 3584 
+Q 2894 3584 3203 3211 
+Q 3513 2838 3513 2113 
+z
+" transform="scale(0.015625)"/>
+     </defs>
+     <use xlink:href="#DejaVuSans-54"/>
+     <use xlink:href="#DejaVuSans-68" transform="translate(61.083984 0)"/>
+     <use xlink:href="#DejaVuSans-72" transform="translate(124.462891 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(163.326172 0)"/>
+     <use xlink:href="#DejaVuSans-73" transform="translate(224.849609 0)"/>
+     <use xlink:href="#DejaVuSans-68" transform="translate(276.949219 0)"/>
+     <use xlink:href="#DejaVuSans-6f" transform="translate(340.328125 0)"/>
+     <use xlink:href="#DejaVuSans-6c" transform="translate(401.509766 0)"/>
+     <use xlink:href="#DejaVuSans-64" transform="translate(429.292969 0)"/>
+     <use xlink:href="#DejaVuSans-20" transform="translate(492.769531 0)"/>
+     <use xlink:href="#DejaVuSans-66" transform="translate(524.556641 0)"/>
+     <use xlink:href="#DejaVuSans-69" transform="translate(559.761719 0)"/>
+     <use xlink:href="#DejaVuSans-6c" transform="translate(587.544922 0)"/>
+     <use xlink:href="#DejaVuSans-74" transform="translate(615.328125 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(654.537109 0)"/>
+     <use xlink:href="#DejaVuSans-72" transform="translate(716.060547 0)"/>
+     <use xlink:href="#DejaVuSans-69" transform="translate(757.173828 0)"/>
+     <use xlink:href="#DejaVuSans-6e" transform="translate(784.957031 0)"/>
+     <use xlink:href="#DejaVuSans-67" transform="translate(848.335938 0)"/>
+    </g>
+   </g>
+  </g>
+  <g id="axes_4">
+   <g id="patch_13">
+    <path d="M 452.364219 140.51875 
+L 561.677969 140.51875 
+L 561.677969 19.27875 
+L 452.364219 19.27875 
+L 452.364219 140.51875 
+z
+" style="fill: none"/>
+   </g>
+   <g id="matplotlib.axis_7">
+    <g id="xtick_15">
+     <g id="line2d_50">
+      <g>
+       <use xlink:href="#m1349860ead" x="452.364219" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_35">
+      <!-- 0 -->
+      <g transform="translate(450.137344 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_16">
+     <g id="line2d_51">
+      <g>
+       <use xlink:href="#m1349860ead" x="481.130995" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_36">
+      <!-- 5 -->
+      <g transform="translate(478.90412 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-35"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_17">
+     <g id="line2d_52">
+      <g>
+       <use xlink:href="#m1349860ead" x="509.897771" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_37">
+      <!-- 10 -->
+      <g transform="translate(505.444021 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-31"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="xtick_18">
+     <g id="line2d_53">
+      <g>
+       <use xlink:href="#m1349860ead" x="538.664548" y="140.51875" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_38">
+      <!-- 15 -->
+      <g transform="translate(534.210798 152.837656) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-31"/>
+       <use xlink:href="#DejaVuSans-35" transform="translate(63.623047 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="text_39">
+     <!-- Iteration -->
+     <g transform="translate(492.175078 163.612344) scale(0.07 -0.07)">
+      <use xlink:href="#DejaVuSans-49"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(29.492188 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(68.701172 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(130.224609 0)"/>
+      <use xlink:href="#DejaVuSans-61" transform="translate(171.337891 0)"/>
+      <use xlink:href="#DejaVuSans-74" transform="translate(232.617188 0)"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(271.826172 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(299.609375 0)"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(360.791016 0)"/>
+     </g>
+    </g>
+   </g>
+   <g id="matplotlib.axis_8">
+    <g id="ytick_9">
+     <g id="line2d_54">
+      <g>
+       <use xlink:href="#m4787e62a98" x="452.364219" y="135.007841" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_40">
+      <!-- 0.0 -->
+      <g transform="translate(434.232031 137.667294) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(63.623047 0)"/>
+       <use xlink:href="#DejaVuSans-30" transform="translate(95.410156 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="ytick_10">
+     <g id="line2d_55">
+      <g>
+       <use xlink:href="#m4787e62a98" x="452.364219" y="90.920568" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_41">
+      <!-- 0.4 -->
+      <g transform="translate(434.232031 93.580021) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(63.623047 0)"/>
+       <use xlink:href="#DejaVuSans-34" transform="translate(95.410156 0)"/>
+      </g>
+     </g>
+    </g>
+    <g id="ytick_11">
+     <g id="line2d_56">
+      <g>
+       <use xlink:href="#m4787e62a98" x="452.364219" y="46.833295" style="stroke: #000000; stroke-width: 0.8"/>
+      </g>
+     </g>
+     <g id="text_42">
+      <!-- 0.8 -->
+      <g transform="translate(434.232031 49.492749) scale(0.07 -0.07)">
+       <use xlink:href="#DejaVuSans-30"/>
+       <use xlink:href="#DejaVuSans-2e" transform="translate(63.623047 0)"/>
+       <use xlink:href="#DejaVuSans-38" transform="translate(95.410156 0)"/>
+      </g>
+     </g>
+    </g>
+   </g>
+   <g id="line2d_57">
+    <path d="M 452.364219 24.789659 
+L 458.117574 26.349503 
+L 463.870929 28.603077 
+L 469.624285 32.004575 
+L 475.37764 37.062313 
+L 481.130995 43.982472 
+L 486.88435 52.316668 
+L 492.637706 60.316934 
+L 498.391061 65.302434 
+L 504.144416 67.427696 
+L 509.897771 68.316123 
+L 515.651127 68.658138 
+L 521.404482 68.79833 
+L 527.157837 68.85228 
+L 532.911192 68.87437 
+L 538.664548 68.882885 
+L 544.417903 68.886367 
+L 550.171258 68.887711 
+L 555.924613 68.88826 
+L 561.677969 68.888472 
+" clip-path="url(#p58b81c82c6)" style="fill: none; stroke: #00bfff; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_58">
+    <path d="M 452.364219 135.007841 
+L 458.117574 130.293035 
+L 463.870929 124.314403 
+L 469.624285 117.577738 
+L 475.37764 111.250245 
+L 481.130995 105.951673 
+L 486.88435 102.087552 
+L 492.637706 100.491157 
+L 498.391061 101.195327 
+L 504.144416 102.132803 
+L 509.897771 102.487553 
+L 515.651127 102.661256 
+L 521.404482 102.721956 
+L 527.157837 102.749855 
+L 532.911192 102.75957 
+L 538.664548 102.763972 
+L 544.417903 102.765512 
+L 550.171258 102.766205 
+L 555.924613 102.766449 
+L 561.677969 102.766558 
+" clip-path="url(#p58b81c82c6)" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+   </g>
+   <g id="line2d_59">
+    <path d="M 452.364219 101.942386 
+L 561.677969 101.942386 
+" clip-path="url(#p58b81c82c6)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #6dd1ac; stroke-opacity: 0.5; stroke-width: 1.5"/>
+   </g>
+   <g id="line2d_60">
+    <path d="M 452.364219 68.876932 
+L 561.677969 68.876932 
+" clip-path="url(#p58b81c82c6)" style="fill: none; stroke-dasharray: 5.55,2.4; stroke-dashoffset: 0; stroke: #00bfff; stroke-opacity: 0.5; stroke-width: 1.5"/>
+   </g>
+   <g id="patch_14">
+    <path d="M 452.364219 140.51875 
+L 452.364219 19.27875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="patch_15">
+    <path d="M 452.364219 140.51875 
+L 561.677969 140.51875 
+" style="fill: none; stroke: #000000; stroke-width: 0.8; stroke-linejoin: miter; stroke-linecap: square"/>
+   </g>
+   <g id="text_43">
+    <!-- Rule classifier -->
+    <g transform="translate(479.377344 13.27875) scale(0.08 -0.08)">
+     <defs>
+      <path id="DejaVuSans-52" d="M 2841 2188 
+Q 3044 2119 3236 1894 
+Q 3428 1669 3622 1275 
+L 4263 0 
+L 3584 0 
+L 2988 1197 
+Q 2756 1666 2539 1819 
+Q 2322 1972 1947 1972 
+L 1259 1972 
+L 1259 0 
+L 628 0 
+L 628 4666 
+L 2053 4666 
+Q 2853 4666 3247 4331 
+Q 3641 3997 3641 3322 
+Q 3641 2881 3436 2590 
+Q 3231 2300 2841 2188 
+z
+M 1259 4147 
+L 1259 2491 
+L 2053 2491 
+Q 2509 2491 2742 2702 
+Q 2975 2913 2975 3322 
+Q 2975 3731 2742 3939 
+Q 2509 4147 2053 4147 
+L 1259 4147 
+z
+" transform="scale(0.015625)"/>
+     </defs>
+     <use xlink:href="#DejaVuSans-52"/>
+     <use xlink:href="#DejaVuSans-75" transform="translate(64.982422 0)"/>
+     <use xlink:href="#DejaVuSans-6c" transform="translate(128.361328 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(156.144531 0)"/>
+     <use xlink:href="#DejaVuSans-20" transform="translate(217.667969 0)"/>
+     <use xlink:href="#DejaVuSans-63" transform="translate(249.455078 0)"/>
+     <use xlink:href="#DejaVuSans-6c" transform="translate(304.435547 0)"/>
+     <use xlink:href="#DejaVuSans-61" transform="translate(332.21875 0)"/>
+     <use xlink:href="#DejaVuSans-73" transform="translate(393.498047 0)"/>
+     <use xlink:href="#DejaVuSans-73" transform="translate(445.597656 0)"/>
+     <use xlink:href="#DejaVuSans-69" transform="translate(497.697266 0)"/>
+     <use xlink:href="#DejaVuSans-66" transform="translate(525.480469 0)"/>
+     <use xlink:href="#DejaVuSans-69" transform="translate(560.685547 0)"/>
+     <use xlink:href="#DejaVuSans-65" transform="translate(588.46875 0)"/>
+     <use xlink:href="#DejaVuSans-72" transform="translate(649.992188 0)"/>
+    </g>
+   </g>
+   <g id="legend_3">
+    <g id="patch_16">
+     <path d="M 498.237344 41.6925 
+L 557.477969 41.6925 
+Q 558.677969 41.6925 558.677969 40.4925 
+L 558.677969 23.47875 
+Q 558.677969 22.27875 557.477969 22.27875 
+L 498.237344 22.27875 
+Q 497.037344 22.27875 497.037344 23.47875 
+L 497.037344 40.4925 
+Q 497.037344 41.6925 498.237344 41.6925 
+z
+" style="fill: #ffffff; opacity: 0.8; stroke: #cccccc; stroke-linejoin: miter"/>
+    </g>
+    <g id="line2d_61">
+     <path d="M 499.437344 27.137812 
+L 505.437344 27.137812 
+L 511.437344 27.137812 
+" style="fill: none; stroke: #00bfff; stroke-width: 1.5; stroke-linecap: square"/>
+    </g>
+    <g id="text_44">
+     <!-- higher bound -->
+     <g transform="translate(516.237344 29.237812) scale(0.06 -0.06)">
+      <defs>
+       <path id="DejaVuSans-62" d="M 3116 1747 
+Q 3116 2381 2855 2742 
+Q 2594 3103 2138 3103 
+Q 1681 3103 1420 2742 
+Q 1159 2381 1159 1747 
+Q 1159 1113 1420 752 
+Q 1681 391 2138 391 
+Q 2594 391 2855 752 
+Q 3116 1113 3116 1747 
+z
+M 1159 2969 
+Q 1341 3281 1617 3432 
+Q 1894 3584 2278 3584 
+Q 2916 3584 3314 3078 
+Q 3713 2572 3713 1747 
+Q 3713 922 3314 415 
+Q 2916 -91 2278 -91 
+Q 1894 -91 1617 61 
+Q 1341 213 1159 525 
+L 1159 0 
+L 581 0 
+L 581 4863 
+L 1159 4863 
+L 1159 2969 
+z
+" transform="scale(0.015625)"/>
+      </defs>
+      <use xlink:href="#DejaVuSans-68"/>
+      <use xlink:href="#DejaVuSans-69" transform="translate(63.378906 0)"/>
+      <use xlink:href="#DejaVuSans-67" transform="translate(91.162109 0)"/>
+      <use xlink:href="#DejaVuSans-68" transform="translate(154.638672 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(218.017578 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(279.541016 0)"/>
+      <use xlink:href="#DejaVuSans-20" transform="translate(320.654297 0)"/>
+      <use xlink:href="#DejaVuSans-62" transform="translate(352.441406 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(415.917969 0)"/>
+      <use xlink:href="#DejaVuSans-75" transform="translate(477.099609 0)"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(540.478516 0)"/>
+      <use xlink:href="#DejaVuSans-64" transform="translate(603.857422 0)"/>
+     </g>
+    </g>
+    <g id="line2d_62">
+     <path d="M 499.437344 35.944687 
+L 505.437344 35.944687 
+L 511.437344 35.944687 
+" style="fill: none; stroke: #6dd1ac; stroke-width: 1.5; stroke-linecap: square"/>
+    </g>
+    <g id="text_45">
+     <!-- lower bound -->
+     <g transform="translate(516.237344 38.044687) scale(0.06 -0.06)">
+      <use xlink:href="#DejaVuSans-6c"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(27.783203 0)"/>
+      <use xlink:href="#DejaVuSans-77" transform="translate(88.964844 0)"/>
+      <use xlink:href="#DejaVuSans-65" transform="translate(170.751953 0)"/>
+      <use xlink:href="#DejaVuSans-72" transform="translate(232.275391 0)"/>
+      <use xlink:href="#DejaVuSans-20" transform="translate(273.388672 0)"/>
+      <use xlink:href="#DejaVuSans-62" transform="translate(305.175781 0)"/>
+      <use xlink:href="#DejaVuSans-6f" transform="translate(368.652344 0)"/>
+      <use xlink:href="#DejaVuSans-75" transform="translate(429.833984 0)"/>
+      <use xlink:href="#DejaVuSans-6e" transform="translate(493.212891 0)"/>
+      <use xlink:href="#DejaVuSans-64" transform="translate(556.591797 0)"/>
+     </g>
+    </g>
+   </g>
+  </g>
+ </g>
+ <defs>
+  <clipPath id="p434b40172d">
+   <rect x="31.197969" y="19.27875" width="109.31375" height="121.24"/>
+  </clipPath>
+  <clipPath id="pe6bc2e38a2">
+   <rect x="171.586719" y="19.27875" width="109.31375" height="121.24"/>
+  </clipPath>
+  <clipPath id="p0c56ad0c37">
+   <rect x="311.975469" y="19.27875" width="109.31375" height="121.24"/>
+  </clipPath>
+  <clipPath id="p58b81c82c6">
+   <rect x="452.364219" y="19.27875" width="109.31375" height="121.24"/>
+  </clipPath>
+ </defs>
+</svg>
diff --git a/docs/index.md b/docs/index.md
index d5eb303..8bef4dc 100644
--- a/docs/index.md
+++ b/docs/index.md
@@ -3,9 +3,12 @@
   <img src="_static/logo/softtorch_logo_white_transparent.png#only-dark" style="width:50%; max-width:360px; height:auto;">
 </p>
 
-# SoftTorch
+# Soft differentiable programming in PyTorch
 
-## In a nutshell
+Looking for JAX? See [SoftJAX](https://github.com/a-paulus/softjax).
+
+
+## What is SoftTorch?
 
 SoftTorch provides soft differentiable drop-in replacements for traditionally non-differentiable functions in [PyTorch](https://pytorch.org), including
 
@@ -19,8 +22,7 @@ All operators offer multiple modes (controlling smoothness or boundedness of the
 
 All operators also support straight-through estimation, using the non-differentiable function in the forward pass and the soft relaxation in the backward pass.
 
-SoftTorch functions are drop-in replacements for their non-differentiable PyTorch counterparts.
-Special care is needed for functions operating on indices, as we relax discrete indices into distributions over indices, which modifies the shape of returned/accepted values.
+*Note, while SoftTorch is designed to provide direct drop-in replacements for PyTorch's operators, soft axis-wise operators return a probability distribution over indices (instead of an index), effectively changing the shape of the function's output.*
 
 
 ## Installation
@@ -30,353 +32,163 @@ pip install softtorch
 ```
 
 
-## Quick example
-```python
-import torch
-import softtorch as st
-
-x = torch.tensor([-0.2, -1.0, 0.3, 1.0])
-
-# Elementwise functions
-print("\nTorch absolute:", torch.abs(x))
-print("SoftTorch absolute (hard mode):", st.abs(x, mode="hard"))
-print("SoftTorch absolute (soft mode):", st.abs(x))
-
-print("\nTorch clamp:", torch.clamp(x, -0.5, 0.5))
-print("SoftTorch clamp (hard mode):", st.clamp(x, -0.5, 0.5, mode="hard"))
-print("SoftTorch clamp (soft mode):", st.clamp(x, -0.5, 0.5))
-
-print("\nTorch heaviside:", torch.heaviside(x, torch.tensor(0.5)))
-print("SoftTorch heaviside (hard mode):", st.heaviside(x, mode="hard"))
-print("SoftTorch heaviside (soft mode):", st.heaviside(x))
-
-print("\nTorch ReLU:", torch.nn.functional.relu(x))
-print("SoftTorch ReLU (hard mode):", st.relu(x, mode="hard"))
-print("SoftTorch ReLU (soft mode):", st.relu(x))
-
-print("\nTorch round:", torch.round(x))
-print("SoftTorch round (hard mode):", st.round(x, mode="hard"))
-print("SoftTorch round (soft mode):", st.round(x))
-
-print("\nTorch sign:", torch.sign(x))
-print("SoftTorch sign (hard mode):", st.sign(x, mode="hard"))
-print("SoftTorch sign (soft mode):", st.sign(x))
-```
-```
-Torch absolute: tensor([0.2000, 1.0000, 0.3000, 1.0000])
-SoftTorch absolute (hard mode): tensor([0.2000, 1.0000, 0.3000, 1.0000])
-SoftTorch absolute (soft mode): tensor([0.1523, 0.9999, 0.2715, 0.9999])
-
-Torch clamp: tensor([-0.2000, -0.5000,  0.3000,  0.5000])
-SoftTorch clamp (hard mode): tensor([-0.2000, -0.5000,  0.3000,  0.5000])
-SoftTorch clamp (soft mode): tensor([-0.1952, -0.4993,  0.2873,  0.4993])
-
-Torch heaviside: tensor([0., 0., 1., 1.])
-SoftTorch heaviside (hard mode): tensor([0., 0., 1., 1.])
-SoftTorch heaviside (soft mode): tensor([0.1192, 0.0000, 0.9526, 1.0000])
-
-Torch ReLU: tensor([0.0000, 0.0000, 0.3000, 1.0000])
-SoftTorch ReLU (hard mode): tensor([0.0000, 0.0000, 0.3000, 1.0000])
-SoftTorch ReLU (soft mode): tensor([0.0127, 0.0000, 0.3049, 1.0000])
-
-Torch round: tensor([-0., -1.,  0.,  1.])
-SoftTorch round (hard mode): tensor([-0., -1.,  0.,  1.])
-SoftTorch round (soft mode): tensor([-0.0465, -1.0000,  0.1189,  1.0000])
-
-Torch sign: tensor([-1., -1.,  1.,  1.])
-SoftTorch sign (hard mode): tensor([-1., -1.,  1.,  1.])
-SoftTorch sign (soft mode): tensor([-0.7616, -0.9999,  0.9051,  0.9999])
-```
+## Quick examples
 
+**Robust median regression:**
+Minimize the median absolute residual to be robust to outliers.
 ```python
-# Tensor-valued operators
-print("\nTorch max:", torch.max(x))
-print("SoftTorch max (hard mode):", st.max(x, mode="hard"))
-print("SoftTorch max (soft mode):", st.max(x))
-
-print("\nTorch min:", torch.min(x))
-print("SoftTorch min (hard mode):", st.min(x, mode="hard"))
-print("SoftTorch min (soft mode):", st.min(x))
-
-print("\nTorch sort:", torch.sort(x).values)
-print("SoftTorch sort (hard mode):", st.sort(x, mode="hard").values)
-print("SoftTorch sort (soft mode):", st.sort(x).values)
-
-print("\nTorch quantile:", torch.quantile(x, q=0.2))
-print("SoftTorch quantile (hard mode):", st.quantile(x, q=0.2, mode="hard"))
-print("SoftTorch quantile (soft mode):", st.quantile(x, q=0.2))
-
-print("\nTorch median:", torch.median(x))
-print("SoftTorch median (hard mode):", st.median(x, mode="hard"))
-print("SoftTorch median (soft mode):", st.median(x))
-
-print("\nTorch topk:", torch.topk(x, k=3).values)
-print("SoftTorch topk (hard mode):", st.topk(x, k=3, mode="hard").values)
-print("SoftTorch topk (soft mode):", st.topk(x, k=3).values)
-
-print("\nTorch rank:", torch.argsort(torch.argsort(x)))
-print("SoftTorch rank (hard mode):", st.rank(x, mode="hard", descending=False))
-print("SoftTorch rank (soft mode):", st.rank(x, descending=False))
-```
-```
-Torch max: tensor(1.)
-SoftTorch max (hard mode): tensor(1.)
-SoftTorch max (soft mode): tensor(0.8874)
+import torch, softtorch as st
 
-Torch min: tensor(-1.)
-SoftTorch min (hard mode): tensor(-1.)
-SoftTorch min (soft mode): tensor(-0.8996)
+torch.manual_seed(0)
+X = torch.randn(20, 3)
+w_true = torch.tensor([1.0, -2.0, 0.5])
+y = X @ w_true
+y[0] = 1e6  # inject outlier
 
-Torch sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (hard mode): tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (soft mode): tensor([-0.8792, -0.1641,  0.2767,  0.8738])
+def median_regression_loss(w, X, y, mode="smooth"):
+    residuals = y - X @ w
+    return st.median(st.abs(residuals, mode=mode), mode=mode)
 
-Torch quantile: tensor(-0.5200)
-SoftTorch quantile (hard mode): tensor(-0.5200)
-SoftTorch quantile (soft mode): tensor(-0.4501)
+w = torch.zeros(3, requires_grad=True)
+hard_loss = median_regression_loss(w, X, y, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, w)[0])
+soft_loss = median_regression_loss(w, X, y, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, w)[0])
 
-Torch median: tensor(-0.2000)
-SoftTorch median (hard mode): tensor(-0.2000)
-SoftTorch median (soft mode): tensor(-0.1641)
-
-Torch topk: tensor([ 1.0000,  0.3000, -0.2000])
-SoftTorch topk (hard mode): tensor([ 1.0000,  0.3000, -0.2000])
-SoftTorch topk (soft mode): tensor([ 0.8738,  0.2767, -0.1641])
-
-Torch rank: tensor([1, 0, 2, 3])
-SoftTorch rank (hard mode): tensor([2., 1., 3., 4.])
-SoftTorch rank (soft mode): tensor([1.9950, 1.0548, 3.0239, 3.9228])
+w = torch.zeros(3)
+for _ in range(50):
+    w.requires_grad_(True)
+    loss = median_regression_loss(w, X, y)
+    g = torch.autograd.grad(loss, w)[0]
+    w = (w - 0.1 * g).detach()
+print("Learned w:", w, " (true:", w_true, ")")
 ```
-
-```python
-# Sort: sweep over methods
-print("\nTorch sort:", torch.sort(x).values)
-print("SoftTorch sort (softsort):", st.sort(x, method="softsort", softness=0.1).values)
-print("SoftTorch sort (neuralsort):", st.sort(x, method="neuralsort", softness=0.1).values)
-print("SoftTorch sort (fast_soft_sort):", st.sort(x, method="fast_soft_sort", softness=2.0).values)
-print("SoftTorch sort (ot):", st.sort(x, method="ot", softness=0.1).values)
-print("SoftTorch sort (sorting_network):", st.sort(x, method="sorting_network", softness=0.1).values)
-
-# Sort: sweep over modes
-print("\nTorch sort:", torch.sort(x).values)
-for mode in ["hard", "smooth", "c0", "c1", "c2"]:
-    print(f"SoftTorch sort ({mode}):", st.sort(x, softness=0.5, mode=mode).values)
 ```
-```
-Torch sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (softsort): tensor([-0.8996, -0.1705,  0.2847,  0.8874])
-SoftTorch sort (neuralsort): tensor([-0.8792, -0.1641,  0.2767,  0.8738])
-SoftTorch sort (fast_soft_sort): tensor([-0.7462, -0.1971,  0.2938,  0.8569])
-SoftTorch sort (ot): tensor([-0.7324, -0.2396,  0.3286,  0.7434])
-SoftTorch sort (sorting_network): tensor([-0.7999, -0.2672,  0.3847,  0.7863])
-
-Torch sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (hard): tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-SoftTorch sort (smooth): tensor([-0.6057, -0.1997,  0.2729,  0.6281])
-SoftTorch sort (c0): tensor([-1.0000, -0.6313,  0.6525,  0.9824])
-SoftTorch sort (c1): tensor([-0.9982, -0.5432,  0.5814,  0.9837])
-SoftTorch sort (c2): tensor([-0.9978, -0.4905,  0.5425,  0.9903])
+Hard grad: tensor([ 0.2103,  0.1772, -0.8305])
+Soft grad: tensor([ 0.0731,  0.7100, -0.2970])
+Learned w: tensor([ 1.0000, -2.0000,  0.5000])  (true: tensor([ 1.0000, -2.0000,  0.5000]) )
 ```
 
+**Top-k feature selection:**
+Discover which features of a trained model are important.
 ```python
-# Operators returning indices
-print("\nTorch argmax:", torch.argmax(x))
-print("SoftTorch argmax (hard mode):", st.argmax(x, mode="hard"))
-print("SoftTorch argmax (soft mode):", st.argmax(x))
-
-print("\nTorch argmin:", torch.argmin(x))
-print("SoftTorch argmin (hard mode):", st.argmin(x, mode="hard"))
-print("SoftTorch argmin (soft mode):", st.argmin(x))
-
-print("\nTorch argquantile:", "Not implemented in standard PyTorch")
-print("SoftTorch argquantile (hard mode):", st.argquantile(x, q=0.2, mode="hard"))
-print("SoftTorch argquantile (soft mode):", st.argquantile(x, q=0.2))
-
-print("\nTorch argmedian:", torch.median(x, dim=0).indices)
-print("SoftTorch argmedian (hard mode):", st.median(x, mode="hard", dim=0).indices)
-print("SoftTorch argmedian (soft mode):", st.median(x, dim=0).indices)
-
-print("\nTorch argsort:", torch.argsort(x))
-print("SoftTorch argsort (hard mode):", st.argsort(x, mode="hard"))
-print("SoftTorch argsort (soft mode):", st.argsort(x))
-
-print("\nTorch argtopk:", torch.topk(x, k=3).indices)
-print("SoftTorch argtopk (hard mode):", st.topk(x, k=3, mode="hard").indices)
-print("SoftTorch argtopk (soft mode):", st.topk(x, k=3).indices)
-```
-```
-Torch argmax: tensor(3)
-SoftTorch argmax (hard mode): tensor([0., 0., 0., 1.])
-SoftTorch argmax (soft mode): tensor([0.0215, 0.0022, 0.1176, 0.8586])
-
-Torch argmin: tensor(1)
-SoftTorch argmin (hard mode): tensor([0., 1., 0., 0.])
-SoftTorch argmin (soft mode): tensor([0.0922, 0.8885, 0.0169, 0.0023])
-
-Torch argquantile: Not implemented in standard PyTorch
-SoftTorch argquantile (hard mode): tensor([0.6000, 0.4000, 0.0000, 0.0000])
-SoftTorch argquantile (soft mode): tensor([0.5403, 0.3693, 0.0902, 0.0001])
-
-Torch argmedian: tensor(0)
-SoftTorch argmedian (hard mode): tensor([1., 0., 0., 0.])
-SoftTorch argmedian (soft mode): tensor([0.8009, 0.0491, 0.1498, 0.0002])
-
-Torch argsort: tensor([1, 0, 2, 3])
-SoftTorch argsort (hard mode): tensor([[0., 1., 0., 0.],
-        [1., 0., 0., 0.],
-        [0., 0., 1., 0.],
-        [0., 0., 0., 1.]])
-SoftTorch argsort (soft mode): tensor([[0.1494, 0.8496, 0.0009, 0.0000],
-        [0.8009, 0.0491, 0.1498, 0.0002],
-        [0.1418, 0.0001, 0.7899, 0.0681],
-        [0.0011, 0.0000, 0.1784, 0.8205]])
-
-Torch argtopk: tensor([3, 2, 0])
-SoftTorch argtopk (hard mode): tensor([[0., 0., 0., 1.],
-        [0., 0., 1., 0.],
-        [1., 0., 0., 0.]])
-SoftTorch argtopk (soft mode): tensor([[0.0011, 0.0000, 0.1784, 0.8205],
-        [0.1418, 0.0001, 0.7899, 0.0681],
-        [0.8009, 0.0491, 0.1498, 0.0002]])
-```
-
+n_features, k = 10, 3
+torch.manual_seed(42)
+X = torch.randn(100, n_features)
+w_model = torch.tensor([0, 2.0, 0, -1.5, 0, 0, 0, 5.0, 0, 0])
+y = X @ w_model + 0.1 * torch.randn(100)
+
+def feature_selection_loss(g, X, y, w_model, mode="smooth"):
+    _, soft_idx = st.topk(g, k=k, mode=mode, gated_grad=False)
+    mask = soft_idx.sum(dim=0)
+    y_pred = (X * mask) @ w_model
+    return torch.mean(st.abs(y_pred - y))
+
+g = torch.zeros(n_features, requires_grad=True)
+hard_loss = feature_selection_loss(g, X, y, w_model, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, g)[0] if hard_loss.requires_grad else torch.zeros_like(g))
+soft_loss = feature_selection_loss(g, X, y, w_model, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, g)[0])
+
+g = torch.zeros(n_features)
+for _ in range(5):
+    g.requires_grad_(True)
+    loss = feature_selection_loss(g, X, y, w_model)
+    g_grad = torch.autograd.grad(loss, g)[0]
+    g = (g - 0.001 * g_grad).detach()
+print("Selected features:", torch.topk(g, k=k).indices)
+```
+```
+Hard grad: tensor([0., 0., 0., 0., 0., 0., 0., 0., 0., 0.])
+Soft grad: tensor([  2359.3386,     62.9980,   2359.3386,   -890.2852,   2359.3386,
+          2359.3386,   2359.3386, -15688.0829,   2359.3386,   2359.3386])
+Selected features: tensor([7, 3, 1])
+```
+
+**Differentiable threshold filtering:**
+Learn a threshold that gates inputs.
 ```python
-y = torch.tensor([0.2, -0.5, 0.5, -1.0])
-
-# Comparison operators
-print("\nTorch greater:", torch.greater(x, y))
-print("SoftTorch greater (hard mode):", st.greater(x, y, mode="hard"))
-print("SoftTorch greater (soft mode):", st.greater(x, y))
-
-print("\nTorch greater equal:", torch.greater_equal(x, y))
-print("SoftTorch greater equal (hard mode):", st.greater_equal(x, y, mode="hard"))
-print("SoftTorch greater equal (soft mode):", st.greater_equal(x, y))
+x = torch.tensor([0.2, 0.8, 0.5, 1.2, 0.1])
+target_sum = 2.0  # sum of values above threshold = 2.0 (i.e. 0.8 + 1.2)
 
-print("\nTorch less:", torch.less(x, y))
-print("SoftTorch less (hard mode):", st.less(x, y, mode="hard"))
-print("SoftTorch less (soft mode):", st.less(x, y))
+def filter_loss(t, x, target, mode="smooth"):
+    mask = st.greater(x, t, mode=mode)
+    return (torch.sum(mask * x) - target) ** 2
 
-print("\nTorch less equal:", torch.less_equal(x, y))
-print("SoftTorch less equal (hard mode):", st.less_equal(x, y, mode="hard"))
-print("SoftTorch less equal (soft mode):", st.less_equal(x, y))
+t = torch.tensor(0.0, requires_grad=True)
+hard_loss = filter_loss(t, x, target_sum, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, t)[0] if hard_loss.requires_grad else torch.zeros_like(t))
+soft_loss = filter_loss(t, x, target_sum, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, t)[0])
 
-print("\nTorch eq:", torch.eq(x, y))
-print("SoftTorch eq (hard mode):", st.eq(x, y, mode="hard"))
-print("SoftTorch eq (soft mode):", st.eq(x, y))
-
-print("\nTorch not equal:", torch.not_equal(x, y))
-print("SoftTorch not equal (hard mode):", st.not_equal(x, y, mode="hard"))
-print("SoftTorch not equal (soft mode):", st.not_equal(x, y))
-
-print("\nTorch isclose:", torch.isclose(x, y))
-print("SoftTorch isclose (hard mode):", st.isclose(x, y, mode="hard"))
-print("SoftTorch isclose (soft mode):", st.isclose(x, y))
+t = torch.tensor(0.0)
+for _ in range(20):
+    t.requires_grad_(True)
+    loss = filter_loss(t, x, target_sum)
+    t_grad = torch.autograd.grad(loss, t)[0]
+    t = (t - 0.1 * t_grad).detach()
+print("Learned threshold:", t)
 ```
 ```
-Torch greater: tensor([False, False, False,  True])
-SoftTorch greater (hard mode): tensor([0., 0., 0., 1.])
-SoftTorch greater (soft mode): tensor([0.0180, 0.0067, 0.1192, 1.0000])
-
-Torch greater equal: tensor([False, False, False,  True])
-SoftTorch greater equal (hard mode): tensor([0., 0., 0., 1.])
-SoftTorch greater equal (soft mode): tensor([0.0180, 0.0067, 0.1192, 1.0000])
-
-Torch less: tensor([ True,  True,  True, False])
-SoftTorch less (hard mode): tensor([1., 1., 1., 0.])
-SoftTorch less (soft mode): tensor([0.9820, 0.9933, 0.8808, 0.0000])
-
-Torch less equal: tensor([ True,  True,  True, False])
-SoftTorch less equal (hard mode): tensor([1., 1., 1., 0.])
-SoftTorch less equal (soft mode): tensor([0.9820, 0.9933, 0.8808, 0.0000])
-
-Torch eq: tensor([False, False, False, False])
-SoftTorch eq (hard mode): tensor([0., 0., 0., 0.])
-SoftTorch eq (soft mode): tensor([0.0414, 0.0143, 0.3580, 0.0000])
-
-Torch not equal: tensor([True, True, True, True])
-SoftTorch not equal (hard mode): tensor([1., 1., 1., 1.])
-SoftTorch not equal (soft mode): tensor([0.9586, 0.9857, 0.6420, 1.0000])
-
-Torch isclose: tensor([False, False, False, False])
-SoftTorch isclose (hard mode): tensor([0., 0., 0., 0.])
-SoftTorch isclose (soft mode): tensor([0.0414, 0.0143, 0.3580, 0.0000])
+Hard grad: tensor(0.)
+Soft grad: tensor(-0.6600)
+Learned threshold: tensor(0.6211)
 ```
 
+**Rule-based classifier:**
+Learn decision boundaries `[lo, hi]` for a rule using soft logic and straight-through estimation. The rule is true if any element of a feature is inside `[lo, hi]`.
 ```python
-# Logical operators
-fuzzy_a = torch.tensor([0.1, 0.2, 0.8, 1.0])
-fuzzy_b = torch.tensor([0.7, 0.3, 0.1, 0.9])
-bool_a = fuzzy_a >= 0.5
-bool_b = fuzzy_b >= 0.5
-
-print("\nTorch AND:", torch.logical_and(bool_a, bool_b))
-print("SoftTorch AND:", st.logical_and(fuzzy_a, fuzzy_b))
-
-print("\nTorch OR:", torch.logical_or(bool_a, bool_b))
-print("SoftTorch OR:", st.logical_or(fuzzy_a, fuzzy_b))
-
-print("\nTorch NOT:", torch.logical_not(bool_a))
-print("SoftTorch NOT:", st.logical_not(fuzzy_a))
-
-print("\nTorch XOR:", torch.logical_xor(bool_a, bool_b))
-print("SoftTorch XOR:", st.logical_xor(fuzzy_a, fuzzy_b))
-
-print("\nTorch ALL:", torch.all(bool_a))
-print("SoftTorch ALL:", st.all(fuzzy_a))
-
-print("\nTorch ANY:", torch.any(bool_a))
-print("SoftTorch ANY:", st.any(fuzzy_a))
-
-# Selection operators
-print("\nTorch Where:", torch.where(bool_a, x, y))
-print("SoftTorch Where:", st.where(fuzzy_a, x, y))
-```
-```
-Torch AND: tensor([False, False, False,  True])
-SoftTorch AND: tensor([0.0700, 0.0600, 0.0800, 0.9000])
-
-Torch OR: tensor([ True, False,  True,  True])
-SoftTorch OR: tensor([0.7300, 0.4400, 0.8200, 1.0000])
-
-Torch NOT: tensor([ True,  True, False, False])
-SoftTorch NOT: tensor([0.9000, 0.8000, 0.2000, 0.0000])
+x = torch.tensor([[0.2, 0.8], [0.5, 0.3], [0.9, 0.1], [0.4, 0.7], [0.1, 0.4], [0.2, 0.7], [0.4, 0.1], [0.4, 0.7],
+               [0.7, 0.29], [0.3, 0.3], [0.61, 0.25], [0.4, 0.6], [0.0, 0.1], [0.5, 0.3], [0.4, 0.9], [0.1, 0.57]])
+labels = torch.tensor([0.0, 1.0, 0.0, 1.0, 1.0, 0.0, 1.0, 1.0,
+                    0.0, 1.0, 0.0, 1.0, 0.0, 1.0, 1.0, 1.0])
+
+@st.st
+def rule_loss(params, x, labels, mode="smooth"):
+    lo, hi = params[0], params[1]
+    above = st.greater(x, lo, mode=mode)
+    below = st.less(x, hi, mode=mode)
+    in_range = st.logical_and(above, below)
+    preds = st.any(in_range, dim=-1)
+    return ((preds - labels) ** 2).sum()
+
+params = torch.tensor([0.0, 1.0], requires_grad=True)
+hard_loss = rule_loss(params, x, labels, mode="hard")
+print("Hard grad:", torch.autograd.grad(hard_loss, params)[0] if hard_loss.requires_grad else torch.zeros_like(params))
+soft_loss = rule_loss(params, x, labels, mode="smooth")
+print("Soft grad:", torch.autograd.grad(soft_loss, params)[0])
+
+params = torch.tensor([0.0, 1.0])
+for _ in range(20):
+    params.requires_grad_(True)
+    loss = rule_loss(params, x, labels)
+    p_grad = torch.autograd.grad(loss, params)[0]
+    params = (params - 0.01 * p_grad).detach()
+print("Learned [lo, hi]:", params)
+```
+```
+Hard grad: tensor([0., 0.])
+Soft grad: tensor([-4.2777,  1.4152])
+Learned [lo, hi]: tensor([0.2925, 0.5999])
+```
+
+<img src="examples/quick_example_optimization.svg" alt="Optimization trajectories" width="100%">
 
-Torch XOR: tensor([ True, False,  True, False])
-SoftTorch XOR: tensor([0.6411, 0.3464, 0.7256, 0.1000])
-
-Torch ALL: tensor(False)
-SoftTorch ALL: tensor(0.0160)
+## Citation
 
-Torch ANY: tensor(True)
-SoftTorch ANY: tensor(1.)
+If this library helped your academic work, please consider citing: ([arXiv link](https://arxiv.org/abs/2603.08824))
 
-Torch Where: tensor([ 0.2000, -0.5000,  0.3000,  1.0000])
-SoftTorch Where: tensor([ 0.1600, -0.6000,  0.3400,  1.0000])
+```bibtex
+@article{paulus2026softjax,
+  title={{SoftJAX} \& {SoftTorch}: Empowering Automatic Differentiation Libraries with Informative Gradients},
+  author={Paulus, Anselm and Geist, A.\ Ren\'e and Musil, V\'it and Hoffmann, Sebastian and Beker, Onur and Martius, Georg},
+  journal={arXiv preprint},
+  year={2026},
+  eprint={2603.08824}
+}
 ```
 
-```python
-# Straight-through operators: Use hard function on forward and soft on backward
-print("Straight-through ReLU:", st.relu_st(x))
-print("Straight-through sort:", st.sort_st(x).values)
-print("Straight-through argtopk:", st.topk_st(x, k=3).indices)
-print("Straight-through greater:", st.greater_st(x, y))
-# And many more...
-```
-```
-Straight-through ReLU: tensor([0.0000, 0.0000, 0.3000, 1.0000])
-Straight-through sort: tensor([-1.0000, -0.2000,  0.3000,  1.0000])
-Straight-through argtopk: tensor([[0., 0., 0., 1.],
-        [0., 0., 1., 0.],
-        [1., 0., 0., 0.]])
-Straight-through greater: tensor([0., 0., 0., 1.])
-```
-
-The outputs were generated with `docs/quick_example.py`.
-
-
-## Citation
-
---8<-- ".citation.md"
+(Also consider starring the project [on GitHub](https://github.com/a-paulus/softtorch))
 
 Special thanks and credit go to [Patrick Kidger](https://kidger.site) for the awesome [JAX repositories](https://github.com/patrick-kidger) that served as the basis for the documentation of this project.
 
@@ -388,7 +200,7 @@ Have a look at the [All of SoftTorch](./all-of-softtorch.ipynb) page.
 
 ## Feedback
 
-This project is still relatively young, if you have any suggestions for improvement or other feedback, please [reach out](mailto:paulus.anselm@gmail.com) or raise a GitHub issue!
+If you have any suggestions for improvement or other feedback, please [reach out](mailto:paulus.anselm@gmail.com) or raise a GitHub issue!
 
 
 ## See also
diff --git a/mkdocs.yml b/mkdocs.yml
index cd8dca4..2de3530 100644
--- a/mkdocs.yml
+++ b/mkdocs.yml
@@ -77,7 +77,6 @@ plugins:
             - "_overrides"
             - "_static/README.md"
             - "examples/.ipynb_checkpoints"
-            - "examples/"
     - mkdocs-jupyter:
         include_requirejs: false
         custom_mathjax_url: "https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.7/latest.js?config=TeX-AMS_CHTML-full,Safe"
@@ -130,7 +129,7 @@ nav:
     - All of Softtorch: 'all-of-softtorch.ipynb'
     - Examples:
         - 'plots.ipynb'
-        - 'manifold_points.ipynb'
+        - 'examples/manifold_points.ipynb'
     - API:
         - 'api/softtorch_operators.md'
         - 'api/straight_through.md'
diff --git a/src/softtorch/functions.py b/src/softtorch/functions.py
index 110a1ec..71981a1 100644
--- a/src/softtorch/functions.py
+++ b/src/softtorch/functions.py
@@ -1440,8 +1440,10 @@ def topk(
             ot_kwargs=ot_kwargs,
         )  # (..., k, ..., [n])
         if not gated_grad:
-            soft_index = soft_index.detach()
-        values = take_along_dim(x, soft_index, dim=dim)  # (..., k, ...)
+            soft_index_tmp = soft_index.detach()
+        else:
+            soft_index_tmp = soft_index
+        values = take_along_dim(x, soft_index_tmp, dim=dim)  # (..., k, ...)
     return torch.return_types.topk((values, soft_index))