vchernoy
diff --git a/‎404.html‎
Lines changed: 2 additions & 2 deletions b/‎404.html‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎content/post/multiplicative-hash.md‎
Lines changed: 17 additions & 13 deletions b/‎content/post/multiplicative-hash.md‎
Lines changed: 17 additions & 13 deletions
diff --git a/‎index.html‎
Lines changed: 38 additions & 33 deletions b/‎index.html‎
Lines changed: 38 additions & 33 deletions
@@ -576,6 +576,8 @@ <h2>Latest</h2>
 
       <li><a href="/post/">Posts</a></li>
 
+      <li><a href="/post/multiplicative-hash/">Multiplicative Hashing Functions -- Notes on Primes, Golden Ratio, and Evil</a></li>
+    
       <li><a href="/post/perfect-distribution/">Perfect Distribution: GCD in Disguise</a></li>
 
       <li><a href="/post/binomial-modulo-prime/">Binomial Coefficients Modulo a Prime: Fermat&#39;s Theorem and the Non-Adjacent Selection Problem</a></li>
@@ -590,8 +592,6 @@ <h2>Latest</h2>
 
       <li><a href="/authors/admin/">Slava Chernoy</a></li>
 
-      <li><a href="/authors/">Authors</a></li>
-    
   </ul>
 
 
 
@@ -4,7 +4,7 @@ math = true
 highlight = true
 tags = ["multiplicative hashing", "hash functions", "prime", "goldan ratio", "gcd", "coprimes", "prime"]
 title = "Multiplicative Hashing Functions -- Notes on Primes, Golden Ratio, and Evil"
-draft = true
+draft = false
 
 # Optional featured image (relative to `static/img/` folder).
 [header]
@@ -15,14 +15,12 @@ caption = ""
 
 ## Introduction
 
-Distribution of partitions amongst slices has a huge impact on the performance of XIV systems.
-Different solutions for this problem were discussed and implemented in Gen2 and Gen3.
-These approaches are not perfect; in fact, any solution for this problem has some drawbacks.
-Our goal is to find a "good" solution having not too much impact on customers and having acceptable cost in development and QA and have acceptable performance (as less collisions as possible).
-Due to great importance of this problem, we would like present a new approach not yet discussed.
+Mapping partitions (or keys) to slices (or buckets) in a distributed or sharded system has a large impact on performance.
+Different hash-based solutions for this problem exist; each has drawbacks.
+The goal is to choose a hash function that is simple to implement and gives acceptable performance with as few collisions as possible.
 
 The problem is defined as follows. Given a logical partition number $P$, compute the corresponding slice number $S = s(P)$.
-Where $0 ≤ P < 2^{32}$, $0 ≤ S < M$, and $M$ denotes the table size, e.g. in Gen2, $M = 2^{14}$.
+Where $0 ≤ P < 2^{32}$, $0 ≤ S < M$, and $M$ denotes the table size (e.g. $M = 2^{14}$).
 In our "binary" world, assumptions that input data (partition numbers) have uniform distribution are not always correct.
 Therefore, the hash function $s: P \to S$ must be designed very carefully.
 In addition to providing uniform distribution of hash values (slice numbers), it has to add some randomness.
@@ -32,13 +30,15 @@ usually explaining in two paragraphs what the book is saying just in one sentenc
 
 ## Basic Ideas
 
-In Gen2 we used the following hash function:
+A first approach is the division-remainder method with $M$ a power of 2:
+
 $s(P) = P \bmod M$,
 
-where $M = 2^{14}$ -- the table size or the number of slices. In Gen3 it was proposed to change the number of slices from a power of 2 to a prime number:
+where $M = 2^{14}$ is the table size (number of slices). A variant is to take $M$ prime instead:
+
 $s(P) = P \bmod M$,
 
-where $M = 16411$ is a prime number $> 2^{14}$. Such form of hashing is called "Division Remainder Method". The main idea of this Notes is to demonstrate another hashing method called "Multiplicative Method":
+where $M = 16411$ is a prime $> 2^{14}$. Both forms are called "Division Remainder Method". These notes focus on another method, the "Multiplicative Method":
 
 $$f(P) = A \cdot P \bmod W$$
 
@@ -107,7 +107,7 @@ uint32_t slice(uint32_t P) {
 }
 ```
 
-It really works well for any input data (partitions) and allows to use the same number of slices as in Gen2: $2^{14}$. As a read can see, the last function has only two operation, one is multiplication and other is logical shift. On some architectures this function may be faster then the second one finding a modulo!
+It works well for arbitrary input data and allows using the same number of slices $M = 2^{14}$. As the reader can see, the multiplicative version uses only a multiplication and a logical shift; on some architectures it can be faster than computing a modulo.
 
 Details for Math Fans
 
@@ -137,7 +137,11 @@ Actually, this condition on $M$ is too strong.
 For satisfying this property, it is sufficient that $M ≠ 2^i$ will hold.
 For example, $M = 15 · 12 · 97 = 17460$ (15 modules, 12 disks, 97 is a prime) is also "good".
 
-Since $M$ is prime, it seems that the following pattern is not common: $P = S + M · i$. But actually, primes that are close to a power of 2 are also not good. Knuth recommends to choose such $M$ that the following condition will not hold for any small integers $a$ and $j$: $r^j ≡ ± a \pmod M$. Where $r$ denotes the base of computation. From explanation of Knuth, the meaning of $r$ for our case is not too clear: whether $r=2$, $r=16$, or $r=256$? It seems the answer very depends on the type of input data. By Knuth, if $r=2$, the chosen $M$ is not so good, since $M = 16411 = 2^{14} + 27$, and hence $2^{14} ≡ -27 \pmod M$. For $r=16$, we get that $16^ ≡ -108 \pmod M$. For $r=256$, $16^2 ≡ -108 \pmod M$. Knuth explains that such $M$ may produce a hash code that is a simple composition of key digits (in $r$ base system). Instead of trying to understand this explanation, we will give some intuition. Working with numbers, a programmer usually chooses powers of 2 for sizes of structures and buffers (e.g., $2^{10}$ bytes). Then he defines the format of such data and introduces headers (e.g. the header size = 20 bytes). Hence, the size of data without the header becomes very close to the power of 2 (in our example, $2^{10} - 20 = 1004$). On the other hand, embedding this structure to an outer packet (assume the size of this outer header is 30 bytes) leads to the total size being also close to the power of 2 ($2^{10} + 30 = 1054$). As result, most of numbers in our "binary" world are either powers of 2 or close to them. Therefore, such choice of $M$ increases collisions. In other words, not only powers of 2 are *evil*, but primes closing to them are *evil* too.
+Since $M$ is prime, it seems that the following pattern is not common: $P = S + M · i$. But actually, primes that are close to a power of 2 are also not good. Knuth recommends to choose such $M$ that the following condition will not hold for any small integers $a$ and $j$: $r^j ≡ ± a \pmod M$. Where $r$ denotes the base of computation. From explanation of Knuth, the meaning of $r$ for our case is not too clear: whether $r=2$, $r=16$, or $r=256$? It seems the answer very depends on the type of input data. 
+
+By Knuth, if $r=2$, the chosen $M$ is not so good, since $M = 16411 = 2^{14} + 27$, and hence $2^{14} ≡ -27 \pmod M$. For $r=16$, we get that $16^4 ≡ -108 \pmod M$. For $r=256$, $256^2 ≡ -108 \pmod M$. 
+
+Knuth explains that such $M$ may produce a hash code that is a simple composition of key digits (in $r$ base system). Instead of trying to understand this explanation, we will give some intuition. Working with numbers, a programmer usually chooses powers of 2 for sizes of structures and buffers (e.g., $2^{10}$ bytes). Then he defines the format of such data and introduces headers (e.g. the header size = 20 bytes). Hence, the size of data without the header becomes very close to the power of 2 (in our example, $2^{10} - 20 = 1004$). On the other hand, embedding this structure to an outer packet (assume the size of this outer header is 30 bytes) leads to the total size being also close to the power of 2 ($2^{10} + 30 = 1054$). As result, most of numbers in our "binary" world are either powers of 2 or close to them. Therefore, such choice of $M$ increases collisions. In other words, not only powers of 2 are *evil*, but primes closing to them are *evil* too.
 
 As an example of a "good" prime, let's consider $M = 24571$. It is a bit smaller then the middle of $2^{14}$ and $2^{15}$.
 
@@ -172,7 +176,7 @@ We show the implementation of $p()$ in C code for the multiplicative hashing onl
 const uint32_t M = 2 << 14;
 const uint32_t B = 244002641;
 
-uint32$t partition(uint32_t S, uint32_t Id) {
+uint32_t partition(uint32_t S, uint32_t Id) {
     return (S << 18 + Id) * B;
 }
 ```
 
@@ -1077,14 +1077,24 @@ <h2 id="the-problem">The Problem</h2>
   <div class="media-body">
 
     <div class="section-subheading article-title mb-0 mt-0">
-      <a href="/post/perfect-distribution/" >Perfect Distribution: GCD in Disguise</a>
+      <a href="/post/multiplicative-hash/" >Multiplicative Hashing Functions -- Notes on Primes, Golden Ratio, and Evil</a>
     </div>
 
 
-    <a href="/post/perfect-distribution/"  class="summary-link">
+    <a href="/post/multiplicative-hash/"  class="summary-link">
       <div class="article-style">
-        <h2 id="perfect-distribution-gcd-in-disguise">Perfect Distribution: GCD in Disguise</h2>
-<p>We discuss an algorithm that distributes $a$ ones among $n$ positions so that the gaps between consecutive ones differ by at most one—a <strong>perfect distribution</strong>. I developed it while working on profiling, stress, and negative testing of a system that needed exactly this kind of uniform spread. I am not aware of prior art; if you know of related work, I would be interested to hear.</p>
+        <h2 id="introduction">Introduction</h2>
+<p>Mapping partitions (or keys) to slices (or buckets) in a distributed or sharded system has a large impact on performance.
+Different hash-based solutions for this problem exist; each has drawbacks.
+The goal is to choose a hash function that is simple to implement and gives acceptable performance with as few collisions as possible.</p>
+<p>The problem is defined as follows. Given a logical partition number $P$, compute the corresponding slice number $S = s(P)$.
+Where $0 ≤ P &lt; 2^{32}$, $0 ≤ S &lt; M$, and $M$ denotes the table size (e.g. $M = 2^{14}$).
+In our &ldquo;binary&rdquo; world, assumptions that input data (partition numbers) have uniform distribution are not always correct.
+Therefore, the hash function $s: P \to S$ must be designed very carefully.
+In addition to providing uniform distribution of hash values (slice numbers), it has to add some randomness.
+Luckily, this field is well studied: well-known textbooks of Corman&rsquo;s and Knuth&rsquo;s have a good introduction to this field.
+The last one has more detail explanation; therefore, without hesitation, we make use of Knuth&rsquo;s book (Section 6.4.):
+usually explaining in two paragraphs what the book is saying just in one sentence.</p>
       </div>
     </a>
 
@@ -1108,7 +1118,7 @@ <h2 id="perfect-distribution-gcd-in-disguise">Perfect Distribution: GCD in Disgu
 
 
 
-    Aug 1, 2017
+    Aug 2, 2017
   </span>
 
 
@@ -1117,7 +1127,7 @@ <h2 id="perfect-distribution-gcd-in-disguise">Perfect Distribution: GCD in Disgu
 
   <span class="middot-divider"></span>
   <span class="article-reading-time">
-    7 min read
+    12 min read
   </span>
 
 
@@ -1165,15 +1175,14 @@ <h2 id="perfect-distribution-gcd-in-disguise">Perfect Distribution: GCD in Disgu
   <div class="media-body">
 
     <div class="section-subheading article-title mb-0 mt-0">
-      <a href="/post/binomial-modulo-prime/" >Binomial Coefficients Modulo a Prime: Fermat&#39;s Theorem and the Non-Adjacent Selection Problem</a>
+      <a href="/post/perfect-distribution/" >Perfect Distribution: GCD in Disguise</a>
     </div>
 
 
-    <a href="/post/binomial-modulo-prime/"  class="summary-link">
+    <a href="/post/perfect-distribution/"  class="summary-link">
       <div class="article-style">
-        <p>In the <a href="/post/efficient-implementation-non-adjacent-selection/">previous post</a>, we implemented the closed form $F_{n,m} = \binom{n-m+1}{m}$ using Python&rsquo;s <code>math.factorial</code>, and with <code>scipy</code> and <code>sympy</code>. Here we cover the common competitive-programming case: computing the answer <strong>modulo a large prime</strong> $M$ (e.g. $M = 10^9+7$).</p>
-<h2 id="why-modulo">Why modulo?</h2>
-<p>In counting problems, the result can be huge even for moderate input. Often the problem asks for the answer modulo a big prime so that it fits in a standard integer type. We could compute the full number and then take the remainder, but that forces expensive long-integer arithmetic. Computing <strong>everything</strong> modulo $M$ from the start is much faster.</p>
+        <h2 id="perfect-distribution-gcd-in-disguise">Perfect Distribution: GCD in Disguise</h2>
+<p>We discuss an algorithm that distributes $a$ ones among $n$ positions so that the gaps between consecutive ones differ by at most one—a <strong>perfect distribution</strong>. I developed it while working on profiling, stress, and negative testing of a system that needed exactly this kind of uniform spread. I am not aware of prior art; if you know of related work, I would be interested to hear.</p>
       </div>
     </a>
 
@@ -1197,7 +1206,7 @@ <h2 id="why-modulo">Why modulo?</h2>
 
 
 
-    Jul 8, 2017
+    Aug 1, 2017
   </span>
 
 
@@ -1206,7 +1215,7 @@ <h2 id="why-modulo">Why modulo?</h2>
 
   <span class="middot-divider"></span>
   <span class="article-reading-time">
-    2 min read
+    7 min read
   </span>
 
 
@@ -1254,17 +1263,15 @@ <h2 id="why-modulo">Why modulo?</h2>
   <div class="media-body">
 
     <div class="section-subheading article-title mb-0 mt-0">
-      <a href="/post/efficient-implementation-non-adjacent-selection/" >Efficient Implementation of the Non-Adjacent Selection Formula</a>
+      <a href="/post/binomial-modulo-prime/" >Binomial Coefficients Modulo a Prime: Fermat&#39;s Theorem and the Non-Adjacent Selection Problem</a>
     </div>
 
 
-    <a href="/post/efficient-implementation-non-adjacent-selection/"  class="summary-link">
+    <a href="/post/binomial-modulo-prime/"  class="summary-link">
       <div class="article-style">
-        <p>In the <a href="/post/two-var-recursive-func/">previous post</a>, we derived the closed form for the non-adjacent selection problem:</p>
-<p>$$ F_{n, m} = {n - m + 1 \choose m} $$</p>
-<p>Now we discuss how to implement this efficiently in Python—from a simple factorial-based solution to library implementations. For the common case of computing the answer <strong>modulo a large prime</strong> (e.g. in competitive programming), see the <a href="/post/binomial-modulo-prime/">next post</a>.</p>
-<h2 id="fast-solutions-based-on-binomials">Fast Solutions Based on Binomials</h2>
-<p>We can reflect the closed form in very trivial Python code:</p>
+        <p>In the <a href="/post/efficient-implementation-non-adjacent-selection/">previous post</a>, we implemented the closed form $F_{n,m} = \binom{n-m+1}{m}$ using Python&rsquo;s <code>math.factorial</code>, and with <code>scipy</code> and <code>sympy</code>. Here we cover the common competitive-programming case: computing the answer <strong>modulo a large prime</strong> $M$ (e.g. $M = 10^9+7$).</p>
+<h2 id="why-modulo">Why modulo?</h2>
+<p>In counting problems, the result can be huge even for moderate input. Often the problem asks for the answer modulo a big prime so that it fits in a standard integer type. We could compute the full number and then take the remainder, but that forces expensive long-integer arithmetic. Computing <strong>everything</strong> modulo $M$ from the start is much faster.</p>
       </div>
     </a>
 
@@ -1288,7 +1295,7 @@ <h2 id="fast-solutions-based-on-binomials">Fast Solutions Based on Binomials</h2
 
 
 
-    Jul 7, 2017
+    Jul 8, 2017
   </span>
 
 
@@ -1297,7 +1304,7 @@ <h2 id="fast-solutions-based-on-binomials">Fast Solutions Based on Binomials</h2
 
   <span class="middot-divider"></span>
   <span class="article-reading-time">
-    4 min read
+    2 min read
   </span>
 
 
@@ -1345,19 +1352,17 @@ <h2 id="fast-solutions-based-on-binomials">Fast Solutions Based on Binomials</h2
   <div class="media-body">
 
     <div class="section-subheading article-title mb-0 mt-0">
-      <a href="/post/two-var-recursive-func/" >Cracking Multivariate Recursive Equations Using Generating Functions</a>
+      <a href="/post/efficient-implementation-non-adjacent-selection/" >Efficient Implementation of the Non-Adjacent Selection Formula</a>
     </div>
 
 
-    <a href="/post/two-var-recursive-func/"  class="summary-link">
+    <a href="/post/efficient-implementation-non-adjacent-selection/"  class="summary-link">
       <div class="article-style">
-        <p>In this post, we return back to the combinatorial problem discussed in <a href="/post/intro-to-dp/">Introduction to Dynamic Programming and Memoization</a> post.
-We will show that generating functions may work great not only for single variable case (see <a href="/post/gen-func-art/">The Art of Generating Functions</a>),
-but also could be very useful for hacking two-variable relations (and of course, in general for multivariate case too).</p>
-<p>For making the post self-contained, we repeat the problem definition here.</p>
-<h2 id="the-problem">The Problem</h2>
-<blockquote>
-<p>Compute the number of ways to choose $m$ elements from $n$ elements such that selected elements in one combination are not adjacent.</p>
+        <p>In the <a href="/post/two-var-recursive-func/">previous post</a>, we derived the closed form for the non-adjacent selection problem:</p>
+<p>$$ F_{n, m} = {n - m + 1 \choose m} $$</p>
+<p>Now we discuss how to implement this efficiently in Python—from a simple factorial-based solution to library implementations. For the common case of computing the answer <strong>modulo a large prime</strong> (e.g. in competitive programming), see the <a href="/post/binomial-modulo-prime/">next post</a>.</p>
+<h2 id="fast-solutions-based-on-binomials">Fast Solutions Based on Binomials</h2>
+<p>We can reflect the closed form in very trivial Python code:</p>
       </div>
     </a>
 
@@ -1381,7 +1386,7 @@ <h2 id="the-problem">The Problem</h2>
 
 
 
-    Jul 6, 2017
+    Jul 7, 2017
   </span>
 
 
@@ -1390,7 +1395,7 @@ <h2 id="the-problem">The Problem</h2>
 
   <span class="middot-divider"></span>
   <span class="article-reading-time">
-    3 min read
+    4 min read
   </span>