Skip to content

Commit 5b09496

Browse files
committed
New Post
1 parent 48a8f87 commit 5b09496

1 file changed

Lines changed: 74 additions & 0 deletions

File tree

Lines changed: 74 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,74 @@
1+
---
2+
title: "Expectations are Linear"
3+
date: 2026-04-26T09:03:28-04:00
4+
draft: false
5+
tags: []
6+
math: true
7+
medium_enabled: false
8+
---
9+
10+
> As an example, he asked me, in more words, what the expected rank when flipping over the top card of a deck of cards was (A=1, J=11, Q=12, K=13). This is easy to compute directly as 7. Then he asked me the expectation of the *sum of the top two cards*.
11+
>
12+
> \- [From "Expectation and Copysets" by Justin Jaffray](https://buttondown.com/jaffray/archive/expectation-and-copysets/)
13+
14+
What does your intuition say the answer is? Justin continues by stating that computing this expectation is as easy as summing their individual expectations.
15+
$$
16+
E[X + Y] = E[X] + E[Y]
17+
$$
18+
In other words, **expectations are linear**. I recommend reading his entire blog post. It's great and also talks about how this property is used in databases today. After a high-level explanation, he says:
19+
20+
> The fact that expectation is linear is easy to show if you just look at the definition, which we will not do here, but I trust you are capable of if you are interested and have not already seen it.
21+
22+
In this episode of *Exercise for the Reader* ([last episode](/blog/implications-prenex-normal-form/)), we'll look at the definition and show why this property holds. This is true regardless of the underlying probability distribution and whether or not we're sampling with replacement.
23+
24+
As Justin stated, let's start with the definition of expectation and then split the sum:
25+
$$
26+
\begin{align*}
27+
E[X + Y] &= \sum_{x \in X} \sum_{y \in Y} (x + y) \cdot P(X = x, Y=y) \\\\
28+
&= (\sum_{x \in X} \sum_{y \in Y} x \cdot P(X = x, Y = y)) + (\sum_{x \in X} \sum_{y \in Y} y \cdot P(X = x, Y = y))
29+
\end{align*}
30+
$$
31+
Notice that the left-hand-side of the multiplication does not depend on both variables anymore. Also it doesn't matter whether we do the summation over $X$ first or $Y$. Therefore, we can bring that variable out of the inner sum and simplify this to:
32+
$$
33+
E[X + Y] = (\sum_{x \in X} x \sum_{y \in Y} P(X = x, Y = y)) + (\sum_{y \in Y} y \sum_{x \in X} P(X = x, Y = y))
34+
$$
35+
We can then perform *marginalization* to substitute $\sum_{y \in Y} P(X = x, Y = y)$ with $P(X = x)$ and do the same for the right hand side of the sum.
36+
$$
37+
\begin{align*}
38+
E[X + Y] &= (\sum_{x \in X}xP(X = x)) + (\sum_{y \in y}yP(Y=y))) \\\\
39+
&= E[x] + E[Y]
40+
\end{align*}
41+
$$
42+
43+
---
44+
45+
Why can we marginalize? For me to show why, we need to peel back the curtain on the notation.
46+
47+
The set $\Omega$ contains the outcomes of all the events that we're concerned about. So, if we are considering events $X$ and $Y$ with outcomes $x_i$ and $y_i$, respectively. Then, our event space $\Omega$ is equal to $\\{ (x_i, y_i) \mid x_i \in X, y_i \in Y\\}$.
48+
49+
Therefore when we say $X = x$, what we really mean is the set of outcomes where that is true. In mathematical terms, $\\{\omega \in \Omega \mid X(\omega) = x\\}$.
50+
51+
Now, let's show why $\sum_{y \in Y} P(X = x, Y = y) = P(X = x)$.
52+
$$
53+
\sum_{y \in Y}P(X = x, Y = y) = \sum_{y \in Y} P(\\{\omega \in \Omega \mid X(\omega) = x\\} \cap \\{\omega \in \Omega \mid Y(\omega) = y\\})
54+
$$
55+
One of the three Kolmogorov axioms of probability is **countable additivity**. This is defined as:
56+
$$
57+
\sum_{x \in A}P(X = x) = P(\bigcup_{x \in A}X =x )
58+
$$
59+
Substituting that in and simplifying, we get:
60+
$$
61+
\begin{align*}
62+
\sum_{y \in Y}P(X = x, Y = y) &= P(\bigcup_{y \in Y}(\\{\omega \in \Omega \mid X(\omega) = x\\} \cap \\{\omega \in \Omega \mid Y(\omega) = y\\})) \\\\
63+
&= P(\\{\omega \in \Omega \mid X(\omega) = x\\} \cap \bigcup_{y \in Y}\\{\omega \in \Omega \mid Y(\omega) = y\\})
64+
\end{align*}
65+
$$
66+
Notice that the right hand side of the term is just $\Omega$. We can then simplify to,
67+
$$
68+
\begin{align*}
69+
\sum_{y \in Y}P(X = x, Y = y) &= P(\\{\omega \in \Omega \mid X(\omega) = x\\} \cap \Omega) \\\\
70+
&= P(\\{\omega \in \Omega \mid X(\omega) = x\\}) \\\\
71+
&= P(X = x)
72+
\end{align*}
73+
$$
74+
Since countable additivity is an axiom, we'll stop our derivations there. See you next time.

0 commit comments

Comments
 (0)