You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: lectures/likelihood_ratio_process.md
+29-10Lines changed: 29 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -37,8 +37,8 @@ Among the things that we'll learn are
37
37
38
38
* How a likelihood ratio process is a key ingredient in frequentist hypothesis testing
39
39
* How a **receiver operator characteristic curve** summarizes information about a false alarm probability and power in frequentist hypothesis testing
40
-
* How a statistician can combine frequentist probabilities of type I and type II errors to form posterior probabilities of mistakes in a model selection or a classification problem
41
-
* How likelihood ratios helped Lawrence Blume and David Easley formulate an answer to the question ''If you're so smart, why aren't you rich?'' {cite}`blume2006if`
40
+
* How a statistician can combine frequentist probabilities of type I and type II errors to form posterior probabilities of mistakes in a model selection or in an individual-classification problem
41
+
* How likelihood ratios helped Lawrence Blume and David Easley formulate an answer to ''If you're so smart, why aren't you rich?'' {cite}`blume2006if`
42
42
* How to use a Kullback-Leibler divergence to quantify the difference between two probability distributions with the same support
43
43
* How during World War II the United States Navy devised a decision rule for doing quality control on lots of ammunition, a topic that sets the stage for {doc}`this lecture <wald_friedman>`
44
44
* A peculiar property of likelihood ratio processes
@@ -673,7 +673,7 @@ where $L_t=\prod_{j=1}^{t}\frac{f(w_j)}{g(w_j)}$ is the likelihood ratio process
673
673
674
674
(For the proof, see [this note](https://nowak.ece.wisc.edu/ece830/ece830_fall11_lecture7.pdf).)
675
675
676
-
{eq}`eq:kl_likelihood_link` tells us that:
676
+
Equation {eq}`eq:kl_likelihood_link` tells us that:
677
677
- When $K_g < K_f$ (i.e., $g$ is closer to $h$ than $f$ is), the expected log likelihood ratio is negative, so $L\left(w^t\right) \rightarrow 0$.
678
678
- When $K_g > K_f$ (i.e., $f$ is closer to $h$ than $g$ is), the expected log likelihood ratio is positive, so $L\left(w^t\right) \rightarrow + \infty$.
679
679
@@ -782,12 +782,31 @@ In the [next section](hetero_agent), we will see an application of these ideas.
782
782
783
783
784
784
(hetero_agent)=
785
-
## Consumption and Heterogeneous Beliefs
785
+
## Heterogeneous Beliefs and Financial Markets
786
786
787
787
A likelihood ratio process lies behind Lawrence Blume and David Easley's answer to their question
788
788
''If you're so smart, why aren't you rich?'' {cite}`blume2006if`.
789
789
790
-
Here we'll provide an example that illustrates basic components of their analysis.
790
+
Blume and Easley constructed formal models to study how differences of opinions about probabilities governing risky income processes would influence outcomes and be reflected in prices of stocks, bonds, and insurance policies that individuals use to share and hedge risks.
791
+
792
+
```{note}
793
+
{cite}`alchian1950uncertainty` and {cite}`friedman1953essays` can conjectured that, by rewarding traders with more realistic probability models, competitive markets in financial securities put wealth in the hands of better informed traders and help
794
+
make prices of risky assets reflect realistic probability assessments.
795
+
```
796
+
797
+
798
+
Here we'll provide an example that illustrates basic components of Blume and Easley's analysis.
799
+
800
+
We'll focus only on their analysis of an environment with complete markets in which trades in all conceivable risky securities are possible.
801
+
802
+
We'll study two alternative arrangements:
803
+
804
+
* perfect socialism in which individuals surrender their endowments of consumption goods each period to a central planner who then dictatorially allocates those goods
805
+
* a decentralized system of competitive markets in which selfish price-taking individuals voluntarily trade with each other in competitive markets
806
+
807
+
The fundamental theorems of welfare economics will apply and assure us that these two arrangements end up producing exactly the same allocation of consumption goods to individuals **provided** that the social planner assigns an appropriate set of **Pareto weights**.
808
+
809
+
791
810
792
811
Let the random variable $s_t \in (0,1)$ at time $t =0, 1, 2, \ldots$ be distributed according to the same Beta distribution with parameters
793
812
$\theta = \{\theta_1, \theta_2\}$.
@@ -816,7 +835,7 @@ $$c^1(s_t) = y_t^1 = s_t. $$
816
835
817
836
But in our model, agent 1 is not alone.
818
837
819
-
### Nature and beliefs
838
+
### Nature and agents' beliefs
820
839
821
840
Nature draws i.i.d. sequences $\{s_t\}_{t=0}^\infty$ from $\pi_t(s^t)$.
822
841
@@ -921,7 +940,7 @@ Notice how social welfare criterion {eq}`eq:welfareW` takes into account both ag
921
940
922
941
This means that the social planner knows and respects
923
942
924
-
* the one period utility function $u(\cdot) = \ln(\cdot)$
943
+
* each agent's one period utility function $u(\cdot) = \ln(\cdot)$
925
944
* each agent $i$'s probability model $\{\pi_t^i(s^t)\}_{t=0}^\infty$
926
945
927
946
Consequently, we anticipate that these objects will appear in the social planner's rule for allocating the aggregate endowment each period.
We find that $KL(f,g) > KL(g,f)$ and $KL(h,g) > KL(h,f)$.
1223
1242
1224
-
The first inequality tells us that the average "surprise" or "inefficiency" of using belief $g$ when nature chooses $f$ is greater than the "surprise" of using belief $f$ when nature chooses $g$.
1243
+
The first inequality tells us that the average "surprise" from having belief $g$ when nature chooses $f$ is greater than the "surprise" from having belief $f$ when nature chooses $g$.
1225
1244
1226
1245
This explains the difference between the first two panels we noted above.
1227
1246
@@ -1392,7 +1411,7 @@ We consider two alternative timing protocols.
1392
1411
* Timing protocol 1 is for the model selection problem
1393
1412
* Timing protocol 2 is for the individual classification problem
1394
1413
1395
-
**Timing Protocol 1:** Nature flips a coin once at time $t=-1$ and with probability $\pi_{-1}$ generates a sequence $\{w_t\}_{t=1}^T$
1414
+
**Timing Protocol 1:** Nature flips a coin only **once** at time $t=-1$ and with probability $\pi_{-1}$ generates a sequence $\{w_t\}_{t=1}^T$
1396
1415
of IID draws from $f$ and with probability $1-\pi_{-1}$ generates a sequence $\{w_t\}_{t=1}^T$
**Timing Protocol 2.** At each time $t \geq 0$, nature flips a coin and with probability $\pi_{-1}$ draws $w_t$ from $f$ and with probability $1-\pi_{-1}$ draws $w_t$ from $g$.
1442
+
**Timing Protocol 2.**Nature flips a coin **often**. At each time $t \geq 0$, nature flips a coin and with probability $\pi_{-1}$ draws $w_t$ from $f$ and with probability $1-\pi_{-1}$ draws $w_t$ from $g$.
1424
1443
1425
1444
Here is Python code that we'll use to implement timing protocol 2.
0 commit comments