Skip to content

Commit 517a83f

Browse files
Tom's Mar 15 edits
1 parent cb0e274 commit 517a83f

4 files changed

Lines changed: 281 additions & 66 deletions

File tree

lectures/_static/quant-econ.bib

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -224,6 +224,14 @@ @book{Burns_2023
224224
address = {New York}
225225
}
226226

227+
@book{lucas1981rational,
228+
title={Rational expectations and econometric practice},
229+
author={Lucas, Robert E and Sargent, Thomas J},
230+
year={1981},
231+
publisher={U of Minnesota Press},
232+
address = {Minneapolis, Minnesota}
233+
}
234+
227235
@article{Orcutt_Winokur_69,
228236
issn = {00129682, 14680262},
229237
abstract = {Monte Carlo techniques are used to study the first order autoregressive time series model with unknown level, slope, and error variance. The effect of lagged variables on inference, estimation, and prediction is described, using results from the classical normal linear regression model as a standard. In particular, use of the t and x^2 distributions as approximate sampling distributions is verified for inference concerning the level and residual error variance. Bias in the least squares estimate of the slope is measured, and two bias corrections are evaluated. Least squares chained prediction is studied, and attempts to measure the success of prediction and to improve on the least squares technique are discussed.},

lectures/_toc.yml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -104,7 +104,8 @@ parts:
104104
- file: cross_product_trick
105105
- file: perm_income
106106
- file: perm_income_cons
107-
- file: theil
107+
- file: theil_1
108+
- file: theil_2
108109
- file: lq_inventories
109110
- caption: Optimal Growth
110111
numbered: true

lectures/theil_1.md

Lines changed: 192 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,192 @@
1+
---
2+
jupytext:
3+
text_representation:
4+
extension: .md
5+
format_name: myst
6+
kernelspec:
7+
display_name: Python 3
8+
language: python
9+
name: python3
10+
---
11+
12+
(certainty_equiv_robustness)=
13+
```{raw} jupyter
14+
<div id="qe-notebook-header" align="right" style="text-align:right;">
15+
<a href="https://quantecon.org/" title="quantecon.org">
16+
<img style="width:250px;display:inline;" width="250px" src="https://assets.quantecon.org/img/qe-menubar-logo.svg" alt="QuantEcon">
17+
</a>
18+
</div>
19+
```
20+
21+
# Certainty Equivalence
22+
23+
```{index} single: Certainty Equivalence; Robustness
24+
```
25+
26+
```{index} single: LQ Control; Permanent Income
27+
```
28+
29+
```{contents} Contents
30+
:depth: 2
31+
```
32+
33+
34+
In addition to what's in Anaconda, this lecture will need the following libraries:
35+
36+
```{code-cell} ipython3
37+
---
38+
tags: [hide-output]
39+
---
40+
!pip install quantecon
41+
```
42+
43+
44+
## The Central Problem of Empirical Economics
45+
46+
The papers collected in {cite}`lucas1981rational` address a single overarching question: given observations on an agent's behavior in a particular economic environment, what can we infer about how that behavior **would have differed** had the environment been altered? This is the problem of policy-invariant structural inference.
47+
48+
The difficulty is immediate. Observations arise under one environment; we wish to predict behavior under another. Unless we understand *why* the agent behaves as he does—that is, unless we recover the deep objectives that rationalize observed decisions—estimated behavioral relationships are silent on this question.
49+
50+
---
51+
52+
## A Formal Setup
53+
54+
Consider a single decision maker whose situation at date $t$ is fully described by two state variables $(x_t, z_t)$.
55+
56+
**The environment** $z_t \in S_1$ is selected by "nature" and evolves exogenously according to
57+
58+
```{math}
59+
:label: eq:z_transition
60+
z_{t+1} = f(z_t,\, \epsilon_t),
61+
```
62+
63+
where the innovations $\epsilon_t \in \mathcal{E}$ are i.i.d. draws from a fixed c.d.f. $\Phi(\cdot) : \mathcal{E} \to [0,1]$. The function $f : S_1 \times \mathcal{E} \to S_1$ is called the **decision maker's environment**.
64+
65+
**The endogenous state** $x_t \in S_2$ is under partial control of the agent. Each period the agent selects an action $u_t \in U$. A fixed technology $g : S_1 \times S_2 \times U \to S_2$ governs the transition
66+
67+
```{math}
68+
:label: eq:x_transition
69+
x_{t+1} = g(z_t,\, x_t,\, u_t).
70+
```
71+
72+
**The decision rule** $h : S_1 \times S_2 \to U$ maps the agent's current situation into an action:
73+
74+
```{math}
75+
:label: eq:decision_rule
76+
u_t = h(z_t,\, x_t).
77+
```
78+
79+
The econometrician observes (some or all of) the process $\{z_t, x_t, u_t\}$, the joint motion of which is determined by {eq}`eq:z_transition`, {eq}`eq:x_transition`, and {eq}`eq:decision_rule`.
80+
81+
---
82+
83+
## The Lucas Critique: Why Estimated Rules Are Not Enough
84+
85+
Suppose we have estimated $f$, $g$, and $h$ from a long time series generated under a fixed environment $f_0$. This gives us $h_0 = T(f_0)$, where $T$ is the (unknown) functional mapping environments into optimal decision rules. But this single estimate, however precise, **reveals nothing** about how $T(f)$ varies with $f$.
86+
87+
Policy evaluation requires knowledge of the entire map $f \mapsto T(f)$. Under an environment change $f_0 \to f_1$, agents will in general revise their decision rules $h_0 \to h_1 = T(f_1)$, rendering the estimated rule $h_0$ invalid for forecasting behavior under $f_1$.
88+
89+
The only nonexperimental path forward is to recover the **return function** $V$ from which $h$ is derived as the solution to an optimization problem, and then re-solve that problem under the counterfactual environment $f_1$.
90+
91+
---
92+
93+
## An Optimization Problem
94+
95+
Assume the agent selects $h$ to maximize the expected discounted sum of current-period returns $V : S_1 \times S_2 \times U \to \mathbb{R}$:
96+
97+
```{math}
98+
:label: eq:objective
99+
E_0\!\left\{\sum_{t=0}^{\infty} \beta^t\, V(z_t,\, x_t,\, u_t)\right\}, \qquad 0 < \beta < 1,
100+
```
101+
102+
given initial conditions $(z_0, x_0)$, the environment $f$, and the technology $g$. Here $E_0\{\cdot\}$ denotes expectation conditional on $(z_0, x_0)$ with respect to the distribution of $\{z_1, z_2, \ldots\}$ induced by {eq}`eq:z_transition`.
103+
104+
In principle, knowledge of $V$ (together with $g$ and $f$) allows one to compute $h = T(f)$ theoretically and hence to trace out $T(f)$ for any counterfactual $f$. The empirical question is whether $V$ can itself be recovered from observations on $\{f, g, h\}$—a problem of structural identification that, at this level of generality, is formidably difficult.
105+
106+
:::{note}
107+
The decision rule is in general a functional $h = T(f, g, V)$. The dependence on $g$ and $V$ is suppressed in the main text but made explicit when needed.
108+
:::
109+
110+
---
111+
112+
## A Linear-Quadratic Specialization and Certainty Equivalence
113+
114+
Progress at the level of generality of Section 4 requires restricting the primitives. The most productive restriction, exploited in the bulk of the volume, imposes **quadratic** $V$ and **linear** $g$, which forces $h$ to be linear. Beyond computational tractability, this specialization delivers a striking structural result: the **certainty equivalence** theorem of Simon {cite}`simon1956dynamic` and Theil {cite}`theil1957note`.
115+
116+
### The Composite Decomposition of $h$
117+
118+
Under quadratic $V$ and linear $g$, the optimal decision rule $h$ decomposes into two components applied in sequence.
119+
120+
**Step 1 — Forecasting.** Define the infinite sequence of optimal point forecasts of all current and future states of nature:
121+
122+
```{math}
123+
:label: eq:forecast_sequence
124+
\tilde{z}_t \;=\; \bigl(z_t,\;\; {}_{t+1}z_t^e,\;\; {}_{t+2}z_t^e,\;\ldots\bigr) \;\in\; S_1^\infty,
125+
```
126+
127+
where ${}_{t+j}z_t^e$ denotes the least-mean-squared-error forecast of $z_{t+j}$ formed at time $t$. The optimal forecast sequence is a (generally nonlinear) function of the current state:
128+
129+
```{math}
130+
:label: eq:forecast_rule
131+
\tilde{z}_t = h_2(z_t).
132+
```
133+
134+
The function $h_2 : S_1 \to S_1^\infty$ depends entirely on the environment $(f, \Phi)$ and is obtained as the solution to a **pure forecasting problem**, with no reference to preferences or technology.
135+
136+
**Step 2 — Optimization.** Given the forecast sequence $\tilde{z}_t$, the optimal action is a **linear** function of $\tilde{z}_t$ and $x_t$:
137+
138+
```{math}
139+
:label: eq:optimization_rule
140+
u_t = h_1(\tilde{z}_t,\, x_t).
141+
```
142+
143+
The function $h_1 : S_1^\infty \times S_2 \to U$ depends entirely on preferences $(V)$ and technology $(g)$ but **not** on the stochastic environment $(f, \Phi)$.
144+
145+
The full decision rule is therefore the **composite**:
146+
147+
```{math}
148+
:label: eq:composite_rule
149+
\boxed{h(z_t, x_t) \;=\; h_1\!\bigl[h_2(z_t),\; x_t\bigr].}
150+
```
151+
152+
### The Separation Principle
153+
154+
{eq}`eq:composite_rule` embodies a clean **separation** of the two sources of dependence in $h$:
155+
156+
| Component | Depends on | Independent of |
157+
|-----------|-----------|----------------|
158+
| $h_1$ (optimization) | $V$, $g$ | $f$, $\Phi$ |
159+
| $h_2$ (forecasting) | $f$, $\Phi$ | $V$, $g$ |
160+
161+
Since policy analysis concerns changes in $f$, and since $h_1$ is invariant to $f$, the policy analyst need only re-solve the forecasting problem $h_2 = S(f)$ under the new environment, keeping $h_1$ fixed. The relationship of original interest, $h = T(f)$, then follows directly from {eq}`eq:composite_rule`.
162+
163+
### Certainty Equivalence and Perfect Foresight
164+
165+
The name "certainty equivalence" reflects a further implication of the LQ structure: the function $h_1$ can be derived as if the agent **knew the future path $z_{t+1}, z_{t+2}, \ldots$ with certainty** — i.e., by solving the deterministic problem in which $\tilde{z}_t$ is treated as the realized path rather than a forecast. The stochasticity of the environment affects actions only through the forecast $\tilde{z}_t$; conditional on $\tilde{z}_t$, the optimization problem is deterministic.
166+
167+
This means the LQ problem decouples into:
168+
169+
* **Dynamic optimization under perfect foresight** — solve for $h_1$ from $(V, g)$ by treating $\tilde{z}_t$ as known. This is a standard deterministic LQ regulator problem and is independent of the environment $(f, \Phi)$.
170+
171+
* **Optimal linear prediction** — solve for $h_2 = S(f)$ from $(f, \Phi)$ using least-squares forecasting theory. If $f$ is itself linear, $h_2$ is also linear and reduces to a standard Kalman/Wiener prediction formula.
172+
173+
### Cross-Equation Restrictions
174+
175+
A hallmark of the rational expectations hypothesis as it appears in this framework is that it ties together what would otherwise be free parameters in different equations. The requirement that $\tilde{z}_t = h_2(z_t) = S(f)(z_t)$ — i.e., that agents' forecasts be *optimal* with respect to the *actual* law of motion $f$ — imposes **cross-equation restrictions** between the parameters of the forecasting rule $h_2$ and the parameters of the environment $f$. These restrictions, rather than any conditions on distributed lags within a single equation, are the operative empirical content of rational expectations.
176+
177+
---
178+
179+
## A Trouble with Ad Hoc Expectations
180+
181+
Prior practice, exemplified by the adaptive expectations mechanisms of Friedman {cite}`Friedman1956` and Cagan {cite}`Cagan`, directly postulated a particular form of {eq}`eq:forecast_rule`:
182+
183+
```{math}
184+
:label: eq:adaptive_expectations
185+
\theta_t^e = \lambda \sum_{i=0}^{\infty} (1-\lambda)^i\, \theta_{t-i}, \qquad 0 < \lambda < 1,
186+
```
187+
188+
treating the coefficient $\lambda$ as a free parameter to be estimated from data, with no reference to the underlying environment $f$.
189+
190+
The deficiency is not that {eq}`eq:adaptive_expectations` is a distributed lag — linear forecasting rules are perfectly acceptable simplifications. The deficiency is that the **coefficients** of the distributed lag are left unrestricted by theory. The mapping $h_2 = S(f)$ shows that optimal forecasting coefficients are *determined* by $f$: when $f$ changes, $h_2$ changes, and so does $h$. An estimated $\lambda$ calibrated under $f_0$ is therefore non-structural and will give incorrect predictions whenever $f$ is altered. This is the econometric content of the critique that Muth's paper delivers.
191+
192+
Rational expectations equates the subjective distribution that agents use in forming $\tilde{z}_t$ to the objective distribution $f$ that actually generates the data, thereby closing the model and eliminating free parameters in $h_2$.

0 commit comments

Comments
 (0)