|
| 1 | +--- |
| 2 | +jupytext: |
| 3 | + text_representation: |
| 4 | + extension: .md |
| 5 | + format_name: myst |
| 6 | +kernelspec: |
| 7 | + display_name: Python 3 |
| 8 | + language: python |
| 9 | + name: python3 |
| 10 | +--- |
| 11 | + |
| 12 | +(certainty_equiv_robustness)= |
| 13 | +```{raw} jupyter |
| 14 | +<div id="qe-notebook-header" align="right" style="text-align:right;"> |
| 15 | + <a href="https://quantecon.org/" title="quantecon.org"> |
| 16 | + <img style="width:250px;display:inline;" width="250px" src="https://assets.quantecon.org/img/qe-menubar-logo.svg" alt="QuantEcon"> |
| 17 | + </a> |
| 18 | +</div> |
| 19 | +``` |
| 20 | + |
| 21 | +# Certainty Equivalence |
| 22 | + |
| 23 | +```{index} single: Certainty Equivalence; Robustness |
| 24 | +``` |
| 25 | + |
| 26 | +```{index} single: LQ Control; Permanent Income |
| 27 | +``` |
| 28 | + |
| 29 | +```{contents} Contents |
| 30 | +:depth: 2 |
| 31 | +``` |
| 32 | + |
| 33 | + |
| 34 | +In addition to what's in Anaconda, this lecture will need the following libraries: |
| 35 | + |
| 36 | +```{code-cell} ipython3 |
| 37 | +--- |
| 38 | +tags: [hide-output] |
| 39 | +--- |
| 40 | +!pip install quantecon |
| 41 | +``` |
| 42 | + |
| 43 | + |
| 44 | +## The Central Problem of Empirical Economics |
| 45 | + |
| 46 | +The papers collected in {cite}`lucas1981rational` address a single overarching question: given observations on an agent's behavior in a particular economic environment, what can we infer about how that behavior **would have differed** had the environment been altered? This is the problem of policy-invariant structural inference. |
| 47 | + |
| 48 | +The difficulty is immediate. Observations arise under one environment; we wish to predict behavior under another. Unless we understand *why* the agent behaves as he does—that is, unless we recover the deep objectives that rationalize observed decisions—estimated behavioral relationships are silent on this question. |
| 49 | + |
| 50 | +--- |
| 51 | + |
| 52 | +## A Formal Setup |
| 53 | + |
| 54 | +Consider a single decision maker whose situation at date $t$ is fully described by two state variables $(x_t, z_t)$. |
| 55 | + |
| 56 | +**The environment** $z_t \in S_1$ is selected by "nature" and evolves exogenously according to |
| 57 | + |
| 58 | +```{math} |
| 59 | +:label: eq:z_transition |
| 60 | +z_{t+1} = f(z_t,\, \epsilon_t), |
| 61 | +``` |
| 62 | + |
| 63 | +where the innovations $\epsilon_t \in \mathcal{E}$ are i.i.d. draws from a fixed c.d.f. $\Phi(\cdot) : \mathcal{E} \to [0,1]$. The function $f : S_1 \times \mathcal{E} \to S_1$ is called the **decision maker's environment**. |
| 64 | + |
| 65 | +**The endogenous state** $x_t \in S_2$ is under partial control of the agent. Each period the agent selects an action $u_t \in U$. A fixed technology $g : S_1 \times S_2 \times U \to S_2$ governs the transition |
| 66 | + |
| 67 | +```{math} |
| 68 | +:label: eq:x_transition |
| 69 | +x_{t+1} = g(z_t,\, x_t,\, u_t). |
| 70 | +``` |
| 71 | + |
| 72 | +**The decision rule** $h : S_1 \times S_2 \to U$ maps the agent's current situation into an action: |
| 73 | + |
| 74 | +```{math} |
| 75 | +:label: eq:decision_rule |
| 76 | +u_t = h(z_t,\, x_t). |
| 77 | +``` |
| 78 | + |
| 79 | +The econometrician observes (some or all of) the process $\{z_t, x_t, u_t\}$, the joint motion of which is determined by {eq}`eq:z_transition`, {eq}`eq:x_transition`, and {eq}`eq:decision_rule`. |
| 80 | + |
| 81 | +--- |
| 82 | + |
| 83 | +## The Lucas Critique: Why Estimated Rules Are Not Enough |
| 84 | + |
| 85 | +Suppose we have estimated $f$, $g$, and $h$ from a long time series generated under a fixed environment $f_0$. This gives us $h_0 = T(f_0)$, where $T$ is the (unknown) functional mapping environments into optimal decision rules. But this single estimate, however precise, **reveals nothing** about how $T(f)$ varies with $f$. |
| 86 | + |
| 87 | +Policy evaluation requires knowledge of the entire map $f \mapsto T(f)$. Under an environment change $f_0 \to f_1$, agents will in general revise their decision rules $h_0 \to h_1 = T(f_1)$, rendering the estimated rule $h_0$ invalid for forecasting behavior under $f_1$. |
| 88 | + |
| 89 | +The only nonexperimental path forward is to recover the **return function** $V$ from which $h$ is derived as the solution to an optimization problem, and then re-solve that problem under the counterfactual environment $f_1$. |
| 90 | + |
| 91 | +--- |
| 92 | + |
| 93 | +## An Optimization Problem |
| 94 | + |
| 95 | +Assume the agent selects $h$ to maximize the expected discounted sum of current-period returns $V : S_1 \times S_2 \times U \to \mathbb{R}$: |
| 96 | + |
| 97 | +```{math} |
| 98 | +:label: eq:objective |
| 99 | +E_0\!\left\{\sum_{t=0}^{\infty} \beta^t\, V(z_t,\, x_t,\, u_t)\right\}, \qquad 0 < \beta < 1, |
| 100 | +``` |
| 101 | + |
| 102 | +given initial conditions $(z_0, x_0)$, the environment $f$, and the technology $g$. Here $E_0\{\cdot\}$ denotes expectation conditional on $(z_0, x_0)$ with respect to the distribution of $\{z_1, z_2, \ldots\}$ induced by {eq}`eq:z_transition`. |
| 103 | + |
| 104 | +In principle, knowledge of $V$ (together with $g$ and $f$) allows one to compute $h = T(f)$ theoretically and hence to trace out $T(f)$ for any counterfactual $f$. The empirical question is whether $V$ can itself be recovered from observations on $\{f, g, h\}$—a problem of structural identification that, at this level of generality, is formidably difficult. |
| 105 | + |
| 106 | +:::{note} |
| 107 | +The decision rule is in general a functional $h = T(f, g, V)$. The dependence on $g$ and $V$ is suppressed in the main text but made explicit when needed. |
| 108 | +::: |
| 109 | + |
| 110 | +--- |
| 111 | + |
| 112 | +## A Linear-Quadratic Specialization and Certainty Equivalence |
| 113 | + |
| 114 | +Progress at the level of generality of Section 4 requires restricting the primitives. The most productive restriction, exploited in the bulk of the volume, imposes **quadratic** $V$ and **linear** $g$, which forces $h$ to be linear. Beyond computational tractability, this specialization delivers a striking structural result: the **certainty equivalence** theorem of Simon {cite}`simon1956dynamic` and Theil {cite}`theil1957note`. |
| 115 | + |
| 116 | +### The Composite Decomposition of $h$ |
| 117 | + |
| 118 | +Under quadratic $V$ and linear $g$, the optimal decision rule $h$ decomposes into two components applied in sequence. |
| 119 | + |
| 120 | +**Step 1 — Forecasting.** Define the infinite sequence of optimal point forecasts of all current and future states of nature: |
| 121 | + |
| 122 | +```{math} |
| 123 | +:label: eq:forecast_sequence |
| 124 | +\tilde{z}_t \;=\; \bigl(z_t,\;\; {}_{t+1}z_t^e,\;\; {}_{t+2}z_t^e,\;\ldots\bigr) \;\in\; S_1^\infty, |
| 125 | +``` |
| 126 | + |
| 127 | +where ${}_{t+j}z_t^e$ denotes the least-mean-squared-error forecast of $z_{t+j}$ formed at time $t$. The optimal forecast sequence is a (generally nonlinear) function of the current state: |
| 128 | + |
| 129 | +```{math} |
| 130 | +:label: eq:forecast_rule |
| 131 | +\tilde{z}_t = h_2(z_t). |
| 132 | +``` |
| 133 | + |
| 134 | +The function $h_2 : S_1 \to S_1^\infty$ depends entirely on the environment $(f, \Phi)$ and is obtained as the solution to a **pure forecasting problem**, with no reference to preferences or technology. |
| 135 | + |
| 136 | +**Step 2 — Optimization.** Given the forecast sequence $\tilde{z}_t$, the optimal action is a **linear** function of $\tilde{z}_t$ and $x_t$: |
| 137 | + |
| 138 | +```{math} |
| 139 | +:label: eq:optimization_rule |
| 140 | +u_t = h_1(\tilde{z}_t,\, x_t). |
| 141 | +``` |
| 142 | + |
| 143 | +The function $h_1 : S_1^\infty \times S_2 \to U$ depends entirely on preferences $(V)$ and technology $(g)$ but **not** on the stochastic environment $(f, \Phi)$. |
| 144 | + |
| 145 | +The full decision rule is therefore the **composite**: |
| 146 | + |
| 147 | +```{math} |
| 148 | +:label: eq:composite_rule |
| 149 | +\boxed{h(z_t, x_t) \;=\; h_1\!\bigl[h_2(z_t),\; x_t\bigr].} |
| 150 | +``` |
| 151 | + |
| 152 | +### The Separation Principle |
| 153 | + |
| 154 | +{eq}`eq:composite_rule` embodies a clean **separation** of the two sources of dependence in $h$: |
| 155 | + |
| 156 | +| Component | Depends on | Independent of | |
| 157 | +|-----------|-----------|----------------| |
| 158 | +| $h_1$ (optimization) | $V$, $g$ | $f$, $\Phi$ | |
| 159 | +| $h_2$ (forecasting) | $f$, $\Phi$ | $V$, $g$ | |
| 160 | + |
| 161 | +Since policy analysis concerns changes in $f$, and since $h_1$ is invariant to $f$, the policy analyst need only re-solve the forecasting problem $h_2 = S(f)$ under the new environment, keeping $h_1$ fixed. The relationship of original interest, $h = T(f)$, then follows directly from {eq}`eq:composite_rule`. |
| 162 | + |
| 163 | +### Certainty Equivalence and Perfect Foresight |
| 164 | + |
| 165 | +The name "certainty equivalence" reflects a further implication of the LQ structure: the function $h_1$ can be derived as if the agent **knew the future path $z_{t+1}, z_{t+2}, \ldots$ with certainty** — i.e., by solving the deterministic problem in which $\tilde{z}_t$ is treated as the realized path rather than a forecast. The stochasticity of the environment affects actions only through the forecast $\tilde{z}_t$; conditional on $\tilde{z}_t$, the optimization problem is deterministic. |
| 166 | + |
| 167 | +This means the LQ problem decouples into: |
| 168 | + |
| 169 | + * **Dynamic optimization under perfect foresight** — solve for $h_1$ from $(V, g)$ by treating $\tilde{z}_t$ as known. This is a standard deterministic LQ regulator problem and is independent of the environment $(f, \Phi)$. |
| 170 | + |
| 171 | + * **Optimal linear prediction** — solve for $h_2 = S(f)$ from $(f, \Phi)$ using least-squares forecasting theory. If $f$ is itself linear, $h_2$ is also linear and reduces to a standard Kalman/Wiener prediction formula. |
| 172 | + |
| 173 | +### Cross-Equation Restrictions |
| 174 | + |
| 175 | +A hallmark of the rational expectations hypothesis as it appears in this framework is that it ties together what would otherwise be free parameters in different equations. The requirement that $\tilde{z}_t = h_2(z_t) = S(f)(z_t)$ — i.e., that agents' forecasts be *optimal* with respect to the *actual* law of motion $f$ — imposes **cross-equation restrictions** between the parameters of the forecasting rule $h_2$ and the parameters of the environment $f$. These restrictions, rather than any conditions on distributed lags within a single equation, are the operative empirical content of rational expectations. |
| 176 | + |
| 177 | +--- |
| 178 | + |
| 179 | +## A Trouble with Ad Hoc Expectations |
| 180 | + |
| 181 | +Prior practice, exemplified by the adaptive expectations mechanisms of Friedman {cite}`Friedman1956` and Cagan {cite}`Cagan`, directly postulated a particular form of {eq}`eq:forecast_rule`: |
| 182 | + |
| 183 | +```{math} |
| 184 | +:label: eq:adaptive_expectations |
| 185 | +\theta_t^e = \lambda \sum_{i=0}^{\infty} (1-\lambda)^i\, \theta_{t-i}, \qquad 0 < \lambda < 1, |
| 186 | +``` |
| 187 | + |
| 188 | +treating the coefficient $\lambda$ as a free parameter to be estimated from data, with no reference to the underlying environment $f$. |
| 189 | + |
| 190 | +The deficiency is not that {eq}`eq:adaptive_expectations` is a distributed lag — linear forecasting rules are perfectly acceptable simplifications. The deficiency is that the **coefficients** of the distributed lag are left unrestricted by theory. The mapping $h_2 = S(f)$ shows that optimal forecasting coefficients are *determined* by $f$: when $f$ changes, $h_2$ changes, and so does $h$. An estimated $\lambda$ calibrated under $f_0$ is therefore non-structural and will give incorrect predictions whenever $f$ is altered. This is the econometric content of the critique that Muth's paper delivers. |
| 191 | + |
| 192 | +Rational expectations equates the subjective distribution that agents use in forming $\tilde{z}_t$ to the objective distribution $f$ that actually generates the data, thereby closing the model and eliminating free parameters in $h_2$. |
0 commit comments