A little financial-econometric history

The issues that have cropped up in applying present value ideas to government finance, in my last post, caused me to write up a little financial-econometric history, which seems worth passing on to blog readers. The lessons of the 1980s and 1990s are fading with time, and we should avoid having to re-learn such hard-won lessons. (Warning: this post uses mathjax to display equations.)

Faced with a present value relation, say \[ p_{t}=E_{t}\sum_{j=1}^{\infty}\beta^{j}d_{t+j}, \] what could be more natural than to model dividends, say as an AR(1), \[ d_{t+1}=\rho_{d}d_{t}+\varepsilon_{t+1}, \] to calculate the model-implied price, \[ E_{t}\sum_{j=1}^{\infty}\beta^{j}d_{t+j}=\frac{\beta\rho_{d}}{1-\beta\rho_{d} }d_{t}, \] and to compare the result to \(p_{t}\)? The result is a disaster -- prices do not move one for one with dividends, and they move all over the place with no discernible movement in expected dividends.

More generally, forecast dividends with any VAR that doesn't include prices, or use analyst or survey dividend forecasts. Discount back the forecasts, and you get nothing like the current price. Tests of the permanent income hypothesis based on AR(1) or VAR models for income showed the same failures.

These sorts of tests looked like failures of the basic present value relation. At the time, it seemed that markets were pretty efficient based on one-period returns, and likewise consumption growth isn't that predictable. But prices way far form present values seems to say markets are nuts. Similarly, consumption so far from VAR forecasts of permanent income suggested consumers face all sorts of constraints.

With the advantages of hindsight we see three crucial mistakes. 1) Prices and dividends are not stationary. That is quickly repaired by transforming to price-dividend ratios and dividend growth rates. 2) Discount rates are not constant. We'll quickly add time-varying discount rates, which (spoiler) becomes the bottom line focus of the whole debate. My focus today, 3) People in the economy have more information than we do.

Of the many lessons of 1980s financial and macroeconometrics, one of the most central is this: your test should allow people in the economy to have information we don't include in our forecasts. Too many tests still fail this test of tests.

To be clear, as illustrative exercises and models, there is nothing wrong with these calculations. They are really simple general equilibrium models. Such models are very useful for generating patterns reminiscent of those in the data and illustrating mechanisms. But they are easily falsifiable as tests. They typically contain 100% \(R^{2}\) predictions, as do my examples.

Leaving price out of the VAR really does count as a mistake. The true valuation equation is \[ p_{t}=E\left( \left. \sum_{j=0}^{\infty}\frac{u^{\prime}(c_{t+j})} {u^{\prime}(c_{t})}d_{t+j}\right\vert \Omega_{t}\right) \] where \(\Omega_{t}\) denotes the agents' information set. This relationship conditions down to the VAR information set \(x_{t}\) \[ p_{t}=E\left( \left. \sum_{j=0}^{\infty}\frac{u^{\prime}(c_{t+j})} {u^{\prime}(c_{t})}d_{t+j}\right\vert x_{t}\right) \] only if the VAR contains the price \(p_{t} \in x_{t}\), or if agents' information is the same as the VAR \(\Omega_{t}=x_{t}\). (I'm ignoring nonlinearities. This is a blog post.)

Finance responded. First came Shiller's (and LeRoy and Porter's) volatility tests. The present value equation implies \[ var\left( p_{t}\right) \leq var\left( \sum_{j=1}^{\infty}\beta^{j} d_{t+j}\right) . \] This implication is robust to agents that have more information. \(var\left[ E(x|\Omega)\right] \leq var(x).\) It holds even if people know dividends ex ante. And it's a bloody disaster too -- prices are far more volatile, as Shiller's famous plot dramatized. But this test still suffers from nonstationary prices -- the variance is infinite -- and no time-varying expected returns.

The resolution of all of these issues came with Campbell and Shiller'sanalysis. (And a little of mine. Summary in "Discount rates.") We start with a linearization of the one-period return, \[ r_{t+1}=\rho pd_{t+1}-pd_{t}+\Delta d_{t+1}. \] where \(r\) is log return, \(pd\) is log price/dividend ratio, \(\Delta d\) is log dividend growth, and \(\rho\) is a constant of linearization a bit less than one. Iterate forward and take expectations, to a present value relation \[ pd_{t}=E_{t}\sum_{j=1}^{\infty}\rho^{j-1}\left( \Delta d_{t+j}-r_{t+j} \right) . \] Problem 1 is solved -- this is a relationship among stationary variables. Problem 2 is solved -- we allow time-varying expected returns. Now, make a VAR forecast of the right hand side including the pd ratio in the VAR. Let's not repeat that mistake. Compute the right hand side and...You get an identity, \(pd_{t}=pd_{t}\).

How do we now test present value relations? The answer is, we don't. You can't test present value relations per se.

What happened? Write the VAR \[ x_{t+1}=Ax_{t}+\varepsilon_{t+1}. \] and use \(a\) for selector matrices, \(r_{t}=a_{r}^{\prime}x_{t}\), etc. The test is then to compare \(pd_{t}=a_{pd}^{\prime}x_{t}\) with the expectation, i.e. to see if \[ a_{pd}^{\prime}=(?)\ (a_{d}^{\prime}-a_{r}^{\prime})(I-\rho A)^{-1}A \] applied to any \(x_{t}\). But look at the definition of return. Taking its expected value, it says \[ \left( a_{r}^{\prime}-a_{d}^{\prime}\right) A=-a_{pd}^{\prime}(I-\rho A). \] so long as \((I-\rho A)\) is invertible -- eigenvalues of \(A\) less than \(\rho^{-1}\) -- the present value "test" just reiterates the return identity. You recover \(pd_{t}=pd_{t}\) exactly. Once we allow time-varying expected returns, there is no separate present value identity to test.

Campbell and Shiller are far from vacuous! We use present value identities to measure whether prices move in ways that correspond to dividend forecasts or return forecasts, and the nature and timing of those forecasts. The finding that most of the action is in the returns is deeply important. But we abandon the idea that we are going to test the present value relation -- or that any such test is more than a test of restrictions on the expected return process. There's plenty to argue about there, but that's all there is to argue about any more.

The Campbell-Shiller identity also allows us to put to rest another 1980s puzzle. Volatility tests seemed like something new and different. Sure, returns aren't really predictable but prices are way too volatile to be "rational." But multiply by \(pd_{t} -E(pd_{t})\) and take expectations, and you get \[ var\left( pd_{t}\right) =cov\left( pd_{t},\sum_{j=1}^{\infty}\rho ^{j-1}\Delta d_{t+j}\right) -cov\left( pd_{t},\sum_{j=1}^{\infty}\rho ^{j-1}r_{t+j}\right) \] \[ 1=\beta\left( \sum_{j=1}^{\infty}\rho^{j-1}\Delta d_{t+j}, pd_t\right) -\beta\left(\sum_{j=1}^{\infty}\rho^{j-1}r_{t+j}, pd_t\right) \] where \(\beta(y,x)\) is the regression coefficient of \(y\) on \(x\). Volatility tests are the same thing as long-run forecasting regressions.

So, asset pricing has come full circle, really. In the 1960s, it seemed that one could test market efficiency by trying to forecast returns. The discount factor existence theorems removed that hope. (I have in mind the "joint hypothesis" theorem of Fama's Efficient Markets Review, the Roll Critique, and of course Harrison and Kreps.) All there is to argue about is whether risk premiums make sense. The volatility tests and present value calculations looked like another way to cleanly test efficiency. Sure, return forecasts are mired in joint hypothesis / time-varying discount rate problems, but we can see that present values are nuts. In retrospect, present values per se add nothing to the argument. There is one and only one argument -- whether the large, time-varying, business-cycle related, long-horizon expected returns we see are "correctly" connected to the economy, or whether those discount rates reflect institutional frictions (institutional finance) or nutty investors (behavioral finance). That's plenty interesting, but that's all there is.

More generally, I think we have all learned (or should have learned) that it is a bad idea to try to test whole classes of theories. All theories rely on auxiliary assumptions. All we can do is to understand and evaluate those auxiliary assumptions.

Why write up this ancient history? Well, it might be useful perspective for asset pricing PhD students to understand how we got to where we are all these years ago, and perhaps to avoid some of the obvious temptations to make past mistakes.

More to the point, the study of government debt is in danger of forgetting this difficult and contentious knowledge and re-fighting old battles. We also look at a present value relation, the value of government debt equals the present value of real primary surpluses. \[ \frac{B_{t-1}}{P_{t}}=b_{t}=E_{t}\sum_{j=0}^{\infty}\frac{\Lambda_{t+j} }{\Lambda_{t}}s_{t+j}, \] where \( \Lambda_t \) is a discount factor.

What could be more natural than to make a VAR forecast of surpluses, add a discount factor model, and calculate what the value of debt should be? If the VAR does not include the value of debt, and if the discount factor model does not replicate bond returns, the answer comes out far from the value of debt. This is the Jiang, Lustig, VanNieuwerburgh and Xiaolan "puzzle." (I don't mean to pick on them. This procedure -- and its attendant fallacies, viewed through asset-pricing 20/20 hindsight glasses -- pervades the empirical literature. Reading this paper and corresponding with them just brought these issues to the fore and helped me to clarify them.)

If the VAR does include the value of government debt, and if you discount at observed bond returns, you get an identity. You can't test the present value relation, but you can measure the relative importance of discount rates and surpluses/deficits in accounting for the value of debt. That's what I do in "the fiscal roots of inflation," also summarized in the fiscal theory of the price level. As in the asset pricing context, this measurement says discount rate movements move to center stage, which is interesting. It's not a sexy "test," or "puzzle," but at least it's right.

To be specific, the one-period linearized government debt identity is \[ \rho v_{t+1}=v_{t}+r_{t+1}^{n}-\pi_{t+1}-g_{t+1}-s_{t+1} \] where \(v\) = log debt/GDP, \(r^{n}\)= nominal government bond return, \(\pi=\) inflation, \(g=\) GDP growth and \(s=\) surplus /GDP ratio scaled by steady state debt/GDP and \(\rho=e^{-(r-g)}\). Iterating forward and taking expectations, \[ v_{t}=E_{t}\sum_{j=0}^{\infty}\rho^{j-1}\left[ s_{t+1+j}-\left( r_{t+1+j}^{n}-\pi_{t+1+j}\right) +g_{t+1+j}\right] . \] Now, if you run a VAR that includes \(v_{t}\) to forecast the variables on the right hand side including returns, if you then calculate the VAR based expected present value, you recover \(v_{t}=v_{t}\) exactly. The VAR forecast produces exactly the observed value of debt.

To be specific, the one-period government debt identity implies that the VAR coefficients must satisfy \[ (I-\rho A)a_{v}^{\prime}=\left( -a_{r^{n}}^{\prime}+a_{\pi}^{\prime} +a_{g}^{\prime}+a_{s}^{\prime}\right) A \] These are not restrictions we need to impose. Since the data, if properly constructed, must obey the identity, the estimated parameters will automatically obey this restriction.

Now, let us try to test the present value relation. We compute the terms on the right hand side from the VAR as \[ \left( a_{s}^{\prime}+a_{g}^{\prime}-a_{r^{n}}^{\prime}+a_{\pi}^{\prime }\right) \left( I-\rho A\right) ^{-1}Ax_{t}. \] so the present value holds if \[ a_{v}^{\prime}\overset{?}{=}\left( a_{s}^{\prime}+a_{g}^{\prime}-a_{r^{n} }^{\prime}+a_{\pi}^{\prime}\right) \left( I-\rho A\right) ^{-1}A. \] So long as the variables are stationary, this restriction is identical to the restriction coming from the one-period identity. The constructed present value of surpluses comes out to be each day's value of debt, exactly, and by construction. We're looking at a tautology, not a test.

With this background, how can Jiang et. al. report anything but \(v_{t}=v_{t}\) with debt in the VAR? The only way is that their discount factor model disagrees with the VAR forecast of bond returns. We're back to arguing about discount factors, where we are cursed to remain.

A caveat: I summarize here what I see as the consensus of a literature in its current state. The existence of an infinite period present value formula does not yet have the simple elegance of the theorems on existence of finite period present value formulas, at least in my understanding. In part, my comments reflect here the general loss of interest in the "rational bubble" or violation of the transversality condition as a practical alternative. A rational bubble term, a nonzero value of the last term in \[ \frac{B_{t-1}}{P_{t}}=E_{t}\sum_{t=0}^{\infty}\frac{1}{R^{j}}s_{t+j} +\lim_{T\rightarrow\infty}\left( \frac{1}{R^{T}}\frac{B_{t-1+T}}{P_{t+T} }\right) \] for example, implies that the value of debt has a greater-than unit root. One can argue some more about a greater than unit root in the debt to GDP ratio (and price-dividend ratio). apply unit root tests, with predictable results.

However, there is resurgent interest in bubble terms, and present value sums that don't converge, and consequent government debt that never needs to be repaid, so maybe the future will improve on these lessons. (Notably, see Olivier Blanchard and Markus Brunnermeier, Sebastian Merkel and Yuliy Sannikov in the context of government debt.) But these are questions for the future, not a reminder of simple fallacies we learned at great pain to avoid in the past.

from The Grumpy Economist https://ift.tt/31ORkEg

Monday, 6 July 2020

A little financial-econometric history

Artikel Terkait A little financial-econometric history :

0 comments:

Post a Comment

Entri Populer

Labels

Blog Archive