Add Journal to My Library
The Review of Financial Studies
, Volume Advance Article – Feb 8, 2018

45 pages

/lp/ou_press/news-shocks-and-the-production-based-term-structure-of-equity-returns-uTVJnreDqS

- Publisher
- Oxford University Press
- Copyright
- © The Author(s) 2018. Published by Oxford University Press on behalf of The Society for Financial Studies. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
- ISSN
- 0893-9454
- eISSN
- 1465-7368
- D.O.I.
- 10.1093/rfs/hhy015
- Publisher site
- See Article on Publisher Site

Abstract We propose a production-based general equilibrium model to study the link between timing of cash flows and expected returns, both in the cross-section of stocks and along the aggregate equity term structure. Our model incorporates long-run growth news with time-varying volatility and slow learning about the exposure that firms have with respect to these shocks. Our framework provides a unified explanation of the stylized features of the slope of the term structure of equity returns, its variations over the business cycle, and the negative relationship between cash-flow duration and expected returns in the cross-section of book-to-market-sorted portfolios. Received May 27, 2017; editorial decision October 12, 2017 by Editor Itay Goldstein. The link between the timing of equity cash flows and equity expected returns has been studied, both in the cross-section of stocks and in the aggregate. Working with aggregate cash-flow strips, Binsbergen, Brandt, and Koijen (2012), Binsbergen et al. (2013), and Binsbergen and Koijen (2016, 2017) document several stylized facts on the term structure of equity returns, that is, the relationship between the return on claims to aggregate dividend strips and their maturity. First, the slope of the term structure varies substantially over time and is significantly negative in the Great Recession. Second, the returns on short-term dividend claims have higher volatility but lower market beta than an index on aggregate dividends. Third, the capital asset pricing model (CAPM) $$\beta$$s of claims to aggregate dividends are countercyclical and this time variation of $$\beta$$s decreases with maturity. In addition, the literature on the cross section of expected returns documents a negative relationship between cash-flow duration and expected returns in book-to-market-sorted portfolios (Da 2009; Dechow, Sloan, and Soliman 2004). In this article, we propose a novel production-based model to provide a unified explanation of the relationship between the timing of cash flows and their expected returns both for the aggregate stock market dividends and for the cross-section of book-to-market-sorted portfolios. We construct a vintage capital model in which individual firms have imperfect information about their productivity and have to learn about it over time. In this setting, the endogenous response of firms’ investment and payout to news about future productivity can explain the relationship between cash-flow duration and risk premiums documented in the above literature. While the term structure of real interest rates is determined by the properties of the stochastic discount factor alone, the term structure of equity returns depends on the dynamics of both the stochastic discount factor and that of the cash-flow process. In endowment economy models (see, among others, Lettau and Wachter 2007, 2011; Santos and Veronesi 2010), dividends are exogenously specified. On the other hand, in investment-based partial equilibrium models (e.g., those of Lin and Zhang 2013, Liu, Whited, and Zhang 2009, Zhang 2005) the stochastic discount factor is often taken as given. Therefore, the empirical evidence on the term structure of equity returns does not provide a direct discipline on the aforementioned models. However, the negative relationship between dividend maturity and expected returns during recessions does provide a litmus test of general equilibrium production models in which both payouts and the pricing kernel are simultaneously and endogenously determined. We start by showing that the equity term structure evidence constitutes a challenge in a large class of neoclassical growth models (henceforth RBC). In a setting with production, the total payout to investors is given by \begin{equation*} \text{Payout}=\text{(1 - Labor Share)}\times \text{Output} - \text{Cost of Investment}. \end{equation*} In the data, the volatility of labor share is fairly small, about $$2\%$$ per year (Choi and Rois-Rull 2009), and is therefore calibrated to be constant in most RBC models. However, RBC models produce an investment process that is highly volatile and procyclical with respect to contemporaneous productivity shocks. As a result, over the short horizon, investment acts like a hedge and the total payout is countercyclical. This endogenous correlation structure implies that short-maturity dividend strips command a negative risk premium and that the term structure of equity returns is unambiguously upward sloping along the cycle. Both implications are inconsistent with the empirical evidence on equity term structure.1 Our model resolves the above puzzle by building on the long-run risk framework of Bansal and Yaron (2004). In our model, investment responds strongly and positively to contemporaneous productivity shocks, like in the data. However, its reaction to news about future productivity shocks is negative upon impact. As a result, the total payout to the household increases upon the arrival of good long-run news. Therefore, the impulse response to contemporaneous productivity shocks leads to an upward-sloping term structure of dividends, while the response of investment to news shocks provides a mechanism for a high risk premium for short-term dividend strips and a downward-sloping term structure over short maturities that is absent in the RBC model. Guided by the above theoretical insights, we provide novel evidence for the time-varying relative volatility of news shocks and contemporaneous productivity shocks that can account for the time variation in the slope of term structure. We show that the volatility of the persistent component of productivity shocks exhibits substantial variation over time, and it peaks during recessions. When incorporated into our model, our productivity-based volatility factors produce a procyclical term structure slope which turns negative during severe recessions, consistent with the data. In addition, the presence of two risk factors, long-run and short-run productivity risks, allows our model to capture the failure of CAPM in the data. The key feature of our model, that is, that investment negatively responds to news shocks upon impact, is due to a novel learning mechanism. In our economy, firms have heterogeneous exposure to aggregate productivity shocks. Adolescent firms have limited information about their exposure to aggregate shocks but receive noisy signals from which they learn over time. Adolescent firms therefore are less capable of taking advantage of advances in aggregate productivity, and the correlation between their output and aggregate productivity is lower than that of firms with full information. In this setup, upon a positive news shock about aggregate productivity, investment does not immediately increase because the learning mechanism dampens the substitution effect: new investment creates adolescent firms that are less capable of taking advantage of technological progress and hence do not represent appealing investment opportunities. At the same time, most of the existing mature firms have full information, and their productivity is expected to rise in the future because they can take full advantage of the new productivity frontier. The income effect therefore is positive and dominates. As a result, consumption increases but investment falls upon positive news shocks. Over time, as news materializes and productivity eventually goes up, so does investment. Like in Ai, Croce, and Li (2013), we model assets in place as physical capital, and the stock of new business ideas and investment opportunities as intangible capital. This setup allows us to microfound value stocks (stocks with a high book-to-market ratio) as the claim to physical asset-intensive firms and growth stocks (stocks with a low book-to-market ratio) as the equity of intangible capital-intensive firms. Because physical capital and intangible capital are complements, the negative response of physical investment with respect to positive news shocks is associated with drops in the payoff to claims to intangible capital. Equivalently, intangible capital provides insurance against news shocks, and hence it commands a lower risk premium, consistent with the value premium empirical evidence. As a result, our model also accounts for the negative relationship between cash-flow duration and expected returns in the cross-section. Several other papers have proposed alternative economic channels for the downward-sloping term structure of equity returns. In endowment economies, Andries, Eisenbach, and Schmalz (2017) focus on the preference side and propose horizon-dependent risk aversion as an explanation for the term structure of equity risk compensation. Croce, Lettau, and Ludvigson (2015) obtain a downward-sloping term structure in a long-run risk model with limited information and bounded rationality. Hasler and Marfe (2017) present a rare-disaster model with recursive preferences and study their implications on the term structure of interest rates and the term structure of dividends. Belo, Colin-Dufresne, and Goldstein (2015) study the implications of capital structure and corporate payout decisions on the term structure of equity returns. In production economies, Kogan et al. (2017) show that their model with investment-specific shocks is also consistent with the negative slope of the term structure of equity returns.2Favilukis and Lin (2016) and Marfe (2017) produce a downward-sloping term structure of equity returns by means of wage rigidity and a time-varying labor share. Our paper is also related to the literature on the cross-section of equity returns, specifically the value premium. Berk, Green, and Naik (1999), Gomes, Kogan, and Zhang (2003), Carlson, Fisher, and Giammarino (2004), and Cooper (2006) propose equilibrium models of the value premium by explicitly modeling the heterogeneous risk exposure of assets in place and growth options. Zhang (2005) and Gala (2005) focus on models of adjustment cost. Dechow, Sloan, and Soliman (2004) and Da (2009) provide empirical evidence on the difference in cash-flow duration for value versus growth stocks. None of the above-mentioned papers focuses on the variations of the slope of the term structure of dividends over the business cycle or links it to the empirical evidence on the time-varying volatility of productivity news shocks, nor do they study the link between cash-flow duration and expected returns along the aggregate equity term structure and in the cross-section of stocks jointly. The learning mechanism that we emphasize in this paper is related to the literature that studies the impact of learning on asset market valuations. David (1997), Veronesi (2000), and Ai (2010) study how learning and information affect both asset valuations and the risk premium on the equity market. Pastor and Veronesi (2009) present a model in which learning affects the life-cycle dynamics of firms and their exposure to aggregate risks. The implication of their model that young firms are less exposed to aggregate shock than older firms is consistent with ours. 1. Model Setup The key element of our model is that firms learn about their exposure to aggregate productivity over time. In equilibrium, heterogeneity in information translates into heterogeneity in risk exposures. In this section, we first describe a tractable analytical framework that models learning with heterogeneous productivity. We then incorporate learning into a general equilibrium model with production and derive the equilibrium conditions. 1.1 Aggregation with learning We provide aggregation results supporting the key learning mechanism of our model, that is, that when firms are uncertain about their exposure to aggregate productivity shocks, more information allows them to take better advantage of aggregate technological progress, and therefore they feature a high exposure to aggregate shocks. 1.1.1 The static problem We start with a static setup similar to that of Melitz (2003) and Hsieh and Klenow (2009). Consider a group of firms that produce intermediate inputs, $$y_{j}$$, that can be transformed into output $$Y$$ using a CES production function: \begin{equation} Y=\left[ \int \left( y_{j}\right) ^{\frac{\eta -1}{\eta }}dj\right] ^{\frac{ \eta }{\eta -1}}, \end{equation} (1) Firm $$j$$ combines capital and labor to produce output using a Cobb-Douglas production technology, \begin{equation} y_{j}=A_{j}k_{j}^{\alpha }n_{j}^{1-\alpha }. \end{equation} (2) We assume that $$A_{j}=e^{\beta _{j}\Delta a}$$, where $$\Delta a$$ is a common shock that affects the productivity of all firms and $$\beta _{j}$$ is the firm-specific exposure to the common shock $$\Delta a$$. To facilitate a closed-form solution, we assume that conditioning on the common shock $$ \Delta a$$, $$\beta _{j}\sim i.i.d.N(\mu ,\frac{1}{\Delta a}\sigma ^{2})$$. Before making its production decision, firm $$j$$ observes only a noisy signal of its own exposure, $$sig_{j}$$: \begin{equation} sig_{j}=\beta _{j}+\varepsilon _{j}, \end{equation} (3) where $$\varepsilon _{j}\sim i.i.d.N.(0,\frac{1}{\Delta a}\tau ^{2})$$. The signal $$sig_{j}$$ helps firm $$j$$ make more efficient capital and labor choices. The parameter $$\tau ^{2}$$ determines the level of noise in firm signals. When $$\tau =0$$, firms have perfect information about their exposure to the common shock. As $$\tau ^{2}$$ increases, firms are less certain about their exposure to common shocks, and input choices are less efficient. In the extreme case in which $$\tau \rightarrow \infty $$, signals are not informative at all. We define the aggregate production function as \begin{gather} F\left( K,N\right) \equiv \max_{\left\{ k_{j},n_{j}\right\} }\left[ \int \left( A_{j}k_{j}^{\alpha }n_{j}^{1-\alpha }\right) ^{\frac{\eta -1}{\eta } }dj\right] ^{\frac{\eta }{\eta -1}} \\ \end{gather} (4) \begin{gather} subject\ to\int k_{j}=K, \\ \end{gather} (5) \begin{gather} \int n_{j}=N, \end{gather} (6) where for each $$j$$, the choices of $$\left\{ k_{j},n_{j}\right\} $$ must be measurable with respect to firm $$j$$’s information. That is, $$k_{j}$$ and $$ n_{j}$$ can only be functions of the signal $$sig_{j}$$. In Lemma 3 in the appendix, we prove that the optimality of resource allocation implies that the aggregate production can be written as $$Y=\mathbf{A}K^{\alpha }N^{1-\alpha }$$, where$$\ \ \ $$ \begin{equation} \mathbf{A}=\left[ \int \left( E_{s}\left[ A^{\frac{\eta -1}{\eta }}\right] \right) ^{\eta }ds\right] ^{\frac{1}{\eta -1}}. \end{equation} (7) For simplicity, we assume that $$\mu =1-\frac{1}{2}\frac{\left( \eta -1\right) }{\eta }\sigma ^{2}$$ throughout the paper. As we show below, this is a normalization assumption that implies that the exposure of aggregate productivity to $$\Delta a$$ is 1 in the case of no information ($$\tau =\infty $$). The following lemma provides the functional form of the aggregate productivity $$\mathbf{A}$$ and analyze its elasticity with respect to the common shock $$\Delta a$$. Lemma 1. The aggregate production function is given by \begin{equation*} F\left( K,N\right) = \mathbf{A}K^{\alpha }N^{1-\alpha }, \end{equation*} where $$\ln \mathbf{A}=\lambda \left( \tau ^{2}\right) \Delta a$$, and $$ \lambda \left( \tau ^{2}\right) $$ is defined as \begin{equation} \lambda \left( \tau ^{2}\right) \equiv \left( 1+\frac{1}{2}\frac{\left( \eta -1\right) ^{2}}{\eta }\frac{\sigma ^{4}}{\sigma ^{2}+\tau ^{2}}\right) . \end{equation} (8) The exposure of aggregate productivity to common shocks, $$\lambda (\tau ^{2}),$$ is decreasing in the amount of noise in the signals, $$\tau ^{2}$$. In addition, \begin{equation*} \lim_{\tau ^{2}\rightarrow \infty }\lambda \left( \tau ^{2}\right) =1, \end{equation*} and \begin{equation} \lambda ^{\ast }\equiv \lim_{\tau ^{2}\rightarrow 0}\lambda \left( \tau ^{2}\right) =1+\frac{1}{2}\frac{\left( \eta -1\right) ^{2}}{\eta }\sigma ^{2}. \end{equation} (9) Proof. See Appendix A. ■ The result of the above lemma is intuitive. Better information allows firms to allocate capital and labor more efficiently across each other and, as a result, the level of the aggregate productivity shock $$\mathbf{A}$$ increases with information precision because $$\lambda (\tau ^{2})$$ is decreasing in $$% \tau ^{2}$$. The upper bound on the exposure is attained under full information and is denoted by $$\lambda ^{\ast }$$. Below, we extend the above setup to a multiperiod setting. 1.1.2 The infinite-horizon setting To build our fully dynamic model, we first present an aggregation result where firm $$j$$ productivity, $$A_{j,t}$$, is determined by the following stochastic growth process: \begin{equation} A_{j,t}=\exp \left[ \sum_{s=0}^{t}\beta _{j,s}\Delta a_{s}\right] , \end{equation} (10) where $$\left\{ \Delta a_{s}\right\} _{s=0}^{t}$$ is a sequence of shocks common across all firms. For $$s=0,1,\cdots ,t$$, $$\beta _{j,s}$$ is the exposure of firm $$j$$’s productivity with respect to the common shock $$\Delta a_{s}$$. Here, we assume that each firm observes only one signal about its own exposure to the current-period common productivity shock, $$\beta _{j,t}$$, in every period $$t$$. For simplicity, we also assume that firms learn only from the observed signals but not from the history of realized output. To avoid keeping track of the distribution of firms with heterogeneous information, we assume that there are two groups of firms: (1) mature firms, which know the exact value of $$\left\{ \beta _{j,s}\right\} _{s=0}^{t}$$, and (2) adolescent firms, which in each period $$t$$, observe a noisy signal about $$\beta _{j,t}$$. Let $$\hat{K}$$ denote the total measure of mature firms and $$\bar{K}$$ be the total measure of adolescent firms. Also, let $$\hat{N}$$ and $$\bar{N}$$ denote the total labor input of mature firms and adolescent firms, respectively. Extending the setup of the static model to the dynamic environment, we assume that the distribution of $$\beta _{j,t}$$ is $$i.i.d.$$ across $$j$$ and $$t$$, and follows a normal distribution with mean $$\mu =1-\frac{1}{2}\frac{\left( \eta -1\right) }{\eta }\sigma ^{2}$$ and variance $$\frac{1}{\Delta a_{t}}\sigma ^{2}$$. In each period $$t$$, adolescent firms observe a signal for $$\beta _{j,t}$$ of the following form: \begin{equation} sig_{j,t}=\beta _{j,t}+\varepsilon _{j,t}, \end{equation} (11) where $$\varepsilon _{j,t}\sim N(0,\frac{1}{\Delta a_{t}}\tau _{t}^{2})$$ for all $$t$$. Given these assumptions, we can derive the posterior distribution of $$ A_{j,t}$$ and apply Equation (7) to recover the aggregate production function, which is given by the following lemma. Lemma 2. The total output of all mature firms, $$\hat{Y}_{t}$$, is \begin{equation} \hat{Y}_{t}=\hat{\mathbf{A}}_{t}\hat{K}_{t}^{\alpha }\hat{N}_{t}^{1-\alpha },\ \ with\ \ \ln \hat{\mathbf{A}}_{t}=\exp \left[ \sum_{s=0}^{t}\lambda ^{\ast }\Delta a_{s}\right] , \end{equation} (12) where $$\lambda ^{\ast }$$ is defined like in Lemma 1. The total output of all adolescent firms, $$\bar{Y}_{t}$$, is \begin{equation} \bar{Y}_{t}=\bar{\mathbf{A}}_{t}\bar{K}_{t}^{\alpha }\bar{N}_{t}^{1-\alpha },\ \ with\ \ \ln \bar{\mathbf{A}}_{t}=\exp \left[ \sum_{s=0}^{t}\lambda \left( \tau _{s}^{2}\right) \Delta a_{s}\right] , \end{equation} (13) where $$\lambda \left( \cdot \right) $$ is defined like in Equation (8). Proof. See Appendix A. ■ The above lemma highlights the basic learning mechanism in our model. Because adolescent firms have less information about their $$\left\{ \beta _{j,s}\right\} _{s=0}^{t}$$, they are less capable of taking advantage of technological progress, and hence their aggregate productivity has a lower elasticity with respect to common shocks than that of mature firms ($$\lambda \left( \tau _{s}^{2}\right) \leq \lambda ^{\ast },\ \forall s$$). Under our specification, Equation (12) implies that the law of motion of the productivity of mature firms satisfies \begin{equation} \ln \hat{\mathbf{A}}_{t+1}-\ln \hat{\mathbf{A}}_{t}=\lambda ^{\ast }\Delta a_{t+1}, \end{equation} (14) and the law of motion of adolescent firms is \begin{equation} \ln \bar{\mathbf{A}}_{t+1}-\ln \bar{\mathbf{A}}_{t}=\lambda \left( \tau _{t}^{2}\right) \Delta a_{t+1}. \end{equation} (15) 1.1.3 Aggregation with perpetual learning It is clear from the above discussion that adolescent firms are less sensitive to aggregate productivity shocks because they do not fully observe their exposures, $$\left\{ \beta _{j,s}\right\} _{s=0}^{t}$$. If we allow for long-term growth, that is, $$E[\Delta a_{t+1}]>0$$, with $$\lambda \left( \tau _{s}^{2}\right) <\lambda ^{\ast }\ \forall s$$, the lower exposure to aggregate productivity implies that adolescent firms will be less productive than mature firms on average. With $$E[\Delta a_{t+1}]>0$$, Equations (14) and (15) together imply that this difference will accumulate over time, and the economy cannot have a balanced growth path. To guarantee balanced growth, we keep the specification of productivity in Equation (10) and allow for perpetual learning, that is, we allow firms to receive new signals about the entire history of their exposure coefficients in every period $$t$$. We describe this process below: In period $$0$$, $$\ln A_{j,0}=\beta _{j,0}\Delta a_{0}$$, where $$\beta _{j,0}\sim N(\mu ,\frac{1}{\Delta a_{0}}\sigma ^{2})$$ and adolescent firms have no additional information about their $$\beta _{j,0}$$. In period $$1$$, $$\ln A_{j,1}=\beta _{j,0}\Delta a_{0}+\beta _{j,1}\Delta a_{1}$$, where $$\beta _{j,s}\sim N\left( \mu ,\frac{1}{\Delta a_{s}}\sigma ^{2}\right)$$ and $$s=0,1$$. Each adolescent firm observes a signal, $$s_{0}^{1}=\beta _{j,0}+\varepsilon _{0}^{1}$$, where $$\varepsilon _{0}^{1}\sim N(0,\frac{1}{\Delta a_{0}}\tau _{0}^{2})$$ to update its belief about $$\beta_{j,0}$$ and lower its posterior variance to $$\frac{1}{\Delta a_0} \frac{1}{\sigma^{-2}+\tau_{0}^{-2}}$$. In period $$2$$, $$\ln A_{j,1}=\beta _{j,0}\Delta a_{0}+\beta _{j,1}\Delta a_{1} + \beta _{j,2}\Delta a_{2}$$, where $$\beta _{j,s}\sim N\left( \mu ,\frac{1}{\Delta a_{s}}\sigma ^{2}\right)$$ and $$s=0,1,2$$. Each adolescent firm observes a signal on $$\beta _{j,1}$$, $$s_{1}^{1}=\beta _{j,1}+\varepsilon _{1}^{1}$$, where $$\varepsilon _{1}^{1}\sim N(0,\frac{1}{\Delta a_{1}}\tau _{0}^{2})$$ to lower is posterior variance to $$\frac{1}{\Delta a_0} \frac{1}{\sigma^{-2}+\tau_{0}^{-2}}$$. In addition, under perpetual learning, this firm also receives a signal about its previous exposure, $$\beta _{j,0}$$, $$s_{0}^{1}=\beta _{j,0}+\varepsilon _{0}^{1}$$, where $$\varepsilon _{0}^{1}\sim N(0,\frac{1}{\Delta a_{0}}\tau _{1}^{2})$$. As a result, it lowers further the posterior variance of $$\beta_{j,0}$$ to $$\frac{1}{\Delta a_0} \frac{1}{\sigma^{-2}+\tau_{0}^{-2}+\tau_{1}^{-2}}$$. Similarly, for $$t=3,4,\cdots $$, each adolescent firm observes a sequence of signals, $$s_{s}^{t}=\beta _{j,s}+\varepsilon _{s}^{t}$$, where $$ \varepsilon _{s}^{t}\sim N(0,\frac{1}{\Delta a_{s}}\tau _{t-s-1}^{2})$$ for $$ s=0, 1, \cdots ,t-1$$ and updates its believes on all previous exposure coefficients, $$\{\beta_{j,s}\}_{s=0}^{t-1}$$. In this setup, over time, firms will be constantly learning their exposures and improving their productivity, which allows us to modify Equation (15) and ensure balanced growth. In Appendix A, we show that the sequence $$\{\tau_t\}_{t=0}^{\infty}$$ can be specified as a function of a parameter $$\rho_s\in(0,1)$$ so that the ratio between the productivity of adolescent firms and that of mature firms, $$\chi _{t+1}\equiv\ln \left( \frac{\hat{\mathbf{A}}_{t+1}}{\bar{\mathbf{A} _{t+1}}}\right)$$, is stationary and follows an AR(1) process: \begin{equation} \chi _{t+1}=\ln \left( \frac{\hat{\mathbf{A}}_{t+1}}{\bar{\mathbf{A}}_{t+1}} \right) =\rho _{s}\chi _{t}+\left( \lambda^* -1\right) \Delta a_{t+1}. \end{equation} (16) In addition, the law of motion of $$\bar{\mathbf{A}}_{t}$$ can be specified recursively with $$ \chi _{t}$$ being the only state variable: \begin{equation} \ln \bar{\mathbf{A}}_{t+1}-\ln \bar{\mathbf{A}}_{t}=\left( 1-\rho _{s}\right) \chi _{t}+\Delta a_{t+1}. \end{equation} (17) Together with (14), the above two equations fully specify the aggregate productivity of adolescent firms and mature firms. 1.1.4 Summary of the micro-foundation of learning At the micro-level, $$\sigma $$ and the sequence $$\left\{ \tau _{t}^{2}\right\} _{t=0}^{\infty }$$ are the primitive parameters of the model. The parameter $$\sigma $$ is the dispersion of firms’ exposure to the economy-wide common productivity. Intuitively, higher dispersion implies more benefit of reallocating resources across firms. As shown in Equation (9), this implies that mature firms who have complete information about $$\left\{ \beta _{j,s}\right\} _{s=0}^{t}$$ are more exposed to aggregate shocks. Thanks to perpetual learning, adolescent firms can eventually obtain full information about their exposures. This condition rules out permanent gaps between the productivity of adolescent and mature firms and guarantees balanced growth. As shown in Equations (A8)–(A9), the sequence of variances of the signals, $$\left\{ \tau _{t}^{2}\right\} _{t=0}^{\infty }$$, is increasing in $$\rho _{s}$$. Intuitively, higher values of $$\tau _{t}^{2}$$ imply that adolescent firms’ information is less precise and, as a result, the productivity gap between adolescent firms and mature firms can persist for many periods. In our quantitative exercise, we do not directly specify the micro parameters $$\sigma $$ and $$\left\{ \tau _{t}^{2}\right\} _{t=1}^{\infty }$$. Rather, we calibrate the macro parameters $$\lambda ^{\ast }$$ and $$\rho _{s}$$ from empirical evidence on the difference in the exposure of young and old firms with respect to aggregate productivity shocks. Finally, given the dynamics of the productivity of adolescent and mature firms, $$\bar{\mathbf{A}}_{t}$$ and $$\hat{\mathbf{A}}_{t}$$, we specify aggregation production as the solution to the following optimal resource allocation problem: \begin{align} F(\hat{\mathbf{A}}_{t},\bar{\mathbf{A}}_{t},\hat{K}_{t},\bar{K} _{t},N_{t})&=\max_{\hat{N}_{t},\bar{N}_{t}}\left\{ \hat{\mathbf{A}}\hat{K} _{t}^{\alpha }\hat{N}_{t}^{1-\alpha }+\bar{\mathbf{A}}_{t}\bar{K} _{t}^{\alpha }\bar{N}_{t}^{1-\alpha }\right\} \\ \textit{subject to }\hat{N_{t}}+\bar{N_{t}}&=N_{t}. \notag \end{align} (18) Despite featuring substantial heterogeneity across firms, the production side of our model can be summarized by the production of a representative firm with the production function $$Y_{t}=F(\hat{\mathbf{A}}_{t},\bar{\mathbf{ A}}_{t},\hat{K}_{t},\bar{K}_{t},N_{t})$$, where the law of motion of productivity is given by Equations (14), (17), and (16). 1.2 The full model 1.2.1 Preferences Time is discrete and infinite, with $$t=1,2,3,\cdots $$. The representative agent has Kreps and Porteus (1978) preferences, like in Epstein and Zin (1989): \begin{equation} V_{t}=\left\{ \left( 1-\beta \right) u\left( C_{t},N_{t}\right) ^{1-\frac{1}{ \psi }}+\beta \left( E_{t}\left[ V_{t+1}^{1-\gamma }\right] \right) ^{\frac{ 1-1/\psi }{1-\gamma }}\right\} ^{\frac{1}{1-1/\psi }}, \end{equation} (19) where $$C_{t}$$ and $$N_{t}$$ denote, respectively, the total consumption and total hours worked at time $$t$$. For simplicity, we consider a Cobb-Douglas aggregator for consumption and leisure: \begin{equation*} u\left( C_{t},N_{t}\right) = C_t^{o}(1-N_t)^{1-o}. \end{equation*} We normalize $$N_t=1$$ in the case of inelastic labor supply, that is, when $$o=1$$. 1.2.2 Output producers The specification of aggregate output and individual firm output are as summarized in Equations (14), (17), (16), and (18). Following the long-run risks literature, we specify the stochastic process for the common productivity $$ \Delta a_{t}$$ as follows: \begin{align} \Delta a_{t+1} &={\mu +x_{t}+e}^{{\sigma _{a}}}{\varepsilon _{a,t+1}},\\ x_{t+1} &=\rho x_{t}+e^{\sigma _{x}}\varepsilon _{x,t+1}, \notag \\ \left[ \begin{array}{c} \varepsilon _{a,t+1} \\ \varepsilon _{x,t+1} \end{array} \right] &\sim i.i.d.N\left( \left[ \begin{array}{c} 0 \\ 0 \end{array} \right] ,\left[ \begin{array}{cc} 1 & 0 \\ 0 & 1 \end{array} \right] \right) ,\quad t=0,1,2,\cdots . \notag \end{align} (20) According to the above specification, short-run productivity shocks, $$ \varepsilon _{a,t+1}$$, affect contemporaneous output directly but have no effect on future productivity growth. Shocks to long-run productivity, represented by $$\varepsilon _{x,t+1}$$, carry news about future productivity growth rates but do not affect current output. The log standard deviations of these shocks, $${\sigma _{a}}$$ and $${\sigma _{x}}$$, are constant over time. 1.2.3 Firm dynamics For parsimony, we assume that all firms are subject to the same exit rate, $$\delta $$. For tractability, we assume that in each period the surviving adolescent firms, $$(1-\delta )\bar{K}_{t}$$, become mature with a constant probability $$\phi $$. Under this assumption, the law of motion of the mass of mature firms, $$\widehat{K}$$, can be written as \begin{equation*} \hat{K}_{t+1}=\left( 1-\delta \right) \hat{K}_{t}+\left( 1-\delta \right) \phi \bar{K}_{t}. \end{equation*} Note that in our setup, maturity and age are positively but not perfectly correlated. The parameter $$\phi $$ determines the speed of transition probability from adolescence to maturity in each period. New firms are created by combining ideas and physical investment goods. We use $$S_{t}$$ to denote the total measure of ideas or, equivalently, the total stock of intangible capital at time $$t$$. Like in Ai, Croce, and Li (2013), the total measure of new firms that can be created with total investment $$I_{t}$$ is determined by a concave and constant return-to-scale production function $$ G\left( I_{t},S_{t}\right) $$. Under these conditions, the law of motion of the total measure of young firms, $$\bar{K}_{t}$$, can be written as \begin{equation} \bar{K}_{t+1}=\left( 1-\delta \right) \left( 1-\phi \right) \bar{K} _{t}+G\left( I_{t},S_{t}\right) . \end{equation} (21) 1.2.4 Intangible capital We now specify the law of motion of intangible capital. Let $$S_{t}$$ denote the total stock of intangibles available at time $$t$$. We follow Ai, Croce, and Li (2013) in modeling intangible capital as a stock of growth options: \begin{equation} S_{t+1}=\left[ S_{t}-G\left( I_{t},S_{t}\right) \right] \times \left( 1-\delta _{S}\right) + J_{t}, \end{equation} (22) where $$J_t$$ represents intangible investments at time $$t$$. Each growth option can be used to build one unit of new firms. Under this normalization, $$G\left(I_t,S_t\right)$$ is also the total amount of growth options exercised at time $$t$$. Ai, Croce, and Li (2013) provide a micro-foundation for the aggregator $$G\left( I_{t}, S_{t} \right)$$ by modeling explicitly the competition among ideas with heterogeneous quality. We adopt an aggregator $$G$$ with constant elasticity of substitution between physical investment and intangible capital, \begin{equation} G\left(I,S\right) = \left( \nu I^{1-\frac{1}{\eta }}+\left( 1-\nu \right) S^{1-\frac{1}{\eta }}\right) ^{\frac{1}{1-1/\eta }}, \end{equation} (23) which conforms well to the data on the cross-section of book-to-market ratios, like in Ai, Croce, and Li (2013). Equation (22) can therefore be interpreted as follows. At time $$t$$, the agent has a mass $$S_{t}$$ of available growth options. If options are exercised optimally and the total amount of investment goods used to exercise options is $$I_{t}$$, then $$\left[ S_{t}-G\left( I_{t},S_{t}\right) \right] \times \left( 1-\delta _{S}\right) $$ is the total amount of growth options left at the end of the period after depreciation. $$ J_{t}$$ is the amount of growth options newly produced in period $$t$$. To complete the model, we note that consumption, investment in physical capital, and investment in intangible capital must sum up to total output: \begin{equation*} C_{t}+I_{t}+J_{t}=F(\hat{\mathbf{A}}_{t},\bar{\mathbf{A}}_{t},\hat{K}_{t}, \bar{K}_{t},N_{t}). \end{equation*} 1.3 Equilibrium conditions In our economy, standard welfare theorems apply, and we can construct equilibrium prices and quantities from the solution to the planner’s problem. Let $$\Lambda _{t,t+1}$$ be the one-step-ahead stochastic discount factor: \begin{equation} \Lambda _{t,t+1}=\beta \left( \frac{C_{t+1}}{C_{t}}\right) ^{-1}\left( \frac{ u_{t+1}}{u_{t}}\right) ^{1-\frac{1}{\psi }}\left( \frac{V_{t+1}}{E_{t}\left[ V_{t+1}^{1-\gamma }\right] ^{\frac{1}{1-\gamma }}}\right) ^{\frac{1}{\psi } -\gamma }. \end{equation} (24) Given the equilibrium quantities, we can show that the cum-dividend price of young firms, $$p_{\bar{K},t}$$, that of the mature firms, $$p_{\hat{K},t}$$, and that of the ideas, $$p_{S,t}$$, must jointly satisfy the following recursions: \begin{align} p_{\bar{K},t} &=\alpha \bar{\mathbf{A}}_{t}\left( \frac{\bar{K}_{t}}{\bar{N} _{t}}\right) ^{\alpha -1}+\left( 1-\delta \right) \left\{ \left( 1-\phi \right) E\left[ \Lambda _{t,t+1}p_{\bar{K},t+1}\right] +\phi E\left[ \Lambda _{t,t+1}p_{\hat{K},t+1}\right] \right\} , \\ \end{align} (25) \begin{align} p_{\hat{K},t} &=\alpha \hat{\mathbf{A}}_{t}\left( \frac{\hat{K}_{t}}{\hat{N} _{t}}\right) ^{\alpha -1}+\left( 1-\delta \right) E\left[ \Lambda _{t,t+1}p_{ \hat{K},t+1}\right] , \\ \end{align} (26) \begin{align} p_{S,t} &=\frac{1-\nu }{\nu }\left( \frac{I_{t}}{S_{t}}\right) ^{\frac{1}{ \eta }}+\left( 1-\delta _{S}\right) E\left[ \Lambda _{t+1}p_{S,t+1}\right] . \end{align} (27) According to Equation (25), the value of adolescent firms is determined by the marginal product of its capital in the current period, $$ \alpha \bar{\mathbf{A}}_{t}\left( \frac{\bar{K}_{t}}{\bar{N}_{t}}\right) ^{\alpha -1}$$, plus the continuation value of their future payoffs. Conditional on surviving to the next period with probability $$1-\delta $$, adolescent firms become mature with probability $$\phi $$ and pay $$p_{\bar{K} ,t+1}$$ going forward. With probability $$1-\phi $$, they stay in adolescence; that is, they continue to have limited information on their $$\beta $$s and pay the continuation value $$p_{\hat{K},t+1}$$. Equation (26) implies that the cum-dividends marginal value of mature firms equals the expected present value of the marginal product of old capital adjusted for the survival probability $$1-\delta$$. Similarly, Equation (27) states that the cum-dividend value of a unit of intangible capital is equal to the present value of its marginal product, $$ \frac{1-\nu }{\nu }\left( \frac{I_{t}}{S_{t}}\right) ^{\frac{1}{\eta }}$$, accounting for the survival probability of $$1-\delta _{S}$$. Using the above notation, the optimality for investment in physical capital and intangible capital can be written as \begin{align} E\left[ \Lambda _{t+1}p_{\bar{K},t+1}\right] -\frac{1}{G_{I}\left( I_{t},S_{t}\right) } &=\left( 1-\delta _{S}\right) E\left[ \Lambda _{t+1}p_{S,t+1}\right] , \\ \end{align} (28) \begin{align} 1 &=E\left[ \Lambda _{t+1}p_{S,t+1}\right] . \end{align} (29) The left-hand side of Equation (28) measures the net marginal benefit of exercising an additional option, that is, the present value of one additional unit of young physical capital net of the $$ \frac{1}{G_{I}\left( I_{t},S_{t}\right) }$$ exercise cost. The right-hand side of Equation (28) is, in contrast, the opportunity cost of exercising an additional option, that is, the market value of an unexercised option adjusted for the probability of death. Finally, Equation (29) prescribes that intangible investment must be set so that the ex-dividend value of growth options equals their marginal production cost. 1.4 Term structures Given a sequence of cash flows, $$\{CF_{t}\}_{t=0}^{\infty }$$, the time $$t$$ present value of the time $$t+n$$ component of the cash-flow sequence is denoted by $$P_{t,t+n}$$ and can be computed as follows: \begin{equation} P_{t,t+n}=E_{t}[\Lambda _{t,t+n}CF_{t+n}]\ \ n=1,2,\ldots , \notag \end{equation} where $$\Lambda _{t,t+n}=\Lambda _{t,t+1}\times \Lambda _{t+1,t+2}\times \cdots \times \Lambda _{t+n-1,t+n}$$ is the $$n$$-step-ahead discount factor that can be computed from the one-step-ahead stochastic discount factors. The one-period return of the claim to $$CF_{t+n}$$ from period $$t$$ to $$t+1$$ is simply $$\frac{P_{t+1,t+n}}{P_{t,t+n}}$$. We are interested in studying the risk premium, $$RP_t(n)$$, on this return for different maturities $$n$$: \begin{equation*} RP_t\left(n\right)=E_t\left[\frac{P_{t+1,t+n}}{P_{t,t+n}} - r^f_t\right] ,\quad n=1,2,\cdots, \end{equation*} where $$r^f_t = \frac{1}{E[\Lambda_{t,t+1}]}$$ is the one-period risk-free interest rate. The term structure of a cash-flow sequence $$ \{CF_t\}_{t=0}^{\infty}$$ refers to the link between $$RP_t(n)$$ and $$n$$. Borrowing the terminology from the literature on the term structure of interest rates, we will call $$RP_t(n)$$ the risk premium on the zero-coupon equity with maturity $$n$$. While the term structure of real interest rates is determined by the properties of the stochastic discount factor alone, the term structure of equity returns depends on the dynamics of both the stochastic discount factor and that of the cash-flow process. Our goal is to study the slope of the term structure of equities in the general equilibrium model we developed above, where both the stochastic discount factor and the cash flows are endogenously determined in equilibrium. Binsbergen, Brandt, and Koijen (2012) and Binsbergen et al. (2013) present evidence for substantial variations in the slope of the term structure over time. In particular, Binsbergen, Brandt, and Koijen (2012) document a significant negative slope of the term structure for the aggregate stock market during recessions. Standard RBC models predict an unambiguously positive slope in the term structure of equity returns, and are therefore inconsistent with the data. In the rest of the paper, we study the term structure of equity return in our learning model in two steps. In Section 2, we analyze our learning model with homoscedastic productivity shocks and contrast it with the standard RBC model. Although without time-varying volatility the slope of the term structure is constant over time, this analysis allows us to demonstrate that our learning mechanism creates a downward-sloping term structure, especially when long-run productivity shocks are important. Guided by this intuition, in Section 3 we provide empirical evidence on the time-varying volatility of productivity shocks and incorporate this feature into our learning model. We show that countercyclical variations in the relative volatility of productivity shocks allow our model to account for the variation in both the slope of the term structure and the market equity premium. 2. The Unconditional Term Structure In this section we calibrate our benchmark learning model and compare it to one without learning and without intangible capital, which we call RBC. This is essentially the production economy studied in Croce (2014), and it can be obtained as a special case of our setting under two conditions: (1) $$\tau _{j,t}^{2}=0,\ \forall j,\forall t$$, that is, all firms have full information about their productivity, and (2) the $$ G\left( I,S\right) $$ function in Equation (21) is replaced by the following capital adjustment cost function: \begin{equation*} G\left( I,K\right) =K\left[ \alpha _{0}+\frac{\alpha _{1}}{1-1/\xi }\left( \frac{I}{K}\right) ^{1-\frac{1}{\xi }}\right] . \end{equation*} We first describe our calibration and then present the quantitative results. 2.1 Calibration We list our calibrated parameter values in Table 1. The discount rate ($$\beta $$), risk aversion ($$\gamma $$), and intertemporal elasticity of substitution ($$\psi $$) are set like in standard long-run risk models. The weight of leisure in the utility function ($$o$$) is chosen to match the average share of hours worked, that is, $$N=1/3$$ in steady state. Table 1 Calibrated parameter values Preference parameters $$\quad$$ Effective risk aversion $$\gamma\cdot o $$ 10 $$\quad$$ Intertemporal elasticity of substitution $$\psi $$ 2 $$\quad$$ Discount factor $$\beta $$ 0.98 $$\quad$$ Leisure weight $$o$$ 0.33 Technology parameters $$\quad$$ Capital share $$\alpha $$ 0.3 $$\quad$$ Depreciation rate of physical capital (%) $$\delta$$ 9.9 $$\quad$$ Depreciation rate of intangible capital (%) $$\delta _{S}$$ 9.9 $$\quad$$ Weight on physical investment in $$G$$ (%) $$\nu $$ 92.5 $$\quad$$ Elasticity of substitution in $$G$$ $$\eta $$ 12 Learning parameters $$\quad$$ Percentage share of firms with limited information $$\phi$$ 0.70 $$\quad$$ transitioning to full information $$\quad$$ Productivity exposure of firms with full information $$\lambda^*$$ 6 $$\quad$$ Diffusion of information: cointegration speed $$\rho_s$$ 0.96 Common productivity parameters $$\quad$$ Average growth rate $$\lambda^*\mu$$ 00.02 $$\quad$$ Volatility of short-run risk (%) $$\lambda^* \exp(\sigma _{a})$$ 5.65 $$\quad$$ Relative volatility of long-run risk $$\exp(\sigma _{x})/\exp(\sigma _{a})$$ 00.12 $$\quad$$ Autocorrelation of expected growth $$\rho_x $$ 0.965 Preference parameters $$\quad$$ Effective risk aversion $$\gamma\cdot o $$ 10 $$\quad$$ Intertemporal elasticity of substitution $$\psi $$ 2 $$\quad$$ Discount factor $$\beta $$ 0.98 $$\quad$$ Leisure weight $$o$$ 0.33 Technology parameters $$\quad$$ Capital share $$\alpha $$ 0.3 $$\quad$$ Depreciation rate of physical capital (%) $$\delta$$ 9.9 $$\quad$$ Depreciation rate of intangible capital (%) $$\delta _{S}$$ 9.9 $$\quad$$ Weight on physical investment in $$G$$ (%) $$\nu $$ 92.5 $$\quad$$ Elasticity of substitution in $$G$$ $$\eta $$ 12 Learning parameters $$\quad$$ Percentage share of firms with limited information $$\phi$$ 0.70 $$\quad$$ transitioning to full information $$\quad$$ Productivity exposure of firms with full information $$\lambda^*$$ 6 $$\quad$$ Diffusion of information: cointegration speed $$\rho_s$$ 0.96 Common productivity parameters $$\quad$$ Average growth rate $$\lambda^*\mu$$ 00.02 $$\quad$$ Volatility of short-run risk (%) $$\lambda^* \exp(\sigma _{a})$$ 5.65 $$\quad$$ Relative volatility of long-run risk $$\exp(\sigma _{x})/\exp(\sigma _{a})$$ 00.12 $$\quad$$ Autocorrelation of expected growth $$\rho_x $$ 0.965 This table reports the parameter values used for our annual calibrations. The benchmark model features both tangible and intangible capital, as well as full- and limited-information firms. Table 1 Calibrated parameter values Preference parameters $$\quad$$ Effective risk aversion $$\gamma\cdot o $$ 10 $$\quad$$ Intertemporal elasticity of substitution $$\psi $$ 2 $$\quad$$ Discount factor $$\beta $$ 0.98 $$\quad$$ Leisure weight $$o$$ 0.33 Technology parameters $$\quad$$ Capital share $$\alpha $$ 0.3 $$\quad$$ Depreciation rate of physical capital (%) $$\delta$$ 9.9 $$\quad$$ Depreciation rate of intangible capital (%) $$\delta _{S}$$ 9.9 $$\quad$$ Weight on physical investment in $$G$$ (%) $$\nu $$ 92.5 $$\quad$$ Elasticity of substitution in $$G$$ $$\eta $$ 12 Learning parameters $$\quad$$ Percentage share of firms with limited information $$\phi$$ 0.70 $$\quad$$ transitioning to full information $$\quad$$ Productivity exposure of firms with full information $$\lambda^*$$ 6 $$\quad$$ Diffusion of information: cointegration speed $$\rho_s$$ 0.96 Common productivity parameters $$\quad$$ Average growth rate $$\lambda^*\mu$$ 00.02 $$\quad$$ Volatility of short-run risk (%) $$\lambda^* \exp(\sigma _{a})$$ 5.65 $$\quad$$ Relative volatility of long-run risk $$\exp(\sigma _{x})/\exp(\sigma _{a})$$ 00.12 $$\quad$$ Autocorrelation of expected growth $$\rho_x $$ 0.965 Preference parameters $$\quad$$ Effective risk aversion $$\gamma\cdot o $$ 10 $$\quad$$ Intertemporal elasticity of substitution $$\psi $$ 2 $$\quad$$ Discount factor $$\beta $$ 0.98 $$\quad$$ Leisure weight $$o$$ 0.33 Technology parameters $$\quad$$ Capital share $$\alpha $$ 0.3 $$\quad$$ Depreciation rate of physical capital (%) $$\delta$$ 9.9 $$\quad$$ Depreciation rate of intangible capital (%) $$\delta _{S}$$ 9.9 $$\quad$$ Weight on physical investment in $$G$$ (%) $$\nu $$ 92.5 $$\quad$$ Elasticity of substitution in $$G$$ $$\eta $$ 12 Learning parameters $$\quad$$ Percentage share of firms with limited information $$\phi$$ 0.70 $$\quad$$ transitioning to full information $$\quad$$ Productivity exposure of firms with full information $$\lambda^*$$ 6 $$\quad$$ Diffusion of information: cointegration speed $$\rho_s$$ 0.96 Common productivity parameters $$\quad$$ Average growth rate $$\lambda^*\mu$$ 00.02 $$\quad$$ Volatility of short-run risk (%) $$\lambda^* \exp(\sigma _{a})$$ 5.65 $$\quad$$ Relative volatility of long-run risk $$\exp(\sigma _{x})/\exp(\sigma _{a})$$ 00.12 $$\quad$$ Autocorrelation of expected growth $$\rho_x $$ 0.965 This table reports the parameter values used for our annual calibrations. The benchmark model features both tangible and intangible capital, as well as full- and limited-information firms. Both the capital share ($$\alpha $$) and capital depreciation rate ($$\delta $$) are standard in the RBC literature. The parameters governing the accumulation of growth options are chosen in the spirit of Ai, Croce, and Li (2013). For parsimony, the depreciation of intangible capital ($$\delta _{S}$$) is set equal to that of physical capital. The shape of the aggregator $$G\left( I,S\right) $$ is determined by two parameters, the weight on physical investment, $$\nu $$, and the elasticity $$\eta $$. Like in Ai, Croce, and Li (2013), we choose them to jointly match the steady-state consumption-tangible investment ratio and the consumption-intangible investment ratio.3 Our calibration of the parameters of the aggregate productivity shocks is standard in the long-run productivity risk literature. We calibrate $$\mu $$ and $$\sigma _{a}$$ to match the mean and the volatility, respectively, of output growth in the US economy in our sample period, 1929–2007. We set $$ e^{\sigma _{x}-\sigma _{a}}=0.12$$ and $$\rho =0.965$$, in the spirit of Croce (2014). We calibrate the parameters for idiosyncratic productivity shocks to match moments of the joint distribution of firm age and exposure to aggregate productivity shocks. Using Compustat data, Ai, Croce, and Li (2013) document a strong positive correlation between firm age and the exposure of firm-level productivity to measured aggregate productivity shocks. In our model, the parameter $$\phi $$ is the rate of transition to maturity, and the parameter $$ \lambda^*$$ governs the difference between the exposure of adolescent and mature firms to aggregate shocks. We simultaneously calibrate $$\phi $$ and $$\lambda^*$$ to target the moments of the conditional distribution of firm exposure to aggregate productivity shocks as a function of firm age. Note that $$\lambda^*$$ is the difference between the exposure of mature and adolescent firms, and $$\phi $$ determines the fraction of mature firms as a function of firm age. Jointly, the two parameters pin down the average exposure of productivity shocks for firms of all ages. We therefore choose $$\phi $$ and $$\lambda^*$$ jointly to target regression coefficients of the exposure-age relationship in the data. We describe in Appendix B the details of this calculation, which yields $$\phi =0.70$$ and $$\lambda^*=6$$. The persistence of the cointegration residual $$\zeta _{t}=0.96$$ is obtained by estimating the autocorrelation of the log productivity difference of the top 20 and bottom 20 percentiles of the firm age distribution. When calibrating the RBC model, we retain the same calibration except for two modifications. We set the subjective discount factor to $$0.99$$ to match the average risk-free interest rate.4 We also lower the volatility of short-run shocks to 4% ($$\lambda^* e^{\sigma _{a}}=4\%$$) to match the volatility of total output in the data. A lower level of $$\lambda^* e^{\sigma _{a}}$$ matches the same level of volatility of output as the learning model because all firms are fully exposed to shocks to $$\Delta a_{t+1}$$. Additionally, we set the adjustment cost parameter $$\xi =1.27$$ to obtain an annual market risk premium the same as our benchmark learning model, $$4\%$$ per year. 2.2 Quantitative results We report the quantitative implications of our benchmark model and the RBC model in Table 2. We make several observations. First, our benchmark model and the RBC model have similar implications for macroeconomic quantities, except that the RBC model produces a significantly lower volatility of investment. Due to the absence of adjustment costs, the ratio of the volatility of investment relative to that of consumption in our benchmark model is $$4.03$$, much closer to its empirical counterpart, $$5.29$$. Table 2 Vintage and intangible capital models Data Benchmark RBC A. Aggregate Quantities E[$$I/Y$$] 00.15 (00.05) 0.17 0.29 $$\sigma(\Delta c)$$ 02.53 (00.56) 2.98 2.76 $$\sigma(\Delta i)/\sigma(\Delta c)$$ 05.29 (00.50) 4.03 1.48 $$\sigma(\Delta j)/\sigma(\Delta i)$$ 00.50 (00.07) 1.04 – $$\sigma(\Delta n)$$ 02.07 (00.21) 1.51 0.10 $$AC_1 (\Delta c)$$ 00.49 (00.15) 0.41 0.21 $$\rho_{\Delta c,\Delta n} $$ 00.28 (00.07) 0.55 0.47 $$\rho_{\Delta c,\Delta i} $$ 00.39 (00.15) 0.69 0.91 B. Asset Prices $$E[r^f]$$ 00.89 (00.97) 0.44 0.96 $$E[r^{L,ex}_K]$$ 05.70 (02.25) 4.05 4.02 $$E[r_K^L-r_S^L]$$ 04.32 (01.39) 3.83 – $$RP(2)$$ 10.08 (05.04) 6.77 $$-$$26.76 Data Benchmark RBC A. Aggregate Quantities E[$$I/Y$$] 00.15 (00.05) 0.17 0.29 $$\sigma(\Delta c)$$ 02.53 (00.56) 2.98 2.76 $$\sigma(\Delta i)/\sigma(\Delta c)$$ 05.29 (00.50) 4.03 1.48 $$\sigma(\Delta j)/\sigma(\Delta i)$$ 00.50 (00.07) 1.04 – $$\sigma(\Delta n)$$ 02.07 (00.21) 1.51 0.10 $$AC_1 (\Delta c)$$ 00.49 (00.15) 0.41 0.21 $$\rho_{\Delta c,\Delta n} $$ 00.28 (00.07) 0.55 0.47 $$\rho_{\Delta c,\Delta i} $$ 00.39 (00.15) 0.69 0.91 B. Asset Prices $$E[r^f]$$ 00.89 (00.97) 0.44 0.96 $$E[r^{L,ex}_K]$$ 05.70 (02.25) 4.05 4.02 $$E[r_K^L-r_S^L]$$ 04.32 (01.39) 3.83 – $$RP(2)$$ 10.08 (05.04) 6.77 $$-$$26.76 All entries for the models are obtained from repetitions of small samples. Data refer to the United States and span the sample 1930–2007, unless otherwise stated. Numbers in parentheses are GMM Newey-West adjusted standard errors. $$E[ r_{K}^{L}-r_{S}^{L}] $$ and $$E[ r_{K}^{L,ex}] $$ measure the levered spread between tangible and intangible capital returns, and the levered excess returns of tangible capital, respectively. We assume a constant leverage of three, consistent with Garcia-Feijo and Jorgensen (2010). In the data, we use the Fama-French HML factor and the market excess return factor as the counterparts of $$E[ r_{K}^{L}-r_{S}^{L}] $$ and $$E[ r_{K}^{L,ex}] $$, respectively. The annualized empirical counterpart of the risk premium on the zero-coupon equity with maturity of 2 years, $$RP(2)$$, is from Binsbergen, Brandt, and Koijen (2012). Volatility and correlations are denoted as $$\sigma(\cdot)$$ and $$\rho_{\cdot,\cdot}$$, respectively. Our annual calibrations are reported in Table 1. Table 2 Vintage and intangible capital models Data Benchmark RBC A. Aggregate Quantities E[$$I/Y$$] 00.15 (00.05) 0.17 0.29 $$\sigma(\Delta c)$$ 02.53 (00.56) 2.98 2.76 $$\sigma(\Delta i)/\sigma(\Delta c)$$ 05.29 (00.50) 4.03 1.48 $$\sigma(\Delta j)/\sigma(\Delta i)$$ 00.50 (00.07) 1.04 – $$\sigma(\Delta n)$$ 02.07 (00.21) 1.51 0.10 $$AC_1 (\Delta c)$$ 00.49 (00.15) 0.41 0.21 $$\rho_{\Delta c,\Delta n} $$ 00.28 (00.07) 0.55 0.47 $$\rho_{\Delta c,\Delta i} $$ 00.39 (00.15) 0.69 0.91 B. Asset Prices $$E[r^f]$$ 00.89 (00.97) 0.44 0.96 $$E[r^{L,ex}_K]$$ 05.70 (02.25) 4.05 4.02 $$E[r_K^L-r_S^L]$$ 04.32 (01.39) 3.83 – $$RP(2)$$ 10.08 (05.04) 6.77 $$-$$26.76 Data Benchmark RBC A. Aggregate Quantities E[$$I/Y$$] 00.15 (00.05) 0.17 0.29 $$\sigma(\Delta c)$$ 02.53 (00.56) 2.98 2.76 $$\sigma(\Delta i)/\sigma(\Delta c)$$ 05.29 (00.50) 4.03 1.48 $$\sigma(\Delta j)/\sigma(\Delta i)$$ 00.50 (00.07) 1.04 – $$\sigma(\Delta n)$$ 02.07 (00.21) 1.51 0.10 $$AC_1 (\Delta c)$$ 00.49 (00.15) 0.41 0.21 $$\rho_{\Delta c,\Delta n} $$ 00.28 (00.07) 0.55 0.47 $$\rho_{\Delta c,\Delta i} $$ 00.39 (00.15) 0.69 0.91 B. Asset Prices $$E[r^f]$$ 00.89 (00.97) 0.44 0.96 $$E[r^{L,ex}_K]$$ 05.70 (02.25) 4.05 4.02 $$E[r_K^L-r_S^L]$$ 04.32 (01.39) 3.83 – $$RP(2)$$ 10.08 (05.04) 6.77 $$-$$26.76 All entries for the models are obtained from repetitions of small samples. Data refer to the United States and span the sample 1930–2007, unless otherwise stated. Numbers in parentheses are GMM Newey-West adjusted standard errors. $$E[ r_{K}^{L}-r_{S}^{L}] $$ and $$E[ r_{K}^{L,ex}] $$ measure the levered spread between tangible and intangible capital returns, and the levered excess returns of tangible capital, respectively. We assume a constant leverage of three, consistent with Garcia-Feijo and Jorgensen (2010). In the data, we use the Fama-French HML factor and the market excess return factor as the counterparts of $$E[ r_{K}^{L}-r_{S}^{L}] $$ and $$E[ r_{K}^{L,ex}] $$, respectively. The annualized empirical counterpart of the risk premium on the zero-coupon equity with maturity of 2 years, $$RP(2)$$, is from Binsbergen, Brandt, and Koijen (2012). Volatility and correlations are denoted as $$\sigma(\cdot)$$ and $$\rho_{\cdot,\cdot}$$, respectively. Our annual calibrations are reported in Table 1. Second, both models produce a significant equity premium, but they yield very different term structures. In our benchmark model, the term structure of equity is downward sloping over short maturities, whereas that in the RBC model is upward sloping. The risk premium on the claim to the zero-coupon equity with 2-year maturity ($$RP(2)$$) in our benchmark model is $$6.77\%$$ per year, close to the evidence reported by Binsbergen, Brandt, and Koijen (2012), who show that a strategy long in a dividend strip with a maturity of 1.9 years and short in a dividend strip with a maturity of 0.9 years pays an average annual excess return of $$10.10\%$$. In contrast, $$RP(2)$$ in the RBC model is $$-26.76\%$$, and the high market risk premium is obtained by a very high right tail of the term structure.5 Third, our benchmark model also produces a significant spread between the return on physical capital and the return on intangible capital. This implication of our model is consistent with the value premium evidence, that is, that physical capital-intensive firms have a higher return than do intangible capital-intensive firms. To understand the above implications, we compare the impulse response functions of quantities and prices of our benchmark model with learning to those in the Croce (2014) model. In Figure 1 we depict the response of quantities (left panel) and asset prices (right panel) to short-run shocks. The responses to long-run productivity shocks are shown in Figure 2. Figure 1 View largeDownload slide Impulse response functions for short-run shocks This figure shows percentage annual log-deviations from the steady state upon the realization of a positive short-run shock. Returns are not levered. Both the RBC and benchmark models feature short- and long-run productivity risk. The RBC model has convex adjustment costs. The benchmark capital model features limited information and learning and is calibrated as detailed in Table 1. Figure 1 View largeDownload slide Impulse response functions for short-run shocks This figure shows percentage annual log-deviations from the steady state upon the realization of a positive short-run shock. Returns are not levered. Both the RBC and benchmark models feature short- and long-run productivity risk. The RBC model has convex adjustment costs. The benchmark capital model features limited information and learning and is calibrated as detailed in Table 1. Figure 2 View largeDownload slide Impulse response functions for long-run shocks This figure shows percentage annual log-deviations from the steady state upon the realization of a positive long-run shock. Returns are not levered. Both the RBC and benchmark models feature short- and long-run productivity risk. The RBC model has convex adjustment costs. The benchmark capital model features limited information and learning and is calibrated as detailed in Table 1. Figure 2 View largeDownload slide Impulse response functions for long-run shocks This figure shows percentage annual log-deviations from the steady state upon the realization of a positive long-run shock. Returns are not levered. Both the RBC and benchmark models feature short- and long-run productivity risk. The RBC model has convex adjustment costs. The benchmark capital model features limited information and learning and is calibrated as detailed in Table 1. 2.2.1 Contemporaneous productivity shocks Focusing on contemporaneous shocks to productivity, or short-run shocks, in Figure 1, we note that the benchmark capital model produces responses qualitatively similar to those in the RBC model, but with a few differences. First, in the absence of adjustment costs, physical investment responds strongly to short-run productivity shocks. As a result, our learning capital model generates a high volatility of investment, like in the data. The significant adjustment cost helps the RBC model produce a high level of equity premium, but at the cost of a counterfactually low level of volatility of investment. Second, in our benchmark model, the response of the return on intangible capital ($$R_{S}^{ex}$$) to short-run productivity shocks is significantly smaller than that of the physical capital ($$R_{K}^{ex}$$). This result contributes to the generation of a value premium in our model and can be explained as follows. Short-run productivity shocks directly affect the payoff of physical capital owned by existing firms. In addition, most of the existing firms are mature firms, and their marginal product of capital is more sensitive to these shocks than is that of adolescent firms. In contrast, productivity shocks affect the payoff of intangibles only indirectly: a growth option benefits from productivity shocks because option exercise leads to creation of an adolescent firm. Because adolescent firms are less able to take advantage of technological progress, $$R_{S}^{ex}$$ is less sensitive to contemporaneous productivity shocks than $$R_{K}^{ex}$$. Third, in both models investment responds positively to contemporaneous productivity shocks, whereas short-term dividends respond negatively. This creates a force in both models that pushes up the slope of the term structure of equity returns. The negative response of dividends to productivity shocks is the implication of the general-equilibrium resource constraint: \begin{equation*} D_{t}=Y_{t}-W_{t}N_{t}-I_{t}-J_{t}=\alpha Y_{t}-I_{t}-J_{t}, \end{equation*} where $$W_{t}$$ is the equilibrium wage, and the second equality results from our constant factor shares. Because investment responds strongly and positively to contemporaneous productivity shocks, dividends must respond negatively. To understand the negative slope of the term structure of equity returns in our learning model, we need to turn to the impulse responses with respect to news shocks. 2.2.2 News about future productivity shocks Like in Bansal and Yaron (2004), Croce (2014), and Gourio (2012), news about future consumption growth requires a significant compensation under recursive preferences.6 In Figure 2, we plot the impulse response functions produced by our learning model (solid line) and by the RBC model (dashed line) with respect to shocks to news about future productivity, $$\varepsilon _{x,t+1}$$. The impulse responses of investment are the key to understanding the difference in the asset pricing implications of the two models. First, note that in the RBC model investment responds positively to news shocks. With an IES of $$ \psi =2$$, upon the arrival of positive news about future productivity shocks, the substitution effect dominates, and investment rises. As for the case of positive short-run shocks, increases in investment are associated with decreases in dividends. This pattern of the impulse response makes short-term dividends less risky than long-term dividends, and hence it reinforces the upward-sloping term structure of equity produced by short-run productivity shocks. In contrast, in our learning model, on impact, investment responds negatively to positive news shocks. Over time, as news about future productivity materializes, investment gradually goes up. Intuitively, a positive news shock does not increase current period productivity, and its effect realizes slowly over time. On one hand, the substitution effect is moderate because new investment builds adolescent firms, which cannot take full advantage of the rise in productivity. On the other hand, the income effect is strong because all existing mature firms immediately benefit from the positive productivity shock. As a result, investment immediately drops and a higher dividend payout is used to support more consumption. Over time, as the effect of the news materializes, investment eventually picks up, and dividends fall correspondingly. This pattern of impulse response has strong implications for asset prices. First, even though investment drops upon the arrival of positive news, the return to physical capital responds strongly and positively to news, allowing our learning model to produce a high equity premium without resorting to adjustment costs. Note that equity is the claim to existing firms, most of which are mature and respond strongly to productivity shocks. As a result, the return on physical capital ($$R_{K}^{ex}$$) rises immediately. In contrast, in the RBC model, the strong reaction of $$ R_{K}^{ex}$$ to news shocks is achieved by assuming a high adjustment cost, which produces a counterfactually low level of the volatility of investment. Second, because short-term dividends respond positively to news, they are more risky and require a greater compensation. The impulse response of dividends in our learning model reflects that of the term structure of equity returns. As a result, our learning model generates a downward sloping term structure over short maturities, in contrast to the RBC model. Third, the negative response of physical investment with respect to positive news about future productivity shocks also implies that the return to intangible capital declines, therefore providing a hedge against these shocks. Because intangible capital and physical investment are complements—less tangible investment implies that a smaller fraction of growth options can be exercised—the decline of physical investment is associated with a lower market value of growth options and therefore a lower return $$R_{S}^{ex}$$. At the equilibrium, news shocks enhance the value premium generated in our benchmark model. 2.2.3 Sensitivity Analysis In Appendix D, we show that our results are robust to a range of plausible values for the speed of information diffusion ($$\rho_s$$) and the speed of learning ($$\phi$$). Furthermore, we show that our results continue to hold even without intangible capital ($$\nu=1$$). 3. Dynamics of the Term Structure Slope The analysis in the previous section implies that news shocks and contemporaneous productivity shocks have opposite effects on the slope of the term structure of equity. Because investment responds positively to contemporaneous productivity shocks, the aggregate payout has to react negatively due to the resource constraint. As a result, short-term dividends must be less risky than long-term dividends. In contrast, under our learning friction the income effect dominates upon positive news shocks, investment declines, and short-term dividends increase. As a result, news shocks make short-term dividends riskier and generate a downward-sloping term structure. Naturally, if the relative importance of news shocks and contemporaneous productivity shocks is time varying, then our model has the potential to account for the time-varying slope of the term structure. Guided by this theoretical insight, in the rest of this section we present and estimate a model of time-varying productivity volatility. We then recalibrate our learning model to incorporate our novel empirical evidence. By doing so, we connect the time-varying slope of the term structure to fundamental macroeconomic volatility factors. 3.1 A model with time-varying volatility 3.1.1 Model specification We replace the homoscedastic model in Equation (20) with the following specification of the productivity process: \begin{align} \log \frac{A_{t+1}}{A_{t}}\equiv \Delta a_{t+1} &={\mu +x_{t}+e^{\bar{\sigma }_{a}+\sigma _{a,t}}\varepsilon _{a,t+1}}, \\ x_{t+1} &=\rho _{x}x_{t}+e^{\bar{\sigma}_{x}+\zeta _{t}+\sigma _{a,t}}\varepsilon _{x,t+1}, \notag \\ \end{align} (30) \begin{align} \sigma _{a,t+1} &=\rho _{\sigma }\sigma _{a,t}+\sigma _{\sigma }\epsilon _{\sigma ,t+1}, \end{align} (31) where $$\left[ \varepsilon _{a,t+1},\varepsilon _{x,t+1},\varepsilon _{\sigma ,t+1}\right] $$ is a vector of standard normal shocks i.i.d. over time. The process $$\sigma _{a,t}$$ is the time-varying stochastic log-volatility for contemporaneous productivity shocks. The key element in our estimation is the term $$e^{\zeta _{t}}$$, which captures the variations in the relative volatility between long-run and short-run productivity shocks. For parsimony, in the model we assume that the relative volatility is a negative log-linear function of the state variable $$x_{t}$$: \begin{equation} \zeta _{t}=-\beta _{\zeta |x}x_{t}. \end{equation} (32) In what follows, we present an empirical procedure to estimate the time-varying conditional volatilities of short- and long-run productivity growth shocks, and we investigate their properties to motivate our specification in (32). The empirical estimation also provides us guidance on the key parameters that are new to the stochastic volatility model, namely, $$\rho _{\sigma }$$, $$\sigma _{\sigma }$$, and $$\beta _{\zeta |x}$$. 3.1.2 Estimation procedure We use a quarterly sample ranging from 1947:Q1 to 2013:Q4. Our main goal is to obtain measures of the conditional volatilities of the long-run and short-run productivity shocks and further construct a measure of the ratio of the two volatilities, $$\zeta _{t}$$. Because none of the above quantities is directly observable, we adopt the following procedure to construct estimates of them. First, like in Croce (2014), we project the one-period-ahead productivity growth, $$\Delta a_{t+1}$$, on a set of predictive factors, $$F_{t}$$, \begin{equation} \Delta a_{t+1}=\mu +\beta ^{x}F_{t}+u_{a,t+1}, \end{equation} (33) and use the fitted value to construct our estimates of the latent predictive component of productivity, $$x_{t}=\widehat{\beta ^{x}}F_{t}$$. The short-run shocks are identified using the residual of the regression above, $$ u_{a,t+1}$$, and innovations in news shocks are constructed as the residual of the following estimated AR(1) process: \begin{equation} x_{t+1}=\rho_x x_{t+1}+u_{x,t+1}. \end{equation} (34) Second, we take the residuals, $$u_{a,t+1}$$ and $$u_{x,t+1}$$, and construct measures of their conditional variances. For robustness, we adopt two alternative approaches. We call the first approach the predictive approach. That is, we regress the realized variances of $$u_{a,t+1}$$ and $$u_{x,t+1}$$ on the same vector of predictive variables, $$F_{t}$$: \begin{equation} \log \left( \frac{1}{h}\sum_{j=1}^{h}u_{a,t+j}^{2}\right) =\beta _{a,0}^{\sigma }+\beta _{a}^{\sigma }F_{t}+error, \end{equation} (35) \begin{equation} \log \left( \frac{1}{h}\sum_{j=1}^{h}u_{x,t+j}^{2}\right) =\beta _{x,0}^{\sigma }+\beta _{x}^{\sigma }F_{t}+error. \end{equation} (36) We set $$h=4$$, so that realized variance is measured as the sum of the squared innovations in the next four quarters. This procedure allows us to construct the demeaned conditional log standard deviation as the square root of the predictable component of the realized variances: $$\widehat{\sigma} _{a,t} =\frac{1}{2} \left( \widehat{\beta }_{a}^{\sigma }F_{t}\right) $$ and $$\widehat{\sigma}_{x,t}= \frac{1}{2}\left( \widehat{\beta }_{x}^{\sigma }F_{t}\right) $$. Our second approach is the GARCH approach, in which we replace Equations (35) and (36) with two GARCH(1,1) models. Third, we construct the key object of interest, the relative volatility of long-run versus short-run shocks, as $$\widehat{\zeta}_{t}=\widehat{\sigma}_{x,t}-\widehat{\sigma} _{a,t}$$ and investigate its empirical properties. In our benchmark estimation, we use the four factors proposed by Bansal and Shaliastovich (2013) plus the integrated volatility of stock market returns.7 To ensure robustness, we also consider an alternative specification with the thirteen factors proposed by Jurado, Ng, and Ludvigson (2015). Their factors are principal components extracted from a very wide cross-section of both macroeconomic and financial indicators and have significant predictive power for aggregate volatility. These factors are available only over the shorter sample of 1960:Q3–2011:Q4. In Table 3 we summarize the results based on our benchmark specification. We report our robustness results in the appendix, Table C.1. Across all procedures, we estimate all coefficients jointly by continuous GMM and use as many orthogonality conditions as parameters. Hence, our inference accounts for all possible layers of estimation uncertainty, and our point estimates are equivalent to those of a multistep OLS procedure. Standard errors are computed using the GMM efficient weighting matrix and are Newey-West adjusted. All data are quarterly to better capture variability in the conditional volatility of productivity.8 Table 3 Time-varying volatility in productivity $$\rho_{\sigma}$$ $$\sigma_{\sigma}$$ $$\rho_{x}$$ $$\rho_{\zeta}$$ $$\beta_{\zeta_t|x_t}$$ $$\frac{StD(\widehat{\zeta_t})}{StD(\zeta_t)}$$ $$\beta_{\Delta y_{t+1|t}|\zeta_t}$$ $$\beta_{\Delta y_{t+2|t}|\zeta_t}$$ Est. 0.91$$^{***}$$ 0.09$$^{***}$$ 0.77$$^{***}$$ 0.92$$^{***}$$ –30.69$$^{***}$$ 0.71 –0.02$$^{*}$$ –0.02$$^{**}$$ SE 0.05 0.02 0.08 0.09 12.06 – 0.01 0.01 $$\rho_{\sigma}$$ $$\sigma_{\sigma}$$ $$\rho_{x}$$ $$\rho_{\zeta}$$ $$\beta_{\zeta_t|x_t}$$ $$\frac{StD(\widehat{\zeta_t})}{StD(\zeta_t)}$$ $$\beta_{\Delta y_{t+1|t}|\zeta_t}$$ $$\beta_{\Delta y_{t+2|t}|\zeta_t}$$ Est. 0.91$$^{***}$$ 0.09$$^{***}$$ 0.77$$^{***}$$ 0.92$$^{***}$$ –30.69$$^{***}$$ 0.71 –0.02$$^{*}$$ –0.02$$^{**}$$ SE 0.05 0.02 0.08 0.09 12.06 – 0.01 0.01 Data refer to the United States and span the sample 1947:Q1–2016:Q4. All $$t$$-stats are based on GMM Newey-West adjusted standard errors. We jointly estimate the set of Equations (30)–(36) and the following Equations: \begin{align*} \zeta_t &= const +\rho_{\zeta}\zeta_{t-1} + \epsilon_{\zeta,t}\\ \zeta_t &= const + \beta_{\zeta_t|x_t} x_t+ resid_t\\ \Delta y_{t+j|t} &= const+\beta_{y_{t+j|t}|\zeta_t}\zeta_t + resid_t\quad j=1,2, \end{align*} where $$\Delta y_{t+j|t}$$ denotes real output growth over $$j$$ periods. In this table, $$\widehat{\zeta}_t={\widehat{\beta}_{\zeta_t|x_t}}x_t$$. Our five factors are the price-dividend ratio, the 3-month Treasury-bill yield, the 3- and 5-year Treasury bond yields, and the integrated volatility of stock market returns. We denote $$p$$-values smaller than .01, .05, and .10 by $$^{***}$$, $$^{**}$$, and $$^*$$, respectively. Table 3 Time-varying volatility in productivity $$\rho_{\sigma}$$ $$\sigma_{\sigma}$$ $$\rho_{x}$$ $$\rho_{\zeta}$$ $$\beta_{\zeta_t|x_t}$$ $$\frac{StD(\widehat{\zeta_t})}{StD(\zeta_t)}$$ $$\beta_{\Delta y_{t+1|t}|\zeta_t}$$ $$\beta_{\Delta y_{t+2|t}|\zeta_t}$$ Est. 0.91$$^{***}$$ 0.09$$^{***}$$ 0.77$$^{***}$$ 0.92$$^{***}$$ –30.69$$^{***}$$ 0.71 –0.02$$^{*}$$ –0.02$$^{**}$$ SE 0.05 0.02 0.08 0.09 12.06 – 0.01 0.01 $$\rho_{\sigma}$$ $$\sigma_{\sigma}$$ $$\rho_{x}$$ $$\rho_{\zeta}$$ $$\beta_{\zeta_t|x_t}$$ $$\frac{StD(\widehat{\zeta_t})}{StD(\zeta_t)}$$ $$\beta_{\Delta y_{t+1|t}|\zeta_t}$$ $$\beta_{\Delta y_{t+2|t}|\zeta_t}$$ Est. 0.91$$^{***}$$ 0.09$$^{***}$$ 0.77$$^{***}$$ 0.92$$^{***}$$ –30.69$$^{***}$$ 0.71 –0.02$$^{*}$$ –0.02$$^{**}$$ SE 0.05 0.02 0.08 0.09 12.06 – 0.01 0.01 Data refer to the United States and span the sample 1947:Q1–2016:Q4. All $$t$$-stats are based on GMM Newey-West adjusted standard errors. We jointly estimate the set of Equations (30)–(36) and the following Equations: \begin{align*} \zeta_t &= const +\rho_{\zeta}\zeta_{t-1} + \epsilon_{\zeta,t}\\ \zeta_t &= const + \beta_{\zeta_t|x_t} x_t+ resid_t\\ \Delta y_{t+j|t} &= const+\beta_{y_{t+j|t}|\zeta_t}\zeta_t + resid_t\quad j=1,2, \end{align*} where $$\Delta y_{t+j|t}$$ denotes real output growth over $$j$$ periods. In this table, $$\widehat{\zeta}_t={\widehat{\beta}_{\zeta_t|x_t}}x_t$$. Our five factors are the price-dividend ratio, the 3-month Treasury-bill yield, the 3- and 5-year Treasury bond yields, and the integrated volatility of stock market returns. We denote $$p$$-values smaller than .01, .05, and .10 by $$^{***}$$, $$^{**}$$, and $$^*$$, respectively. 3.1.3 Estimation results We summarize our main estimation results as follows. First, the estimated relative volatility, $$\zeta _{t}$$, exhibits significant variation over time. In Table C.1, we report a Wald test for the hypothesis that both volatilities have the same loadings on our predictive factors: $$\beta _{x}^{\sigma }=\beta _{a}^{\sigma }$$. Under this null hypothesis, relative volatility would be constant. We strongly reject this null across all of our specifications. In Figure 3, we plot our constructed time series of $$\zeta _{t}$$ (top panel) and $$\sigma_{a,t}$$ (bottom panel) and use shaded areas to indicate NBER-defined recessions. Our second result refers to the countercyclicality of $$\zeta$$, that is, our relative volatility process tends to spike up right before recessions, and it negatively predicts future economic growth. In Table 3, we report the results of a regression of future output growth, $$\Delta y_{t+j}\equiv \ln Y_{t+j}-\ln Y_{t}$$, on $$\zeta _{t}$$. The regression coefficient is unambiguously negative and statistically significant, consistent with the patterns depicted in Figure 3. Figure 3 View largeDownload slide Volatility factors in productivity ($$\zeta_t$$ and $$ \sigma_{a,t}$$) This figure shows the estimated relative volatility process, $$ \zeta_t$$ (top panel), and the conditional volatility of the productivity short-run shock, $$\sigma_{a,t}$$ (bottom panel), obtained through the methods described in Section 3.1. The main empirical features of these processes are reported in Table 3. Gray bars denote NBER recession periods. Figure 3 View largeDownload slide Volatility factors in productivity ($$\zeta_t$$ and $$ \sigma_{a,t}$$) This figure shows the estimated relative volatility process, $$ \zeta_t$$ (top panel), and the conditional volatility of the productivity short-run shock, $$\sigma_{a,t}$$ (bottom panel), obtained through the methods described in Section 3.1. The main empirical features of these processes are reported in Table 3. Gray bars denote NBER recession periods. Third, the estimated $$x_{t}$$ and $$\zeta _{t}$$ processes are persistent and highly correlated with each other (see Figure C.1 for our estimated $$x$$). In our benchmark estimation, the sample autocorrelations of $$x_{t}$$ and $$\zeta _{t}$$ are $$0.80$$ and $$0.90$$, respectively. The Wald test of the hypothesis that these autocorrelation coefficients are zero has a $$p$$-value smaller than 1% (see Table C.1). The high correlation between relative volatility $$\zeta$$ and expected growth $$x$$ is reflected by the fact that when we project $$\zeta_t$$ on $$x_t$$, we can explain 75% of the standard deviation of relative volatility (Table 3). Given that both $$x_{t}$$ and $$\zeta _{t}$$ have significant predictive powers for future economic growth and that they are highly correlated, our specification of $$\zeta _{t}$$ in Equation (32) is an efficient way to summarize the dynamics of relative volatility without introducing additional state variables into the model. Figure C.1 View largeDownload slide Fitted long-run productivity risk This figure shows the expected productivity growth estimated as detailed in Section 3. The estimation is based on the benchmark specification with the four factors of Bansal and Shaliastovich (2013) and integrated stock market volatility. Quarterly U.S. data start in 1947:Q1 and end in 2016:Q4. Gray bars denote NBER recession periods. Figure C.1 View largeDownload slide Fitted long-run productivity risk This figure shows the expected productivity growth estimated as detailed in Section 3. The estimation is based on the benchmark specification with the four factors of Bansal and Shaliastovich (2013) and integrated stock market volatility. Quarterly U.S. data start in 1947:Q1 and end in 2016:Q4. Gray bars denote NBER recession periods. We use these estimation results to guide our calibration. Specifically, we set $$\beta _{\phi |x}=-30.7$$ according to its point estimate. We also calibrate $$\rho _{\sigma }=0.91$$ and $$\sigma _{\sigma }=0.09$$ to match the point estimates of the autocorrelation coefficient and volatility for the $$\sigma _{a,t}$$ process. We now turn to the quantitative implications of our model. 3.2 Quantitative results 3.2.1 The slope of the term structure of equity returns To illustrate the relationship between the conditional volatility of productivities and the slope of the term structure, we report the implications of our model for the slope of the term structure for different combinations of the conditional volatilities in Table 4. $$RP\left( 7\right) -RP\left( 2\right)$$ is the levered model-implied spread between the expected return on a zero-coupon equity with a 7-year maturity and that with a 2-year maturity. We also report the market equity premium as $$E_{t}\left[ r^{Lev}\right]$$, assuming a financial leverage of 3. Table 4 Productivity volatility factors in our model with learning A. Conditional risk premiums Short-run vol. Low High Long-run relative vol. Low High Low High $$RP_t(7) - RP_t(2)$$ 1.82 $$-$$1.55 5.23 $$-$$6.00 $$E_t[r^{L,ex}]$$ 1.16 $$-$$1.35 4.59 $$-$$5.38 $$E_t[r^L_{K,t+1}-r^L_{S,t+1}]$$ 1.10 $$-$$1.31 4.34 $$-$$5.23 $$MD^S/MD^K$$ 1.9 $$-$$1.9 1.9 $$-$$1.9 A. Conditional risk premiums Short-run vol. Low High Long-run relative vol. Low High Low High $$RP_t(7) - RP_t(2)$$ 1.82 $$-$$1.55 5.23 $$-$$6.00 $$E_t[r^{L,ex}]$$ 1.16 $$-$$1.35 4.59 $$-$$5.38 $$E_t[r^L_{K,t+1}-r^L_{S,t+1}]$$ 1.10 $$-$$1.31 4.34 $$-$$5.23 $$MD^S/MD^K$$ 1.9 $$-$$1.9 1.9 $$-$$1.9 B. Conditional second moments and CAPM Maturity Vol. SR $$\alpha$$ $$\beta$$ $$\partial \beta/\partial x$$ Spot equity excess returns 2 8.46 0.37 4.21 $$-$$0.81$$-$$ $$-$$257.8$$-$$ 7 5.73 0.23 0.31 0.73 $$-$$46.7$$-$$ Forward equity excess returns 2 8.37 0.38 4.22 $$-$$0.79$$-$$ $$-$$255.5$$-$$ 7 5.71 0.26 0.42 0.78 $$-$$31.7$$-$$ Bonds excess returns 2 0.16 $$-$$0.26$$-$$ $$-$$0.02$$-$$ $$-$$0.02$$-$$ $$-$$2.3$$-$$ 7 0.70 $$-$$0.25$$-$$ $$-$$0.18$$-$$ $$-$$0.05$$-$$ $$-$$15.0$$-$$ B. Conditional second moments and CAPM Maturity Vol. SR $$\alpha$$ $$\beta$$ $$\partial \beta/\partial x$$ Spot equity excess returns 2 8.46 0.37 4.21 $$-$$0.81$$-$$ $$-$$257.8$$-$$ 7 5.73 0.23 0.31 0.73 $$-$$46.7$$-$$ Forward equity excess returns 2 8.37 0.38 4.22 $$-$$0.79$$-$$ $$-$$255.5$$-$$ 7 5.71 0.26 0.42 0.78 $$-$$31.7$$-$$ Bonds excess returns 2 0.16 $$-$$0.26$$-$$ $$-$$0.02$$-$$ $$-$$0.02$$-$$ $$-$$2.3$$-$$ 7 0.70 $$-$$0.25$$-$$ $$-$$0.18$$-$$ $$-$$0.05$$-$$ $$-$$15.0$$-$$ All entries are obtained from our benchmark model augmented by time-varying volatility factors, as described in Section 3. Our baseline annual calibration is reported in Table 1, and the additional parameters are specified in Section 3. Excess returns are levered by a factor of three, consistent with Garcia-Feijo and Jorgensen (2010). In panel B, all entries refer to the case of low short-run volatility risk and high relative volatility. SR denotes the Sharpe ratio and $$\alpha$$ and $$\beta$$ are obtained from a conditional CAPM regression. The forward equity excess return is obtained by going long in zero-coupon equity and short in a bond of the same maturity. Table 4 Productivity volatility factors in our model with learning A. Conditional risk premiums Short-run vol. Low High Long-run relative vol. Low High Low High $$RP_t(7) - RP_t(2)$$ 1.82 $$-$$1.55 5.23 $$-$$6.00 $$E_t[r^{L,ex}]$$ 1.16 $$-$$1.35 4.59 $$-$$5.38 $$E_t[r^L_{K,t+1}-r^L_{S,t+1}]$$ 1.10 $$-$$1.31 4.34 $$-$$5.23 $$MD^S/MD^K$$ 1.9 $$-$$1.9 1.9 $$-$$1.9 A. Conditional risk premiums Short-run vol. Low High Long-run relative vol. Low High Low High $$RP_t(7) - RP_t(2)$$ 1.82 $$-$$1.55 5.23 $$-$$6.00 $$E_t[r^{L,ex}]$$ 1.16 $$-$$1.35 4.59 $$-$$5.38 $$E_t[r^L_{K,t+1}-r^L_{S,t+1}]$$ 1.10 $$-$$1.31 4.34 $$-$$5.23 $$MD^S/MD^K$$ 1.9 $$-$$1.9 1.9 $$-$$1.9 B. Conditional second moments and CAPM Maturity Vol. SR $$\alpha$$ $$\beta$$ $$\partial \beta/\partial x$$ Spot equity excess returns 2 8.46 0.37 4.21 $$-$$0.81$$-$$ $$-$$257.8$$-$$ 7 5.73 0.23 0.31 0.73 $$-$$46.7$$-$$ Forward equity excess returns 2 8.37 0.38 4.22 $$-$$0.79$$-$$ $$-$$255.5$$-$$ 7 5.71 0.26 0.42 0.78 $$-$$31.7$$-$$ Bonds excess returns 2 0.16 $$-$$0.26$$-$$ $$-$$0.02$$-$$ $$-$$0.02$$-$$ $$-$$2.3$$-$$ 7 0.70 $$-$$0.25$$-$$ $$-$$0.18$$-$$ $$-$$0.05$$-$$ $$-$$15.0$$-$$ B. Conditional second moments and CAPM Maturity Vol. SR $$\alpha$$ $$\beta$$ $$\partial \beta/\partial x$$ Spot equity excess returns 2 8.46 0.37 4.21 $$-$$0.81$$-$$ $$-$$257.8$$-$$ 7 5.73 0.23 0.31 0.73 $$-$$46.7$$-$$ Forward equity excess returns 2 8.37 0.38 4.22 $$-$$0.79$$-$$ $$-$$255.5$$-$$ 7 5.71 0.26 0.42 0.78 $$-$$31.7$$-$$ Bonds excess returns 2 0.16 $$-$$0.26$$-$$ $$-$$0.02$$-$$ $$-$$0.02$$-$$ $$-$$2.3$$-$$ 7 0.70 $$-$$0.25$$-$$ $$-$$0.18$$-$$ $$-$$0.05$$-$$ $$-$$15.0$$-$$ All entries are obtained from our benchmark model augmented by time-varying volatility factors, as described in Section 3. Our baseline annual calibration is reported in Table 1, and the additional parameters are specified in Section 3. Excess returns are levered by a factor of three, consistent with Garcia-Feijo and Jorgensen (2010). In panel B, all entries refer to the case of low short-run volatility risk and high relative volatility. SR denotes the Sharpe ratio and $$\alpha$$ and $$\beta$$ are obtained from a conditional CAPM regression. The forward equity excess return is obtained by going long in zero-coupon equity and short in a bond of the same maturity. We choose two levels of the volatility of the contemporaneous productivity shock, $$5.65\%$$ for the high-volatility regime, which corresponds to the average volatility for the pre-World War II period, and $$2.83\%$$ for the low-volatility regime, which corresponds to the post-World War II period. We set the low relative volatility to be $$4.5\%$$ and the high relative volatility to be $$5\%$$ to illustrate the effect of relative volatility. The first column of Table 4, panel A (“Low/Low”), refers to a case of moderation in both short-run volatility and long-run risk. In this setting, our model delivers an upward-sloping term structure over the 7-year horizon, as long-run news is not sizeable enough to make the term structure downward sloping over short maturities. Both the conditional value premium and the conditional aggregate equity premium are below average due to the assumed moderation scenario. In the second column of Table 4, panel A (“Low/High”), we consider a scenario in which the relative volatility of long-run risk is 25% higher than in the low state. In this case, the long-run shocks are sizable enough to make short-term dividends risker than dividends with a maturity of seven years, that is, the term structure slopes downward. Both the conditional value premium and the conditional equity premium, in contrast, increase with respect to the figures reported in the first column. The next two columns in panel A focus on the case of higher short-run volatility. An increase in short-run volatility simultaneously (1) magnifies both the equity and the value premium and (2) expands the absolute value of the term structure spread. Hence, the sign of the term structure spread solely depends on relative volatility, whereas the magnitude of the spread depends on the amount of short-run volatility. We formally test this model-implied relationship between relative volatility and the slope of the term structure in Section 4. 3.2.2 Implications for the cross-section Empirically, high book-to-market-ratio stocks (value stocks) earn a higher average return than low book-to-market stocks (growth stocks). Because the cash flow of value stocks has a shorter duration than that of growth stocks (Da 2009; Dechow, Sloan, and Soliman 2004), in endowment economies where value and growth are both claims to shares of aggregate dividend, the existence of the value premium requires a downward-sloping term structure of equity returns (Lettau and Wachter 2007; Santos and Veronesi 2010; Lettau and Wachter 2011; Croce, Lettau, and Ludvigson 2015). For these models, the Binsbergen et al. (2013) evidence on the time-varying sign of the slope of the term structure of equity return is challenging. Specifically, the upward-sloping term structure observed during boom periods would imply a growth premium, as opposed to the value premium that we observe in the data. Our production-based economy is not subject to this problem, as the value premium depends on endogenous, heterogeneous exposure of tangible and intangible capital to fundamental shocks. In contrast to prior literature, duration is not the key determinant of risk premiums. More specifically, as we have shown in Section 2, our model generates a value premium because growth options have endogenously lower exposure to news shocks than value stocks. This feature remains unchanged with time-varying volatility. In panel A of Table 4, we report our model-implied spread between value and growth stocks, $$E\left[ r_{K}^{L}-r_{S}^{L}\right]$$, for different combinations of conditional volatilities. We also report the Macaulay duration of the cash flows of value and growth portfolios implied by our model, where the Macaulay duration is calculated using the steady-state discount rate. The existence of the value premium and the negative relation between expected returns and duration in the cross-section of stocks are a robust outcome across all scenarios, regardless of the sign of the slope of the term structure. 3.2.3 Conditional second moments and CAPM Binsbergen, Brandt, and Koijen (2012), Binsbergen et al. (2013), and Binsbergen and Koijen (2017) document several facts regarding the term structure of equity returns. First, both the risk premium and Sharpe ratio for short-maturity claims to zero-coupon equity are higher than for the aggregate stock market (Binsbergen and Koijen 2017). Second, the returns on short-term dividend claims are risky as measured by volatility, but safe as measured by market betas (Binsbergen and Koijen 2017). Third, the CAPM $$\beta$$ of claims to aggregate dividends is countercyclical and this time variation of $$\beta$$ decreases with maturity (Binsbergen et al. 2013). These results are documented around the period of the Great Recession, which according to our estimation is a period with low volatility of short-run productivity shocks (post-World War II) and higher long-run volatility. For consistency, we simulate our model conditioning on this combination of values for our volatility state processes, and we report the model-implied moments in panel B of Table 4. Binsbergen and Koijen (2017) report data on both the returns on claims to equity dividend strips (spot equity returns) and those on their futures contract (forward equity returns). Forward returns are just the spot equity returns less the returns of a bond of equal maturity, and they help in separating the effect of the term structure of interest rates from the effect of the term structure of the equity premium. We report key moments for all of these returns in panel B of Table 4 and make the following observations. First, short-term dividends in our model have a higher risk premium as well as a higher Sharpe ratio, consistent with the pattern reported in Binsbergen and Koijen (2017). These features of our model are due to short-term dividends that have a larger exposure to news shocks, which require a higher market price of risk than contemporaneous productivity shocks. Second, short-term dividends have a higher return volatility, but a lower CAPM $$\beta$$. In our model, short-term dividends have a higher volatility because the effect of productivity shocks on cash flows decays over time, as shown in the impulse response functions in Figure 4. In addition, the failure of CAPM can be explained by the presence of multiple shocks in our model. Note that under recursive preferences, all three shocks $$\left[\varepsilon _{a,t+1},\varepsilon _{x,t+1},\varepsilon _{\sigma ,t+1}\right]$$ carry a risk premium, but they are independent of each other and have different market prices of risk. Short-term dividends are more sensitive to news shocks, which carry a high market price of risk but do not generate very volatile responses in the returns. As a result, short-term dividends have a high Sharpe ratio, but a low CAPM $$\beta$$ and high $$\alpha$$ compared to longer-maturity cash flows. Figure 4 View largeDownload slide Volatility factors and term structure slope This figure shows both the realized (Binsbergen et al. 2013) and the estimated term structure spread (TSS) between 7- and 2-year zero-coupon equities. The fitted TSS values are obtained by estimating different versions of Equation (38) over the sample 2002:Q4–2010:Q4. Values outside of this sample period are based on extrapolations. The relative volatility process, $$\zeta_t$$, and the conditional volatility of the productivity short-run shock, $$\sigma_{a,t}$$ (denoted as SR vol.), are obtained through the methods described in Section 3. Gray bars denote NBER recession periods. Figure 4 View largeDownload slide Volatility factors and term structure slope This figure shows both the realized (Binsbergen et al. 2013) and the estimated term structure spread (TSS) between 7- and 2-year zero-coupon equities. The fitted TSS values are obtained by estimating different versions of Equation (38) over the sample 2002:Q4–2010:Q4. Values outside of this sample period are based on extrapolations. The relative volatility process, $$\zeta_t$$, and the conditional volatility of the productivity short-run shock, $$\sigma_{a,t}$$ (denoted as SR vol.), are obtained through the methods described in Section 3. Gray bars denote NBER recession periods. Third, consistent with the evidence provided by Binsbergen et al. (2013), the CAPM $$\beta$$s in our model are countercyclical. In the rightmost column of panel B, Table 4, we report the model-implied sensitivity of CAPM $$\beta$$ with respect to the expected growth rate of the economy. All partial derivatives are clearly negative for equities. They also decrease with respect to maturity, like in Binsbergen et al. (2013), because the effect of news shocks decays over time, as shown in the impulse response functions in Figure 2. Consistent with the evidence provided in Binsbergen and Koijen (2017), these results also apply to forward equity excess returns, because they are driven by the term structure of the risk premium and not by that of the risk-free interest rate. 4. Testable Implications of the Learning Mechanism The key features that distinguish our setting from the standard RBC model are the learning mechanism and time-varying relative volatility of long-run versus short-run shocks. In this section, we formally test several implications of these features of our model. 4.1.4 News shocks and payouts The key implication of the learning mechanism in our model is the response of investment and dividends to news shocks. In the standard RBC model, investment responds positively, and therefore dividends respond negatively, to news shocks. The opposite happens in our model for the term structure of equity returns. In this section, we directly test this implication of our model using evidence on macroeconomic quantities. We show that the aggregate payout has a negative exposure to short-run shocks but a positive exposure to growth news shocks, like in our setting with learning. We proceed in two steps. First, we measure aggregate dividends using the accounting identity implied by our model, $$D_t= Y_t-I_t -J_t- W_tN_t$$. Because our model abstracts away from leverage and capital structure decisions, $$D_t$$ in our model cannot be directly compared to stock market dividends. We therefore use the model to guide our empirical measurement. Both output, $$Y_t$$, and investment, $$I_t+J_t$$, are from table 1.1.5 of the NIPA system. We exclude both government expenditure and net exports, to be consistent with the model, and use the CPI index for all urban consumers to obtain real values. Like in Choi and Rois-Rull (2009), we estimate labor income to be 65% of total output. For robustness, we also consider the aggregate dividends reported in the Flow of Funds Accounts dataset for nonfinancial firms over the sample period 1952:Q1–2016:Q4, in the spirit of Belo, Colin-Dufresne, and Goldstein (2015).9 Due to data limitations, we focus on quarterly observations that are available only starting from 1947:Q1. To maximize sample length, our data include observations through 2016:Q4. Our main results are robust to the exclusion of the Great Recession period. In our second step, we estimate the following equation: \begin{align} Z_t - E_{t-1}\left[Z_t\right]&= \beta_{srr}e^{\sigma_{a,t-1}}\epsilon_{a,t}+ \beta_{lrr}e^{\sigma_{a,t-1}+\zeta_{t-1}}\epsilon_{x,t} + \beta_{vol}\epsilon_{\sigma,t}\notag\\ &\quad+ \beta_{rel\_vol}\epsilon_{\zeta,t}+ {resid}_t, \label{Eq: Empirical Regression} \\ E_{t-1}\left[Z_t\right] &= \beta_0 + \rho Z_{t-1} + \beta_{x} x_{t-1} + \beta_{\sigma} \sigma_{a,t-1} + \beta_{\zeta}\zeta_{t-1}, \notag \end{align} (37) where $$Z_t$$ is either the investment-to-output ratio, $$\frac{I_t}{Y_t}$$, or the dividends-to-output ratio, $$\frac{D_t}{Y_t}$$. We divide our main variables by output for three reasons: (i) since $$D_t$$ can be negative, we cannot just focus on growth rates; (ii) this is a common way to detrend our variables; and (iii) according to the model, it does not affect our ability to identify the sensitivity of our variables to news shocks, as total output is nearly invariant upon the arrival of pure news shocks. In the model, a linear approximation of the equilibrium dividend and investment processes suggests the dependence of these variables on both contemporaneous productivity innovations ($$\epsilon_{a,t},\epsilon_{x,t},\epsilon_{\sigma,t},\epsilon_{\zeta,t}$$) and predetermined state variables. For the sake of parsimony, we use the lagged values of either $$\frac{I_{t-1}}{Y_{t-1}}$$ or $$\frac{D_{t-1}}{Y_{t-1}}$$ to capture the role of the endogenous state variables (i.e., capital stocks) to avoid additional measurement errors. Under the null of the model, this is an innocuous assumption. We also control for the predetermined value of the long-run component, $$x_{t-1}$$, relative long-run volatility, $$\zeta_{t-1}$$, and short-run conditional volatility, $$\sigma_{a,t-1}$$. Our main findings are reported in panel A of Table 5. Since neither our dividends series nor our regressors are standardized, magnitudes are not directly comparable. As a result, we only discuss the sign of our estimates. The data suggest that the response of the aggregate payout to short-run news is negative, as predicted by standard production-economy models. Most importantly, the immediate response of aggregate investment to long-run news is negative, implying that the aggregate payout increases with positive new shocks, consistent with our model. Table 5 News shocks, payout, and asset prices A. Payout exposures $$\beta_{srr}$$ $$\beta_{lrr}$$ $$\beta_{vol}$$ $$\beta_{rel\_vol}$$ Adj. $$R^2$$ Aggregate investment 0.078$$^{***}$$ –0.150$$^{***}$$ –0.097$$^{***}$$ –0.090$$^{***}$$ 0.937 (0.014) (0.064) (0.033) (0.039) Aggregate payout –0.051$$^{***}$$ 0.160$$^{***}$$ 0.098$$^{***}$$ 0.099$$^{***}$$ 0.938 (0.015) (0.064) (0.034) (0.040) FoF dividends –0.020$$^{**}$$ 0.022$$^{*}$$ 0.004 –0.007 0.884 (0.012) (0.035) (0.025) (0.027) A. Payout exposures $$\beta_{srr}$$ $$\beta_{lrr}$$ $$\beta_{vol}$$ $$\beta_{rel\_vol}$$ Adj. $$R^2$$ Aggregate investment 0.078$$^{***}$$ –0.150$$^{***}$$ –0.097$$^{***}$$ –0.090$$^{***}$$ 0.937 (0.014) (0.064) (0.033) (0.039) Aggregate payout –0.051$$^{***}$$ 0.160$$^{***}$$ 0.098$$^{***}$$ 0.099$$^{***}$$ 0.938 (0.015) (0.064) (0.034) (0.040) FoF dividends –0.020$$^{**}$$ 0.022$$^{*}$$ 0.004 –0.007 0.884 (0.012) (0.035) (0.025) (0.027) B. Time-varying volatility factors and asset prices $$R^{ex}_{mkt}$$ $$HML$$ $$TSS$$ Rel.vol. 0.423$$^{*}$$ 0.080 –0.630$$^{***}$$ (0.273) (0.219) (0.092) SR-vol. 0.419 $$^{***}$$ 0.095 0.528$$^{***}$$ (0.173) (0.144) (0.045) Adj. $$R^2$$ 0.027 –0.009 0.699 Adj. $$R^2$$ (no SR-vol.) 0.010 –0.006 0.463 Sample: 1947:Q3–2016:Q4 1947:Q3–2016:Q4 2002:Q4–2010:Q4 B. Time-varying volatility factors and asset prices $$R^{ex}_{mkt}$$ $$HML$$ $$TSS$$ Rel.vol. 0.423$$^{*}$$ 0.080 –0.630$$^{***}$$ (0.273) (0.219) (0.092) SR-vol. 0.419 $$^{***}$$ 0.095 0.528$$^{***}$$ (0.173) (0.144) (0.045) Adj. $$R^2$$ 0.027 –0.009 0.699 Adj. $$R^2$$ (no SR-vol.) 0.010 –0.006 0.463 Sample: 1947:Q3–2016:Q4 1947:Q3–2016:Q4 2002:Q4–2010:Q4 In panel A, data are from the United States and span the sample 1947:Q1–2016:Q4. The statistics reported refer to the regression specified in Equation (37). In panel B, we report the estimates from the following regressions: \begin{align*} Z_{t+1} &= const. + \beta^z_{x}x_{t} + \beta^z_{\phi}\phi_{t} + \beta^z_{\sigma_a}\sigma_{a,t} + resid, \quad Z\in\{R^{ex}_{mkt}; HML\},\\ TSS_{t} &= const. + \beta^{TSS}_{x}x_{t} + \beta^{TSS}_{\phi}\phi_{t} + \beta^{TSS}_{\sigma_a}\sigma_{a,t}\cdot\text{sign}(TSS_t) +resid, \end{align*} where $$R^{ex}_{mkt}$$ and $$HML$$ are the Fama-French quarterly market excess return and HML factors, respectively. $$TSS$$ denotes the term structure spread between 7- and 2-year zero-coupon equities (Binsbergen et al. 2013). The factors $$x_t$$, $$\sigma_{a,t}$$, and $$\zeta_t$$ are estimated according to the procedure described in Section 3. Our estimates are from the five-factor specification with volatility estimated through projection methods. The sign$$(TSS_t)$$ term accounts for the opposite impact of volatility on the spread depending on whether the term structure is either upward or downward sloping. For each regression, we report (1) our estimates for the exposure to both short-run conditional volatility ($$\sigma_{a,t}$$) and relative volatility ($$\zeta_t$$); (2) the $$p$$-value associated with the null that the signs of our exposure coefficients are opposite to those estimated (we denote $$p$$-values smaller than 1%, 5%, and 10% by $$^{***}$$, $$^{**}$$, and $$^*$$, respectively); and (3) the adjusted $$R^2$$ from each regression with and without the inclusion of short-run volatility ($$\sigma_t$$). Numbers in parentheses are GMM Newey-West adjusted standard errors. Table 5 News shocks, payout, and asset prices A. Payout exposures $$\beta_{srr}$$ $$\beta_{lrr}$$ $$\beta_{vol}$$ $$\beta_{rel\_vol}$$ Adj. $$R^2$$ Aggregate investment 0.078$$^{***}$$ –0.150$$^{***}$$ –0.097$$^{***}$$ –0.090$$^{***}$$ 0.937 (0.014) (0.064) (0.033) (0.039) Aggregate payout –0.051$$^{***}$$ 0.160$$^{***}$$ 0.098$$^{***}$$ 0.099$$^{***}$$ 0.938 (0.015) (0.064) (0.034) (0.040) FoF dividends –0.020$$^{**}$$ 0.022$$^{*}$$ 0.004 –0.007 0.884 (0.012) (0.035) (0.025) (0.027) A. Payout exposures $$\beta_{srr}$$ $$\beta_{lrr}$$ $$\beta_{vol}$$ $$\beta_{rel\_vol}$$ Adj. $$R^2$$ Aggregate investment 0.078$$^{***}$$ –0.150$$^{***}$$ –0.097$$^{***}$$ –0.090$$^{***}$$ 0.937 (0.014) (0.064) (0.033) (0.039) Aggregate payout –0.051$$^{***}$$ 0.160$$^{***}$$ 0.098$$^{***}$$ 0.099$$^{***}$$ 0.938 (0.015) (0.064) (0.034) (0.040) FoF dividends –0.020$$^{**}$$ 0.022$$^{*}$$ 0.004 –0.007 0.884 (0.012) (0.035) (0.025) (0.027) B. Time-varying volatility factors and asset prices $$R^{ex}_{mkt}$$ $$HML$$ $$TSS$$ Rel.vol. 0.423$$^{*}$$ 0.080 –0.630$$^{***}$$ (0.273) (0.219) (0.092) SR-vol. 0.419 $$^{***}$$ 0.095 0.528$$^{***}$$ (0.173) (0.144) (0.045) Adj. $$R^2$$ 0.027 –0.009 0.699 Adj. $$R^2$$ (no SR-vol.) 0.010 –0.006 0.463 Sample: 1947:Q3–2016:Q4 1947:Q3–2016:Q4 2002:Q4–2010:Q4 B. Time-varying volatility factors and asset prices $$R^{ex}_{mkt}$$ $$HML$$ $$TSS$$ Rel.vol. 0.423$$^{*}$$ 0.080 –0.630$$^{***}$$ (0.273) (0.219) (0.092) SR-vol. 0.419 $$^{***}$$ 0.095 0.528$$^{***}$$ (0.173) (0.144) (0.045) Adj. $$R^2$$ 0.027 –0.009 0.699 Adj. $$R^2$$ (no SR-vol.) 0.010 –0.006 0.463 Sample: 1947:Q3–2016:Q4 1947:Q3–2016:Q4 2002:Q4–2010:Q4 In panel A, data are from the United States and span the sample 1947:Q1–2016:Q4. The statistics reported refer to the regression specified in Equation (37). In panel B, we report the estimates from the following regressions: \begin{align*} Z_{t+1} &= const. + \beta^z_{x}x_{t} + \beta^z_{\phi}\phi_{t} + \beta^z_{\sigma_a}\sigma_{a,t} + resid, \quad Z\in\{R^{ex}_{mkt}; HML\},\\ TSS_{t} &= const. + \beta^{TSS}_{x}x_{t} + \beta^{TSS}_{\phi}\phi_{t} + \beta^{TSS}_{\sigma_a}\sigma_{a,t}\cdot\text{sign}(TSS_t) +resid, \end{align*} where $$R^{ex}_{mkt}$$ and $$HML$$ are the Fama-French quarterly market excess return and HML factors, respectively. $$TSS$$ denotes the term structure spread between 7- and 2-year zero-coupon equities (Binsbergen et al. 2013). The factors $$x_t$$, $$\sigma_{a,t}$$, and $$\zeta_t$$ are estimated according to the procedure described in Section 3. Our estimates are from the five-factor specification with volatility estimated through projection methods. The sign$$(TSS_t)$$ term accounts for the opposite impact of volatility on the spread depending on whether the term structure is either upward or downward sloping. For each regression, we report (1) our estimates for the exposure to both short-run conditional volatility ($$\sigma_{a,t}$$) and relative volatility ($$\zeta_t$$); (2) the $$