Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

When is discretionary fiscal policy effective?

When is discretionary fiscal policy effective? 1IntroductionSince the Global Financial Crisis, there has been worldwide resurgence in the use of discretionary fiscal policy. Both stimulus and austerity have been enacted in many countries, with policies often being a mix of tax and spending changes. By its nature, “discretionary” implies choice, including choice about timing. Thus, if the effects of discretionary fiscal policy depend nonlinearly on economic conditions at the time when the policy is undertaken, it opens up important questions about when different policies would be comparatively more or less effective, questions that would simply not be relevant under linearity.This paper addresses three key questions about the timing and type of discretionary fiscal policy: (i) When do discretionary government spending increases and tax cuts provide more or less effective stimulus to the economy? (ii) Do the effects of government spending differ from the effects of taxes? (iii) Is austerity more or less effective than stimulus? In answering these questions, we make three contributions to the literature on nonlinear state-dependent effects of fiscal policy.First, we examine the exact nature and robustness of state dependence in the effectiveness of fiscal policy by considering an informationally-sufficient medium-scale threshold vector autoregressive (TVAR) model and by comparing and testing many different possible threshold variables. Using U.S. data and identifying government spending and tax shocks via sign restrictions, we find strong empirical support for nonlinearity related to economic slack in the relationship between government spending and aggregate output and between taxes and aggregate output, both when considering dollar-for-dollar and cumulative multiplier responses. The measure of economic slack that we find most closely relates to the nonlinear relationship between fiscal policy and aggregate output is the model-averaged output gap developed by Morley and Panovska (2019) based on earlier research by Morley and Piger (2012). The model averaging approach addresses uncertainty about the appropriate forecasting model for aggregate output by averaging implied estimates of the output gap from forecast-based trend-cycle decompositions for a large set of similarly-fitting reduced-form time series models. However, it is important to note that we find generally robust results in terms of the timing and implications of the nonlinearity for various conventionally used measures of slack. Meanwhile, we show that structural shocks identified from our medium-scale model pass the conventional informational sufficiency tests. Thus, our results suggest that previous findings in favor of nonlinearity are not simply due to omitted variables or a failure to account for fiscal foresight.Second, using evolving-regime generalized impulse response analysis, we demonstrate that tax cuts and government spending increases have similarly large expansionary effects during deep recessions and sluggish recoveries, but they are much less effective, especially in the case of government spending increases, when the economy is in a robust expansion. Meanwhile, tax increases and government spending cuts are most contractionary during deep recessions, and, as a result, are largely self-defeating if the goal of fiscal austerity implemented during an economic crisis is to bring down the debt-to-GDP ratio. Overall, we find that austerity has larger dollar-for-dollar effects on output than stimulus across the business cycle.Third, we investigate and determine the roles of consumption and investment in driving the effects of both government spending and taxes on aggregate output. Our results imply that the effectiveness of discretionary government spending shocks, including its state dependence, is almost entirely due to the response of consumption. The responses of both consumption and investment to discretionary tax changes are state dependent, but investment plays the larger quantitative role.The rest of our paper is organized as follows. Section 2 reviews the relevant previous literature. Section 3 presents our empirical model. Section 4 examines the evidence for nonlinearity and state dependence in the effects of government spending and taxes on aggregate output. Section 5 reports evolving-regime impulse response analysis to investigate when discretionary changes in government spending and taxes are comparatively more or less effective. Section 6 explores the roles of consumption and investment in driving the state-dependent effects of fiscal policy on aggregate output. Section 7 concludes.The technical estimation details are provided in Appendix A. Some additional results are provided in the online Supplementary Appendix B.2Previous literature2.1Nonlinear effects of fiscal policyRecent theoretical research highlights potential channels through which fiscal policy shocks transmit nonlinearly. Michaillat (2014) shows that public employment can have much larger multiplier effects when the unemployment rate is high than when it is low. Canzoneri et al. (2016) emphasize the role of the credit channel, and McManus et al. (2018) emphasize the importance of the credit channel when credit constraints are occasionally binding and endogenous. Gali et al. (2007) showed that a fiscal multiplier could be large when the ratio of rule-of-thumb consumers is large and stimulus policies work primarily through the consumption channel. Leeper et al. (2017) showed that fiscal multipliers can be persistently high when government spending interacts favorably with consumer preferences.The empirical literature on fiscal policy multipliers has grown rapidly and has many different strands. Our analysis builds on and merges several of these. Most closely related, a number of studies with smaller-scale nonlinear vector autoregressive (VAR) models find state-dependent effects of discretionary changes in government spending – see, for example, Auerbach and Gorodnichenko (2012), Auerbach and Gorodnichenko (2013), Bachmann and Sims (2012), Baum et al. (2012), Caggiano et al. (2015), Candelon and Lieb (2013), Fazzari et al. (2015), (FMP henceforth), and Mumtaz and Sunder-Plassmann (2019). However, the existence of state dependence does not seem settled. Studies using a narrative approach and military spending shocks to identify the effects of government spending often find little support for state dependence–see, for example, Owyang et al. (2013) and Ramey and Zubairy (2018).Some recent empirical studies have also considered asymmetries in the effects of stimulus versus austerity, mostly in a sign-dependent framework. Jones et al. (2015) find that tax cuts have significant positive effects on US output, while tax increases have no substantial negative effects, but these results are reversed for the UK. Barnichon and Matthes (2017) find that government spending cuts have larger effects than increases, with the results driven primarily by very strong negative responses of output to government spending decreases during recessions. Guajardo et al. (2014) and Jorda and Taylor (2016) find large decreases in output in response to exogenous fiscal consolidations. Alesina et al. (2015) show that for a panel of 16 OECD countries, fiscal consolidations based on spending cuts are less costly in terms of output loss than consolidations based on tax increases. In a state-dependent framework, Klein (2017) finds that austerity has large negative effects on output when the level of private debt is high.2.2Shock identificationDifferent strands of the fiscal literature have also taken varied approaches to shock identification. The three most popular are the timing approach, the narrative approach, and the sign restriction approach.Variations of the timing approach are used by, for example, Blanchard and Perotti (2002), Auerbach and Gorodnichenko (2012), and FMP. The timing approach entails imposing a restrictions such as, for example, government spending not responding to business cycle shocks within a quarter.The narrative approach uses government spending shocks or tax shocks constructed by examining historical announcements about changes in government spending and taxes unrelated to the business cycle or overall economic conditions. Ramey (2011), Owyang et al. (2013), Ramey and Zubairy (2018), Cloyne (2013), Romer and Romer (2010), and Jones et al. (2015), inter alia, use the narrative approach, sometimes combined with timing restrictions. However, many studies that use the narrative approach consider only military spending shocks or narrative measures limited to large consolidations. This means that many of the observations for the narrative shock series are equal to zero for a large part of the sample, which makes exploring state dependence challenging econometrically.The sign restriction approach defines the number of structural shocks of interest (which can be smaller than the number of variables in the VAR model) and restricts the sign of the response of variables over particular horizons. This approach is usually considered more agnostic than the timing approach because it effectively nests the timing restrictions. In a linear setting, Mountford and Uhlig (2009) find that deficit-financed tax cuts increase output more than deficit-financed increases in government spending. Candelon and Lieb (2013) extend the model to a nonlinear setting, and find that there is strong evidence of nonlinearity in the response of output to government spending shocks, but that the multipliers are always lower than one.Despite the rapidly growing fiscal literature, few studies have formally considered whether both the effects of government spending and taxes are nonlinear in a joint model or whether state dependence could imply sign asymmetry. Notably, a theoretical model with endogenous credit constraints would imply both state dependence and sign dependence. For example, in a model in the spirit of (McManus et al., 2018), a cut in transfers to impatient households that keeps the households constrained would have larger effects than an increase. Because our medium-scale model described in the next section embeds detailed information about fiscal and other macroeconomic variables, we are able to address a number of potential problems in identifying discretionary government spending and tax changes shocks separately, while evolving-regime generalized impulse responses presented in Section 5 allow us to consider the presence of sign asymmetry under state dependence.3A medium-scale TVAR model3.1Reduced-form model and estimation methodWe construct a TVAR model and consider different possible threshold variables. Let Ytdenote the vector containing the endogenous variables. The TVAR model splits the stochastic process for Ytinto two different regimes. Within each regime, the process for Ytis linear, but Ytcan evolve endogenously between regimes. Let qt−ddenote the threshold variable that determines the prevailing regime, where the integer d is the delay lag for a regime switch. If the threshold variable qt−dcrosses c at time t−d, the dynamics of the TVAR model change at time t. Defining an indicator function I[.] that equals 1 when qt−dexceeds the threshold c and 0 otherwise, the full model can be written in a single equation as(1)Yt=Φ01+Φ11(L)Yt−1+(Φ02+Φ12(L)Yt−1)I[qt−d>c]+εt.$${Y}_{t}={\text{{\Phi}}}_{0}^{1}+{\text{{\Phi}}}_{1}^{1}\left(L\right){Y}_{t-1}+\left({\text{{\Phi}}}_{0}^{2}+{\text{{\Phi}}}_{1}^{2}\left(L\right){Y}_{t-1}\right)I\left[{q}_{t-d}{ >}c\right]+{\varepsilon }_{t}.$$The dynamics of the system when qt−dis below c are given by Φ01${\text{{\Phi}}}_{0}^{1}$and the lag polynomial matrix Φ11(L)${\text{{\Phi}}}_{1}^{1}\left(L\right)$, and by Φ02${\text{{\Phi}}}_{0}^{2}$and the lag polynomial matrix Φ12(L)${\text{{\Phi}}}_{1}^{2}\left(L\right)$when qt−dis above c. The disturbances εt${\varepsilon }_{t}$are assumed to be nid with mean zero and variance-covariance matrix Σ that is assumed fixed across regimes.We show that our results are robust to relaxing this assumption in Supplementary Appendix B.For our benchmark specification, Ytincludes nine variables: log real federal consumption and investment spending, log real federal transfer payments to persons, log real federal interest payments on debt, log real transfer taxes, log of other tax revenues in real terms, log real GDP, a measure of slack, an interest rate (measured using the Federal Funds Rate or the Wu and Xia (2016), shadow rate during the zero-lower-bound period), and inflation (calculated using the GDP deflator). The sample period for the benchmark model is 1967Q1–2015Q4. All fiscal variables are converted to real terms using the GDP deflator, and all nominal series were obtained from NIPA-BEA.By focusing on federal variables only, we are able to trace out the impact on public debt, a variable of obvious interest in debates about fiscal policy. In particular, if the total federal debt at time t is Dt, thenDt=Dt−1+Gt+Gttransfer+Gtinterest−Tttransfer−Ttother$${D}_{t}={D}_{t-1}+{G}_{t}+{G}_{t}^{transfer}+{G}_{t}^{interest}-{T}_{t}^{transfer}-{T}_{t}^{other}$$and the debt-to-GDP ratio can be calculated asdt=dt−1*Yt−1/Yt+Gt+Gttransfer+Gtinterest−Tttransfer−TtotherYt,$${d}_{t}={d}_{t-1}\text{{\ast}}{Y}_{t-1}/{Y}_{t}+\frac{{G}_{t}+{G}_{t}^{transfer}+{G}_{t}^{interest}-{T}_{t}^{transfer}-{T}_{t}^{other}}{{Y}_{t}}\text{,}$$where dtis the ratio at time t.An alternative way to track debt would be to account for the evolution of interest rates and inflation and to directly account for debt in the VAR, as in Favero and Giavazzi (2012). The results for a model with debt in the VAR are discussed in Supplementary Appendix B.Most fiscal stimulus or austerity that involves discretionary changes in government spending is usually implemented by changes in government consumption or investment. Nonetheless, transfer and interest payments may have sizable effects on the debt-to-output ratio. Furthermore, government transfer payments to persons are strongly affected by the state of the business cycle and respond to movements in output. Changes in transfer payments are occasionally used as a fiscal policy tool (a notable example is the extension of unemployment benefits during the Great Recession), although most movements in transfer payments are likely to be endogenous.When we perform a variance decomposition for our benchmark model, we find that the bulk of the variation in transfer payments is explained by business cycle shocks (61% on impact, 91% after 8 quarters).Likewise, it is important to split taxes into two sub-components: transfer taxes, which depend on the state of the business cycle and are rarely used as a discretionary fiscal policy tool, and federal tax receipts net of transfer taxes (the federal equivalent of Blanchard and Perotti’s 2002, tax series).We estimate the parameters Φij${\text{{\Phi}}}_{i}^{j}$, the threshold c, the delay lag d, and the number of lags included in the TVAR model using Bayesian methods (technical details are provided in Appendix A). A Bayesian approach has two advantages in highly parametrized models such as the TVAR model. First, conventional frequentist tests can be severely underpowered. The Bayesian approach circumvents this problem by allowing us to directly compare the linear to the nonlinear model using marginal likelihoods. The marginal likelihoods are calculated based on Chib and Jeliazkov (2001) algorithm and models are evaluated using the implied Bayes factors. In addition, motivated by concerns described by Campolieti et al. (2014), we also report the expected posterior likelihoods and the highest posterior density for all of the models. Second, the impulse responses for the endogenously evolving system have nonstandard distributions that will be highly non-Gaussian and depend on the history and the size or sign of the shocks, even when the true values of parameters are known. The Bayesian sampler conveniently produces the entire posterior distribution for c, Φij${\text{{\Phi}}}_{i}^{j}$and Σ conditional on the data and the entire posterior distribution of the impulse responses.3.2Impulse responses and shock identificationThe main empirical questions we consider are whether the effects of government spending differ across regimes defined by economic slack and whether, conditional on any state dependence, austerity has effects that are significantly different from a mirrored effect of stimulus of the same magnitude. Rejecting linearity using Bayesian model comparison directly implies that at least one of the impulse responses to at least one identified structural shock is different across regimes. However, the nature and degree of this asymmetry can be evaluated only by looking at the impulse response functions themselves.The main impulse responses that we consider reflect, after appropriate conversion, the dollar-for-dollar responses of a variable of interest (for example output) to a one-time policy shock. However, these may provide an incomplete picture of the overall effects from a policy shock. If a researcher is interested in calculating multipliers in a more conventional sense, the cumulative change in output scaled by the cumulative change in government spending or taxes may be the more appropriate measure.We compute the cumulative multipliers as the cumulative impulse response of a outcome variable, like output, divided by a cumulative impulse response of a policy variable. In particular, multiplierhorizon=H=(Σt=1t=HIFR(Y,h))/(Σt=1t=HIFR(Fiscal,h))${multiplier}_{horizon=H}=\left({\text{{\Sigma}}}_{t=1}^{t=H}IFR\left(Y,h\right)\right)/\left({\text{{\Sigma}}}_{t=1}^{t=H}IFR\left(Fiscal,h\right)\right)$where IRF(Y,h) is the impulse response of the variable of interest to the fiscal policy shock at horizon h and IRF(Fiscal,h) is the impulse response of G or T to its own shock at horizon h.The dollar-for-dollar responses and the cumulative multipliers provide related, but slightly different pieces of information. In particular, the dollar-for-dollar responses address the question of how output responds today (or at some future horizon) to a policy change today and are close to what, for example, the Congressional Budget Office releases and is reported by the media when they estimate the effects of a policy at a given horizon (although the CBO also calculates and reports the cumulative net effects). For that reason, and because the literature is not unanimous about reporting cumulative or dollar-for-dollar responses, we consider both.In constructing impulse responses (or functions of the impulse responses, in the case of multipliers), the structural shocks need to be identified using a plausible orthogonal decomposition of the variance-covariance matrix Σ. When imposing sign restrictions, we take an approach that is similar in spirit to Mountford and Uhlig (2009). However, following recent developments in the time-series literature that show the penalty function approach used by Mountford and Uhlig (2009) may bias the impulse responses and lead to artificially narrow credibility intervals, we construct the impulse responses using the efficient sampler proposed by Arias et al. (2018).It is important to note that we also incorporate uncertainty about the threshold estimate. For the restricted linear version of the model, we use the Arias et al. (2018) algorithm directly. For the nonlinear version of the model, the priors are symmetric across regimes. In particular, the prior distributions are centered around the posterior means for the linear model. Figure A1 in Appendix A plots the impulse responses for the linear model, which correspond to the impulse responses implied by the priors for the nonlinear model. In Appendix A we also conduct an experiment where the priors for the nonlinear model are centered around zero to illustrate that the posteriors are not substantially affected by the informative priors.Our focus is on four structural shocks identified using sign restrictions on the impulse responses, summarized in Table 1.Table 1:Sign Identification.ResponseShockGTransPayIntPayTransTaxOtherTaxYslackiπG+++????+/????T????+++????BC????+++++++++??MP??????/− − −?+++− − −Question marks indicate that the sign is left unrestricted.The four identified shocks are a government spending shock, a tax shock, a “business cycle” shock, and a monetary policy shock. The first sign in each cell of Table 1 shows the assumed direction of the effect of a shock on the response variable on impact; the second and third signs in each cell are the assumed signs in the first and second quarter following the shock. All shocks are assumed to be orthogonal to one another, which differs from Mountford and Uhlig (2009), who do not impose the restriction that tax shocks are orthogonal to government spending shocks.Our results are robust to relaxing this assumption, as shown in Supplementary Appendix B.A positive business cycle shock is restricted to increase output, tax revenues, and the measure of slack on impact and for 2 quarters following the shock.“Slack” is conventionally defined as the difference between some measure of economic activity and its long run trend. Thus, large negative values imply there is a lot of slack in the economy. A positive business cycle shock would, therefore, increase capacity utilization or the output gap (i.e., making the gap less negative or more positive). However, in the cases where the unemployment rate is used as a measure of slack, we reverse the sign of the restricted responses. Also, note that when we consider the responses of consumption and investment in Section 5, we assume that a positive business cycle shock increases consumption and investment. As in Mountford and Uhlig (2009), a positive business cycle shock in this context could be consistent with either demand or supply shocks, and we remain agnostic about the ultimate source of business cycle fluctuations. Note that we do not impose any restrictions on the responses of the interest rate to a business cycle shock, although we find that the posterior responses indicate that the interest rate also increases in response to a positive business cycle shock. These results are available upon request.Meanwhile, a positive monetary policy shock is specified to increase the interest rate contemporaneously and for the subsequent two quarters, while decreasing inflation on impact and for the subsequent two quarters. That is, a “positive” monetary shock is contractionary in the sense of having a disinflationary and negative liquidity effect. However, because there is conflicting evidence from the monetary policy literature (see, for example, Lo and Piger (2005), Alpanda and Zubairy (2019)) about whether the responses of output to monetary policy can vary and possibly be insignificant at some points of the business cycle, we do not impose the restriction that output falls in response to a contractionary monetary policy shock, although our main results do not change when we impose this restriction. A positive tax shock is assumed to increase tax revenues contemporaneously and for two quarters following the shock. Similarly, a positive government spending shock increases government consumption and investment contemporaneously and for two quarters following the shock.We also consider an alternative identification scheme where the restrictions are imposed for four quarters and a restriction scheme where transfer payment are countercyclical. The responses look very similar to the responses presented in the next sections and are available upon request.Following previous results from the fiscal literature, we also impose the restriction that output increases on impact in response to a positive government spending shock (see, for example, FMP and Auerbach and Gorodnichenko 2013) and that exogenous tax increases decrease output on impact (see, for example, Romer and Romer 2010). Even studies that find no evidence of state dependence or studies that find that government spending multipliers decline sharply after the first quarter find positive multipliers on impact (for example, see Ramey and Zubairy 2018). The responses of output are then left unrestricted after impact.We make an assumption about the response of output on impact in order to speed up the Bayesian estimation, including the calculation of marginal likelihoods. However, the estimates presented in the next sections do not hinge upon this assumption. When the response of output to government spending or taxes is left unrestricted, almost all of the posterior distribution of the response of output to a positive government spending shock or a negative tax shock is above zero at horizon zero. Therefore, the slightly more restrictive prior is supported by the data. The full set of responses for different identification schemes is available upon request.The responses to negative shocks are restricted to have the opposite signs to those shown in Table 1. In the case where we consider the evolving-state impulse responses, the responses are constructed assuming that the economy evolves endogenously from one regime to another, with an orthogonalization accepted if the sign restrictions hold for two quarters even if the economy evolves from one regime to another. The technical details of the impulse response calculation are discussed in Appendix A.4Evidence for nonlinearity and state dependenceBecause the main goal of this paper is to explore when different types of discretionary fiscal policy are effective in the presence of state dependence, we first need to establish what evidence there is for nonlinearity when considering a large enough model to ensure informational sufficiency. To do this, we consider the choice of threshold variable, assess the evidence of state dependence in the dollar-for-dollar and cumulative multipliers, and then perform formal tests to demonstrate that our shocks can be considered “structural” in the sense they are orthogonal both to survey forecasts and to information from other macroeconomic variables.4.1Choice of threshold variableTwo key issues complicate the choice of threshold variable. First, any proposed measure of slack may not accurately capture the true degree of under (or over) utilization of resources in the economy. Second, economic slack may not actually be what triggers nonlinear responses of output to fiscal policy.A full discussion of other potential econometric issues when selecting threshold variables, such as breaks and stationarity issues is included in Supplementary Appendix B. Supplementary Appendix B also includes checks that our results are robust with respect to different measures of slack, as well as to different subsamples and alternative identification schemes.4.1.1Measures of slackEven focusing on the output gap (i. e., the difference between actual and potential log real GDP) as a measure of slack, large discrepancies arise when using different models to estimate the output gap (see, inter alia, Morley and Piger 2012; Morley and Panovska 2019; Perron and Wada 2016). To address this model uncertainty, the measure of slack that we use in our benchmark TVAR model is the model-averaged output gap (MAOG) from Morley and Panovska (2019).For this paper, we re-calculate the MAOG using the full available data sample for US real GDP and treat the resulting output gap as data in our TVAR model.The MAOG is calculated using equal weights on estimated output gaps from a large set of linear and nonlinear time series models (we refer the reader to the original study for technical details). Morley and Panovska (2019) show the MAOG approach performs very well in matching business cycle dates and correspondence to narrower measures of slack, not just for the US, but for a large group of OECD countries.We note that the extant nonlinear fiscal spending multiplier literature has used many different observed variables as potential proxies of slack. For example, in FMP, we considered capacity utilization. Meanwhile, a large number of studies use the CBO output gap (see, for example Baum et al. 2012). Auerbach and Gorodnichenko (2012, 2013 use various combinations of moving averages of output growth rates, whereas Ramey and Zubairy (2018) use the unemployment rate. Given this variety of slack measures, two immediate questions arise: First, which measure of slack drives possible nonlinearity in our medium-scale TVAR model? Second, which measure of slack is the “right” measure when modeling the macroeconomy as a whole? To address these questions, we consider three sets of models. The first set is based on our benchmark specification, but using different measures of slack in the VAR and as a threshold variable. This set of models helps us assess which measure of slack drives the nonlinearity in our model. The second set of models also covers different measures of slack, but corresponds to a smaller VAR that excludes fiscal instruments and only includes output, inflation, interest rates, and the measure of slack. The third set of models also covers different measures of slack, but corresponds to a small-scale fiscal model that includes federal consumption and investment, federal revenues, output, inflation, interest rates, and the measure of slack.For the three sets of models, Table 2 reports the various threshold estimates, and different measures of fit, including the log marginal likelihood, across different measures of slack in the VAR and as threshold variables.All threshold variables are adjusted for structural breaks. Details are provided in Supplementary Appendix B.The implied Bayes factors strongly favor the TVAR model over the linear counterpart in all cases. This result is particularly notable for the benchmark medium-scale model because evidence of nonlinearity for the smaller models could have been due to omitted variables included in our larger model. Meanwhile, the MAOG is the preferred measure of slack in almost every case, both for the benchmark model and for the smaller models. Furthermore, while the threshold estimates for some of the other variables change across different models, the estimated thresholds for the MAOG as the threshold variable are fairly robust across the different specifications. Taken together, these results suggest that the MAOG is a good measure of economic slack and driver of nonlinearities in macroeconomic dynamics.Table 2:Model comparison: Linear versus nonlinear models with different measures of slack.Benchmark VAR: measure of slack in the VARThreshold variablecapacity utilizationunemployment rateCBOgapMAOGLinear (none)−2670.03−2525.25−2430.87−2394.05−2676.38−2523.81−2429.42−2389.91−822.73−507.72−506.87−833.73capacity utilization−2238.54−2137.32−2028.08−1938.70−2239.00−1.35−2029.01−2.00−2029.47−0.30−1938.02 −1.33−339.26 (−1.66, −1.04)−339.26(−2.63, −0.28)−344.21(−1.79, 0.23)−568.77(−1.82, −0.91)unemployment rate−2294.43−2152.21−1764.52−1958.42 0.13−2293.77 0.59−2151.00 0.06−1761.30 0.88−1958.63(−0.05, 0.38)−339.26(−0.06, 0.73)−362.84(−0.29, 0.38)−341.39(0.06, 1.04)−542.63CBOgap−2231.54−2089.74−1597.20−1916.56−2231.70−1.64−2091.77−1.66−1598.00−2.00−1918.22−1.64−303.37(−1.95, −0.84)−329.92(−1.95, −0.66)−312.61(−2.16, −1.30)−487.05(−1.94, −1.06)MAOG−1937.81−2085.69−1579.30−1873.11−1937.00−0.69−2084.27 −0.74−1578.33 −0.71−1870.00−0.74−269.66(−0.74, −0.52)−329.92(−0.80, −0.39)−309.65(−0.82, −0.39)−450.37 (−0.86, −0.51)Var with output, inflation, interest rates, and a measure of slack: measure of slack in the VARThreshold variablecapacity utilizationunemployment rateCBOgapMAOGLinear (none)−667.09−510.15−442.52−365.37−664.22−509.23−441.52−364.22−194.27−162.52−405.22−240.38capacity utilization−569.70−423.05−339.42−247.23−567.99−3.03−422.99−3.03−339.40−3.02−244.22−3.03−84.37(−4.84, −0.86)−64.22(−3.51, −0.82)−110.95(−3.42. −1.52)−91.11(−3.38, −0.47)unempoyment rate−592.15−428.39−94.83−274.90−592.910.74−426.220.73−92.111.20−276.210.73−109.22(−0.04, 1.03)−62.19(−0.07, 0.87)−63.59(−0.05.1.39)−90.23(−0.06, 0.89)CBOgap−585.11−420.45−237.37−268.55−584.00−1.64−420.40−0.69−235.22−2.17−266.211−0.41−111.96(−1.96, −0.22)−54.11(−2.71, −0.21)−89.65(−2.42, −0.41)−80.29(−1.96, 0.14)MAOG−569.55−408.14−50.10−211.84−567.01−1.02−407.22−1.03−49.99−1.49−210.22−1.21−84.62(−1.34, −0.02)−44.07(−1.34, −0.60)−35.09(−1.51, −0.51)−58.57(−1.34, −0.95)Var with federal spending, federal revenues, output, inflation, interest rates, and a measure of slack: measure of slack in the VARThreshold variablecapacity utilizationunemployment rateCBOgapMAOGLinear (none)−1490.90−1335.36−1243.82−1196.54−1488.20−1333.00−1242.01−1194.24−433.76−459.44−466.73−363.01Capacity Utilization−1266.93−1118.90−11036.46−936.63−1262.90−3.00−1118.00−2.90−1134.00−3.01−930.10−3.13−182.07(−3.50, −0.89)−139.18(−3.52, −1.71)−174.33(−3.45, −2.52)−317.02(−3.47, −0.94)unemployment rate−1336.69−1185.00−772.61−1028.60−1334.920.79−1184.500.80−771.511.09−1022.510.43−176.22(−0.16, 0.99)−223.16(−0.00, 1.08)−223.16(0.09, 1.21)−261.95(−0.07, 0.96)CBOgap−1296.73−1147.56−917.49−990.20−1292.00−1.91−1148.00−1.56−915.00−1.86−986.02−1.64−170.02(−2.01, −1.01)−128.33(−2.40, −0.72)−374.00(−2.41, −1.05)−320.58(−1.95, 0.76)MAOG−1253.99−1110.35−613.42−942.67−1253.80−1.16−1109.00−1.02−612.00−1.11−942.02−0.74−161.22(−1.51, −0.36)−104.56(−2.22, −0.37)−99.52(−1.25, −0.55)−263.00(−2.01, −0.31)Each cell reports the log likelihood obtained from maximum likelihood estimation, the expected posterior log likelihood obtained Bayesian estimation, and the log marginal likelihood (top, middle, bottom). The second entry is the threshold estimate, including 90% credibility intervals, obtained from the posterior Bayesian distribution. The best model fit for each measure of slack is reported in bold.4.1.2Slack versus alternative threshold variablesTo account for the possibility that nonlinearity could actually be driven not so much by the degree of slack in the economy, but more by the direction of change in economic activity or fiscal policy, we also consider the following possible threshold variables: a 4-quarter moving average change in log output, a 4-quarter moving average change in the log of government spending and consumption, and a 4-quarter moving average change in log tax revenues (net of transfer taxes).We also considered longer moving averages of output growth, as in Auerbach and Gorodnichenko (2012). Results were similar to those for the 4-quarter moving average and are available upon request.For completeness, we also consider the level of the ex-ante real interest rate (based on static expectations) as a threshold variable to allow for the possibility that any asymmetry is related more to the stance of monetary policy rather than fiscal policy. Figure 1 plots all of the possible threshold variables that relate to economic slack, growth rates, and policy changes. The left panels display the measures of slack and the right panels display the additional threshold variables related to growth rates and policy changes.Figure 1:Threshold variables and estimated thresholds.Recent developments in the fiscal and monetary literature have also indicated that household debt or household debt overhang could be a potential channel for nonlinear transmission of policy (Alpanda and Zubairy 2019; Bernadini and Peersman 2018; Klein 2017). Furthermore, the empirical literature has also related the effectiveness of fiscal policy to the level of the government debt (see, inter alia, the now highly controversial study by Reinhard and Rogoff (2009)). Thus, we also consider household and Federal debt-output ratios as potential threshold variables. Figure 2 plots the debt levels and the debt overhang. As shown in the figure, both debt ratios are clearly nonstationary. For the sake of direct comparison with previous studies on federal debt, we consider using the ratios in levels (while fully acknowledging that this could be problematic and could lead to incorrect inference, as discussed in detail in Supplementary Appendix B). We also consider their overhang levels, which are stationary. Following the previous literature (inter alia Klein 2017 or Alpanda and Zubairy 2019), overhang is defined as the difference between the ratio and its long run trend, where trend in this case is estimated using the low frequency output of the HP filter with λ=104$\lambda ={10}^{4}$.Figure 2:Household and federal debt-to-GDP ratios.The threshold variables are adjusted for any structural breaks. The estimated threshold (median) and 90% credibility intervals from Tables 2 and 3 are also displayed.Table 3:Model comparison: slack versus growth versus debt as threshold variables in a TVAR model with MAOG as the measure of slack.Threshold variableMeasure of slack in the VAR modelMAOGLinear model (none)−2394.05−2389.91−833.73Moving Average Output Growth−1879.45−1.51−1878.00(−1.67, −0.89)−454.37Moving Average Government Spending Growth−1966.62 3.93−1962.79(1.06, 4.45)−503.65Moving Average Taxes Growth−1924.22 −3.31−1923.16(−4.79, 1.34)−490.12Real Interest Rate−1953.92 −0.21−1953.00(−0.62, 0.87)−515.22Household Debt Ratio−1958.2547.25−1956.22(n/a)−600.22 Federal Debt Ratio−1992.85 34.58−1990.11(n/a)−751.00Household Debt Overhang−1995.11−0.75−1994.22(−0.42, 0.85)−671.00Federal Debt Overhang−1939.31−1.385−1933.22(−2.02, 2.10)−688.95MAOG−1873.11−0.74−1870.00(−0.86, −0.51)−450.37 Each cell reports the log likelihood obtained using maximum likelihood estimation, the expected posterior log likelihood obtained Bayesian estimation, and the log marginal likelihood (top, middle, bottom). The second entry is the threshold estimate, including 90% credibility intervals, obtained from the posterior Bayesian distribution. The growth rate threshold variables are 4-quarter moving averages of log differences. The real interest rate is an ex ante measure given static expectations. The best model fit is reported in bold.The trends are estimated using an HP filter with λ equal to 104. The estimated threshold (median) and 90% credibility intervals from Tables 3 and 4 are also displayed.Table 4:Output multipliers: cumulative scaled responses at selected horizons.HorizonGovernment spendingTaxesExcess slackClose to potentialExcess slackClose to potential1 year1.151.24−6.24−2.222 years1.210.72−5.37−3.193 years1.230.38−4.51−3.684 years1.180.28−4.28−4.075 years1.150.26−4.50−4.33Table 3 reports the results of a model comparison for a specification where slack (namely the MAOG) is used as the threshold variable versus specifications with policy or debt threshold variables. Similar to Table 2, the MAOG is strongly preferred as the threshold variable. Looking back at Figure 1, the threshold estimates for all measures of slack split the sample up into periods of excess slack (recessions and their immediate recoveries) and “normal” times when the economy is closer to potential. Related to this delineation of the sample and, as can be seen by comparing Tables 2 and 3, any of the slack variables is strongly preferred as a threshold variable compared to the policy variables, while the MAOG is preferred over the moving average of output growth, even though both identify similar dates for the regimes, as shown in Figure 1. The models in which debt ratios are included as threshold variables in levels perform worse than any of the models that include a measure of slack as a potential threshold variable. Furthermore, the estimates are quite imprecise when considering policy measures as threshold variables. In the case of debt overhang, the nonlinear model outperforms the linear model, but, again, all model selection criteria strongly prefer the model with the MAOG as the threshold variable. Therefore, in our remaining analysis, we focus on the model with the MAOG as a measure of slack and as a threshold variable.However, to ensure that our results are not driven solely by a particular choice of the measure of slack, we also consider the robustness of responses using the other measures of slack. The results for this robustness analysis are reported in Supplementary Appendix B.4.2Fixed-state responsesBefore considering evolving-regime responses to both government spending and tax shocks in the next section, we first establish that our identified government spending and tax shocks have state-dependent effects. Figure 3 plots the dollar-for-dollar responses and Table 4 summarizes the responses for the cumulative multipliers at select horizons.It is important to note that both the dollar-for-dollar responses and the cumulative multipliers are highly nonlinear. They peak at different horizons, and the credibility intervals are not symmetric around the median or the highest posterior density response. For convenience, we provide summary tables with the estimated cumulative multipliers for various outcome variables (Table 4 presents the responses of output and Table 6 in Section 6 presents the responses of consumption and investment) at select horizons. The companion graphs that show the full extent of the nonlinearities for the cumulative multipliers by plotting them with their 90% intervals at all horizons are provided in Supplementary Appendix B.The left panels in Figure 3 display the fixed-regime responses for the “excess slack” regime (defined by the estimated threshold), the middle panels display fixed-regime responses when the economy is close to potential, and the right panels display the posterior differences for the responses between the two regimes. Both median responses and 90% credibility intervals are reported.The dollar-for-dollar responses are converted from log-to-log responses using the average Gt/Ytratios for the corresponding regimes for each draw of the threshold parameter. The Gt/Ytratios vary considerably over time, which could potentially affect the size of the multiplier when converting log-to-log responses to dollar-for-dollar responses. In this section, we report the fixed-regime responses using the average ratios to illustrate the pattern of nonlinearity. When we construct the history-dependent evolving-regime responses in the next section, we address this issue directly and all of the responses are converted using the Gt/Ytratio at each point in time. However, it is worth noting that the responses in Figure 3 are comparable in magnitude and in shape to the responses that would be obtained using the Gt/Ytratio for each point in time.Figure 3:Dollar-for-dollar effects of government spending and taxes on output.In the excess slack regime, output responds with a large and persistent increase to a positive shock to government spending. By contrast, when the economy is close to potential, an increase in government spending temporarily increases output on impact, but the response dies out and becomes negative after two years. Tax cuts have similar state dependence. Flipping the sign of the displayed response to a tax hike, the fixed-regime impulse responses suggest that a tax cut would increase output in both regimes. A tax cut in the excess slack regime increases output by $2 (dollar for dollar) and is significant for 13 quarters, whereas the response is smaller and dies out after 7 quarters when the economy is close to potential, peaking at $1.3 and becoming insignificant after 7 quarters.The impulse responses for the linear VAR model for the benchmark case when the MAOG is used as a measure of slack are reported in Appendix A. These responses correspond to the responses implied by the prior in each regime for the nonlinear model. As pointed out by (Arias et al., 2018), the impulse responses obtained using their approach have much wider credibility bands than the responses reported by Mountford and Uhlig (2009). However, when we allow for nonlinearity, the impulse responses are more precisely estimated, implying that some of the uncertainty about the linear responses could be due to state dependence that is not accounted for. Furthermore, the posterior distribution for the excess slack regime looks substantially different from the posterior distribution for the linear model, thus indicating that the findings in favor of state dependence are not driven by our choice of priors.The posterior differences in the right panels make it clear that the state dependence is significant. Meanwhile, the cumulative tax multipliers in Table 4 are larger than the cumulative spending multipliers, especially at longer horizons, with both spending and tax multipliers exhibiting clear state dependence. The spending multipliers are larger and more persistent in the excess slack regime than when the economy is close to potential. The spending multipliers peak early when the economy is close to potential, and start declining after less than a year. Similarly, the tax multipliers are larger in magnitude in the excess slack regime than when the economy is close to potential.The figure displays the fixed-regime responses of output to a government spending shock (first row) and a tax shock (second row) in the excess slack regime (left column) and close to potential regime (middle column) for the benchmark model. The right column plots the posterior differences in responses across the two regimes. Posterior medians and 90% credibility intervals are reported.4.3Informational sufficiencyThe impulse responses in Figure 3 are presented under the assumption that the VAR model includes sufficient information to correctly identify the fiscal shocks. However, if shocks are correlated with other information available to economic agents that is not included in the VAR model, the estimated impulse responses can be biased. For example, Ramey (2011) shows possible “fiscal foresight” about shocks is a problem for a small Blanchard-Perotti type VAR model for which the shocks identified from the VAR model can be Granger-caused by forecasts of those shocks and are, therefore, likely to be anticipated by economic agents.We follow Forni and Gambetti (2014) to assess whether the shocks from our TVAR model are unanticipated.We have also benchmarked the shocks from our model against commonly-used narrative measures. The spending shocks pick up most of Ramey (2011) military spending news shocks and our tax shocks most of Romer and Romer (2010) unanticipated tax shocks. Our results are also robust to an alternative identification scheme that accounts for fiscal foresight following Mountford and Uhlig (2009). Both sets of results are available upon request.We regress the structural shocks for government spending, in our case from each iteration of the Bayesian sampler for our benchmark model, on the Survey of Professional Forecasters’ forecasts of Real Federal Government Consumption Expenditures and Gross Investment in the subsequent four quarters, calculated based on mean responses and taking into account compounding. Our assumption with this test is that the SPF forecasters aggregate relevant information about the anticipated component of government spending. Meanwhile, because the residuals for a Bayesian VAR model may not necessarily be orthogonal to the VAR information set, we also perform a Granger causality test where we regress the structural shocks on a model that includes both the right-hand-side variables from Equation (1) and the SPF forecasts. In either case, we are unable to reject the null of information sufficiency. For the case where we do not include the conditioning variables, the p-values range from [0.21, 0.98] across different draws of the sampler, with the median p-value being 0.56. For the case where we include the conditioning variables, the p-values range from [0.47, 1.00], with the median value being 0.83. Therefore, our medium-scale TVAR model appears informationally sufficient when identifying government spending shocks.We also conduct orthogonality tests by checking whether either the spending or tax shocks are correlated with lags of principle components extracted from a large macroeconomic dataset that proxies for information available to economic agents. In particular, we use the FRED MD Stock and Watson dataset (see McCracken and Ng 2016 for details).To obtain the principal components, we convert the data to quarterly frequency, drop series that are not available for the period 1967–2015, and use growth rates for series that are nonstationary. We take a conservative approach and include up to seven components in our orthogonality tests.Different conventional tests indicate that the relevant number of principal components for the dataset is between 1 and 7. Table 5 reports the results for 2 lags of each principal component at a time and for 2 lags of all seven principal components at the same time. We cannot reject the null of orthogonality for any of the principal components. The p-values are especially large for the TVAR models. This result is consistent with Forni and Gambetti (2016), who show that, while a small fiscal VAR model is insufficient to identify fiscal shocks, a larger VAR model that includes forward-looking variables such as inflation and interest rates (and exchange rates in their case) is sufficient. Therefore, our shocks appear to be “structural” in the sense that they are not correlated with other information at time t about the macroeconomy and are thus possibly also “structural” in the conventional SVAR sense.Table 5:Orthogonality tests.ShocksPC1PC2PC3PC4PC5PC6PC7PC1-7G Linear VAR0.8720.9530.9350.6200.3540.7850.1620.919T Linear VAR0.8540.9970.8610.1220.2250.2750.0900.181G TVAR0.5330.4320.1960.6980.8980.3270.4560.881T TVAR0.5620.2670.7900.8510.1450.5960.8990.736Each cell reports the p-value for an F-test where the null is that the highest posterior density estimates of shocks are orthogonal to the lags of the principal components.5Evolving-regime impulse response analysisThe responses reported in Figure 3 and in Table 4 embed three different sources of uncertainty: uncertainty about the threshold estimate, uncertainty about the TVAR parameters, and uncertainty about the orthogonalization matrix that identifies the shocks. Even though we account for all of these different sources of uncertainty and we use 90% credibility bands, which are conservative in the fiscal literature, there is clear evidence of state dependence in the responses of output to fiscal policy. Notably, the posterior differences in the right columns of Figure 3 are large in magnitude and are highly likely to be different from zero. The cumulative multipliers exhibit similar state dependence.In this section, we turn to exploring implications of state dependence in more realistic scenarios for fiscal policy spending and tax shocks in which the economy is allowed to evolve endogenously from one regime to another. This approach allows us to consider possible sign and size asymmetries and to determine when discretionary fiscal policy is comparatively more or less effective. While the fixed-regime responses are useful for testing state dependence across regimes, the responses within a regime are be linear by construction – i.e., they are proportional to the size and sign of a shock. However, if the economy is allowed to evolve across regimes, threshold models allow (but do not impose) the possibility that negative shocks can have different proportional effects for positive shocks or that shocks of different magnitudes have non-proportional effects.The evolving-regime analysis requires specification of the history of the economy prior to the shock because the effects of the shock will depend on the system’s proximity to the threshold. For our generalized impulse response calculations, we focus on three particular histories of interest from a policy perspective: a strong expansion, a deep recession, and a sluggish recovery, defined as follows:–1996Q1: a robust expansion, when the economy is usually classified as being above or close to potential according to our various threshold variables and threshold estimates;–2008Q3: a deep recession, when the economy is clearly classified as being in the excess slack regime;–2012–2014: a sluggish recovery, when the economy is close to the estimated threshold for at least some of our threshold variables and threshold estimates.For each history, we calculate the responses to an increase in government spending and taxes and to decreases in government spending and taxes.Averaging over other similar histories of robust expansions, deep recessions, or sluggish recoveries produces very similar results to considering these specific dates.The shocks are scaled to 1% of GDP and 3% of GDP to consider possible size asymmetries. Sign restrictions are simply reversed for negative shocks. To address the computational burden when calculating the generalized impulse response functions, we abstract from the parameter uncertainty and fix the parameters at their highest posterior density values, although we know from fixed-regime responses in the previous section and as noted above that there is evidence of state dependence even when taking this parameter uncertainty into account.Figure 4 plots the responses of output to changes in government spending and taxes when the economy starts in robust expansion and Figure 5 plots the responses of output when the economy starts in a deep recession. In both cases, a shock scaled to 1% of GDP is considered. The top panels of the figures display the responses to government spending, the bottom panels plot the responses to tax changes. The left column displays the responses to positive shocks (higher government spending or higher taxes), the middle panel displays the response to a negative shock (scaled by −1 for ease of comparison), and the right panel displays the difference between the scaled response to a contractionary shock and the response to an expansionary shock.Figure 4:Sign-dependent effects of government spending and taxes on output in a robust expansion.Figure 5:Sign-dependent effects of government spending and taxes on output in a deep recession.The results in Figure 4 show that contractionary shocks (i.e., cuts in government spending or tax increases) have somewhat larger effects, on average, than expansionary shocks when the economy starts in a robust expansion. However, the difference is economically small and not significant. Tax cuts appear to be more efficient at stimulating output than increases in government spending ($1.7 vs. $0.6 after one year), which is consistent with Mountford and Uhlig (2009). The magnitude of the peak responses to tax shocks is also in line with, for example, the responses obtained by Romer and Romer (2010). However, our results for tax increases stand in contrast to the findings of Jones et al. (2015), who find that tax increases do not affect output, but decreases have a strong positive effect. Our results, by contrast, indicate that tax increases have a strong contractionary effect on output across the business cycle.Meanwhile, as shown in Figure 5, the effects of contractionary shocks are more persistent and larger than the effects of expansionary shocks when the economy starts in a deep recession. Cuts in government spending decrease output by $1.7 after 9 quarters. Tax increases decrease output by almost $3 after 10 quarters. The responses to stimulative shocks are smaller than the responses to austerity shocks. Both increases in taxes and decreases in government spending significantly decrease output (the response is different from zero at all horizons for tax increases and for two years for spending cuts). These results indicate that, if the aim of discretionary policy is to stimulate the economy, either government spending or tax cuts could be used in deep recessions, but tax cuts should be used when the economy is in a robust expansion.Size and sign asymmetries might be particularly relevant in a sluggish recovery when the economy is close to the threshold and different shocks can influence the probability of crossing it. Figure 6 plots the dollar-for-dollar responses of output to “small” (1% of GDP) and “large” (3% of GDP) shocks, both positive and negative, when the economy starts in a sluggish recovery. Figure 7 then plots posterior differences between responses to positive and negative shocks, responses to large positive and large negative shocks, responses to small and large negative shocks, and responses to small and large positive shocks.Figure 6:Sign-dependent and size-dependent effects of government spending and taxes on output in a sluggish recovery.Figure 7:Differences in sign and size effects of government spending and taxes on output in a sluggish recovery.The figure displays the evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a robust expansion. The left columns plot the responses to a positive shock, the middle column plots the response to a negative shock (scaled by −1 for ease of comparison), and the right column plots the difference in magnitude of (scaled) responses for positive and negative shocks. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.The figure displays the evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a deep recession. The left columns plot the responses to a positive shock, the middle column plots the scaled (by −1 for ease of comparison) response to a negative shock, and the right column plots the difference in magnitude of the scaled responses for positive and negative shocks. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.Figures 6 and 7 illustrate that the responses to contractionary shocks are larger than the responses to expansionary shocks. This is particularly pronounced when we consider large shocks. Large contractionary shocks have very persistent effects. By contrast, large expansionary shocks have positive effects in the short run that quickly die out as the economy gets closer to potential. The responses to large expansionary shocks (positive government spending shocks or negative tax shocks) are proportionally smaller than the responses to smaller expansionary shocks. In particular, when the economy is above the threshold, the dollar-for-dollar responses are dampened. By contrast, the responses to different sized cuts in government spending or tax increases are proportionally more similar.We also use evolving-regime impulse responses to trace out the effects of different shocks on government debt. While contractionary shocks have stronger effects on output, dollar for dollar, than expansionary shocks, the difference for government spending is much smaller in a robust expansion. This implies both that austerity could sometimes be self-defeating and that different fiscal policies of the same magnitude could have different effects on the debt-to-GDP ratio. Figure 8 plots the effects of stimulus on the debt-to-GDP ratio, while Figure 9 plots the effects of austerity. The top rows display the responses to a change in government spending and the bottom rows display the responses to tax changes. The left columns display the evolving responses when the economy starts in a deep recession, the middle columns display the responses when the economy starts in a robust expansion, and the right columns display the posterior differences.Figure 8:State-dependent effects of fiscal stimulus on the debt-to-GDP ratio with evolving regimes.Figure 9:State-dependent effects of austerity on the debt-to-GDP ratio with evolving regimes.Figure 8 shows that a decrease in taxes immediately raises the debt-to-GDP ratio and this increase is significant, albeit temporary. Because the increase in output is larger when taxes are cut in a deep recession, the rise in the debt-to-GDP ratio from a tax cut is significantly smaller than in a robust expansion.As Figure 9 shows, even though there is a substantial amount of uncertainty associated with the responses of the debt-to-GDP ratio to cuts in government spending, the posterior differences indicate that, at medium horizons at least, government spending cuts implemented in a robust expansion decrease the debt-to-GDP ratio more than government spending cuts implemented in a deep recession. The posterior difference peaks at 0.5% of GDP at intermediate horizons (Quarter 6 through Quarter 12). Similarly, increases in taxes reduce the debt-to-GDP ratio more if implemented in a robust expansion, but the difference is most pronounced at short horizons.The figure displays evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a sluggish recovery. The first column plots the responses to a positive shock equal to 1% of GDP, the second column plots responses to a positive shock equal to 3% of GDP, scaled by 1/3 for ease of comparison. The third column plots responses to a negative shock equal to 1% of GDP (scaled by – 1), and the fourth column plots responses to a negative shock equal to 3% of GDP (scaled by – 1/3). Posterior medians and 90% credibility intervals are reported.The figure displays the differences in evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a sluggish recovery. The first column plots the difference in magnitude of (scaled) responses for positive and negative shocks equal to 1% of GDP, the second column plots the difference in magnitude of (scaled) responses for positive and negative shocks equal to 3% of GDP, the third column plots the difference in magnitude of (scaled) responses for a negative shock equal to 3% of GDP and a negative shock equal to 1% of GDP, and the fourth column plots the difference in magnitude of (scaled) responses for a positive shock equal to 3% of GDP and a positive shock equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.The figure displays the evolving-regime responses of the debt-to-GDP ratio to a positive government spending shock (first row) and a negative tax shock (second row) for the benchmark model in a deep recession (left column) and in a robust expansion (middle column). The right column plots the difference between the responses in a deep recession and a robust expansion. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.The figure displays the evolving-regime responses of the debt-to-GDP ratio to a negative government spending shock (first row) and a positive tax shock (second row) for the benchmark model in a deep recession (left column) and in a robust expansion (middle column). The right column plots the difference between the responses in a deep recession and a robust expansion. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.6Responses of consumption and investmentIn this section, we disaggregate the responses of output by considering the separate responses of consumption and investment to government spending and tax shocks. This allows us to consider different channels through which policy effects propagate. The impulse responses are calculated following the same approach used to calculate the impulse responses for output, although the impact responses of consumption and investment to fiscal shocks are intentionally left unrestricted as there is a less clear consensus on the underlying source of the response on output than there is about the overall response.Figures 10 and 11 plot the fixed-regime responses of consumption and investment, respectively, while Table 6 summarizes the results for the cumulative multipliers at select horizons.Evolving-regime responses behave as would be expected given the fixed-regime responses and are not reported to conserve space.There is strong evidence in Figure 10 for state dependence in the response of consumption to government spending shocks, which is also confirmed in the cumulative multipliers in Table 6. A clear implication of this result is that consumption drives much of the state dependence in the response of output documented previously. In particular, consumption responds strongly in the excess slack regime, while the response is much smaller, although still significant, when the economy is closer to potential. Similarly, the response of consumption to tax shocks is stronger in the excess slack regime. The response pattern of consumption to government spending shocks is consistent with theoretical models that incorporate a time-varying share of rule-of-thumb consumers, models featuring habit formation in which government spending and consumption are complements, as in Leeper et al. (2017) and the endogenous credit constraint model by McManus et al. (2018).Figure 10:Dollar-for-dollar effects of government spending and taxes on consumption.Figure 11:Dollar-for-dollar effects of government spending and taxes on investment.Table 6:Consumption and investment multipliers: cumulative scaled responses at selected horizons.HorizonGovernment spendingTaxesExcess slackClose to potentialExcess slackClose to potentialConsumption1 year0.670.43−0.80−0.302 years0.990.36−1.19−0.803 years1.290.32−1.44−0.904 years1.490.32−0.29−0.845 years1.520.32−0.12−0.83Investment1 year0.03−0.16−1.61−0.252 years−0.02−0.22−1.30−0.513 years−0.09−0.25−0.39−0.304 years−0.13−0.27−0.14−0.365 years−0.14−0.28−0.23−0.41There is no evidence in Figure 11 of state dependence in the response of investment to government spending shocks, again confirmed in Table 6. However, there is evidence that investment responds very differently to government spending and tax shocks, dollar for dollar, and some evidence of state dependence in the responses of investment to tax shocks. Investment does not increase significantly in response to government spending shocks, but it responds significantly to tax changes. Thus, the responses of output to discretionary changes in taxes appears to be driven by both consumption and investment.The figure displays the fixed-regime responses of output to a government spending shock (first row) and a tax shock (second row) in the excess slack regime (left column) and close to potential regime (middle column) for the benchmark model. The right column plots the posterior differences in responses across the two regimes. Posterior medians and 90% credibility intervals are reported.The figure displays the fixed-regime responses of output to a government spending shock (first row) and a tax shock (second row) in the excess slack regime (left column) and close to potential regime (middle column) for the benchmark model. The right column plots the posterior differences in responses across the two regimes. Posterior medians and 90% credibility intervals are reported.7ConclusionsOur analysis has considered when discretionary changes in government spending and taxes are comparatively more or less effective. We have found strong and robust empirical evidence in favor of nonlinearity and state dependence in the relationship between both types of fiscal policy and aggregate output. In particular, estimates from a threshold structural vector autoregressive model imply different responses of the economy both to government spending and to taxes during periods of excess slack compared to when the economy is closer to potential.If the aim of discretionary fiscal policy is to stimulate the economy during periods of excess slack, both government spending multipliers and tax multipliers are high and work primarily through a consumption channel. However, when the economy is closer to potential, tax cuts have larger effects than government spending increases and work primarily through an investment channel. Notably, austerity designed to lower the government debt-to-GDP ratio is largely self-defeating in deep recessions. In particular, if the aim of austerity is to reduce the debt-to-GDP ratio, our results suggest it will have smaller negative effects on economic activity if pursued when the economy is in a robust expansion.Research since the influential study by Auerbach and Gorodnichenko (2012) has debated the importance of state dependence and nonlinearity in the response of the economy to fiscal policy. While many studies find nonlinearity, important questions have been raised about robustness of the evidence, such as those in Ramey and Zubairy (2018). By exploring these issues with a relatively large model that nests much of the earlier research and allows us look extensively at robustness to various choices of threshold variables and shock identification, we find strong, robust support for nonlinearity in the effects of fiscal shocks that can help guide the timing and structure of future fiscal interventions. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Studies in Nonlinear Dynamics & Econometrics de Gruyter

When is discretionary fiscal policy effective?

Loading next page...
 
/lp/de-gruyter/when-is-discretionary-fiscal-policy-effective-a0LpkLdYQj

References

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

Publisher
de Gruyter
Copyright
© 2020 Steven M. Fazzari et al., published by De Gruyter, Berlin/Boston
ISSN
1558-3708
eISSN
1558-3708
DOI
10.1515/snde-2018-0113
Publisher site
See Article on Publisher Site

Abstract

1IntroductionSince the Global Financial Crisis, there has been worldwide resurgence in the use of discretionary fiscal policy. Both stimulus and austerity have been enacted in many countries, with policies often being a mix of tax and spending changes. By its nature, “discretionary” implies choice, including choice about timing. Thus, if the effects of discretionary fiscal policy depend nonlinearly on economic conditions at the time when the policy is undertaken, it opens up important questions about when different policies would be comparatively more or less effective, questions that would simply not be relevant under linearity.This paper addresses three key questions about the timing and type of discretionary fiscal policy: (i) When do discretionary government spending increases and tax cuts provide more or less effective stimulus to the economy? (ii) Do the effects of government spending differ from the effects of taxes? (iii) Is austerity more or less effective than stimulus? In answering these questions, we make three contributions to the literature on nonlinear state-dependent effects of fiscal policy.First, we examine the exact nature and robustness of state dependence in the effectiveness of fiscal policy by considering an informationally-sufficient medium-scale threshold vector autoregressive (TVAR) model and by comparing and testing many different possible threshold variables. Using U.S. data and identifying government spending and tax shocks via sign restrictions, we find strong empirical support for nonlinearity related to economic slack in the relationship between government spending and aggregate output and between taxes and aggregate output, both when considering dollar-for-dollar and cumulative multiplier responses. The measure of economic slack that we find most closely relates to the nonlinear relationship between fiscal policy and aggregate output is the model-averaged output gap developed by Morley and Panovska (2019) based on earlier research by Morley and Piger (2012). The model averaging approach addresses uncertainty about the appropriate forecasting model for aggregate output by averaging implied estimates of the output gap from forecast-based trend-cycle decompositions for a large set of similarly-fitting reduced-form time series models. However, it is important to note that we find generally robust results in terms of the timing and implications of the nonlinearity for various conventionally used measures of slack. Meanwhile, we show that structural shocks identified from our medium-scale model pass the conventional informational sufficiency tests. Thus, our results suggest that previous findings in favor of nonlinearity are not simply due to omitted variables or a failure to account for fiscal foresight.Second, using evolving-regime generalized impulse response analysis, we demonstrate that tax cuts and government spending increases have similarly large expansionary effects during deep recessions and sluggish recoveries, but they are much less effective, especially in the case of government spending increases, when the economy is in a robust expansion. Meanwhile, tax increases and government spending cuts are most contractionary during deep recessions, and, as a result, are largely self-defeating if the goal of fiscal austerity implemented during an economic crisis is to bring down the debt-to-GDP ratio. Overall, we find that austerity has larger dollar-for-dollar effects on output than stimulus across the business cycle.Third, we investigate and determine the roles of consumption and investment in driving the effects of both government spending and taxes on aggregate output. Our results imply that the effectiveness of discretionary government spending shocks, including its state dependence, is almost entirely due to the response of consumption. The responses of both consumption and investment to discretionary tax changes are state dependent, but investment plays the larger quantitative role.The rest of our paper is organized as follows. Section 2 reviews the relevant previous literature. Section 3 presents our empirical model. Section 4 examines the evidence for nonlinearity and state dependence in the effects of government spending and taxes on aggregate output. Section 5 reports evolving-regime impulse response analysis to investigate when discretionary changes in government spending and taxes are comparatively more or less effective. Section 6 explores the roles of consumption and investment in driving the state-dependent effects of fiscal policy on aggregate output. Section 7 concludes.The technical estimation details are provided in Appendix A. Some additional results are provided in the online Supplementary Appendix B.2Previous literature2.1Nonlinear effects of fiscal policyRecent theoretical research highlights potential channels through which fiscal policy shocks transmit nonlinearly. Michaillat (2014) shows that public employment can have much larger multiplier effects when the unemployment rate is high than when it is low. Canzoneri et al. (2016) emphasize the role of the credit channel, and McManus et al. (2018) emphasize the importance of the credit channel when credit constraints are occasionally binding and endogenous. Gali et al. (2007) showed that a fiscal multiplier could be large when the ratio of rule-of-thumb consumers is large and stimulus policies work primarily through the consumption channel. Leeper et al. (2017) showed that fiscal multipliers can be persistently high when government spending interacts favorably with consumer preferences.The empirical literature on fiscal policy multipliers has grown rapidly and has many different strands. Our analysis builds on and merges several of these. Most closely related, a number of studies with smaller-scale nonlinear vector autoregressive (VAR) models find state-dependent effects of discretionary changes in government spending – see, for example, Auerbach and Gorodnichenko (2012), Auerbach and Gorodnichenko (2013), Bachmann and Sims (2012), Baum et al. (2012), Caggiano et al. (2015), Candelon and Lieb (2013), Fazzari et al. (2015), (FMP henceforth), and Mumtaz and Sunder-Plassmann (2019). However, the existence of state dependence does not seem settled. Studies using a narrative approach and military spending shocks to identify the effects of government spending often find little support for state dependence–see, for example, Owyang et al. (2013) and Ramey and Zubairy (2018).Some recent empirical studies have also considered asymmetries in the effects of stimulus versus austerity, mostly in a sign-dependent framework. Jones et al. (2015) find that tax cuts have significant positive effects on US output, while tax increases have no substantial negative effects, but these results are reversed for the UK. Barnichon and Matthes (2017) find that government spending cuts have larger effects than increases, with the results driven primarily by very strong negative responses of output to government spending decreases during recessions. Guajardo et al. (2014) and Jorda and Taylor (2016) find large decreases in output in response to exogenous fiscal consolidations. Alesina et al. (2015) show that for a panel of 16 OECD countries, fiscal consolidations based on spending cuts are less costly in terms of output loss than consolidations based on tax increases. In a state-dependent framework, Klein (2017) finds that austerity has large negative effects on output when the level of private debt is high.2.2Shock identificationDifferent strands of the fiscal literature have also taken varied approaches to shock identification. The three most popular are the timing approach, the narrative approach, and the sign restriction approach.Variations of the timing approach are used by, for example, Blanchard and Perotti (2002), Auerbach and Gorodnichenko (2012), and FMP. The timing approach entails imposing a restrictions such as, for example, government spending not responding to business cycle shocks within a quarter.The narrative approach uses government spending shocks or tax shocks constructed by examining historical announcements about changes in government spending and taxes unrelated to the business cycle or overall economic conditions. Ramey (2011), Owyang et al. (2013), Ramey and Zubairy (2018), Cloyne (2013), Romer and Romer (2010), and Jones et al. (2015), inter alia, use the narrative approach, sometimes combined with timing restrictions. However, many studies that use the narrative approach consider only military spending shocks or narrative measures limited to large consolidations. This means that many of the observations for the narrative shock series are equal to zero for a large part of the sample, which makes exploring state dependence challenging econometrically.The sign restriction approach defines the number of structural shocks of interest (which can be smaller than the number of variables in the VAR model) and restricts the sign of the response of variables over particular horizons. This approach is usually considered more agnostic than the timing approach because it effectively nests the timing restrictions. In a linear setting, Mountford and Uhlig (2009) find that deficit-financed tax cuts increase output more than deficit-financed increases in government spending. Candelon and Lieb (2013) extend the model to a nonlinear setting, and find that there is strong evidence of nonlinearity in the response of output to government spending shocks, but that the multipliers are always lower than one.Despite the rapidly growing fiscal literature, few studies have formally considered whether both the effects of government spending and taxes are nonlinear in a joint model or whether state dependence could imply sign asymmetry. Notably, a theoretical model with endogenous credit constraints would imply both state dependence and sign dependence. For example, in a model in the spirit of (McManus et al., 2018), a cut in transfers to impatient households that keeps the households constrained would have larger effects than an increase. Because our medium-scale model described in the next section embeds detailed information about fiscal and other macroeconomic variables, we are able to address a number of potential problems in identifying discretionary government spending and tax changes shocks separately, while evolving-regime generalized impulse responses presented in Section 5 allow us to consider the presence of sign asymmetry under state dependence.3A medium-scale TVAR model3.1Reduced-form model and estimation methodWe construct a TVAR model and consider different possible threshold variables. Let Ytdenote the vector containing the endogenous variables. The TVAR model splits the stochastic process for Ytinto two different regimes. Within each regime, the process for Ytis linear, but Ytcan evolve endogenously between regimes. Let qt−ddenote the threshold variable that determines the prevailing regime, where the integer d is the delay lag for a regime switch. If the threshold variable qt−dcrosses c at time t−d, the dynamics of the TVAR model change at time t. Defining an indicator function I[.] that equals 1 when qt−dexceeds the threshold c and 0 otherwise, the full model can be written in a single equation as(1)Yt=Φ01+Φ11(L)Yt−1+(Φ02+Φ12(L)Yt−1)I[qt−d>c]+εt.$${Y}_{t}={\text{{\Phi}}}_{0}^{1}+{\text{{\Phi}}}_{1}^{1}\left(L\right){Y}_{t-1}+\left({\text{{\Phi}}}_{0}^{2}+{\text{{\Phi}}}_{1}^{2}\left(L\right){Y}_{t-1}\right)I\left[{q}_{t-d}{ >}c\right]+{\varepsilon }_{t}.$$The dynamics of the system when qt−dis below c are given by Φ01${\text{{\Phi}}}_{0}^{1}$and the lag polynomial matrix Φ11(L)${\text{{\Phi}}}_{1}^{1}\left(L\right)$, and by Φ02${\text{{\Phi}}}_{0}^{2}$and the lag polynomial matrix Φ12(L)${\text{{\Phi}}}_{1}^{2}\left(L\right)$when qt−dis above c. The disturbances εt${\varepsilon }_{t}$are assumed to be nid with mean zero and variance-covariance matrix Σ that is assumed fixed across regimes.We show that our results are robust to relaxing this assumption in Supplementary Appendix B.For our benchmark specification, Ytincludes nine variables: log real federal consumption and investment spending, log real federal transfer payments to persons, log real federal interest payments on debt, log real transfer taxes, log of other tax revenues in real terms, log real GDP, a measure of slack, an interest rate (measured using the Federal Funds Rate or the Wu and Xia (2016), shadow rate during the zero-lower-bound period), and inflation (calculated using the GDP deflator). The sample period for the benchmark model is 1967Q1–2015Q4. All fiscal variables are converted to real terms using the GDP deflator, and all nominal series were obtained from NIPA-BEA.By focusing on federal variables only, we are able to trace out the impact on public debt, a variable of obvious interest in debates about fiscal policy. In particular, if the total federal debt at time t is Dt, thenDt=Dt−1+Gt+Gttransfer+Gtinterest−Tttransfer−Ttother$${D}_{t}={D}_{t-1}+{G}_{t}+{G}_{t}^{transfer}+{G}_{t}^{interest}-{T}_{t}^{transfer}-{T}_{t}^{other}$$and the debt-to-GDP ratio can be calculated asdt=dt−1*Yt−1/Yt+Gt+Gttransfer+Gtinterest−Tttransfer−TtotherYt,$${d}_{t}={d}_{t-1}\text{{\ast}}{Y}_{t-1}/{Y}_{t}+\frac{{G}_{t}+{G}_{t}^{transfer}+{G}_{t}^{interest}-{T}_{t}^{transfer}-{T}_{t}^{other}}{{Y}_{t}}\text{,}$$where dtis the ratio at time t.An alternative way to track debt would be to account for the evolution of interest rates and inflation and to directly account for debt in the VAR, as in Favero and Giavazzi (2012). The results for a model with debt in the VAR are discussed in Supplementary Appendix B.Most fiscal stimulus or austerity that involves discretionary changes in government spending is usually implemented by changes in government consumption or investment. Nonetheless, transfer and interest payments may have sizable effects on the debt-to-output ratio. Furthermore, government transfer payments to persons are strongly affected by the state of the business cycle and respond to movements in output. Changes in transfer payments are occasionally used as a fiscal policy tool (a notable example is the extension of unemployment benefits during the Great Recession), although most movements in transfer payments are likely to be endogenous.When we perform a variance decomposition for our benchmark model, we find that the bulk of the variation in transfer payments is explained by business cycle shocks (61% on impact, 91% after 8 quarters).Likewise, it is important to split taxes into two sub-components: transfer taxes, which depend on the state of the business cycle and are rarely used as a discretionary fiscal policy tool, and federal tax receipts net of transfer taxes (the federal equivalent of Blanchard and Perotti’s 2002, tax series).We estimate the parameters Φij${\text{{\Phi}}}_{i}^{j}$, the threshold c, the delay lag d, and the number of lags included in the TVAR model using Bayesian methods (technical details are provided in Appendix A). A Bayesian approach has two advantages in highly parametrized models such as the TVAR model. First, conventional frequentist tests can be severely underpowered. The Bayesian approach circumvents this problem by allowing us to directly compare the linear to the nonlinear model using marginal likelihoods. The marginal likelihoods are calculated based on Chib and Jeliazkov (2001) algorithm and models are evaluated using the implied Bayes factors. In addition, motivated by concerns described by Campolieti et al. (2014), we also report the expected posterior likelihoods and the highest posterior density for all of the models. Second, the impulse responses for the endogenously evolving system have nonstandard distributions that will be highly non-Gaussian and depend on the history and the size or sign of the shocks, even when the true values of parameters are known. The Bayesian sampler conveniently produces the entire posterior distribution for c, Φij${\text{{\Phi}}}_{i}^{j}$and Σ conditional on the data and the entire posterior distribution of the impulse responses.3.2Impulse responses and shock identificationThe main empirical questions we consider are whether the effects of government spending differ across regimes defined by economic slack and whether, conditional on any state dependence, austerity has effects that are significantly different from a mirrored effect of stimulus of the same magnitude. Rejecting linearity using Bayesian model comparison directly implies that at least one of the impulse responses to at least one identified structural shock is different across regimes. However, the nature and degree of this asymmetry can be evaluated only by looking at the impulse response functions themselves.The main impulse responses that we consider reflect, after appropriate conversion, the dollar-for-dollar responses of a variable of interest (for example output) to a one-time policy shock. However, these may provide an incomplete picture of the overall effects from a policy shock. If a researcher is interested in calculating multipliers in a more conventional sense, the cumulative change in output scaled by the cumulative change in government spending or taxes may be the more appropriate measure.We compute the cumulative multipliers as the cumulative impulse response of a outcome variable, like output, divided by a cumulative impulse response of a policy variable. In particular, multiplierhorizon=H=(Σt=1t=HIFR(Y,h))/(Σt=1t=HIFR(Fiscal,h))${multiplier}_{horizon=H}=\left({\text{{\Sigma}}}_{t=1}^{t=H}IFR\left(Y,h\right)\right)/\left({\text{{\Sigma}}}_{t=1}^{t=H}IFR\left(Fiscal,h\right)\right)$where IRF(Y,h) is the impulse response of the variable of interest to the fiscal policy shock at horizon h and IRF(Fiscal,h) is the impulse response of G or T to its own shock at horizon h.The dollar-for-dollar responses and the cumulative multipliers provide related, but slightly different pieces of information. In particular, the dollar-for-dollar responses address the question of how output responds today (or at some future horizon) to a policy change today and are close to what, for example, the Congressional Budget Office releases and is reported by the media when they estimate the effects of a policy at a given horizon (although the CBO also calculates and reports the cumulative net effects). For that reason, and because the literature is not unanimous about reporting cumulative or dollar-for-dollar responses, we consider both.In constructing impulse responses (or functions of the impulse responses, in the case of multipliers), the structural shocks need to be identified using a plausible orthogonal decomposition of the variance-covariance matrix Σ. When imposing sign restrictions, we take an approach that is similar in spirit to Mountford and Uhlig (2009). However, following recent developments in the time-series literature that show the penalty function approach used by Mountford and Uhlig (2009) may bias the impulse responses and lead to artificially narrow credibility intervals, we construct the impulse responses using the efficient sampler proposed by Arias et al. (2018).It is important to note that we also incorporate uncertainty about the threshold estimate. For the restricted linear version of the model, we use the Arias et al. (2018) algorithm directly. For the nonlinear version of the model, the priors are symmetric across regimes. In particular, the prior distributions are centered around the posterior means for the linear model. Figure A1 in Appendix A plots the impulse responses for the linear model, which correspond to the impulse responses implied by the priors for the nonlinear model. In Appendix A we also conduct an experiment where the priors for the nonlinear model are centered around zero to illustrate that the posteriors are not substantially affected by the informative priors.Our focus is on four structural shocks identified using sign restrictions on the impulse responses, summarized in Table 1.Table 1:Sign Identification.ResponseShockGTransPayIntPayTransTaxOtherTaxYslackiπG+++????+/????T????+++????BC????+++++++++??MP??????/− − −?+++− − −Question marks indicate that the sign is left unrestricted.The four identified shocks are a government spending shock, a tax shock, a “business cycle” shock, and a monetary policy shock. The first sign in each cell of Table 1 shows the assumed direction of the effect of a shock on the response variable on impact; the second and third signs in each cell are the assumed signs in the first and second quarter following the shock. All shocks are assumed to be orthogonal to one another, which differs from Mountford and Uhlig (2009), who do not impose the restriction that tax shocks are orthogonal to government spending shocks.Our results are robust to relaxing this assumption, as shown in Supplementary Appendix B.A positive business cycle shock is restricted to increase output, tax revenues, and the measure of slack on impact and for 2 quarters following the shock.“Slack” is conventionally defined as the difference between some measure of economic activity and its long run trend. Thus, large negative values imply there is a lot of slack in the economy. A positive business cycle shock would, therefore, increase capacity utilization or the output gap (i.e., making the gap less negative or more positive). However, in the cases where the unemployment rate is used as a measure of slack, we reverse the sign of the restricted responses. Also, note that when we consider the responses of consumption and investment in Section 5, we assume that a positive business cycle shock increases consumption and investment. As in Mountford and Uhlig (2009), a positive business cycle shock in this context could be consistent with either demand or supply shocks, and we remain agnostic about the ultimate source of business cycle fluctuations. Note that we do not impose any restrictions on the responses of the interest rate to a business cycle shock, although we find that the posterior responses indicate that the interest rate also increases in response to a positive business cycle shock. These results are available upon request.Meanwhile, a positive monetary policy shock is specified to increase the interest rate contemporaneously and for the subsequent two quarters, while decreasing inflation on impact and for the subsequent two quarters. That is, a “positive” monetary shock is contractionary in the sense of having a disinflationary and negative liquidity effect. However, because there is conflicting evidence from the monetary policy literature (see, for example, Lo and Piger (2005), Alpanda and Zubairy (2019)) about whether the responses of output to monetary policy can vary and possibly be insignificant at some points of the business cycle, we do not impose the restriction that output falls in response to a contractionary monetary policy shock, although our main results do not change when we impose this restriction. A positive tax shock is assumed to increase tax revenues contemporaneously and for two quarters following the shock. Similarly, a positive government spending shock increases government consumption and investment contemporaneously and for two quarters following the shock.We also consider an alternative identification scheme where the restrictions are imposed for four quarters and a restriction scheme where transfer payment are countercyclical. The responses look very similar to the responses presented in the next sections and are available upon request.Following previous results from the fiscal literature, we also impose the restriction that output increases on impact in response to a positive government spending shock (see, for example, FMP and Auerbach and Gorodnichenko 2013) and that exogenous tax increases decrease output on impact (see, for example, Romer and Romer 2010). Even studies that find no evidence of state dependence or studies that find that government spending multipliers decline sharply after the first quarter find positive multipliers on impact (for example, see Ramey and Zubairy 2018). The responses of output are then left unrestricted after impact.We make an assumption about the response of output on impact in order to speed up the Bayesian estimation, including the calculation of marginal likelihoods. However, the estimates presented in the next sections do not hinge upon this assumption. When the response of output to government spending or taxes is left unrestricted, almost all of the posterior distribution of the response of output to a positive government spending shock or a negative tax shock is above zero at horizon zero. Therefore, the slightly more restrictive prior is supported by the data. The full set of responses for different identification schemes is available upon request.The responses to negative shocks are restricted to have the opposite signs to those shown in Table 1. In the case where we consider the evolving-state impulse responses, the responses are constructed assuming that the economy evolves endogenously from one regime to another, with an orthogonalization accepted if the sign restrictions hold for two quarters even if the economy evolves from one regime to another. The technical details of the impulse response calculation are discussed in Appendix A.4Evidence for nonlinearity and state dependenceBecause the main goal of this paper is to explore when different types of discretionary fiscal policy are effective in the presence of state dependence, we first need to establish what evidence there is for nonlinearity when considering a large enough model to ensure informational sufficiency. To do this, we consider the choice of threshold variable, assess the evidence of state dependence in the dollar-for-dollar and cumulative multipliers, and then perform formal tests to demonstrate that our shocks can be considered “structural” in the sense they are orthogonal both to survey forecasts and to information from other macroeconomic variables.4.1Choice of threshold variableTwo key issues complicate the choice of threshold variable. First, any proposed measure of slack may not accurately capture the true degree of under (or over) utilization of resources in the economy. Second, economic slack may not actually be what triggers nonlinear responses of output to fiscal policy.A full discussion of other potential econometric issues when selecting threshold variables, such as breaks and stationarity issues is included in Supplementary Appendix B. Supplementary Appendix B also includes checks that our results are robust with respect to different measures of slack, as well as to different subsamples and alternative identification schemes.4.1.1Measures of slackEven focusing on the output gap (i. e., the difference between actual and potential log real GDP) as a measure of slack, large discrepancies arise when using different models to estimate the output gap (see, inter alia, Morley and Piger 2012; Morley and Panovska 2019; Perron and Wada 2016). To address this model uncertainty, the measure of slack that we use in our benchmark TVAR model is the model-averaged output gap (MAOG) from Morley and Panovska (2019).For this paper, we re-calculate the MAOG using the full available data sample for US real GDP and treat the resulting output gap as data in our TVAR model.The MAOG is calculated using equal weights on estimated output gaps from a large set of linear and nonlinear time series models (we refer the reader to the original study for technical details). Morley and Panovska (2019) show the MAOG approach performs very well in matching business cycle dates and correspondence to narrower measures of slack, not just for the US, but for a large group of OECD countries.We note that the extant nonlinear fiscal spending multiplier literature has used many different observed variables as potential proxies of slack. For example, in FMP, we considered capacity utilization. Meanwhile, a large number of studies use the CBO output gap (see, for example Baum et al. 2012). Auerbach and Gorodnichenko (2012, 2013 use various combinations of moving averages of output growth rates, whereas Ramey and Zubairy (2018) use the unemployment rate. Given this variety of slack measures, two immediate questions arise: First, which measure of slack drives possible nonlinearity in our medium-scale TVAR model? Second, which measure of slack is the “right” measure when modeling the macroeconomy as a whole? To address these questions, we consider three sets of models. The first set is based on our benchmark specification, but using different measures of slack in the VAR and as a threshold variable. This set of models helps us assess which measure of slack drives the nonlinearity in our model. The second set of models also covers different measures of slack, but corresponds to a smaller VAR that excludes fiscal instruments and only includes output, inflation, interest rates, and the measure of slack. The third set of models also covers different measures of slack, but corresponds to a small-scale fiscal model that includes federal consumption and investment, federal revenues, output, inflation, interest rates, and the measure of slack.For the three sets of models, Table 2 reports the various threshold estimates, and different measures of fit, including the log marginal likelihood, across different measures of slack in the VAR and as threshold variables.All threshold variables are adjusted for structural breaks. Details are provided in Supplementary Appendix B.The implied Bayes factors strongly favor the TVAR model over the linear counterpart in all cases. This result is particularly notable for the benchmark medium-scale model because evidence of nonlinearity for the smaller models could have been due to omitted variables included in our larger model. Meanwhile, the MAOG is the preferred measure of slack in almost every case, both for the benchmark model and for the smaller models. Furthermore, while the threshold estimates for some of the other variables change across different models, the estimated thresholds for the MAOG as the threshold variable are fairly robust across the different specifications. Taken together, these results suggest that the MAOG is a good measure of economic slack and driver of nonlinearities in macroeconomic dynamics.Table 2:Model comparison: Linear versus nonlinear models with different measures of slack.Benchmark VAR: measure of slack in the VARThreshold variablecapacity utilizationunemployment rateCBOgapMAOGLinear (none)−2670.03−2525.25−2430.87−2394.05−2676.38−2523.81−2429.42−2389.91−822.73−507.72−506.87−833.73capacity utilization−2238.54−2137.32−2028.08−1938.70−2239.00−1.35−2029.01−2.00−2029.47−0.30−1938.02 −1.33−339.26 (−1.66, −1.04)−339.26(−2.63, −0.28)−344.21(−1.79, 0.23)−568.77(−1.82, −0.91)unemployment rate−2294.43−2152.21−1764.52−1958.42 0.13−2293.77 0.59−2151.00 0.06−1761.30 0.88−1958.63(−0.05, 0.38)−339.26(−0.06, 0.73)−362.84(−0.29, 0.38)−341.39(0.06, 1.04)−542.63CBOgap−2231.54−2089.74−1597.20−1916.56−2231.70−1.64−2091.77−1.66−1598.00−2.00−1918.22−1.64−303.37(−1.95, −0.84)−329.92(−1.95, −0.66)−312.61(−2.16, −1.30)−487.05(−1.94, −1.06)MAOG−1937.81−2085.69−1579.30−1873.11−1937.00−0.69−2084.27 −0.74−1578.33 −0.71−1870.00−0.74−269.66(−0.74, −0.52)−329.92(−0.80, −0.39)−309.65(−0.82, −0.39)−450.37 (−0.86, −0.51)Var with output, inflation, interest rates, and a measure of slack: measure of slack in the VARThreshold variablecapacity utilizationunemployment rateCBOgapMAOGLinear (none)−667.09−510.15−442.52−365.37−664.22−509.23−441.52−364.22−194.27−162.52−405.22−240.38capacity utilization−569.70−423.05−339.42−247.23−567.99−3.03−422.99−3.03−339.40−3.02−244.22−3.03−84.37(−4.84, −0.86)−64.22(−3.51, −0.82)−110.95(−3.42. −1.52)−91.11(−3.38, −0.47)unempoyment rate−592.15−428.39−94.83−274.90−592.910.74−426.220.73−92.111.20−276.210.73−109.22(−0.04, 1.03)−62.19(−0.07, 0.87)−63.59(−0.05.1.39)−90.23(−0.06, 0.89)CBOgap−585.11−420.45−237.37−268.55−584.00−1.64−420.40−0.69−235.22−2.17−266.211−0.41−111.96(−1.96, −0.22)−54.11(−2.71, −0.21)−89.65(−2.42, −0.41)−80.29(−1.96, 0.14)MAOG−569.55−408.14−50.10−211.84−567.01−1.02−407.22−1.03−49.99−1.49−210.22−1.21−84.62(−1.34, −0.02)−44.07(−1.34, −0.60)−35.09(−1.51, −0.51)−58.57(−1.34, −0.95)Var with federal spending, federal revenues, output, inflation, interest rates, and a measure of slack: measure of slack in the VARThreshold variablecapacity utilizationunemployment rateCBOgapMAOGLinear (none)−1490.90−1335.36−1243.82−1196.54−1488.20−1333.00−1242.01−1194.24−433.76−459.44−466.73−363.01Capacity Utilization−1266.93−1118.90−11036.46−936.63−1262.90−3.00−1118.00−2.90−1134.00−3.01−930.10−3.13−182.07(−3.50, −0.89)−139.18(−3.52, −1.71)−174.33(−3.45, −2.52)−317.02(−3.47, −0.94)unemployment rate−1336.69−1185.00−772.61−1028.60−1334.920.79−1184.500.80−771.511.09−1022.510.43−176.22(−0.16, 0.99)−223.16(−0.00, 1.08)−223.16(0.09, 1.21)−261.95(−0.07, 0.96)CBOgap−1296.73−1147.56−917.49−990.20−1292.00−1.91−1148.00−1.56−915.00−1.86−986.02−1.64−170.02(−2.01, −1.01)−128.33(−2.40, −0.72)−374.00(−2.41, −1.05)−320.58(−1.95, 0.76)MAOG−1253.99−1110.35−613.42−942.67−1253.80−1.16−1109.00−1.02−612.00−1.11−942.02−0.74−161.22(−1.51, −0.36)−104.56(−2.22, −0.37)−99.52(−1.25, −0.55)−263.00(−2.01, −0.31)Each cell reports the log likelihood obtained from maximum likelihood estimation, the expected posterior log likelihood obtained Bayesian estimation, and the log marginal likelihood (top, middle, bottom). The second entry is the threshold estimate, including 90% credibility intervals, obtained from the posterior Bayesian distribution. The best model fit for each measure of slack is reported in bold.4.1.2Slack versus alternative threshold variablesTo account for the possibility that nonlinearity could actually be driven not so much by the degree of slack in the economy, but more by the direction of change in economic activity or fiscal policy, we also consider the following possible threshold variables: a 4-quarter moving average change in log output, a 4-quarter moving average change in the log of government spending and consumption, and a 4-quarter moving average change in log tax revenues (net of transfer taxes).We also considered longer moving averages of output growth, as in Auerbach and Gorodnichenko (2012). Results were similar to those for the 4-quarter moving average and are available upon request.For completeness, we also consider the level of the ex-ante real interest rate (based on static expectations) as a threshold variable to allow for the possibility that any asymmetry is related more to the stance of monetary policy rather than fiscal policy. Figure 1 plots all of the possible threshold variables that relate to economic slack, growth rates, and policy changes. The left panels display the measures of slack and the right panels display the additional threshold variables related to growth rates and policy changes.Figure 1:Threshold variables and estimated thresholds.Recent developments in the fiscal and monetary literature have also indicated that household debt or household debt overhang could be a potential channel for nonlinear transmission of policy (Alpanda and Zubairy 2019; Bernadini and Peersman 2018; Klein 2017). Furthermore, the empirical literature has also related the effectiveness of fiscal policy to the level of the government debt (see, inter alia, the now highly controversial study by Reinhard and Rogoff (2009)). Thus, we also consider household and Federal debt-output ratios as potential threshold variables. Figure 2 plots the debt levels and the debt overhang. As shown in the figure, both debt ratios are clearly nonstationary. For the sake of direct comparison with previous studies on federal debt, we consider using the ratios in levels (while fully acknowledging that this could be problematic and could lead to incorrect inference, as discussed in detail in Supplementary Appendix B). We also consider their overhang levels, which are stationary. Following the previous literature (inter alia Klein 2017 or Alpanda and Zubairy 2019), overhang is defined as the difference between the ratio and its long run trend, where trend in this case is estimated using the low frequency output of the HP filter with λ=104$\lambda ={10}^{4}$.Figure 2:Household and federal debt-to-GDP ratios.The threshold variables are adjusted for any structural breaks. The estimated threshold (median) and 90% credibility intervals from Tables 2 and 3 are also displayed.Table 3:Model comparison: slack versus growth versus debt as threshold variables in a TVAR model with MAOG as the measure of slack.Threshold variableMeasure of slack in the VAR modelMAOGLinear model (none)−2394.05−2389.91−833.73Moving Average Output Growth−1879.45−1.51−1878.00(−1.67, −0.89)−454.37Moving Average Government Spending Growth−1966.62 3.93−1962.79(1.06, 4.45)−503.65Moving Average Taxes Growth−1924.22 −3.31−1923.16(−4.79, 1.34)−490.12Real Interest Rate−1953.92 −0.21−1953.00(−0.62, 0.87)−515.22Household Debt Ratio−1958.2547.25−1956.22(n/a)−600.22 Federal Debt Ratio−1992.85 34.58−1990.11(n/a)−751.00Household Debt Overhang−1995.11−0.75−1994.22(−0.42, 0.85)−671.00Federal Debt Overhang−1939.31−1.385−1933.22(−2.02, 2.10)−688.95MAOG−1873.11−0.74−1870.00(−0.86, −0.51)−450.37 Each cell reports the log likelihood obtained using maximum likelihood estimation, the expected posterior log likelihood obtained Bayesian estimation, and the log marginal likelihood (top, middle, bottom). The second entry is the threshold estimate, including 90% credibility intervals, obtained from the posterior Bayesian distribution. The growth rate threshold variables are 4-quarter moving averages of log differences. The real interest rate is an ex ante measure given static expectations. The best model fit is reported in bold.The trends are estimated using an HP filter with λ equal to 104. The estimated threshold (median) and 90% credibility intervals from Tables 3 and 4 are also displayed.Table 4:Output multipliers: cumulative scaled responses at selected horizons.HorizonGovernment spendingTaxesExcess slackClose to potentialExcess slackClose to potential1 year1.151.24−6.24−2.222 years1.210.72−5.37−3.193 years1.230.38−4.51−3.684 years1.180.28−4.28−4.075 years1.150.26−4.50−4.33Table 3 reports the results of a model comparison for a specification where slack (namely the MAOG) is used as the threshold variable versus specifications with policy or debt threshold variables. Similar to Table 2, the MAOG is strongly preferred as the threshold variable. Looking back at Figure 1, the threshold estimates for all measures of slack split the sample up into periods of excess slack (recessions and their immediate recoveries) and “normal” times when the economy is closer to potential. Related to this delineation of the sample and, as can be seen by comparing Tables 2 and 3, any of the slack variables is strongly preferred as a threshold variable compared to the policy variables, while the MAOG is preferred over the moving average of output growth, even though both identify similar dates for the regimes, as shown in Figure 1. The models in which debt ratios are included as threshold variables in levels perform worse than any of the models that include a measure of slack as a potential threshold variable. Furthermore, the estimates are quite imprecise when considering policy measures as threshold variables. In the case of debt overhang, the nonlinear model outperforms the linear model, but, again, all model selection criteria strongly prefer the model with the MAOG as the threshold variable. Therefore, in our remaining analysis, we focus on the model with the MAOG as a measure of slack and as a threshold variable.However, to ensure that our results are not driven solely by a particular choice of the measure of slack, we also consider the robustness of responses using the other measures of slack. The results for this robustness analysis are reported in Supplementary Appendix B.4.2Fixed-state responsesBefore considering evolving-regime responses to both government spending and tax shocks in the next section, we first establish that our identified government spending and tax shocks have state-dependent effects. Figure 3 plots the dollar-for-dollar responses and Table 4 summarizes the responses for the cumulative multipliers at select horizons.It is important to note that both the dollar-for-dollar responses and the cumulative multipliers are highly nonlinear. They peak at different horizons, and the credibility intervals are not symmetric around the median or the highest posterior density response. For convenience, we provide summary tables with the estimated cumulative multipliers for various outcome variables (Table 4 presents the responses of output and Table 6 in Section 6 presents the responses of consumption and investment) at select horizons. The companion graphs that show the full extent of the nonlinearities for the cumulative multipliers by plotting them with their 90% intervals at all horizons are provided in Supplementary Appendix B.The left panels in Figure 3 display the fixed-regime responses for the “excess slack” regime (defined by the estimated threshold), the middle panels display fixed-regime responses when the economy is close to potential, and the right panels display the posterior differences for the responses between the two regimes. Both median responses and 90% credibility intervals are reported.The dollar-for-dollar responses are converted from log-to-log responses using the average Gt/Ytratios for the corresponding regimes for each draw of the threshold parameter. The Gt/Ytratios vary considerably over time, which could potentially affect the size of the multiplier when converting log-to-log responses to dollar-for-dollar responses. In this section, we report the fixed-regime responses using the average ratios to illustrate the pattern of nonlinearity. When we construct the history-dependent evolving-regime responses in the next section, we address this issue directly and all of the responses are converted using the Gt/Ytratio at each point in time. However, it is worth noting that the responses in Figure 3 are comparable in magnitude and in shape to the responses that would be obtained using the Gt/Ytratio for each point in time.Figure 3:Dollar-for-dollar effects of government spending and taxes on output.In the excess slack regime, output responds with a large and persistent increase to a positive shock to government spending. By contrast, when the economy is close to potential, an increase in government spending temporarily increases output on impact, but the response dies out and becomes negative after two years. Tax cuts have similar state dependence. Flipping the sign of the displayed response to a tax hike, the fixed-regime impulse responses suggest that a tax cut would increase output in both regimes. A tax cut in the excess slack regime increases output by $2 (dollar for dollar) and is significant for 13 quarters, whereas the response is smaller and dies out after 7 quarters when the economy is close to potential, peaking at $1.3 and becoming insignificant after 7 quarters.The impulse responses for the linear VAR model for the benchmark case when the MAOG is used as a measure of slack are reported in Appendix A. These responses correspond to the responses implied by the prior in each regime for the nonlinear model. As pointed out by (Arias et al., 2018), the impulse responses obtained using their approach have much wider credibility bands than the responses reported by Mountford and Uhlig (2009). However, when we allow for nonlinearity, the impulse responses are more precisely estimated, implying that some of the uncertainty about the linear responses could be due to state dependence that is not accounted for. Furthermore, the posterior distribution for the excess slack regime looks substantially different from the posterior distribution for the linear model, thus indicating that the findings in favor of state dependence are not driven by our choice of priors.The posterior differences in the right panels make it clear that the state dependence is significant. Meanwhile, the cumulative tax multipliers in Table 4 are larger than the cumulative spending multipliers, especially at longer horizons, with both spending and tax multipliers exhibiting clear state dependence. The spending multipliers are larger and more persistent in the excess slack regime than when the economy is close to potential. The spending multipliers peak early when the economy is close to potential, and start declining after less than a year. Similarly, the tax multipliers are larger in magnitude in the excess slack regime than when the economy is close to potential.The figure displays the fixed-regime responses of output to a government spending shock (first row) and a tax shock (second row) in the excess slack regime (left column) and close to potential regime (middle column) for the benchmark model. The right column plots the posterior differences in responses across the two regimes. Posterior medians and 90% credibility intervals are reported.4.3Informational sufficiencyThe impulse responses in Figure 3 are presented under the assumption that the VAR model includes sufficient information to correctly identify the fiscal shocks. However, if shocks are correlated with other information available to economic agents that is not included in the VAR model, the estimated impulse responses can be biased. For example, Ramey (2011) shows possible “fiscal foresight” about shocks is a problem for a small Blanchard-Perotti type VAR model for which the shocks identified from the VAR model can be Granger-caused by forecasts of those shocks and are, therefore, likely to be anticipated by economic agents.We follow Forni and Gambetti (2014) to assess whether the shocks from our TVAR model are unanticipated.We have also benchmarked the shocks from our model against commonly-used narrative measures. The spending shocks pick up most of Ramey (2011) military spending news shocks and our tax shocks most of Romer and Romer (2010) unanticipated tax shocks. Our results are also robust to an alternative identification scheme that accounts for fiscal foresight following Mountford and Uhlig (2009). Both sets of results are available upon request.We regress the structural shocks for government spending, in our case from each iteration of the Bayesian sampler for our benchmark model, on the Survey of Professional Forecasters’ forecasts of Real Federal Government Consumption Expenditures and Gross Investment in the subsequent four quarters, calculated based on mean responses and taking into account compounding. Our assumption with this test is that the SPF forecasters aggregate relevant information about the anticipated component of government spending. Meanwhile, because the residuals for a Bayesian VAR model may not necessarily be orthogonal to the VAR information set, we also perform a Granger causality test where we regress the structural shocks on a model that includes both the right-hand-side variables from Equation (1) and the SPF forecasts. In either case, we are unable to reject the null of information sufficiency. For the case where we do not include the conditioning variables, the p-values range from [0.21, 0.98] across different draws of the sampler, with the median p-value being 0.56. For the case where we include the conditioning variables, the p-values range from [0.47, 1.00], with the median value being 0.83. Therefore, our medium-scale TVAR model appears informationally sufficient when identifying government spending shocks.We also conduct orthogonality tests by checking whether either the spending or tax shocks are correlated with lags of principle components extracted from a large macroeconomic dataset that proxies for information available to economic agents. In particular, we use the FRED MD Stock and Watson dataset (see McCracken and Ng 2016 for details).To obtain the principal components, we convert the data to quarterly frequency, drop series that are not available for the period 1967–2015, and use growth rates for series that are nonstationary. We take a conservative approach and include up to seven components in our orthogonality tests.Different conventional tests indicate that the relevant number of principal components for the dataset is between 1 and 7. Table 5 reports the results for 2 lags of each principal component at a time and for 2 lags of all seven principal components at the same time. We cannot reject the null of orthogonality for any of the principal components. The p-values are especially large for the TVAR models. This result is consistent with Forni and Gambetti (2016), who show that, while a small fiscal VAR model is insufficient to identify fiscal shocks, a larger VAR model that includes forward-looking variables such as inflation and interest rates (and exchange rates in their case) is sufficient. Therefore, our shocks appear to be “structural” in the sense that they are not correlated with other information at time t about the macroeconomy and are thus possibly also “structural” in the conventional SVAR sense.Table 5:Orthogonality tests.ShocksPC1PC2PC3PC4PC5PC6PC7PC1-7G Linear VAR0.8720.9530.9350.6200.3540.7850.1620.919T Linear VAR0.8540.9970.8610.1220.2250.2750.0900.181G TVAR0.5330.4320.1960.6980.8980.3270.4560.881T TVAR0.5620.2670.7900.8510.1450.5960.8990.736Each cell reports the p-value for an F-test where the null is that the highest posterior density estimates of shocks are orthogonal to the lags of the principal components.5Evolving-regime impulse response analysisThe responses reported in Figure 3 and in Table 4 embed three different sources of uncertainty: uncertainty about the threshold estimate, uncertainty about the TVAR parameters, and uncertainty about the orthogonalization matrix that identifies the shocks. Even though we account for all of these different sources of uncertainty and we use 90% credibility bands, which are conservative in the fiscal literature, there is clear evidence of state dependence in the responses of output to fiscal policy. Notably, the posterior differences in the right columns of Figure 3 are large in magnitude and are highly likely to be different from zero. The cumulative multipliers exhibit similar state dependence.In this section, we turn to exploring implications of state dependence in more realistic scenarios for fiscal policy spending and tax shocks in which the economy is allowed to evolve endogenously from one regime to another. This approach allows us to consider possible sign and size asymmetries and to determine when discretionary fiscal policy is comparatively more or less effective. While the fixed-regime responses are useful for testing state dependence across regimes, the responses within a regime are be linear by construction – i.e., they are proportional to the size and sign of a shock. However, if the economy is allowed to evolve across regimes, threshold models allow (but do not impose) the possibility that negative shocks can have different proportional effects for positive shocks or that shocks of different magnitudes have non-proportional effects.The evolving-regime analysis requires specification of the history of the economy prior to the shock because the effects of the shock will depend on the system’s proximity to the threshold. For our generalized impulse response calculations, we focus on three particular histories of interest from a policy perspective: a strong expansion, a deep recession, and a sluggish recovery, defined as follows:–1996Q1: a robust expansion, when the economy is usually classified as being above or close to potential according to our various threshold variables and threshold estimates;–2008Q3: a deep recession, when the economy is clearly classified as being in the excess slack regime;–2012–2014: a sluggish recovery, when the economy is close to the estimated threshold for at least some of our threshold variables and threshold estimates.For each history, we calculate the responses to an increase in government spending and taxes and to decreases in government spending and taxes.Averaging over other similar histories of robust expansions, deep recessions, or sluggish recoveries produces very similar results to considering these specific dates.The shocks are scaled to 1% of GDP and 3% of GDP to consider possible size asymmetries. Sign restrictions are simply reversed for negative shocks. To address the computational burden when calculating the generalized impulse response functions, we abstract from the parameter uncertainty and fix the parameters at their highest posterior density values, although we know from fixed-regime responses in the previous section and as noted above that there is evidence of state dependence even when taking this parameter uncertainty into account.Figure 4 plots the responses of output to changes in government spending and taxes when the economy starts in robust expansion and Figure 5 plots the responses of output when the economy starts in a deep recession. In both cases, a shock scaled to 1% of GDP is considered. The top panels of the figures display the responses to government spending, the bottom panels plot the responses to tax changes. The left column displays the responses to positive shocks (higher government spending or higher taxes), the middle panel displays the response to a negative shock (scaled by −1 for ease of comparison), and the right panel displays the difference between the scaled response to a contractionary shock and the response to an expansionary shock.Figure 4:Sign-dependent effects of government spending and taxes on output in a robust expansion.Figure 5:Sign-dependent effects of government spending and taxes on output in a deep recession.The results in Figure 4 show that contractionary shocks (i.e., cuts in government spending or tax increases) have somewhat larger effects, on average, than expansionary shocks when the economy starts in a robust expansion. However, the difference is economically small and not significant. Tax cuts appear to be more efficient at stimulating output than increases in government spending ($1.7 vs. $0.6 after one year), which is consistent with Mountford and Uhlig (2009). The magnitude of the peak responses to tax shocks is also in line with, for example, the responses obtained by Romer and Romer (2010). However, our results for tax increases stand in contrast to the findings of Jones et al. (2015), who find that tax increases do not affect output, but decreases have a strong positive effect. Our results, by contrast, indicate that tax increases have a strong contractionary effect on output across the business cycle.Meanwhile, as shown in Figure 5, the effects of contractionary shocks are more persistent and larger than the effects of expansionary shocks when the economy starts in a deep recession. Cuts in government spending decrease output by $1.7 after 9 quarters. Tax increases decrease output by almost $3 after 10 quarters. The responses to stimulative shocks are smaller than the responses to austerity shocks. Both increases in taxes and decreases in government spending significantly decrease output (the response is different from zero at all horizons for tax increases and for two years for spending cuts). These results indicate that, if the aim of discretionary policy is to stimulate the economy, either government spending or tax cuts could be used in deep recessions, but tax cuts should be used when the economy is in a robust expansion.Size and sign asymmetries might be particularly relevant in a sluggish recovery when the economy is close to the threshold and different shocks can influence the probability of crossing it. Figure 6 plots the dollar-for-dollar responses of output to “small” (1% of GDP) and “large” (3% of GDP) shocks, both positive and negative, when the economy starts in a sluggish recovery. Figure 7 then plots posterior differences between responses to positive and negative shocks, responses to large positive and large negative shocks, responses to small and large negative shocks, and responses to small and large positive shocks.Figure 6:Sign-dependent and size-dependent effects of government spending and taxes on output in a sluggish recovery.Figure 7:Differences in sign and size effects of government spending and taxes on output in a sluggish recovery.The figure displays the evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a robust expansion. The left columns plot the responses to a positive shock, the middle column plots the response to a negative shock (scaled by −1 for ease of comparison), and the right column plots the difference in magnitude of (scaled) responses for positive and negative shocks. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.The figure displays the evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a deep recession. The left columns plot the responses to a positive shock, the middle column plots the scaled (by −1 for ease of comparison) response to a negative shock, and the right column plots the difference in magnitude of the scaled responses for positive and negative shocks. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.Figures 6 and 7 illustrate that the responses to contractionary shocks are larger than the responses to expansionary shocks. This is particularly pronounced when we consider large shocks. Large contractionary shocks have very persistent effects. By contrast, large expansionary shocks have positive effects in the short run that quickly die out as the economy gets closer to potential. The responses to large expansionary shocks (positive government spending shocks or negative tax shocks) are proportionally smaller than the responses to smaller expansionary shocks. In particular, when the economy is above the threshold, the dollar-for-dollar responses are dampened. By contrast, the responses to different sized cuts in government spending or tax increases are proportionally more similar.We also use evolving-regime impulse responses to trace out the effects of different shocks on government debt. While contractionary shocks have stronger effects on output, dollar for dollar, than expansionary shocks, the difference for government spending is much smaller in a robust expansion. This implies both that austerity could sometimes be self-defeating and that different fiscal policies of the same magnitude could have different effects on the debt-to-GDP ratio. Figure 8 plots the effects of stimulus on the debt-to-GDP ratio, while Figure 9 plots the effects of austerity. The top rows display the responses to a change in government spending and the bottom rows display the responses to tax changes. The left columns display the evolving responses when the economy starts in a deep recession, the middle columns display the responses when the economy starts in a robust expansion, and the right columns display the posterior differences.Figure 8:State-dependent effects of fiscal stimulus on the debt-to-GDP ratio with evolving regimes.Figure 9:State-dependent effects of austerity on the debt-to-GDP ratio with evolving regimes.Figure 8 shows that a decrease in taxes immediately raises the debt-to-GDP ratio and this increase is significant, albeit temporary. Because the increase in output is larger when taxes are cut in a deep recession, the rise in the debt-to-GDP ratio from a tax cut is significantly smaller than in a robust expansion.As Figure 9 shows, even though there is a substantial amount of uncertainty associated with the responses of the debt-to-GDP ratio to cuts in government spending, the posterior differences indicate that, at medium horizons at least, government spending cuts implemented in a robust expansion decrease the debt-to-GDP ratio more than government spending cuts implemented in a deep recession. The posterior difference peaks at 0.5% of GDP at intermediate horizons (Quarter 6 through Quarter 12). Similarly, increases in taxes reduce the debt-to-GDP ratio more if implemented in a robust expansion, but the difference is most pronounced at short horizons.The figure displays evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a sluggish recovery. The first column plots the responses to a positive shock equal to 1% of GDP, the second column plots responses to a positive shock equal to 3% of GDP, scaled by 1/3 for ease of comparison. The third column plots responses to a negative shock equal to 1% of GDP (scaled by – 1), and the fourth column plots responses to a negative shock equal to 3% of GDP (scaled by – 1/3). Posterior medians and 90% credibility intervals are reported.The figure displays the differences in evolving-regime responses of output to a government spending shock (first row) and a tax shock (second row) for the benchmark model in a sluggish recovery. The first column plots the difference in magnitude of (scaled) responses for positive and negative shocks equal to 1% of GDP, the second column plots the difference in magnitude of (scaled) responses for positive and negative shocks equal to 3% of GDP, the third column plots the difference in magnitude of (scaled) responses for a negative shock equal to 3% of GDP and a negative shock equal to 1% of GDP, and the fourth column plots the difference in magnitude of (scaled) responses for a positive shock equal to 3% of GDP and a positive shock equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.The figure displays the evolving-regime responses of the debt-to-GDP ratio to a positive government spending shock (first row) and a negative tax shock (second row) for the benchmark model in a deep recession (left column) and in a robust expansion (middle column). The right column plots the difference between the responses in a deep recession and a robust expansion. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.The figure displays the evolving-regime responses of the debt-to-GDP ratio to a negative government spending shock (first row) and a positive tax shock (second row) for the benchmark model in a deep recession (left column) and in a robust expansion (middle column). The right column plots the difference between the responses in a deep recession and a robust expansion. The shocks are equal to 1% of GDP. Posterior medians and 90% credibility intervals are reported.6Responses of consumption and investmentIn this section, we disaggregate the responses of output by considering the separate responses of consumption and investment to government spending and tax shocks. This allows us to consider different channels through which policy effects propagate. The impulse responses are calculated following the same approach used to calculate the impulse responses for output, although the impact responses of consumption and investment to fiscal shocks are intentionally left unrestricted as there is a less clear consensus on the underlying source of the response on output than there is about the overall response.Figures 10 and 11 plot the fixed-regime responses of consumption and investment, respectively, while Table 6 summarizes the results for the cumulative multipliers at select horizons.Evolving-regime responses behave as would be expected given the fixed-regime responses and are not reported to conserve space.There is strong evidence in Figure 10 for state dependence in the response of consumption to government spending shocks, which is also confirmed in the cumulative multipliers in Table 6. A clear implication of this result is that consumption drives much of the state dependence in the response of output documented previously. In particular, consumption responds strongly in the excess slack regime, while the response is much smaller, although still significant, when the economy is closer to potential. Similarly, the response of consumption to tax shocks is stronger in the excess slack regime. The response pattern of consumption to government spending shocks is consistent with theoretical models that incorporate a time-varying share of rule-of-thumb consumers, models featuring habit formation in which government spending and consumption are complements, as in Leeper et al. (2017) and the endogenous credit constraint model by McManus et al. (2018).Figure 10:Dollar-for-dollar effects of government spending and taxes on consumption.Figure 11:Dollar-for-dollar effects of government spending and taxes on investment.Table 6:Consumption and investment multipliers: cumulative scaled responses at selected horizons.HorizonGovernment spendingTaxesExcess slackClose to potentialExcess slackClose to potentialConsumption1 year0.670.43−0.80−0.302 years0.990.36−1.19−0.803 years1.290.32−1.44−0.904 years1.490.32−0.29−0.845 years1.520.32−0.12−0.83Investment1 year0.03−0.16−1.61−0.252 years−0.02−0.22−1.30−0.513 years−0.09−0.25−0.39−0.304 years−0.13−0.27−0.14−0.365 years−0.14−0.28−0.23−0.41There is no evidence in Figure 11 of state dependence in the response of investment to government spending shocks, again confirmed in Table 6. However, there is evidence that investment responds very differently to government spending and tax shocks, dollar for dollar, and some evidence of state dependence in the responses of investment to tax shocks. Investment does not increase significantly in response to government spending shocks, but it responds significantly to tax changes. Thus, the responses of output to discretionary changes in taxes appears to be driven by both consumption and investment.The figure displays the fixed-regime responses of output to a government spending shock (first row) and a tax shock (second row) in the excess slack regime (left column) and close to potential regime (middle column) for the benchmark model. The right column plots the posterior differences in responses across the two regimes. Posterior medians and 90% credibility intervals are reported.The figure displays the fixed-regime responses of output to a government spending shock (first row) and a tax shock (second row) in the excess slack regime (left column) and close to potential regime (middle column) for the benchmark model. The right column plots the posterior differences in responses across the two regimes. Posterior medians and 90% credibility intervals are reported.7ConclusionsOur analysis has considered when discretionary changes in government spending and taxes are comparatively more or less effective. We have found strong and robust empirical evidence in favor of nonlinearity and state dependence in the relationship between both types of fiscal policy and aggregate output. In particular, estimates from a threshold structural vector autoregressive model imply different responses of the economy both to government spending and to taxes during periods of excess slack compared to when the economy is closer to potential.If the aim of discretionary fiscal policy is to stimulate the economy during periods of excess slack, both government spending multipliers and tax multipliers are high and work primarily through a consumption channel. However, when the economy is closer to potential, tax cuts have larger effects than government spending increases and work primarily through an investment channel. Notably, austerity designed to lower the government debt-to-GDP ratio is largely self-defeating in deep recessions. In particular, if the aim of austerity is to reduce the debt-to-GDP ratio, our results suggest it will have smaller negative effects on economic activity if pursued when the economy is in a robust expansion.Research since the influential study by Auerbach and Gorodnichenko (2012) has debated the importance of state dependence and nonlinearity in the response of the economy to fiscal policy. While many studies find nonlinearity, important questions have been raised about robustness of the evidence, such as those in Ramey and Zubairy (2018). By exploring these issues with a relatively large model that nests much of the earlier research and allows us look extensively at robustness to various choices of threshold variables and shock identification, we find strong, robust support for nonlinearity in the effects of fiscal shocks that can help guide the timing and structure of future fiscal interventions.

Journal

Studies in Nonlinear Dynamics & Econometricsde Gruyter

Published: Sep 14, 2021

Keywords: E32; E62; C32; austerity; Bayesian; government spending; nonlinear dynamics; sign restrictions; vector autoregression

There are no references for this article.