TY - JOUR AU - Farhi,, Emmanuel AB - Abstract Aggregate production functions are reduced-form relationships that emerge endogenously from input–output interactions between heterogeneous producers and factors in general equilibrium. We provide a general methodology for analyzing such aggregate production functions by deriving their first- and second-order properties. Our aggregation formulas provide nonparametric characterizations of the macroelasticities of substitution between factors and of the macrobias of technical change in terms of microsufficient statistics. They allow us to generalize existing aggregation theorems and to derive new ones. We relate our results to the famous Cambridge–Cambridge controversy. 1. Introduction The aggregate production function is pervasive in macroeconomics. The vast majority of macroeconomic models postulate that real GDP or aggregate output Y can be written as arising from some specific parametric function Y = F(L1, …, LN, A), where Li is a primary factor input and A indexes different production technologies. By far the most common variant takes the form Y = AF(AKK, ALL), where A, AK, and AL index Hicks-neutral, capital-augmenting, and labor-augmenting technical change, and F is a CES function.1 From the early 1950s to the late 1960s, the aggregate production function became a central focus of a dispute commonly called the Cambridge–Cambridge controversy. The attackers were the post-Keynesians, based primarily in and associated with Cambridge, England, and the defenders were the neoclassicals, based primarily in and associated with Cambridge, Massachusetts.2 A primary point of contention surrounded the validity of the neoclassical aggregate production function. To modern economists, the archetypal example of the neoclassical approach is Solow’s famous growth model (Solow 1956), which uses an aggregate production function with capital and labor to model the process of economic growth. The debate kicked off with Joan Robinson’s (1953) paper criticizing the aggregate production function as a “powerful tool of miseducation”. The post-Keynesians (Robinson, Sraffa, and Pasinetti, among others) criticized the aggregate production function, and specifically, the aggregation of the capital stock into a single index number. They were met in opposition by the neoclassicals (Solow, Samuelson, and Hahn, among others) who rallied in defense of the aggregate production function. Eventually, the English Cambridge prevailed against the American Cambridge, decisively showing that aggregate production functions with an aggregate capital stock do not always exist. They did this through a series of ingenious, though perhaps exotic looking, “re-switching” examples. These examples demonstrated that at the macrolevel, “fundamental laws” such as diminishing returns may not hold for the aggregate capital stock, even if, at the microlevel, there are diminishing returns for every capital good. This means that a neoclassical aggregate production function could not be used to study the distribution of income in such economies. However, despite winning the battle, the English side arguably lost the war. Although exposed as a fiction, the “neoclassical” approach to modeling the production technology of an economy was nevertheless very useful. It was adopted and built upon by the real business cycle and growth literatures starting in the 1980s. Reports of the death of the aggregate production function turned out to be greatly exaggerated, as nearly all workhorse macroeconomic models now postulate an exogenous aggregate production function. Why did Robinson and Sraffa fail to convince macroeconomists to abandon aggregate production functions? One answer is the old adage: you need a model to beat a model. Once we abandon the aggregate production function, we need something to replace it with. Although the post-Keynesians were effective in dismantling this concept, they were not able to offer a preferable alternative. For his part, Sraffa advocated a disaggregated approach, one that took seriously “the production of commodities by means of commodities” (the title of his magnum opus). However, his impact was limited. Clean theoretical results were hard to come by and conditions under which factors of production could be aggregated were hopelessly restrictive.3 In a world lacking both computational power and data, and in lieu of powerful theorems, it is little wonder that workaday macroeconomists decided to work with Solow’s parsimonious aggregate production function instead. After all, it was easy to work with and only needed a sparing amount of data to be calibrated, typically having just one or two free parameters (the labor share and the elasticity of substitution between capital and labor).4 Of course, today’s world is awash in an ocean of microdata and access to computational power is cheap and plentiful, so old excuses no longer apply. Macroeconomic theory must evolve to take advantage of and make sense of detailed microlevel data. This paper is a contribution to this project. We fully take on board the lessons of the Cambridge–Cambridge controversy and allow for as many factors as necessary to ensure the existence of aggregate production functions.5 Instead of desperately seeking to aggregate factors, we focus on aggregating over heterogeneous producers in competitive general equilibrium. Under the assumptions of homothetic final demand and no distortions, such aggregation endogenously gives rise to aggregate production functions.6 The key difference between our approach and that of most of the rest of the literature that follows the Solow-Swan paradigm is that we treat aggregate production functions as endogenous reduced-form objects rather than structural ones. In other words, we do not impose an arbitrary parametric structure on aggregate production functions at the outset and instead derive their properties as a function of deeper structural microeconomic primitives. Our contribution is to fully characterize these endogenous aggregate production functions, up to the second order, for a general class of competitive disaggregated economies with an arbitrary number of factors and producers, arbitrary patterns of input–output linkages, arbitrary microeconomic elasticities of substitution, and arbitrary microeconomic technology shifters. Our sufficient-statistic formulas lead to general aggregation results expressing the macroeconomic elasticities of substitution between factors and the macroeconomic bias of technical change in terms of microeconomic elasticities of substitution and characteristics of the production network. The benefits of microfoundations do not require lengthy elaboration. First, they address the Lucas critique by grounding aggregate production functions in deep structural parameters that can be taken to be constant across counterfactuals driven by shocks or policy. Second, they allow us to understand the macroeconomic implications of microeconomic phenomena. Third, they allow to unpack the microeconomic implications of macroeconomic phenomena. This development can be put in a broader perspective by drawing an analogy with the shifting attitudes of economists toward aggregate consumption functions. In the wake of the Rational Expectations Revolution and the Lucas critique, economists abandoned aggregate consumption functions—functions that postulated a parametric relationship between aggregate consumption and aggregate income without deriving this relationship from microeconomic theory. This has become all the more true with the rise of heterogeneous-agent models following the early contributions of Bewley (1986), Aiyagari (1994), Huggett (1993), and Krusell and Smith (1998). However, the aggregate production function, which does much the same thing on the production side of the economy was left largely unexamined. By deriving an aggregate production function from first-principles, this paper provides microeconomic foundations for the aggregate production function building explicitly on optimizing microeconomic behavior. We restrict attention to situations where the aggregate production function, a function mapping endowments and technologies to aggregate output, exists, because final demand is homothetic and there are no distortions. The aggregate production function then depends on the structure of final demand. It is possible to define an alternative notion, the aggregate distance function, which separates technology from final demand. The aggregate distance function must then be combined with final demand to compute general-equilibrium comparative statics. Our object of interest is the aggregate production function, which takes these two steps at once. This choice, which is guided by our focus on general-equilibrium comparative statics, is without loss of generality. Indeed, it is possible to use our results for the aggregate production function to characterize the aggregate distance function by using different specifications of final demand. Aggregate production functions may fail to exist if there is no single quantity index corresponding to final output; this happens if final demand is nonhomothetic either because there is a representative agent with nonhomothetic preferences or because there are heterogeneous agents with different preferences. Furthermore, aggregate production functions also fail to exist in economies with distortions. Extended notions of aggregate production functions with distortions and nonhomothetic final demand can be defined. However, they are less useful in the sense that their properties cannot anymore be tied to interesting observables: their first and second derivatives do not correspond to factor shares, elasticities of substitution between factors, and bias of technical change. In this paper, we confine ourselves to economies with homothetic final demand and without distortions. In other papers (see Baqaee and Farhi 2018, 2019b), we have developed an alternative “propagation-equations” methodology to cover economies with nonhomothetic final demand and with distortions. These propagation equations generalize equations (5) and (6) in Proposition 2 and equations (8) and (9) in Proposition 7. They fully characterize the elasticities of sales shares and factor shares to factor supplies, factor prices, and technology shocks. They can be used along the exact same lines as in this paper to express the macroeconomic elasticities of substitution between factors and the macroeconomic bias of technical change as a function of microeconomic primitives. This shows precisely how to extend our results to economies with nonhomothetic final demand and with distortions. The outline of the paper is as follows. In Section 2 we set up the basic model, introduce the aggregate cost and production functions as dual ways of representing an economy’s production possibilities, and define the notions of macroeconomic elasticities of substitution between factors and of the bias of technical change. In Section 3, we define and characterize the properties of aggregate cost functions for the case of nested-CES economies. In Section 4, we define and characterize the properties of aggregate production functions for the case of nested-CES economies. In Section 5, we review some classic aggregation theorems and provide new ones. We revisit the Cambridge–Cambridge controversy, and represent some of the classic arguments via our framework and language. In Section 6, we provide some simple theoretical examples to illustrate the results developed in Sections 3 and 4: Hicksian and non-Hicksian examples with two and three factors; an example showing how to capture Houthakker (1955) within our framework; and an example of factor-biased technical change in a task-based model. We also present a simple quantitative application to capital-skill complementarity à la Griliches (1969) in the US economy, taking into account the multiplicity of sectors and their input–output linkages. We put it to use to revisit the analysis in Krusell et al. (2000) of the role of these complementarities in the evolution over time of the skill premium. In Section 7, we review several extensions. First, we generalize the results of Sections 3 and 4 to non-nested-CES economies with two simple tricks. Second, we explain how to use our results to separate technology from final demand by characterizing the aggregate distance and associated cost functions. Third, we explain how to generalize our results to economies where final demand is nonhomothetic and with distortions. We conclude in Section 8. 2. Setup In this section, we setup the model and notation, define the equilibrium, the aggregate production, and the aggregate cost function. 2.1. Environment The model has a set of producers N, and a set of factors F with supply functions Lf. We write N + F for the union of these two sets. With some abuse of notation, we also denote by N and F the cardinalities of these sets. What distinguishes goods from factors is the fact that goods are produced from factors and goods, whereas factors are produced ex nihilo. The output of each producer is produced using intermediate inputs and factors, and is sold as an intermediate good to other producers and as a final good. Final demand is a constant-returns-to-scale aggregator $$ \begin{equation} Y=\mathcal {D}_{0}(c_{1},\ldots ,c_{N}), \end{equation}$$ (1) where ci represents the use of good i in final demand and Y is real output. Each good i is produced with some constant-returns-to-scale production function. Hence, we can write the production function of each producer as $$ \begin{equation} y_{i}=A_{i}F_{i}(x_{i1},\ldots ,x_{iN},L_{i1},\ldots ,L_{iF}), \end{equation}$$ (2) where yi is the total output of i, |$x_{ij}$| is the use of input j, and |$L_{if}$| is the use of factor f. The variable Ai is a Hicks-neutral productivity shifter. We will sometimes use the unit-cost function |$A^{-1}_{i} C_{i}(p_{1},\ldots ,p_{N},w_{1},\ldots ,w_{F})$| associated with the production function Fi. Finally, the economy-wide resource constraints for goods j and factors f are given by $$ \begin{align} c_{j}+\sum _{i\in N}x_{ij}&=y_{j}, \end{align}$$ (3) $$ \begin{align} \sum _{i\in N}L_{if}&=L_{f}. \end{align}$$ (4) This framework is more general than it might appear. First, although we have assumed constant-returns-to-scale production functions, our analysis also covers the case of decreasing-returns-to-scale production functions by adding producer-specific fixed factors.7|$^,$|8|$^,$|9 Similarly, although we have assumed that, at the level of individual producers, technical change is Hicks neutral, our analysis also covers the case of biased factor- or input-augmenting technical change: for example, to capture factor-f-augmenting technical change for firm i, we simply introduce a new fictitious producer that linearly transforms factor f into factor f for firm i and study a Hicks-neutral technology shock to this fictitious producer.10 Finally, although we refer to each producer as producing one good, our framework actually allows for joint production by multi-product producers: for example, to capture a producer i producing goods i and i′ using intermediate inputs and factors, we represent good i′ as an input entering negatively in the production and cost functions for good i.11 2.2. Feasible and Competitive Equilibrium Allocations We first define feasible allocations. Definition (Feasible Allocations). A feasible allocation is a set of intermediate input choices |$x_{ij}$|⁠, factor input choices |$L_{if}$|⁠, outputs yi, final demands ci, and real output Y, such that (1)–(4) hold. Next we define equilibrium allocations. Equilibrium allocations are feasible allocations that arise as part of a competitive equilibrium. Definition (Equilibrium Allocations). An equilibrium allocation is a set of prices pi and |$w_{f}$| for goods and factors, intermediate input choices |$x_{ij}$|⁠, factor input choices |$L_{if}$|⁠, outputs yi, final demands ci, and real output Y, such that: final demand maximizes Y subject to (1) and to the budget constraint |$\sum _{i=1}^{N}\!p_{i}c_{i}=\sum _{f=1}^{F}\!w_{f}L_{f}$|⁠; each producer i maximizes its profits piyi − ∑j ∈ Npj|$x_{ij}$| − ∑f ∈F|$w_{f}$||$L_{if}$| subject to (2), taking prices pj and wages |$w_{f}$| as given; the markets for all goods i and factors f clear so that (3) and (4) hold. Instead of fixing factor supplies Lf, we can also define feasible and equilibrium allocations for given factor prices |$w_{f}$| and level of income E allocated to final demand. The welfare theorems apply in our environment. Equilibrium allocations are efficient and coincide with the solutions of the planning problems introduced in what follows, which define the aggregate production and cost functions. We will make use of these theorems to go back and forth between those properties most easily seen using the equilibrium decentralization and those that arise more naturally using the planning problem. Going forward, and to make the exposition more intuitive, we slightly abuse notation in the following way. For each factor f, we interchangeably use the notation |$w_{f}$| or pN+f to denote its wage, the notation |$L_{if}$| or xi(N+f) to denote its use by producer i, and the notation Lf or yf or to denote total factor supply. We define final demand as an additional good produced by producer 0 according to the final demand aggregator. We interchangeably use the notation c0i or x0i to denote the consumption of good i in final demand. We write 1 + N for the union of the sets {0} and N, and 1 + N + F for the union of the sets {0}, N, and F. 2.3. Aggregate Production and Cost Functions The aggregate production function is defined as the solution of the following planning problem: $$ \begin{align*} F(L_{1},\ldots ,L_{F},A_{1},\ldots ,A_{N})=\max Y \end{align*}$$ subject to (1)–(4). It is homogeneous of degree one in the factor supplies L1, …, LF. As already discussed previously, this production function also indexes the equilibrium level of real output as a function of productivity shocks Ai and factor supplies Lf. The aggregate cost function is defined as the solution of the dual planning problem that seeks to minimize the expenditure necessary to achieve real output Y given factor prices |$w_{f}$|⁠: $$ \begin{align*} C(w_{1},\ldots ,w_{F},A_{1},\ldots A_{N},Y)=\min E \end{align*}$$ subject to (1)–(3), and E = ∑f ∈F|$w_{f}$||$L_{f}$|⁠. It is homogeneous of degree one in the factor prices |$w_{1}$|⁠, …, |$w_{F}$|⁠. The aggregate cost function is also homogeneous of degree one in aggregate output Y so that we can write it as YC(⁠|$w_{1}$|⁠, …, |$w_{F}$|⁠, A1, …, AN), where with some abuse of notation, C now denotes the aggregate unit-cost function. Most of the results in the rest of the paper characterize the log derivatives of the aggregate cost function with respect to productivities or factor prices, which coincide with the corresponding log derivatives of the aggregate unit-cost function, and so both can be used interchangeably. To fix ideas, the reader can focus on the aggregate cost function. The primary difference between the aggregate production function and the aggregate cost function is that the former takes the factor quantities as given, whereas the latter takes the factor prices as given. The goal of this paper is to characterize the aggregate production and cost functions up to the second order as a function of microeconomic primitives such as microeconomic elasticities of substitution and the input–output network. Propositions 1 and 6 characterize the Jacobians (first derivatives) and Propositions 2 and 7 the Hessians (second derivatives) of the aggregate production and cost functions. In economic terms, this means that we seek to characterize not only macroeconomic marginal products of factors and factor demands (first-order properties) but also macroeconomic elasticities of substitution between factors and the sensitivities of marginal products of factors and factor demands to technical change (second-order properties). As mentioned before, we have put ourselves under conditions where the existence of an “aggregate output” good can be taken for granted because final demand is homothetic. Given our definitions, the properties of the aggregate production and cost functions depend on final demand. One might want to consider alternative notions that remove this dependence: the aggregate distance and associated cost functions. One might also want to study economies where there is no aggregate output good because final demand is not homothetic, or economies with distortions. Our results can be used to fully and precisely accommodate these desiderata. Detailed explanations can be found in Sections A.3 and A.4. 2.4. Macroeconomic Elasticities of Substitution between Factors As is well known, there is no unambiguous way to generalize the standard Hicksian notion of elasticity of substitution between factors (Hicks 1932) when there are more than two factors, and several concepts have been proposed in the literature. Invariably, all competing definitions of the elasticity of substitution are computed via the Jacobian and Hessian of a function. Since we characterize both of these in general, our results can be used to compute all the different notions of the elasticity of substitution. In this paper, we follow Blackorby and Russell (1989) who advocate using the definition due to Morishima (1967). They argue that Morishima Elasticities of Substitution (MESs) are appealing because they extend the standard Hicksian notion while preserving some of its desirable properties: an MES is a measure of the inverse-curvature of isoquants; it is a sufficient statistic for the effect on relative factor shares of changes in relative factor prices; it is a log derivative of a quantity ratio to a price ratio.12 Definition (MESs for the Aggregate Production Function). The MES |$\sigma _{fg}^{F}$| between factors f and g in the aggregate production function is defined as $$ \begin{equation*} \frac{1}{\sigma _{fg}^{F}}=\frac{d\log \left(\frac{dF}{dL_{f}}/\frac{dF}{dL_{g}}\right)}{d\log (L_{g}/L_{f})}=1+\frac{d\log \left(\frac{d\log F}{d\log L_{f}}/\frac{d\log F}{d\log L_{g}}\right)}{d\log (L_{g}/L_{f})}. \end{equation*}$$ Definition (MESs for the Aggregate Cost Function). The MES |$\sigma _{fg}^{C}$| between factors f and g in the aggregate cost function is defined as $$ \begin{equation*} \sigma _{fg}^{C}=\frac{d\log \left(\frac{dC}{dw_{f}}/\frac{dC}{dw_{g}}\right)}{d\log (w_{g}/w_{f})}=1+\frac{d\log \left(\frac{d\log C}{d\log w_{f}}/\frac{d\log C}{d\log w_{g}}\right)}{d\log (w_{g}/w_{f})}. \end{equation*}$$ Note that the ratios $$ \begin{equation*} \frac{dF/dL_{f}}{d F/d L_{g}} \quad {and}\quad \frac{d\log F/d\log L_{f}}{d\log F/d\log L_{g}} \end{equation*}$$ are homogeneous of degree zero in L1, …, LF . Similarly, the ratios $$ \begin{equation*} \frac{d C/dw_{f}}{d C/d w_{g}} \quad \mbox{and}\quad \frac{d\log C/d\log w_{f}}{d\log C/d\log w_{g}} \end{equation*}$$ are homogeneous of degree zero in |$w_{1}$|⁠, …, |$w_{F}$|⁠. These definitions exploit this homogeneity to write these ratios as functions of L1/Lf, …, LF/Lf and |$w_{1}$|/|$w_{\mathit f}$|⁠, …, |$w_{\mathit F}$|/|$w_{\mathit f}$|⁠, respectively. Therefore, underlying the definition of |$\sigma _{fg}^{F}$| are variations in Lg/Lf, holding Lh/Lf constant for h ≠ g, that is, variations in Lg, holding Lh constant for h ≠ g. Similarly, underlying the definition of |$\sigma _{fg}^{C}$| are variations in |$w_{g}$|/|$w_{\mathit f}$|⁠, holding |$w_{h}$|/|$w_{\mathit f}$| constant for h ≠ g, that is, variations in |$w_{\mathit g}$|⁠, holding |$w_{h}$| constant for h ≠ g. As we shall see in Propositions 1 and 6, d log F/d log Lh and d log C/d log |$w_{h}$| are equal to the factor shares Λh in the competitive equilibria of the corresponding economies. MESs therefore pin down the elasticities of relative factor shares to relative factor supplies or relative factor prices: $$ \begin{equation*} 1-\frac{1}{\sigma _{fg}^{F}}=-\frac{d\log (\Lambda _{f}/\Lambda _{g})}{d\log (L_{g}/L_{f})}\quad \text{and}\quad \sigma _{fg}^{C}-1=\frac{d\log (\Lambda _{f}/\Lambda _{g})}{d\log (w_{g}/w_{f})}. \end{equation*}$$ Similarly, dF/dLh is equal to the wage rate |$w_{h}$| and dC/d|$w_{h}$| to the factor demand per unit of output Lh in the competitive equilibria of the corresponding economies, which can be viewed as homogeneous-of-degree-zero functions of L1, …, LF and |$w_{1}$|⁠, …, |$w_{\mathit F}$|⁠, respectively. MESs therefore pin down the elasticities of factor prices to factor supplies and of factor demands to factor prices: $$ \begin{equation*} \frac{1}{\sigma _{fg}^{F}}=\frac{d\log (w_{f}/w_{g})}{d\log (L_{g}/L_{f})}\quad \text{and}\quad \sigma _{fg}^{C}=\frac{d\log (L_{f}/L_{g})}{d\log (w_{g}/w_{f})}. \end{equation*}$$ MESs between factors in the aggregate production and cost functions can be directly expressed as a function of the Jacobians and Hessians of these functions: $$ \begin{align*} 1-\frac{1}{\sigma _{fg}^{F}}& =\frac{d^{2}\log F/(d\log L_{g})^{2}}{d\log F/d\log L_{g}}-\frac{d^{2}\log F/(d\log L_{f}d\log L_{g})}{d\log F/d\log L_{f}},\\ \sigma _{fg}^{C}-1& =\frac{d^{2}\log C/(d\log w_{f}d\log w_{g})}{d\log C/d\log w_{f}}-\frac{d^{2}\log C/(d\log w_{g})^{2}}{d\log C/d\log w_{g}}. \end{align*}$$ MESs between factors in the aggregate production function are typically not symmetric so that |$\sigma _{fg}^{F}\ne \sigma _{gf}^{F}$| and |$\sigma _{fg}^{C}\ne \sigma _{gf}^{C}$| in general. Moreover, MESs between factors in the aggregate production and cost functions are typically not equal to each other, so |$\sigma _{fg}^{F}\ne \sigma _{gf}^{C}$| in general. The “Hicksian” case where there are only two factors of production f and g is special in this regard since in this case, the MESs for the cost and production function are the same, and symmetric, so that we get |$\sigma _{fg}^{F}=\sigma _{gf}^{F}$|⁠, |$\sigma _{fg}^{C}=\sigma _{gf}^{C}$|⁠, and |$\sigma _{fg}^{F}=\sigma _{fg}^{C}$|⁠. The proof is standard and can be found in Hicks (1932) and in Russell (2017) for example. Consider the case where the aggregate production function and the associated aggregate cost function are of the CES form with $$ \begin{equation*} F(L_1, \ldots , L_N,A) =\bar{Y}\frac{A}{\bar{A}}\left(\sum _{i=1}^N \omega _i \left(\frac{L_i}{\bar{L}_i}\right)^{\frac{\sigma -1}{\sigma }}\right)^{\frac{\sigma }{\sigma -1}}, \end{equation*}$$ $$ \begin{equation*} C(w_1, \ldots , w_N,A) = \frac{\bar{A}}{A}\left(\sum _{i=1}^N \omega _i \left(\frac{w_i}{\bar{w}_i}\right)^{1-\sigma } \right)^{\frac{1}{1-\sigma }}, \end{equation*}$$ where bar variables correspond to some particular point and ωi denotes the share of factor i at this point. Then with our definitions, the MESs in the aggregate production and cost functions between factor f and factor g are given by |$\sigma _{fg}^F = \sigma _{fg}^C = \sigma$|⁠. More generally, if the aggregate production and cost functions are of the nested-CES form, and if two factors belong to the same CES nest, then the MES between these two factors is equal to the elasticity of substitution of the nest; more generally, if two factors enter together with other factors only through a nested-CES subaggregate, then the MES between these two factors is only a function of the elasticities of substitution in the nested-CES subaggregate. However, even when the economy with disaggregated production is of the nested-CES form as described in Section 2.7, the aggregate production and cost functions that describe its production possibility frontier are typically not of the nested-CES form except in simple cases with limited heterogeneity and simple input–output network structures. MESs between factors in the aggregate production and cost functions are macroeconomic elasticities of substitution. They incorporate general equilibrium effects and typically do not coincide with any microeconomic elasticity of substitution. Our results in Propositions 3 and 8 deliver formulas for the MESs between factors as a function of microeconomic primitives such as microeconomic elasticities of substitution and the input–output network. 2.5. Macroeconomic Bias of Technical Change We now present our definitions of the macroeconomic bias of technical change. These definitions generalize the definitions proposed by Hicks (1932) to the case of multiple factors. We present these definitions directly in terms of the Jacobians and Hessians of the aggregate production and cost functions. We later relate them to the elasticities of relative factor shares to technology shocks. Definition (Bias of Technical Change for the Aggregate Production Function). The bias |$B^F_{fgj}$| in the aggregate production function toward factor f versus factor g of technical change driven by a technology shock to producer j is defined as $$ \begin{equation*} \frac{B^F_{fgj}}{1+B^F_{fgj}}=\frac{d \log \left(\frac{d\log F}{d\log L_f} / \frac{d\log F}{d\log L_g} \right)}{d\log A_j}. \end{equation*}$$ Definition (Bias of Technical Change for the Aggregate Cost Function). The bias |$B^C_{fgj}$| in the aggregate cost function toward factor f versus factor g of technical change driven by a technology shock to producer j is defined as $$ \begin{equation*} B^C_{fgj}=\frac{d \log \left(\frac{d\log C}{d\log w_f} / \frac{d\log C}{d\log w_g} \right)}{d\log A_j}. \end{equation*}$$ As already alluded to, and as we shall see in Propositions 1 and 6, d log F/d log Lh and d log C/d log |$w_{\mathit h}$| are equal to the factor shares Λh in the competitive equilibria of the corresponding economies. The macroeconomic biases of technical change in the aggregate production and cost functions therefore pin down the elasticities with respect to technology shocks of relative factor shares as well as of relative factor prices and of relative factor demands, holding, respectively, factor supplies or factor prices constant:13 $$ \begin{align*} \frac{B^F_{fgj}}{1+B^F_{fgj}} = \frac{d \log (\Lambda _f /\Lambda _g )}{d\log A_j}=\frac{d \log (w_f /w_g )}{d\log A_j} \end{align*}$$ and $$ \begin{align*} B^C_{fgj} = \frac{d \log (\Lambda _f /\Lambda _g) }{d\log A_j}=\frac{d \log (L_f /L_g )}{d\log A_j}. \end{align*}$$ Technological bias in the aggregate production and cost functions can be directly expressed as a function of the Jacobians and Hessians of these functions: $$ \begin{align*} \frac{B^F_{fgj}}{1+B^F_{fgj}} & =\frac{d^2 \log F /(d\log A_j d\log L_f)}{d\log F /d\log L_f}-\frac{d^2 \log F /(d\log A_j d\log L_g)}{d\log F /d\log L_g},\\ B^C_{fgj} & =\frac{d^2 \log C /(d\log A_j d\log w_f)}{d\log C /d\log w_f}-\frac{d^2 \log C /(d\log A_j d\log w_g)}{d\log C /d\log w_g}. \end{align*}$$ Even in the Hicksian case where there are only two factors, the biases of technology in the aggregate production and cost functions do not necessarily coincide so that |$B^F_{fgj} \ne B^C_{fgj}$| in general. Consider the case where the aggregate production and cost functions are of the CES form with $$ \begin{equation*} F(L_1, \ldots , L_N,A_1,\ldots ,A_N) = \bar{Y}\left(\sum _{i=1}^N \omega _i \left(\frac{A_i}{\bar{A}_i}\frac{L_i}{\bar{L}_i}\right)^{\frac{\sigma -1}{\sigma }}\right)^{\frac{\sigma }{\sigma -1}}, \end{equation*}$$ $$ \begin{equation*} C(w_1, \ldots , w_N,A_1,\ldots ,A_N) = \left(\sum _{i=1}^N \omega _i \left(\frac{\bar{A}_i}{A_i} \frac{w_i}{\bar{w}_i}\right)^{1-\sigma } \right)^{\frac{1}{1-\sigma }}, \end{equation*}$$ where bar variables correspond to some particular point, ωi denotes the share of factor i at this point, and Ai is a technological shock augmenting factor i. Then with our definitions, the biases in the aggregate production and cost functions toward factor f versus factor g of technical change driven by a technology shock augmenting factor j are given |$B_{fgj}^F = B_{fgj}^C= \sigma -1$| if j = f, |$B_{fgj}^F = B_{fgj}^C= -(\sigma -1)$| if j = g, and |$B_{fgj}^F = B_{fgj}^C=0$| otherwise. More generally, if the aggregate production and cost functions are of the nested-CES form with factor-augmenting technical change, and if two factors belong to the same CES nest, then the bias toward the first factor versus the second factor of a technology shock is equal to (minus) the elasticity of substitution minus one of the nest if the technology shock augments the first (second) of the two factors, and zero otherwise; more generally, if two factors enter together with other factors only through a nested-CES subaggregate, then the bias between these two factors is nonzero only for technology shocks that augment the factors in the nested-CES subaggregate, and then it is only a function of the elasticities of substitution in the nested-CES subaggregate. However, even when the economy with disaggregated production is of the nested-CES form as described in Section 2.7, the aggregate production and cost functions that describe its production possibility frontier are typically not of the nested-CES form with factor-augmenting technical change except in simple cases with limited heterogeneity and simple input–output network structures. The bias of technical change in the aggregate production and cost functions are macroeconomic in nature. They incorporate general equilibrium effects and typically do not coincide with any microeconomic elasticity of substitution minus one. Our results in Propositions 5 and 10 deliver formulas for the bias of technical change as a function of microeconomic primitives such as microeconomic elasticities of substitution and the input–output network. 2.6. Input–Output Definitions To state our results, we require some input–output notation and definitions. We define input–output objects such as input–output matrices, Leontief inverse matrices, and Domar weights. These definitions arise most naturally in the equilibrium decentralization of the corresponding the planning solution (for the aggregate production function or the aggregate cost function, respectively). Input–Output Matrix. We define the input–output matrix to be the (1 + N + F) × (1 + N + F) matrix Ω whose |${\mathit{ij}}$|th element is equal to i’s expenditures on inputs from j as a share of its total revenues $$ \begin{equation*} \Omega _{ij}\equiv \frac{p_{j}x_{ij}}{p_{i}y_{i}}. \end{equation*}$$ Note that input–output matrix Ω includes expenditures by producers on factor inputs as well as expenditures by consumers for final consumption. By Shephard’s lemma, |$\Omega _{\mathit{i\,j}}$| is also the elasticity of the cost of i to the price of j, holding the prices of all other producers constant. Leontief Inverse Matrix. We define the Leontief inverse matrix as $$ \begin{equation*} \Psi \equiv (I-\Omega )^{-1}=I+\Omega +\Omega ^{2}+\cdots . \end{equation*}$$ Although the input–output matrix Ω records the direct exposures of one producer to another, the Leontief inverse matrix Ψ records instead the direct and indirect exposures through the production network. This can be seen most clearly by noting that (Ωn)|${\mathit{i\,j}}$| measures the weighted sums of all paths of length n from producer i to producer j. By Shephard’s lemma, |$\Psi _{\mathit{i\,j}}$| is also the elasticity of the cost of i to the price of j holding fixed the prices of factors but taking into account how the price of all other goods in the economy will change. GDP and Domar Weights. GDP or nominal output is the total sum of all final expenditures $$ \begin{equation*} {\rm {GDP}}=\sum _{i\in N}p_{i}c_{i} = \sum _{i \in N} p_i x_{0i}. \end{equation*}$$ We define the Domar weight λi of producer i to be its sales share as a fraction of GDP $$ \begin{equation*} \lambda _{i}\equiv \frac{p_{i}y_{i}}{{\rm {GDP}}}. \end{equation*}$$ Note that ∑i ∈Nλi > 1 in general since some sales are not final sales but intermediate sales. For expositional convenience, for a factor f, we sometimes use Λf instead of λf. Note that the Domar weight Λf of factor f is simply its total income share. We can also define the vector b to be final demand expenditures as a share of GDP $$ \begin{equation*} b_{i}=\frac{p_{i}c_{i}}{{\rm {GDP}}} = \frac{p_{i}x_{0i}}{{\rm {GDP}}}=\Omega _{0i}. \end{equation*}$$ The accounting identity $$ \begin{equation*} p_{i}y_{i}=p_{i}c_{i}+\sum _{j\in N}p_{i}x_{ji}=\Omega _{0i}{\rm {GDP}}+\sum _{j\in N}\Omega _{ji}\lambda _{j}{\rm {GDP}} \end{equation*}$$ links the revenue-based Domar weights to the Leontief inverse via $$ \begin{equation*} \lambda ^{\prime }=b^{\prime }\Psi =b^{\prime }I+b^{\prime }\Omega +b^{\prime }\Omega ^{2}+\cdots \, . \end{equation*}$$ Another way to see this is that the ith element of b′Ωn measures the weighted sum of all paths of length n from producer i to final demand. 2.7. Nested-CES Economies We call an economy nested CES if all the production functions of all the producers (including final demand) are of the nested-CES form with Hicks-neutral technical change at the level of each nest. Following Baqaee and Farhi (2019), any nested-CES economy, with an arbitrary number of producers, factors, CES nests, elasticities, and intermediate input use, can be rewritten in what we call standard form, which is more convenient to study. A CES economy in standard form is defined by a tuple (ω, θ, F). The (1 + N + F) × (1 + N + F) matrix ω is a matrix of input–output parameters. The (1 + N) × 1 vector θ is a vector of microeconomic elasticities of substitution. Each good i is produced with the production function $$ \begin{equation*} \frac{y_{i}}{\bar{y}_{i}}=\frac{A_{i}}{\bar{A}_{i}}\left(\sum _{j\in 1+N+F}\omega _{ij}\left(\frac{x_{ij}}{\bar{x}_{ij}}\right)^{\frac{\theta _{i}-1}{\theta _{i}}}\right)^{\frac{\theta _{i}}{\theta _{i}-1}}, \end{equation*}$$ where |$x_{ij}$| are intermediate inputs from j used by i. Throughout the paper, variables with overlines are normalizing constants equal to the values at some initial allocation. We represent final demand Y as the purchase of good 0 from producer 0 producing the final good. For the most part, we assume that |$A_{0}=\bar{A}_{0}$| and abstract away from this parameter.14 Through a relabeling, this structure can represent any nested-CES economy with an arbitrary pattern of nests and elasticities. Intuitively, by relabeling each CES aggregator to be a new producer, we can have as many nests as desired. To facilitate the exposition in the paper, and due to their ubiquity in the literature, we present our baseline results for nested-CES economies in Sections 3 and 4. We then explain how to generalize them for arbitrary economies in Section A.1.15 3. Aggregate Cost Functions In this section, we provide a general characterization of aggregate cost functions up to the second order for nested-CES economies. We refer the reader to Sections 6 and 6.5 for some simple theoretical and quantitative examples, and to Section A.1 for a generalization to non-nested-CES economies. 3.1. First-Order Characterization The following proposition characterizes the first derivatives (gradient) of the aggregate cost function. Proposition 1 (Gradient).  The first derivatives of the aggregate cost function are given by the sales shares of goods and factors $$ \begin{equation*} \frac{d\log C}{d\log w_{\mathit f}}=\Lambda _{f}\quad \text{and}\quad \frac{d\log C}{d\log A_{i}}=-\lambda _{i}. \end{equation*}$$ The proposition follows directly from Hulten’s theorem (Hulten 1978). It shows that the elasticity of the aggregate cost function C to the price of factor f is given by the share Λf of this factor in GDP. Similarly, the elasticity of the aggregate cost function C to the productivity of producer i is given by the negative of the sales share λi of this producer in GDP. The proposition is fully general and applies even when the economy is not of the nested-CES form. Incidentally, Proposition 1 confirms that the aggregate cost function C is homogeneous of degree one in factor prices, since ∑f∈F Λf = 1. It also confirms that C is homogeneous of degree one in aggregate output Y since |${d}\log C/ {d}\log A_0 = 1$|⁠. 3.2. Second-Order Characterization The following proposition characterizes the second derivatives (Hessian) of the aggregate cost function. Proposition 2 (Hessian). The second derivatives of the aggregate cost function are determined by the elasticities of the sales shares of goods and factors $$ \begin{align*} \frac{d^{2}\log C}{d\log w_{f}d\log w_{g}} & =\frac{d\Lambda _{f}}{d\log w_{g}},\\ \frac{d^{2}\log C}{d\log A_{j}d\log A_{i}} & =-\frac{d\lambda _{i}}{d\log A_{j}},\\ \frac{d^{2}\log C}{d\log A_{j}d\log w_{f}} & =\frac{d\Lambda _{f}}{d\log A_{j}}, \end{align*}$$ where the elasticities of the sales shares are given by $$ \begin{equation} d\log \lambda _{i}=\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\lambda _{i}}\operatorname{Cov}_{\Omega ^{(k)}}\left(\sum _{j\in N}\Psi _{(j)}d\log A_{j}-\sum _{g\in F}\Psi _{(g)}d\log w_{g},\Psi _{(i)}\right), \end{equation}$$ (5) and the elasticities of the factor shares are given by $$ \begin{equation} d\log \Lambda _{f}=\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\left(\sum _{j\in N}\Psi _{(j)}d\log A_{j}-\sum _{g\in F}\Psi _{(g)}d\log w_{g},\Psi _{(f)}\right). \end{equation}$$ (6) The shares propagation equations (5) and (6) are taken directly from Baqaee and Farhi (2019). Although Baqaee and Farhi (2019) focuses on the second-order macroeconomic impact of microeconomic shocks d2 log C/(d log Ajd log Ai), in this paper, we focus instead on d2 log C/(d log |$w_{\mathit f}$|d log |$w_{\mathit g}$|⁠), which as we will show in Section 3.3, determines the macroeconomic elasticities of substitution between factors, as well as on d2 log C/(d log Ajd log |$w_{\mathit f}$|⁠), which determines the elasticity of factor shares to technical change, that is, the bias of technical change. Of course, equation (6) is obtained simply by letting i = f in (5). This proposition shows that these equations, which characterize the elasticities of the shares of goods and factors to productivity shocks and factor prices, completely pin down the second derivatives of the aggregate cost function. In these equations, we make use of the input–output covariance operator introduced by Baqaee and Farhi (2019): $$ \begin{equation} \operatorname{Cov}_{\Omega ^{(k)}}\left(\Psi {}_{(j)},\Psi _{(i)}\right)=\sum _{l\in N+F}\Omega _{kl}\Psi _{lj}\Psi _{li}-\left(\sum _{l\in N+F}\Omega _{kl}\Psi _{lj}\right)\left(\sum _{l\in N+F}\Omega _{kl}\Psi _{li}\right), \end{equation}$$ (7) where Ω(k) corresponds to the kth row of Ω, Ψ(j) to jth column of Ψ, and Ψ(i) to the ith column of Ψ. In words, this is the covariance between the jth column of Ψ and the ith column of Ψ using the kth row of Ω as the distribution. Since the rows of Ω always sum to one for a reproducible (nonfactor) good k, we can formally think of this as a covariance, and for a nonreproducible good, the operator just returns 0. To gain some intuition, consider for example the elasticity d log Λf/d log |$w_{\mathit g}$| of the share Λf of factor f to the price of factor g in equation (6). Imagine a shock d log |$w_{\mathit g}$| < 0 that reduces the wage of factor g. For fixed relative factor prices, every producer k will substitute across its inputs in response to this shock. Suppose that θk > 1, so that producer k substitutes expenditure toward those inputs l that are more reliant on factor g, captured by Ψlg, and the more so, the higher θk − 1. Now, if those inputs are also more reliant on factor f, captured by a high |$\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(g)},\Psi _{(f)})$|⁠, then substitution by k will increase expenditure on factor f and hence the income share of factor f. These substitutions, which happen at the level of each producer k, must be summed across producers. The intuition for d log Λf/d log Aj in equation (6) as well as for d log λi/d log |$w_{\mathit g}$| and d log λi/d log Aj in equation (5) is similar. 3.3. Macroeconomic Elasticities of Substitution Between Factors We can leverage Proposition 2 to characterize the macroeconomic elasticities of substitution between factors in the aggregate cost function. Proposition 3 (MESs). The MESs between factors in the aggregate cost function are given by $$ \begin{equation*} \sigma _{fg}^{C}=\sum _{k\in 1+N}\theta _{k}\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\left(\Psi _{(g)},\Psi _{(g)}/\Lambda _{g}-\Psi _{(f)}/\Lambda _{f}\right), \end{equation*}$$ where $$ \begin{equation*} \sum _{k\in 1+N}\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\left(\Psi _{(g)},\Psi _{(g)}/\Lambda _{g}-\Psi _{(f)}/\Lambda _{f}\right)=1. \end{equation*}$$ This proposition shows that MESs |$\sigma _{fg}^{C}$| between factors in the aggregate cost function are weighted averages of the microeconomic elasticities of substitution θk in production with weights given by sufficient statistics of the input–output network |$\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(g)},\Psi _{(g)}/\Lambda _{g}-\Psi _{(f)}/\Lambda _{f})$|⁠. These weights capture the change in expenditure for factor f versus g from substitution by producer k in response to a change in the price of factor f. This implies the following network-irrelevance result, already uncovered in Baqaee and Farhi (2019), in the knife-edge case where all the microeconomic elasticities of substitution are identical. Proposition 4 (Network Irrelevance). If all microeconomic elasticities of substitution θk are equal to the same value θk = θ, then MESs |$\sigma _{fg}^C$| between factors in the aggregate cost function are also equal to that value |$\sigma _{fg}^{C}=\theta$|⁠. 3.4. Macroeconomic Bias of Technical Change We can also leverage Proposition 2 to characterize the macroeconomic bias of technical change in the aggregate cost function. Proposition 5 (Bias of Technical Change). The biases toward one factor versus another of the different technology shocks in the aggregate cost function are given by $$ \begin{equation*} B_{fgj}^{C}=\sum _{k\in 1+N}(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\left(\Psi _{(j)},\Psi _{(f)}/\Lambda _f-\Psi _{(g)}/\Lambda _g\right). \end{equation*}$$ This proposition shows that biases |$B_{fgj}^{C}$| are weighted sums of the departures from one θk − 1 of the microeconomic elasticities of substitution with weights given by sufficient statistics of the input–output network |$\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(j)},\Psi _{(f)}/\Lambda _{f}-\Psi _{(g)}/\Lambda _{g})$|⁠. These weights capture the change in expenditure for factor f versus g as a result substitution by producer k in response to a technology shock to producer j. The network-irrelevance result for MESs in the aggregate cost function stated in Proposition 4 does not extend to the bias of technical change. In general, the network matters for the bias of technical change, even when all the microeconomic elasticities of substitution θk are identical. The Cobb–Douglas is the one case where it does not: when all the microeconomic elasticities θk are unitary so that θk = 1, technical change in unbiased with |$B_{fgj}^C=0$| for all f, g, and j, no matter what the structure of the network is. 4. Aggregate Production Functions In this section, we provide a general characterization of aggregate production functions up to the second order for nested-CES economies. We refer the reader to Sections 6 and 6.5 for some simple theoretical and quantitative examples, and to Section A.1 for a generalization to non-nested-CES economies. 4.1. First-Order Characterization The following proposition characterizes the first derivatives (gradient) of the aggregate production function. Proposition 6 (Gradient). The first derivatives of the aggregate production function are given by the sales shares of goods and factors $$ \begin{equation*} \frac{d\log F}{d\log L_{f}}=\Lambda _{f}\quad \text{and}\quad \frac{d\log F}{d\log A_{i}}=\lambda _{i}. \end{equation*}$$ The proposition follows directly from Hulten’s theorem (Hulten 1978). It shows that the elasticity of the aggregate production function F to the supply of factor f is given by the share Λf of this factor in GDP. Similarly, the elasticity of the aggregate production function F to the productivity of producer i is given by the sales share λi of this producer in GDP. The proposition is fully general and applies even when the economy is not of the nested-CES form. Once again, Proposition 6 confirms that the aggregate production function is homogeneous of degree one with respect to factor quantities since ∑f ∈ FΛf = 1. 4.2. Second-Order Characterization The following proposition characterizes the second derivatives (Hessian) of the aggregate production function. Proposition 7 (Hessian). The second derivatives of the aggregate production function are given by the elasticities of the sales shares of goods and factors $$ \begin{align*} \frac{d^{2}\log F}{d\log L_{f}d\log L_{g}} & =\frac{d\Lambda _{f}}{d\log L_{g}},\\ \frac{d^{2}\log F}{d\log A_{j}d\log A_{i}} & =\frac{d\lambda _{i}}{d\log A_{j}},\\ \frac{d^{2}\log F}{d\log A_{j}d\log L_{f}} & =\frac{d\Lambda _{f}}{d\log A_{j}}, \end{align*}$$ where the elasticities of the sales shares are given by $$ \begin{align} & d\log \lambda _{i}=\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\lambda _{i}}\operatorname{Cov}_{\Omega ^{(k)}}\left(\sum _{j\in N}\Psi _{(j)}d\log A_{j}+\sum _{g\in F}\Psi _{(g)}d\log L_{g},\Psi _{(i)}\right)\nonumber \\ &\qquad \qquad {}-\sum _{h\in F}\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\lambda _{i}}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(h)},\Psi _{(i)}\big)d\log \Lambda _{h}, \end{align}$$ (8) and where the elasticities of the factor shares solve the following system of linear equations $$ \begin{align} &d\log \Lambda _{f}=\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\left(\sum _{j\in N}\Psi _{(j)}d\log A_{j}+\sum _{g\in F}\Psi _{(g)}d\log L_{g},\Psi _{(f)}\right)\nonumber \\ &\qquad \qquad \quad {}-\sum _{h\in F}\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(h)},\Psi _{(f)}\big)d\log \Lambda _{h}. \end{align}$$ (9) The shares propagation equations (8) and (9) are taken directly from Baqaee and Farhi (2019). Although Baqaee and Farhi (2019) focuses on the second-order macroeconomic impact of microeconomic shocks d2 log F/(d log Ajd log Ai), in this paper, we focus instead on d2 log F/(d log Lfd log Lg), which as we will show in Section 4.3, determines the macroeconomic elasticities of substitution between factors, as well as on d2 log F/(d log Ajd log |$w_{\mathit f}$|⁠), which determines the elasticity of factor shares to technical change, that is, the bias of technical change. The difference with the characterization of the second-order aggregate cost function in Section 3.2 is that: the elasticities of the factor shares show up in equation (8) for the elasticities of the sales shares; the elasticities of the sales shares are now given by a system of linear equations. As we shall see, this is because shocks trigger changes in relative demand for factors, which given fixed factor supplies, lead to changes in factor prices. To gain some intuition, consider for example the vector of elasticities d log Λ/d log Lg of factor shares to the supply of factor g. Note that as observed in Baqaee and Farhi (2019), we can rewrite the system of linear factor share propagation equations (9) as $$ \begin{equation} \frac{d\log \Lambda }{d\log L_{g}}=\Gamma \frac{d\log \Lambda }{d\log L_{g}}+\delta _{(g)}, \end{equation}$$ (10) with $$ \begin{align*} \Gamma _{fh}=-\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(h)},\Psi _{(f)}\big) \end{align*}$$ and $$ \begin{align*} \delta _{fg}=\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(g)},\Psi _{(f)}\big). \end{align*}$$ We call δ the factor share impulse matrix. Its gth column encodes the direct or first-round effects of a shock to the supply of factor g on factor income shares, taking relative factor prices as given. We call Γ the factor share propagation matrix. It encodes the effects of changes in relative factor prices on factor income shares, and it is independent of the source of the shock g. Consider a shock d log Lg > 0 that increases the supply of factor g. If we fix relative factor shares, the relative price of this factor declines by −d log Lg. Every producer k will substitute across its inputs in response to this shock. Suppose that θk > 1, so that producer k substitutes expenditure toward those inputs l that are more reliant on factor g, captured by |$\Psi _{\mathit{lg}}$|⁠, and the more so, the higher θk − 1. Now, if those inputs are also more reliant on factor f, captured by a high |$\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(g)},\Psi _{(f)})$|⁠, then substitution by k will increase expenditure on factor f and hence the income share of factor f. These substitutions, which happen at the level of each producer k, must be summed across producers. This first round of changes in the demand for factors triggers changes in relative factor prices, which then sets off additional rounds of substitution in the economy that we must account for, and this is the role Γ plays. For a given set of factor prices, the shock to g affects the demand for each factor, hence factor income shares and in turn factor prices, as measured by the F × 1 vector δ(g) given by the gth column of δ. These changes in factor prices then cause further substitution through the network, leading to additional changes in factor demands and prices. The impact of the change in the relative price of factor h on the share of factor f is measured by the fhth element of the F × F matrix Γ. The movements in factor shares are the fixed point of this process, that is, the solution of equation (10): $$ \begin{align*} \frac{d\log \Lambda }{d\log L_{g}}=(I-\Gamma )^{-1}\delta _{(g)}, \end{align*}$$ where I is the F × F identity matrix. The intuition for the elasticities of factor share to productivity shocks d log Λ/d log Aj in equation (9) and for the elasticities of sales shares of goods to factor supplies d log λ/d log Lg and to productivities d log λ/d log Aj in equation (8) are similar. 4.3. Macroeconomic Elasticities of Substitution Between Factors As in Section 3.3, we can leverage Proposition 7 to characterize the macroeconomic elasticities of substitution between factors in the aggregate production function. Proposition 8 (MESs). The MESs between factors in the aggregate production function are given by $$ \begin{equation*} 1-\frac{1}{\sigma _{fg}^{F}}=\big(I_{(g)}-I_{(f)}\big)^{\prime }(I-\Gamma )^{-1}\delta _{(g)}, \end{equation*}$$ where Γ is the F × F factor share propagation matrix defined by $$ \begin{equation*} \Gamma _{hh^{\prime }}=-\sum _{k\in 1+N}(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(h^{\prime })},\Psi _{(h)}/\Lambda _{h}\big), \end{equation*}$$ δ is the F × F factor share impulse matrix defined by $$ \begin{equation*} \delta _{hh^{\prime }}=\sum _{k\in 1+N}(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(h^{\prime })},\Psi _{(h)}/\Lambda _{h}\big) \end{equation*}$$ δ(g) is its gth column, I is the F × F identity matrix, and I(f) and I(g) are its fth and gth columns. In Section 3.3, we showed that the MES |$\sigma _{fg}^{C}$| between factors in the aggregate cost function are weighted averages of the microeconomic elasticities of substitution θk in production with weights given by sufficient statistics |$\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(g)},\Psi _{(g)}/\Lambda _{g}-\Psi _{(f)}/\Lambda _{f})$| of the input–output network. For the aggregate production function, the MESs |$\sigma _{fg}^{F}$| between factors are still determined by microeconomic elasticities of substitution θk and by sufficient statistics |$\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(h^{\prime })},\Psi _{(h)}/\Lambda _{h})$| of the input–output network. However, they are no longer weighted averages of the microeconomic elasticities of substitution, and they depend on a longer list of input–output network sufficient statistics. In fact, |$\sigma _{fg}^{F}$| is now a nonlinear function of the sufficient statistics |$(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(h^{\prime })},\Psi _{(h)}/\Lambda _{h})$|⁠. There are two special cases where |$\sigma _{fg}^F$| becomes a weighted average of the microeconomic elasticities θi. The first case is the “Hicksian” case when there are only two factors. The second case is when all the microeconomic elasticities of substitution are identical, which follows from the following network-irrelevance result established in uncovered in Baqaee and Farhi (2019). Proposition 9 (Network Irrelevance). If all microeconomic elasticities of substitution θk are equal to the same value θk = θ, then MESs |$\sigma _{fg}^F$| between factors in the aggregate production function are also equal to that value |$\sigma _{fg}^F=\theta$|⁠. 4.4. Macroeconomic Bias of Technical Change We can also leverage Proposition 7 to characterize the macroeconomic bias of technical change in the aggregate production function. Proposition 10 (Bias of Technical Change). The biases toward one factor versus another of the different technology shocks in the aggregate production function are given by $$ \begin{equation*} \frac{B_{fgj}^{F}}{1+B_{fgj}^{F}}=\big(I_{(f)}-I_{(g)}\big)^{\prime }(I-\Gamma )^{-1}\hat{\delta }_{(j)}, \end{equation*}$$ where Γ is the F × F factor share propagation matrix defined by $$ \begin{equation*} \Gamma _{hh^{\prime }}=-\sum _{k\in 1+N}(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(h^{\prime })},\Psi _{(h)}/\Lambda _{h}\big), \end{equation*}$$|$\hat{\delta }$| is the F × 1 + N factor share impulse matrix defined by $$ \begin{equation*} \hat{\delta }_{hj}=\sum _{k\in 1+N}(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(j)},\Psi _{(h)}/\Lambda _{h}\big), \end{equation*}$$|$\hat{\delta }_{(j)}$| is its jth column, I is the F × F identity matrix, and I(f) and I(g) are its fth and gth columns. In Section 3.4, we showed that the bias |$B_{fgj}^{C}$| of technical change in the aggregate cost function was a weighted sum of the departure from one θk − 1 of the microeconomic elasticities of substitution in production with weights given by sufficient statistics |$\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(j)},\Psi _{(f)}/\Lambda _{f}-\Psi _{(g)}/\Lambda _{g})$| of the input–output network. For the aggregate production function, |$B_{fgj}^{F}$| is determined by departures from one θk − 1 of microeconomic elasticities of substitution and by sufficient statistics |$\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(j)},\Psi _{(h)}/\Lambda _{h})$| and |$\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(h^{\prime })},\Psi _{(h)}/\Lambda _{h})$| of the input–output network. However, it is no longer a weighted sum of the departures from one of the microeconomic elasticities of substitution, and it depends on a longer list of input–output network sufficient statistics. In fact, |$B_{fgj}^{F}$| is now a nonlinear function of the sufficient statistics |$(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(j)},\Psi _{(h)}/\Lambda _{h})$| and |$(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(h^{\prime })},\Psi _{(h)}/\Lambda _{h})$|⁠. As in case of the aggregate cost function, the network-irrelevance result for MESs in the aggregate production function stated in Proposition 9 does not extend to the bias of technical change. In general, the network matters for the bias of technical change, even when all the microeconomic elasticities of substitution θk are identical. Once again, the Cobb–Douglas is the one case where it does not: when all the microeconomic elasticities θk are unitary so that θk = 1, technical change in unbiased with |$B_{fgj}^F=0$| for all f, g, and j, no matter what the structure of the network is. As already mentioned, in Sections 6 and 6.5, we will present some simple theoretical and quantitative examples to illustrate the results of Sections 3 and 4. In Section 7, we will also generalize these results to non-nested-CES economies. Before doing so however, we now turn to the questions of factor aggregation and network factorization and relate our results to the Cambridge–Cambridge controversy. 5. Factor Aggregation, Network Factorization, and the Cambridge–Cambridge Controversy “The production function has been a powerful instrument of miseducation. The student of economic theory is taught to write Y = F(K, L) where L is a quantity of labor, K a quantity of capital and Y a rate of output of commodities. He is instructed to [...] measure L in man-hours of labor; he is told something about the index-number problem involved in choosing a unit of output; and then he is hurried on [...], in the hope that he will forget to ask in what units K is measured. Before ever he does ask, he has become a professor, and so sloppy habits of thought are handed on from one generation to the next.” — Robinson (1953) As described earlier, the Cambridge–Cambridge controversy was a decades-long debate about the foundations of the aggregate production function. The broader context of the controversy was a clash between two views of the origins of the returns to capital. The first one is the Marxist view of the return to capital as a rent determined by political economy and monopolization. The second one is the marginalist view of the competitive return to capital determined by technology, returns to scale, and scarcity. The marginalist view is encapsulated in the “three key parables” of neoclassical writers (Jevons, Bohm-Bawerk, Wicksell, Clark) identified by Samuelson (1966): (1) the rate of interest is determined by technology (r = FK); (2) there are diminishing returns to capital (K/Y and K/L are decreasing in r); and (3) the distribution of income is determined by relative factor scarcity (r/|$ w $| is decreasing in K/L). These parables are consequences of having a per-period neoclassical aggregate production function F(K, L) that has decreasing returns in each of its arguments. In his famous “Summing Up” QJE paper (Samuelson 1966), Samuelson, speaking for the Cambridge US camp, finally conceded to the Cambridge UK camp and admitted that indeed, capital could not be aggregated. He produced an example of an economy with “reswitching”: an economy where, as the interest rate decreases, the economy switches from one technique to the other and then back to the original technique. This results in a nonmonotonic relationship between the capital–labor ratio as a function of the rate of interest r. Since the corresponding capital–labor and capital–output ratios are nonmonotonic functions of the rate of interest, this economy violates the first two of the three key parables. It is impossible to represent the equilibrium of the economy with a simple neoclassical model with a neoclassical aggregate production function with capital and labor, and where output can be used for consumption and investment. Importantly, this result is established using valuations to compute the value of the capital stock index as sum of the values of the existing vintages of techniques, that is, the net-present-value of present and future payments to nonlabor net of the net-present-value of present and future investments. The value of the capital stock depends on the rate of interest. Basically, the physical interpretation of capital is lost when it is aggregated in this financial way, and so are basic technical properties such as decreasing returns.16 The reactions to the Cambridge–Cambridge controversy were diverse. Post-Keynesians, like Pasinetti, considered neoclassical theory to have been “shattered” by their critiques.17 Samuelson (and others like Franklin Fisher) on the other hand became invested in the view that one should develop disaggregated models of production. For example, Samuelson concluded his “A Summing Up” paper with this: “Pathology illuminates healthy physiology [...] If this causes headaches for those nostalgic for the old time parables of neoclassical writing, we must remind ourselves that scholars are not born to live an easy existence.” — Samuelson (1966). Solow was more ambivalent: “There is a highbrow answer to this question and a lowbrow one. The highbrow answer is that the theory of capital is after all just a part of the fundamentally microeconomic theory of the allocation of resources, necessary to allow for the fact that commodities can be transformed into other commodities over time. Just as the theory of resource allocation has as its “ dual” a theory of competitive pricing, so the theory of capital has as its “ dual” a theory of intertemporal pricing involving rentals, interest rates, present values, and the like. The lowbrow answer, I suppose, is that theory is supposed to help us understand real problems, and the problems that cannot be understood without capital-theoretic notions are those connected with saving and investment. Therefore the proper scope of capital theory is the elucidation of the causes and consequences of acts of saving and investment. Where the highbrow approach tends to be technical, disaggregated, and exact, the lowbrow view tends to be pecuniary, aggregative, and approximate. A middlebrow like myself sees virtue in each of these ways of looking at capital theory. I am personally attracted by what I have described as the lowbrow view of the function of capital theory. But as so often happens, I think the highbrow view offers indispensable help in achieving the lowbrow objective.” — Solow (1963). In the mid-1960s, an “MIT school” arose, which attempted to make progress on the study of disaggregated models of production. It impact was limited. Of course it did not help that the reswitching example that concluded the Cambridge–Cambridge controversy seemed so exotic. Later, developments in the growth literature, the arrival of real business cycle models, and the rational expectations revolution shifted the mainstream of the profession (with a few notable exceptions) away from these questions of heterogeneity and aggregation and toward dynamics and expectations. In our opinion, the general neglect of these questions is unfortunate, and we hope that our work will contribute to reviving interest in these important topics. This section can be seen as a historical detour to make contact with the issues that preoccupied the protagonists of the Cambridge–Cambridge controversy. First, in Section 5.1, we show that generically, capital, or for that matter, any group of distinct factors, cannot be physically aggregated. Second, in Section 5.2, we give useful sufficient conditions for the possibility of physically factorizing the production network into components that can be represented via a subaggregate production functions. Third, in Section 5.3, we show how to capture Samuelson’s reswitching example showing that capital cannot be linearly aggregated financially with valuations using our formalism. The general lesson from this section is that the details of the production network matter, that outside of very knife-edge special cases, aggregating factors violates the structure of the network, and hence that it also changes the properties of the model. As a result, attempting to capture a disaggregated model of production by directly postulating an aggregate model does not work outside of very special cases. 5.1. Factor Aggregation We study the aggregate production and cost functions of an economy with more than three factors. For brevity, we only treat the case of the aggregate cost function in nested-CES economies. The analysis of the general non-nested-CES case is similar, using the generalizations presented in Section 7. Similar proofs can be given in the case of the aggregate production function.18 We also abstract from productivity shocks in our discussion (hold them fixed), but similar reasoning can be extended to productivity shocks. Consider a nontrivial partition {Fi}i∈I of the set factors F, that is, such that there exists an element of the partition comprising strictly more than 1 and strictly less than F factors. We say factors can be aggregated according in the partition {Fi}i∈I if there exists a set of functions |$\tilde{C}$| and |$\tilde{g}_i$| such that $$ \begin{equation*} C(w_1, \ldots , w_F,Y) = \tilde{C}(g_{1}(\lbrace w_f\rbrace _{f \!\in F_1}), \ldots , g_{I}(\lbrace w_f\rbrace _{f \!\in F_I}),Y). \end{equation*}$$ Similarly, we say that factors can be aggregated in to the partition {Fi}i∈I, up to an nth order approximation, if there exists a set of functions |$\tilde{C}$| and |$\tilde{g}_i$| such that for all m ≤ n and (f1, f2, …, fm) ∈ Fm, $$ \begin{equation*} \frac{d^m \log C(w_1, \ldots , w_F,Y) }{d w_{f_1}\cdots d w_{f_m}} = \frac{d^m \log \tilde{C}(\tilde{g}_{1}(\lbrace w_f\rbrace _{f \!\in F_1}), \ldots , \tilde{g}_{I}(\lbrace w_f\rbrace _{i\in F_I}),Y)}{d w_{f_1}\cdots d w_{f_m}} . \end{equation*}$$ In words, the factors can be aggregated up to the nth order, if there exists a separable function whose derivatives coincide with C up to the nth order. A strict subset Fi of factors can always be aggregated locally to the first order by matching the shares of these factors in revenue.19 But this aggregation fails to the second order, and by implication, it also fails globally. Indeed, and abstracting from productivity shocks, by the Leontief-Sono theorem, the strict subset Fi of factors can be globally aggregated in the aggregate cost function if and only if |$C_{w_{\mathit{f}}}/C_{w_{\mathit{g}}}$|⁠, is independent of |$w_{\mathit h}$| for all |$(f,g)\in F_{i}^{2}$| and h ∈ F − Fi. This is equivalent to the condition that $$ \begin{equation*} \frac{d^{2}\log C}{d\log w_{h}d\log w_{f}}-\frac{d^{2}\log C}{d\log w_{h}d\log w_{g}}=0 \end{equation*}$$ or equivalently that |$\sigma _{hf}^{C}=\sigma _{hg}^{C}$| for all |$(f,g)\in F_{i}^{2}$|⁠, h ∈ F − Fi, and vector of factor prices. Using Proposition 2, this equation can be rewritten as $$ \begin{equation*} \sum _{k\in 1+N}(\theta _{k}-1)\lambda _{k}\operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi _{(h)},\Psi _{(f)}/\Lambda _{f}-\Psi _{(g)}/\Lambda _{g}\big)=0. \end{equation*}$$ It is clear that this property is not generic: starting with an economy where this property holds, it is possible to slightly perturb the economy and make it fail. Indeed, suppose that the property holds at the original economy for a given vector of factor prices. Consider a set Fi an element of the partition comprising strictly more than 1 and strictly less than F factors. If |$\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(h)},\Psi _{(f)}/\Lambda _{f}-\Psi _{(g)}/\Lambda _{g})\ne 0$| for some k, |$(f,g)\in F_{i}^{2}$|⁠, and h ∈ F − Fi, then it is enough to perturb the elasticity θk to make the property fail. If |$\operatorname{Cov}_{\Omega ^{(k)}}(\Psi _{(h)},\Psi _{(f)}/\Lambda _{f}-\Psi _{(g)}/\Lambda _{g})=0$| for all k, |$(f,g)\in F_{i}^{2}$|⁠, and h ∈ F − Fi, then we need to perturb the network to bring ourselves back to the previous case. It is enough to introduce a new producer producing only from factors h and f and selling only to final demand, with a small share ϵ, and scale down the other expenditure share in final demand by 1 − ϵ. We can choose the exposures of the new producer to h and f such that |$\operatorname{Cov}_{\Omega ^{(0)}}(\Psi _{(h)},\Psi _{(f)}/\Lambda _{f}-\Psi _{(g)}/\Lambda _{g})\ne 0$| for ϵ ≠ 0 small enough. This leads us the following proposition. Proposition 11 (Conditions for Factor Aggregation). Consider an economy with more than three factors and a nontrivial partition {Fi}i∈I of the set F of factors. In the aggregate production function, the factors can be aggregated in to the partition if and only if |$\sigma _{hf}^{F}=\sigma _{hg}^{F}$| for all i ∈ I, |$(f,g)\in F_{i}^{2}$|⁠, h ∈ F − Fi, and vector of factor supplies. Similarly, in the aggregate cost function, the factors can be aggregated according to the partition if and only if |$\sigma _{hf}^{C}=\sigma _{hg}^{C}$| for all i ∈ I, |$(f,g)\in F_{i}^{2}$|⁠, h ∈ F − Fi, and vector of factor prices. The conditions for factor aggregation according to a given partition in the aggregate production and cost functions are equivalent. Generically, these properties do not hold. The capital-aggregation theorem of Fisher (1965) can be seen through the lens of this proposition. It considers an economy with firms producing perfectly substitutable goods using firm-specific capital and labor. It shows that the different capital stocks can be aggregated into a single capital index in the aggregate production function if and only if all the firms have the same production function up to a capital-efficiency term. In this case, and only in the case, all the MESs between the different firm-specific capital stocks and labor in the aggregate production function are all equal to each other, and are equal to the elasticity of substitution between capital and labor of the common firm production function. Proposition 11 also shows that generically, capital, or indeed any other factor, cannot be aggregated. In other words, disaggregated production models cannot be avoided. Our approach in the previous sections acknowledges this reality and start with as many disaggregated factors as is necessary to describe technology. Our results take disaggregated models and seek to characterize their properties in terms of standard constructs such as the aggregate production and cost functions, marginal products of factors and factor demands, and elasticities of substitution between factors. 5.2. Network Factorization and Subaggregate Production Functions There is one frequently occurring network structure under which we can establish a powerful form of network aggregation. This result can easily be conveyed at a high level of generality without requiring the economy to be of the nested-CES form. We need the following definition. Definition. Let I be a subset of nodes. Let M be the set of nodes j ∉ I with |$\Omega _{\mathit{i\,j}}$| ≠ 0 for some i ∈ I. We say (r, I, M) is an island, if There is a unique node r ∈ I such that |$\Omega _{\mathit{i\,r}}$| = 0 for every i ∈ I. |$\Omega _{\mathit{j\,i}}$| = 0 for every i ∈ I − {r} and every j ∉ I. |$\Omega _{\mathit{k\,j}}$| = 0 for every j ∈ M and k ∉ I. We call r the export of the island. We say M are imports of the island. The imports of the island can be factors or nonfactors. With some abuse of notation, we denote by |$x_{\mathit{r\,i}}$| the total imports of good i ∈ M of the whole island (r, I, M). In the case where f ∈ M is a factor, we also use the notation |$L_{\mathit{r\,f}}$|⁠. See Figure 1 for a graphical illustration. Note that the requirement that the export r of the island not be used as an intermediate input by other producers in the island is merely a representation convention: if it is not the case, we can always introduce a fictitious producer that transforms the good into an export using a one-to-one technology. The same remark applies to the requirement that imports of the island not be used by producers outside of the island: if a particular import is used by another producer outside of the island, we can always introduce a fictitious producer that transforms the good into an import using a one-to-one technology. Figure 1. Open in new tabDownload slide Illustration of an island (r, I, M) within a broader network. The nodes in the island I are in blue, the imports M are in green, and the export of the island is denoted by r. The figure only shows the island, its imports, and its export. This island is embedded in a broader network that is not explicitly represented in the figure. Figure 1. Open in new tabDownload slide Illustration of an island (r, I, M) within a broader network. The nodes in the island I are in blue, the imports M are in green, and the export of the island is denoted by r. The figure only shows the island, its imports, and its export. This island is embedded in a broader network that is not explicitly represented in the figure. Given an island (r, I, M), we can define an associated island subaggregate production function with the island’s imports as factors and its exports as aggregate output: $$ \begin{equation*} F_{r}\!\left(\lbrace x_{ri}\rbrace _{i \in M}\!\!, \lbrace A_i\rbrace _{i \in I}\right) = \max y_r, \end{equation*}$$ $$ \begin{eqnarray} \text{subj. to:}\quad & y_j=A_jF_j\big(\lbrace x_{jk}\rbrace _{k\in I-\lbrace r\rbrace + M}\big)\quad (j \in I),\nonumber\\ & \displaystyle \sum _{i\in I}x_{ij}=x_{rj} \quad \left(j\in I-\lbrace r\rbrace + M\right). \end{eqnarray}$$ (11) With some abuse of notation, we use the same symbol Fr to denote the endogenous island subaggregate production function that we have used to denote the exogenous production function of producer r. The arguments of the latter are the intermediate inputs used by producer r whereas those of the former are the imports of the island and the productivities of the different producers in the island. The island subaggregate production function can be characterized using the same methods that we have employed for the economy-wide aggregate production function throughout the paper. The planning problem defining the economy-wide aggregate production function can then be rewritten by replacing all the nodes in the island by its subaggregate production function: $$ \begin{align*} F(L_1, \ldots , L_N, A_1, \ldots , A_N)&= \max \mathcal {D}_{0}(c_{1},\ldots ,c_{N})\\ \text{subj. to:} & \quad y_{i}=A_{i}F_{i}\!\big(\lbrace x_{ij} \rbrace _{j\in N-I+\lbrace r \rbrace + F}\big)\quad (i \in N-I),\\ & \quad y_{r}=F_{r}\!\left(\lbrace x_{ri}\rbrace _{i \in M}\!\!, \lbrace A_i\rbrace _{i \in I}\right)\!,\\ & \quad c_{i}+\sum _{j\in N-I+\lbrace r\rbrace }x_{ji}=y_{i}\quad (i \in N-I+\lbrace r\rbrace ),\\ & \quad \sum _{i\in N-I+\lbrace r\rbrace}x_{if}=L_{f} \quad (f \in F), \end{align*}$$ where Fr is the island subaggregate production function. So, if the economy contains islands, then the economy-wide aggregate production function can be derived in two stages: by first solving the island component planning problems (11) giving rise to the island subaggregate production functions, and then by solving the economy-wide problem giving rise to the economy-wide aggregate production function that uses the island subaggregate production functions. To describe this recursive structure, we say that the production network has been factorized. Proposition 12 (Network Factorization). Let (r, I, M) denote an island. Then the economy-wide aggregate production function depend only on {Ai}i∈I and {|$x_{\mathit{ri}}$|}i∈M = {yi}i∈M via the island aggregate production function Fr({|$x_{\mathit{ri}}$|}i ∈ M, {Ai}). In particular, if all the imports of the island are factors so that M ⊆ F, then the factors can be aggregated according to the partition {M, F − M}.20 The factor-aggregation theorems of Fisher (1982) and Fisher (1983) correspond to special cases of the conditions in the second part of this proposition. The first part of the proposition is particularly useful in disaggregated intertemporal models. In some cases, intertemporal linkages can be represented via a set of capital stocks and their laws of motions via production functions with investment as inputs. This is the case, for example, of the post-Keynesian reswitching model studied in the next section. Even though will not pursue this particular representation of the example, it will prove helpful in understanding some of the obstacles preventing the aggregation of this model into a simple one-good neoclassical growth model. 5.3. Reswitching Revisited We now turn to the post-Keynesian reswitching example in Samuelson (1966). Samuelson’s example features an economy with two goods in every period: labor and output. Labor is in unit supply. Output is used for consumption, labor can be used to produce output using two different production functions (called “techniques”). The first technique combines 2 units of labor at t − 2 and 6 units of labor at t to produce one unit of output at t. The second technique uses 7 units of labor at t − 1 to produce one unit of output at t. Both techniques are assumed to have constant-returns-to-scale. We focus on the steady state of this economy, taking the gross interest rate R = 1 + r as given, where r is the net interest rate. The interest rate R is varied by changing the rate of time preferences β = 1/R of the agent. By comparing the unit costs of production, it is easy to see that the second technique dominates for high and low values of the interest rate, and that the first technique dominates for intermediate values of the interest rate. Indeed, at a gross interest rate of one (a net interest rate of zero), the second technique is preferred because it has a lower total labor requirement (7 vs. 8); and at a high interest rate, the second technique is preferred because the two-period delay in production of the first technique is too costly. Therefore, the economy features reswitching: as the interest rate is increased, it switches from the second to the first technique and then switches back to the second technique. This post-Keynesian example can be obtained as a limit of the sort of nested-CES economies that we consider, provided that we use the Arrow–Debreu formalism of indexing goods and factors by dates and to think about capital stocks as intermediate goods. The corresponding production network is represented in Figure 2. The diagram shows how in different periods, labor can be combined with intermediate goods produced from past labor to produce new intermediate and final goods. Figure 2. Open in new tabDownload slide The production network underlying Samuelson’s reswitching example. The arrows indicate the flow of goods. The green nodes are primary factors, and Y is aggregate output in this economy, which is perfectly substitutable across consumption units at different dates. Figure 2. Open in new tabDownload slide The production network underlying Samuelson’s reswitching example. The arrows indicate the flow of goods. The green nodes are primary factors, and Y is aggregate output in this economy, which is perfectly substitutable across consumption units at different dates. In Figure 3 we plot some steady-values for this economy as a function of the interest rate. The capital–labor and capital–output ratios are nonmonotonic functions of the interest rate, where the aggregate capital stock is computed via financial valuations as the net-present-value of the payments to capital or equivalently at its replacement cost. Using the notation in Figure 2, this means that the aggregate capital stock in period t is computed as |$K_t = p_{x_{1,t}} x_{1,t} + p_{y_{2,t}} y_{2,t} = 2 w_{t-2} x_{1,t} + 7 w_{t-1} y_{2,t}.$| Figure 3. Open in new tabDownload slide Samuelson’s reswitching example. Figure 3. Open in new tabDownload slide Samuelson’s reswitching example. We could alternatively represent this economy using per-period production functions for consumption and for investments, where the factors would be the capital stocks corresponding to the quantities of the different vintages of the two techniques and labor, and the laws of motions for the different capital stocks would combine previous capital stocks and investments to produce new capital stocks. These different production functions would correspond to a factorization of the production network into separate islands. For our purposes here, it is more convenient instead to work directly with the disaggregated economy in Arrow–Debreu intertemporal form, which is characterized by the propagation equations (8) and (9). These equations are associated with an intertemporal production function characterizing the production of an intertemporal aggregate of consumption goods in all dates as a function labor in all dates. For convenience, we consider a smoother version of the post-Keynesian example by imaging that the two techniques produce different goods that enter in consumption via a CES aggregator with a finite elasticity of substitution. The example obtains in the limit when that elasticity of substitution goes to infinity. As can be seen in Figure 4, the properties of the smoothed-out example resemble those of the original example.21 Figure 4. Open in new tabDownload slide Samuelson’s smoothed-out reswitching example. Figure 4. Open in new tabDownload slide Samuelson’s smoothed-out reswitching example. Now consider a simple one-good neoclassical growth model with a per-period neoclassical aggregate production function with capital and labor as its two arguments, where output can be used for investment and for consumption. In such a model, the homogeneous capital stock can be computed via financial valuations or equivalently at its replacement cost. The question we now ask is whether we could represent the disaggregated post-Keynesian example as a version of the simple neoclassical model with an aggregate capital stock given by the sum of the values of the heterogeneous capital stocks in the disaggregated post-Keynesian example. The nonmonotonicity of the capital–labor and capital–output ratios as a function of the interest rate shows that this is not possible. The simple neoclassical model could match the investment share, the capital share, the value of capital, and the value of the capital–output and capital–labor ratios of the original steady state of the disaggregated model, but not across steady states associated with different values of the interest rate. In other words, aggregation via financial valuation fails.22 The production network of the simple one-good neoclassical growth economy is represented in Figure 5. The nodes (yt, lt, kt) form an island where yt is the export of the island, and kt is the import of the island. Hence the relative price of kt and lt depends only on the quantity of kt and lt, and the problem can be studied in isolation via a neoclassical production function. The production function of the post-Keynesian reswitching example is different and it cannot be factorized into the same islands and represented with the same production functions. In other words, physical aggregation also fails. Figure 5. Open in new tabDownload slide The production network underlying a simple one-good neoclassical growth model of an economy with a per-period production function with capital and labor as its two arguments, where output can be used for investment and for consumption. The arrows indicate the flow of goods. The green nodes are primary factors, and Y is aggregate output in this economy, which is perfectly substitutable across consumption units at different dates. Figure 5. Open in new tabDownload slide The production network underlying a simple one-good neoclassical growth model of an economy with a per-period production function with capital and labor as its two arguments, where output can be used for investment and for consumption. The arrows indicate the flow of goods. The green nodes are primary factors, and Y is aggregate output in this economy, which is perfectly substitutable across consumption units at different dates. One way to frame the lesson more generally is that the details of the production network matter. Aggregating factors changes the production network, and hence aggregation changes the properties of the model. 6. Simple Illustrative Examples In this section, we provide four simple theoretical examples and a simple quantitative illustration. The first example is Hicksian in the sense that there are only two factors: the MESs in the aggregate production and cost functions are identical and are symmetric. The second example is non-Hicksian since it has three factors: the MESs in the aggregate production and cost functions are different and are asymmetric in general. The third example is the famous example of Houthakker (1955). The fourth example works out the macroeconomic bias of technical change that is capital augmenting at the microeconomic level in a disaggregated “task-based” model; it also shows that such a model can give rise to richer and more complex patterns than simpler models based on an aggregate production function in the sense that such technical change can be capital biased but not necessarily capital augmenting at the macroeconomic level. The fifth example is a simple quantitative illustration to capital-skill complementarity à la Griliches (1969) in the US economy, taking into account the multiplicity of sectors and their input–output linkages. We use the analysis to revisit the influential analysis in Krusell et al. (2000) of the role of these complementarities in the evolution over time of the skill premium. 6.1. A Hicksian Example with Two Factors Our first example features two factors of production and producers with different factor intensities. A similar example is analyzed in Oberfield and Raval (2014), building on Sato (1975).23 Each producer 1 ≤ i ≤ N produces from capital (Ki) and labor (Li) according to $$ \begin{equation*} \frac{y_{i}}{\bar{y}_{i}}=\left(\omega _{iK}\left(\frac{K_{i}}{\bar{K}_{i}}\right)^{\frac{\theta _{iKL-1}}{\theta _{iKL}}}+\omega _{iL}\left(\frac{L_{i}}{\bar{L}_{i}}\right)^{\frac{\theta _{iKL-1}}{\theta _{iKL}}}\right)^{\frac{\theta _{iKL}}{\theta _{iKL}-1}} \end{equation*}$$ and the final demand aggregator is $$ \begin{equation*} \frac{Y}{\bar{Y}}=\left(\sum _{i=1}^{N}\omega _{Di}\left(\frac{y_{i}}{\bar{y}_{i}}\right)^{\frac{\theta _{D}-1}{\theta _{D}}}\right)^{\frac{\theta _{D}}{\theta _{D}-1}} \end{equation*}$$ with ωiK = 1 − ωiL and |$\sum _{i=1}^{N}\omega _{Di}=1$|⁠. Sales shares for goods and factors are given by λD = 1, λi = ωDi, |$\Lambda _{K}=\sum _{i=1}^{N}\lambda _{i}\omega _{iK}$|⁠, and |$\Lambda _{L}=\sum _{i=1}^{N}\lambda _{i}\omega _{iL}$|⁠. For this example economy, the MES between capital and labor in the aggregate cost and production functions satisfy |$\sigma _{LK}^{C}=\sigma _{KL}^{C}=\sigma _{LK}^{F}=\sigma _{KL}^{F}=\sigma _{LK}$|⁠, where σLK is given by $$ \begin{equation*} \sigma _{LK}=\sum _{i=1}^{N}\theta _{iKL}\lambda _{i}\frac{\omega _{iK}(1-\omega _{iK})}{\Lambda _{K}(1-\Lambda _{K})}+\theta _{D}\frac{\sum _{i=1}^{N}\lambda _{i}(\omega _{iK}-\Lambda _{K})^{2}}{\Lambda _{K}(1-\Lambda _{K})}, \end{equation*}$$ where $$ \begin{equation*} \sum _{i=1}^{N}\lambda _{i}\frac{\omega _{iK}(1-\omega _{iK})}{\Lambda _{K}(1-\Lambda _{K})}+\frac{\sum _{i=1}^{N}\lambda _{i}(\omega _{iK}-\Lambda _{K})^{2}}{\Lambda _{K}(1-\Lambda _{K})}=1. \end{equation*}$$ The MES between capital and labor σLK is a weighted average of the microeconomic elasticities of substitution between capital and labor θiKL and of the elasticity of substitution θD across producers in final demand. The weight on θiKL increases with its sales share λi and the heterogeneity in factor shares ωiK(1 − ωiK) relative to the economy-wide heterogeneity in factor shares ΛK(1 − ΛK). It is zero when ωiK = 0 or ωiK = 1. The weight on θD increases in the heterogeneity in capital exposure across producers |$\sum _{i=1}^{N}\lambda _{i}(\omega _{iK}-\Lambda _{K})^{2}$| relative to the economy-wide heterogeneity in factor exposures ΛK(1 − ΛK). It is zero when ωiK = ΛK for all i. 6.2. A Non-Hicksian Example with Three Factors Our second example extends the first example to include three factors. Each producer 1 ≤ i ≤ N produces from capital (Ki), skilled labor (Hi), and unskilled labor (Li), according to $$ \begin{align*} \frac{y_{i}}{\bar{y}_{i}}&=\Bigg (\omega _{iKH}\left(\omega _{iK}\left(\frac{K_{i}}{\bar{K}_{i}}\right)^{\frac{\theta _{iKH-1}}{\theta _{iKH}}}+\omega _{iH}\left(\frac{H_{i}}{\bar{H}_{i}}\right)^{\frac{\theta _{iKH-1}}{\theta _{iKH}}}\right)^{\frac{\theta _{iKH}}{\theta _{iKH}-1}\frac{\theta _{iKHL-1}}{\theta _{iKHL}}}\\ &\quad\quad {}+\, \omega _{iL}\!\left(\frac{L_{i}}{\bar{L}_{i}}\right)^{\frac{\theta _{iKHL-1}}{\theta _{iKHL}}}\vphantom{\omega _{iK}\left(\frac{K_{i}}{\bar{K}_{i}}\right)^{\frac{\theta _{iKH-1}}{\theta _{iKH}}}}\Bigg)^{\frac{\theta _{iKHL-1}}{\theta _{iKHL}}} \end{align*}$$ and the final demand aggregator is $$ \begin{equation*} \frac{Y}{\bar{Y}}=\left(\sum _{i=1}^{N}\omega _{Di}\left(\frac{y_{i}}{\bar{y}_{i}}\right)^{\frac{\theta _{D}-1}{\theta _{D}}}\right)^{\frac{\theta _{D}}{\theta _{D}-1}}, \end{equation*}$$ with ωiKH = 1 − ωiL, ωiK = 1 − ωiH, and |$\sum _{i=1}^{N}\omega _{Di}=1$|⁠. This economy can be written in normal form by introducing fictitious producers indexed by iKH producing a bundle of capital and skilled labor to be used as an input by producer i. Sales shares for goods and factors are given by λD = 1, λi = ωDi, λiKH = λiωiKH, |$\Lambda _{K}=\sum _{i=1}^{N}\lambda _{i}\omega _{iKH}\omega _{iK}$|⁠, |$\Lambda _{H}=\sum _{i=1}^{N}\lambda _{i}\omega _{iKH}\omega _{iH}$|⁠, and |$\Lambda _{L}=\sum _{i=1}^{N}\lambda _{i}\omega _{iL}$|⁠. We start with the MESs in the aggregate cost function. For the sake of illustration, we focus on the MESs |$\sigma _{LK}^{C}$| and |$\sigma _{KL}^{C}$|⁠. We have $$ \begin{eqnarray*} \sigma _{LK}^{C}&=&\theta _{D}\sum _{i=1}^{N}\lambda _{i}\omega _{iKH}\omega _{iK}\left(\frac{\omega _{iKH}\omega _{iK}}{\Lambda _{K}}-\frac{\omega _{iL}}{\Lambda _{L}}\right)\\ && +\, \sum _{i=1}^{N}\theta _{iKHL}\left[\lambda _{i}\omega _{iK}\frac{\omega _{iKH}\omega _{iK}}{\Lambda _{K}}+\lambda _{i}\omega _{iKH}\omega _{iK}\left(-\frac{\omega _{iKH}\omega _{iK}}{\Lambda _{K}}+\frac{\omega _{iL}}{\Lambda _{L}}\right)\right]\\ && +\, \sum _{i=1}^{N}\theta _{iKH}\lambda _{i}\omega _{iH}\frac{\omega _{iKH}\omega _{iK}}{\Lambda _{K}},\\ \sigma _{KL}^{C}&=&\theta _{D}\sum _{i=1}^{N}\lambda _{i}\omega _{iL}\left(\frac{\omega _{iL}}{\Lambda _{L}}-\frac{\omega _{iKH}\omega _{iK}}{\Lambda _{K}}\right)\\ && +\, \sum _{i=1}^{N}\theta _{iKHL}\left[\lambda _{i}\frac{\omega _{iL}}{\Lambda _{L}}+\lambda _{i}\omega _{iL}\left(\frac{\omega _{iKH}\omega _{iK}}{\Lambda _{K}}-\frac{\omega _{iL}}{\Lambda _{L}}\right)\right]. \end{eqnarray*}$$ As is apparent from these formulas, in general, |$\sigma _{LK}^{C}\ne \sigma _{KL}^{C}$|⁠. For instance, and by contrast with |$\sigma _{LK}^{C}$|⁠, |$\sigma _{KL}^{C}$| does not depend on the microeconomic elasticities of substitution θiKH between capital and skilled labor. This follows from two observations: variations underlying the definition |$\sigma _{LK}^{C}$| vary |$w_{\mathit K}$| while keeping |$w_{\mathit H}$| and |$w_{\mathit L}$| constant whereas variations underlying the definition |$\sigma _{KL}^{C}$| vary |$w_{\mathit L}$| while keeping |$w_{\mathit K}$| and |$w_{\mathit H}$| constant; capital and skilled labor always enter in the CES nest iKH with elasticity θiKH whereas unskilled labor does not. In the special case where capital intensities are uniform across producers, |$\sigma _{LK}^{C}$| is independent of θD, and similarly, in the special case where labor intensities are uniform across producers, |$\sigma _{KL}^{C}$| is independent of θD. In general, and although verifying it requires some steps of algebra, both |$\sigma _{LK}^{C}$| and |$\sigma _{KL}^{C}$| are weighted averages of the microeconomic elasticities of substitution. The expressions for the MESs in the aggregate production function |$\sigma _{KL}^{F}$| and |$\sigma _{LK}^{F}$| are more complex, and we omit them for brevity. Obtaining these equations requires solving a system of equations of two equations in two unknowns for the changes in factor shares d log ΛL and d log ΛK in response to a change d log K and d log L, respectively (after having used the equation ΛKd log ΛK + ΛHd log ΛH + ΛLd log ΛL = 0 to substitute out d log ΛH). In general, we have |$\sigma _{LK}^{F}\ne \sigma _{KL}^{C}$|⁠, |$\sigma _{LK}^{F}\ne \sigma _{LK}^{C}$|⁠, and |$\sigma _{LK}^{F}\ne \sigma _{KL}^{F}$|⁠. Moreover, |$\sigma _{LK}^{F}$| and |$\sigma _{LK}^{F}$| are not weighted averages, or even linear functions, of the microeconomic elasticities of substitution. These expressions simplify drastically in the case where factor intensities and microeconomic elasticities of substitution are uniform across producers so that ωiK, ωiH, ωiL, ωiKH, θiKHL, and θiKH are independent of i. In this case, the aggregate production and cost functions are of the nested-CES form. In particular, we get |$\sigma _{KL}^{C}=\sigma _{KL}^{F}=\theta _{KHL}$|⁠, |$\sigma _{LK}^{C}=\theta _{KHL}\Lambda _{K}/(\Lambda _{K}+\Lambda _{H})+\theta _{KH}\Lambda _{H}/(\Lambda _{K}+\Lambda _{H})$|⁠, and |$\sigma _{LK}^{F}=(\Lambda _{K}+\Lambda _{H})/(\Lambda _{K}/\theta _{KHL}+\Lambda _{H}/\theta _{KH})$|⁠. Hence we see that in this simple case, |$\sigma _{LK}^{C}$| and |$\sigma _{LK}^{F}$| are, respectively, the arithmetic and harmonic averages of the microeconomic elasticities θKHL and θKH with weights ΛK/(ΛK + ΛH) and ΛH/(ΛK + ΛH) and are therefore different in general. 6.3. Houthakker (1955) Houthakker (1955) described how a disaggregated economy with fixed proportions and decreasing returns at the microeconomic level could give rise to a Cobb–Douglas aggregate production function with decreasing returns when the distribution of technical requirements across producers is a double Pareto. The model illustrates a divorce between microeconomic elasticities of substitutions between factors equal to 0, and macroeconomic elasticities of substitutions between factors equal to 1. The model is a particular limit case of our general model. In this section, we explain how to capture it using our formalism.24 The model features individual cells. Each individual j cell can produce up to φj units of output, where each unit of output requires a1, j units of factor L1 and a2, j units of factor L2. Using output as the numeraire, the unit is active in equilibrium if 1 − a1, j|$w_{1}$| − a2, j|$w_{2}$| ≥ 0. The total capacity of cells for which a1, j lies between a1 and a1 + da1 and for which a2, j lies between a2 and a2 + da2 can be represented by φ(a1, a2)da1da2, where φ is the input–output distribution for the set of cells concerned. Total output and total factor demand are then given by $$ \begin{align*} Y & =\int _{0}^{1/w_{1}}\int _{0}^{(1+a_{1}w_{1})/w_{2}}\phi (a_{1},a_{2})da_{1}da_{2},\\ L_{1} & =\int _{0}^{1/w_{1}}\int _{0}^{(1+a_{1}w_{1})/w_{2}}a_{1}\phi (a_{1},a_{2})da_{1}da_{2},\\ L_{2} & =\int _{0}^{1/w_{1}}\int _{0}^{(1+a_{1}w_{1})/w_{2}}a_{2}\phi (a_{1},a_{2})da_{1}da_{2}. \end{align*}$$ The last two equations implicitly give w1 and w2 as functions of L1 and L2, and plugging these functions back into the first equation gives output Y = F(L1, L2) as a function of L1 and L2, thereby describing the aggregate production function of this economy. Characterizing this production function is difficult, and so Houthakker focused on the special case where the distribution of unit requirements is double Pareto with |$\phi (a_{1},a_{2})=Aa_{1}^{\alpha _{1}-1}a_{2}^{\alpha _{2}-1}$|⁠. He showed that in this case, the production function is given by $$ \begin{equation*} F(L_{1},L_{2})=\Theta L_{1}^{\frac{\alpha _{1}}{\alpha _{1}+\alpha _{2}+1}}L_{2}^{\frac{\alpha _{2}}{\alpha _{1}+\alpha _{2}+1}}, \end{equation*}$$ where $$ \begin{equation*} \Theta =(\alpha _{1}+\alpha _{2}+1)\left(\frac{AB(\alpha _{1}+1,\alpha _{2}+1)}{\alpha _{1}^{\alpha _{1}+1}\alpha _{2}^{\alpha _{2}+1}}\right)^{\frac{1}{\alpha _{1}+\alpha _{2}+1}}, \end{equation*}$$ where B is the beta function given by |$B(\alpha _{1}+1,\alpha _{2}+1)=\int _{0}^{1}t^{\alpha _{1}}(1-t)^{\alpha _{2}}.$| To capture the model using our formalism, one first has to introduce more factors, because of decreasing returns to scale at the micro level. Specifically, we assume that over and above the factors L1 and L2, there is a different fixed factor Lj in unit supply for each producer j. Producer j produces output according to a Leontief aggregate min {l1/a1, l2/a2, φjlj}, where l1 is its use of factor L1, l2 its use of factor L2, and lj its use of factor Lj. The outputs of the different producers are then aggregated using a CES aggregate. Houthakker’s model obtains in the limit where the elasticity of substitution of this CES aggregator goes to infinity. In this limit, the wage of the fixed factor of producer j can be computed as (1 − a1, j|$w_{1}$| − a2, j|$w_{2}$|⁠)+ so that payments to all factors exhaust the revenues of producer j. It is possible to obtain Houthakker’s formulas in the particular case where the distribution is double Pareto by specializing our general formulas, but the calculations are tedious and so we refrain from doing so. The reason for this difficulty is that there are many factors: the two nonfixed factors and all the fixed factors. Our formulas solve for all these changes in shares simultaneously as the solution of a large system of linear equations. In the double Pareto case, it is actually possible to sidestep this difficulty and to solve directly for the changes in the shares in the two non-fixed factors, which turn out to be zero. This means that although the individual changes in the shares accruing to the fixed factors are nonzero, their sum is zero. Since the changes in these individual shares is not of direct interest for the question at hand, the direct method is preferable. Indeed, it is straightforward to see that $$ \begin{align*} Y & =\frac{(\alpha _{1}+\alpha _{2}+1)AB(\alpha _{1}+1,\alpha _{2}+1)}{\alpha _{1}\alpha _{2}w_{1}^{\alpha _{1}}w_{2}^{\alpha _{2}}},\\ L_{1} & =\frac{AB(\alpha _{1}+1,\alpha _{2}+1)}{\alpha _{2}w_{1}^{\alpha _{1}+1}w_{2}^{\alpha _{2}}},\\ L_{2} & =\frac{AB(\alpha _{1}+1,\alpha _{2}+1)}{\alpha _{1}w_{1}^{\alpha _{1}}w_{2}^{\alpha _{2}+1}}. \end{align*}$$ This immediately implies that $$ \begin{align*} \frac{w_{1}L_{1}}{Y} & =\frac{\alpha _{1}}{\alpha _{1}+\alpha _{2}+1},\\ \frac{w_{2}L_{2}}{Y} & =\frac{\alpha _{2}}{\alpha _{1}+\alpha _{2}+1}. \end{align*}$$ This in turn immediately implies that $$ \begin{equation*} \sigma _{L_{1},L_{2}}^{F}=\sigma _{L_{2},L_{1}}^{F}=\sigma _{L_{1},L_{2}}^{C}=\sigma _{L_{2},L_{1}}^{C}=1 \end{equation*}$$ as well as Houthakker’s result that |$F(L_{1},L_{2})=\Theta L_{1}^{\alpha _{1}/(\alpha _{1}+\alpha _{2})}L_{2}^{\alpha _{2}/(\alpha _{1}+\alpha _{2})}$|⁠. 6.4. Capital-Biased Technical Change in a Task-Based Model In this section, we consider an example taken from Baqaee and Farhi (2018) and inspired by Acemoglu and Restrepo (2019). We compute the bias of technical change and explain its dependence on the microeconomic pattern of sales shares, factor intensities, and microeconomic elasticities of substitution. We then show that in a “task-based” economy with disaggregated production, a possible consequence of capital-augmenting technical change and automation at the microeconomic level is a simultaneous decline in both the labor share of income and the real wage at the macroeconomic level. This cannot happen in a simpler economy with an aggregate production function with capital-augmenting technical since such technical change would always increase the real wage. In other words, technical change that is capital augmenting at the microeconomic level is capital biased but not capital augmenting at the macroeconomic level. The impact of technical change is therefore richer and more complex in models of disaggregated production. Assume that each producer, associated to a “task”, produces from capital and labor according to $$ \begin{equation*} \frac{y_{i}}{\bar{y}_{i}}=\left(\omega _{iL}\left(\frac{\tilde{L}_{i}}{\bar{\tilde{L}}_{i}}\right)^{\frac{\theta _{KL-1}}{\theta _{KL}}}+\omega _{iK}\left(\frac{\tilde{K}_{i}}{\bar{\tilde{K}}_{i}}\right)^{\frac{\theta _{KL-1}}{\theta _{KL}}}\right)^{\frac{\theta _{KL}}{\theta _{KL}-1}} \end{equation*}$$ with $$ \begin{equation*} \tilde{K}_{i}=\frac{A_{iK}}{\bar{A}_{iK}}K_{i}\qquad \textrm{and}\qquad \tilde{L}_{i}=\frac{A_{iL}}{\bar{A}_{iL}}L_{i} \end{equation*}$$ and ωiK = 1 − ωiL. The consumer values the output of these tasks according to a CES aggregator $$ \begin{equation*} \frac{Y}{\bar{Y}}=\left(\sum _{i=1}^{N}\omega _{Di}\left(\frac{y_{i}}{\bar{y}_{i}}\right)^{\frac{\theta _{D}-1}{\theta _{D}}}\right)^{\frac{\theta _{D}}{\theta _{D}-1}}, \end{equation*}$$ with |$\sum _{i=1}^{N}\omega _{Di}=1$|⁠. Sales shares for goods and factors are given by λD = 1, λi = ωDi, |$\Lambda _{K}=\sum _{i=1}^{N}\lambda _{i}\omega _{iK}$|⁠, and |$\Lambda _{L}=\sum _{i=1}^{N}\lambda _{i}\omega _{iL}$|⁠. Capital-biased technical change is modeled as a shock d log AkK > 0. Using our formulas, we can characterize the responses of the labor share and of the wage. The biases toward K versus L of this technology shock in the aggregate cost function and production functions are given by $$ \begin{equation*} B^C_{KLkK}=(\theta _{KL}-1)\lambda _{k}\frac{\omega _{kK}}{\Lambda _K}\frac{\omega _{kL}}{\Lambda _{L}}+(\theta _D-1)\lambda _k\frac{\omega _{kK}}{\Lambda _K}\left(1-\frac{\omega _{kL}}{\Lambda _L}\right) \end{equation*}$$ and $$ \begin{equation*} \frac{B^F_{KLkK}}{1+B^F_{KLkK}}=\frac{(\theta _{KL}-1)\lambda _{k}\frac{\omega _{kK}}{\Lambda _K}\frac{\omega _{kL}}{\Lambda _{L}}+(\theta _D-1)\lambda _k\frac{\omega _{kK}}{\Lambda _K}\big(1-\frac{\omega _{kL}}{\Lambda _L}\big)}{1+(\theta _{KL}-1)\sum _{i=1}^N\lambda _{i}\frac{\omega _{iK}}{\Lambda _{K}}\frac{\omega _{iL}}{\Lambda _{L}}+(\theta _D-1)\frac{1}{\Lambda _L\Lambda _K}\operatorname{Var}_{\lambda }\big(\omega _{(L)}\big)}. \end{equation*}$$ When there is a single task k so that λk = 1, ωkL = ΛL, and ωkK = Λk, the aggregate production function is CES with elasticity of substitution θKL between capital and labor, and the biases in the aggregate cost and production functions coincide |$B^C_{KLkL}=B^F_{KLkL}=\theta _{KL}-1$|⁠. In general however, these two biases are different |$B^C_{KLkL}\ne B^F_{KLkL}$|⁠. The bias in the aggregate cost function |$B^C_{KLkL}$| is a linear function of the different microeconomic elasticities of substitution whereas the bias in the aggregate production function |$B^F_{KLkL}$| is a nonlinear function of these elasticities. However, the signs of the two biases are identical. A capital-augmenting shock to task k is more likely to be biased toward capital versus labor at the macroeconomic level when: (i) capital and labor are substitutes at the microeconomic level with θKL > 1; and (ii) tasks are substitutes with θD > 1 and task k is more capital intensive than the average task with ωkL/ΛL < 1, or tasks are complements with θD < 1 and task k is more labor intensive than the average task with ωkL/ΛL > 1. The intuition for (i) is straightforward: in response to a positive shock, producer k substitutes expenditure toward capital if θKL > 1 and toward labor if θKL < 1. The intuition for (ii) is the following: a positive shock reduces the price of task k; if θD > 1, the household substitutes expenditure toward task k, resulting in the reallocation of factors toward task k, which increases the overall expenditure on capital if ωkL/ΛL < 1 and reduces it otherwise; if θD < 1, the household substitutes expenditure away from task k, resulting in the reallocation of factors away from task k, which increases the overall expenditure on capital if ωkL/ΛL > 1 and reduces it otherwise. We now turn our attention to the effect of technical change on the real wage, holding factor supplies constant. For simplicity, we focus on the case where final demand is Cobb Douglas across tasks with θD = 1. We also assume that capital and labor are substitutes at the microeconomic level with θKL > 1, so that a capital-augmenting shock to task k is biased toward capital versus labor at the macroeconomic level, that is, a positive shock increases the capital share and decreases the labor share. As we shall now see, the effect of such a shock on the real wage is ambiguous:25|$^,$|26 $$ \begin{equation*} \frac{d\log w_{L}}{d\log A_{kK}}=\lambda _{k}\omega _{kK}\frac{1+(\theta _{KL}-1)\sum _{i}\lambda _{i}\big(\frac{\omega _{iL}}{\Lambda _{L}}-\frac{\omega _{kL}}{\Lambda _{L}}\big)\frac{\omega _{iK}}{\Lambda _{K}}}{1+(\theta _{KL}-1)\sum _{i}\lambda _{i}\frac{\omega _{iL}}{\Lambda _{L}}\frac{\omega _{iK}}{\Lambda _{K}}}. \end{equation*}$$ If task k is more labor intensive than the average task with ωkL/ΛL > 1, and capital and labor are highly substitutable with a high-enough value of θKL, then the real wage falls in response to a positive shock. This is because as task k substitutes away from labor and toward capital, labor is reallocated to other tasks who use labor less intensively. This reallocation of labor reduces the marginal product of labor and hence the real wage. These patterns cannot be generated in a simpler economy with an aggregate production function with capital-augmenting technical change since such a shock always increases the marginal product of labor and hence the real wage.27 6.5. Quantitative Application: Capital-Skill Complementarity and the Skill Premium In this section, we briefly summarize a simple quantitative application of the results in Sections 3 and 4. We refer the reader to Appendix B for the details of the analysis. We show how to use these results to study capital-skill complementarity à la Griliches (1969) in the US economy, taking into account the multiplicity of sectors and their input–output linkages. We use the analysis to revisit the influential analysis in Krusell et al. (2000) of the role of these complementarities in the evolution over time of the skill premium. Krusell et al. (2000) studies the relationship between the increasing skill premium and the rapid decline in the relative price of equipment investment goods. They find that complementarity between capital goods and high-skill labor can explain a large part of the increase in the skill premium. They use an aggregate model and directly postulate an exogenous aggregate production function. We revisit their analysis in the context of a disaggregated model where the aggregate production function emerges endogenously. We find that moving from the aggregate model to the disaggregated model reduces the MES between capital and high-skil labor from 0.67 to 0.66 and the MES between capital a low-skill labor from 1.05 to 0.93. These differences are enough to reduce by 20% the contribution of the decline in the relative price of equipment investment to the increase of the skill premium. These particular results notwithstanding, we stress that our goal in this application is more to demonstrate quantitatively the dependence of macroeconomic elasticities of substitution on microeconomic primitives than to draw strong implications for the question of whether and how much complementarities between capital and skilled labor can explain the behavior of the skill premium. 7. Extensions In Appendix A, we present a number of extensions. First, in Appendix A.1, we explain how to generalize our results to arbitrary economies beyond the nested-CES case with Hicks neutral technology shocks at the level of each nest. In Appendix A.3, we introduce alternative notions for the aggregate production and cost functions that separate final demand from technology and explain how to use our results to characterize these objects. In Appendix A.4, we explain how to generalize our results to economies where final demand is non-homothetic or where there are distortions. 8. Conclusion This paper is part of a broader agenda to bring macrotheory closer to microdata. As microdata becomes more plentiful, parsimonious reduced-form aggregate production functions look more antiquated. This paper takes a step toward realizing this goal by providing an organizing framework and some general characterizations of microfounded aggregate production functions. The difference in this approach, compared to the classical analysis, is that it takes advantage of the information in granular microdata and allows for data-intensive aggregation to arrive at the behavior of macroeconomic aggregates. We have several other papers in this agenda. In Baqaee and Farhi (2019), we focus on the nonlinear macroeconomic impact on aggregate output of microeconomic shocks in efficient models. In Baqaee and Farhi (2019b), we extend our analysis to inefficient economies. In Baqaee and Farhi (2018), we characterize how shocks affect nonaggregate outcomes, namely, how shocks propagate from one producer to another, and how microeconomic variables comove with one another in a production network, and we also show how to combine input–output production networks with heterogeneous agents and nonhomothetic final demand in closed economies. In Baqaee and Farhi (2019a), we show how to take these elements into account in trade models. However, we think we have only scratched the surface of what is left to be done. Extending the analysis to allow for, amongst other things, entry, exit, and tractable stochastic dynamics, are further steps that need to be taken before theory can start to approximate reality. We are actively pursuing these questions in ongoing research. Acknowledgements The editor in charge of this paper was Claudio Michelacci. Acknowledgments: This paper was written for the 2018 BBVA lecture of the European Economic Association. Farhi gratefully acknowledges the travel support from the BBVA Foundation, which allowed him to present the research at the ASSA meetings in Philadelphia and at a public lecture at FBBVA headquarters in Madrid. We thank Maria Voronina for excellent research assistance. We thank Natalie Bau and Elhanan Helpman for useful conversations. Appendix A: Extensions In this section, we present a number of extensions. First, in Section A.1, we explain how to generalize our results to arbitrary economies beyond the nested-CES case with Hicks neutral technology shocks at the level of each nest. In Section A.3, we introduce alternative notions for the aggregate production and cost functions that separate final demand from technology and explain how to use our results to characterize these objects. In Section A.4, we explain how to generalize our results to economies where final demand is non-homothetic or where there are distortions. A.1 Beyond CES In Sections 3 and 4, we confined our characterization of aggregate cost and production functions to the general class of nested-CES economies with Hicks-neutral technology shocks at the level of each nest. Now each producer i has a production function Fi(xi1, …, xiN, Li1, …, LiF, Ai) with associated cost function Ci(p1, …, pN, |$w_{1}$|⁠, …, |$w_{\mathit F}$|⁠, Ai). Generically, we can normalize the technology shock Ai so that d log F/d log Ai = d log C/d log Ai = 1 at the point of interest. We proceed in two successive steps. In the first step, we continue to assume that technology shocks are Hicks neutral at the producer level and explain how to deal with non-CES producers.28 We then explain how to extend our results using the input–output substitution operator, which is a generalization of the input–output covariance operator defined in equation (7). In the second step, we relax the assumption of Hicks-neutral technical change at the producer level and explain how to deal with completely general biased technical change at the producer level.29 Dealing with Non-CES Producers. We start with the case where technical change is Hicks neutral at the producer level. We introduce the input–output substitution operator. We show how it can be used to generalize the results of Sections 3 and 4. Definition (Micreconomic Allen–Uzawa Elasticities of Substitution). For a producer k, let θk(l, l′) denote the Allen–Uzawa elasticity of substitution in cost between inputs l ≠ l′ in the cost of producer k: $$ \begin{equation*} \theta _{k}(l,l^{\prime })=\frac{C_{k}d^{2}C_{k}/(dp_{l}dp_{l^{\prime }})}{(dC_{k}/dp_{l})(dC_{k}/dp_{l^{\prime }})}=\frac{\epsilon _{k}(l,l^{\prime })}{\Omega _{kl^{\prime }}}, \end{equation*}$$ where ϵk(l, l′) is the elasticity of the demand by producer k for input l with respect to the price |$p_{l^{\prime }}$| of input l′, and |$\Omega _{kl^{\prime }}$| is the expenditure share in cost of input l′. This definition applies to good inputs and factor inputs using the aforementioned notation pN+f = |$w_{\mathit f}$|⁠. Because of the symmetry of partial derivatives, we have θk(l, l′) = θk(l′, l). Because of the homogeneity of degree one of the cost function, we have the homogeneity identity |$\sum _{^{l^{\prime }\in N+F}}\Omega _{kl^{\prime }}\theta _k(l,l^{\prime })=0$|⁠. Following Baqaee and Farhi (2019), we introduce the input–output substitution operator for producer k: $$ \begin{align} \Phi _{k}\big(\Psi _{(i)},\Psi _{(j)}\big)&=-\sum _{^{(l,l^{\prime })\in (N+F)^{2}}}\Omega _{kl}\left[\delta _{ll^{\prime }}+\Omega _{kl^{\prime }}(\theta _k(l,l^{\prime })-1)\right]\Psi _{li}\Psi _{l^{\prime }j}, \end{align}$$ (A.1) $$ \begin{align} =\frac{1}{2}E_{\Omega ^{(k)}}\big((\theta _{k}(l,l^{\prime })-1)(\Psi _{i}(l)-\Psi _{i}(l^{\prime }))(\Psi _{j}(l)-\Psi _{j}(l^{\prime }))\big), \end{align}$$ (A.2) where |$\delta _{ll^{\prime }}$| is the Kronecker symbol, Ψi(l) = |$\Psi _{\mathit{li}}$|⁠, |$\Psi _{j}(l^{\prime })=\Psi _{l^{\prime }j}$|⁠, and the expectation in the second line is over l and l′. The second line can be obtained from the first using the symmetry of Allen–Uzawa elasticities and the homogeneity identity. When the production function of k is CES with elasticity of substitution θk, the cross Allen–Uzawa elasticities θk(l, l′) are identical θk(l, l′) = θk for l ≠ l′, the own Allen–Uzawa elasticities are given by θk(l, l) = −θk(1 − Ωkl)/Ωkl, and we recover $$ \begin{equation*} \Phi _k\big(\Psi _{(i)}, \Psi _{(j)}\big) = (\theta _k -1) \operatorname{Cov}_{\Omega ^{(k)}}\big(\Psi {}_{(j)},\Psi _{(i)}\big). \end{equation*}$$ Even outside of the CES case, the input–output substitution operator Φk(Ψ(i), Ψ(j)) shares many properties with a covariance operator. For example, it is immediate to verify that: Φk(Ψ(i), Ψ(j)) is bilinear in Ψ(i) and Ψ(j); Φk(Ψ(i), Ψ(j)) is symmetric in Ψ(i) and Ψ(j); Φk(Ψ(i), Ψ(j)) = 0 whenever Ψ(i) or Ψ(j) is constant. All of our results in Sections 3 and 4 can be generalized to non-nested-CES economies. All that is needed is to replace terms of the form |$(\theta _{k}-1) \operatorname{Cov}_{\Omega ^{(k)}}(\Psi {}_{(j)},\Psi _{(i)})$| by Φk(Ψ(i), Ψ(j)). For example, the result in Proposition 3 for the MES between factors in the aggregate cost function becomes $$ \begin{equation*} \sigma _{fg}^{C}-1=\sum _{k\in 1+N}\lambda _{k}\Phi _{k}\big(\Psi _{(g)},\Psi _{(g)}/\Lambda _{g}-\Psi _{(f)}/\Lambda _{f}\big). \end{equation*}$$ Just like in the nested-CES case, |$\sigma _{fg}^{C}$| is a weighted average of the microeconomic elasticities of substitution θk(l, l′) and is equal to θ if the microeconomic elasticities of substitution are all equal to θ. Intuitively, Φk(Ψ(i), Ψ(j)) captures the way in which k redirects expenditure share toward i in response to one percent change in the price of j. To see this, we make use of the following well-known result (see, e.g., Russell 2017): the elasticity of the expenditure share of producer k on input l with respect to the price of input l′ is given by |$\delta _{ll^{\prime }}+\Omega _{kl^{\prime }}(\theta _{k}(l,l^{\prime })-1)$|⁠.30 Equation (A.1) says that the way k redirects expenditure share toward i in response to unit proportional decrease in the price of j depends on considering, for each pair of inputs l and l′, how much the proportional decrease |$\Psi _{l^{\prime }j}$| in the price of l′ induced by the decline in the price of j causes k to increase its expenditure share on l (measured by |$-\Omega _{kl}[\delta _{ll^{\prime }}+\Omega _{kl^{\prime }}(\theta _{k}(l,l^{\prime })-1)\Psi _{l^{\prime }j}$|⁠), and on the exposure of l to i (measured by |$\Psi _{\mathit{li}}$|⁠). Equation (A.2) says that the way k redirects expenditure share toward i in shares in response to a decline in the price of j depends on considering, for each pair of inputs l and l′, whether or not increased exposure to j (measured by Ψj(l) − Ψj(l′)), is aligned with increased exposure to i (measured by Ψi(l) − Ψi(l′)), and whether l and l′ are complements or substitutes (measured by (θk(l, l′) − 1)). Dealing with Biased Technical Change at the Producer Level. We now further generalize the results of Sections 3 and 4 to the case where technical change is biased at the producer level. This generalization matters only for our results on the bias of technical change in the aggregate production and cost functions. It does not change anything to our results on the MESs between factors in the aggregate production and cost functions. Definition (Microeconomic Bias of Technical Change). For a producer k, we denote by |$b_{ijk}^C$| the microeconomic bias toward input i versus j of a technology shock to producer k, defined by the elasticity of the ratio of the cost shares of inputs i versus j of producer k with respect to a technology shock to producer k, holding input prices constant: $$ \begin{equation*} b_{ijk}^{C}=\frac{d\log (\Omega _{ki}/\Omega _{kj})}{d\log A_k}. \end{equation*}$$ Similarly, we denote by |$b_{ik}^C$| the bias toward input i of a technology shock to producer k, defined by elasticity of the cost share of input i of producer k, holding input prices constant: $$ \begin{equation*} b_{ik}^{C}=\frac{d\log \Omega _{ki}}{d\log A_k}=\sum _{j\in N} \Omega _{kj}b_{ijk}^C. \end{equation*}$$ Note that we have defined the bias of technical change toward inputs, be they factors or intermediate goods. These notions are purely microeconomic. They are defined at the level of an individual producer, and do not embed any general equilibrium whatsoever. In what follows, we will rely on the following properties: $$ \begin{equation*} \sum _{i\in N} \Omega _{ki} b_{ik}^C=0 \quad \mbox{and} \quad \sum _{i\in N} \sum _{j\in N} \Omega _{ki} \Omega _{kj}b_{ijk}^C=0. \end{equation*}$$ In the presence of biased technical change at the producer level, the only difference to the analysis in Section A.1 is that we now must include new forcing terms in the propagation equations (5) and (6) in Proposition 2 and (8) and (9) in Proposition 7. More precisely the following term must be added on the right-hand sides of equations (5) and (8) for d log λi: $$ \begin{equation*} \sum _{k\in 1+N}\sum _{j\in 1+N}\frac{\lambda _k}{\lambda _i} \operatorname{Cov}_{\Omega ^{(k)}}\left(b_{(k)}^C, \Psi _{(i)}\right) d\log A_k, \end{equation*}$$ and the following term must be added on the right-hand sides of equations (6) and (9) for d log Λf: $$ \begin{equation*} \sum _{k\in 1+N}\sum _{j\in 1+N} \frac{\lambda _k}{\Lambda _f} \operatorname{Cov}_{\Omega ^{(k)}}\left(b_{(k)}^C, \Psi _{(f)}\right) d\log A_k, \end{equation*}$$ where |$b_{(k)}^C$| is the kth column of |$b_{jk}^C$|⁠. These terms account for the direct and indirect effects through the network of the changes in input expenditures of each producer k due to the bias of its technical change d log Ak. Of course, these new terms do not appear in the results on the MESs between factors in the aggregate production and cost functions that do not depend on the nature of technical change. But they do enter the results on the bias of technical change in the aggregate production and cost functions. For example, the result in Proposition 5 for the bias of technical change in the aggregate cost function becomes $$ \begin{align*} B_{fgj}^{C}&=\lambda _j \operatorname{Cov}_{\Omega ^{(j)}}\left(b_{(j)}^C, \Psi _{(f)}/\Lambda _f-\Psi _{(g)}/\Lambda _g\right) \\ &\quad {}+\sum _{k\in 1+N}\lambda _{k}\Phi _k\left(\Psi _{(j)},\Psi _{(f)}/\Lambda _f-\Psi _{(g)}/\Lambda _g\right). \end{align*}$$ A.2 Hybrid Model with Infinitely Elastic and Inelastic Factors We assume that some of the factors F∞ are in infinitely elastic supply and that the others F0 are in fixed supply, with F = F∞ ∪ F0. We then have $$ \begin{align*} {d}\log \lambda _{i}&=\!\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\lambda _{i}}\operatorname{Cov}_{\Omega ^{(k)}}\!\!\left(\sum _{j\in N}\Psi _{(j)}d\log A_{j}\!+\!\sum _{g\in F}\Psi _{(g)}d\log L_{g},\Psi _{(i)}\right)\\ -\!\!\!\!\!\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\!\left(\sum _{h\in F_0} \Psi _{(h)}d\log \Lambda _{h} \!-\!\sum _{h\in F_{\infty }}\Psi _{(h)}{d}\log Y,\Psi _{(i)}\right)\!, \end{align*}$$ where the elasticities of output and of factor shares solve the following system of linear equations $$ \begin{align*} {d}\log Y&=\frac{1}{\sum _{f\in F_0} \Lambda _f} \Bigg [ \sum \sum _{j\in N}\lambda _j {d}\log A_j -\sum _{g\in {F_\infty }} \Lambda _g {d}\log w_g \\ &\quad {} +\sum _{g\in F_0}\Lambda _g {d}\log L_g -\sum _{g\in F_0}\Lambda _d {d}\log \Lambda _g \Bigg ], \end{align*}$$ $$ \begin{eqnarray*} &&{d}\log \Lambda _{f}=\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\left(\sum _{j\in N}\Psi _{(j)}d\log A_{j} +\sum _{g\in F}\Psi _{(g)}d\log L_{g},\Psi _{(f)}\right) \nonumber\\ && -\, \!\!\!\!\!\sum _{k\in 1+N}(\theta _{k}-1)\frac{\lambda _{k}}{\Lambda _{f}}\operatorname{Cov}_{\Omega ^{(k)}}\left(\sum _{h\in F_0} \Psi _{(h)}d\log \Lambda _{h} -\sum _{h\in F_{\infty }}\Psi _{(h)}{d}\log Y,\Psi _{(f)}\right)\!. \end{eqnarray*}$$ A.3 Separating Technology from Final Demand As mentioned before, in this paper, we have put ourselves under conditions where the existence of an aggregate output good can be taken for granted because final demand is homothetic. Given our definitions, the properties of the aggregate production and cost functions depend on final demand. It is possible to define alternative notions that do not depend on final demand: the aggregate distance function and its associated aggregate cost function. These alternative notions must then be combined with final demand to derive general-equilibrium comparative static results. Our definitions intentionally take these two steps at once. We take this route because we want to put the focus on general-equilibrium comparative statics. However, our results can also be used to characterize the properties of aggregate distance and cost functions. This is because we have full flexibility in specifying final demand, meaning that the aggregate distance function and its associated cost function can be analyzed by specifying final demand to be Leontief (i.e., setting the elasticity of substitution between final goods to be zero). To see this, we first define the aggregate distance function and the associated cost function. Define the aggregate technology set $$ \begin{equation*} \Gamma (A)=\lbrace (L,c) : L { can produce } c\rbrace , \end{equation*}$$ where L is a vector of factors, c is a vector of final consumptions, and A is a vector of productivities. The aggregate distance function is defined as follows: $$ \begin{equation*} D(L,A,c) = \max _{\lbrace \delta : (L/\delta ,c)\in \Gamma (A) \rbrace }\delta , \end{equation*}$$ so that (L, c) ∈ Γ(A) if and only if D(L, A, c) ≥ 1. Similarly, and with some abuse of notation, we can define the aggregate cost function $$ \begin{equation*} C(w,c,A)=\min _{\lbrace L: (L,c)\in \Gamma (A) \rbrace } wL. \end{equation*}$$ Whenever an output good |$\mathcal {D}_0(c)$| exists because final demand is homothetic, the aggregate production function can be recovered from the aggregate distance function using $$ \begin{equation*} \frac{F(L,A)}{Y}=\max _{\lbrace c : \mathcal {D}_0(c)\ge Y\rbrace } D(L,A,c). \end{equation*}$$ Going in the other direction is also possible. To capture an aggregate distance function via an aggregate production function, for a given vector of final output c, suppose that final demand has a Leontief form |$\mathcal {D}_0=\mathcal {D}^c_0$|⁠, where |$\mathcal {D}^c_0(\hat{c})=\min \lbrace \hat{c}_1/c_1,\ldots ,\hat{c}_N/c_N \rbrace$|⁠, and then compute the associated aggregate production function Fc to recover $$ \begin{equation*} D(L,A,c)=\frac{F^c(L,A)}{\mathcal {D}^c_0(c)}=F^c(L,A). \end{equation*}$$ Basically, the aggregate distance function is obtained by removing the possibility of substitution in final demand. Similar considerations apply to the relationship between the aggregate cost functions C(⁠|$ w $|⁠, Y, A) and C(⁠|$ w $|⁠, c, A). Therefore, and although we do not present them in this form, our results can be used to fully characterize the aggregate distance function D(L, A, c) and the aggregate cost function C(⁠|$ w $|⁠, c, A). A.4 Non-Homothetic Final Demand and Distortions One advantage of the alternative notions defined in Section A.3 is that they can be applied to economies that do not possess an “aggregate output” good. However, as already mentioned, these notions must then be combined with final demand to derive general-equilibrium comparative statics. In Baqaee and Farhi (2018), we develop an alternative approach based on “propagation equations” that maintains the focus on general-equilibrium comparative static results. These equations generalize equations (5) and (6) in Proposition 2 and equations (8) and (9) in Proposition 7. These equations allow us to characterize the first- and second-order properties of real GDP when it is defined as a Divisa index (and does not correspond to any physical quantity) and hence to compute macroeconomic elasticities of substitution between factors and the macroeconomic bias of technical change along the same lines as in this paper. This methodology generalizes our results to environments with non-homothetic final demand. It also has the advantage of allowing us to also deal with economies with distortions in the same unified framework. Appendix B: Quantitative Application in Section 6.5 We show how to use these results to study capital-skill complementarity à la Griliches (1969) in the US economy, taking into account the multiplicity of sectors and their input–output linkages. We use the analysis to revisit the influential analysis in Krusell et al. (2000) of the role of these complementarities in the evolution over time of the skill premium. Krusell et al. (2000) studies the relationship between the increasing skill premium and the rapid decline in the relative price of equipment investment goods. They find that complementarity between capital goods and high-skill labor can explain a large part of the increase in the skill premium. They use an aggregate model and directly postulate an exogenous aggregate production function. We revisit their analysis in the context of a disaggregated model where the aggregate production function emerges endogenously. We find that moving from the aggregate model to the disaggregated model reduces the MES between capital and high-skil labor from 0.67 to 0.66 and the MES between capital a low-skill labor from 1.05 to 0.93. These differences are enough to reduce by 20% the contribution of the decline in the relative price of equipment investment to the increase of the skill premium. These particular results notwithstanding, we stress that our goal in this application is more to demonstrate quantitatively the dependence of macroeconomic elasticities of substitution on microeconomic primitives than to draw strong implications for the question of whether and how much complementarities between capital and skilled labor can explain the behavior of the skill premium. Krusell et al. (2000) work with a nested CES aggregate production function of the form $$ \begin{equation*} \frac{Y}{\bar{Y}}=\left(\omega _{L} \left(\frac{X}{\bar{X}}\right)^{\frac{\theta _2-1}{\theta _2}} + (1-\omega _L) \left(\frac{L}{\bar{L}}\right)^{\frac{\theta _2-1}{\theta _2}}\right)^{\frac{\theta _2}{\theta _2-1}}, \end{equation*}$$ with $$ \begin{equation*} \frac{X}{\bar{X}}=\left(\omega _{H}\left( \frac{{H}}{\bar{{H}}}\right)^{\frac{\theta _{1}-1}{\theta _1}}+\omega _{K}\left(\frac{K}{\bar{K}}\right)^{\frac{\theta _1 -1}{\theta _{1}}}\right)^{\frac{\theta _{1}}{\theta _{1}-1}}. \end{equation*}$$ Here L represents low-skill labor, H high-skill labor, and K is capital. They estimate the model on macrodata using the first-order conditions of the model and find an elasticity of substitution between skilled labor and capital to be θ1 = 0.67 and an elasticity of substitution between unskilled labor and the composite factor X to be θ2 = 1.67. They argue that these estimates are plausible by noting that they lie in the middle of the range of estimates in the microeconomic literature reported by Johnson (1997) and Hamermesh (1996). They then use their estimated model to perform a counterfactual and conclude that the decrease in the price of equipment investment goods can explain a large part of the increase in the skill premium over time. We revisit their exercise in a calibrated disaggregated model with 66 sectors and input–output linkages. We consider a model with 5 distinct microelasticities of substitution (θ0, θ1, θ2, θ3, θ4). The parameter θ0 is the elasticity of substitution across industries in consumption, θ1 is the elasticity of substitution between high-skilled labor and capital, θ2 is the elasticity of substitution between low-skilled labor and a composite factor consisting of high skilled labor and capital, θ3 is the elasticity of substitution across value-added and intermediate inputs, and θ4 is the elasticity of substitution across intermediate inputs. We consider a benchmark calibration with values for the elasticities (θ0, θ1, θ2, θ3, θ4) = (0.9, 0.67, 1.67, 0.5, 0.0001) informed by estimates from the microeconomic literature, which as we shall see, points to strong complementarities at the sectoral level (θ4 = 0.0001) and between value added and intermediates (θ3 = 0.5). For the elasticities of substitution between skilled labor and capital and between unskilled labor and the composite factor consisting of high-skilled labor and capital and skilled labor, we pick θ1 = 0.67 and θ2 = 1.67, which we are consistent with the microeconomic literature surveyed in reported by Johnson (1997) and Hamermesh (1996). We set the elasticity of substitution in consumption θ0 = 0.9, following Atalay (2017), Herrendorf, Rogerson, and Valentinyi (2013), and Oberfield and Raval (2014), all of whom use an elasticity of substitution in consumption (across industries) of slightly less than one. For the elasticity of substitution across value-added and intermediate inputs, we set θ3 = 0.5. This accords with the estimates of Atalay (2017), who estimates this parameter to be between 0.4 and 0.8, as well as Boehm, Flaaen, and Pandalai-Nayar (2019), who estimate this elasticity to be close to zero. Finally, we set the elasticity of substitution across intermediate inputs to be θ4 = 0.001, which matches the estimates of Atalay (2017). The aggregate production function arising endogenously from our model is different from the aggregate production function postulated by Krusell et al. (2000). In the aggregate model of Krusell et al. (2000), macroeconomic elasticities of substitution between factors coincide with their microeconomic counterparts. As a result, they can be estimated using either microeconomic or macroeconomic data. Their particular choices come from a macroeconomic estimation with macroeconomic data using the first-order conditions of their aggregate model. They find that these estimates are consistent with microeconomic estimates in the microeconomic literature reported by Johnson (1997) and Hamermesh (1996). By contrast, in our disaggregated model, the macroeconomic elasticities of substitution between factors are different from their microeconomic counterparts. We therefore rely entirely on microeconomic estimates to justify our choices the microeconomic elasticities of substitution θ1 and θ2. If one takes the view that the economy is described by our model, then their macroeconomic model is mis-specified. Our justification for choosing the same values as them for these elasticities is only that these values are consistent with estimates from the microeconomic literature, and not because they come out of the estimation of their possibly mis-specified macroeconomic model with macroeconomic data. We use input–output tables and the integrated industry-level production account (KLEMS) data from the BEA. These two datasets report, at the industry level, the expenditures of each industry on different types of inputs (supplied from other industries), as well as the compensation of college (high-skill) employees and noncollege (low-skill) employees. We attribute the remainder of each industry’s value-added net of compensation of employees to the industry’s capital stock. Although we calibrate the model with industry-level data, the general methodology can accommodate as much data as available to the researcher. In particular, going from an industry-level model, to a firm or product level model is, conceptually, a very easy step given the generality of Propositions 2 and 7. In Table B.1, we show the MES between the three primary factors for the aggregate model of Krusell et al. (2000) using both the production function (quantity elasticities) and the cost function (price elasticities). Since the aggregate model has a nested CES production function, the MES between capital and college labor in the aggregate production function and in the aggregate cost function are just equal to the corresponding microelasticity θ1 = 0.67. Similarly, the MESs between either capital or college labor and non-college labor are just equal to the corresponding microelasticity θ2 = 1.67. Table B.1. MESs between factors in the aggregate production function and in the aggregate cost function for the aggregate model. Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.67 0.67 Capital – 1.67 0.67 Noncollege 1.05 – 0.88 Noncollege 1.28 – 1.07 College 0.67 1.67 – College 0.67 1.67 – Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.67 0.67 Capital – 1.67 0.67 Noncollege 1.05 – 0.88 Noncollege 1.28 – 1.07 College 0.67 1.67 – College 0.67 1.67 – Open in new tab Table B.1. MESs between factors in the aggregate production function and in the aggregate cost function for the aggregate model. Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.67 0.67 Capital – 1.67 0.67 Noncollege 1.05 – 0.88 Noncollege 1.28 – 1.07 College 0.67 1.67 – College 0.67 1.67 – Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.67 0.67 Capital – 1.67 0.67 Noncollege 1.05 – 0.88 Noncollege 1.28 – 1.07 College 0.67 1.67 – College 0.67 1.67 – Open in new tab In Table B.2, we show the same macroeconomic elasticities of substitution (for both quantities and prices) using the disaggregated model. In the disaggregated model, MESs between factors depend not only on their microeconomic counterparts θ1 and θ2 as in the aggregate model, but also on the other microeconomic elasticities of substitution θ0, θ3, and θ4. Note that the values of |$\sigma ^C_{HK}$| and |$\sigma ^F_{HK}$| end up being very similar to those used by Krusell et al. (2000) (respectively, 0.64 vs. 0.67 and 0.67 vs. 0.67) : this is because some of the microeconomic elasticities are higher (θ0 = 0.9 and θ2 = 1.67) and some are lower (θ3 = 0.5 and θ4 = 0.0001), and their corresponding effects basically cancel out. The difference is bigger in the case of |$\sigma ^C_{LK}$| and |$\sigma ^F_{LK}$| (respectively, 1.09 vs. 1.28 and 0.94 vs. 1.05) and the disaggregated model has lower values for these elasticities in part simply because in this case the benchmark microeconomic elasticity is higher to begin with. Table B.2. MESs between factors in the aggregate production function and in the aggregate cost function for the disaggregated model. Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.43 0.70 Capital – 1.47 0.72 Noncollege 0.94 – 0.94 Noncollege 1.09 – 1.09 College 0.67 1.59 – College 0.64 1.54 – Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.43 0.70 Capital – 1.47 0.72 Noncollege 0.94 – 0.94 Noncollege 1.09 – 1.09 College 0.67 1.59 – College 0.64 1.54 – Open in new tab Table B.2. MESs between factors in the aggregate production function and in the aggregate cost function for the disaggregated model. Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.43 0.70 Capital – 1.47 0.72 Noncollege 0.94 – 0.94 Noncollege 1.09 – 1.09 College 0.67 1.59 – College 0.64 1.54 – Production function Cost function Capital Noncollege College Capital Noncollege College Capital – 1.43 0.70 Capital – 1.47 0.72 Noncollege 0.94 – 0.94 Noncollege 1.09 – 1.09 College 0.67 1.59 – College 0.64 1.54 – Open in new tab To demonstrate this dependence in more detail, we show how the MES between capital and skilled labor changes in response to changes in the underlying microeconomic elasticities of substitution in Table B.3, and how the MES between capital and skilled labor changes in response to changes in the underlying microeconomic elasticities of substitution in Table B.4. In both tables, since the MESs in the aggregate cost function are weighted averages of the underlying microelasticities of substitution, the sum of the derivatives along this column add up to 1. The same is not true of the MESs in the aggregate production function. Table B.3. Derivatives of the MES between high-skilled labor and capital w.r.t. microelasticities of substitution. Aggregate model Disaggregated model Cost Production Cost Production σHK 0.67 0.67 0.64 0.66 Consumption: θ0 – – 0.13 0.12 VA versus INT: θ3 – – 0.1 0.09 INT: θ4 – – 0.09 0.08 VA outer nest: θ2 0 0 0.01 0.05 VA inner nest: θ1 1 1 0.67 0.65 Aggregate model Disaggregated model Cost Production Cost Production σHK 0.67 0.67 0.64 0.66 Consumption: θ0 – – 0.13 0.12 VA versus INT: θ3 – – 0.1 0.09 INT: θ4 – – 0.09 0.08 VA outer nest: θ2 0 0 0.01 0.05 VA inner nest: θ1 1 1 0.67 0.65 Open in new tab Table B.3. Derivatives of the MES between high-skilled labor and capital w.r.t. microelasticities of substitution. Aggregate model Disaggregated model Cost Production Cost Production σHK 0.67 0.67 0.64 0.66 Consumption: θ0 – – 0.13 0.12 VA versus INT: θ3 – – 0.1 0.09 INT: θ4 – – 0.09 0.08 VA outer nest: θ2 0 0 0.01 0.05 VA inner nest: θ1 1 1 0.67 0.65 Aggregate model Disaggregated model Cost Production Cost Production σHK 0.67 0.67 0.64 0.66 Consumption: θ0 – – 0.13 0.12 VA versus INT: θ3 – – 0.1 0.09 INT: θ4 – – 0.09 0.08 VA outer nest: θ2 0 0 0.01 0.05 VA inner nest: θ1 1 1 0.67 0.65 Open in new tab Table B.4. Derivatives of the MES between low-skilled labor and capital w.r.t. microelasticities of substitution. Aggregate model Disaggregated model Cost Production Cost Production σLK 1.28 1.05 1.09 0.93 Consumption: θ0 – – 0.11 0.14 VA versus INT: θ3 – – 0.09 0.11 INT: θ4 – – 0.1 0.07 VA outer nest: θ2 0.4 0.97 0.26 0.57 VA inner nest: θ1 0.6 0.24 0.46 0.22 Aggregate model Disaggregated model Cost Production Cost Production σLK 1.28 1.05 1.09 0.93 Consumption: θ0 – – 0.11 0.14 VA versus INT: θ3 – – 0.09 0.11 INT: θ4 – – 0.1 0.07 VA outer nest: θ2 0.4 0.97 0.26 0.57 VA inner nest: θ1 0.6 0.24 0.46 0.22 Open in new tab Table B.4. Derivatives of the MES between low-skilled labor and capital w.r.t. microelasticities of substitution. Aggregate model Disaggregated model Cost Production Cost Production σLK 1.28 1.05 1.09 0.93 Consumption: θ0 – – 0.11 0.14 VA versus INT: θ3 – – 0.09 0.11 INT: θ4 – – 0.1 0.07 VA outer nest: θ2 0.4 0.97 0.26 0.57 VA inner nest: θ1 0.6 0.24 0.46 0.22 Aggregate model Disaggregated model Cost Production Cost Production σLK 1.28 1.05 1.09 0.93 Consumption: θ0 – – 0.11 0.14 VA versus INT: θ3 – – 0.09 0.11 INT: θ4 – – 0.1 0.07 VA outer nest: θ2 0.4 0.97 0.26 0.57 VA inner nest: θ1 0.6 0.24 0.46 0.22 Open in new tab We can use our estimates for the values of the macroelasticities of substitution to revisit the question posed by Krusell et al. (2000) and assess by how much the growth in capital due to the decline in the relative price of equipment has contributed to the widening skill premium. Following them, we ask how factor income shares would change if growth in capital had not accelerated in the mid-1970s. Absent this acceleration in the growth rate, the capital stock in 2015 would be lower. Specifically, maintaining the growth in the capital stock at its 1970s growth rate would be tantamount to a shock of size |${d}\,\log K = -0.37$| to today’s capital stock. In Table B.5, we show the implied change in factor income shares in response to such a shock, for both the aggregate and disaggregated models. In the aggregate counterfactual economy, the capital share would be higher by about 5%, whereas the noncollege share of income would be 7% higher, and the college share of income would be 13% lower. On the other hand, in the disaggregated counterfactual economy, the capital share would be 6% higher, whereas the noncollege share of income would be 4% higher, and the college share would be 13% lower. Hence, viewed through the lens of the disaggregated economy, and compared to the aggregate economy, the accelerated growth in capital has weighed down the capital share more, increased the college share less, and weighed down the noncollege share less (the last effect has been cut by more than half). Table B.5. The (log point) change in factor income shares in response to the shock |${d}\,\log K = -0.37$| in the aggregated and disaggregated model. Capital Noncollege College Aggregate model 0.05 0.07 −0.13 Disaggregated model 0.06 0.04 −0.13 Capital Noncollege College Aggregate model 0.05 0.07 −0.13 Disaggregated model 0.06 0.04 −0.13 Open in new tab Table B.5. The (log point) change in factor income shares in response to the shock |${d}\,\log K = -0.37$| in the aggregated and disaggregated model. Capital Noncollege College Aggregate model 0.05 0.07 −0.13 Disaggregated model 0.06 0.04 −0.13 Capital Noncollege College Aggregate model 0.05 0.07 −0.13 Disaggregated model 0.06 0.04 −0.13 Open in new tab The change in the skill premium, defined as the change in the relative income of college and noncollege labor, can also be estimated using the numbers in Table B.1 and B.2. Specifically, we can write $$ \begin{equation*} \left(\frac{1}{\sigma _{HK}} - \frac{1}{\sigma _{LK}}\right)\times {d}\log K =\left(\frac{{d}\log \Lambda _{H}}{{d}\log K} - \frac{{d}\log \Lambda _{L}}{{d}\log K}\right)\times {d}\log K, \end{equation*}$$ putting the change in skill premium in the aggregate model at −0.20 log points and −0.16 log points in the disaggregated model. The implication is that the disaggregated model reduces by 20% the increase in the skill premium that can be ascribed to the channel running through capital-skill complementarity and the decrease in the relative price of equipment identified by Krusell et al. (2000). This is of course by no means the final word on this question. The broad lesson is that disaggregating matters for some questions and less for others. There is a priori no way to know without doing the analysis. The point we wish to demonstrate is how easy it is to port prior analysis to more realistic production structures through a mechanical application of Propositions 1 and 6—the only limit is the availability of data. Footnotes 1. More precisely, this variant can be written as $$ \begin{equation*} \frac{Y}{\bar{Y}}=\frac{A}{\bar{A}} \left(\omega _K \left(\frac{A_K K}{\bar{A}_K\bar{K}}\right)^{\frac{\sigma -1}{\sigma }} + \omega _L \left(\frac{A_L L}{\bar{A}_L\bar{L}}\right)^{\frac{\sigma -1}{\sigma }}\right)^{\frac{\sigma }{\sigma -1}}, \end{equation*}$$ where bars denote values of output, factors, and productivity shifters, at a specific point, ωL and ωK = 1 − ωL are the labor and capital shares at that point, and σ is the elasticity of substitution between capital and labor. The popular Cobb–Douglas specification obtains in the limit σ → 1. 2. See Cohen and Harcourt (2003) for a retrospective account of the controversy. 3. For a good review, see Felipe and Fisher (2003). 4. The popular specification Y = AF(AKK, ALL) with F a CES function described in details in footnote 1 can be entirely calibrated using the labor share ωL and the elasticity of substitution between capital and labor σ, or even with only the labor share ωL under the common Cobb–Douglas restriction. 5. Since we do not place any restrictions on the number of factors the economy has, we can recreate the famous counterexamples from the Cambridge capital controversy in our environment. In other words, despite having an aggregate production function, our framework can accommodate the classic Cambridge UK critiques. We show exactly how in Section 5. 6. We explain later how to generalize our results regarding macroeconomic elasticities of substitution and the macroeconomic bias of technical change to environments with nonhomothetic final demand and with distortions. 7. This was an observation made by McKenzie (1959). See Section 6.3 for a concrete example in the model of Houthakker (1955). 8. Note that this flexibility also allows us to capture different renditions of assignment/sorting models (see Sattinger 1993 for a survey). In these models: there are distributions of workers and tasks of different types; output in a given task depends on both the type of the worker and the type of the task; the output of different tasks can be complements or substitutes; and tasks and workers may operate under decreasing returns to scale, limiting, for example, the number of workers per task. These models can be captured within our framework by treating different types of workers as different factors, different tasks as different producers, and allowing for producer-specific fixed factor to capture decreasing returns at the task level. 9. Our formulas can also in principle be applied, with increasing-returns to scale, but only to some limited extent, by allowing these quasi-fixed factors to be local “bads” receiving negative payments over some range, but care must be taken because it introduces nonconvexities in the cost-minimization over variable inputs, and our formulas only apply when variable input demand changes smoothly. 10. We generalize our results to factor-biased but not necessarily factor-augmenting technical change at the producer level in Section 7. 11. To satisfactorily capture such features, one probably needs to go beyond the nested-CES case on which we focus for a large part of the paper, and use instead the nonparametric generalization to arbitrary economies provided in Section 7. 12. Stern (2010) points out that although the MESs in cost do characterize the inverse-curvature of the corresponding constant-output isoquants, those in production do not. In the production function case, he defines the symmetric elasticity of complementarity to be the inverse-curvature of the constant-output isoquants, and shows that its inverse, which measures the curvature of the constant-output isoquants, is symmetric and can easily be recovered as a share-weighted harmonic average |$(\Lambda _{f}+\Lambda _{g})/(\Lambda _{f}/\sigma _{fg}^{F}+\Lambda _{g}/\sigma _{gf}^{F})$| of the MESs in production. This concept is the dual of the shadow elasticity of substitution put forth by McFadden (1963), which is symmetric and which can be recovered as share-weighted arithmetic average |$(\Lambda _{f}\sigma _{fg}^{C}+\Lambda _{g}\sigma _{gf}^{C})/(\Lambda _{f}+\Lambda _{g})$| of the MESs in costs. We will focus on characterizing MESs for the aggregate production and cost functions. 13. We can use these measures to compute a measure of bias of technical change toward one factor instead of toward one factor versus another by defining $$ \begin{align*} B^F_{fj}&=\sum _{g\in F} \frac{d\log F}{d\log L_g} B^F_{fgj}= \sum _{g\in F} \Lambda _g B^F_{fgj}=\frac{d \log \Lambda _f }{d\log A_j}=\frac{d \log w_f }{d\log A_j}-\lambda _j\\ \mbox{and}\qquad B^C_{fj}&=\sum _{g\in F} \frac{d\log C}{d\log w_g} B^C_{fgj}= \sum _{g\in F} \Lambda _g B^C_{fgj}=\frac{d \log \Lambda _f }{d\log A_j}=\frac{d \log L_f }{d\log A_j}-\lambda _j, \end{align*}$$ holding, respectively, factor supplies or factor prices constant. 14. Changes in A0 are changes in how each unit of final output affects consumer welfare. This is what Hulten and Nakamura (2017) call “output-saving” technical change. 15. We assume throughout that all microeconomic elasticities of substitution are finite. Some economic models assume that some of these elasticities are infinite. This implies that substitution is not smooth at the producer level, and raises a number of technical issues having to do with varying patterns of partial or full specialization. Our approach can be used to shed light on these models by viewing them as limiting cases of perhaps models with large but finite elasticities. 16. A historical reason for the focus of the controversy on the aggregation of capital as opposed to labor was the view held by the participants there was a natural physical unit in which to measure labor, man-hours. This view rests on the debatable assumption that different forms of labor, such as skilled labor and unskilled labor for example are perfect substitutes. Another historical reason was that some participants in the controversy took the view that labor could be reallocated efficiently across production units in response to shocks whereas capital was stuck in the short run, which they thought made the aggregation of capital more problematic. From the perspective of this paper, the aggregation problem for capital is not meaningfully different from that of labor. In general, outside of knife-edge cases, factors that are not perfectly substitutable or which cannot be reallocated cannot be aggregated. 17. See, for example, Pasinetti et al. (2003). 18. The results also extend the “hybrid” case of an economy where some factors are in inelastic supply and some factors are in perfectly elastic supply, as in the steady state of a Ramsey model. 19. A loglinear approximation of the aggregate cost function is trivially separable in every partition, and is a first-order approximation. By Proposition 1, the log-linear approximation sets the elasticity of the aggregate cost function with respect to the wage of each factor equal to the revenue share of that factor. 20. In this case, we can write $$ \begin{equation*} F(L_1, \ldots , L_N, A_1, \ldots , A_N)=\tilde{F}\left(F_{r}\left(\lbrace L_{f}\rbrace _{f \in M}, \lbrace A_i\rbrace _{i \in I}\right) , \lbrace A_{i}\rbrace _{i \notin I}, \lbrace L_i\rbrace _{i\notin M}\right). \end{equation*}$$ 21. Marglin (1984) shows that reswitching cannot occur with smooth substitution in the sense that the composition of the basket of inputs used to produce each good cannot be the same for all goods for two different interest rates. One lesson of the original example is lost: that for no ranking of the two techniques in terms of “mechanization” or capital intensity is the economy necessarily becoming more mechanized or capital intensive as the interest rate decreases. Nonetheless, the lesson that the capital–labor and capital–output ratios are nonmonotonic functions of the interest rate survives. 22. Relatedly, consider a model with several capital stocks Kf and labor L. Each of the capital stocks depreciates from one period to the next with depreciation rate |$\delta _{K_f}$| can be augmented by investing the final good with productivity |$1/p_{K_f}$|⁠. The user cost of each capital stock is therefore |$p_f(r+\delta _{K_f})$|⁠. To capture a steady state, we assume that the different capital stocks are in infinitely elastic supply at these user costs, but that labor is in fixed supply. Appendix A.2 generalizes our equations to such a hybrid case with both factors in infinitely elastic supply and factors in fixed supply. We can compute the financial value of the aggregate capital stock |$K=\sum _f p_{K_f}K_f$| in the model. We have $$ \begin{multline*} \frac{{d}\log \left( \frac{ \sum _f p_{K_f} K_f}{Y} \right)}{{d}\log r}=-\left(\sum _{f}\frac{p_{K_f}K_{f}}{\sum _g p_{K_g} K_g}\frac{r}{r+\delta _{K_f}}\right) \sum _{f}\frac{p_{K_f}K_{f}\frac{r}{r+\delta _{K_f}}}{\sum _{g}p_{K_g}K_{g}\frac{r}{r+\delta _{K_g}}} \\ \quad\quad\times {}\left(1+\sum _{j}\lambda _{j}(\theta _{j}-1)\operatorname{Cov}_{\Omega ^{(j)}}\left(\sum _{g}\Lambda _{K_g}\frac{r+\delta _{K_f}}{r+\delta _{K_g}}\left(\frac{\Psi _{(K_g)}}{\Lambda _{K_g}}-\frac{\Psi _{(L)}}{\Lambda _{L}}\right),\frac{\Psi _{(K_f)}}{\Lambda _{K_f}}\right)\right). \end{multline*}$$ Knowledge of the aggregate capital stock |$K=\sum _f p_{K_f}K_f$| is not enough to infer the elasticity of the capital output ratio K/Y to the interest rate r. Knowledge of the values of all of the disaggregated capital stocks is required. Aggregation via financial valuations fails. 23. Sato (1975) only considered the case with two producers. 24. Levhari (1968) generalizes Houthakker (1955) by deriving distributions of technical requirements across producers for which the aggregate production function is CES rather than simply Cobb Douglas. Sato (1969) in turn generalizes Levhari (1968) by allowing for microeconomic production to be CES rather than simply Leontief. All these models are particular cases of our general model. 25. We can compute this as a function of the aggregate production function using $$ \begin{equation*} \frac{d\log w_{L}}{d\log A_{kK}}=\frac{d\log F}{d\log A_{kK}}+\frac{d^{2}\log F}{d\log A_{kK}d\log L}/\frac{d\log F}{d\log L}=\lambda _k\omega _{kK}+\frac{d\log \Lambda _{L}}{d\log A_{kK}}. \end{equation*}$$ 26. In the general case where θD ≠ 1, we have $$ \begin{multline*} \frac{d\log w_{L}}{d\log A_{kK}}= \lambda _{k}\omega _{kK}\frac{1+(\theta _{KL}-1)\sum _{i}\lambda _{i}\left(\frac{\omega _{iL}}{\Lambda _{L}}-\frac{\omega _{kL}}{\Lambda _{L}}\right)\frac{\omega _{iK}}{\Lambda _{K}}+(\theta _D-1)\left[ \frac{1}{\Lambda _K\Lambda _L}\operatorname{Var}_{\lambda }\big(\omega _{(L)}\big)+\frac{\omega _{kL}}{\Lambda _L}-1\right]}{1+(\theta _{KL}-1)\sum _{i=1}^N\lambda _{i}\frac{\omega _{iL}}{\Lambda _{L}}\frac{\omega _{iK}}{\Lambda _{K}}+(\theta _D-1)\frac{1}{\Lambda _L\Lambda _K}\operatorname{Var}_{\lambda }\big(\omega _{(L)}\big)}. \end{multline*}$$ 27. Indeed, suppose, for example, that there is a single task so that λk = 1. We get ωkL = ΛL and ωkK = Λk. This implies that |$B^F_{KLkL}=\theta _{KL}-1$| and d log wL/d log AkK = Λk/θKL > 0. The result is true more generally. 28. That technology shocks are Hicks neutral at the producer level means that we can write, with some abuse of notation, Fi(xi1, …, xiN, Li1, …, LiF, Ai) = AiFi(xi1, …, xiN, Li1, …, LiF) and |$C_{i}(p_{1},\ldots ,p_{N},w_{1},\ldots ,w_{F},A_i)=A_i^{-1}C_{i}(p_{1},\ldots ,p_{N},w_{1},\ldots ,w_{F})$| 29. In contrast to our analysis of the nested-CES case, in this section, we purposefully eschew any relabeling of the network via the disaggregation of a producer into an network of producers (an island) or via the introduction of fictitious producers. Factor-biased technical change is modeled directly at the original producer level, where it is not necessarily factor-augmenting. 30. This property is the reason we choose to use Allen–Uzawa elasticities at the producer level: because they easily give the elasticities of cost expenditure shares with respect to input prices. Morishima elasticities |$\sigma ^C_k(l,l^{\prime })=\epsilon _{k}(l,l^{\prime })-\epsilon _{k}(l^{\prime },l^{\prime })$| are better suited instead to give the elasticities of relative cost expenditure shares. Of course, the two concepts are related and so we could have used either one. References Acemoglu Daron Restrepo Pascual ( 2019 ). “ Artificial Intelligence, Automation and Work .” In The Economics of Artificial Intelligence: An Agenda , edited by Goldfarb Avi Gans Joshua Agrawal Ajay . University of Chicago Press . Google Preview WorldCat COPAC Aiyagari S. Rao ( 1994 ). “ Uninsured Idiosyncratic Risk and Aggregate Saving .” Quarterly Journal of Economics , 109 , 659 – 684 . Google Scholar Crossref Search ADS WorldCat Atalay Enghin ( 2017 ). “ How Important are Sectoral Shocks? ” American Economic Journal: Macroeconomics , 9 , 254 – 280 . Google Scholar Crossref Search ADS WorldCat Baqaee David Rezza Farhi Emmanuel ( 2019 ). “ The Macroeconomic Impact of Microeconomic Shocks: Beyond Hulten’s Theorem .” Econometrica, Econometric Society , 87 , 1155 – 1203 . Google Scholar Crossref Search ADS WorldCat Baqaee David Rezza Farhi Emmanuel ( 2018 ). “ Macroeconomics with Heterogeneous Agents and Input-Output Networks .” Working Paper 24684 , National Bureau of Economic Research, Inc., Cambridge, MA, USA , https://ideas.repec.org/p/nbr/nberwo/24684.html. Baqaee David Rezza Farhi Emmanuel ( 2019a ). “ Networks, Barriers, and Trade .” NBER Working Papers 26108, National Bureau of Economic Research, Inc., Cambridge, MA, USA. WorldCat Baqaee David Rezza Farhi Emmanuel ( 2019b ). “ Productivity and Misallocation in General Equilibrium .” Working Paper 24007 , National Bureau of Economic Research, Inc., Cambridge, MA, USA . Bewley Truman ( 1986 ). “ Stationary Monetary Equilibrium with a Continuum of Independently Fluctuating Consumers .” Contributions to Mathematical Economics in Honor of Gérard Debreu . North Holland , Amsterdam , pp. 79 – 102 . Google Preview WorldCat COPAC Blackorby Charles Robert Russell R. ( 1989 ). “ Will the Real Elasticity of Substitution Please Stand Up? (A Comparison of the Allen/Uzawa and Morishima elasticities) .” American Economic Review , 79 ( 1 ), 882 – 888 . WorldCat Boehm Christoph E. Flaaen Aaron Pandalai-Nayar Nitya ( 2019 ). “ Input Linkages and the Transmission of Shocks: Firm-level Evidence from the 2011 Tōhoku Earthquake .” Review of Economics and Statistics , 101 , 60 – 75 . Google Scholar Crossref Search ADS WorldCat Cohen Avi J. Harcourt Geoffrey C. ( 2003 ). “ Retrospectives: Whatever Happened to the Cambridge Capital Theory Controversies? ” Journal of Economic Perspectives , 17 ( 1 ), 199 – 214 . Google Scholar Crossref Search ADS WorldCat Felipe Jesus Fisher Franklin M. ( 2003 ). “ Aggregation in Production Functions: What Applied Economists Should Know .” Metroeconomica , 54 , 208 – 262 . Google Scholar Crossref Search ADS WorldCat Fisher Franklin M. ( 1965 ). “ Embodied Technical Change and the Existence of an Aggregate Capital Stock .” Review of Economic Studies , 32 , 263 – 288 . Google Scholar Crossref Search ADS WorldCat Fisher Franklin M. ( 1982 ). “ Aggregate Production Functions Revisited: the Mobility of Capital and the Rigidity of Thought .” Review of Economic Studies , 49 , 615 – 626 . Google Scholar Crossref Search ADS WorldCat Fisher Franklin M. ( 1983 ). “ On the Simultaneous Existence of Full and Partial Capital Aggregates .” Review of Economic Studies , 50 , 197 – 208 . Google Scholar Crossref Search ADS WorldCat Griliches Zvi ( 1969 ). “ Capital-Skill Complementarity .” Review of Economics and Statistics , 51 , 465 – 468 . Google Scholar Crossref Search ADS WorldCat Hamermesh Daniel S. ( 1996 ). Labor Demand . Princeton University Press . Google Preview WorldCat COPAC Herrendorf Berthold Rogerson Richard Valentinyi Akos ( 2013 ). “ Two Perspectives on Preferences and Structural Transformation .” American Economic Review , 103 ( 7 ), 2752 – 2789 . Google Scholar Crossref Search ADS WorldCat Hicks John ( 1932 ). Theory of Wages . Springer . Google Preview WorldCat COPAC Houthakker Hendrik S. ( 1955 ). “ The Pareto Distribution and the Cobb-Douglas Production Function in Activity Analysis .” Review of Economic Studies , 23 , 27 – 31 . Google Scholar Crossref Search ADS WorldCat Huggett Mark ( 1993 ). “ The Risk-Free Rate in Heterogeneous-Agent Incomplete-Insurance Economies .” Journal of Economic Dynamics and Control , 17 , 953 – 969 . Google Scholar Crossref Search ADS WorldCat Hulten Charles Nakamura Leonard ( 2017 ). “ Accounting for Growth in the Age of the Internet: The Importance of Output-Saving Technical Change .” Working Paper 23315 , National Bureau of Economic Research, Cambridge, MA, USA , http://www.nber.org/papers/w23315. Hulten Charles R. ( 1978 ). “ Growth Accounting with Intermediate Inputs .” Review of Economic Studies , 45 , 511 – 518 . Google Scholar Crossref Search ADS WorldCat Johnson George E. ( 1997 ). “ Changes in Earnings Inequality: The Role of Demand Shifts .” Journal of Economic Perspectives , 11 ( 2 ), 41 – 54 . Google Scholar Crossref Search ADS WorldCat Krusell Per Ohanian Lee E. Ríos-Rull José-Víctor Violante Giovanni L. ( 2000 ). “ Capital-Skill Complementarity and Inequality: A Macroeconomic Analysis .” Econometrica , 68 , 1029 – 1053 . Google Scholar Crossref Search ADS WorldCat Krusell Per Smith Anthony A. Jr . ( 1998 ). “ Income and Wealth Heterogeneity in the Macroeconomy .” Journal of Political Economy , 106 , 867 – 896 . Google Scholar Crossref Search ADS WorldCat Levhari David ( 1968 ). “ A Note on Houthakker’s Aggregate Production Function in A Multifirm Industry .” Econometrica , 36 , 151 – 154 . Google Scholar Crossref Search ADS WorldCat Marglin Stephen A. ( 1984 ). Growth, Distribution, and Prices . Harvard University Press , p. 155 . Google Preview WorldCat COPAC McFadden Daniel ( 1963 ). “ Constant Elasticity of Substitution Production Functions .” Review of Economic Studies , 30 , 73 – 83 . Google Scholar Crossref Search ADS WorldCat McKenzie Lionel W. ( 1959 ). “ On the Existence of General Equilibrium for a Competitive Market .” Econometrica , 27 , 54 – 71 . Google Scholar Crossref Search ADS WorldCat Morishima Michio ( 1967 ). “ A Few Suggestions on the Theory of Elasticity .” Keizai Hyoron (Economic Review) , 16 , 144 – 150 . WorldCat Oberfield Ezra Raval Devesh ( 2014 ). “ Micro Data and Macro Technology .” Working Paper 20452 , National Bureau of Economic Research, Cambridge, MA, USA , http://www.nber.org/papers/w20452. Pasinetti Luigi L. Fisher Franklin M. Felipe Jesus McCombie J. S. L. Greenfield Robert L. ( 2003 ). “ Cambridge Capital Controversies .” Journal of Economic Perspectives , 17 ( 4 ), 227 – 232 . Google Scholar Crossref Search ADS WorldCat Robinson Joan ( 1953 ). “ The Production Function and the Theory of Capital .” The Review of Economic Studies , 21 , 81 – 106 . Google Scholar Crossref Search ADS WorldCat Russell R. Robert ( 2017 ). “ Elasticities of Substitution .” Tech. rep. Samuelson Paul A. ( 1966 ). “ A Summing Up .” Quarterly Journal of Economics , 80 , 568 – 583 . Google Scholar Crossref Search ADS WorldCat Sato Kazuo ( 1969 ). “ Micro and Macro Constant-Elasticity-of-Substitution Production Functions in A Multifirm Industry .” Journal of Economic Theory , 1 , 438 – 453 . Google Scholar Crossref Search ADS WorldCat Sato Kazuo ( 1975 ). Production Functions and Aggregation , Vol. 90 . North-Holland , Amsterdam . Google Preview WorldCat COPAC Sattinger Michael ( 1993 ). “ Assignment Models of the Distribution of Earnings .” Journal of Economic Literature , 31 , 831 – 880 . WorldCat Solow Robert M. ( 1956 ). “ A Contribution to the Theory of Economic Growth .” Quarterly Journal of Economics , 70 , 65 – 94 . Google Scholar Crossref Search ADS WorldCat Solow Robert M. ( 1963 ). Capital Theory and the Rate of Return . North-Holland , Amsterdam . Google Preview WorldCat COPAC Stern David I. ( 2010 ). “ Derivation of the Hicks, or Direct, Elasticity of Substitution Using the Input Distance Function .” Economics Letters , 108 , 349 – 351 . Google Scholar Crossref Search ADS WorldCat © The Author(s) 2019. Published by Oxford University Press on behalf of European Economic Association. This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model) TI - JEEA-FBBVA Lecture 2018: The Microeconomic Foundations of Aggregate Production Functions JF - Journal of the European Economic Association DO - 10.1093/jeea/jvz050 DA - 2019-10-01 UR - https://www.deepdyve.com/lp/oxford-university-press/jeea-fbbva-lecture-2018-the-microeconomic-foundations-of-aggregate-09Z0vZGpp1 SP - 1337 VL - 17 IS - 5 DP - DeepDyve ER -