TY - JOUR AU - Kalouptsidi, Myrto AB - Abstract This article provides a model-based empirical strategy to, (1) detect the presence and gauge the magnitude of government subsidies and (2) quantify their impact on production reallocation across countries, industry prices, costs and consumer surplus. I construct and estimate an industry model that allows for dynamic agents in both demand and supply and apply my strategy to world shipbuilding, a classic target of industrial policy. I find strong evidence consistent with China having intervened and reducing shipyard costs by $$13$$–$$20\%$$, corresponding to $$1.5$$ to $$4.5$$ billion US dollars, between 2006 and 2012. The subsidies led to substantial reallocation of ship production across the world, with Japan, in particular, losing significant market share. They also misaligned costs and production, while leading to minor surplus gains for shippers. 1. Introduction In recent years, Chinese firms have extremely rapidly dominated a number of capital intensive industries, such as steel, auto parts, solar panels, and shipbuilding.1 Government subsidies are often evoked as a possible contributing factor to China’s expansion.2 Yet, even though industrial subsidies have steered industrialization and growth in several countries throughout economic history ($$e.g.$$ in East Asia), little is known about their quantitative impact on production reallocation across countries, industry prices, costs, and surplus. A significant challenge in this task is that government subsidies to industries are notoriously difficult to detect and measure; and in China even more so. Indeed, partly because international trade agreements prohibit direct and in-kind subsidies, “systematic data are non-existent”3 and thus the presence and magnitude of such subsidies are often unknown. This article assesses the consequences of government subsidies on industrial evolution, focusing on the unparalleled and timely Chinese expansion. Since measuring subsidies is a prerequisite for this analysis, I provide a model-based empirical strategy to detect their presence and gauge their magnitude. I apply my empirical strategy to the world shipbuilding industry, a classic target of industrial policy. In $$2006$$, China identified shipbuilding as a “strategic industry” and introduced a plan for its development.4 Within a short time span, its market share doubled from $$25\%$$ to $$50\%$$, leaving Japan, South Korea and Europe trailing behind. Many asserted that China’s rapid rise was driven by hidden government subsidies that reduced shipyard production costs, as well as by a number of new shipyards constructed through the government plan; here I assess the relative contribution of these interventions. I develop and estimate a model of the shipbuilding industry that allows for dynamic agents in both demand and supply. A large number of shipyards offer durable, differentiated ships. Their production decisions are subject to a dynamic feedback because of time to build: shipyards accumulate backlogs, which can affect their future production cost, either positively (expertise acquisition) or negatively (capacity constraints). Production cost is also subject to an aggregate stochastic shock, summarized in the price of steel, a key production input. Every period a large number of identical potential shipowners decide to enter the freight market by buying a new ship from world shipyards. Demand for new ships is driven by demand for international sea transport, which is uncertain and volatile. As ships are long-lived investments for shipowners, demand depends on expectations about future demand and fleet development. The model primitive of interest is the cost function of potentially subsidized firms. As in many industries, however, costs of production are not observed. My strategy amounts to estimating costs from demand variation in a framework of dynamic demand and supply. In the simplest example of a static, perfectly competitive framework, marginal cost is recovered directly from prices. In that case, the detection strategy amounts to testing for a break in observed ship prices in 2006, when China launched its shipbuilding plan. In my setup, there are two complications: (1) new ship price data are scant, and (2) the shipbuilding production decision is dynamic as it affects future costs. To address (1), I add used ship prices; to address (2) I use the shipyard’s optimality conditions resulting from its dynamic optimization. My estimation strategy can be summarized as follows. To estimate shipyard costs, I adopt a novel approach that is inspired by the literature on estimation of dynamic setups, for example Hotz and Miller (1993), Bajari et al. (2007) and Pakes et al. (2007), as well as Xu (2008), Ryan (2012), Collard-Wexler (2013) and others. I first invert the shipyards’ observed choice probabilities to obtain directly the optimal policy thresholds non-parametrically in an ordered-choice dynamic problem; I then show that the latter lead to a closed-form expression for the shipyards’ ex ante optimal per period payoffs, which in turn are sufficient to obtain shipyard value functions. To recover demand for new ships I estimate the willingness to pay using new and used ship prices, as well as shipowner expectations. The estimation allows expectations and value functions to be different before and after $$2006$$, consistent with China’s intervention. Finally, in estimating dynamics, I employ sparse approximation techniques (LASSO) that allow for a large state space, as well as significant flexibility. Solving value functions via LASSO, which to my knowledge has not be used before, can be very useful to other applications with dynamic agents and a large state space. Indeed, I provide a detailed Monte Carlo analysis to show that this method works well compared to more standard methods of solving dynamic problems ($$e.g.$$ state space discretization), which are often intractable in real applications. I use my framework to detect and measure changes in costs that are consistent with subsidies. I find a strong significant decline in Chinese costs equal to about 13–20% of costs, or 1.5–4.5 billion U.S. dollars at the observed production levels. The results hold under flexible cost function specifications, as well as dynamic shipyards whose past production is found to decrease future costs, capturing the benefits of expertise accumulation/learning by doing. I provide evidence that a number of possible alternative explanations for the recovered cost decline cannot adequately account for the observed patterns. For example, the results are robust to many specifications that control flexibly for time-variation. Moreover, costs did not change in other countries. Convincingly, the results hold when I estimate costs on the subset of shipyards that existed prior to 2001. Hence cost declines are not driven by new shipyards, which may have a different technology (though bulk ship production is not characterized by technological innovations to begin with) or may be optimizing under (a more general form of) learning by doing. Next, I use my framework to quantify the contribution of government interventions in China’s seizing of the market. My main counterfactual predicts industry evolution (production reallocation across countries, ship prices, industry costs, and shipper surplus) in the absence of China’s government plan altogether; note that both cost subsidies and investment in new shipbuilding facilities would violate international trade agreements. Moreover, I assess the relative contribution of the new shipbuilding facilities constructed through China’s plan in a second counterfactual that removes the new facilities, but maintains the cost subsidies detected above. Here is a brief summary of my main four findings from these counterfactuals. First, I find that the interventions led to substantial reallocation in production: in the absence of China’s government plan, Chinese market share is cut to less than half, while Japan’s share increases by $$70\%$$. If only new shipyards are removed, China’s share falls from $$50\%$$ to $$40\%$$, suggesting that new shipyards played an important, but not the predominant part in China’s expansion. Second, ship prices experience moderate increases in all countries in the absence of China’s plan, as the latter shifted supply outward. Third, in the presence of subsidies, freight rates decrease moderately because of the larger fleet between $$2006$$ and $$2012$$ and more so over time due to time to build. As a result of China’s plan, cargo shippers gain about 400 million U.S. dollars in shipper surplus over that time period. Comparing this gain to the $$4$$ billion U.S. dollars of cost subsidies alone implies that the benefits of subsidies for shippers are minimal and perhaps the Chinese government is also aspiring to externalities to different sectors ($$e.g.$$ steel, defense). Fourth, subsidies create a wedge in the alignment of market share and production costs: they lead to a large increase in the industry average cost of production (net of subsidies) by shifting production away from low-cost Japanese shipyards towards high-cost Chinese shipyards. Finally, government subsidies are often related to dynamic considerations, and this is true in this application as well: the long construction lags and the positive production feedback resembling learning by doing are important features of shipbuilding. Indeed, if past production decreases future costs, it is important to take this feedback into account when detecting subsidies, as it may be explaining part of the increase in production. In addition, to evaluate the importance of dynamics in the policy analysis, I consider two extra models: (1) a model with no production feedback so that a firm’s backlog does not affect its future production costs; (2) a model with no construction lags in ship production. In the first model, China experiences a lower increase in its market share of 10%; increased backlog decreases costs and thus market share gains multiply and lead to more reallocation favourable to China. This is consistent with the pro-subsidy arguments of infant industries. In the second model with no time to build, demand for ships is higher in the boom, leading to higher overall production, and higher volatility. This article contributes to our understanding of the impact of industrial policy, while recognizing that we most often lack direct information on the measures undertaken by (foreign) governments. The methodology of subsidy detection here relies solely on having a dataset of prices and quantities (a common dataset in the IO literature) and some knowledge about an implementation date. This allows one to estimate the cost function of potentially subsidized firms from demand variation and to test for a “break” at the corresponding date; the idea being that the government lowers costs via subsidies. In that sense, one could apply this approach to any industry after appropriately tailoring the model ($$e.g.$$ competition, dynamic/static behaviour).5 This article contributes to the recent empirical literature on industry dynamics ($$e.g.$$Benkard, 2004; Aguirregabiria and Mira, 2007; Xu, 2008; Ryan, 2012; Collard-Wexler, 2013). Methodologically, it lies closest to Hotz and Miller (1993), Bajari et al. (2007) and Pakes et al. (2007). This literature typically considers either single agent dynamics or dynamic firms and static consumers (one exception is Chen et al., 2013). In contrast, this article allows for dynamics in both demand and supply; these dynamics are important in many durable goods markets ($$e.g.$$ autos, TVs), as well as upstream–downstream industries ($$e.g.$$ airlines–aircraft builders, shipping–shipbuilders). Kalouptsidi (2014) focuses on the downstream industry of bulk shipping and studies its cyclical behaviour and the role of adjustment costs (time to build as in Kydland and Prescott, 1982) in these fluctuations. That article is a starting point in modelling the demand of shipbuilders in this article, yet the focus here is on Chinese government interventions in the upstream industry of shipbuilding. Both the questions and the industry under study are different in the two papers. The paper also naturally contributes to the literature on trade policies. Goldberg (1995) and Berry et al. (1999) consider the impact of voluntary export restraints in the automobile industry. Grossman (1990) provides an excellent survey of the relevant trade literature, while Brander (1995) is a classic reference on the long theoretical literature of strategic trade (see also references within). Not surprisingly, given the constraints in subsidy data availability, there is little empirical work, most of which is in the form of model calibration. Baldwin and Krugman (1987a) and (1987b) explore the impact of trade policies in the wide-bodied jet aircraft and the semiconductor industries, while Baldwin and Flam (1989) in the small commuter aircraft industry. They all discuss the lack of knowledge regarding both the presence and magnitude of subsidies and other policies and compute industrial evolution under different hypothetical scenarios. The remainder of the article is organized as follows: Section 2 provides a description of the environment under study including subsidy issues and features of the industry. Section 3 presents the model of shipbuilding subsidies. Section 4 describes the data used and provides some descriptive evidence. Section 5 presents the empirical strategy and the estimation results starting with static shipyards and robustness checks and then proceeding to dynamic shipyards. Section 6 provides the counterfactual experiments. Section 7 performs a robustness analysis in several dimensions to evaluate the article’s results, while Section 8 concludes. Finally, the Appendix provides details on several aspects of the analysis, as well as a Monte Carlo for the use of sparse approximations in dynamic programming. 2. Environment This section discusses the practical difficulties regarding the detection and impact evaluation of subsidies generally and in China in particular. Then, it provides the key features and history of the shipbuilding industry. 2.1. Subsidy disputes Although government interventions in the form of subsidies are arguably prevalent and subsidy disputes abundant, both domestic and global policy-makers ($$e.g.$$ WTO) have difficulty in designing and adopting appropriate measures to respond to them.6 Indeed, deciding on subsidy complaints is difficult for two reasons; first, “systematic data (on industrial subsidies) are non-existent; reliable sources of information are scarce and mostly incomplete [...] because governments do not systematically provide the information” (WTO, 2006). Therefore, detecting and measuring subsidies becomes a difficult task and plaintiffs need to base their allegations mostly on available self-reported data.7 The second difficulty concerns (dis)proving “injury caused” by the alleged subsidies.8 Ideally, the question to be answered is “how would have this industry evolved absent the alleged subsidies?”; this is clearly a difficult question, which in practice is answered based on a mostly qualitative analysis of industry indicators on a case by case basis. China in particular, has had more trade conflicts than any other country in the world, in more industries and with more countries.9 China provides industry subsidies in the form of free or low-cost loans, as well as subsidies for inputs (including energy), land and technology. Because of institutional and strategic reasons, the information on subsidies that the Chinese government provides has rampant missing and misreported data.10 These subsidies are often transferred through financial institutions, most of which are directed by the central and provincial governments. 2.2. Shipbuilding and China’s plan11 Shipbuilding is one of the major recipients of subsidies globally, along with for example the steel, mining, and automotive industries. It is often seen as a “strategic industry” as it increases industrial and defense capacity, generates employment and has important spill-overs to other industries ($$e.g.$$ iron and steel). Indeed, several of today’s leading economies developed their production technologies and human capital through a phase of heavy industrialization, in which shipbuilding was one of key pillars. In the 1850s, Britain was the world leading shipbuilder, until it was overtaken by Japan in the 1950s, which in turn lost its leading position to Korea, in the 1970s (4.5% of its GDP today). China’s 11$$^{th}$$ National 5-year Economic Plan 2006–2010 was the first to appoint shipbuilding as a “strategic industry” in need of “special oversight and support”; the central government “unveiled an official shipbuilding blueprint to guide the medium and long-term development of the industry”. As part of the national plan, the central government introduced the “Medium and Long-Term Plan for the Shipbuilding Industry”12 which set the goal of China becoming the largest shipbuilding nation within a decade. The plan sets specific output and capacity goals and as such involves investment in shipbuilding infrastructure and financial support for output growth, including increased credit allowance, low input prices, etc. Consistent with these government programs, Figure 1 shows China’s rapid expansion in shipbuilding dry docks (a measure of shipbuilding capacity). It is important to note that the majority of this expansion ($$82\%$$) was realized through the construction of new facilities; so that the industry experienced a massive entry wave in 2005–06. Figure 1 View largeDownload slide Shipbuilding dry docks Figure 1 View largeDownload slide Shipbuilding dry docks In contrast to the capital expansion, which is observed, subsidies that reduce operating costs are not observed. Yet, consistent with such measures, China’s production and market share increased dramatically as the programmes were announced. Table 1 shows that China’s market share more than doubled across all major ship types. Importantly, this market share expansion was driven by both existing shipyards, as well as the new entrant shipyards: the latter accounted for about half ($$55\%$$) of the output expansion in the case of Handysize bulk carriers (on which the empirical analysis will focus). Figure 2 plots Chinese production of all, new and existing yards over time. TABLE 1 China’s average quarterly market share before and after 2006    Chinese market share, pre-2006  Chinese market share, post-2006  Bulk carriers  0.17  0.57  Tankers  0.15  0.28  Containerships  0.16  0.39  Gas carriers (LNG/LPG)  0.07  0.21     Chinese market share, pre-2006  Chinese market share, post-2006  Bulk carriers  0.17  0.57  Tankers  0.15  0.28  Containerships  0.16  0.39  Gas carriers (LNG/LPG)  0.07  0.21  TABLE 1 China’s average quarterly market share before and after 2006    Chinese market share, pre-2006  Chinese market share, post-2006  Bulk carriers  0.17  0.57  Tankers  0.15  0.28  Containerships  0.16  0.39  Gas carriers (LNG/LPG)  0.07  0.21     Chinese market share, pre-2006  Chinese market share, post-2006  Bulk carriers  0.17  0.57  Tankers  0.15  0.28  Containerships  0.16  0.39  Gas carriers (LNG/LPG)  0.07  0.21  Figure 2 View largeDownload slide Chinese production expansion for all, new and existing shipyards Figure 2 View largeDownload slide Chinese production expansion for all, new and existing shipyards Production in China is also split between public and private firms. A substantial number of shipyards operate under the umbrella of two major state-owned enterprises directly administered by the Chinese central government (CSSC and CSIC), or under provincial and local governments. These public firms account for about 40% of annual Chinese ship production on average. It is worth noting that the day-to-day operations and contract bids are handled directly by the individual shipyards, even if the long-term design is carried out at the federal or local level.13 At the same time, a large number of private firms and foreign joint ventures are active. The private firms accounted for about 40% of the output expansion post 2006. Overall, the global industry is fairly unconcentrated at the firm level with hundreds of firms and low market shares; for instance, the average annual market share in China is 0.8%.14 In addition, China’s shipbuilding is mostly geared towards export sales which comprised about 80% of its production in 2006. Finally, commercial ships are the largest factory produced product. Materials account for about half the cost of the ship (steel is about 13%) and labour (mostly low-skill tasks) about 17% of total cost. This article focuses on cargo transportation and in particular, dry bulk shipping, which concerns vessels designed to carry a homogeneous unpacked cargo (mostly raw materials), for individual shippers on non-scheduled routes. The bulk shipping market consists of a large number of small shipowning firms (largest firm has share 3%). See Kalouptsidi (2014) for a detailed description of the bulk shipping industry. 3. Model In this section, I present a dynamic model for world shipbuilders, following the tradition of Hopenhayn (1992) and Ericson and Pakes (1995). In Section 3.1, I start by presenting a toy model of shipbuilding subsidies to showcase the main mechanisms at play and to aid the reader in following the main model of Section 3.2. Indeed, in Section 3.2 I substantially enrich this toy model to make it more realistic and amenable to empirical work. 3.1. A toy model of shipbuilding subsidies The industry consists of $$N_{c}$$ Chinese firms (shipyards) and $$N_{nc}$$ non-Chinese firms, and both $$N_{c}$$ and $$N_{nc}$$ are large. In each period $$t$$, each firm $$j$$ produces quantity (ships) $$q_{jt}$$ at cost $$c_{jt}(q_{jt})$$. The cost function of ship production $$c_{jt}(\cdot)$$ is quadratic in $$q_{jt}$$, and depends on characteristics of firm $$j$$, $$x_{jt}$$. For instance, consider the cost function:   \[ c_{jt}(q_{jt})=c'_{1}x_{jt}q_{jt}+c_{2}q_{jt}^{2}, \] where $$'$$ denotes transpose and $$c_{1},c_{2}$$ are parameters. Shipyards are price-takers in the world market for ships and face price $$P_{t}$$. The ship price fluctuates over time and depends on world market conditions, such as the demand for shipping services and the total fleet in period $$t$$, which captures the competition that shipowners are facing. As shipyards build more ships, they reduce the shipowners’ willingness to pay, since the latter expect lower profits.15 Let the aggregate demand for ships be $$P_{t}=\bar{\eta}_{0t}-\eta_{1}Y_{t}$$, where $$\bar{\eta}_{0t}$$ captures the international trade conditions and $$Y_{t}$$ is the total fleet. The fleet evolves according to the rule: $$Y_{t}=(1-\delta)Y_{t-1}+Q_{t}$$ where $$Q_{t}$$ is total ship construction at $$t$$ and $$\delta$$ captures ship exit or depreciation. Letting $$\eta_{0t}=\bar{\eta}_{0t}-\eta_{1}(1-\delta)Y_{t-1}$$, the demand curve can be written as $$P_{t}=\eta_{0t}-\eta_{1}Q_{t}$$. Shipyard $$j$$ solves:   \[ \max_{q}P_{t}q-c'_{1}x_{jt}q-c_{2}q^{2}. \] For simplicity, let $$x_{jt}$$ only indicate whether the shipyard is Chinese or not, so that $$c_{1}'x_{jt}\in\left\{ c_{c1},c_{nc1}\right\} $$, where subscript $$c$$ stands for Chinese and $$nc$$ for non-Chinese. Assume also that $$c_{c1}>c_{nc1}$$, so that Chinese shipyards are less efficient than non-Chinese shipyards. Shipyard $$j$$’s optimal production is $$q_{jt}^{*}=(P_{t}-c{}_{j1})/2c_{2}$$, for $$j\in\left\{ c,nc\right\} $$. Moreover, the equilibrium world ship price is $$P_{t}=(2c_{2}\eta_{0t}+\eta_{1}c_{c1}N_{c}+\eta_{1}c_{nc1}N_{nc})/(2c_{2}+\eta_{1}(N_{c}+N_{nc}))$$. Now suppose that China provides a subsidy of $$\kappa$$ dollars for every ship built, so that the cost function of Chinese firms becomes $$(c'_{c1}-\kappa)q_{jt}+c_{2}q_{jt}^{2}$$. Then, the optimal production of a Chinese yard becomes $$q_{ct}^{**}=(P_{t}-c'_{c1}+\kappa)/2c_{2}>q_{ct}^{*}$$ and the price falls at $$P_{t}=(2c_{2}\eta_{0t}+\eta_{1}(c_{c1}-\kappa)N_{c}+\eta_{1}c_{nc1}N_{nc})/(2c_{2}+\eta_{1}(N_{c}+N_{nc}))$$. Essentially, Chinese shipyards become more “efficient” due to their lower cost of production and “steal” market share from firms in other countries whose production declines. Next suppose that the Chinese government also sponsors new facilities so that the number of Chinese firms grows from $$N_{c}$$ to $$N_{c}'$$. In this case, the market price declines, total Chinese production rises and total non-Chinese production falls, as long as the efficiency gap between Chinese and non-Chinese firms, $$c_{c1}-c_{1nc}$$, is lower than a certain threshold.16 The higher the per ship subsidy $$\kappa$$, the more likely it is that total Chinese production goes up. In other words, conditional on increasing the number of shipyards $$N_{c}$$, the Chinese government might wish to also subsidize their costs to guarantee higher overall production. From a welfare perspective, there are obvious winners and losers. Non-Chinese firms lose market share and profits. Consumers gain because prices go down. Chinese shipyards gain, but at the cost of subsidies. Figure 3 depicts these equilibria from the firm’s and industry’s perspective. Figure 3 View largeDownload slide Firm and industry demand and supply for ships Figure 3 View largeDownload slide Firm and industry demand and supply for ships Although this toy model illustrates some of the main mechanisms at play, it is too stylized to identify and quantitatively evaluate government subsidies which is the goal of this article. In the next section I enrich the model in a number of important dimensions. First, I model explicitly the entry behaviour of shipowners which essentially constitutes the demand (when a shipowner buys a ship, he effectively enters the shipping industry). Second, building ships may entail dynamic considerations. Ship production does not happen instantly; rather, a ship takes a number of years to be completed (two to five on average) and thus shipyards accumulate backlogs and have ship delivery schedules. This past production (the backlog) may be an important determinant of costs, due to the accumulation of expertise (Thompson, 2001), economies of scale, or capacity constraints. This feature is potentially important both for the identification and the evaluation of the impact of subsidies. Therefore, the model of Section 3.2 involves dynamics in both the demand and supply. Third, I consider discrete production, as each ship is a large project in a firm’s production line. 3.2. A modelling framework for the shipbuilding industry I now turn to the main model. A key input is demand for new ships, which stems from the decisions of downstream shipowners to purchase new vessels. I therefore begin this section by describing shipowner behaviour. Next, I turn to upstream shipbuilder behaviour and I also discuss how government subsidies enter. Variables with superscript “$$o$$” refer to shipowners and “$$y$$” to yards, while subscripts $$i$$ and $$j$$ refer to a particular owner and yard, respectively. 3.2.1. Demand for new ships (shipowners) Time is discrete and the horizon is infinite. There is a finite number of incumbent shipowners (the fleet) and a large number of identical potential entrant shipowners. I assume constant returns to scale, so that a shipowner is indistinguishable from his ship. Ships are long-lived. Shipowner state space: The state variable of ship $$i$$ at time $$t$$, $$s_{it}^{o}$$, includes its: (1) Age $$\in\left\{ 0,1,...,A\right\} $$. (2) Country where built $$\in C$$. while the industry state, $$s_{t}$$, includes the: (1) Distribution of characteristics in $$s_{it}^{o}$$ over the fleet, $$\overline{s_{t}^{o}}\in\mathbb{R}^{A\times||\mathcal{C}||}$$.17 (2) Industry backlog $$b_{t}\in \mathbb{R} ^{J\times\overline{T}}$$, whose $$\left(j,k\right)^{th}$$ element is the number of ships under construction at time $$t$$ and scheduled to be delivered at $$t+k$$ by shipyard $$j=1,...,J$$ and $$\overline{T}$$ is the maximum possible time to build.18 (3) Aggregate demand for shipping services, $$d_{t}\in \mathbb{R}^{+}$$. (4) Price of steel, $$l_{t}\in \mathbb{R} ^{+}$$.19 Shipowner per period payoffs: In period $$t$$, each shipowner $$i$$ chooses how much transportation (voyages) to offer, $$q_{it}^{o}$$. Shipowners face the inverse demand curve:   \begin{equation} P_{t}^{o}=P^{o}\left(d_{t},Q_{t}^{o}\right), \end{equation} (1) where $$P_{t}^{o}$$ is the price per voyage, $$d_{t}$$ captures demand shifters, such as world industrial production and commodity prices and $$Q_{t}^{o}$$ denotes the total voyages offered, so that $$Q_{t}^{o}=\sum_{i}q_{it}^{o}$$. Voyages are a homogeneous good, but shipowners face heterogeneous convex costs of freight. Ship operating costs increase with the ship’s age and may differ based on country of built because of varying quality. I assume that shipowners act as price-takers in the market for freight. Their resulting per period payoffs are $$\pi^{o}\left(s_{it}^{o},s_{t}\right)$$.20 A ship lives a maximum of $$A$$ periods. Every period, a ship can be hit by an exit shock.21 In particular, I assume that a ship at state $$\left(s_{it}^{o},s_{t}\right)$$ exits with probability $$\delta\left(s_{it}^{o},s_{t}\right)$$ and receives a deterministic scrap value $$\phi\left(s_{it}^{o},s_{t}\right)$$. Note that ships exit with certainty at age $$A$$.22 Shipowner behaviour: The ship’s value function, $$V^{o}\left(s_{it}^{o},s_{t}\right)$$, satisfies the following Bellman equation:   \begin{equation} V^{o}\left(s_{it}^{o},s_{t}\right)=\pi^{o}(s_{it}^{o},s_{t})+\delta(s_{it}^{o},s_{t})\phi(s_{it}^{o},s_{t})+(1-\delta(s_{it}^{o},s_{t}))\beta E\left[V^{o}\left(s_{it+1}^{o},s_{t+1}\right)|s_{it}^{o},s_{t}\right], \end{equation} (2) where $$\beta$$ is the discount factor. In words, the value function of a ship at state $$\left(s_{it}^{o},s_{t}\right)$$ equals the profits from cargo transport plus the scrap value which is received with probability $$\delta(s_{it}^{o},s_{t})$$ and the continuation value $$E\left[V^{o}\left(s_{it+1}^{o},s_{t+1}\right)|s_{it}^{o},s_{t}\right]$$, which is received with probability $$1-\delta(s_{it}^{o},s_{t})$$. The only dynamic control of shipowners is entry in the industry: each period, a large number of identical potential entrants simultaneously make entry decisions. There is time to build, in other words, a shipowner begins its operation a number of periods after its entry decision. To enter, shipowners purchase new vessels from world shipyards. Shipyard $$j$$ in period $$t$$ can build a new ship at price $$P_{jt}^{N}$$ and time to build $$T_{jt}$$. The assumption of a large number of homogeneous potential shipowners implies that shipyard prices are bid up to the ships’ values and shipyards can extract all surplus. One can also think of this as a free entry condition in the shipping industry where the entry cost is equal to the shipyard price. Therefore, the following equilibrium condition holds:   \begin{equation} P_{jt}^{N}=E\left[\beta^{T_{jt}}V^{o}\left(s_{it+T_{jt}}^{o},s_{t+T_{jt}}\right)|s_{it}^{o},s_{t}\right],\label{nb market cond} \end{equation} (3) where $$s_{it}^{o}$$ in this case involves ship age equal to zero and the country of yard $$j$$. This is the ship price of the toy model of Section 3.1. In practice, shipowners can also buy a used ship. In this model, since ships are indistinguishable from their owners, transactions in the second-hand market do not affect entry or profits in the industry. In addition, since there is a large number of identical shipowners who share the value of a ship, the price of a ship in the second hand market, $$P_{it}^{SH}$$, equals this value and shipowners are always indifferent between selling their ship and operating it themselves. Therefore, in equilibrium:   \begin{equation} P_{it}^{SH}=V^{o}\left(s_{it}^{o},s_{t}\right). \end{equation} (4) I revisit sales in the empirical part of the article, where second-hand prices are treated as observations on the value function. 3.2.2. Supply of new ships (shipyards) There are $$J$$ long-lived incumbent shipbuilders. Shipyard state space and state transitions: The state variable of shipyard $$j$$ at time $$t$$, $$s_{jt}^{y}$$, includes its: (1) Backlog $$b_{jt}\in \mathbb{R}^{\overline{T}}$$ whose $$k^{th}$$ element is the number of ships under construction scheduled for delivery at time $$t+k$$.23 (2) Country. (3) Other characteristics, such as: age, capital equipment (number of docks and berths, length of largest dock), number of employees. Shipyards also share the industry state, $$s_{t}$$. I allow time itself to be part of the aggregate state as well. Allowing for time to enter the agents’ decision-making is important in this application, as my empirical analysis of detecting government subsidies hinges on allowing time-varying factors to affect costs. The shipyard’s backlog changes based on its production choice (more on this below). The demand state variable $$d_{t}$$ and steel prices $$l_{t}$$ evolve according to exogenous first-order autoregressive processes (see Section 5.1.2). The evolution of all other states (for the shipowner, shipyard, and aggregate) is deterministic (see Section 5.1.2). Shipyard actions and per period payoffs: In period $$t$$, shipyard $$j$$ draws a private iid (across $$j$$ and $$t$$) production cost shock $$\varepsilon_{jt}$$, with $$\varepsilon_{jt}\sim N\left(0,1\right)$$, and makes a discrete production decision $$q_{jt}\in\left\{ 0,1,...,\overline{q}\right\} $$. Shipyard $$j$$ faces production costs, $$C\left(q,s_{jt}^{y},s_{t},\varepsilon_{jt}\right)$$. Even though $$q_{jt}$$ is an integer I assume that the cost function can be defined over the real interval $$\left[0,\overline{q}\right]$$ and that as such it is convex in $$q$$, in line with the toy model of Section 3.1. I also assume that the cost shock $$\varepsilon_{jt}$$ is paid for each produced unit, so that the total per period cost is:   \begin{equation} C\left(q,s_{jt}^{y},s_{t},\varepsilon_{jt}\right)=c\left(q,s_{jt}^{y},s_{t}\right)+\sigma q\varepsilon_{jt}. \end{equation} (5) In this model $$q_{jt}$$ corresponds to the number of ships ordered in period $$t$$ at shipyard $$j$$.24 As mentioned previously, producing a ship takes time and thus period $$t$$’s ship orders enter the shipyard’s backlog and are delivered a number of years later (Kydland and Prescott, 1982). In particular, period $$t$$ production, $$q_{jt}$$, enters $$j$$’s backlog, $$b_{jt}$$, at position $$T_{jt}$$, while the remaining elements of $$b_{jt}$$ move one period closer to delivery with its first element being delivered.25 Under demand uncertainty, therefore, undertaking a ship order becomes a dynamic choice. To capture this dynamic feedback, I assume that the cost function depends on the shipyard’s backlog. As in Jofre-Bonet and Pesendorfer (2003), there are two opposing ways the backlog can impact costs: on one hand, increased backlog can lower costs because of the accumulation of expertise, or economies of scale ($$e.g.$$ in ordering inputs); on the other hand, increased backlog can raise costs because of capacity constraints ($$e.g.$$ less available labour).26 Allowing for past production to affect current costs is potentially important both for measuring and evaluating the impact of subsidies. Time to build is shipyard-specific and in particular, $$T_{jt}=T\left(s_{jt}^{y},s_{t}\right)$$. In words, the time to build that shipyard $$j$$ can offer at $$t$$ depends on the shipyard’s characteristics ($$e.g.$$ backlog, docks/berths), as well as the aggregate state ($$e.g.$$ time, aggregate demand, and queuing); one can think of this as a reduced form capturing different delivery times over shipyards and time. As discussed above, shipyard $$j$$ sells its ships at a price equal to the shipowners’ entry value:   \begin{equation} VE_{j}^{o}\left(s_{t}\right)\equiv E\left[\beta^{T_{jt}}V^{o}\left(s_{it+T_{jt},}^{o}s_{t+T_{jt}}\right)|s_{it}^{o},s_{t}\right],\label{VE def} \end{equation} (6) where $$s_{it+T_{jt}}^{o}$$ has zero ship age and the country of yard $$j$$.27 Note that $$VE_{j}^{o}\left(s_{t}\right)$$ does not explicitly depend on period $$t$$’s production, $$q_{jt}$$; in other words yards do not face a downward sloping demand curve, in line with the toy model of Section 3.1. Indeed, $$q_{jt}$$ affects the willingness to pay for the ship by entering into the total backlog $$b_{t}$$ and from there into the fleet after $$T_{jt}$$ periods. Typically, $$q_{jt}$$ is a small integer, while the total fleet is a large number in the order of thousands. Therefore each shipyard, when making its production decision, can ignore the impact it has on $$VE_{j}^{o}\left(s_{t}\right)$$; note however, that aggregates do matter so that as the total fleet increases, shipowners’ willingness to pay falls, all else equal. This latter impact is captured by the aggregate demand of the toy model of Section 3.1. Shipyard behaviour: Shipyard $$j$$ chooses its production level to solve the Bellman equation:   \begin{equation} V^{y}\left(s_{jt}^{y},s_{t},\varepsilon_{jt}\right)=\max_{0\leq q\leq\overline{q}}VE_{j}^{o}\left(s_{t}\right)q-c\left(q,s_{jt}^{y},s_{t}\right)-\sigma q\varepsilon_{jt}+\beta E\left[V^{y}\left(s_{jt+1}^{y},s_{t+1},\varepsilon_{jt+1}\right)|s_{jt}^{y},s_{t},q\right].\label{yard value function} \end{equation} (7) To ease notation, I also define the continuation value:   \begin{equation} CV^{y}\left(s_{jt}^{y},s_{t},q\right)\equiv E\left[V^{y}\left(s_{jt+1}^{y},s_{t+1},\varepsilon_{jt+1}\right)|s_{jt}^{y},s_{t},q\right].\label{Q def} \end{equation} (8) The expectation in (7) is over demand for shipping services, $$d_{t}$$, steel prices, $$l_{t}$$, production by other shipyards and $$j$$’s future cost shocks. Under convex costs, the shipyard’s optimal policy amounts to comparing each production level $$q$$ to $$q+1$$ and $$q-1$$, as stated in the following intuitive lemma: Lemma 1. If the shipbuilding cost function $$C\left(q,\cdot\right):\left[0,\overline{q}\right]\rightarrow \mathbb{R} $$, is convex in $$q$$, then the shipyard’s optimal policy is given by:  \begin{equation} q^{\ast}\left(s_{jt}^{y},s_{t},\varepsilon_{jt}\right)=\left\{ \begin{array}{c} 0,\,\,\text{if }\varepsilon_{jt}\geq A\left(s_{jt}^{y},s_{t},0\right)\\ \\ q,\text{ if }\varepsilon_{jt}\in\left[\begin{array}{c} A\left(s_{jt}^{y},s_{t},q-1\right), \ A\left(s_{jt}^{y},s_{t},q\right)\end{array}\right]\\ \\ \overline{q},\,\,\text{if }\varepsilon_{jt}\leq A\left(s_{jt}^{y},s_{t},\overline{q}-1\right), \end{array}\right. \end{equation} (9)where  \[ A\left(s_{jt}^{y},s_{t},q\right)\equiv\frac{1}{\sigma}\left[\begin{array}{c} VE_{j}^{o}\left(s_{t}\right)+\left(c\left(q,s_{jt}^{y},s_{t}\right)-c\left(q+1,s_{jt}^{y},s_{t}\right)\right)+\\ \\ +\beta\left(CV^{y}\left(s_{jt}^{y},s_{t},q+1\right)-CV^{y}\left(s_{jt}^{y},s_{t},q\right)\right) \end{array}\right] \]for $$q=0,1,...,\overline{q}-1$$, are the optimal policy thresholds. Proof. See the Appendix. ∥ Timing and equilibrium: The timing in each period is as follows: incumbent and potential entrant shipowners observe their state $$\left(s_{it}^{o},s_{t}\right)$$, while shipbuilders observe their state $$\left(s_{jt}^{y},s_{t}\right)$$. Shipowners are hit by exit shocks and shipbuilders observe their private production cost shocks. Shipyards make production decisions. Next, shipowners receive profits from freight services and shipyards receive profits from new ship production. Exiting ships receive their scrap value $$\phi(s_{it}^{o},s_{t})$$. Finally, states are updated. I consider a competitive equilibrium which consists of an optimal production policy function $$q^{\ast}\left(s_{jt}^{y},s_{t},\varepsilon_{jt}\right)$$ that is given by (9), as well as value functions $$V^{y}\left(s_{jt}^{y},s_{t}\right)$$ and $$V^{o}\left(s_{it}^{o},s_{t}\right)$$ that satisfy (7) and (2) respectively, while all expectations employ $$q^{\ast}\left(s_{jt}^{y},s_{t},\varepsilon_{jt}\right)$$. Existence of equilibrium follows from Doraszelski and Satterthwaite (2009), Hopenhayn (1992) and Jovanovic (1982). Finally, I assume that China’s plan was an unexpected, one-shot, permanent and immediate change from the point of view of industry participants. Here, there are two equilibria; one before and one after 2006. The post-2006 worlds differs in three dimensions: (1) it has a larger number of Chinese shipyards; (2) Chinese shipyards obtain subsidies and thus have a lower cost function; (3) shipyards can have a different number of docks/berths (found in $$s_{jt}^{y}$$).28 4. Data and Descriptive Evidence 4.1. Data I employ five different datasets, all from Clarksons Research. The first reports shipbuilding quarterly production ($$i.e.$$ orders) between Q1-2001 and Q3-2012. For each shipyard and quarter I observe its bulk ship production in tons and numbers, as well as the yard’s backlog and time to build. There are 192 yards that produce Handysize vessels (the segment on which my empirical analysis will focus), of which 119 are Chinese, 41 are Japanese, 21 are South Korean and 11 are European. Of the 119 Chinese shipyards, 57 entered after 2005. The majority of bulk ship production occurs in China and Japan; hence even though I include Europe and South Korea in the estimation and counterfactuals, most comparisons will be made between China and Japan. As shown in Table 2, quarterly production at the shipyard level is quite low and often zero. Japan produces on average more than China, while Europe rarely produces over the sample period. TABLE 2 Sample summary statistics    Mean  St. Dev.  Min  Max  Quarterly firm production (no of ships)  0.35  1.37  0  22  Quarterly firm backlog (no of ships)  3.7  7.2  0  59  Quarterly firm production (no of ships), China  0.28  1.3  0  22  Quarterly firm production (no of ships), Japan  0.48  1.47  0  15  New ship price, million US dollars  28.4  8  6.3  51  Used ship price, million US dollars  9.9  8.6  0.6  56  Firm number of docks/berths, China  3.4  3.4  1  23  Firm number of docks/berths, Japan  2.5  1.96  1  9     Mean  St. Dev.  Min  Max  Quarterly firm production (no of ships)  0.35  1.37  0  22  Quarterly firm backlog (no of ships)  3.7  7.2  0  59  Quarterly firm production (no of ships), China  0.28  1.3  0  22  Quarterly firm production (no of ships), Japan  0.48  1.47  0  15  New ship price, million US dollars  28.4  8  6.3  51  Used ship price, million US dollars  9.9  8.6  0.6  56  Firm number of docks/berths, China  3.4  3.4  1  23  Firm number of docks/berths, Japan  2.5  1.96  1  9  TABLE 2 Sample summary statistics    Mean  St. Dev.  Min  Max  Quarterly firm production (no of ships)  0.35  1.37  0  22  Quarterly firm backlog (no of ships)  3.7  7.2  0  59  Quarterly firm production (no of ships), China  0.28  1.3  0  22  Quarterly firm production (no of ships), Japan  0.48  1.47  0  15  New ship price, million US dollars  28.4  8  6.3  51  Used ship price, million US dollars  9.9  8.6  0.6  56  Firm number of docks/berths, China  3.4  3.4  1  23  Firm number of docks/berths, Japan  2.5  1.96  1  9     Mean  St. Dev.  Min  Max  Quarterly firm production (no of ships)  0.35  1.37  0  22  Quarterly firm backlog (no of ships)  3.7  7.2  0  59  Quarterly firm production (no of ships), China  0.28  1.3  0  22  Quarterly firm production (no of ships), Japan  0.48  1.47  0  15  New ship price, million US dollars  28.4  8  6.3  51  Used ship price, million US dollars  9.9  8.6  0.6  56  Firm number of docks/berths, China  3.4  3.4  1  23  Firm number of docks/berths, Japan  2.5  1.96  1  9  The second dataset is a sample of shipbuilding contracts, between August 1998 and August 2012. It reports the order and delivery dates, the shipyard and price in million U.S. dollars. Unfortunately, prices are reported for only a very small fraction of contracts. Indeed, especially before 2006 most contracts involve missing prices. In addition, for shipyard-quarter combinations that involve zero production, the corresponding price does not exist by default. To deal with these issues, I introduce a dataset of second-hand ship sale transactions, between August 1998 and August 2012. The dataset reports the date of the transaction, the name and age of the ship, as well as the price in million U.S. dollars. I end up with 417 observations of new ship contracts and 2016 observations of second-hand sale contracts (2,434 total), of which 1,172 are pre-2006 and 1261 are post-2006. As shown in Table 2, which reports summary statistics, new ship prices are on average substantially higher than second-hand prices, though this is driven primarily by ship age. Prices are also very volatile, driven mostly by time series variation. Indeed, in Table 3, I regress new and used prices on ship age, country of built, and quarter dummies. The $$R^{2}$$ is high, at 0.82. The age coefficient in the used price regressions suggests that a very young ship in the second-hand market commands a price roughly equal to that of a new ship. Finally, Chinese ships are slightly cheaper than Japanese and South Korean. TABLE 3 Hedonic regression of new and used ships.    I New ship price  II Used ship price  Constant  14.8**  17.1**     (1.5)  (0.69)  Age  –  –0.76**        (0.01)  Built in Europe  –  –0.86**        (0.42)  Built in Japan  1.7*  0.36     (0.93)  (0.36)  Built in South Korea  3.4**  0.53     (0.51)  (0.51)  Quarter dummies  Yes  Yes  $$R^{2}$$  0.82  0.83  Obs.  417  2016     I New ship price  II Used ship price  Constant  14.8**  17.1**     (1.5)  (0.69)  Age  –  –0.76**        (0.01)  Built in Europe  –  –0.86**        (0.42)  Built in Japan  1.7*  0.36     (0.93)  (0.36)  Built in South Korea  3.4**  0.53     (0.51)  (0.51)  Quarter dummies  Yes  Yes  $$R^{2}$$  0.82  0.83  Obs.  417  2016  $$^{*}p<0.1$$, $$^{**}p<0.05$$. Price in million U.S. Dollars TABLE 3 Hedonic regression of new and used ships.    I New ship price  II Used ship price  Constant  14.8**  17.1**     (1.5)  (0.69)  Age  –  –0.76**        (0.01)  Built in Europe  –  –0.86**        (0.42)  Built in Japan  1.7*  0.36     (0.93)  (0.36)  Built in South Korea  3.4**  0.53     (0.51)  (0.51)  Quarter dummies  Yes  Yes  $$R^{2}$$  0.82  0.83  Obs.  417  2016     I New ship price  II Used ship price  Constant  14.8**  17.1**     (1.5)  (0.69)  Age  –  –0.76**        (0.01)  Built in Europe  –  –0.86**        (0.42)  Built in Japan  1.7*  0.36     (0.93)  (0.36)  Built in South Korea  3.4**  0.53     (0.51)  (0.51)  Quarter dummies  Yes  Yes  $$R^{2}$$  0.82  0.83  Obs.  417  2016  $$^{*}p<0.1$$, $$^{**}p<0.05$$. Price in million U.S. Dollars The fourth dataset employed is a snapshot of shipyard characteristics in 2012, such as shipyards’ first year of ship delivery, location, number of dry docks and berths, length of the largest dock, number of employees, total past output. The first year of delivery is used to compute the shipyard’s age.29 The number of docks and berths is a measure of capacity, since production bottlenecks occur during the assembly operations done on the docks/berths. I allow the capital infrastructure of yards ($$i.e.$$ docks/berths and length) to be different before and after China’s plan.30 The adoption of a pre- and post-$$2006$$ capital infrastructure level is consistent with the modeling assumption of two equilibria, before and after China’s plan. I drop shipyards with missing capital measures (docks/berths) so that the end shipyard sample consists of 4,741 shipyard-quarter observations (all results are robust if the full sample is used). As shown in Table 2, a shipyard has on average about three docks/berths and there is sizable heterogeneity. Finally, the fifth dataset consists of quarterly time-series between $$1998$$ and $$2012$$ for the fleet, total backlog, orders of new ships, deliveries and demolitions between $$1998$$ and $$2012$$. This dataset is used to create the states and estimate their transition. I also obtain quarterly time-series of Japan’s steel ship plate commodity price in dollars per ton during the same time period. 4.2. Descriptive evidence What patterns of the raw data are consistent with the presence of subsidies? One might expect that new ship prices should react around 2006. As discussed above, however, the sparsity of new ship prices (especially in the pre era) makes it impossible to explore this. Used ship prices, however, should also display a reaction. Figure 4 uses the hedonic regression of Table 3 and shows that indeed there is a short-lived drop in late 2005, in a period when ship prices are trending upward due to increased demand for freight (shifts in demand are also the reason the price decline is not permanent in the raw data).31 Of course, this finding is not proof of cost subsidies; yet if no drop were observed, one may have been concerned about the impact of this policy. Figure 4 View largeDownload slide Hedonic regression of used ship prices on ship age, country, and quarter dummies Figure 4 View largeDownload slide Hedonic regression of used ship prices on ship age, country, and quarter dummies Despite the importance of a price response, the main insight of this article, in terms of identifying/measuring subsidies, is that production patterns are equally important. Figure 5 depicts the evolution of China’s market share. Between $$2005$$ and $$2006$$, China experiences a large, rapid increase in market share. In this article, I employ precisely this rapid increase in production (in combination with prices) to identify changes in costs that are consistent with the presence of subsidies. At the same time, the constructed model can control for the different factors leading to this increase and provide a measurement of the alleged subsidies, a fundamental input into subsidy disputes. When I come to the results, I discuss alternative explanations for this pattern ($$e.g.$$ productivity changes/technological improvements) and argue that they cannot account for the observed industry evolution and the speed of China’s upswing. Figure 5 View largeDownload slide China’s market share Figure 5 View largeDownload slide China’s market share 5. Model Estimation and Detection of Subsidies To see the main idea behind the subsidy detection method, consider the toy model of Section 3.1; that is a static, perfectly competitive market, so that $$P_{jt}^{N}=MC_{jt}$$ for all $$j$$ and $$t$$. In that case, to detect subsidies one would simply look for a break in observed prices in 2006, since prices are in fact the marginal costs. In my setup, there are two complications: (1) I do not observe enough prices of new ships, and (2) there are dynamics in the production decision. To address (1), I complement with used ship prices; to address (2) I use the shipyard’s first-order condition from its dynamic optimization. Overall the detection methodology relies on a firm level dataset (here, quantities and prices) and some knowledge about the date of the subsidy programme implementation. Although the model is tailored to the shipbuilding industry, one can apply the same idea to other setups under suitable adjustments. The proposed strategy proceeds in two steps. In the first step, I recover the demand curve that shipbuilders face, which coincides with the value that shipowners place on entering the shipping industry. Retrieving this willingness to pay for a new ship amounts to estimating the value function for a new ship, as well as shipowner expectations. The second step inserts the estimated willingness to pay for a ship into the optimization problem of shipbuilders to recover their costs. Each estimation task is described below and followed by the results. A time period is a quarter. I consider Q3-2005 as the first quarter of the post world, consistent with Figures 1 and 4; all results are robust to alternative thresholds around that date. Finally, all results presented are for bulk vessels, in particular, Handysize. There are three good reasons to focus on bulk carriers: China was already an important player before 2006; their production process is not characterized by significant technological advances; product differentiation is limited. 5.1. Estimation of the willingness to pay for a new ship In this step, I create the state space and then estimate state transitions and ship value functions. 5.1.1. State space The ship’s state space includes its age and country of built, as well as the industry state, $$s_{t}$$ (fleet age distribution, backlog, steel prices, and demand for shipping services). All ship states are directly observed in the data except for the demand for shipping services, $$d_{t}$$. I construct $$d_{t}$$ following Kalouptsidi (2014) by estimating the demand curve for shipping services (1) and using its intercept. The empirical analogue of (1) that I estimate is:   \[ P_{t}^{o}=\alpha_{0}^{d}+\alpha_{1}^{d}X_{t}^{d}+\alpha_{2}^{d}Q_{t}^{o}+\varepsilon_{t}^{d}, \] where $$P_{t}^{o}$$ is the price per voyage, $$X_{t}^{d}$$ are demand shifters ($$e.g.$$ world commodity prices and production) and $$Q_{t}^{o}$$ is the number of voyages. This is estimated by IV regression and I set $$d_{t}\equiv\hat{\alpha}_{0}^{d}+\hat{\alpha}_{1}^{d}X_{t}^{d}+\hat{\varepsilon}_{t}^{d}$$. Details and estimates are presented in the Appendix. 5.1.2. State transitions To compute the value of entering the shipping industry, defined in (6), I need shipowner expectations over $$\left(s_{it}^{o},s_{t}\right)$$. The transition of $$s_{it}^{o}$$ is known (age evolves deterministically, while country of built is time invariant). The transition of $$s_{t}$$ is computationally complex: on one hand the dimension of the state space is very large (in the case of four countries, $$\overline{s_{t}^{o}}$$ has dimension $$4\times A$$—where $$A$$ is a ship’s maximum age; while the backlog $$b_{t}$$ has dimension $$J\times\overline{T}$$ which in my sample is in the order of several thousand); on the other hand, updating $$b_{t}$$ requires optimal production policies for all shipyards. Instead of working with the true transitions, I follow Pathak (2015) who assume that $$s_{t}$$ follows a vector autoregressive (VAR) model; this is similar to the first step of two-step estimation procedures for dynamic games ($$e.g.$$Bajari et al., 2007 and Pakes et al., 2007). To deal with the state dimension, I make the following simplifying assumptions, in the spirit of Krussel and Smith (1998). First, I replace the fleet distribution, $$\overline{s_{t}^{o}}$$, with two age groups $$\left(\overline{s_{1t}^{o}},\overline{s_{2t}^{o}}\right)$$: the number of ships below $$20$$ years old and the number of ships above $$20$$ years old. I do not use the distribution of the fleet over country of built because its evolution is extremely slow. Second, I replace the backlog, $$b_{t}$$, with the total backlog $$B_{t}=\sum_{j,l}b_{jtl}$$. In Section 7, I present a number of robustness exercises regarding these simplifying assumptions. I consider the VAR model:   \[ s_{t}=R_{o}+Rs_{t-1}+\xi_{t}, \] where $$\xi_{t}\sim N\left(0,\Sigma_{\xi}\right)$$. Note that the parameter vector $$R_{0}$$ and the parameter matrix $$R$$ can be different before and after 2006: since state transitions are not modeled explicitly, the VAR model embraces equilibrium features of agents’ expectations that are likely to change after China’s intervention. In particular, shipowners are likely to believe that all else equal the supply of ships is permanently higher post 2006. As discussed above, demand $$d_{t}$$ and steel price $$l_{t}$$ are exogenous to the model and are unaffected by the pre/post-2006 regime. Essentially I assume they are autonomous AR(1) processes. In contrast, I allow $$\left(\overline{s_{1t}^{o}},\overline{s_{2t}^{o}},B_{t}\right)$$ to be affected by all variables to account for ship entry and exit. The baseline specification allows only $$R_{0}$$ to change before and after $$2006$$. I estimate the parameters of interest $$\left(R_{0},R,\Sigma_{\xi}\right)$$ via OLS separately for each variable (Hamilton, 1994). Table 13 in the Appendix reports the results and provides more details. All variables are persistent. Signs are also in general as expected: $$\overline{s_{1t}^{o}}$$ is increasing in the backlog and demand and decreasing in steel prices (as steel prices increase, exit increases); $$\overline{s_{2t}^{o}}$$ is decreasing in $$\overline{s_{1t}^{o}}$$, as more young ships increase exit, and increasing in demand which leads to less exit; the backlog is increasing in demand. All eigenvalues of $$R$$ lie inside the unit circle so that the model is stationary. Finally, the post-$$2006$$ world’s steady state has significantly higher fleet. In Section 7, I present a number of robustness checks on the transition process for state variables, including a somewhat less sparse model that lies closer to the actual transitions. 5.1.3. Ship value function The main object entering the willingness to pay for a new ship in (6), is the ship’s value function. To estimate it, I treat prices of new and used ships as observations of the value of entry and the value function respectively. In particular, under the assumption of a large number of identical potential entrant shipowners, ship prices are bid up to valuations. The empirical versions of the equilibrium conditions (3) and (4) are:   \begin{equation} P_{jt}^{N}=E\left[\beta^{T_{jt}}V^{o}\left(s_{it+T_{jt},}^{o}s_{t+T_{jt}}\right)|s_{it}^{o},s_{t}\right]+\zeta_{jt}^{N}\label{estim NB 1} \end{equation} (10)  \begin{equation} P_{it}^{SH}=V^{o}\left(s_{it}^{o},s_{t}\right)+\zeta_{it}^{SH}, \end{equation} (11) where $$\zeta_{it}^{SH}$$ and $$\zeta_{jt}^{N}$$ are measurement error. Kalouptsidi (2014) employs used ship prices alone to non-parametrically estimate ship value functions and provides an extensive discussion on the merits and caveats of this approach, as well as direct and suggestive evidence against worries of sample selection. Here, however, the addition of the new ship prices requires the combination of (10) and (11) in a single estimation step, which is not straightforward. Indeed, even though both $$V$$ and $$VE$$ can be written directly as a function of the state, time to build implies that they cannot be written as the same function of the state. Indeed, in $$V$$ the owner already has the ship, while in $$VE$$ the owner will obtain it in a number of periods. Thus, to combine (10) and (11) in one regression, one needs to model agent expectations. I next describe the approach. I use a flexible linear sieve approximation for the value function:   \[ V^{o}\left(s_{it}^{o},s_{t}\right)=\gamma^{o}f^{o}\left(s_{it}^{o},s_{t}\right), \] where $$f^{o}\left(\cdot\right)$$ is a polynomial function in the ship state $$\left(s_{it}^{o},s_{t}\right)$$ and $$\gamma^{o}$$ is a (sparse) parameter vector. Then:   \begin{equation} P_{it}^{SH}=\gamma^{o}f^{o}\left(s_{it}^{o},s_{t}\right)+\zeta_{it}^{SH} \end{equation} (12) while   \[ P_{jt}^{N}=\gamma^{o}\beta^{T_{jt}}E\left[f^{o}\left(s_{it+T_{jt}}^{o},s_{t+T_{jt}}\right)|s_{it}^{o},s_{t}\right]+\zeta_{jt}^{N}= \]  \begin{equation} =\gamma^{o}\beta^{T_{jt}}\int f^{o}\left(s_{it+T_{jt}}^{o},s_{t+T_{jt}}\right)dP\left(s_{it+T_{jt}}^{o},s_{t+T_{jt}}|s_{it}^{o},s_{t}\right)+\zeta_{jt}^{N}\equiv\gamma^{o}f_{N}\left(s_{it}^{o},s_{t}\right)+\zeta_{jt}^{N},\label{estim NB 2} \end{equation} (13) where $$P\left(s_{it+1}^{o},s_{t+1}|s_{it}^{o},s_{t}\right)$$ is the state transition probability function and can be determined by the VAR estimated above. The parameter vector $$\gamma^{o}$$ enters (12) and (13) linearly; yet even though (12) can be estimated in a straightforward manner, (13) requires the computation of the right-hand side integrals. Indeed, (13) involves the expectation of higher order terms of the following vector:   \begin{equation} s_{t+T}=R^{T}s_{t}+\sum_{k=t+1}^{t+T}R^{t+T-k}(R_{o}+\xi_{k}). \end{equation} (14) In the Appendix, I derive closed-form expressions for the integrals of up to third-order terms in the industry state $$s_{t}$$. As the dimensionality of the state $$\left(s_{it}^{o},s_{t}\right)$$ is large, computing high order polynomial terms quickly leads to a very large number of regressors in (12) and (13). I therefore use the LASSO which is appropriate for sparse regression problems, that is problems that involve a large number of potential regressors, only a small subset of which is important in capturing the regression function accurately. Thus $$\gamma^{o}$$ is estimated from:   \[ \min_{\gamma^{o}}\left\{ \sum_{j,t}\left(P_{jt}^{N}-\gamma^{o}f_{N}\left(s_{it}^{o},s_{t}\right)\right)^{2}+\sum_{i,t} \left(P_{it}^{SH}-\gamma^{o}f^{o}\left(s_{it}^{o},s_{t}\right)\right)^{2}+\lambda\left\vert \gamma^{o}\right\vert _{1}\right\}, \] where $$\lambda$$ penalizes the number of non-zero coefficients in $$\gamma^{0}$$. In this application, the regressors $$f^{o}\left(s_{it}^{o},s_{t}\right)$$ and $$f_{N}\left(s_{it}^{o},s_{t}\right)$$ are third-order polynomials in $$s_{t}$$ and $$s_{it}^{o}$$, as well as interactions of $$s_{it}^{o}$$ and $$s_{t}$$. The discount factor is set to $$0.9877$$ which corresponds to $$5\%$$ annual interest rate. The flexible nature of this empirical approach implies that the parameters $$\gamma^{o}$$ embody equilibrium features which are likely to change in 2006 as agents’ valuations are altered. Therefore, in analogy with the VAR formulation, I allow the value function to change before and after by a 2006 dummy. Figure 6 depicts the estimated value function on the observed states for zero year old ships. Consistent with the raw data, Chinese ships are of somewhat lower value, yet the differences are small.32 Figure 6 View largeDownload slide Value function of a ship at age zero. The 0.95 bootstrap confidence intervals Figure 6 View largeDownload slide Value function of a ship at age zero. The 0.95 bootstrap confidence intervals 5.2. Shipbuilding production cost function I next turn to estimating the shipbuilding cost function. I begin with the simple case where shipbuilders are static to illustrate the estimation strategy followed. I present the estimation results along with several robustness exercises. I then proceed to the case of dynamic shipbuilders. Section 7 provides further robustness analysis. 5.2.1. Static shipbuilders If shipyard $$j$$ is myopic it solves:   \[ \max_{0\leq q\leq\overline{q}}VE_{j}^{o}\left(s_{t}\right)q-c\left(q,s_{jt}^{y},s_{t}\right)-\sigma\varepsilon_{jt}q. \] This is essentially an ordered choice problem, so to estimate the cost function parameters, I maximize the following likelihood function of observed shipyard production decisions:   \begin{equation} \displaystyle\prod\limits _{j,t:q_{jt}=0}\Pr\left(q_{jt}=0|s_{jt}^{y},s_{t}\right)\displaystyle\prod\limits _{q=1}^{\bar{q}-1}\displaystyle\prod\limits _{j,t:q_{jt}=q}\Pr\left(q_{jt}=q|s_{jt}^{y},s_{t}\right)\displaystyle\prod\limits _{j,t:q_{jt}=\overline{q}}\Pr\left(q_{jt}=\overline{q}|s_{jt}^{y},s_{t}\right).\label{likelihood fnc} \end{equation} (15) I consider a cost function of the following flexible form:   \[ c\left(q,s_{jt}^{y},s_{t}\right)=c_{1}'g_{1}\left(s_{jt}^{y},s_{t}\right)q+c_{2}\left(q,s_{jt}^{y},s_{t}\right), \] where $$g_{1}(\cdot)$$ is a known vector-valued function of the shipyard characteristics $$s_{jt}^{y}$$ and aggregate states $$s_{t}$$ (steel prices and time); $$c_{1}$$ is the vector of parameters of the linear component; and $$c_{2}(\cdot)$$ is the cost curvature and is convex in the output $$q$$. As shown below, I consider several flexible specifications for the latter. The simple baseline specification assumes that the shipbuilding cost function is quadratic:   \[ c\left(q,s_{jt}^{y},s_{t}\right)=c_{1}'g_{1}\left(s_{jt}^{y},s_{t}\right)q+c_{2}'g_{2}\left(s_{jt}^{y},s_{t}\right)q^{2} \] with $$\left(c_{1}',c_{2}',\sigma\right)$$ the parameters of interest. In all specifications, the linear component includes country dummies, as well as a China-post 2006 dummy. Testing whether the latter is different than zero provides evidence of a structural change in China’s cost function, for any value of $$s_{jt}^{y}$$, $$s_{t}$$ and $$q$$.33 Table 4 reports the baseline cost function estimates. In all specifications, there is a strongly significant decline in China’s cost after $$2006$$ in the order of 13–20%, as indicated by the China-post dummy. Multiplying this parameter by China’s production, I find that between $$2006$$ and $$2012$$ Chinese subsidies cost between 1.5 and 4.5 billion U.S. Dollars. The results suggest some convexity in costs. The coefficient on backlog is negative, implying cost declines due to expertise or economies of scale.34 This finding is consistent with industry participants’ testimony, who claim that shipyards have incentives to produce ships similar to those they already have under construction. In addition, costs are decreasing in capital measures, as expected. Not surprisingly, Europe is the highest cost producer, while either Japan or China post-2006 are the lowest cost producer depending on the specification. TABLE 4 Baseline static cost estimates Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{\gamma}$$  I  II  III  IV  V  VI  VII  VIII  IX     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  32.00**  32.25**  32.31**  21.48**  32.07**  39.89**  33.66**  30.92**  34.79**     (5.24)  (5.05)  (5.10)  (5.30)  (5.46)  (7.71)  (5.20)  (6.51)  (5.66)  China, POST  –6.36**  –6.14**  –7.24**  –2.78**  –5.92**  –8.37**  –6.19**  –6.14**  6.03**     (2.19)  (1.93)  (2.98)  (1.56)  (2.28)  (2.93)  (1.96)  (2.10)  (1.99)  Europe  32.16**  32.46**  36.02**  21.58**  32.35**  39.72**  33.78**  31.13**  34.95**     (5.68)  (5.40)  (6.85)  (5.56)  (5.65)  (8.28)  (5.46)  (6.50)  (6.16)  Japan  25.91**  26.01**  26.33**  19.54**  25.87**  32.79**  27.57**  24.61**  27.63**     (3.25)  (3.25)  (3.37)  (3.66)  (3.32)  (5.37)  (3.37)  (4.29)  (3.80)  Japan, POST              0.24                             (1.15)              S Korea  33.94**  34.41**  38.51**  25.49**  34.40**  39.76**  35.46**  33.20**  36.06**     (4.58)  (4.65)  (6.10)  (4.84)  (4.73)  (6.45)  (4.52)  (5.65)  (5.22)  Backlog  –0.63**  –0.61**  –0.63**  –0.29**  –0.60**  –0.84**  –0.59**  –0.62**  0.74**     (0.17)  (0.16)  (0.17)  (0.16)  (0.17)  (0.24)  (0.15)  (0.18)  (0.19)  Docks/berths     –0.14  –0.14     –0.14     –0.15  –0.13  0.23        (0.11)  (0.12)     (0.11)     (0.11)  (0.11)  (0.14)  Max length     –0.001  –0.001     –0.001     –0.0009  –0.001  0.001        (0.0008)  (0.0009)     (0.0008)     (0.0008)  (0.0008)  (0.002)  Steel price  0.50**  0.51**  0.52**  0.78**  0.50**  0.50**  0.48**  0.52**  0.57**     (0.17)  (0.17)  (0.16)  (0.36)  (0.17)  (0.21)  (0.17)  (0.15)  (0.23)  $$t$$  0.14**  0.13**  0.04     0.13**  0.19**  0.13**  0.13**  0.15**     (0.05)  (0.05)  (0.06)     (0.06)  (0.07)  (0.05)  (0.05)  (0.05)  China$${*}$$t        0.13                             (0.10)                    Japan$${*}$$t        0.10                             (0.07)                    Year dummies  No  No  No  Yes  No  No  No  No        Curvature: $$c_{2}q^{\gamma}$$  $$c_{2}q^{2}$$  1.23**  1.20**  1.24**  0.53*  1.18**           1.42**     (0.35)  (0.32)  (0.34)  (0.33)  (0.36)           (0.38)  $$c_{2}*\text{Docks/berths}*q^{2}$$                 0.38**                             (0.11)           $$c_{2}q^{3}$$                    0.11**                             (0.03)        $$c_{2}$$                       2.47*     $$c_{2}q^{\gamma}$$:                       (1.57)     $$\gamma$$                       1.72**                             (0.15)     $$\sigma$$  12.07  11.67  12.13  5.11  11.48  17.31  11.48  11.76  13.08     (3.23)  (3.03)  (3.31)  (3.15)  (3.38)  (4.93)  (2.86)  (3.50)  (3.44)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{\gamma}$$  I  II  III  IV  V  VI  VII  VIII  IX     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  32.00**  32.25**  32.31**  21.48**  32.07**  39.89**  33.66**  30.92**  34.79**     (5.24)  (5.05)  (5.10)  (5.30)  (5.46)  (7.71)  (5.20)  (6.51)  (5.66)  China, POST  –6.36**  –6.14**  –7.24**  –2.78**  –5.92**  –8.37**  –6.19**  –6.14**  6.03**     (2.19)  (1.93)  (2.98)  (1.56)  (2.28)  (2.93)  (1.96)  (2.10)  (1.99)  Europe  32.16**  32.46**  36.02**  21.58**  32.35**  39.72**  33.78**  31.13**  34.95**     (5.68)  (5.40)  (6.85)  (5.56)  (5.65)  (8.28)  (5.46)  (6.50)  (6.16)  Japan  25.91**  26.01**  26.33**  19.54**  25.87**  32.79**  27.57**  24.61**  27.63**     (3.25)  (3.25)  (3.37)  (3.66)  (3.32)  (5.37)  (3.37)  (4.29)  (3.80)  Japan, POST              0.24                             (1.15)              S Korea  33.94**  34.41**  38.51**  25.49**  34.40**  39.76**  35.46**  33.20**  36.06**     (4.58)  (4.65)  (6.10)  (4.84)  (4.73)  (6.45)  (4.52)  (5.65)  (5.22)  Backlog  –0.63**  –0.61**  –0.63**  –0.29**  –0.60**  –0.84**  –0.59**  –0.62**  0.74**     (0.17)  (0.16)  (0.17)  (0.16)  (0.17)  (0.24)  (0.15)  (0.18)  (0.19)  Docks/berths     –0.14  –0.14     –0.14     –0.15  –0.13  0.23        (0.11)  (0.12)     (0.11)     (0.11)  (0.11)  (0.14)  Max length     –0.001  –0.001     –0.001     –0.0009  –0.001  0.001        (0.0008)  (0.0009)     (0.0008)     (0.0008)  (0.0008)  (0.002)  Steel price  0.50**  0.51**  0.52**  0.78**  0.50**  0.50**  0.48**  0.52**  0.57**     (0.17)  (0.17)  (0.16)  (0.36)  (0.17)  (0.21)  (0.17)  (0.15)  (0.23)  $$t$$  0.14**  0.13**  0.04     0.13**  0.19**  0.13**  0.13**  0.15**     (0.05)  (0.05)  (0.06)     (0.06)  (0.07)  (0.05)  (0.05)  (0.05)  China$${*}$$t        0.13                             (0.10)                    Japan$${*}$$t        0.10                             (0.07)                    Year dummies  No  No  No  Yes  No  No  No  No        Curvature: $$c_{2}q^{\gamma}$$  $$c_{2}q^{2}$$  1.23**  1.20**  1.24**  0.53*  1.18**           1.42**     (0.35)  (0.32)  (0.34)  (0.33)  (0.36)           (0.38)  $$c_{2}*\text{Docks/berths}*q^{2}$$                 0.38**                             (0.11)           $$c_{2}q^{3}$$                    0.11**                             (0.03)        $$c_{2}$$                       2.47*     $$c_{2}q^{\gamma}$$:                       (1.57)     $$\gamma$$                       1.72**                             (0.15)     $$\sigma$$  12.07  11.67  12.13  5.11  11.48  17.31  11.48  11.76  13.08     (3.23)  (3.03)  (3.31)  (3.15)  (3.38)  (4.93)  (2.86)  (3.50)  (3.44)  $$^*p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. Column IX involves only existing shipyards. TABLE 4 Baseline static cost estimates Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{\gamma}$$  I  II  III  IV  V  VI  VII  VIII  IX     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  32.00**  32.25**  32.31**  21.48**  32.07**  39.89**  33.66**  30.92**  34.79**     (5.24)  (5.05)  (5.10)  (5.30)  (5.46)  (7.71)  (5.20)  (6.51)  (5.66)  China, POST  –6.36**  –6.14**  –7.24**  –2.78**  –5.92**  –8.37**  –6.19**  –6.14**  6.03**     (2.19)  (1.93)  (2.98)  (1.56)  (2.28)  (2.93)  (1.96)  (2.10)  (1.99)  Europe  32.16**  32.46**  36.02**  21.58**  32.35**  39.72**  33.78**  31.13**  34.95**     (5.68)  (5.40)  (6.85)  (5.56)  (5.65)  (8.28)  (5.46)  (6.50)  (6.16)  Japan  25.91**  26.01**  26.33**  19.54**  25.87**  32.79**  27.57**  24.61**  27.63**     (3.25)  (3.25)  (3.37)  (3.66)  (3.32)  (5.37)  (3.37)  (4.29)  (3.80)  Japan, POST              0.24                             (1.15)              S Korea  33.94**  34.41**  38.51**  25.49**  34.40**  39.76**  35.46**  33.20**  36.06**     (4.58)  (4.65)  (6.10)  (4.84)  (4.73)  (6.45)  (4.52)  (5.65)  (5.22)  Backlog  –0.63**  –0.61**  –0.63**  –0.29**  –0.60**  –0.84**  –0.59**  –0.62**  0.74**     (0.17)  (0.16)  (0.17)  (0.16)  (0.17)  (0.24)  (0.15)  (0.18)  (0.19)  Docks/berths     –0.14  –0.14     –0.14     –0.15  –0.13  0.23        (0.11)  (0.12)     (0.11)     (0.11)  (0.11)  (0.14)  Max length     –0.001  –0.001     –0.001     –0.0009  –0.001  0.001        (0.0008)  (0.0009)     (0.0008)     (0.0008)  (0.0008)  (0.002)  Steel price  0.50**  0.51**  0.52**  0.78**  0.50**  0.50**  0.48**  0.52**  0.57**     (0.17)  (0.17)  (0.16)  (0.36)  (0.17)  (0.21)  (0.17)  (0.15)  (0.23)  $$t$$  0.14**  0.13**  0.04     0.13**  0.19**  0.13**  0.13**  0.15**     (0.05)  (0.05)  (0.06)     (0.06)  (0.07)  (0.05)  (0.05)  (0.05)  China$${*}$$t        0.13                             (0.10)                    Japan$${*}$$t        0.10                             (0.07)                    Year dummies  No  No  No  Yes  No  No  No  No        Curvature: $$c_{2}q^{\gamma}$$  $$c_{2}q^{2}$$  1.23**  1.20**  1.24**  0.53*  1.18**           1.42**     (0.35)  (0.32)  (0.34)  (0.33)  (0.36)           (0.38)  $$c_{2}*\text{Docks/berths}*q^{2}$$                 0.38**                             (0.11)           $$c_{2}q^{3}$$                    0.11**                             (0.03)        $$c_{2}$$                       2.47*     $$c_{2}q^{\gamma}$$:                       (1.57)     $$\gamma$$                       1.72**                             (0.15)     $$\sigma$$  12.07  11.67  12.13  5.11  11.48  17.31  11.48  11.76  13.08     (3.23)  (3.03)  (3.31)  (3.15)  (3.38)  (4.93)  (2.86)  (3.50)  (3.44)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{\gamma}$$  I  II  III  IV  V  VI  VII  VIII  IX     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  32.00**  32.25**  32.31**  21.48**  32.07**  39.89**  33.66**  30.92**  34.79**     (5.24)  (5.05)  (5.10)  (5.30)  (5.46)  (7.71)  (5.20)  (6.51)  (5.66)  China, POST  –6.36**  –6.14**  –7.24**  –2.78**  –5.92**  –8.37**  –6.19**  –6.14**  6.03**     (2.19)  (1.93)  (2.98)  (1.56)  (2.28)  (2.93)  (1.96)  (2.10)  (1.99)  Europe  32.16**  32.46**  36.02**  21.58**  32.35**  39.72**  33.78**  31.13**  34.95**     (5.68)  (5.40)  (6.85)  (5.56)  (5.65)  (8.28)  (5.46)  (6.50)  (6.16)  Japan  25.91**  26.01**  26.33**  19.54**  25.87**  32.79**  27.57**  24.61**  27.63**     (3.25)  (3.25)  (3.37)  (3.66)  (3.32)  (5.37)  (3.37)  (4.29)  (3.80)  Japan, POST              0.24                             (1.15)              S Korea  33.94**  34.41**  38.51**  25.49**  34.40**  39.76**  35.46**  33.20**  36.06**     (4.58)  (4.65)  (6.10)  (4.84)  (4.73)  (6.45)  (4.52)  (5.65)  (5.22)  Backlog  –0.63**  –0.61**  –0.63**  –0.29**  –0.60**  –0.84**  –0.59**  –0.62**  0.74**     (0.17)  (0.16)  (0.17)  (0.16)  (0.17)  (0.24)  (0.15)  (0.18)  (0.19)  Docks/berths     –0.14  –0.14     –0.14     –0.15  –0.13  0.23        (0.11)  (0.12)     (0.11)     (0.11)  (0.11)  (0.14)  Max length     –0.001  –0.001     –0.001     –0.0009  –0.001  0.001        (0.0008)  (0.0009)     (0.0008)     (0.0008)  (0.0008)  (0.002)  Steel price  0.50**  0.51**  0.52**  0.78**  0.50**  0.50**  0.48**  0.52**  0.57**     (0.17)  (0.17)  (0.16)  (0.36)  (0.17)  (0.21)  (0.17)  (0.15)  (0.23)  $$t$$  0.14**  0.13**  0.04     0.13**  0.19**  0.13**  0.13**  0.15**     (0.05)  (0.05)  (0.06)     (0.06)  (0.07)  (0.05)  (0.05)  (0.05)  China$${*}$$t        0.13                             (0.10)                    Japan$${*}$$t        0.10                             (0.07)                    Year dummies  No  No  No  Yes  No  No  No  No        Curvature: $$c_{2}q^{\gamma}$$  $$c_{2}q^{2}$$  1.23**  1.20**  1.24**  0.53*  1.18**           1.42**     (0.35)  (0.32)  (0.34)  (0.33)  (0.36)           (0.38)  $$c_{2}*\text{Docks/berths}*q^{2}$$                 0.38**                             (0.11)           $$c_{2}q^{3}$$                    0.11**                             (0.03)        $$c_{2}$$                       2.47*     $$c_{2}q^{\gamma}$$:                       (1.57)     $$\gamma$$                       1.72**                             (0.15)     $$\sigma$$  12.07  11.67  12.13  5.11  11.48  17.31  11.48  11.76  13.08     (3.23)  (3.03)  (3.31)  (3.15)  (3.38)  (4.93)  (2.86)  (3.50)  (3.44)  $$^*p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. Column IX involves only existing shipyards. The starting point on the curvature of the cost function is a simple quadratic. Columns I and II are the simplest specifications: they control for the shipyard’s backlog, docks/berths, length of the largest dock, as well as for a linear time trend. It is important, however, to reject to the furthest extent possible explanations alternative to subsidies. To do so, in the other columns of Table 4 as well as the following tables, I incorporate rich time variation, perform placebos, adopt a flexible cost curvature, consider only existing shipyards and perform a number of miscellaneous other checks which I discuss next. Time variation and placebos: It is important to control for time-varying factors adequately to alleviate the concern that the estimated cost declines may be driven by unobserved time variation. The results are robust to any parametric function of time I have tried ($$e.g.$$ country specific time trends, polynomial trends); as an example, Specification III of Table 4 adds time trends specific to China and Japan. Specification IV moves away from parametric functions of time and adds year dummies; there is still a significant decline in Chinese costs, not surprisingly somewhat lower, at $$13\%$$. The most flexible specification in terms of time variation is to estimate China-year dummies. As expected, estimates are more noisy, yet as shown in Figure 7, Chinese costs fall post $$2006$$. One could argue that an arbitrary productivity process or technological improvements can also be consistent with these results. The production process of bulk carriers is old, however, without frequent technological advances. In addition, such a productivity process would have to feature a discontinuity in 2006 in China alone. Figure 7 View largeDownload slide China-year dummies Figure 7 View largeDownload slide China-year dummies Specification V of Table 4 shows that no significant changes occur in 2006 in other countries. Indeed, I add a Japan-post 2006 dummy and find that Japan’s costs seem to increase slightly, but the coefficient is not significant (similar findings are obtained if other countries are used). Cost curvature: Adopting a flexible cost specification is undoubtedly crucial in the subsidy measurement, as misspefication can confound the results. I therefore consider a number of alternative specifications for the cost function and show that the retrieved subsidies are robust. Columns VI–VIII of Table 4 generalize the quadratic cost function curvature. Column VI makes $$c_{2}'g_{2}(s_{jt}^{y},s_{t})$$ a linear function of docks/berths, to allow dependence of curvature on capital measures. Results are robust. At the average number of docks/berths the convexity parameter becomes $$1.1$$, close to the simple quadratic estimates. Column VII considers a cubic function of output. Finally, column VIII considers a general power function, so that:   \begin{equation} c_{2}\left(q,s_{jt}^{y},s_{t}\right)=c_{2}q^{\gamma} \end{equation} (16) for $$q=0,1,...,\bar{q}$$. I estimate both $$c_{2}$$ and $$\gamma$$. The estimated power is $$\hat{\gamma}=1.72$$ and a LR test cannot reject that it is different from 2. Next, I turn to a more general cost function that allows for a non-parametric curvature, so that:   \begin{equation} c\left(q,s_{jt}^{y},s_{t}\right)=c_{1}'g_{1}\left(s_{jt}^{y},s_{t}\right)q+c_{q}.\label{eq:flexible cost function} \end{equation} (17) I estimate $$\left\{ c_{0},c_{1},...,c_{\bar{q}}\right\} $$. Note that not all coefficients can be identified; some need to be restricted.35Table 5 presents the results for several different restrictions. In Column I, I set $$c_{0}=0$$ and $$c_{q}=\gamma_{1}q^{2}$$ for $$q=\bar{q}-1,\bar{q}$$; the parameters to be estimated are $$\left\{ c_{1},c_{2},...,c_{\bar{q}-2},\gamma_{1}\right\} $$. Given the large number of observations with $$q=0$$, it might be important to allow for a fixed cost. Thus, in column II of Table 5, I do not restrict $$c_{0}$$ to zero, but rather I set $$c_{q}=\gamma_{1}q^{2}$$ for $$q=\bar{q}-2,\bar{q}-1,\bar{q}$$ and estimate $$\left\{ c_{0},c_{1},c_{2},...,c_{\bar{q}-3},\gamma_{1}\right\} $$. Next, in column III, I allow $$c_{0}$$ to be country-specific and to change in China pre/post 2006. In other words, this specification allows for subsidies in the fixed cost. Interestingly, the estimated parameters suggest that the cost decline is not manifested in the fixed cost, but rather in positive production levels. The recovered subsidy is robust in all the above specifications. TABLE 5 Static cost estimates with flexible cost curvature Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{q}$$  I  II  III  IV     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  34.80**  38.48**  35.83**  28.96**     (5.36)  (7.55)  (5.50)  (6.04)  China, POST  –6.12**  –6.12**  –5.32**  –6.11**     (1.91)  (2.05)  (1.77)  (2.03)  Europe  35.04**  38.72**  39.36**  29.20**     (5.63)  (7.73)  (6.80)  (6.37)  Japan  28.49**  32.18**  31.51**  22.66**     (3.61)  (6.09)  (4.32)  (4.95)  S Korea  37.09**  40.78**  38.47**  31.27**     (4.79)  (7.20)  (5.37)  (5.72)  Backlog  –0.62**  –0.62**  –0.62**  –0.62**     (0.15)  (0.15)  (0.15)  (0.16)  Docks/berths  –0.13  –0.13  –0.13  –0.13     (0.11)  (0.11)  (0.11)  (0.12)  Max length  –0.001  –0.001  –0.0009  –0.001     (0.0008)  (0.0009)  (0.0008)  (0.0009)  Steel price  0.53**  0.53**  0.52**  0.53**     (0.17)  (0.16)  (0.16)  (0.18)  $$t$$  0.13**  0.13**  0.13**  0.13**     (0.04)  (0.05)  (0.05)  (0.05)     Curvature: $$c_{q}$$  $$c_{0}$$     17.49     4.39        (19.17)     (4.73)  $$c_{0}^{China}$$        8.96**              (3.95)     $$c_{0}^{China,Post}$$        0.88              (0.69)     $$c_{0}^{Europe}$$        12.85**              (4.74)     $$c_{0}^{Japan}$$        11.45**              (4.28)     $$c_{0}^{S.Korea}$$        9.12**              (4.08)     $$c_{1}$$  –1.42*  12.38  6.64**  11.63*     (0.81)  (14.70)  (2.72)  (9.75)  $$c_{2}$$  0.02  10.13  5.58**  22.36*     (1.47)  (10.38)  (1.80)  (15.28)  $$c_{3}$$  4.96*  11.38*  8.01**  34.92*     (2.55)  (6.53)  (2.12)  (21.17)  $$c_{4}$$  11.75**  14.47**  12.26**  49.99*     (4.18)  (4.50)  (3.43)  (27.60)  $$c_{5}$$  21.08**  20.11**  19.05**  66.64*     (6.52)  (6.73)  (5.44)  (34.45)  $$c_{6}$$  32.01**  27.35**           (9.31)  (11.00)        $$c_{7}$$  44.51**  36.15**           (12.55)  (15.88)        $$c_{8}$$  57.75**              (16.01)           $$\gamma_{1}$$  0.91**  0.71**  0.76**  3.95     (0.25)  (0.33)  (0.22)  (3.84)  $$\gamma_{2}$$           1.58**              (0.16)  $$\sigma$$  11.77  11.77  11.69  11.75     (2.92)  (2.96)  (2.87)  (3.25)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{q}$$  I  II  III  IV     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  34.80**  38.48**  35.83**  28.96**     (5.36)  (7.55)  (5.50)  (6.04)  China, POST  –6.12**  –6.12**  –5.32**  –6.11**     (1.91)  (2.05)  (1.77)  (2.03)  Europe  35.04**  38.72**  39.36**  29.20**     (5.63)  (7.73)  (6.80)  (6.37)  Japan  28.49**  32.18**  31.51**  22.66**     (3.61)  (6.09)  (4.32)  (4.95)  S Korea  37.09**  40.78**  38.47**  31.27**     (4.79)  (7.20)  (5.37)  (5.72)  Backlog  –0.62**  –0.62**  –0.62**  –0.62**     (0.15)  (0.15)  (0.15)  (0.16)  Docks/berths  –0.13  –0.13  –0.13  –0.13     (0.11)  (0.11)  (0.11)  (0.12)  Max length  –0.001  –0.001  –0.0009  –0.001     (0.0008)  (0.0009)  (0.0008)  (0.0009)  Steel price  0.53**  0.53**  0.52**  0.53**     (0.17)  (0.16)  (0.16)  (0.18)  $$t$$  0.13**  0.13**  0.13**  0.13**     (0.04)  (0.05)  (0.05)  (0.05)     Curvature: $$c_{q}$$  $$c_{0}$$     17.49     4.39        (19.17)     (4.73)  $$c_{0}^{China}$$        8.96**              (3.95)     $$c_{0}^{China,Post}$$        0.88              (0.69)     $$c_{0}^{Europe}$$        12.85**              (4.74)     $$c_{0}^{Japan}$$        11.45**              (4.28)     $$c_{0}^{S.Korea}$$        9.12**              (4.08)     $$c_{1}$$  –1.42*  12.38  6.64**  11.63*     (0.81)  (14.70)  (2.72)  (9.75)  $$c_{2}$$  0.02  10.13  5.58**  22.36*     (1.47)  (10.38)  (1.80)  (15.28)  $$c_{3}$$  4.96*  11.38*  8.01**  34.92*     (2.55)  (6.53)  (2.12)  (21.17)  $$c_{4}$$  11.75**  14.47**  12.26**  49.99*     (4.18)  (4.50)  (3.43)  (27.60)  $$c_{5}$$  21.08**  20.11**  19.05**  66.64*     (6.52)  (6.73)  (5.44)  (34.45)  $$c_{6}$$  32.01**  27.35**           (9.31)  (11.00)        $$c_{7}$$  44.51**  36.15**           (12.55)  (15.88)        $$c_{8}$$  57.75**              (16.01)           $$\gamma_{1}$$  0.91**  0.71**  0.76**  3.95     (0.25)  (0.33)  (0.22)  (3.84)  $$\gamma_{2}$$           1.58**              (0.16)  $$\sigma$$  11.77  11.77  11.69  11.75     (2.92)  (2.96)  (2.87)  (3.25)  $$^*p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. TABLE 5 Static cost estimates with flexible cost curvature Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{q}$$  I  II  III  IV     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  34.80**  38.48**  35.83**  28.96**     (5.36)  (7.55)  (5.50)  (6.04)  China, POST  –6.12**  –6.12**  –5.32**  –6.11**     (1.91)  (2.05)  (1.77)  (2.03)  Europe  35.04**  38.72**  39.36**  29.20**     (5.63)  (7.73)  (6.80)  (6.37)  Japan  28.49**  32.18**  31.51**  22.66**     (3.61)  (6.09)  (4.32)  (4.95)  S Korea  37.09**  40.78**  38.47**  31.27**     (4.79)  (7.20)  (5.37)  (5.72)  Backlog  –0.62**  –0.62**  –0.62**  –0.62**     (0.15)  (0.15)  (0.15)  (0.16)  Docks/berths  –0.13  –0.13  –0.13  –0.13     (0.11)  (0.11)  (0.11)  (0.12)  Max length  –0.001  –0.001  –0.0009  –0.001     (0.0008)  (0.0009)  (0.0008)  (0.0009)  Steel price  0.53**  0.53**  0.52**  0.53**     (0.17)  (0.16)  (0.16)  (0.18)  $$t$$  0.13**  0.13**  0.13**  0.13**     (0.04)  (0.05)  (0.05)  (0.05)     Curvature: $$c_{q}$$  $$c_{0}$$     17.49     4.39        (19.17)     (4.73)  $$c_{0}^{China}$$        8.96**              (3.95)     $$c_{0}^{China,Post}$$        0.88              (0.69)     $$c_{0}^{Europe}$$        12.85**              (4.74)     $$c_{0}^{Japan}$$        11.45**              (4.28)     $$c_{0}^{S.Korea}$$        9.12**              (4.08)     $$c_{1}$$  –1.42*  12.38  6.64**  11.63*     (0.81)  (14.70)  (2.72)  (9.75)  $$c_{2}$$  0.02  10.13  5.58**  22.36*     (1.47)  (10.38)  (1.80)  (15.28)  $$c_{3}$$  4.96*  11.38*  8.01**  34.92*     (2.55)  (6.53)  (2.12)  (21.17)  $$c_{4}$$  11.75**  14.47**  12.26**  49.99*     (4.18)  (4.50)  (3.43)  (27.60)  $$c_{5}$$  21.08**  20.11**  19.05**  66.64*     (6.52)  (6.73)  (5.44)  (34.45)  $$c_{6}$$  32.01**  27.35**           (9.31)  (11.00)        $$c_{7}$$  44.51**  36.15**           (12.55)  (15.88)        $$c_{8}$$  57.75**              (16.01)           $$\gamma_{1}$$  0.91**  0.71**  0.76**  3.95     (0.25)  (0.33)  (0.22)  (3.84)  $$\gamma_{2}$$           1.58**              (0.16)  $$\sigma$$  11.77  11.77  11.69  11.75     (2.92)  (2.96)  (2.87)  (3.25)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{q}$$  I  II  III  IV     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  34.80**  38.48**  35.83**  28.96**     (5.36)  (7.55)  (5.50)  (6.04)  China, POST  –6.12**  –6.12**  –5.32**  –6.11**     (1.91)  (2.05)  (1.77)  (2.03)  Europe  35.04**  38.72**  39.36**  29.20**     (5.63)  (7.73)  (6.80)  (6.37)  Japan  28.49**  32.18**  31.51**  22.66**     (3.61)  (6.09)  (4.32)  (4.95)  S Korea  37.09**  40.78**  38.47**  31.27**     (4.79)  (7.20)  (5.37)  (5.72)  Backlog  –0.62**  –0.62**  –0.62**  –0.62**     (0.15)  (0.15)  (0.15)  (0.16)  Docks/berths  –0.13  –0.13  –0.13  –0.13     (0.11)  (0.11)  (0.11)  (0.12)  Max length  –0.001  –0.001  –0.0009  –0.001     (0.0008)  (0.0009)  (0.0008)  (0.0009)  Steel price  0.53**  0.53**  0.52**  0.53**     (0.17)  (0.16)  (0.16)  (0.18)  $$t$$  0.13**  0.13**  0.13**  0.13**     (0.04)  (0.05)  (0.05)  (0.05)     Curvature: $$c_{q}$$  $$c_{0}$$     17.49     4.39        (19.17)     (4.73)  $$c_{0}^{China}$$        8.96**              (3.95)     $$c_{0}^{China,Post}$$        0.88              (0.69)     $$c_{0}^{Europe}$$        12.85**              (4.74)     $$c_{0}^{Japan}$$        11.45**              (4.28)     $$c_{0}^{S.Korea}$$        9.12**              (4.08)     $$c_{1}$$  –1.42*  12.38  6.64**  11.63*     (0.81)  (14.70)  (2.72)  (9.75)  $$c_{2}$$  0.02  10.13  5.58**  22.36*     (1.47)  (10.38)  (1.80)  (15.28)  $$c_{3}$$  4.96*  11.38*  8.01**  34.92*     (2.55)  (6.53)  (2.12)  (21.17)  $$c_{4}$$  11.75**  14.47**  12.26**  49.99*     (4.18)  (4.50)  (3.43)  (27.60)  $$c_{5}$$  21.08**  20.11**  19.05**  66.64*     (6.52)  (6.73)  (5.44)  (34.45)  $$c_{6}$$  32.01**  27.35**           (9.31)  (11.00)        $$c_{7}$$  44.51**  36.15**           (12.55)  (15.88)        $$c_{8}$$  57.75**              (16.01)           $$\gamma_{1}$$  0.91**  0.71**  0.76**  3.95     (0.25)  (0.33)  (0.22)  (3.84)  $$\gamma_{2}$$           1.58**              (0.16)  $$\sigma$$  11.77  11.77  11.69  11.75     (2.92)  (2.96)  (2.87)  (3.25)  $$^*p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. Overall, the results reveal that the non-parametric cost curvature is rather close to the simple quadratic curvature, suggesting that the latter is a good approximation. One may worry that it is the restriction that the cost for higher quantities follows a quadratic that leads to an overall quadratic-like function. To explore this possibility, I estimate (17) with $$c_{0}=0$$ and $$c_{q}=\gamma_{1}q^{\gamma_{2}}$$, for $$q=\bar{q}-5,...,\bar{q}$$. Results are in Column IV of Table 5. Interestingly, it turns out $$\hat{\gamma}_{2}=1.58$$.36 Existing shipyards: I next reestimate costs using only shipyards that already existed in $$2001$$. Column IX of Table 4 reports the results, which show that the same cost declines are retrieved when only old shipyards are considered. This finding is consistent with the raw data patterns: existing yards are responsible for a large portion (45%) of the increase in China’s production, as discussed in Section 2. Existing yards also produce 56% of total annual Chinese output on average. Estimating costs using only existing yards speaks to the following two concerns: (1) cost declines are driven by the new facilities built through China’s plan, which perhaps are more modern and have entirely different production capabilities (though, to reiterate, bulk shipbuilding technology is not subject to frequent technological innovations); (2) cost declines are driven by firms’ optimizing production under learning by doing (though in the next section I allow for such expertise accumulation through the backlog). Indeed, existing shipyards do not change technology and have already gone down their learning curve. Other checks: It is worth noting that results are also robust to adding several covariates, such as the shipyard’s age, total past production (capturing total experience), dummy variables for young ages to capture learning by doing (documented in military ships in Thompson, 2001) somewhat more flexibly, administrative region, number of employees (reported only in a subset of yards). Working with tons produced, rather than ships, and thus using a tobit model also does not alter findings. One might also worry that there was a coincident increase in demand for Chinese ships by Chinese shipowners; as discussed in Section 2, however, the overwhelming majority of Chinese built ships are in fact exported during the sample period (about 80%). Another concern is that Chinese shipyards may have switched production towards Handysize vessels and away from other ship types, so that the estimated cost decline reflects a different allocation across products rather than subsidies. As discussed in Section 2, however, the same patterns of increased production are prevalent in virtually all ship types (Table 1). In addition, the market share of Handysize bulk carriers within China remains practically flat over time during the sample. As regional governments in China can play an important role (Section 2), I consider a specification where they implement the subsidy plans at different dates and magnitudes. As no official documentation was found on implementation dates, I consider the first quarter that new shipbuilding docks/berths come online and divide regions into three groups. Results are similar to prior specifications and suggest that the last region to implement, also has the lowest subsidy level. I also explore whether any firms went through restructuring, mergers or changes in ownership. A total of ten Chinese firms are affected by such changes both within and outside my sample. Removing them from the sample leads to the same results. One may be concerned that the estimated cost declines are somehow driven by the estimated willingness to pay, $$VE_{j}^{o}\left(s_{t}\right)$$, (for instance due to the different pre and post expectations and value functions). To address this concern I estimate costs without used prices, using an average quarterly price (across shipyards and countries) of a new ship, constructed by Clarksons. I find that estimated subsidies are significant and of the same magnitude. Finally, although the vast majority of dock expansion in 2006 results from new firm entry, some shipyards do acquire new docks. Estimating the model using only existing shipyards that did not increase their docks gives the same results; this is consistent with the raw data pattern that these shipyards account for 33% of China’s output expansion (recall that new yards account for another 55%).37 5.2.2. Dynamic shipbuilders I now acknowledge that shipyards may be facing dynamic considerations, as past production can affect their current costs. My empirical approach employs sparse approximation techniques to solve the shipyard’s dynamic programming problem. The shipyard’s optimal production now obeys (9). To ease notation, rename the shipyard state $$x=\left(s_{jt}^{y},s_{t}\right)$$ and $$x^{\prime}=\left(s_{jt+1}^{y},s_{t+1}\right)$$ and suppress $$\left(j,t\right)$$. Recall the optimal policy thresholds that define the shipyard’s optimal production (Lemma 1):   \begin{equation} A\left(x,q\right)=\frac{1}{\sigma}\left[VE^{o}\left(x\right)+\left(c\left(q,x\right)-c\left(q+1,x\right)\right)+\beta\left(CV^{y}\left(x,q+1\right)-CV^{y}\left(x,q\right)\right)\right]\label{A def} \end{equation} (19) for $$q=0,1,...,\overline{q}-1$$. To estimate the cost parameters, I maximize the likelihood (15) with choice probabilities:   \begin{equation} \Pr\left(q^{\ast}=0|x\right)\equiv p_{0}\left(x\right)=\Pr\left(\varepsilon\geq A\left(x,0\right)\right) \end{equation} (20)  \[ \Pr\left(q^{\ast}=q|x\right)\equiv p_{q}\left(x\right)=\Pr\left(\varepsilon\leq A\left(x,q-1\right)\right)-\Pr\left(\varepsilon\leq A\left(x,q\right)\right) \]  \[ \Pr\left(q^{\ast}=\overline{q}|x\right)\equiv p_{\overline{q}}\left(x\right)=\Pr\left(\varepsilon\leq A\left(x,\overline{q}-1\right)\right). \] Maximizing this likelihood function would be trivial if the continuation value $$CV^{y}\left(x,q\right)$$ were known. This is the standard difficulty of estimating dynamic setups and to address it, I adopt a novel approach that proceeds in two steps (following the recent literature, $$e.g.$$Hotz and Miller, 1993 and Bajari et al., 2007). First, I invert observed choice probabilities to directly obtain the optimal policy thresholds non-parametrically. Second, I show that the latter lead to a closed-form expression for ex ante optimal per period payoffs, which in turn are sufficient to obtain the value function. I next describe this approach in detail. For the first step, note from (20) that clearly, the choice probabilities are a one-to-one function of the optimal policy thresholds $$A\left(x,q\right)$$. Therefore the latter can be recovered from the observed choice probabilities using:38  \begin{equation} A\left(x,q\right)=\Phi^{-1}\left(1-\sum_{k=0}^{q}p_{k}\left(x\right)\right),\,\,\text{for }q=0,1,...,\overline{q}-1, \end{equation} (21) where $$\Phi\left(\cdot\right)$$ is the standard normal distribution. Note that $$A\left(x,q\right)$$ is (weakly) decreasing in $$q$$. Most important, if $$A\left(x,q\right)$$ is known, so is the optimal policy: for any $$\left(x,\varepsilon\right)$$,   \[ q^{\ast}\left(x,\varepsilon\right)=\widehat{q},\text{ such that }\varepsilon\in\left[A\left(x,\widehat{q}\right),A\left(x,\widehat{q}-1\right)\right]. \] For the second step, I show that once the optimal policy is known, the value function can be recovered in a straightforward manner. Indeed, consider shipyard $$j$$’s Bellman equation (7) which I repeat here for convenience:   \[ V^{y}\left(x,\varepsilon\right)=\max_{0\leq q\leq\overline{q}}\pi^{y}\left(x,q,\varepsilon\right)+\beta E_{\varepsilon^{\prime},x^{\prime}}\left[V^{y}\left(x^{\prime},\varepsilon^{\prime}\right)|x,q\right], \] where $$\pi^{y}\left(x,q,\varepsilon\right)\equiv$$$$VE^{o}\left(x\right)q-c\left(q,x\right)-\sigma q\varepsilon$$. The ex ante value function under the recovered optimal policy becomes:   \begin{equation} V^{y}\left(x\right)\equiv E_{\varepsilon}V^{y}\left(x,\varepsilon\right)=E_{\varepsilon}\left[\pi^{y}\left(x,q^{\ast}\left(x,\varepsilon\right),\varepsilon\right)+\beta E_{\varepsilon^{\prime},x^{\prime}}\left[V^{y}\left(x^{\prime},\varepsilon^{\prime}\right)|x,q^{\ast}\left(x,\varepsilon\right)\right]\right].\label{bellman estimation} \end{equation} (22) If the ex ante per period profit, $$E_{\varepsilon}\pi^{y}\left(x,q^{\ast}\left(x,\varepsilon\right),\varepsilon\right)$$, were known, then one could solve for the ex ante value function from (22). This can be done in several ways, such as state space discretization and matrix inversion, or parametric approximation; I opt for the latter because of the large dimension of the state space. In particular, I approximate the value function by a polynomial function of the state variables, so that:   \[ V^{y}\left(x\right)=\gamma^{y}f^{y}\left(x\right) \] then (22) becomes   \begin{equation} \left(f^{y}\left(x\right)-\beta E_{x',\varepsilon}\left[f^{y}\left(x^{\prime}\right)|x,q^{\ast}\left(x,\varepsilon\right)\right]\right)\gamma^{y}=E_{\varepsilon}\pi^{y}\left(x,q^{\ast}\left(x,\varepsilon\right),\varepsilon\right).\label{bellman estimation LASSO} \end{equation} (23) It is now possible to estimate the approximating parameters $$\gamma^{y}$$ by ordinary regression, provided the ex ante profit can be computed. Note, however, that the large number of states leads to an exploding number of possible terms in $$f^{y}\left(x\right)$$; in addition, choosing which terms to include in $$f^{y}\left(x\right)$$ can be an arduous process. Instead, I estimate the sparse vector $$\gamma^{y}$$ via LASSO, which circumvents these issues. Using LASSO to solve for approximate value functions can be useful to the many empirical applications of dynamic setups. Indeed, in the Appendix I present a Monte Carlo exercise that evaluates the performance of this method and shows that it does very well compared to standard approaches. I now only need to show how $$E_{\varepsilon}\pi^{y}\left(x,q^{\ast}\left(x,\varepsilon\right),\varepsilon\right)$$ is computed. Under the assumption of quadratic costs ($$i.e.$$$$c_{1}\left(x\right)q+c_{2}\left(x\right)q{}^{2}$$, where $$c_{1}(x)=c_{1}'g_{1}(x)$$ and $$c_{2}(x)=c_{2}'g_{2}(x)$$), ex ante per period payoffs become:   \[ E_{\varepsilon}\pi^{y}\left(x,q^{\ast}\left(x,\varepsilon\right),\varepsilon\right)= \]  \begin{eqnarray} & = & E_{\varepsilon}\left[VE^{o}\left(x\right)q^{\ast}\left(x,\varepsilon\right)-c_{1}\left(x\right)q^{\ast}\left(x,\varepsilon\right)+c_{2}\left(x\right)q^{\ast}\left(x,\varepsilon\right)^{2}-\sigma q^{\ast}\left(x,\varepsilon\right)\varepsilon\right]\nonumber \\ \nonumber \\ & = & \left(VE^{o}\left(x\right)-c_{1}\left(x\right)\right)E_{\varepsilon}q^{\ast}\left(x,\varepsilon\right)+c_{2}\left(x\right)E_{\varepsilon}q^{\ast}\left(x,\varepsilon\right)^{2}-\sigma E_{\varepsilon}\left[q^{\ast}\left(x,\varepsilon\right)\varepsilon\right]. \end{eqnarray} (24) I show in the Appendix that $$E_{\varepsilon}q^{\ast}(x,\varepsilon)$$, $$E_{\varepsilon}(q^{\ast}(x,\varepsilon)^{2})$$ and $$E_{\varepsilon}(q^{\ast}(x,\varepsilon)\varepsilon)$$ are simple functions of $$A(x,q)$$. To sum up, the estimation proceeds as follows (further details are in the Appendix): (1) Estimate the policy thresholds $$A\left(x,q\right)$$ using (21). (2) Compute the the ex ante per period profit, $$E_{\varepsilon}\pi^{y}\left(x,q^{\ast}\left(x,\varepsilon\right),\varepsilon\right)$$. (3) At each guess of the parameters $$\left(c_{1}',c_{2}',\sigma\right)$$ in the optimization of the likelihood (15): (a) Solve for the approximate value function parameters $$\gamma^{y}$$ from (23) (b) Using $$\gamma^{y}$$, compute the choice probabilities and update $$\left(c_{1}',c_{2}',\sigma\right)$$. Results:Table 6 gives the Maximum Likelihood estimate of the cost function for dynamic shipyards. The implied subsidy is in the order of 18% or 4.3 billion U.S. dollars paid between 2006 and 2012, similarly to the case of static shipyards. Costs are decreasing in the current backlog, consistent with accumulation of expertise or economies of scale. The magnitude of the backlog coefficient suggests that the cost decline accruing from past production is about 11% of total cost at the observed production choices. More docks/berths, as well as longer docks decrease costs. The estimated cost function of dynamic shipyards is somewhat more convex than the one of static shipyards. Since accumulating a backlog decreases future costs and yards take this into account, higher cost parameters are needed to justify the observed low production levels. TABLE 6 Dynamic shipyards cost estimates Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  Linear term: $$c_{1}^{\prime}g_{1}(s)q$$     China  43.01**     (9.31)  China, POST  –7.85**     (2.91)  Europe  43.45**     (9.81)  Japan  34.49**     (6.16)  S Korea  44.34**     (8.16)  Backlog  –0.64**     (0.20)  Docks/berths  –0.17     (0.12)  Max length  –0.001     (0.001)  Steel price  0.55**     (0.18)  $$t$$  0.003     (0.04)  Curvature: $$c_{2}q^{2}$$  1.66**     (0.55)  $$\sigma$$  15.46     (5.04)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  Linear term: $$c_{1}^{\prime}g_{1}(s)q$$     China  43.01**     (9.31)  China, POST  –7.85**     (2.91)  Europe  43.45**     (9.81)  Japan  34.49**     (6.16)  S Korea  44.34**     (8.16)  Backlog  –0.64**     (0.20)  Docks/berths  –0.17     (0.12)  Max length  –0.001     (0.001)  Steel price  0.55**     (0.18)  $$t$$  0.003     (0.04)  Curvature: $$c_{2}q^{2}$$  1.66**     (0.55)  $$\sigma$$  15.46     (5.04)  $$^*p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. TABLE 6 Dynamic shipyards cost estimates Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  Linear term: $$c_{1}^{\prime}g_{1}(s)q$$     China  43.01**     (9.31)  China, POST  –7.85**     (2.91)  Europe  43.45**     (9.81)  Japan  34.49**     (6.16)  S Korea  44.34**     (8.16)  Backlog  –0.64**     (0.20)  Docks/berths  –0.17     (0.12)  Max length  –0.001     (0.001)  Steel price  0.55**     (0.18)  $$t$$  0.003     (0.04)  Curvature: $$c_{2}q^{2}$$  1.66**     (0.55)  $$\sigma$$  15.46     (5.04)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  Linear term: $$c_{1}^{\prime}g_{1}(s)q$$     China  43.01**     (9.31)  China, POST  –7.85**     (2.91)  Europe  43.45**     (9.81)  Japan  34.49**     (6.16)  S Korea  44.34**     (8.16)  Backlog  –0.64**     (0.20)  Docks/berths  –0.17     (0.12)  Max length  –0.001     (0.001)  Steel price  0.55**     (0.18)  $$t$$  0.003     (0.04)  Curvature: $$c_{2}q^{2}$$  1.66**     (0.55)  $$\sigma$$  15.46     (5.04)  $$^*p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. A further interesting feature of the dynamic model is that it allows one to compute the expected value of all new Chinese shipyards that are born through China’s government plan. I find that this equals 16 billion U.S. dollars. One can think of this amount as a rough estimate of the order of magnitude of the costs of building these shipyards. The static and dynamic formulations yield similar results in terms of subsidy detection and have similar fit, with the dynamic model having a somewhat higher likelihood. It is not as surprising that retrieved subsidies are similar in the two models—the increase in China’s market share is too sharp to be driven by dynamic decision-making. In the next section, I explore the implications of the static and dynamic model, as well as of some extra models in which dynamics are shut down. As discussed then, when shipyards accumulate experience that suppresses their future costs, market share gains multiply and lead to a somewhat larger reallocation favourable to China. 6. Quantifying the Implications of Subsidies In this section, I quantitatively assess the degree to which industrial subsidies contribute to China’s rapid emergence as a world leader in the shipbuilding industry. In particular, I evaluate the impact of government interventions on production reallocation across countries, industry prices, costs, and consumer surplus. I use my model to predict the evolution of the industry in the absence of China’s government shipbuilding plan, by removing both the cost subsidies retrieved in Section 5, as well as the new shipbuilding facilities. This counterfactual quantifies the adverse trade effects from these two interventions, which are considered actionable by the WTO. Moreover, I assess the relative contribution of the new shipyards by performing a counterfactual that removes the new facilities but maintains the detected cost subsidies. To implement the main counterfactual of “no interventions”, I assume that shipyards keep their pre-2006 costs and docks and new shipyards are removed. To implement the “no entrants” counterfactual, new shipyards are removed and existing shipyards keep their post-2006 cost functions and docks.39 Details on the implementation of these counterfactuals can be found in the Appendix. As shown in Table 7, the industrial subsidies led to substantial reallocation in production, by increasing China’s market share and decreasing Japan’s share: if the plan is removed, China’s market share falls from $$49\%$$ to 15%, while Japan’s share increases from $$46\%$$ to $$78\%$$. If only the new shipyards are removed, China’s share falls from $$50\%$$ to about $$40\%$$, revealing that new facilities played an important but not the predominant part in China’s expansion. This is consistent with the raw data patterns of Section 2 that indicated that both new and existing facilities contributed to China’s dramatic growth. Figure 8 depicts China’s production in the data, as well as the baseline and two counterfactual worlds. In addition to depicting graphically the evolution of the industry in the absence of subsidies, this graph also shows that the model’s fit is good since the baseline simulation is not far from the data. TABLE 7 Counterfactual results    Baseline  No entrants  No interventions  Market share, China  49%  38%  15%  Market share, Japan  46%  58%  78%  Ship price, China  20.29  20.7  21.7  Ship price, Japan  21.4  21.9  22.8  Total shipyard profits, China  4,807  3,186  827  Total shipyard profits, Japan  4,724  5,040  5,132  Freight rate (price per voyage)  1.22  1.23  1.26  Consumer surplus (shippers)  5,968  5,826  5,569  Industry AVC  1.98  2.3  2.34     Baseline  No entrants  No interventions  Market share, China  49%  38%  15%  Market share, Japan  46%  58%  78%  Ship price, China  20.29  20.7  21.7  Ship price, Japan  21.4  21.9  22.8  Total shipyard profits, China  4,807  3,186  827  Total shipyard profits, Japan  4,724  5,040  5,132  Freight rate (price per voyage)  1.22  1.23  1.26  Consumer surplus (shippers)  5,968  5,826  5,569  Industry AVC  1.98  2.3  2.34  Prices, surplus and average variable cost (AVC) measured in million U.S. Dollars. Profits and surplus refer to the total amount between 2006 and 2012. TABLE 7 Counterfactual results    Baseline  No entrants  No interventions  Market share, China  49%  38%  15%  Market share, Japan  46%  58%  78%  Ship price, China  20.29  20.7  21.7  Ship price, Japan  21.4  21.9  22.8  Total shipyard profits, China  4,807  3,186  827  Total shipyard profits, Japan  4,724  5,040  5,132  Freight rate (price per voyage)  1.22  1.23  1.26  Consumer surplus (shippers)  5,968  5,826  5,569  Industry AVC  1.98  2.3  2.34     Baseline  No entrants  No interventions  Market share, China  49%  38%  15%  Market share, Japan  46%  58%  78%  Ship price, China  20.29  20.7  21.7  Ship price, Japan  21.4  21.9  22.8  Total shipyard profits, China  4,807  3,186  827  Total shipyard profits, Japan  4,724  5,040  5,132  Freight rate (price per voyage)  1.22  1.23  1.26  Consumer surplus (shippers)  5,968  5,826  5,569  Industry AVC  1.98  2.3  2.34  Prices, surplus and average variable cost (AVC) measured in million U.S. Dollars. Profits and surplus refer to the total amount between 2006 and 2012. Figure 8 View largeDownload slide China’s production in the data, the baseline model and the two counterfactuals Figure 8 View largeDownload slide China’s production in the data, the baseline model and the two counterfactuals Table 7 also compares ship prices in the baseline and counterfactual worlds and shows that ship prices are higher for all countries in the absence of China’s subsidization plans, by about 7%. This is not surprising, given that China’s subsidies shifted supply outward. Next, I turn to profits, shipper surplus and costs, shown in the lower half of Table 7. China’s total shipbuilding profits would be substantially lower in the absence of any interventions: from almost 5 billion to less than 1 billion (this is inclusive of the subsidy cost). At the same time, Japan’s profits have gone down because of Chinese subsidies by about 10%. In this model, shipowners neither gain, nor lose from subsidies: because of the free entry condition in shipping, they are always indifferent between buying a ship or not (existing shipowners do lose, however, because of the unexpected negative shock to their asset value). Shippers of cargo, however, gain from subsidies as these lead to higher shipbuilding production and thus to a larger fleet. I use the demand curve estimated in Section 5.1.1 and in the Appendix to compute shipping prices and shipper surplus.40 As shown in Table 7, the freight rate is moderately higher (by $$3\%$$) in the absence of Chinese subsidies. The difference in freight rates, however, increases over time between 2006 and 2012: because of time to build it takes time until the different worlds lead to different fleet levels. Indeed, between 2009 and 2012 freight rates are higher by 5%. As a result, cargo shippers benefit from Chinese subsidies; their consumer surplus is higher by 7% and increases over time (between 2009 and 2012 consumer surplus is higher by 9%). Note that between 2006 and 2012 the cost subsidies alone ($$i.e.$$ ignoring the additional costs of building new yards), are estimated to have cost the government about 4 billion U.S. Dollars and resulted in consumer surplus gains of 400 million U.S. dollars. This calculation implies that a frequent assertion that China developed shipbuilding to benefit from low freight rates for its trade seems to be unsubstantiated; indeed, the benefits of subsidies to shipping are minimal and perhaps the Chinese government is aspiring also to externalities to different sectors ($$e.g.$$ steel, defense) or, even, national pride (Grossman, 1990). Finally, I consider the cost implications of subsidies. I compute the average cost of producing a ship at the industry level and find that as expected, subsidies decrease costs of production; see last row of Table 7. If I decompose the average cost to subsidies and market share allocation, however, this picture is entirely different. As implied by the numbers in Table 7, the change in industry average cost brought about by production subsidies is 0.36 million. The role of subsidies in this change in average variable cost turns out to be $$-3.8$$ million, implying a negative reallocation effect equal to $$0.36-3.8=-3.44$$ million.41 In other words, the subsidization in costs should have led to a much larger decline in the industry average cost of production; but as subsidies shift production away from the low-cost Japanese shipyards towards the high-cost Chinese shipyards, the industry produces at a much higher average cost net of subsidies. The role of dynamics: Government subsidies are often related to dynamic considerations, and this is true in this application as well: the long construction lags and the positive production feedback of learning by doing are important features of shipbuilding.42 I now explore the importance of these dynamics and their interaction with subsidies. To do so, I consider two extra models and contrast them to the estimated one: (1) removes the positive production feedback ($$i.e.$$ it sets the backlog coefficient equal to zero in the cost function); (2) removes the time to build. Table 8 reports some results. TABLE 8 The role of dynamics    I Estimated model  II No dynamic feedback  III No TTB  $$\%\varDelta$$ Market share, China  $$69\%$$  $$60\%$$  $$61\%$$  Average production baseline, China  $$22$$  $$20.7$$  $$28$$  Ship price baseline, China  $$20.29$$  $$21$$  $$21.6$$  $$\%\varDelta$$ Ship price, China  $$-2\%$$  $$-5\%$$  $$-3\%$$  $$\%\varDelta$$ Total shipyard profits, China  $$83\%$$  $$79\%$$  $$79\%$$  $$\%\varDelta$$ Freight rate (price per voyage)  $$-3.3\%$$  $$-3.2\%$$  $$-6\%$$     I Estimated model  II No dynamic feedback  III No TTB  $$\%\varDelta$$ Market share, China  $$69\%$$  $$60\%$$  $$61\%$$  Average production baseline, China  $$22$$  $$20.7$$  $$28$$  Ship price baseline, China  $$20.29$$  $$21$$  $$21.6$$  $$\%\varDelta$$ Ship price, China  $$-2\%$$  $$-5\%$$  $$-3\%$$  $$\%\varDelta$$ Total shipyard profits, China  $$83\%$$  $$79\%$$  $$79\%$$  $$\%\varDelta$$ Freight rate (price per voyage)  $$-3.3\%$$  $$-3.2\%$$  $$-6\%$$  Estimated model refers to the estimated dynamic model. No dynamic feedback sets the parameter of the backlog equal to zero in the cost function. No TTB assumes that there is no time to build. $$\%\varDelta$$ is a % change between the baseline world (Chinese subsidies) and a world with no Chinese interventions. TABLE 8 The role of dynamics    I Estimated model  II No dynamic feedback  III No TTB  $$\%\varDelta$$ Market share, China  $$69\%$$  $$60\%$$  $$61\%$$  Average production baseline, China  $$22$$  $$20.7$$  $$28$$  Ship price baseline, China  $$20.29$$  $$21$$  $$21.6$$  $$\%\varDelta$$ Ship price, China  $$-2\%$$  $$-5\%$$  $$-3\%$$  $$\%\varDelta$$ Total shipyard profits, China  $$83\%$$  $$79\%$$  $$79\%$$  $$\%\varDelta$$ Freight rate (price per voyage)  $$-3.3\%$$  $$-3.2\%$$  $$-6\%$$     I Estimated model  II No dynamic feedback  III No TTB  $$\%\varDelta$$ Market share, China  $$69\%$$  $$60\%$$  $$61\%$$  Average production baseline, China  $$22$$  $$20.7$$  $$28$$  Ship price baseline, China  $$20.29$$  $$21$$  $$21.6$$  $$\%\varDelta$$ Ship price, China  $$-2\%$$  $$-5\%$$  $$-3\%$$  $$\%\varDelta$$ Total shipyard profits, China  $$83\%$$  $$79\%$$  $$79\%$$  $$\%\varDelta$$ Freight rate (price per voyage)  $$-3.3\%$$  $$-3.2\%$$  $$-6\%$$  Estimated model refers to the estimated dynamic model. No dynamic feedback sets the parameter of the backlog equal to zero in the cost function. No TTB assumes that there is no time to build. $$\%\varDelta$$ is a % change between the baseline world (Chinese subsidies) and a world with no Chinese interventions. Let us first compare the baseline to a model where the backlog has no impact on future costs; see columns I and II. The latter model essentially has no dynamics and in this case, Chinese shipyards experience a lower increase in their market share by 10%. In the baseline world, shipyards are accumulating experience that decreases their future costs and thus market share gains multiply and lead to more reallocation favourable to China. This is consistent with the pro-subsidy arguments of infant industries (though note that here learning is private and provides no externalities, so that there is no real rationale for the government to intervene). Figure 9 illustrates that the industry is more volatile when shipyards face a dynamic production feedback (column I). When accumulated backlog lowers costs, firms have an incentive to overproduce. In turn, this overproduction during the boom lowers future prices and in the subsequent bust, firms produce less. In contrast, a world that lacks this dynamic feedback (column II) is smoother. Column II of Table 8 also shows that China produces less on average and prices are higher. Figure 9 View largeDownload slide Number of ships produced by China in the data, the estimated model, the no dynamic feedback counterfactual and the no time to build counterfactual Figure 9 View largeDownload slide Number of ships produced by China in the data, the estimated model, the no dynamic feedback counterfactual and the no time to build counterfactual Next, I consider a world with virtually no time to build (I allow for a one period construction lag to maintain the model’s timing); see column III of Table 8. As argued in Kalouptsidi (2014), time to build tends to smooth out fluctuations. Indeed, in the absence of time to build, firms tend to respond more to shocks: in the case of a positive macro shock, more shipowners enter as ships are delivered instantly and thus shipyards produce more. In contrast, under construction lags (baseline, column I), a positive shock may die out by the time the ship is delivered and thus shipowners demand fewer ships. This is illustrated in Figure 9. As shown in column III of Table 8 as well as Figure 9, under no time to build, average ship production is substantially higher. As the fleet is built instantly, we also see the impact of subsidies on freight rates much faster: the latter are 6% lower (rather than 3% in column I), consistent with the more long-run impact of the baseline world. 7. Robustness analysis In this section I present several robustness exercises to evaluate my empirical strategy. In particular, I explore robustness with respect to: (1) the state space; (2) the VAR model for state transitions; (3) the LASSO penalty parameter in the shipyard value function approximation. The latter is complemented by a Monte Carlo (see the Appendix). This section is self-contained and the reader may omit it and proceed to the conclusion if desired. State space and agent expectations: As described in Section 5.1.2, I employ a parsimonious VAR model to capture agents’ perceptions of the aggregate state transition process. Here, I explore the robustness of this specification. First, I consider a more flexible VAR model that is different before and after 2006 and includes a time trend. I estimate it via LASSO since the number of parameters is now large. In particular, the model sets   \[ s_{t}=\left(R_{0}^{pre}+R^{pre}s_{t-1}+R_{\tau}^{pre}t\right)1\{t<2006\}+\left(R_{0}^{post}+R^{post}s_{t-1}+R_{\tau}^{post}t\right)1\{t\geq2006\}+\xi_{t}. \] Note that the process for the demand and steel prices remains unchanged—that is the corresponding rows of the VAR are the same as in the baseline. The estimated cost parameters are in Column I of Table 9 and are close to the baseline specifications of Section 5.2.1. TABLE 9 Dynamic cost estimates — robustness specifications Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  II  III     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  41.78**  38.55**  41.08**     (7.78)  (5.48)  (9.23)  China, POST  –8.13**  –6.70**  –9.50**     (2.50)  (1.96)  (3.47)  Europe  41.92**  38.91**  43.59**     (8.36)  (5.55)  (10.34)  Japan  33.37**  31.43**  32.19**     (5.20)  (3.58)  (5.84)  S Korea  42.45**  40.06**  41.64**     (6.99)  (4.85)  (8.15)  Backlog  –0.61**  –0.56**  –0.81**     (0.17)  (0.12)  (0.22)  Docks/berths  –0.16  –0.13*  –0.21     (0.13)  (0.07)  (0.16)  Max length  –0.001  –0.001*  –0.003*     (0.001)  (0.0007)  (0.001)  Steel price  0.62**  0.48**  0.25     (0.17)  (0.13)  (0.23)  $$t$$  0.005  0.03  0.42**     (0.03)  (0.02)  (0.09)  Curvature: $$c_{2}q^{2}$$  1.62**  1.46**  2.12**     (0.46)  (0.31)  (0.60)  $$\sigma$$  15.04  13.56  19.77     (4.16)  (2.78)  (5.36)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  II  III     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  41.78**  38.55**  41.08**     (7.78)  (5.48)  (9.23)  China, POST  –8.13**  –6.70**  –9.50**     (2.50)  (1.96)  (3.47)  Europe  41.92**  38.91**  43.59**     (8.36)  (5.55)  (10.34)  Japan  33.37**  31.43**  32.19**     (5.20)  (3.58)  (5.84)  S Korea  42.45**  40.06**  41.64**     (6.99)  (4.85)  (8.15)  Backlog  –0.61**  –0.56**  –0.81**     (0.17)  (0.12)  (0.22)  Docks/berths  –0.16  –0.13*  –0.21     (0.13)  (0.07)  (0.16)  Max length  –0.001  –0.001*  –0.003*     (0.001)  (0.0007)  (0.001)  Steel price  0.62**  0.48**  0.25     (0.17)  (0.13)  (0.23)  $$t$$  0.005  0.03  0.42**     (0.03)  (0.02)  (0.09)  Curvature: $$c_{2}q^{2}$$  1.62**  1.46**  2.12**     (0.46)  (0.31)  (0.60)  $$\sigma$$  15.04  13.56  19.77     (4.16)  (2.78)  (5.36)  $$^{*}p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. Specification I considers a more flexible VAR model for state transitions. Specification II considers the total fleet and mean age as state variables. Specification III considers the entire fleet age distribution as a state variable. TABLE 9 Dynamic cost estimates — robustness specifications Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  II  III     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  41.78**  38.55**  41.08**     (7.78)  (5.48)  (9.23)  China, POST  –8.13**  –6.70**  –9.50**     (2.50)  (1.96)  (3.47)  Europe  41.92**  38.91**  43.59**     (8.36)  (5.55)  (10.34)  Japan  33.37**  31.43**  32.19**     (5.20)  (3.58)  (5.84)  S Korea  42.45**  40.06**  41.64**     (6.99)  (4.85)  (8.15)  Backlog  –0.61**  –0.56**  –0.81**     (0.17)  (0.12)  (0.22)  Docks/berths  –0.16  –0.13*  –0.21     (0.13)  (0.07)  (0.16)  Max length  –0.001  –0.001*  –0.003*     (0.001)  (0.0007)  (0.001)  Steel price  0.62**  0.48**  0.25     (0.17)  (0.13)  (0.23)  $$t$$  0.005  0.03  0.42**     (0.03)  (0.02)  (0.09)  Curvature: $$c_{2}q^{2}$$  1.62**  1.46**  2.12**     (0.46)  (0.31)  (0.60)  $$\sigma$$  15.04  13.56  19.77     (4.16)  (2.78)  (5.36)  Cost function $$C(q,s)=c_{1}^{\prime}g_{1}(s)q+c_{2}q^{2}$$  I  II  III     Linear term: $$c_{1}^{\prime}g_{1}(s)q$$  China  41.78**  38.55**  41.08**     (7.78)  (5.48)  (9.23)  China, POST  –8.13**  –6.70**  –9.50**     (2.50)  (1.96)  (3.47)  Europe  41.92**  38.91**  43.59**     (8.36)  (5.55)  (10.34)  Japan  33.37**  31.43**  32.19**     (5.20)  (3.58)  (5.84)  S Korea  42.45**  40.06**  41.64**     (6.99)  (4.85)  (8.15)  Backlog  –0.61**  –0.56**  –0.81**     (0.17)  (0.12)  (0.22)  Docks/berths  –0.16  –0.13*  –0.21     (0.13)  (0.07)  (0.16)  Max length  –0.001  –0.001*  –0.003*     (0.001)  (0.0007)  (0.001)  Steel price  0.62**  0.48**  0.25     (0.17)  (0.13)  (0.23)  $$t$$  0.005  0.03  0.42**     (0.03)  (0.02)  (0.09)  Curvature: $$c_{2}q^{2}$$  1.62**  1.46**  2.12**     (0.46)  (0.31)  (0.60)  $$\sigma$$  15.04  13.56  19.77     (4.16)  (2.78)  (5.36)  $$^{*}p<0.1$$, $$^{**}p<0.05$$. Standard errors computed from 500 bootstrap samples. Time $$t$$ measured in quarters. Countries refer to country dummy variables. Specification I considers a more flexible VAR model for state transitions. Specification II considers the total fleet and mean age as state variables. Specification III considers the entire fleet age distribution as a state variable. Next, I also alter the state space. As a reminder, the baseline industry state space includes the number of ships younger than 20 years old, the number of ships older than 20 years old, the total backlog, demand for shipping services, and steel prices. I consider two alternative specifications of the state space. First, I explore different statistics of the age distribution and in particular, the total fleet and the mean age of the fleet. Similar to above, I use a flexible VAR model for this state’s transition. Results are presented in Column II of Table 9 and are robust. Second, I consider a more detailed state space: the entire ship age distribution ($$i.e.$$ number of ships $$0,1,2,\,...,\,20+$$ years old).43 This is computationally possible due to the sparse approximations which allow one to deal with very large state spaces. Let $$s_{kt}$$ be the number of ships that are $$k$$ years old in period $$t$$. To form the transition of this state I regress $$s_{kt}$$ on $$s_{kt-1}$$ and $$s_{k-1t-1}$$ for $$k=1,...,19$$.44 Similarly, I regress ships of age zero, $$s_{0t}$$, on $$s_{0t-1}$$ and the total backlog. Finally, $$s_{20+t}$$ depends on optimal ship exit and thus all states; similarly total backlog depends on ship entry and thus all states. Therefore, I assume that the total backlog and $$s_{20+t}$$ follow a VAR which is also affected by demand and steel prices. Results from this specification are presented in Column III of Table 9 and again are similar to the baseline specifications. Finally, Table 10 presents a few counterfactual results based on these specifications.45 Comparing these results to Table 7, we see that counterfactual predictions are overall robust across these different specifications. TABLE 10 Counterfactuals performed using robustness Specification I and III of Table 9    Robustness specification I  Robustness specification III     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  50%  39%  14%  51%  40%  17%  Market share, Japan  43%  56%  79%  42%  55%  77%  Ship price, China  20.14  20.5  21.8  20  20.6  21.4  Ship price, Japan  21.37  21.75  23  23  23.7  24.5  Freight rate (price per voyage)  1.22  1.24  1.26  1.32  1.33  1.35  Consumer surplus (shippers)  5,954  5,810  5,556  4,948  4,879  4,732  Industry AVC  1.88  2.17  2.3  1.73  2.13  2.07     Robustness specification I  Robustness specification III     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  50%  39%  14%  51%  40%  17%  Market share, Japan  43%  56%  79%  42%  55%  77%  Ship price, China  20.14  20.5  21.8  20  20.6  21.4  Ship price, Japan  21.37  21.75  23  23  23.7  24.5  Freight rate (price per voyage)  1.22  1.24  1.26  1.32  1.33  1.35  Consumer surplus (shippers)  5,954  5,810  5,556  4,948  4,879  4,732  Industry AVC  1.88  2.17  2.3  1.73  2.13  2.07  Prices, surplus and average variable cost (AVC) measured in million U.S. Dollars. Surplus refers to the total amount between 2006 and 2012. Specification I considers a more flexible VAR model for state transitions. Specification III considers the entire fleet age distribution as a state variable. TABLE 10 Counterfactuals performed using robustness Specification I and III of Table 9    Robustness specification I  Robustness specification III     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  50%  39%  14%  51%  40%  17%  Market share, Japan  43%  56%  79%  42%  55%  77%  Ship price, China  20.14  20.5  21.8  20  20.6  21.4  Ship price, Japan  21.37  21.75  23  23  23.7  24.5  Freight rate (price per voyage)  1.22  1.24  1.26  1.32  1.33  1.35  Consumer surplus (shippers)  5,954  5,810  5,556  4,948  4,879  4,732  Industry AVC  1.88  2.17  2.3  1.73  2.13  2.07     Robustness specification I  Robustness specification III     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  50%  39%  14%  51%  40%  17%  Market share, Japan  43%  56%  79%  42%  55%  77%  Ship price, China  20.14  20.5  21.8  20  20.6  21.4  Ship price, Japan  21.37  21.75  23  23  23.7  24.5  Freight rate (price per voyage)  1.22  1.24  1.26  1.32  1.33  1.35  Consumer surplus (shippers)  5,954  5,810  5,556  4,948  4,879  4,732  Industry AVC  1.88  2.17  2.3  1.73  2.13  2.07  Prices, surplus and average variable cost (AVC) measured in million U.S. Dollars. Surplus refers to the total amount between 2006 and 2012. Specification I considers a more flexible VAR model for state transitions. Specification III considers the entire fleet age distribution as a state variable. LASSO penalty parameter: Finally, when performing LASSO, one needs to choose the penalty parameter, $$\lambda$$, that trades off fit and sparsity. Results are overall robust to a range of different $$\lambda$$, yet some sensitivity is to be expected, since the parameter $$\lambda$$ determines the level of smoothing.46 To save space, Table 11 presents counterfactual results corresponding to two intermediate values of $$\lambda$$. Those too are close to the baseline specifications. TABLE 11 Counterfactuals performed using robustness specification $$\lambda_{8}$$ and $$\lambda_{17}$$ of Table ??    Robustness specification $$\lambda_{8}$$  Robustness specification $$\lambda_{17}$$     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  55%  45%  19%  49%  39%  15%  Market share, Japan  39%  51%  72%  46%  57%  79%  Ship price, China  19.8  20.1  21  19.9  20.4  21.3  Ship price, Japan  20.9  21.2  22  21  21.5  22.4  Freight rate (price per voyage)  1.21  1.22  1.263  1.22  1.23  1.25  Consumer surplus (shippers)  6,047  5,984  5,817  6,019  5,916  4,731  Industry AVC  2.76  3.38  3.99  2.65  3.2  3.04     Robustness specification $$\lambda_{8}$$  Robustness specification $$\lambda_{17}$$     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  55%  45%  19%  49%  39%  15%  Market share, Japan  39%  51%  72%  46%  57%  79%  Ship price, China  19.8  20.1  21  19.9  20.4  21.3  Ship price, Japan  20.9  21.2  22  21  21.5  22.4  Freight rate (price per voyage)  1.21  1.22  1.263  1.22  1.23  1.25  Consumer surplus (shippers)  6,047  5,984  5,817  6,019  5,916  4,731  Industry AVC  2.76  3.38  3.99  2.65  3.2  3.04  Prices, surplus and average variable cost (AVC) measured in million U.S. Dollars. Surplus refers to the total amount between 2006 and 2012. TABLE 11 Counterfactuals performed using robustness specification $$\lambda_{8}$$ and $$\lambda_{17}$$ of Table ??    Robustness specification $$\lambda_{8}$$  Robustness specification $$\lambda_{17}$$     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  55%  45%  19%  49%  39%  15%  Market share, Japan  39%  51%  72%  46%  57%  79%  Ship price, China  19.8  20.1  21  19.9  20.4  21.3  Ship price, Japan  20.9  21.2  22  21  21.5  22.4  Freight rate (price per voyage)  1.21  1.22  1.263  1.22  1.23  1.25  Consumer surplus (shippers)  6,047  5,984  5,817  6,019  5,916  4,731  Industry AVC  2.76  3.38  3.99  2.65  3.2  3.04     Robustness specification $$\lambda_{8}$$  Robustness specification $$\lambda_{17}$$     Baseline  No entrants  No interventions  Baseline  No entrants  No interventions  Market share, China  55%  45%  19%  49%  39%  15%  Market share, Japan  39%  51%  72%  46%  57%  79%  Ship price, China  19.8  20.1  21  19.9  20.4  21.3  Ship price, Japan  20.9  21.2  22  21  21.5  22.4  Freight rate (price per voyage)  1.21  1.22  1.263  1.22  1.23  1.25  Consumer surplus (shippers)  6,047  5,984  5,817  6,019  5,916  4,731  Industry AVC  2.76  3.38  3.99  2.65  3.2  3.04  Prices, surplus and average variable cost (AVC) measured in million U.S. Dollars. Surplus refers to the total amount between 2006 and 2012. 8. Conclusion Shipbuilding is just one of several industries that China dominated in recent years; solar panels, steel, auto parts and numerous other sectors have exhibited similar trends of rapid market share expansion, as well as capacity surge, often coinciding with national or regional 5-year plans. This pattern can have major consequences for the geography of trade and production, as well as consumer welfare. The role of industrial policy in China’s rapid industrial takeover remains an open question. An important hurdle in answering this question lies in the inherent difficulty of knowing what policies are in place. This article provides a first attempt at measuring Chinese subsidies and quantifying their impact in the case of the world shipbuilding industry. Using a dynamic model of shipbuilding production and firm-level production data, I find strong evidence consistent with subsidies that decreased shipyard costs by 13–20%. Moreover, the policy interventions led to substantial misallocation of production across countries with no significant consumer surplus gains. There are two natural next steps from here; the first is to study the impact of industrial policy on market structure, both domestic and international, by delving deeper into firm entry and exit. The second, and more difficult extension would be to delve into the government’s objectives in providing the various support measures ($$e.g.$$ shipbuilding’s externalities to other sectors such as the steel industry or the readiness of the military). This article has partly ignored these issues and focused on the detection of production subsidies and their impact on production reallocation and surplus; yet both extensions form interesting avenues for future research. Finally, from a methodological perspective, this article features dynamic agents in the demand for and the supply of ships. Moreover, the empirical strategy makes use of sparse approximations to solve the shipbuilders’ dynamic optimization; this approach may be useful in other setups with a large number of states. APPENDIX A. Proof of Lemma 1 The shipyard’s optimal production policy is as follows: shipyard $$j$$ chooses $$q_{jt}=0$$ if   \[ -c\left(0,s_{jt}^{y},s_{t}\right)+\beta CV^{y}\left(s_{jt}^{y},s_{t},0\right)\geq VE_{j}^{o}\left(s_{t}\right)q^{\prime}-c\left(q^{\prime},s_{jt}^{y},s_{t}\right)-q^{\prime}\varepsilon_{jt}+\beta CV^{y}\left(s_{jt}^{y},s_{t},q^{\prime}\right) \] all $$q^{\prime}\in\left\{ 1,...,\overline{q}\right\} ,$$ or   \[ \varepsilon_{jt}\geq VE_{j}^{o}\left(s_{t}\right)+\max_{02006\right\} }_{R_{0}}+\underbrace{\begin{bmatrix}r^{s_{1}s_{1}} & r^{s_{1}s_{2}} & r^{s_{1}B} & r^{s_{1}d} & r^{s_{1}l}\\ r^{s_{2}s_{1}} & r^{s_{2}s_{2}} & r^{s_{2}B} & r^{s_{2}d} & r^{s_{2}l}\\ r^{Bs_{1}} & r^{Bs_{2}} & r^{BB} & r^{Bd} & r^{Bl}\\ 0 & 0 & 0 & r^{d} & 0\\ 0 & 0 & 0 & 0 & r^{l} \end{bmatrix}}_{R}s_{t-1}+\xi_{t} \end{equation} (26) and $$\Sigma_{\xi}$$ is diagonal. I estimate the parameters of interest $$\left(R_{0},R,\Sigma_{\xi}\right)$$ via OLS separately for each variable. Table 13 reports the results. TABLE 13 VAR parameter estimates       $$R_{0}^{pre}$$  $$R_{0}^{post}$$  $$\Sigma_{\xi}$$        $$\overline{s_{1t}^{o}}$$  –1.47  –1.46  7e-05           (0.97)  (0.98)           $$\overline{s_{2t}^{o}}$$  2.53**  2.52**  8e-05           (1.03)  (1.04)           $$B_{t}$$  –8.49  –8.50  0.004           (7.31)  (7.35)           $$d_{t}$$  0.38**  0.15           (0.17)           $$l_{t}$$  0.74*  1.27           (0.40)              $$R_{0}^{pre}$$  $$R_{0}^{post}$$  $$\Sigma_{\xi}$$        $$\overline{s_{1t}^{o}}$$  –1.47  –1.46  7e-05           (0.97)  (0.98)           $$\overline{s_{2t}^{o}}$$  2.53**  2.52**  8e-05           (1.03)  (1.04)           $$B_{t}$$  –8.49  –8.50  0.004           (7.31)  (7.35)           $$d_{t}$$  0.38**  0.15           (0.17)           $$l_{t}$$  0.74*  1.27           (0.40)           $$R$$     $$R$$     $$\overline{s_{1t}^{o}}$$  $$\overline{s_{2t}^{o}}$$  $$B_{t}$$  $$d_{t}$$  $$l_{t}$$  $$\overline{s_{1t}^{o}}$$  1.10**  0.08  0.02**  0.005  –0.003**     (0.04)  (0.09)  (0.004)  (0.003)  (0.001)  $$\overline{s_{2t}^{o}}$$  –0.15**  0.81**  –0.007  0.004  0.002**     (0.04)  (0.10)  (0.005)  (0.004)  (0.001)  $$B_{t}$$  0.16  1.07  0.87**  0.08**  –0.009     (0.30)  (0.71)  (0.03)  (0.03)  (0.007)  $$d_{t}$$  0  0  0  0.81**  0              (0.08)     $$l_{t}$$  0  0  0  0  0.89**                 (0.06)     $$\overline{s_{1t}^{o}}$$  $$\overline{s_{2t}^{o}}$$  $$B_{t}$$  $$d_{t}$$  $$l_{t}$$  $$\overline{s_{1t}^{o}}$$  1.10**  0.08  0.02**  0.005  –0.003**     (0.04)  (0.09)  (0.004)  (0.003)  (0.001)  $$\overline{s_{2t}^{o}}$$  –0.15**  0.81**  –0.007  0.004  0.002**     (0.04)  (0.10)  (0.005)  (0.004)  (0.001)  $$B_{t}$$  0.16  1.07  0.87**  0.08**  –0.009     (0.30)  (0.71)  (0.03)  (0.03)  (0.007)  $$d_{t}$$  0  0  0  0.81**  0              (0.08)     $$l_{t}$$  0  0  0  0  0.89**                 (0.06)  $$^{*}p<0.1$$, $$^{**}p<0.05$$. $$\left(\overline{s_{1t}^{o}},\overline{s_{2t}^{o}},B_{t}\right)$$ are in natural logarithms. TABLE 13 VAR parameter estimates       $$R_{0}^{pre}$$  $$R_{0}^{post}$$  $$\Sigma_{\xi}$$        $$\overline{s_{1t}^{o}}$$  –1.47  –1.46  7e-05           (0.97)  (0.98)           $$\overline{s_{2t}^{o}}$$  2.53**  2.52**  8e-05           (1.03)  (1.04)           $$B_{t}$$  –8.49  –8.50  0.004           (7.31)  (7.35)           $$d_{t}$$  0.38**  0.15           (0.17)           $$l_{t}$$  0.74*  1.27           (0.40)              $$R_{0}^{pre}$$  $$R_{0}^{post}$$  $$\Sigma_{\xi}$$        $$\overline{s_{1t}^{o}}$$  –1.47  –1.46  7e-05           (0.97)  (0.98)           $$\overline{s_{2t}^{o}}$$  2.53**  2.52**  8e-05           (1.03)  (1.04)           $$B_{t}$$  –8.49  –8.50  0.004           (7.31)  (7.35)           $$d_{t}$$  0.38**  0.15           (0.17)           $$l_{t}$$  0.74*  1.27           (0.40)           $$R$$     $$R$$     $$\overline{s_{1t}^{o}}$$  $$\overline{s_{2t}^{o}}$$  $$B_{t}$$  $$d_{t}$$  $$l_{t}$$  $$\overline{s_{1t}^{o}}$$  1.10**  0.08  0.02**  0.005  –0.003**     (0.04)  (0.09)  (0.004)  (0.003)  (0.001)  $$\overline{s_{2t}^{o}}$$  –0.15**  0.81**  –0.007  0.004  0.002**     (0.04)  (0.10)  (0.005)  (0.004)  (0.001)  $$B_{t}$$  0.16  1.07  0.87**  0.08**  –0.009     (0.30)  (0.71)  (0.03)  (0.03)  (0.007)  $$d_{t}$$  0  0  0  0.81**  0              (0.08)     $$l_{t}$$  0  0  0  0  0.89**                 (0.06)     $$\overline{s_{1t}^{o}}$$  $$\overline{s_{2t}^{o}}$$  $$B_{t}$$  $$d_{t}$$  $$l_{t}$$  $$\overline{s_{1t}^{o}}$$  1.10**  0.08  0.02**  0.005  –0.003**     (0.04)  (0.09)  (0.004)  (0.003)  (0.001)  $$\overline{s_{2t}^{o}}$$  –0.15**  0.81**  –0.007  0.004  0.002**     (0.04)  (0.10)  (0.005)  (0.004)  (0.001)  $$B_{t}$$  0.16  1.07  0.87**  0.08**  –0.009     (0.30)  (0.71)  (0.03)  (0.03)  (0.007)  $$d_{t}$$  0  0  0  0.81**  0              (0.08)     $$l_{t}$$  0  0  0  0  0.89**                 (0.06)  $$^{*}p<0.1$$, $$^{**}p<0.05$$. $$\left(\overline{s_{1t}^{o}},\overline{s_{2t}^{o}},B_{t}\right)$$ are in natural logarithms. In addition to the robustness analysis performed in Section 7, I have examined several specifications where the VAR parameters can be time-varying, that is $$\left(R_{ot},R_{t}\right)$$. Indeed, $$\left(R_{ot},R_{t}\right)$$ can vary deterministically ($$e.g.$$ time trend, so that $$R_{ot}=R_{o}+R_{\tau}t$$) or randomly with time (random walk model for $$R_{t}$$ estimated by the Kalman filter). The former requires extension of (14) to the time varying case and adaptation of the higher moments formulas; the latter relies on the standard Kalman updates for the time varying parameters which constitute the state of the Kalman filter. End results are robust. I also experimented heavily with restrictions on $$R_{0t}$$ and $$R_{t}$$ both in terms of before and after $$2006$$, allowing $$\Sigma_{\xi}$$ to be full, as well as parameter restrictions ($$e.g.$$ time to build might imply that $$r^{s_{1}d}=0$$). Finally, I allowed $$d_{t}$$ to be an $$AR\left(2\right)$$. My main findings are robust to these experiments. The chosen specification combines the following desired properties: it is parsimonious, stationary and takes into account the 2006 break. D. Derivation of state expectations in ship value function Here, I derive the expressions required for the LASSO estimation of the value functions of Section 5.1.3. Recall that I approximate the value function with a polynomial function, so that:   \[ V^{o}(x_{t})=\gamma^{o}f^{o}\left(x_{t}\right)=\sum_{i=1}^{d}\gamma_{i}^{o}x_{t}^{\left(i\right)}, \] where $$x_{t}=\left(s_{it}^{o},s_{t}\right)$$, and $$x_{t}^{\left(i\right)}$$ are Kronecker powers, so that $$x_{t}^{\left(2\right)}=x_{t}\otimes x_{t}$$, $$x_{t}^{(3)}=x_{t}^{\left(2\right)}\otimes x_{t}$$, etc. Then, (13) can be written as:   \begin{equation} P_{jt}^{N}=\beta^{T_{jt}}\gamma^{o}E\left(f^{o}\left(x_{t+T_{jt}}\right)|x_{t}\right)=\beta^{T_{jt}}\sum_{i=1}^{d}\gamma_{i}^{o}E\left(x_{t+T_{jt}}^{\left(i\right)}|x_{t}\right)\label{app 1}. \end{equation} (27) The conditional expectation is only necessary for $$s_{t}$$ since $$s_{it}^{o}$$ evolves deterministically. Equation (14) can be written as:   \[ s_{t+T}=A+v, \] where   \begin{eqnarray*} A & = & R^{T}s_{t}+\sum_{k=t+1}^{t+T}R^{t+T-k}R_{0k}\\ v & = & \sum_{k=t+1}^{t+T}R^{t+T-k}\xi_{k}. \end{eqnarray*} Note that conditional on $$s_{t}$$, $$A$$ is constant.47 Moreover, $$v$$ is zero-mean normal with covariance:   \[ \Sigma_{v}=Ev^{\prime}v=\sum_{k=t+1}^{t+T}R^{t+T-k}\Sigma_{\xi}\left(R^{\prime}\right)^{t+T-k}. \] Therefore, (27) becomes:   \[ P_{jt}^{N}=\beta^{T_{jt}}\sum_{i=1}^{d}\gamma_{i}^{o}E\left(\left(A+v\right)^{\left(i\right)}|x_{t}\right). \] I next compute the conditional expectations for up to third-order terms:   \begin{eqnarray*} E\left(A+v|x_{t}\right) & = & A\\ E\left(\left(A+v\right)^{\left(2\right)}|x_{t}\right) & = & A^{\left(2\right)}+vec\left(\Sigma_{v}\right)\\ E\left(\left(A+v\right)^{\left(3\right)}|x_{t}\right) & = & A^{\left(3\right)}+A\otimes vec\left(\Sigma_{v}\right)+vec\left(\Sigma_{v}\right)\otimes A+T_{mm^{2}}A\otimes vec\left(\Sigma_{v}\right), \end{eqnarray*} where $$vec\left(x\right)$$ denotes the vector formed by stacking the columns of $$x$$ one after the other; the matrix $$T_{mm^{2}}$$ is defined as the following linear operator: given a $$L\times n$$ matrix $$A$$, $$T_{Ln}$$ is an $$Ln\times Ln$$ matrix specified by the assignment $$T_{Ln}vec\left(A\right)=vec\left(A^{\prime}\right)$$, ($$A^{\prime}$$ stands for transpose). The first of the above equations is straightforward. To prove the second, use:   \[ E\left(\left(A+v\right)^{\left(2\right)}|x_{t}\right)=E\left(\left(A+v\right)^{\left(2\right)}\right)=A\otimes A+A\otimes E\left(v\right)+E\left(v\right)\otimes A+Ev^{\left(2\right)}. \] It is easy to see that $$Ev^{\left(2\right)}=vec\left(Evv^{\prime}\right)=vec\left(\Sigma_{v}\right)$$ using the property $$vec\left(BXC\right)=\left(C^{\prime}\otimes B\right)vec\left(X\right)$$. Finally,   \begin{eqnarray*} E\left(\left(A+v\right)^{\left(3\right)}|x_{t}\right) & = & A\otimes E\left(A+v\right)^{2}+Ev\otimes\left(A+v\right)^{2}\\ & = & A\otimes\left(A^{\left(2\right)}+vec\left(\Sigma_{v}\right)\right)+E\left(v\otimes A\otimes v\right)+E\left(v\otimes v\otimes A\right)+Ev^{\left(3\right)}. \end{eqnarray*} $$Ev^{\left(3\right)}$$ is zero since $$v$$ is Gaussian. Moreover,   \[ E\left(v\otimes A\otimes v\right)=T_{mm^{2}}\otimes Ev^{\left(2\right)}. \] Indeed, if $$B$$ and $$C$$ are matrices of dimensions $$\left(L,n\right)$$ and $$\left(n,k\right)$$ respectively, then   \[ B\otimes C=T_{pl}\left(C\otimes B\right)T_{nk}. \] E. Statistics of the Optimal Production Lemma 2. The statistics of the optimal production are given by:  \begin{equation} E_{\varepsilon}q^{\ast}\left(x,\varepsilon\right)=\sum_{q=0}^{\overline{q}-1}\Phi\left(A\left(x,q\right)\right) \end{equation} (28)  \begin{equation} E_{\varepsilon}\left[q^{\ast}\left(x,\varepsilon\right)\right]^{2}=2\sum_{q=1}^{\overline{q}}q\Phi\left(A\left(x,q-1\right)\right)-\sum_{q=0}^{\overline{q}-1}\Phi\left(A\left(x,q\right)\right) \end{equation} (29)  \begin{equation} E_{\varepsilon}\left[q^{\ast}\left(x,\varepsilon\right)\varepsilon\right]=-\sum_{q=0}^{\overline{q}-1}\phi\left(A\left(x,q\right)\right), \end{equation} (30)where $$\phi\left(\cdot\right)$$ is the standard normal density. Proof. To derive (28), use (20) to get:   \begin{eqnarray*} E_{\varepsilon}q^{\ast}\left(x,\varepsilon\right) & = & \sum_{q=1}^{\overline{q}}qp_{q}\left(x\right)=\sum_{q=1}^{\overline{q}-1}q\left[\Phi\left(A\left(x,q-1\right)\right)-\Phi\left(A\left(x,q\right)\right)\right]+\overline{q}\Phi\left(A\left(x,\overline{q}-1\right)\right)\\ & = & \sum_{q=0}^{\overline{q}-2}\left(q+1\right)\Phi\left(A\left(x,q\right)\right)-\sum_{q=1}^{\overline{q}-1}q\Phi\left(A\left(x,q\right)\right)+\overline{q}\Phi\left(A\left(x,\overline{q}-1\right)\right)\\ & = & \Phi\left(A\left(x,0\right)\right)+\sum_{q=1}^{\overline{q}-2}\Phi\left(A\left(x,q\right)\right)+\Phi\left(A\left(x,\overline{q}-1\right)\right)=\sum_{q=0}^{\overline{q}-1}\Phi\left(A\left(x,q\right)\right). \end{eqnarray*} Equation (29) follows similarly. Finally, let $$\phi\left(\varepsilon\right)$$ denote the standard normal density. Then,   \[ \int_{a}^{b}\varepsilon\phi\left(\varepsilon\right)=-\frac{1}{2\sqrt{\pi}}\int_{a}^{b}de^{-\frac{1}{2}\varepsilon^{2}}=\phi\left(a\right)-\phi\left(b\right) \] and therefore:   \begin{eqnarray*} E_{\varepsilon}\varepsilon q^{\ast}\left(x,\varepsilon\right) & = & \int\varepsilon q^{\ast}\left(x,\varepsilon\right)\phi\left(\varepsilon\right)d\varepsilon=\\ & = & \sum_{q=1}^{\overline{q}-1}q\int_{A\left(x,q\right)}^{A\left(x,q-1\right)}\varepsilon\phi\left(\varepsilon\right)+\overline{q}\int_{-\infty}^{A\left(x,\overline{q}-1\right)}\varepsilon\phi\left(\varepsilon\right)\\ & = & \sum_{q=1}^{\overline{q}-1}q\left[\phi(A\left(x,q\right))-\phi\left(A\left(x,q-1\right)\right)\right]-\overline{q}\phi\left(A\left(x,\overline{q}-1\right)\right)\\ & = & -\sum_{q=0}^{\overline{q}-1}\phi(A\left(x,q\right)). \end{eqnarray*} ∥ F. Estimating costs for dynamic shipbuilders: Details I provide details on each step performed when estimating the cost function of dynamic shipyards (see Supplementary Material). Step 1: Estimate $$A\left(x,q\right)$$ using (21). In this step, I first compute the choice probabilities $$\left\{ p_{q}\left(x\right)\right\} _{q=0}^{\overline{q}}$$ from observed frequencies. I also include a post-2006 dummy in the state to capture differences in the policy function before and after 2006. As is common in dynamic applications, there are not many observations for all $$q=0,1,...,\overline{q}$$ at each state $$x$$. Therefore, I first cluster the data finely, using the kmeans algorithm, and then compute frequencies on this subset of states. Second, I smooth the frequency matrix using kernels. In particular, I compute the choice probability $$p_{q}\left(x\right)$$ at state $$x$$ using the following formula:   \[ p_{q}\left(x\right)=\sum_{x^{\prime}}w\left(x^{\prime}-x\right)\widetilde{p}_{q}\left(x^{\prime}\right), \] where $$\widetilde{p}_{q}\left(x\right)$$ is the observed frequency count of $$q$$ at state $$x$$ and $$w\left(\cdot\right)$$ is a kernel that appropriately weights the distance of $$x$$ from every other state $$x^{\prime}$$. For numerical states (backlog, docks/berths, length, time, fleet, total backlog, demand, steel price) I use normal kernels with diagonal covariance. For categorical states (country and post-2006 dummy) I use the kernel:   \[ w\left(x^{\prime}-x\right)=\left\{ \begin{array}{c} 1-h,\quad\text{if }x^{\prime}=x\\ h/k_{x},\quad\text{if }x^{\prime}\neq x, \end{array}\right. \] where $$k_{x}$$ is the number of values that $$x$$ can take (in the case of country it is 4, in the case of the post dummy 2) and $$h$$ represents the bandwidth of the kernel. As $$h$$ gets close to 0, this kernel weights states that share the same variable $$x$$.48 Finally, I chose $$\overline{q}=10$$, since $$99.75\%$$ of observations involve $$q\leq10$$. Step 2: Compute the terms $$Eq^{\ast}$$, $$Eq^{\ast2}$$, $$E\varepsilon q$$ using (28), (29), and (30). Step 3: At each guess of the parameters $$\left(c_{1}',c_{2}',\sigma\right)$$ in the optimization of the likelihood (15) the following calculations are performed: Step 3a: Solve for the approximate value function parameters $$\gamma^{y}$$ from (23). Note that the choice probabilities require the continuation value $$CV^{y}\left(x,q\right)=\gamma^{y}E\left[f^{y}\left(x^{\prime}\right)|x,q\right]$$. To estimate $$\gamma^{y}$$ from (23) I need   \begin{equation} E_{\varepsilon,x^{\prime}}\left[f^{y}\left(x^{\prime}\right)|x,q^{\ast}\left(x,\varepsilon\right)\right]=\gamma^{y}\sum_{q=0}^{\overline{q}}p_{q}\left(x\right)E_{x^{\prime}}\left[f^{y}\left(x^{\prime}\right)|x,q\right].\label{exp polyn} \end{equation} (31) I use polynomials of third degree in all variables (fourth degree does not alter the results). The industry state $$s$$ evolves according to the estimated VAR model described in Section 5.1.2, while the expectations of its polynomial powers are given above in this Appendix. I assume that the shipyard’s individual backlog, $$b_{jt}$$, transitions as follows:   \[ b_{jt+1}=\left(1-\delta\right)b_{jt}+q \] that is $$\delta\%$$ of the backlog is delivered and $$q$$ orders of period $$t$$ enter the backlog. I experimented extensively with the above transition rule.49 It was found that the simple model where $$\delta$$ is constant over shipyards and time and equal to the sample mean (which is $$10\%$$) performs equally well compared to more complex models. Given the state transitions it is straightforward to compute (31). I estimate (23) using LASSO in two ways. First, call the LASSO within the likelihood maximization with the regularization parameter chosen using Belloni and Chernozhukov $$\left(2011\right)$$. Second, estimate (23) with LASSO using profits obtained from the static cost estimates. The goal here is to recover the relevant non-zero polynomial terms. I then run OLS within the likelihood with only these terms (and repeat the estimation for many values of the regularization parameter). Results are overall robust to all of the above. Step 3b: Using $$\gamma^{y}$$, compute choice probabilities in the likelihood and update $$(c_{1}',c_{2}',\sigma)$$. A concern in two-step approaches to dynamic frameworks is that the first-stage policy functions (in this case, the non-parametric $$A\left(x,q\right)$$) may be different from the optimal policy computed using the true parameters and value function, that is from (19). To check this, I compute $$A\left(\cdot\right)$$ from (19) and find that it is close to its first-stage estimate. I then re-optimize the likelihood using the new $$A\left(\cdot\right)$$. The parameters that I report result from this loop. G. Counterfactual Computation There are two steps in the implementation of the counterfactual scenarios presented in Section 6 (see Supplementary Material). First, I compute the equilibrium of the model in each scenario. Second, I simulate the model using the observed paths of demand and steel prices which are exogenous. Note that if one is only interested in the “no interventions” counterfactual, one can simply use the pre-2006 expectations and value functions to simulate the model. To predict how the industry would evolve under different counterfactual scenarios I need to obtain shipyards’ optimal policies and value functions under each scenario. I can no longer use the estimated VAR for state transitions, since this formed an approximation to expectations that hid equilibrium features. I therefore turn to the following state transitions for $$\left(\overline{s_{1t}^{o}},\overline{s_{2t}^{o}},B_{t}\right)$$, where $$\overline{s_{1t}^{o}}$$ is the number of ships younger than 20 years old, $$\overline{s_{2t}^{o}}$$ is the number of ships older than 20 years old and $$B_{t}$$ is the total backlog:   \begin{eqnarray} \overline{s_{1t+1}^{o}} & = & \delta B_{t}+\left(1-\rho_{1t}\right)\overline{s_{1t}^{o}}\\ \overline{s_{2t+1}^{o}} & = & \overline{s_{2t}^{o}}+\rho_{1t}\overline{s_{1t}^{o}}-\zeta\left(s_{t}\right)\nonumber \\ \end{eqnarray} (32)  \begin{eqnarray} B_{t+1} & = & (1-\delta)B_{t}+\sum_{j}q_{jt}, \end{eqnarray} (33) where $$\zeta\left(s_{t}\right)$$ is the number of ships that exit at state $$s_{t}$$, $$\rho_{1t}$$ is the percentage of ships that transit from 19 years old and 3 quarters to 20 years old and $$\delta$$ is the percentage of the backlog that is delivered, consistent with the individual backlog transition used in the estimation and described above in this Appendix. In words, the number of young ships $$\overline{s_{1t+1}^{o}}$$ equals last period’s young ships plus deliveries from the total backlog, minus exiting ships (as documented in Kalouptsidi, 2014, there is virtually no exit in ships younger than 20 years old). The number of old ships $$\overline{s_{2t+1}^{o}}$$ equals last period’s old ships plus the aging ships minus exiting ships. Finally, total backlog $$B_{t+1}$$ equals last period’s total backlog minus deliveries, plus total new ship orders. I calibrate $$\rho_{1t}$$ to $$3\%$$ which is the sample average. To predict ship exit $$\zeta\left(s_{t}\right)$$ I follow Kalouptsidi (2014) where the number of exiting ships is regressed on the industry state (in particular, $$\log\zeta_{t}=\beta_{\zeta}s_{t}$$); note that exit rates are extremely low (even during the $$2008$$ crisis). Demand $$d_{t}$$ and steel price $$l_{t}$$ retain their original transition processes, since these are exogenous to this model. To find the equilibrium of the model I use a fixed point algorithm with the goal of recovering the shipyard’s optimal policy function $$p_{q}^{\ast}\left(x\right)$$, for all production levels $$q$$ and states $$x=\left(s^{y},s\right)$$, as well as the shipyard’s value function $$V^{y}\left(x\right)$$. At each iteration $$l$$ the policies $$p_{q}^{l}\left(x\right)$$ are used to update to $$p_{q}^{l+1}\left(x\right)$$ and I keep iterating until $$||p_{q}^{l+1}\left(x\right)-p_{q}^{l}\left(x\right)||\leq eps$$. Each iteration performs the following steps: Step 1: Update the value function using a sparse parametric approximation and LASSO (third-order polynomials are used). The estimation of the value function approximating parameters, $$\widetilde{\gamma}^{y}$$, relies on the approximate Bellman equation:   \[ \left(f^{y}\left(x\right)-\beta E\left[f^{y}\left(x^{\prime}\right)|p_{q}^{l}\left(x\right),x\right]\right)\widetilde{\gamma}^{y,l+1}=E_{\varepsilon}\left[\pi^{y}(x,q,\varepsilon);p_{q}^{l}\left(x\right)\right], \] where ex ante profits are computed by:   \[ E_{\varepsilon}\left[\pi^{y}(x,q,\varepsilon);p_{q}^{l}\left(x\right)\right]=\left[VE\left(x\right)-c_{1}\left(x\right)\right]\sum_{q=0}^{\overline{q}}qp_{q}^{l}\left(x\right)-c_{2}\sum_{q=0}^{\overline{q}}q^{2}p_{q}^{l}\left(x\right)+\sigma\sum_{q=0}^{\overline{q}-1}\phi(A^{l}\left(x,q\right)), \] where $$A\left(x,q\right)=\Phi^{-1}\left(1-\sum_{k=0}^{q}p_{k}^{l}\left(x\right)\right)$$, $$q=0,1,...,\overline{q}-1$$. To derive the above, use (24) and (30). Moreover, to compute $$E\left[f\left(x^{\prime}\right)|p_{q}^{l}\left(x\right),x\right]$$, I simulate $$d_{t}$$ and $$l_{t}$$ one period forward since these are the only stochastic states now. State transitions are computed via (32). Due to computational constraints I have assumed throughout that shipyards keep track of the total backlog rather than the distribution of backlogs. Therefore, at this stage shipyards do not have the full information to predict total orders accurately. To circumvent this issue, I make the simplifying assumption that shipyards believe they are all at the same state and can predict total orders using the total number of firms. Step 2: Update the choice probabilities:   \begin{eqnarray*} p_{0}^{l+1}\left(x\right) & = & 1-\Phi\left(\frac{1}{\sigma}\left(VE^{o}\left(x\right)-c_{1}\left(x\right)-c_{2}+\beta\left(CV^{y,l+1}\left(1\right)-CV^{y,l+1}\left(0\right)\right)\right)\right)\\ p_{q}^{l+1}\left(x\right) & = & \Phi\left(\frac{1}{\sigma}\left(VE^{o}\left(x\right)-c_{1}\left(x\right)-c_{2}\left(2q-1\right)+\beta\left(CV^{y,l+1}\left(q\right)-CV^{y,l+1}\left(q-1\right)\right)\right)\right)-\\ & & -\Phi\left(\frac{1}{\sigma}\left(VE^{o}\left(x\right)-c_{1}\left(x\right)-c_{2}\left(2q+1\right)+\beta\left(CV^{y,l+1}\left(q+1\right)-CV^{y,l+1}\left(q\right)\right)\right)\right)\\ p_{\overline{q}}^{l+1}\left(x\right) & = & 1-\sum_{q=0}^{\overline{q}-1}p_{q}^{l+1}\left(x\right). \end{eqnarray*} I solve the above fixed point under three scenarios: the true post-2006 world, a world with no China intervention and a world without the Chinese new shipyards. These worlds differ in the shipyard cost function, the set of active shipyards and the shipyard capital structure. I perform the fixed point on all data. Finally, to simulate the model, I draw cost shocks $$\varepsilon$$ and obtain the corresponding optimal production using (9) which in turn relies on the value function computed using the parameters $$\widetilde{\gamma}^{y}$$ retrieved by the fixed point algorithm. At each state visited, $$E\left[f\left(x^{\prime}\right)|p_{q}^{\ast}\left(x\right),x\right]$$ is computed using the retrieved equilibrium choice probabilities. H. Monte Carlo: sparse approximations in dynamic programming Here, I present a Monte Carlo to evaluate the performance of the methodology used in the article based on sparse approximations. To do so, I introduce a simple dynamic shipbuilding model motivated by the article, but of much smaller dimensionality. Roughly, the model introduces production dynamics in the toy model of Section 3.1. I show that using sparse approximations performs equally well or better than other standard methods ($$e.g.$$ state space discretization or parametric approximations) both in terms of solving the model to obtain the optimal policy and value function, as well as in terms of estimating the payoff parameters of interest. A simplified shipbuilding model: The agent (a shipyard) maximizes its expected lifetime payoff from ship production. The shipyard’s action is how many units (ships) to produce, $$q\in\{0,1,...,\bar{q}\}$$. Its state comprises of its backlog $$b$$ and an aggregate state $$s$$ with two components: the demand for shipping services and the total fleet. A shipyard gets price $$P(b,s)$$ which is increasing in demand for shipping and decreasing in the total fleet (both variables encoded in $$s$$); the price also depends on the yard’s backlog $$b$$ which affects its delivery date. The backlog transitions deterministically according to the rule: $$b'=(1-\delta)b+q$$, as in the Appendix above. In words, the shipyard delivers $$\delta\%$$ of its backlog every period and production is added to the backlog. It is easy to show that this transition rule confines $$b$$ in the interval $$[0,\bar{q}/\delta]$$. The aggregate state follows a stationary VAR model: $$s'=R_{0}+Rs+\xi$$, where $$\xi\sim N(0,\Sigma_{\xi})$$. The shipyard’s per period payoff is:   \[ \pi(q,b,s,\varepsilon)=P(b,s)q-c_{1}(b)q-c_{2}q^{2}-\sigma q\varepsilon, \] where $$\epsilon\sim N(0,1)$$. Note that the cost function is quadratic in quantity, while the coefficient $$c_{1}$$ depends on the shipyard’s backlog, making the production choice dynamic (I assume that $$c_{1}=c_{1,0}+c_{1,1}b+c_{1,2}b^{2}$$, where $$c_{1,1}<0$$ and $$c_{1,2}>0$$). Table 14 provides the Monte Carlo parameter values for the model.50 TABLE 14 Monte Carlo parameters $$\beta$$  0.95  $$R_{0}$$  $$\left( {\matrix{{200} \cr {1.5} \cr } } \right)$$     $$\delta$$  0.3     $$\bar{q}$$  9           $$c_{1,0}$$  35  $$R$$  $$\left( {\matrix{{0.9} & {2.5} \cr 0 & {0.93} \cr } } \right)$$     $$c_{1,1}$$  –1.1     $$c_{1,2}$$  0.018           $$c_{2}$$  2.5  $$\Sigma_{\xi}$$  $$\left( {\matrix{{3600} & 0 \cr 0 & {0.04} \cr} } \right)$$     $$\sigma$$  8     $$\beta$$  0.95  $$R_{0}$$  $$\left( {\matrix{{200} \cr {1.5} \cr } } \right)$$     $$\delta$$  0.3     $$\bar{q}$$  9           $$c_{1,0}$$  35  $$R$$  $$\left( {\matrix{{0.9} & {2.5} \cr 0 & {0.93} \cr } } \right)$$     $$c_{1,1}$$  –1.1     $$c_{1,2}$$  0.018           $$c_{2}$$  2.5  $$\Sigma_{\xi}$$  $$\left( {\matrix{{3600} & 0 \cr 0 & {0.04} \cr} } \right)$$     $$\sigma$$  8     TABLE 14 Monte Carlo parameters $$\beta$$  0.95  $$R_{0}$$  $$\left( {\matrix{{200} \cr {1.5} \cr } } \right)$$     $$\delta$$  0.3     $$\bar{q}$$  9           $$c_{1,0}$$  35  $$R$$  $$\left( {\matrix{{0.9} & {2.5} \cr 0 & {0.93} \cr } } \right)$$     $$c_{1,1}$$  –1.1     $$c_{1,2}$$  0.018           $$c_{2}$$  2.5  $$\Sigma_{\xi}$$  $$\left( {\matrix{{3600} & 0 \cr 0 & {0.04} \cr} } \right)$$     $$\sigma$$  8     $$\beta$$  0.95  $$R_{0}$$  $$\left( {\matrix{{200} \cr {1.5} \cr } } \right)$$     $$\delta$$  0.3     $$\bar{q}$$  9           $$c_{1,0}$$  35  $$R$$  $$\left( {\matrix{{0.9} & {2.5} \cr 0 & {0.93} \cr } } \right)$$     $$c_{1,1}$$  –1.1     $$c_{1,2}$$  0.018           $$c_{2}$$  2.5  $$\Sigma_{\xi}$$  $$\left( {\matrix{{3600} & 0 \cr 0 & {0.04} \cr} } \right)$$     $$\sigma$$  8     Solving the model: The shipyard’s Bellman equation is:   \begin{equation} V(b,s,\varepsilon)=\max_{q\in\{0,...,\bar{q}\}}\left\{ \pi(q,b,s,\varepsilon)+\beta E_{\varepsilon',s'}V\left((1-\delta)b+q,s',\varepsilon'\right)\right\} \end{equation} (34) while the ex ante value function is $$V(b,s)\equiv E_{\varepsilon}V(b,s,\varepsilon)$$. The shipyard’s optimal policy is characterized by the conditional choice probabilities. Let $$p_{q}(b,s)$$ be the probability of choosing $$q$$ at state $$(b,s)$$. These choice probabilities are determined by optimal thresholds over the range of $$\varepsilon$$, as shown in Lemma 1 of the article. Ideally, one would like to first solve this model “exactly” and then solve it with different “approximate” methodologies (state space discretization, parametric approximations, sparse parametric approximations) to compare their performance. Here, determination of the exact solution is not straightforward as the states are continuous. To address this issue, I discretize the state space very finely and consider this the exact solution (note that all states are bounded). Then, I compare the following three solved models: (1) (coarse) state space discretization; (2) polynomial approximation of the value function; (3) sparse polynomial approximation of the value function. The models are solved via a typical fixed point algorithm that obtains the value function and the optimal policy. The algorithm iterates two steps, similar to those in the paper; one updates the policy function and the other updates the value function. In particular, at iteration $$l$$, the algorithm inherits an ex ante value function $$V^{l}(b,s)$$ and optimal policy given by the choice probabilities $$p_{q}^{l}(b,s)$$ and then performs the following two steps: (1) Compute the optimal policy $$p_{q}^{l+1}(b,s)$$ by calculating the optimal thresholds of Lemma 1 using $$V^{l}(b,s)$$. (2) Update the ex ante value function to $$V^{l+1}(b,s)$$ by first computing the ex ante per period payoff as in the article and then using the Bellman equation (34). There are two implementations here: (a) State space discretization (fine or coarse):   \begin{equation} V^{l+1}(b,s)=\tilde{\pi}(b,s)+\beta{{{\displaystyle \sum_{q=0}^{\bar{q}}p_{q}^{l+1}(b,s)}}}\sum_{s'}V^{l}\left((1-\delta)b+q,s'\right)\Pr(s'|s),\label{eq:update sd} \end{equation} (35) where $$\tilde{\pi}(b,s)\equiv E_{\varepsilon}\left[\pi(q,b,s,\varepsilon);p_{q}^{l}(b,s)\right]$$ is the ex ante payoff under the optimal policy and $$\Pr(s'|s)$$ is the transition probability of the aggregate state $$s$$ in the discretized world (more on this below). (b) Parametric approximations: here the value function is of the form $$V(b,s)=\gamma f(b,s)$$, where $$f(b,s)$$ denotes a vector of polynomial terms in the state variables and $$\gamma$$ a conforming vector of coefficients. To update the value function, use:   \begin{equation} \gamma^{l+1}f(b,s)=\tilde{\pi}(b,s)+\beta\gamma^{l+1}{\displaystyle \sum_{q=0}^{\bar{q}}p_{q}^{l+1}(b,s)E_{s'}}f\left((1-\delta)b+q,s'\right), \end{equation} (36) where $$E_{s'}$$ now employs the VAR model for $$s$$. $$\gamma^{l+1}$$ can be obtained via OLS (standard parametric approximation) or LASSO (sparse parametric approximation). Then, set $$V^{l+1}(b,s)=\gamma^{l+1}f(b,s)$$. Before proceeding to the results, I provide some further details on the construction of the various models. Fine state space discretization (capturing the exact solution): To construct the state space, I use the full range of $$b$$, $$[0,\bar{q}/\delta]$$ and its actual transition. I discretize $$s$$ using twenty-five equidistant grid points for each of the two variables. The resulting number of states exceeds 14,000. To construct the transition of this finite aggregate state, I employ the VAR model as follows: I simulate $$s$$ forward for a long time horizon; then each state of the generated paths is replaced by its closest finite state. The transition probabilities are calculated by the empirical frequencies (this procedure is followed in Kalouptsidi, 2014). Coarse state space discretization: The typical difficulty in using state space discretization to solve dynamic programming problems is that often it cannot accommodate large state spaces. To reflect this difficulty, I consider a coarser discretization of the same state space: I now use five grid points for each of the two aggregate state variables, but still consider the full range of $$b$$. This results in a total of 775 states. I construct the transition matrix following the same procedure as above. It is important to note that since the exact model is itself discretized, this method is likely “positively biased”. Parametric approximations: I explore polynomial approximations of various degrees (second, third, and fourth). Here, I employ the VAR model to compute expectations directly, as in the main body of the article. The size of the state space used to perform the fixed point, is larger than the one of the coarse discretization (around 5,000 grid points); this is in order to capture the benefit of allowing for continuous states when using approximations. Sparse parametric approximations: The dimension of the state space may lead to an explosive number of possible terms to be included in $$f(x)$$. Additionally, choosing which terms to include in the approximation may prove to be an arduous process. A solution which circumvents both issues is to estimate the sparse vector $$\gamma$$ in each iteration via LASSO instead of OLS. This way, the polynomial approximation of the value function can be of higher power, allowing for more flexibility, while term selection takes place at the same time. As in the paper, in most specifications below I use the static model to determine the polynomial terms that are maintained. Figure 10 depicts the shipyard’s value function as a function of the backlog (on average across other states). As shown, the value function obtained from the coarsely discretized model is furthest away from the true. Polynomial and sparse polynomial approximations both do well. Figure 11 plots the evolution of a shipyard’s backlog over thirty-five periods. Again, parametric approximations do better than the coarse state space discretization. Table 15 collects a number of statistics pertaining to the different models. In particular, it compares the models in terms of both the obtained value function and choice probabilities, as well as the production and backlog obtained when simulating the model. The (coarsely) discretized model performs worse than approximations. Both regular and sparse parametric approximations do well and yield similar results. TABLE 15 Model solution Model  Exact  Coarse  Parametric Approx (OLS)  Sparse Parametric Approx (LASSO)  State Space Size  14074  775  5766  5766  5766  5766  5766  5766  Polynomial Degree  –  –  2  3  4  2  3  4  Mean value function  861.495  851.017  855.099  860.496  862.458  857.969  860.333  861.987  Median value function  848.347  843.125  839.304  843.454  847.254  845.277  843.753  842.567  St. Dev. value function  195.598  117.096  201.855  208.150  208.391  201.605  203.648  204.809  Av. distance value function  –  0.119  0.017  0.019  0.019  0.015  0.015  0.016  Av. distance CPs  –  0.210  0.437  0.399  0.399  0.437  0.429  0.419     Simulated paths  Av. distance backlog  –  0.034  0.025  0.024  0.022  0.025  0.024  0.024  St. Dev. backlog  –  0.045  0.038  0.036  0.034  0.038  0.036  0.035  Av. distance production  –  0.062  0.041  0.040  0.037  0.041  0.040  0.039  St. Dev production  –  0.120  0.109  0.105  0.098  0.109  0.107  0.098  Model  Exact  Coarse  Parametric Approx (OLS)  Sparse Parametric Approx (LASSO)  State Space Size  14074  775  5766  5766  5766  5766  5766  5766  Polynomial Degree  –  –  2  3  4  2  3  4  Mean value function  861.495  851.017  855.099  860.496  862.458  857.969  860.333  861.987  Median value function  848.347  843.125  839.304  843.454  847.254  845.277  843.753  842.567  St. Dev. value function  195.598  117.096  201.855  208.150  208.391  201.605  203.648  204.809  Av. distance value function  –  0.119  0.017  0.019  0.019  0.015  0.015  0.016  Av. distance CPs  –  0.210  0.437  0.399  0.399  0.437  0.429  0.419     Simulated paths  Av. distance backlog  –  0.034  0.025  0.024  0.022  0.025  0.024  0.024  St. Dev. backlog  –  0.045  0.038  0.036  0.034  0.038  0.036  0.035  Av. distance production  –  0.062  0.041  0.040  0.037  0.041  0.040  0.039  St. Dev production  –  0.120  0.109  0.105  0.098  0.109  0.107  0.098  Av. distance is the average normalized distance from the true model across states or across the points of the simulated path. TABLE 15 Model solution Model  Exact  Coarse  Parametric Approx (OLS)  Sparse Parametric Approx (LASSO)  State Space Size  14074  775  5766  5766  5766  5766  5766  5766  Polynomial Degree  –  –  2  3  4  2  3  4  Mean value function  861.495  851.017  855.099  860.496  862.458  857.969  860.333  861.987  Median value function  848.347  843.125  839.304  843.454  847.254  845.277  843.753  842.567  St. Dev. value function  195.598  117.096  201.855  208.150  208.391  201.605  203.648  204.809  Av. distance value function  –  0.119  0.017  0.019  0.019  0.015  0.015  0.016  Av. distance CPs  –  0.210  0.437  0.399  0.399  0.437  0.429  0.419     Simulated paths  Av. distance backlog  –  0.034  0.025  0.024  0.022  0.025  0.024  0.024  St. Dev. backlog  –  0.045  0.038  0.036  0.034  0.038  0.036  0.035  Av. distance production  –  0.062  0.041  0.040  0.037  0.041  0.040  0.039  St. Dev production  –  0.120  0.109  0.105  0.098  0.109  0.107  0.098  Model  Exact  Coarse  Parametric Approx (OLS)  Sparse Parametric Approx (LASSO)  State Space Size  14074  775  5766  5766  5766  5766  5766  5766  Polynomial Degree  –  –  2  3  4  2  3  4  Mean value function  861.495  851.017  855.099  860.496  862.458  857.969  860.333  861.987  Median value function  848.347  843.125  839.304  843.454  847.254  845.277  843.753  842.567  St. Dev. value function  195.598  117.096  201.855  208.150  208.391  201.605  203.648  204.809  Av. distance value function  –  0.119  0.017  0.019  0.019  0.015  0.015  0.016  Av. distance CPs  –  0.210  0.437  0.399  0.399  0.437  0.429  0.419     Simulated paths  Av. distance backlog  –  0.034  0.025  0.024  0.022  0.025  0.024  0.024  St. Dev. backlog  –  0.045  0.038  0.036  0.034  0.038  0.036  0.035  Av. distance production  –  0.062  0.041  0.040  0.037  0.041  0.040  0.039  St. Dev production  –  0.120  0.109  0.105  0.098  0.109  0.107  0.098  Av. distance is the average normalized distance from the true model across states or across the points of the simulated path. Figure 10 View largeDownload slide The shipyard’s value function as a function of backlog (average across other states) Figure 10 View largeDownload slide The shipyard’s value function as a function of backlog (average across other states) Figure 11 View largeDownload slide Simulated backlog path Figure 11 View largeDownload slide Simulated backlog path Estimating the model: The second part of the Monte Carlo exercise is to evaluate the use of the different methods as part of an estimation strategy aiming to recover the parameters of the per period payoff function, and in particular those of the cost function, $$\left\{ c_{1,0},c_{1,1},c_{1,2},c_{2},\sigma\right\} $$. I create a dataset by simulating the exact model for fifty periods and 200 shipyards. The estimation procedure closely follows the two-step approach used in the main body of the article: first, I estimate the optimal policy thresholds using the choice probabilities.51 In the second step I maximize the likelihood of the article; different methods differ in how the value function is obtained. When a coarse state space discretization is used, I solve for the value function via a fixed point following (35); when parametric approximations are used, I follow the paper’s procedure, summarized in (36).52 Table 16 presents the results. I allow for higher order terms in the case of sparse approximations to capture the method’s ability to handle larger problems. The recovered parameter estimates via sparse approximations are closer to the true values than regular parametric approximations, though both do well. In contrast, the coarsely discretized model performs worse. Results are also robust over a range of the penalty parameter $$\lambda$$ (though some sensitivity is to be expected since $$\lambda$$ trades off fit and sparsity), as well as to more structured methods of determining the penalty parameter (cross validation, Belloni and Chernozhukov, 2011). Moreover, similar results are obtained when using Chebyshev polynomials. TABLE 16 Model estimation    True  Coarse  Parametric Approx (OLS)  Sparse parametric Approx (LASSO)  Polynomial degree  –  –  2  3  4  $$c_{1,0}$$  35  30.26  36.34  35.68  33.21  $$c_{1,1}$$  –1.1  –1.1  –1.30  –1.26  –1.08  $$c_{1,2}$$  0.018  0.016  0.0237  0.0224  0.0192  $$c_{2}$$  2.5  3.27  2.57  2.61  2.58  $$\sigma$$  8  10.54  8.38  8.49  8.23  Distance  –  0.18  0.12  0.10  0.04     True  Coarse  Parametric Approx (OLS)  Sparse parametric Approx (LASSO)  Polynomial degree  –  –  2  3  4  $$c_{1,0}$$  35  30.26  36.34  35.68  33.21  $$c_{1,1}$$  –1.1  –1.1  –1.30  –1.26  –1.08  $$c_{1,2}$$  0.018  0.016  0.0237  0.0224  0.0192  $$c_{2}$$  2.5  3.27  2.57  2.61  2.58  $$\sigma$$  8  10.54  8.38  8.49  8.23  Distance  –  0.18  0.12  0.10  0.04  Distance is the average normalized distance from the true vector of parameters. TABLE 16 Model estimation    True  Coarse  Parametric Approx (OLS)  Sparse parametric Approx (LASSO)  Polynomial degree  –  –  2  3  4  $$c_{1,0}$$  35  30.26  36.34  35.68  33.21  $$c_{1,1}$$  –1.1  –1.1  –1.30  –1.26  –1.08  $$c_{1,2}$$  0.018  0.016  0.0237  0.0224  0.0192  $$c_{2}$$  2.5  3.27  2.57  2.61  2.58  $$\sigma$$  8  10.54  8.38  8.49  8.23  Distance  –  0.18  0.12  0.10  0.04     True  Coarse  Parametric Approx (OLS)  Sparse parametric Approx (LASSO)  Polynomial degree  –  –  2  3  4  $$c_{1,0}$$  35  30.26  36.34  35.68  33.21  $$c_{1,1}$$  –1.1  –1.1  –1.30  –1.26  –1.08  $$c_{1,2}$$  0.018  0.016  0.0237  0.0224  0.0192  $$c_{2}$$  2.5  3.27  2.57  2.61  2.58  $$\sigma$$  8  10.54  8.38  8.49  8.23  Distance  –  0.18  0.12  0.10  0.04  Distance is the average normalized distance from the true vector of parameters. Acknowledgements I am very thankful to Jan De Loecker, Eduardo Morales, Richard Rogerson for many helpful discussions. The article has also gained a lot from comments by John Asker, Panle Jia Barwick, Allan Collard-Wexler, Gene Grossman, Tom Holmes, Bo Honore, Eugenio Miravete, Ariel Pakes, Paul Scott, Junichi Suzuki, as well as the editor and three anonymous referees. Charis Katsiardis and Daniel Goetz provided excellent research assistance. Sarinka Parry-Jones and Natalie Burrows at Clarksons have been extremely helpful. Finally, I gratefully acknowledge the support of the National Science Foundation (under grant SES-1426933), as well as of the Griswold Center for Economic Policy Studies at Princeton University. supplementary data Supplementary data are available at Review of Economic Studies online. REFERENCES AGUIRREGABIRIA V. and MIRA P. ( 2007), “Sequential Estimation of Dynamic Discrete Games”, Econometrica , 75, 1– 53. Google Scholar CrossRef Search ADS   AMEMIYA T. ( 1984), “Tobit Models: A Survey”, Journal of Econometrics , 24, 3– 61. Google Scholar CrossRef Search ADS   BALDWIN R. E. and FLAM H. ( 1989), “Strategic Trade Policy in the Market for 30-40 Seat Commuter Aircraft”, Weltwritschaftliches Archiv Band , 125, 484– 500. Google Scholar CrossRef Search ADS   BALDWIN R. E. and KRUGMAN P. R. ( 1987a), “Market Access and International Competition: A Simulation Study of 16K Random Access Memory”, in Feenstra R. C., (ed.) Empirical Methods for international Trade  ( Cambridge: The MIT Press). BALDWIN R. E. and KRUGMAN P. R. ( 1987b), “Industrial Policy and International Competition in Wide-Bodied Aircraft” in Baldwin R. E., (ed.) Trade Policy lssues and Empirical Analysis  ( Chicago: University of Chicago Press for the National Bureau of Economic Research). BAJARI P., LANIER C. B. and LEVIN J. ( 2007), “Estimating Dynamic Models of Imperfect Competition”, Econometrica , 75, 1331– 1370. Google Scholar CrossRef Search ADS   BARWICK JIA P. and PATHAK P. A. ( 2012), “The Costs of Free Entry: An Empirical Study of Real Estate Agents in Greater Boston” ( Working Paper). BELLONI A. and CHERNOZHUKOV V. ( 2011), “High-Dimensional Sparse Econometric Models, an Introduction” ( Springer Lecture Notes). BENKARD L. C. ( 2004), “A Dynamic Analysis of the Market for Wide-Bodied Commercial Aircraft”, Review of Economic Studies , 71, 581– 611. Google Scholar CrossRef Search ADS   BERRY S., LEVINSOHN J. and PAKES A. ( 1995), “Automobile Prices in Market Equilibrium”, Econometrica , 7, 841– 890. Google Scholar CrossRef Search ADS   BERRY S., LEVINSOHN J. and PAKES A. ( 1999), “Voluntary Export Restraints on Automobiles: Evaluating a Trade Policy”, American Economic Review , 89, 400– 430. Google Scholar CrossRef Search ADS   BRANDER J. E. ( 1995), “Strategic Trade Policy”, in Grossman G. and Rogoff K., (eds), Handbook of International Economics , Vol. 3, Ch. 27 ( Amsterdam: Elsevier BV) 1395– 1455. CHEN J., ESTEBAN S. and SHUM M. ( 2013), “When do Secondary Markets Harm Firms?”, American Economic Review , 103, 2911– 2934. Google Scholar CrossRef Search ADS   COLLARD-WEXLER A. ( 2013), “Demand Fluctuations in the Ready-Mix Concrete Industry”, Econometrica , 81, 1003– 1037. Google Scholar CrossRef Search ADS   COLLINS G. and GRUBB M. C. ( 2008), “A Comprehensive Survey of China’s Dynamic Shipbuilding Industry” ( China Maritime Studies Institute. U.S. Naval War College). DORASZELSKI U. and SATTERTHWAITE M. ( 2009), “Computable Markov-perfect Industry Dynamics”, Rand Journal of Economics , 41, 215– 243. Google Scholar CrossRef Search ADS   ERICSON R. and PAKES A. ( 1995), “Markov-Perfect Industry Dynamics: A Framework for Empirical Work”, The Review of Economic Studies , 62, 53– 82. Google Scholar CrossRef Search ADS   GOLDBERG K. P. ( 1995), “Product Differentiation and Oligopoly in International Markets: The Case of the U.S. Automobile Industry”, Econometrica , 63, 891– 951. Google Scholar CrossRef Search ADS   GROSSMAN M. G. ( 1990), “Promoting New Industrial Activities: A Survey of Recent Arguments and Evidence”, OECD Economic Studies , 14, 87– 126. HALEY C. V. U. and HALEY G. T. ( 2013), “Subsidies to Chinese Industry: State Capitalism, Business Strategy and Trade Policy” ( New York, NY: Oxford University Press), Google Scholar CrossRef Search ADS   HAMILTON J. D. ( 1994), “Time Series Analysis” ( Princeton, NJ: Princeton University Press). HOPENHAYN H. ( 1992), “Entry, Exit and Firm Dynamics in Long Run Equilibrium”, Econometrica , 60, 1127– 1150. Google Scholar CrossRef Search ADS   HOTZ V. J. and MILLER R. A. ( 1993), “Conditional Choice Probabilities and the Estimation of Dynamic Models”, Review of Economic Studies , 60, 497– 529. Google Scholar CrossRef Search ADS   JOFRE-BONET M. and PESENDORFER M. ( 2003), “Estimation of a Dynamic Auction Game”, Econometrica , 71, 1443– 1489. Google Scholar CrossRef Search ADS   JOVANOVIC B. ( 1982), “Selection and the Evolution of the Industry”, Econometrica , 50, 649– 670. Google Scholar CrossRef Search ADS   KALOUPTSIDI M. ( 2014), “Time to Build and Fluctuations in Bulk Shipping”, American Economic Review , 104, 564– 608. Google Scholar CrossRef Search ADS   KALOUPTSIDI M., SCOTT P. T. and SOUZA-RODRIGUES E. ( 2016), “Identification of Counterfactuals in Dynamic Discrete Choice Models” ( NBER Working Paper 21527). KRUSSEL P. and SMITH A. A. ( 1998), “Income and Wealth Heterogeneity in the Macroeconomy”, Journal of Political Economy , 106, 867– 896. Google Scholar CrossRef Search ADS   KYDLAND E. F. and PRESCOTT E. C. ( 1982), “Time to Build and Aggregate Fluctuations”, Econometrica , 50, 1345– 1370. Google Scholar CrossRef Search ADS   OECD. ( 2008), The Shipbuilding Industry in China . C/WP6(2008)7/REV1. PAKES A., MICHAEL O. and BERRY S. T. ( 2007), “Simple Estimators for the Parameters of Discrete Dynamics Games”, The RAND Journal of Economics , 38, 373– 399. Google Scholar CrossRef Search ADS   PATHAK P. ( 2015), “The Cost of Free Entry: An Empirical Analysis of Real Estate Agents in Greater Boston”, RAND Journal of Economics , 46, 103– 145. Google Scholar CrossRef Search ADS   RYAN P. S. ( 2012), “The Costs of Environmental Regulation in a Concentrated Industry”, Econometrica , 80, 1019– 1062. Google Scholar CrossRef Search ADS   STOPFORD M. ( 1999), Maritime Economics  ( New York: Routledge). SYKES A. O. ( 2009), “The Questionable Case for Subsidies Regulation: A Comparative Perspective” ( Stanford University School of Law, Law and Economics Research Paper Series, no. 380). THOMPSON P. ( 2001), “How Much Did the Liberty Shipbuilders Learn? New Evidence for an Old Case Study”, Journal of Political Economy , 109, 103– 137. Google Scholar CrossRef Search ADS   UNCTAD. ( 2009), Review of Maritime Transport . New York and Geneva: UNCTAD. U.S.–CHINA ECONOMIC AND SECURITY REVIEW COMMISSION. ( 2011), Annual Review . US INTERNATIONAL TRADE COMMISSION. ( 2008), Antidumping and Countervailing Duty Handbook . 13th edn. ( Publication 4056, Washington, DC). WOOLDRIDGE J. ( 2001), “Econometric Analysis of Cross Section and Panel Data” ( Cambridge, MA: The MIT Press). WTO. ( 2006), Agreement on Subsidies and Countervailing Measures . WTO. ( 2006), World Trade Report . XU Y. D. ( 2008), “A Structural Empirical Model of R&D, Firm Heterogeneity and Industry Evolution” ( Working Paper). Footnotes 1. The share of labour intensive products in Chinese exports fell from $$37\%$$ to $$14\%$$ between $$2000$$ and $$2010$$. On a monthly basis, in $$2011$$ the U.S. imported advanced-technology products from China $$560\%$$ more than it exported to China. In contrast, the monthly U.S.–China trade surplus in scrap (used as raw material) grew by $$1187\%$$ between $$2000$$ and $$2010$$. (U.S.–China Economic and Security Review Commission, 2011). 2. “China is the workshop to the world. It is the global economy’s most formidable exporter and its largest manufacturer. The explanations for its success [include the] seemingly endless supply of cheap labour [...] another reason for China’s industrial dominance: subsidies” (“Perverse Advantage”, The Economist, April $$2013$$). 3. WTO (2006). 4. Section 2 provides details. 5. For example, if the industry under consideration involved oligopolists that produce differentiated products, one could estimate a differentiated product demand system ($$e.g.$$Berry et al., 1995) to obtain the cost curve from the firms’ first-order conditions and test for a break. 6. Subsidy disputes are handled both bilaterally ($$e.g.$$ in the U.S. the relevant agencies are the Department of Commerce and the International Trade Commission (ITC)), as well as internationally through the WTO. “The ITC received a total of 1,606 antidumping and countervailing duty petitions during 1980-2007. These cases involved over $65 billion in imports. Thirty-eight percent of the petitions resulted in affirmative determinations by the Commission and Commerce, culminating in the issuance of an antidumping or countervailing duty order” (ITC, 2008) 7. One possible source of information (other than $$e.g.$$ national accounts or individual government measures (Sykes, 2009)) are subsidy “notifications” required by the WTO under SCM; however “the data contain many gaps and shortcomings. Not all Members fulfill the notification requirements. 29 of the currently 149 WTO Members have so far not submitted any notification. [...] Other Members do not provide quantitative information on subsidy programmes or do not provide this information systematically. In most years, information is available for less than half of the WTO membership” (WTO, 2006) 8. For example, in the U.S. “an interested party may file a petition with ITC and Commerce alleging that an industry in the United States is materially injured or threatened with material injury, or that the establishment of an industry is materially retarded, by reason of imports that are being subsidized” (ITC, 2008). “Material injury” amounts to observing the volume of imports, as well as their impact on the domestic industry’s prices and production. WTO’s SCM has similar provisions: complaining countries must provide evidence of adverse trade effects in their own or third country markets (as well as evidence on the existence and nature of subsidies as mentioned above). 9. The rest of the section draws from Haley and Haley (2013). 10. In its 2014 report the U.S.–China Economic and Security Commission (a U.S. government body established to monitor and investigate national security implications of the bilateral trade and economic relationship between the U.S. and China) states that “a full identification of support programmes was not possible for the Secretariat, as they are often the result of internal administrative measures that are not always easy to identify and generally only available in Chinese. In addition, the budget is not a public document hence it is not possible to identify outlays”. 11. This section draws from OECD (2008), Collins and Grubb (2008) and Sykes (2009). 12. The plan was introduced by the National Development and Reform Commission (NDRC) and the Commission of Science, Technology and Industry for National Defense (COSTIND). 13. A significant portion of CSSC and CSIC financial resources have been devoted to expanding China’s shipbuilding infrastructure. As an example, “CSSC had multibillion-dollar projects under way to build new shipyards on Changxing Island in Shanghai and Longxue Island in Guangzhou” (Collins and Grubb, 2008). 14. The maximum market share ever observed in my sample is about 11%, conditional on positive production. 15. Note that each individual shipyard is too small to affect the total fleet (stock of ships) with its period $$t$$ production. 16. The trade-off lies in having more firms, each of whom produces less. To be specific, the equilibrium price is decreasing in $$N_{c}$$ ($$i.e.$$$$\partial P/\partial N_{c}\leq0$$) if and only if $$c_{c1}-c_{nc1}\leq2c_{2}\left(\eta_{0t}-c_{c1}\right)/\eta_{1}N_{nc}$$, ignoring $$\kappa$$. This is less likely to hold if the market is of low value ($$c_{1c}$$ is close to $$\eta_{0t}$$) or the number of non-Chinese firms is very large. 17. That is $$\overline{s_{t}^{o}}$$ is a $$A\times||\mathcal{C}||$$ matrix whose $$(i,j)$$ element is the number of ships that are $$i$$ years old and built in country $$j$$. 18. For instance, suppose no ship takes more than eight periods to be built; then row $$j$$ of $$b_{t}$$ is $$[b_{jt1},b_{jt2},...,b_{jt8}]$$ and indicates the scheduled deliveries of yard $$j$$ for the next eight periods. 19. The steel price is part of the state because it: (1) is a key determinant of shipyard production costs; (2) determines the ship’s scrap value. 20. More accurately, ship profits from freight are $$\pi^{o}\left(s_{it}^{o},\overline{s_{t}^{o}},d_{t}\right)$$; even though the backlog and steel prices do not affect current payoffs, they affect state transitions and scrap values and are thus part of the state. 21. Shipowners scrap their ships by selling them to scrapyards where they are dismantled and their steel hull is recycled. 22. Allowing for endogenous exit is straightforward (see Kalouptsidi, 2014). 23. Note that each $$b_{jt}$$, all $$j$$, is a row in the total backlog matrix $$b_{t}$$. 24. I consider the number of orders as the relevant choice variable (as opposed to using the number of deliveries or smoothing orders) because the observed ship prices are paid at the order date and may be dramatically different from the prevailing prices at the delivery date. 25. Suppose the backlog of yard $$j$$ at $$t$$ is $$b_{jt}=[b_{jt1},b_{jt2},...,b_{jt\bar{T}}]$$ and that the yard received $$q_{jt}$$ orders to be delivered at $$T_{jt}=t+k$$ periods with $$k\neq\bar{T}$$. Then, at $$t+1$$, the yard delivered $$b_{jt1}$$ and $$b_{jt+1}=[b_{jt2},...,b_{jt+k},q_{jt+1}+b_{jt+k+1},b_{jt+k+2},...,b_{jt\bar{T}},0]\in\mathbb{R^{\bar{T}}}$$. 26. Here, the shipyard’s backlog also affects its demand, as it increases the time shipowners have to wait for delivery. 27. Note that the willingness to pay for a new ship from yard $$j$$ depends only on its country of origin, not $$j$$ itself. The hundreds of shipyards encountered in the data make this generalization impossible in this empirical application. 28. I am essentially assuming that China is the only country subsidizing. Alternatively, one can think of the recovered subsidies as the differential pre/post China and in comparison to other countries. 29. Because of time to build, I subtract a number of years from every first delivery year of all shipyards, after consulting with Clarksons’ analysts. The results reported subtract $$3$$ years (similar findings were obtained when the number of years subtracted varies). 30. I manually collected the extra data for before 2006 from a Clarksons publication (‘World Shipyard Monitor’). 31. I have unsuccessfully searched extensively in industry publications for alternative explanations. 32. Pointwise confidence intervals are computed via $$500$$ bootstrap samples, with the resampling done on the error. 33. As $$VE_{j}^{o}\left(s_{t}\right)$$ is estimated, to compute standard errors I create 500 bootstrap samples by re-drawing $$\left(q_{jt},s_{jt}^{y},s_{t}\right)$$ and combine them with the $$500$$ samples drawn to compute confidence intervals for $$VE_{j}^{o}\left(s_{t}\right)$$. I have also used the block-boostrap where I drew shipyards with replacement, and standard errors are unaffected. Finally, I follow Amemiya $$\left(1984\right)$$ and maximize the likelihood over $$\left(\frac{1}{\sigma},\frac{c_{1}}{\sigma},\frac{c_{2}}{\sigma}\right)$$ rather than $$\left(c_{1},c_{2},\sigma\right)$$. 34. Consistent estimation of the backlog coefficient crucially relies on the assumption that $$\epsilon_{jt}$$ is iid. Indeed, serial correlation in $$\epsilon$$ would imply that firm $$j$$’s backlog at $$t$$ (which is determined by past cost shocks) would be correlated with the current $$\epsilon_{jt}$$; this correlation would lead to a biased backlog coefficient and instruments would be required. 35. Roughly, we have $$\bar{q}$$ equations (the choice probabilities) in the following unknowns: $$c_{0},...,c_{10},c_{1}',\sigma$$. As shown in Kalouptsidi et al. (2016), variation in $$VE$$ and $$s_{jt}^{y},s_{t}$$ identifies $$\sigma$$ and the corresponding part of $$c_{1}'$$. Thus we are left with $$\bar{q}-1$$ equations in $$\bar{q}+2$$ unknowns ($$c_{0},...,c_{\bar{q}}$$ and the constant in $$c_{1}'$$). I either fix the first term ($$c_{0}$$) to zero and force the last terms ($$e.g.$$$$c_{\bar{q}},c_{\bar{q}-1}$$) to obey a quadratic; or I allow $$c_{0}$$ to be different from zero and force more terms to follow a quadratic. I later generalize the quadratic assumption. 36. I have also considered the following specification:   \begin{equation} c(q,s_{jt}^{y},s_{t})=c_{1}'g_{1}\left(s_{jt}^{y},s_{t}\right)+c_{q}1\left\{ q\leq\check{q}\right\} +c_{2}q^{2}1\left\{ q>\check{q}\right\} \end{equation} (18) which is in the spirit of Ryan (2012). This specification explores the possibility of a kink in the cost function possibly capturing capacity constraints. Since $$q$$ is discrete, I estimate this for $$\check{q}=1,...,\bar{q}-1$$; note that when $$\check{q}=1$$ we obtain the baseline quadratic model, while when $$\check{q}=\bar{q}-1$$ we get (17) of Column II of Table 5. I find that there is no evidence of a kink, which suggests that in fact, we have a fairly smooth convex function (results available upon request). This does not seem unreasonable: a shipyard can always delay production; hire more workers, increase man-hours. 37. The capacity investment decision is not studied in this article. This does not bias cost (and thus subsidy) estimates in the static model (the first-order condition for optimal production is unaffected by shipyard dock choice). If yards were to choose both production and docks dynamically however, the first-order conditions are no longer decoupled, since firms consider the future impact of all their actions. 38. To show this, begin with $$p_{0}\left(x\right)=1-\Phi\left(A\left(x,0\right)\right)$$, so that $$A\left(x,0\right)=\Phi^{-1}\left(1-p_{0}\left(x\right)\right)$$. Next, $$p_{1}\left(x\right)=\Phi\left(A\left(x,0\right)\right)-\Phi\left(A\left(x,1\right)\right)=1-p_{o}\left(x\right)-\Phi\left(A\left(x,1\right)\right)$$, so that $$A\left(x,1\right)=\Phi^{-1}\left(1-p_{0}\left(x\right)-p_{1}\left(x\right)\right)$$. The general case follows by induction. 39. In the “no interventions” counterfactuals, shipowners maintain their pre-2006 expectations and ship value functions. In the “no entrants” counterfactuals, I assume that shipowners switch to the post-2006 expectations and value functions. In other words, shipowners understand that a change occurred; yet they cannot distinguish between different policies. I feed the observed post-$$2006$$ values for shipping demand and steel prices into the model and simulate shipyard optimal production and ship prices. 40. The cargo shipping demand curve gives the price per voyage as a function of the total number of voyages. I assume that there is a constant fleet utilization rate to map the fleet into voyages. 41. Consider the cost function $$c_{1j}q+c_{2}q^{2}$$, for $$j\in\left\{ \text{China, EU, Japan, South Korea}\right\} $$, which for China becomes $$\left(c_{1China}-\kappa\right)q+c_{2}q^{2}$$ post-2006. The change in the industry average cost because of subsidies is equal to a sum of two terms: the first is $$\kappa\left(q_{China}/\left(q_{China}+q_{EU}+q_{Japan}+q_{SK}\right)\right)$$, while the second includes all other terms and is related to the reallocation of production and market share. The first term equals $$3.44$$ million. 42. Note that the backlog accumulation rule here is identical to a learning and forgetting model (Benkard, 2004), with the forgetting determined by the backlog depreciation parameter. 43. The data does not distinguish the age of ships that are older than 20 years old so this last age group is an absorbing age category. 44. This transition is not deterministic since the data are quarterly rather than annual. However, the recovered coefficients imply a close to uniform age distribution across quarters within each year. Moreover, I use lower degree approximations in the value functions because of the much larger number of states. 45. I do not perform counterfactuals with robustness specification II, as it is not clear how to construct the “true” transition of the mean age and total fleet from firm decisions. 46. Twenty penalty values are generated automatically by the LASSO command and are spaced out between the value that holds all regressors and the value that removes all regressors. Results for all values of $$\lambda$$ are available upon request. 47. It is worth noting that this can be generalized to the time-varying VAR model $$s_{t}=R_{0t}+R_{t}s_{t-1}+\xi_{t}$$, versions of which are employed in the robustness analysis of Section 7. Indeed, in that case we have:   \[ s_{t+T}=\phi\left(t+T,t\right)s_{t}+\sum_{k=t+1}^{t+T}\phi\left(t+T,k\right)\left(R_{0k}+\xi_{k}\right), \] where   \[ \phi\left(t+T,k\right)=\left\{ \begin{array}{c} R_{t+T}R_{t+T-1}...R_{k+1},\quad\text{for}k