Abstract In this article, I study how relative wages and border enforcement affect immigration from Mexico to the U.S. To do this, I develop a discrete choice dynamic programming model where people choose from a set of locations in both the U.S. and Mexico, while accounting for the location of one’s spouse when making decisions. I estimate the model using data on individual immigration decisions from the Mexican Migration Project. Counterfactuals show that a 10% increase in Mexican wages reduces migration rates and durations, overall decreasing the number of years spent in the U.S. by about 5%. A 50% increase in enforcement reduces migration rates and increases durations of stay in the U.S., and the overall effect is a 7% decrease in the number of years spent in the U.S. 1. Introduction Approximately 11 million Mexican immigrants were living illegally in the U.S. in 2015 (Krogstad et al., 2017). This large migrant community affects the economies of both countries. For example, migrants send remittances back home, which support development in Mexico.1 In the U.S., concern about illegal immigration affects political debate and policy. Border enforcement has been increasing since the mid-1980s, and it grew by a factor of 13 between 1986 and 2002 (Massey, 2007). This was a major issue in the 2016 presidential election, where President Trump campaigned on the promise of a wall between the two countries to cut down on illegal immigration. Despite these large concerns about illegal immigration from Mexico, much about the individual decisions and mechanisms remains poorly understood. In this article, I study how wage differentials and U.S. border enforcement affect an individual’s immigration decisions. Given the common pattern of repeat and return migration in the data, changes in policy affect both current and future decisions. For example, increased enforcement not only reduces initial migration rates, but also increases the duration of stay in the U.S. by making it more costly for people to come back to the U.S. after returning home. To capture such intertemporal effects, I analyse this problem in a dynamic setting where people choose from multiple locations each period, following Kennan and Walker’s (2011).2 The model extends Kennan and Walker’s (2011)’s framework in two dimensions. First, I allow for moves across an international border, where people choose from a set of locations which includes both states in the U.S. and in Mexico, necessitating different treatment of illegal and legal immigration. By observing individual legal status, where illegal immigrants crossed the border, and U.S. border enforcement, which varies across locations and time, I can capture various trade-offs of immigration decisions. Secondly, I allow for interactions within the decisions of husbands and wives. The data show that this is important, in that 5.7% of women with a husband in the U.S. move each year, compared to on overall female migration rate of 0.6%, suggesting a positive utility of living in the same place.3 Therefore, a married man living in the U.S. alone will consider the likelihood that his wife will join him, which is endogenous given that she also makes active decisions. This affects reactions to the policy environment. For example, as enforcement increases, a married man living in the U.S. alone knows that his wife is less likely to join him, giving him an extra incentive to return to Mexico. To capture these types of mechanisms, we need a model that allows for interactions within married couples. The most similar paper on Mexico–U.S. immigration is Thom (2010), which estimates a dynamic migration model where men choose which country to live in, focusing on savings decisions as an incentive for repeat and return migration.4 In comparison, in my model, people choose from multiple locations in both countries, allowing for both internal and international migration. I also allow for a relationship between the decisions of married couples, enabling me to study how family interactions affect the counterfactual outcomes. Gemici (2011) studies family migration by estimating a dynamic model of migration decisions with intra-household bargaining using U.S. data. In her model, married couples make a joint decision on where to live together, whereas the data from Mexico show that couples often live in different locations. In this article, I estimate a discrete-choice dynamic programming model where individuals choose from a set of locations in Mexico and the U.S. in each period. Individuals’ choices depend on the location of their spouse. To make this computationally feasible, I model household decisions in a sequential process: first, the household head picks a location, and then the spouse decides where to live. The model differentiates between legal and illegal immigrants, who face different moving costs and a different wage distribution in the U.S.5 Border enforcement, measured as the number of person-hours spent patrolling the border, affects the moving cost only for illegal immigrants. To evaluate the effectiveness of border enforcement, I use a new identification strategy, which accounts for the variation in the allocation of enforcement resources along the border and over time. In the model, individuals who move to the U.S. illegally also choose where to cross the border. The data show that as enforcement at the main crossing point increased, migrants shifted their behaviour and crossed at alternate points.6 Past work, which for the most part uses aggregate enforcement levels, misses this component of the effect of increased border patrol on immigration decisions. I estimate the model using data on individual immigration decisions from the Mexican Migration Project (MMP). I use the estimated model to perform several counterfactuals, finding that increases in Mexican wages decrease both immigration rates and the duration of stays in the U.S. A 10% increase in Mexican wages reduces the average number of years that a person lives in the U.S. by about 5%. Estimation of a dynamic model captures mechanisms that could not be studied in a static model. As enforcement increases, fewer people move, but those that do are more reluctant to return home, knowing that it will be harder to re-enter the U.S. in the future. This increases the duration of stays in the U.S. Policy changes also have differential effects with marital status. As enforcement increases, it becomes harder for women to join their husbands in the U.S., giving married men an extra incentive to return home, and thereby pushing their migration durations downwards. I hold female migration rates constant in the counterfactual to isolate this effect, and then see an even larger increase in men’s durations of stay in the U.S. Overall, simulations show that a 50% increase in enforcement, distributed uniformly along the border, reduces the average amount of time that an individual in the sample spends in the U.S. over a lifetime by approximately 3%. If total enforcement increased by 50%, not uniformly but instead concentrated at the points along the border where it would have the largest effect, the number of years spent in the U.S. per person would decrease by about 7%. Following U.S. policy changes in the 1990s, most new resources were allocated to certain points along the border, and this research suggests that this is the optimal policy from the perspective of reducing illegal immigration rates. The remainder of the article is organized as follows. Section 2 reviews the literature, and Section 3 explains the model. Section 4 details the data, and Section 5 provides descriptive statistics. The estimation is explained in Section 6, and the results are in Section 7. The counterfactuals are in Section 8, and Section 9 concludes the article. 2. Related Literature Wages are understood to be the main driving force behind immigration from Mexico to the U.S. Hanson and Spilimbergo (1999) find that an increase in U.S. wages relative to Mexican wages positively affects apprehensions at the border, implying that more people attempted to move illegally. Rendón and Cuecuecha (2010) estimate a model of job search, savings, and migration, finding that migration and return migration depend not only on wage differentials, but also on job turnover and job-to-job transitions. In my model, the value of a location depends on expected earnings there, allowing for wage differentials to affect migration decisions. I can quantify how responsive migration decisions are to changes in the wage distribution. To estimate the effect of border enforcement on immigration decisions, some research uses the structural break caused by the 1986 Immigration Reform and Control Act (IRCA), one of the first policies aimed at decreasing illegal immigration. This law increased border enforcement and legalized many illegal immigrants living in the U.S. Espenshade (1990, 1994) finds that there was a decline in apprehensions at the U.S. border in the year after IRCA was implemented, but no lasting effect. Using survey data from communities in Mexico, Cornelius (1989) and Donato et al. (1992) find that IRCA had little or no effect on illegal immigration. After the implementation of IRCA, there was a steady increase in border enforcement over time. Hanson and Spilimbergo (1999) find that increased enforcement led to a greater number of apprehensions at the border. This provides one mechanism for increased enforcement to affect moving costs, as immigrants may have to make a greater number of attempts to successfully cross the border. Changes in enforcement can affect not only initial but also return migration decisions, and some of the past literature has looked at this. Angelucci (2012), using the MMP data, finds that border enforcement affects initial and return migration rates. Her framework permits analysis of initial and return migration decisions separately using a reduced-form framework. By estimating a structural model, I can perform counterfactual analyses to calculate the net effect of changes in enforcement on illegal immigration. The model in this article allows for an individual’s characteristics to affect migration decisions. Past literature has studied this, mostly in a static setting, to understand what factors are important. I build on this work by including the relevant characteristics found to impact migration decisions in my dynamic setting. There is a large literature on the selection of migrants, starting with the theoretical model in Borjas (1987), which predicts that migrants will be negatively selected. This is empirically supported in Ibarraran and Lubotsky (2005). However, Chiquiar and Hanson (2005) find that Mexican immigrants in the U.S. are more educated than non-migrants in Mexico. They find evidence of intermediate selection of immigrants, as do Lacuesta (2006) and Orrenius and Zavodny (2005). Past work also looks at the determinants of the duration of stays in the U.S.; for example, see Reyes and Mameesh (2002), Massey and Espinosa (1997), and Lindstrom (1996). 3. Model The basic structure of the model follows Kennan and Walker’s (2011), where each person chooses where to live each period. The value of living in a location depends on the expected wages there, as well as the cost of moving. Since the model is dynamic, individuals also consider the value of being in each location in future periods. At the start of a period, each person sees a set of payoff shocks to living in each location, and then chooses the location with the highest total valuation. The shocks are random, independent and identically distributed (i.i.d.) across locations and time, and unobserved by the econometrician. I assume that the payoff shocks follow a type I extreme value distribution, and solve the model following McFadden (1973) and Rust (1987). I assume a finite horizon, so the model can be solved using backward induction. The model extends Kennan and Walker’s (2011)’s framework in two dimensions: (1) by allowing for moves across an international border, which necessitates different treatment of illegal and legal immigration, and (2) modelling the interactions within married couples. The model includes elements to account for the fact that people are moving across an international border, which is different than domestic migration in a couple of important ways. When deciding where to live, people choose from a set of locations, defined as states, in both the U.S. and in Mexico. Migration decisions are substantially affected by whether or not people can move to the U.S. legally, and to account for this, the model differentiates between legal and illegal migrants. Legal immigration status is assumed to be exogenous to the model, and people can transition to legal status in future periods. Legal immigration status affects wage offers in the U.S., since we expect that legal immigrants will have access to better job opportunities in the U.S. labour market. In addition, U.S. border enforcement only affects the moving costs for illegal immigrants. I assume that all people who choose to move to the U.S. illegally are successful, so the effects of increased enforcement just come through the increased moving cost.7 This is due to an increased cost of hiring a smuggler (Gathmann, 2008) or an increase in the expected number of attempts before successfully crossing. Illegal immigrants moving to the U.S. choose both a location and a border crossing point, where the cost of moving varies at each crossing point due to differences in the fixed costs and enforcement levels at each point.8 In this article, I also extend Kennan and Walker’s (2011)’s framework by allowing for the decisions of married individuals to depend on where their spouse is living. Decisions are made individually, but utility depends on whether a person is in the same location as his spouse. Since individuals’ decisions are related, this is a game between the husband and wife. I solve for a Markov perfect equilibrium (Maskin and Tirole, 1988). I make some assumptions on the timing of decisions to ensure that there is only one equilibrium. For each household, I define a primary and a secondary mover, which empirically is the husband and wife, respectively. In each period, the primary mover picks a location first, so he does not know his spouse’s location when he makes this choice. After the primary mover makes a decision, the secondary mover learns her payoff shocks and decides where to live.9 This setup allows for people to make migration decisions that are affected by the location of their spouse. Single people’s decisions are not affected by a spouse, but they can transition over marital status in future periods, and therefore know that at some point they could have utility differentials based on their spouse’s location. In the remainder of this section, I describe a model without any unobserved heterogeneity. In the estimation, there will be three sources of unobserved heterogeneity, over (1) moving costs, (2) wages in the U.S., and (3) whether or not women choose to participate in the labour market. This is explained in more detail when I discuss the estimation in Section 6. 3.1. Model setup 3.1.1. Primary and secondary movers I solve separate value functions for primary and secondary movers, denoted with superscripts 1 and 2, respectively. In the empirical implementation, men are the primary movers, and women are the secondary movers. A married person’s decisions depend on the location of his spouse, whose characteristics I denote with the superscript $$s$$. Single men and women make decisions as individuals, but know that they could become married in future periods. I account for these differences by keeping track of marital status $$m_t$$, where $$m_t=1$$ is a married person and $$m_t=2$$ is a single person. 3.1.2. State variables People learn their legal status at the start of each period. I assume that once a person is able to immigrate legally, this option remains with that person forever. I use $$z_t$$ to indicate whether or not a person can move to the U.S. legally, where $$z_t=1$$ means a person can move to the U.S. legally and $$z_t=2$$ means that he cannot. State variables also include a person’s location in the previous period ($$\ell_{t-1}$$), their characteristics $$X_t$$, and their marital status $$m_t$$. When a married secondary mover picks a location, the primary mover has already chosen where to live in that period, so the location of the spouse ($$\ell_t^s$$) is known and is part of the state space. For the primary mover, who makes the first decision, the location of the spouse in the previous period ($$\ell_{t-1}^s$$) is part of the state space. The characteristics and legal status of one’s spouse ($$X_t^s$$ and $$z_t^s$$) are also part of the state space. To simplify notation, denote $$\Delta_t$$ as the characteristics and legal status of an individual and his spouse, so $$\Delta_t=\{X_t,z_t,X^s_t,z^s_t\}$$. 3.1.3. Choice set Denote the set of locations in the U.S. as $$J_{U}$$, those in Mexico as $$J_{M}$$, and the set of border crossing points as $$C$$. If moving to the U.S. illegally, a person has to pick both a location and a border crossing point. Denote the choice set as $$J(\ell_{t-1},z_t)$$, where \begin{eqnarray} J(\ell_{t-1},z_t)=\left\{ \begin{array}{ll} J_M\cup (J_U\times C) & \text{if } \ell_{t-1}\in J_M \text{ and } z_t=2\\[3pt] J_M\cup J_U & \text{otherwise.} \end{array}\right. \end{eqnarray} (1) 3.1.4. Payoff shocks I denote the set of payoff shocks at time $$t$$ as $$\eta_t=\{\eta_{jt}\}$$, where $$j$$ indexes locations. I assume that these follow an extreme value type I distribution. 3.1.5. Utility The utility flow depends on a person’s location $$j$$, characteristics $$X_t$$, legal status $$z_t$$, marital status $$m_t$$, and spouse’s location $$\ell_t^s$$, and it is written as $$u(j,X_t,z_t,m_t,\ell_t^s)$$. This allows for utility to depend on wages, which are a function of a person’s characteristics and location. Utility also depends on whether or not a person is at his home location, and increases for married couples who are living in the same place. 3.1.6. Moving costs The moving cost depends on which locations a person is moving between, and that person’s characteristics and legal status. I denote the cost of moving from location $$\ell_{t-1}$$ to location $$j$$ as $$c_t(\ell_{t-1},j,X_t,z_t)$$. The moving cost is normalized to zero if staying at the same location. 3.1.7. Transition probabilities There are transitions over legal status, spouse’s location for married couples, and marital status for people who are single.10 The primary mover is uncertain of his spouse’s location in the current period. For example, if he moves to the U.S., he is not sure whether or not his wife will follow. The secondary mover knows her spouse’s location in the current period, but is unsure of her spouse’s location in the next period. For example, she may move to the U.S. to join her husband, but does not know whether or not he will remain there in the next period. Single people can get married in future periods. Furthermore, if someone gets married, he does not know where his new spouse will be living. Marrying someone who is living in the U.S. will affect decisions differently than marrying someone who is in Mexico. For the primary mover, denote the probability of being in the state with legal status $$z_{t+1}$$, marital status $$m_{t+1}$$, and having a spouse in location $$\ell^s_{t}$$ in this period as $$\rho^1_{t}(z_{t+1},m_{t+1},\ell^s_{t}| j,\Delta_t,m_t,\ell_{t-1}^s)$$. This depends on his location $$j$$, his characteristics, as well as his marital status and his spouse’s previous-period location (if married). For the secondary mover, the transition probability is written as $$\rho^2_{t}(z_{t+1},m_{t+1},\ell^s_{t+1}| j,\Delta_t,m_{t},\ell_{t}^s)$$. 3.2. Value function In this section, I derive the value functions for primary and secondary movers. Because the problem is solved by backward induction and the secondary mover makes the last decision, it is logical to start with the secondary mover’s problem. 3.2.1. Secondary movers The secondary mover’s state space includes her previous-period location, her characteristics and those of her spouse, her marital status, and the location of her spouse. After seeing her payoff shocks, she chooses the location with the highest value: \begin{equation} V^2_t(\ell_{t-1},\Delta_t,m_t,\ell_t^s, \eta_t)= \max_{j\in J(\ell_{t-1},z_t)} v^2_t(j,\ell_{t-1},\Delta_t,m_t,\ell_t^s) +\eta_{jt}. \label{eqn:VF2} \end{equation} (2) The value of living in each location has a deterministic and a random component ($$v_t^2(\cdot)$$ and $$\eta_t$$, respectively). The deterministic component of living in a location consists of the flow payoff plus the discounted expected value of living there at the start of the next period: \begin{eqnarray} v^2_{t}(\cdot)&=& \tilde{v}^2_t(j,\ell_{t-1},\Delta_t,m_t,\ell_t^s)+ \beta \sum_{z_{t+1},m_{t+1},\ell^s_{t+1}}\Big( \rho^2_{t}( z_{t+1},m_{t+1},\ell^s_{t+1}|j,\Delta_t,m_t,\ell_t^s)\notag\\ &\times& E_{\eta}\left[V^2_{t+1}(j,\Delta_{t+1},m_{t+1}, \ell_{t+1}^s, \eta_{t+1})\right]\Big). \label{eqn:deterministic} \end{eqnarray} (3) The flow payoff of living in location $$j$$, denoted as $$\tilde{v}_t(\cdot)$$, consists of utility net of moving costs, and is defined as \begin{equation} \tilde{v}^2_t(j,\ell_{t-1},\Delta_t,m_t,\ell_t^s)= u(j,X_t,z_t,m_t,\ell_t^s)-c_t(\ell_{t-1},j,X_t,z_t). \label{eqn:flow} \end{equation} (4) The second part of the deterministic component in equation (3) is the expected future value of living in a location. The transition probabilities, written as $$\rho^2(\cdot)$$, are over legal status, marital status, and location of primary mover. I integrate out the future payoff shocks using the properties of the extreme value distribution, following McFadden (1973) and Rust (1987). For a given legal status, marital status, and location of primary mover, the expected continuation value is given by \begin{eqnarray} &&E_{\eta}\left[V^2_{t+1} (j,\Delta_{t+1},m_{t+1},\ell^s_{t+1}, \eta_{t+1}) \right]\notag\\ &&=E_{\eta}\left[\max_{k\in J(j,z_{t+1})}v^2_{t+1} (k,j,\Delta_{t+1},m_{t+1},\ell_{t+1}^s) +\eta_{k,t+1}\right]\notag\\ &&=\log\left( \sum_{k\in J(j,z_{t+1})} \exp \Big(v_{t+1}^2(k,j,\Delta_{t+1},m_{t+1},\ell^s_{t+1} )\Big) \right)+\gamma \text{ ,} \label{eqn:expect2} \end{eqnarray} (5) where $$\gamma$$ is Euler’s constant ($$\gamma\approx 0.58$$). I calculate the probability that a person will choose location $$j$$ at time $$t$$, which will be used for two purposes. First, this is the choice probability, necessary to calculate the likelihood function. Secondly, the choice probability is used to calculate the transition probabilities for the primary mover, who is concerned with the probability that his spouse lives in a given location in this period. I assume that he has all of the same information as the secondary mover, but since the primary mover makes the first decision, the secondary mover’s payoff shocks have not yet been realized, so I can only calculate the probability that the secondary mover will make a given decision. Since I assume that the payoff shocks are distributed with an extreme value distribution, the choice probabilities take a logit form, again following McFadden (1973) and Rust (1987). The probability that a person picks location $$j$$ is given by the following formula: \begin{equation} P_t^2(j|\ell_{t-1},\Delta_t,m_t,\ell_t^s)= \frac{\exp\left(v^2_t(j,\ell_{t-1},\Delta_t,m_t,\ell_t^s)\right)} {\sum_{k\in J(\ell_{t-1},z_{t})} \exp\Big(v^2_t(k,\ell_{t-1},\Delta_t,m_t,\ell_t^s)\Big)}\text{ .}\label{eqn:secondary_prob} \end{equation} (6) 3.2.2. Primary movers I define the value function for the primary mover as follows: \begin{equation} V^1_t(\ell_{t-1},\Delta_t,m_t,\ell_{t-1}^s, \eta_t)= \max_{j\in J(\ell_{t-1},z_t)} v^1_t(j,\ell_{t-1},\Delta_t,m_t,\ell_{t-1}^s) +\eta_{jt} \text{ .} \label{eqn:VF1} \end{equation} (7) In comparison with the secondary mover, the primary mover does not know where his spouse is living in this period, and only knows her previous-period location $$\ell_{t-1}^s$$. As before, the deterministic component of living in a location includes the flow utility and the expected continuation value. However, in this case, I do not know the exact flow utility, since the secondary mover’s location has not been determined. I instead calculate the expected flow utility: \begin{eqnarray} &&E_{\ell_t^s}\left[\tilde{v}_t(j, \ell_{t-1},\Delta_t,m_t,\ell_t^s )|\ell_{t-1}^s\right]=\notag\\&& \sum_{k\in J(\ell_{t-1},z_t)} P_t^2(k|\ell_{t-1}^s,\Delta_t^s,m_t^s,j) u(j,X_t,z_t,m_t,k) -c(\ell_{t-1},j,X_t,z_t) \text{ .} \label{eqn:EU} \end{eqnarray} (8) This is calculated using the probability $$P^2_t(\cdot)$$ that the secondary mover will pick a given location, defined in equation (6). Denoting the transition probabilities as $$\rho^1(\cdot)$$, I can write the deterministic component of living in a location as: \begin{eqnarray} v^1_{t}(\cdot)&=& E_{\ell_t^s}\left[\tilde{v}_t( j,\ell_{t-1},\Delta_t,m_t,\ell_t^s )|\ell_{t-1}^s\right] + \beta \sum_{z_{t+1},m_{t+1},\ell^s_{t}}\Big(\rho^1_{t}( z_{t+1},m_{t+1},\ell^s_{t}|j,\Delta_t, m_t, \ell_{t-1}^s)\notag \\ &\times& E_{\eta}\left[V^1_{t+1}(j,\Delta_{t+1} ,m_{t+1},\ell_{t}^s, \eta_{t+1})\right]\Big). \end{eqnarray} (9) For a given state, the continuation value is calculated by integrating over the distribution of future payoff shocks: \begin{eqnarray} &&E_{\eta}\left[V^1_{t+1} (j,\Delta_{t+1},m_{t+1},\ell_{t}^s,\eta_{t+1})\right]\notag \\ &=&E_{\eta}\left[\max_{k\in J^1(j,z_{t+1})}v^1_{t+1} (k,j,\Delta_{t+1},m_{t+1},\ell_{t}^s) +\eta_{j,t+1}\right]\notag\\ &=&\log\left(\sum_{k\in J(j,z_{t+1})} \exp v_{t+1}^1\Big(k,j,\Delta_{t+1},m_{t+1},\ell_{t}^s )\Big) \right)+\gamma. \label{eqn:exp1} \end{eqnarray} (10) I calculate the probabilities that the primary mover picks each location in a period, which are used to calculate the likelihood function. They also are a part of the transition probabilities for the secondary mover. Using the properties of the extreme value distribution, the probability that a primary mover picks location $$j$$ is given by \begin{equation} P_t^1(j|\ell_{t-1},\Delta_t,m_t,\ell_{t-1}^s)= \frac{\exp\Big(v^1_t(j,\ell_{t-1},\Delta_t,m_t, \ell_{t-1}^s)\Big)} {\sum_{k\in J(\ell_{t-1},z_{t})} \exp\Big(v^1_t(k,\ell_{t-1},\Delta_t,m_t, \ell_{t-1}^s)\Big)}\text{ .}\label{eqn:primary_prob} \end{equation} (11) 3.2.3. Transition probabilities In this section, I calculate the transition probabilities. There is uncertainty over future legal status, future marital status (if single), and the location of one’s spouse (if married). I assume that the probability that a person has a given legal status in the next period depends on his characteristics and his current legal status.11 For people who are married, the transition probabilities are also over a spouse’s future decisions. I assume that the agent has the same information as the spouse about the spouse’s future decisions. This means that the probability that a person’s spouse lives in a given location is given by his choice probabilities. A single person can become married in future periods with some probability. If he gets married, there is also uncertainty over where his new spouse is living. Recall that $$\rho^1(\cdot)$$ and $$\rho^2(\cdot)$$ are the transition probabilities for primary and secondary movers. These give the probability that a person has a given legal status, marital status, and if married, has a spouse living in a certain location in the next period. \begin{eqnarray} \rho^1_t(z_{t+1},m_{t+1},\ell^s_t| \ell_t,\Delta_t,m_t, \ell_{t-1}^s)=\left\{ \begin{array}{ll} \delta(z_{t+1}|z_t,X_t)P^2_t(\ell^s_t|\ell_{t-1}^s,\Delta_t^s,m_t^s, \ell_t) & \text{if } m_t=1 \\[3pt] \delta(z_{t+1}|z_t,X_t)\psi^1(m_{t+1},\ell_t^s|X_t,\ell_t) & \text{if } m_t=2 \text{ .}\label{eqn:lambda1} \end{array} \right. \end{eqnarray} (12) \begin{eqnarray} \rho^2_t(z_{t+1},m_{t+1},\ell^s_{t+1}| \ell_t, \Delta_t,m_t,\ell_{t}^s)=\left\{ \begin{array}{ll} \delta(z_{t+1}|z_t,X_t)P^1_{t+1}(\ell^s_{t+1}|\ell_{t}^s,\Delta_{t+1}^s,m_{t+1}^s, \ell_{t}) & \text{if } m_t=1 \\[3pt] \delta(z_{t+1}|z_t,X_t)\psi^2(m_{t+1},\ell_{t+1}^s|X_t,\ell_t) & \text{if } m_t=2\text{ .}\label{eqn:lambda2} \end{array} \right. \end{eqnarray} (13) The function $$\delta(\cdot)$$ gives the probability that a person has a given legal status in the next period. For primary movers, there is uncertainty over where the secondary mover will live in the current period. This is represented by the function $$P^2_t(\cdot)$$, which comes from the secondary mover’s choice probabilities defined in equation (6). Likewise, for secondary movers, there is uncertainty over the primary mover’s location in the next period. This is represented by the function $$P^1_{t+1}(\cdot)$$, which comes from the primary mover’s choice probabilities defined in equation (11). Single people could become married in future periods, and the probability of this happening is written as $$\psi^k(\cdot)$$, with $$k=1,2$$ for primary and secondary movers, respectively. If he gets married, there is a probability his new spouse lives in each location. If he does not get married, then he continues to make decisions as a single person.12 4. Data I estimate the model using data from the MMP, a joint project of Princeton University and the University of Guadalajara.13 The MMP is a repeated cross-sectional data set that started in 1982, and is still ongoing. The project aims to understand the decisions and outcomes relating to immigration for Mexican individuals. To my knowledge, this is the most detailed source of information on immigration decisions between the U.S. and Mexico, most importantly on illegal immigrants, which are underrepresented in most U.S.-based surveys. The survey asks questions on when and where people lived in the U.S., how they got across the border, and what the wage outcomes in the U.S. were, which is the set of information necessary to estimate the model detailed in the previous section. For household heads and spouses, the MMP collects a lifetime migration history, asking people which country and state they lived in each year. This information is used to construct a panel data set that contains each person’s location at each point in time. I also know if and when each person is allowed to move to the U.S. legally. For people who move to the U.S. illegally, the MMP records when and where they cross the border. The MMP also collects information on the remaining members of the household. The inclusion of these respondents allows me to cover a wider age range than if I were to just use the household head and spouse data. Although the MMP does not ask for the lifetime migration histories for this group, it asks many questions related to migration. The survey asks for the migrants’ wages, location, and legal status for their first and last trip to the U.S., as well as their total number of U.S. trips. For people who have moved to the U.S. two or fewer times, I know their full history of U.S. migration, although when they are in Mexico I may not know their precise location. For people who have moved more than two times, there are gaps in the sample for years when a migration is not reported. I will have to integrate over the missing information to compensate for the lack of full histories for each person.14 In addition, in this group, I do not know these people’s marital status at each point in time, and they are also not matched to a spouse in the data, so I cannot include the marriage interactions component of the model for this group. I call this sample the “partial history” sample, whereas I call the group of household heads and spouses the “full history” sample. One question in this article is how changes in border enforcement affect immigration decisions. Border patrol was fairly low and constant up to the 1986 IRCA. Because the data have lifetime histories, the sample spans many years. Computing the value function for each year is costly, so I limit the sample time frame to years in which there are changes in enforcement levels. For this reason, I study behaviour starting in 1980. To avoid an initial condition problem, I only include individuals who were aged 17 in 1980 or after. This leaves me with a sample size of 6,457 for the full history sample, where I observe each person’s location from age 17 until the year surveyed.15 The partial history sample is larger, consisting of 41,069 individuals. One downside of the data is that the MMP sample is not representative of Mexico, as the surveyed communities are mostly those in rural areas with high migration propensities. Western-central Mexico, the region with the highest migration rates historically, is oversampled.16 Over time, the MMP sampling frame has shifted to other areas in Mexico, thus covering areas with lower migration rates. Because the MMP collects retrospective data, I have information on migration decisions in earlier years in these communities that are surveyed later, mitigating this problem somewhat. Another restriction of the data is that the sample misses permanent migrants, because the survey is administered in Mexico.17 Therefore, the results of this article apply to this specific section of the Mexican population. In Online Appendix A, I compare the MMP sample to the Current Population Study (CPS) (restricting the sample to Mexicans living in the U.S.) and to Mexican census data, to get an understanding of the limitations of the data. Table A1 in Online Appendix A shows that the MMP sample has substantially more men than the CPS, which is unsurprising due to the prevalence of temporary migrants in the MMP. The CPS sample also has higher levels of education. Table A2 compares the MMP sample to the Mexican census data. The MMP sample is younger, most likely because of my sample selection criteria explained in the previous paragraph. The MMP sample also has higher education levels. Unlike other data sources, the MMP has wage data when people are in the U.S. illegally, allowing me to estimate the wage distribution for illegal immigrants living in the U.S. In comparison, other datasets report country of birth but not legal status, and I expect that datasets such as the CPS will be biased towards legal immigrants, since illegal immigrants are likely to avoid government surveys. Because legal immigration is relatively rare in the MMP data, I combine MMP wages with CPS data on Mexicans living in the U.S. to get a larger sample size to study the legal wage distribution. The MMP also records wages in Mexico; however, there are limited wage observations per person and the data give imprecise estimates. Therefore, for Mexican wages, I use data from Mexican labour force surveys: the Encuesta Nacional de Ingresos y Gastos de los Hogares (ENIGH) in 1989, 1992, and 1994 , and the Encuesta Nacional de Empleo (ENE) from 1995 to 2004. To measure border enforcement, I use data from U.S. Customs and Border Protection (CBP) on the number of person-hours spent patrolling each sector of the border.18 CBP divides the U.S.–Mexico border into nine regions, and the data report the person-hours spent patrolling each sector. 5. Descriptive Statistics Tables 1 and 2 show the characteristics of the sample, divided into five groups: people who move internally, people who move to the U.S., people who move internally and to the U.S., non-migrants, and people who can immigrate legally. Table 1 shows this information for the full history sample, and Table 2 for the partial history sample. For the partial history sample, there is no information on internal movers, since the MMP has insufficient information to isolate this group. These tables show that most U.S. migrants are male. Each row shows the percentage of a group ($$i.e.$$ internal movers) with a given level of education. People who move to the U.S. have the least education. The literature finds that returns to education are higher in Mexico than in the U.S., possibly explaining why educated people are less likely to immigrate. In addition, illegal immigrants do not have access to the full U.S. labour market, and therefore may not be able to find jobs that require higher levels of education. People who can immigrate legally make up close to 3% of the full history sample and about 2.4% of the partial history sample. Table 1. Characteristics of full history sample Internal movers (%) Moves to U.S. (%) Moves internally and to the U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 60.53 91.51 89.51 50.82 90.63 60.66 Percent married 67.59 81.01 78.40 75.74 92.19 76.24 Average age 29.95 30.13 30.74 29.73 30.86 29.88 Years of education 0–4 16.07 18.03 14.81 17.72 11.46 17.33 5–8 39.47 43.61 43.83 40.48 53.13 41.33 9–11 28.67 30.34 26.54 30.83 22.92 30.17 12 9.42 5.92 8.64 7.66 8.85 7.64 13$$+$$ 6.37 2.10 6.27 3.30 3.65 3.53 Observations 722 1,048 162 4,333 192 6,457 Internal movers (%) Moves to U.S. (%) Moves internally and to the U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 60.53 91.51 89.51 50.82 90.63 60.66 Percent married 67.59 81.01 78.40 75.74 92.19 76.24 Average age 29.95 30.13 30.74 29.73 30.86 29.88 Years of education 0–4 16.07 18.03 14.81 17.72 11.46 17.33 5–8 39.47 43.61 43.83 40.48 53.13 41.33 9–11 28.67 30.34 26.54 30.83 22.92 30.17 12 9.42 5.92 8.64 7.66 8.85 7.64 13$$+$$ 6.37 2.10 6.27 3.30 3.65 3.53 Observations 722 1,048 162 4,333 192 6,457 Notes: Calculated using data from the full history sample in the MMP. For education, the table gives the percentage of each group ($$i.e.$$ internal movers) that has a given level of education. Table 1. Characteristics of full history sample Internal movers (%) Moves to U.S. (%) Moves internally and to the U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 60.53 91.51 89.51 50.82 90.63 60.66 Percent married 67.59 81.01 78.40 75.74 92.19 76.24 Average age 29.95 30.13 30.74 29.73 30.86 29.88 Years of education 0–4 16.07 18.03 14.81 17.72 11.46 17.33 5–8 39.47 43.61 43.83 40.48 53.13 41.33 9–11 28.67 30.34 26.54 30.83 22.92 30.17 12 9.42 5.92 8.64 7.66 8.85 7.64 13$$+$$ 6.37 2.10 6.27 3.30 3.65 3.53 Observations 722 1,048 162 4,333 192 6,457 Internal movers (%) Moves to U.S. (%) Moves internally and to the U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 60.53 91.51 89.51 50.82 90.63 60.66 Percent married 67.59 81.01 78.40 75.74 92.19 76.24 Average age 29.95 30.13 30.74 29.73 30.86 29.88 Years of education 0–4 16.07 18.03 14.81 17.72 11.46 17.33 5–8 39.47 43.61 43.83 40.48 53.13 41.33 9–11 28.67 30.34 26.54 30.83 22.92 30.17 12 9.42 5.92 8.64 7.66 8.85 7.64 13$$+$$ 6.37 2.10 6.27 3.30 3.65 3.53 Observations 722 1,048 162 4,333 192 6,457 Notes: Calculated using data from the full history sample in the MMP. For education, the table gives the percentage of each group ($$i.e.$$ internal movers) that has a given level of education. Table 2. Characteristics of partial history sample Moves to U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 71.85 43.80 65.79 48.94 Percent married 58.72 53.40 70.32 54.68 Average age 26.02 24.92 28.21 25.18 0–4 years education 8.96 9.59 6.64 9.42 5–8 years education 40.05 29.99 36.42 31.80 9–11 years education 34.07 31.48 32.90 31.94 12 years education 11.84 14.39 15.29 13.99 13$$+$$ years education 5.09 14.55 8.75 12.85 Observations 6,742 33,333 994 41,069 Moves to U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 71.85 43.80 65.79 48.94 Percent married 58.72 53.40 70.32 54.68 Average age 26.02 24.92 28.21 25.18 0–4 years education 8.96 9.59 6.64 9.42 5–8 years education 40.05 29.99 36.42 31.80 9–11 years education 34.07 31.48 32.90 31.94 12 years education 11.84 14.39 15.29 13.99 13$$+$$ years education 5.09 14.55 8.75 12.85 Observations 6,742 33,333 994 41,069 Notes: Calculated using data from the partial history sample in the MMP. For education, the table gives the percentage of each group ($$i.e.$$ people that move to the U.S.) that has a given level of education. Table 2. Characteristics of partial history sample Moves to U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 71.85 43.80 65.79 48.94 Percent married 58.72 53.40 70.32 54.68 Average age 26.02 24.92 28.21 25.18 0–4 years education 8.96 9.59 6.64 9.42 5–8 years education 40.05 29.99 36.42 31.80 9–11 years education 34.07 31.48 32.90 31.94 12 years education 11.84 14.39 15.29 13.99 13$$+$$ years education 5.09 14.55 8.75 12.85 Observations 6,742 33,333 994 41,069 Moves to U.S. (%) Non-migrant (%) Legal immigrant (%) Whole sample (%) Percent male 71.85 43.80 65.79 48.94 Percent married 58.72 53.40 70.32 54.68 Average age 26.02 24.92 28.21 25.18 0–4 years education 8.96 9.59 6.64 9.42 5–8 years education 40.05 29.99 36.42 31.80 9–11 years education 34.07 31.48 32.90 31.94 12 years education 11.84 14.39 15.29 13.99 13$$+$$ years education 5.09 14.55 8.75 12.85 Observations 6,742 33,333 994 41,069 Notes: Calculated using data from the partial history sample in the MMP. For education, the table gives the percentage of each group ($$i.e.$$ people that move to the U.S.) that has a given level of education. 5.1. Migration decisions Between 1980 and 2004, an average of 2.5% of the people in the sample living in Mexico moved to the U.S. in each year. Table 3 looks at the effects of family interactions on migration rates.19 The migration behaviour of married men is very similar to that of single men. However, there are stark differences in the migration decisions of married and single women. I compare married women, whose husband is in the U.S. to single women, and show that these married women have substantially higher migration rates.20 This suggests that husband’s decisions have an important effect on female migration decisions. Table 3. Family and migration rates Married men (%) Single men (%) Married women (spouse in U.S.) (%) Single women (%) 0–4 years education 3.44 4.10 1.74 0.81 5–8 years education 4.92 4.55 3.27 1.43 9–11 years education 3.82 3.26 3.45 1.30 12 years education 2.36 2.60 6.25 1.21 13$$+$$ years education 1.14 1.00 10.00 0.58 Total 4.04 3.74% 3.22% 1.17% Married men (%) Single men (%) Married women (spouse in U.S.) (%) Single women (%) 0–4 years education 3.44 4.10 1.74 0.81 5–8 years education 4.92 4.55 3.27 1.43 9–11 years education 3.82 3.26 3.45 1.30 12 years education 2.36 2.60 6.25 1.21 13$$+$$ years education 1.14 1.00 10.00 0.58 Total 4.04 3.74% 3.22% 1.17% Notes: This table calculates average annual Mexico to U.S. migration rates in the full history sample. For married women, I only include those whose husband is living in the U.S. Table 3. Family and migration rates Married men (%) Single men (%) Married women (spouse in U.S.) (%) Single women (%) 0–4 years education 3.44 4.10 1.74 0.81 5–8 years education 4.92 4.55 3.27 1.43 9–11 years education 3.82 3.26 3.45 1.30 12 years education 2.36 2.60 6.25 1.21 13$$+$$ years education 1.14 1.00 10.00 0.58 Total 4.04 3.74% 3.22% 1.17% Married men (%) Single men (%) Married women (spouse in U.S.) (%) Single women (%) 0–4 years education 3.44 4.10 1.74 0.81 5–8 years education 4.92 4.55 3.27 1.43 9–11 years education 3.82 3.26 3.45 1.30 12 years education 2.36 2.60 6.25 1.21 13$$+$$ years education 1.14 1.00 10.00 0.58 Total 4.04 3.74% 3.22% 1.17% Notes: This table calculates average annual Mexico to U.S. migration rates in the full history sample. For married women, I only include those whose husband is living in the U.S. To further analyse the determinants of migration decisions, I estimate the probability that a person who lives in Mexico moves to the U.S. in a given year using probit regressions. The marginal effects are reported in Table 4. The first two columns include both genders, and the third and fourth columns allow for separate effects for men and women, respectively.21 In all regressions but column (4), the effect of age on migration is negative and statistically significant, supporting the human capital model, which predicts that younger people are more likely to move because they have more time to earn higher wages. Using family members as a measure of networks, I find that having a family member in the U.S. makes a person more likely to immigrate. Legal immigrants are more likely to move, as are people who have moved to the U.S. before. Columns (2)–(4) include controls for marital status. Column (2), which includes both men and women, indicates that single men, married men, and married women are more likely to move than single women. Column (3) only includes men, and shows no difference between married and single men. Column (4), which only includes women, again shows that married women whose spouse is in the U.S. are more likely to immigrate than single women. Since married women only move to the U.S. when their husband is in the U.S., it is important to include these sorts of interactions in a model.22 Table 4. Migration probit regression Dependent variable = 1 if moves to the U.S. Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education 0.00680*** 0.00302 0.00242 0.00674** (0.000959) (0.00194) (0.00260) (0.00256) 9–11 years education 0.00511*** –0.000856 –0.00347 0.00713** (0.00102) (0.00217) (0.00289) (0.00259) 12 years education –0.00130 –0.00393 –0.00754 0.00714* (0.00120) (0.00326) (0.00440) (0.00326) 13$$+$$ years education –0.0191*** –0.0144** –0.0203*** 0.00194 (0.00147) (0.00462) (0.00604) (0.00538) Age –0.00365*** –0.00266* –0.00319* –0.0000882 (0.000521) (0.00120) (0.00160) (0.00122) Age squared 0.0000408*** 0.0000164 0.0000193 –0.0000144 (0.0000105) (0.0000231) (0.0000307) (0.0000248) Family in U.S. 0.0104*** 0.0161*** 0.0206*** 0.00427** (0.000722) (0.00149) (0.00201) (0.00140) Legal immigrant 0.0771*** 0.0503*** 0.0627*** 0.0185*** (0.00356) (0.00703) (0.00979) (0.00393) Has moved to U.S. before 0.0465*** 0.0476*** 0.0599*** 0.0141*** (0.00144) (0.00245) (0.00321) (0.00288) Single man 0.0471*** (0.00306) Married man 0.0466*** –0.00119 (0.00317) (0.00219) Married woman 0.0366*** 0.0119*** (0.00480) (0.00179) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 421,638 69,344 50,610 16,288 Dependent variable = 1 if moves to the U.S. Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education 0.00680*** 0.00302 0.00242 0.00674** (0.000959) (0.00194) (0.00260) (0.00256) 9–11 years education 0.00511*** –0.000856 –0.00347 0.00713** (0.00102) (0.00217) (0.00289) (0.00259) 12 years education –0.00130 –0.00393 –0.00754 0.00714* (0.00120) (0.00326) (0.00440) (0.00326) 13$$+$$ years education –0.0191*** –0.0144** –0.0203*** 0.00194 (0.00147) (0.00462) (0.00604) (0.00538) Age –0.00365*** –0.00266* –0.00319* –0.0000882 (0.000521) (0.00120) (0.00160) (0.00122) Age squared 0.0000408*** 0.0000164 0.0000193 –0.0000144 (0.0000105) (0.0000231) (0.0000307) (0.0000248) Family in U.S. 0.0104*** 0.0161*** 0.0206*** 0.00427** (0.000722) (0.00149) (0.00201) (0.00140) Legal immigrant 0.0771*** 0.0503*** 0.0627*** 0.0185*** (0.00356) (0.00703) (0.00979) (0.00393) Has moved to U.S. before 0.0465*** 0.0476*** 0.0599*** 0.0141*** (0.00144) (0.00245) (0.00321) (0.00288) Single man 0.0471*** (0.00306) Married man 0.0466*** –0.00119 (0.00317) (0.00219) Married woman 0.0366*** 0.0119*** (0.00480) (0.00179) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 421,638 69,344 50,610 16,288 Notes: Standard errors, clustered at the household level, in parentheses. $$^{*}p<0.05$$, $$^{**}p<0.01$$, $$^{***}p<0.001$$. Table is reporting marginal effects from a probit regression. The sample includes individuals who were living in Mexico at the start of the period. Column (1) uses the whole sample, and columns (2)–(4) only include the full history sample. For education, the excluded group is people with four or fewer years of education. Married women whose spouse is in Mexico are not included in the regression. Table 4. Migration probit regression Dependent variable = 1 if moves to the U.S. Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education 0.00680*** 0.00302 0.00242 0.00674** (0.000959) (0.00194) (0.00260) (0.00256) 9–11 years education 0.00511*** –0.000856 –0.00347 0.00713** (0.00102) (0.00217) (0.00289) (0.00259) 12 years education –0.00130 –0.00393 –0.00754 0.00714* (0.00120) (0.00326) (0.00440) (0.00326) 13$$+$$ years education –0.0191*** –0.0144** –0.0203*** 0.00194 (0.00147) (0.00462) (0.00604) (0.00538) Age –0.00365*** –0.00266* –0.00319* –0.0000882 (0.000521) (0.00120) (0.00160) (0.00122) Age squared 0.0000408*** 0.0000164 0.0000193 –0.0000144 (0.0000105) (0.0000231) (0.0000307) (0.0000248) Family in U.S. 0.0104*** 0.0161*** 0.0206*** 0.00427** (0.000722) (0.00149) (0.00201) (0.00140) Legal immigrant 0.0771*** 0.0503*** 0.0627*** 0.0185*** (0.00356) (0.00703) (0.00979) (0.00393) Has moved to U.S. before 0.0465*** 0.0476*** 0.0599*** 0.0141*** (0.00144) (0.00245) (0.00321) (0.00288) Single man 0.0471*** (0.00306) Married man 0.0466*** –0.00119 (0.00317) (0.00219) Married woman 0.0366*** 0.0119*** (0.00480) (0.00179) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 421,638 69,344 50,610 16,288 Dependent variable = 1 if moves to the U.S. Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education 0.00680*** 0.00302 0.00242 0.00674** (0.000959) (0.00194) (0.00260) (0.00256) 9–11 years education 0.00511*** –0.000856 –0.00347 0.00713** (0.00102) (0.00217) (0.00289) (0.00259) 12 years education –0.00130 –0.00393 –0.00754 0.00714* (0.00120) (0.00326) (0.00440) (0.00326) 13$$+$$ years education –0.0191*** –0.0144** –0.0203*** 0.00194 (0.00147) (0.00462) (0.00604) (0.00538) Age –0.00365*** –0.00266* –0.00319* –0.0000882 (0.000521) (0.00120) (0.00160) (0.00122) Age squared 0.0000408*** 0.0000164 0.0000193 –0.0000144 (0.0000105) (0.0000231) (0.0000307) (0.0000248) Family in U.S. 0.0104*** 0.0161*** 0.0206*** 0.00427** (0.000722) (0.00149) (0.00201) (0.00140) Legal immigrant 0.0771*** 0.0503*** 0.0627*** 0.0185*** (0.00356) (0.00703) (0.00979) (0.00393) Has moved to U.S. before 0.0465*** 0.0476*** 0.0599*** 0.0141*** (0.00144) (0.00245) (0.00321) (0.00288) Single man 0.0471*** (0.00306) Married man 0.0466*** –0.00119 (0.00317) (0.00219) Married woman 0.0366*** 0.0119*** (0.00480) (0.00179) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 421,638 69,344 50,610 16,288 Notes: Standard errors, clustered at the household level, in parentheses. $$^{*}p<0.05$$, $$^{**}p<0.01$$, $$^{***}p<0.001$$. Table is reporting marginal effects from a probit regression. The sample includes individuals who were living in Mexico at the start of the period. Column (1) uses the whole sample, and columns (2)–(4) only include the full history sample. For education, the excluded group is people with four or fewer years of education. Married women whose spouse is in Mexico are not included in the regression. The data on return migration rates show that 9% of all migrants living in the U.S. move to Mexico each year. Raw statistics show that men have higher return migration rates than women. Suspecting that return migration rates for married men are affected by the location of their wives, in Table 5, looking at only men in the full history sample, I split the sample by marital status and wife’s location. Married men whose wife is in Mexico are much more likely to return home, whereas those whose wife is living in the U.S. have a much lower return migration rate. Table 5. Family and male return migration rates Wife in Mexico (%) Wife in U.S. (%) Single (%) 0–4 years education 40.55 15.38 33.39 5–8 years education 33.59 22.22 31.70 9–11 years education 39.83 16.22 29.43 12 years education 48.84 9.09 26.19 13$$+$$ years education 29.41 0.00 35.09 Total 36.61 17.88 30.96 Wife in Mexico (%) Wife in U.S. (%) Single (%) 0–4 years education 40.55 15.38 33.39 5–8 years education 33.59 22.22 31.70 9–11 years education 39.83 16.22 29.43 12 years education 48.84 9.09 26.19 13$$+$$ years education 29.41 0.00 35.09 Total 36.61 17.88 30.96 Notes: This table reports the average annual return migration rates, using the the full history sample. Table 5. Family and male return migration rates Wife in Mexico (%) Wife in U.S. (%) Single (%) 0–4 years education 40.55 15.38 33.39 5–8 years education 33.59 22.22 31.70 9–11 years education 39.83 16.22 29.43 12 years education 48.84 9.09 26.19 13$$+$$ years education 29.41 0.00 35.09 Total 36.61 17.88 30.96 Wife in Mexico (%) Wife in U.S. (%) Single (%) 0–4 years education 40.55 15.38 33.39 5–8 years education 33.59 22.22 31.70 9–11 years education 39.83 16.22 29.43 12 years education 48.84 9.09 26.19 13$$+$$ years education 29.41 0.00 35.09 Total 36.61 17.88 30.96 Notes: This table reports the average annual return migration rates, using the the full history sample. Using a probit regression, I estimate the probability that a person currently living in the U.S. returns to Mexico in a given year. The marginal effects are shown in Table 6. Columns (1) and (2) use data for both genders, and columns (3) and (4) use data for men and women, respectively.23 All specifications except for column (4) show that that legal immigrants are less likely to return home. Columns (2)–(4) control for marital status, and additionally split the sample for married men based on whether their spouse is living in Mexico or the U.S. Married men with a wife in Mexico are more likely to return migrate than single men, whereas married men whose wife is in the U.S. are less likely to return migrate than single men. This suggests that moving home to be with one’s spouse is a strong incentive for return migration. Table 6. Return migration probit regression Dependent variable$$=$$1 if moves from U.S. to Mexico Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education –0.0263*** 0.0109 0.00962 0.120 (0.00699) (0.0279) (0.0288) (0.0983) 9–11 years education –0.0320*** –0.00796 –0.00648 0.113 (0.00734) (0.0308) (0.0321) (0.101) 12 years education –0.0429*** –0.0125 –0.00367 0.00428 (0.00898) (0.0434) (0.0473) (0.116) 13$$+$$ years education –0.0194 0.0134 0.0515 –0.242 (0.0109) (0.0542) (0.0608) (0.150) Age –0.00495 0.0181 0.0218 0.0410 (0.00321) (0.0133) (0.0138) (0.0455) Age squared 0.000104 –0.000237 –0.000293 –0.000844 (0.0000617) (0.000248) (0.000257) (0.000901) Family in U.S. 0.0313*** –0.0304 –0.0349 0.0480 (0.00482) (0.0208) (0.0218) (0.0519) Legal immigrant –0.0725*** –0.284*** –0.295*** –0.0167 (0.00794) (0.0299) (0.0311) (0.0838) Single man 0.0794* (0.0395) Married man, wife in U.S. –0.0709 –0.149** (0.0631) (0.0530) Married man, wife in Mexico 0.121** 0.0442* (0.0430) (0.0223) Married woman 0.0590 0.0711 (0.0587) (0.0552) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 40,268 5,624 5,185 425 Dependent variable$$=$$1 if moves from U.S. to Mexico Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education –0.0263*** 0.0109 0.00962 0.120 (0.00699) (0.0279) (0.0288) (0.0983) 9–11 years education –0.0320*** –0.00796 –0.00648 0.113 (0.00734) (0.0308) (0.0321) (0.101) 12 years education –0.0429*** –0.0125 –0.00367 0.00428 (0.00898) (0.0434) (0.0473) (0.116) 13$$+$$ years education –0.0194 0.0134 0.0515 –0.242 (0.0109) (0.0542) (0.0608) (0.150) Age –0.00495 0.0181 0.0218 0.0410 (0.00321) (0.0133) (0.0138) (0.0455) Age squared 0.000104 –0.000237 –0.000293 –0.000844 (0.0000617) (0.000248) (0.000257) (0.000901) Family in U.S. 0.0313*** –0.0304 –0.0349 0.0480 (0.00482) (0.0208) (0.0218) (0.0519) Legal immigrant –0.0725*** –0.284*** –0.295*** –0.0167 (0.00794) (0.0299) (0.0311) (0.0838) Single man 0.0794* (0.0395) Married man, wife in U.S. –0.0709 –0.149** (0.0631) (0.0530) Married man, wife in Mexico 0.121** 0.0442* (0.0430) (0.0223) Married woman 0.0590 0.0711 (0.0587) (0.0552) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 40,268 5,624 5,185 425 Notes: Standard errors, clustered at the household level, in parentheses. $$^{*}p<0.05$$, $$^{**}p<0.01$$, $$^{***}p<0.001$$. Table is reporting marginal effects from a probit regression. The sample includes individuals who were living in the U.S. at the start of the period. Column 1 uses the whole sample, and columns (2)–(4) only use the full history sample. The excluded group for education is people with four or fewer years of education. Table 6. Return migration probit regression Dependent variable$$=$$1 if moves from U.S. to Mexico Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education –0.0263*** 0.0109 0.00962 0.120 (0.00699) (0.0279) (0.0288) (0.0983) 9–11 years education –0.0320*** –0.00796 –0.00648 0.113 (0.00734) (0.0308) (0.0321) (0.101) 12 years education –0.0429*** –0.0125 –0.00367 0.00428 (0.00898) (0.0434) (0.0473) (0.116) 13$$+$$ years education –0.0194 0.0134 0.0515 –0.242 (0.0109) (0.0542) (0.0608) (0.150) Age –0.00495 0.0181 0.0218 0.0410 (0.00321) (0.0133) (0.0138) (0.0455) Age squared 0.000104 –0.000237 –0.000293 –0.000844 (0.0000617) (0.000248) (0.000257) (0.000901) Family in U.S. 0.0313*** –0.0304 –0.0349 0.0480 (0.00482) (0.0208) (0.0218) (0.0519) Legal immigrant –0.0725*** –0.284*** –0.295*** –0.0167 (0.00794) (0.0299) (0.0311) (0.0838) Single man 0.0794* (0.0395) Married man, wife in U.S. –0.0709 –0.149** (0.0631) (0.0530) Married man, wife in Mexico 0.121** 0.0442* (0.0430) (0.0223) Married woman 0.0590 0.0711 (0.0587) (0.0552) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 40,268 5,624 5,185 425 Dependent variable$$=$$1 if moves from U.S. to Mexico Whole sample Full history sample Men Women (1) (2) (3) (4) 5–8 years education –0.0263*** 0.0109 0.00962 0.120 (0.00699) (0.0279) (0.0288) (0.0983) 9–11 years education –0.0320*** –0.00796 –0.00648 0.113 (0.00734) (0.0308) (0.0321) (0.101) 12 years education –0.0429*** –0.0125 –0.00367 0.00428 (0.00898) (0.0434) (0.0473) (0.116) 13$$+$$ years education –0.0194 0.0134 0.0515 –0.242 (0.0109) (0.0542) (0.0608) (0.150) Age –0.00495 0.0181 0.0218 0.0410 (0.00321) (0.0133) (0.0138) (0.0455) Age squared 0.000104 –0.000237 –0.000293 –0.000844 (0.0000617) (0.000248) (0.000257) (0.000901) Family in U.S. 0.0313*** –0.0304 –0.0349 0.0480 (0.00482) (0.0208) (0.0218) (0.0519) Legal immigrant –0.0725*** –0.284*** –0.295*** –0.0167 (0.00794) (0.0299) (0.0311) (0.0838) Single man 0.0794* (0.0395) Married man, wife in U.S. –0.0709 –0.149** (0.0631) (0.0530) Married man, wife in Mexico 0.121** 0.0442* (0.0430) (0.0223) Married woman 0.0590 0.0711 (0.0587) (0.0552) State fixed effects Yes Yes Yes Yes Time fixed effects Yes Yes Yes Yes Observations 40,268 5,624 5,185 425 Notes: Standard errors, clustered at the household level, in parentheses. $$^{*}p<0.05$$, $$^{**}p<0.01$$, $$^{***}p<0.001$$. Table is reporting marginal effects from a probit regression. The sample includes individuals who were living in the U.S. at the start of the period. Column 1 uses the whole sample, and columns (2)–(4) only use the full history sample. The excluded group for education is people with four or fewer years of education. One of the motivations for the dynamic model estimated in this article is that repeat migration is common. In the sample, the average number of moves to the U.S. per migrant is 1.64 for men and 1.14 for women , showing that many migrants move more than once.24 Women move less and are less likely to return migrate, implying that when women move, their decision is more likely to be permanent. The average durations illustrate this more clearly. Overall, the average migration duration is 4.4 years. It is slightly higher for legal than illegal movers (4.83 versus 4.35 years, respectively). The average duration for men is 4.15 years, and the average duration for women is 5.20 years, again indicating that when women move, their decision is more likely to be permanent. This section shows that it is crucial to allow for a relationship between spouses’ decisions. The model in this article accounts for the following trends observed in the data: (1) women are more likely to move if their husband is in the U.S., and (2) men are less likely to return migrate if their spouse is living with them in the U.S. By including both male and female decisions in the model, I can study how their interactions affect the counterfactual outcomes. A key component of the model is that individuals are choosing from a set of locations in both the U.S. and Mexico, instead of just picking between the two countries. This is an important contribution of this article, in that most of the past work on Mexico to U.S. migration does not allow for internal migration. Internal migration is fairly common, as close to 30% of the people in the full history sample moves internally, making it important to allow for people to choose from locations in both countries.25 Due to these high rates, changes in wages in Mexico, even outside of one’s home location, could affect the decision on whether or not to move to the U.S. The model accounts for this by letting people choose from a set of locations in both countries. 5.2. Border enforcement To measure border enforcement, I use data from U.S. CBP on the number of person-hours spent patrolling the border. CBP divides the U.S.–Mexico border into nine sectors, as shown in Figure 1, each of which gets a different allocation of resources each year.26Figure 2 shows the number of person-hours spent patrolling each region of the border over time.27 Relative to the levels observed today, border patrol was fairly low in the early 1980s. Enforcement was initially highest at San Diego and grew the fastest there. Enforcement also grew substantially at Tucson and the Rio Grande Valley, although the growth started later than at San Diego. In most of the other sectors, there was a small amount of growth in enforcement, mostly starting in the late 1990s. Figure 1 View largeDownload slide Border patrol sectors. Notes: Map downloaded from U.S. CBP website. Figure 1 View largeDownload slide Border patrol sectors. Notes: Map downloaded from U.S. CBP website. Figure 2 View largeDownload slide Hours patrolling the border Notes: Data on enforcement from U.S. CBP. Figure 2 View largeDownload slide Hours patrolling the border Notes: Data on enforcement from U.S. CBP. Much of the variation in Figure 2 can be explained by changes in U.S. policy. The 1986 IRCA called for increased enforcement along the U.S.–Mexico border. However, changes in enforcement were small until the early 1990s, when new policies further increased border patrol.28 Illegal immigrants surveyed in the MMP reported the closest city in Mexico to where they crossed the border. I use this information to match each individual to a border patrol sector. Figure 3 shows the percentage of illegal immigrants who cross the border at each crossing point in each year. Initially, the largest share of people crossed the border near San Diego. However, as enforcement there increased, fewer people crossed at San Diego. Before 1995, about 50% of illegal immigrants crossed the border at San Diego. This decreased to 27% post-1995. At the same time, the share of people crossing at Tucson increased. I use this variation in behaviour, combined with the changes in enforcement at each sector over time, to identify the effect of border enforcement on immigration decisions.29 Figure 3 View largeDownload slide Border crossing locations (MMP). Notes: In this figure, I use data from the MMP to calculate the share of illegal migrants that cross at each border patrol sector in each year. Figure 3 View largeDownload slide Border crossing locations (MMP). Notes: In this figure, I use data from the MMP to calculate the share of illegal migrants that cross at each border patrol sector in each year. 6. Estimation I estimate the model using maximum likelihood. I assume that a person has 28 location choices, which include 24 locations in Mexico and four in the U.S. The Mexico locations are loosely defined as states; however, some states are grouped when they border each other and have smaller sample sizes.30 The locations in the U.S. are California, Texas, Illinois, and the remainder of states that are grouped into one location choice.31 I restrict decisions so that a married woman cannot move to the U.S. unless her husband is living there. This simplifies computation, and is empirically grounded since it is very rare in the data for the wife to live in the U.S. while the husband is in Mexico. Illegal immigrants moving to the U.S. also choose where to cross the border. The U.S. government divides the border into nine regions. However, very few people in the data cross at some of these points, making identification of the fixed cost of crossing difficult. I reduce the number of crossing points to seven to avoid this problem.32 Therefore, an illegal immigrant has twenty-eight choices in the U.S., the four locations combined with the seven crossing points. I define a time period as one year, and use a one-year discount rate of 0.95. I assume that people solve the model starting at the age of 17 and work until the age of 65. There are three sources of unobserved heterogeneity in the model. The first is over moving cost type, and this is at the household level. In particular, I assume that there are two types, where one group (the stayers) has infinitely high moving costs and will never move to the U.S. The second source of unobserved heterogeneity is over wage outcomes when living in the U.S., and I assume that this is at the individual level. These values are known by the individual but unobserved by the econometrician. The data show that many women do not work, and therefore would not be affected by wage differentials. To account for this, there is a third set of unobserved heterogeneity that allows for women to be a worker or a non-worker type, where decisions of non-worker types are not affected by wages. I integrate over the probability that a woman is a worker type, which is taken from aggregate statistics on female labour force participation from the World Bank’s World Development Indicators.33 Identification of the wage parameters and the fixed cost of moving follows the arguments in Kennan and Walker’s (2011). My model also has the parameters related to illegal immigration, where identification of the border enforcement term comes from comparing the rate that people cross at each border patrol sector over time as enforcement hours are reallocated. The intuition for how these parameters are identified is discussed in Online Appendix B. 6.1. Wages I estimate three sets of wage functions: when people are in Mexico, in the U.S. illegally, and in the U.S. legally. For all three situations, wages have a deterministic and a random component, where the latter is realized each period after a person decides where to live. This means that when making migration decisions, people only consider their expected wage in each location. Wages in Mexico are estimated in a first-stage regression. The MMP data do not have sufficient information on individual wages in Mexico, so I cannot learn about how individual variations in wage draws affect migration decisions.34 Instead, I use data from Mexican labour force surveys, which have more accurate information on Mexican wages in each year, to estimate this wage distribution. Using data from the ENIGH in 1989, 1992, and 1994 and the ENE from 1995 to 2004, I estimate wage regressions in each year: \begin{eqnarray} w^{M}_{ijt} = \beta^{Mt} X_{it} + \gamma^{Mt}_j + \epsilon_{ijt}\text{ .}\label{eqn:wagesmex} \end{eqnarray} (14) In equation (14), $$X_{it}$$ are individual characteristics, $$\beta^{Mt}$$ are the returns to these characteristics when in Mexico at time $$t$$, and $$\gamma^{Mt}$$ are state fixed effects, which also vary over time. The first two columns of Table 7 show the results of the wage regression for Mexican wages in 1989 and 2004, the first and last years where I have these data. The regressions for all years are in Online Appendix A. There are strong returns to education and experience in these data, which have fluctuated significantly over the time period analysed. Note that in equation (14), there is no unobserved heterogeneity in wages, so unobserved types are independent of Mexican wages. I make this assumption due to the lack of reliable wage information in the MMP when individuals live in Mexico. Unfortunately, the lack of individual-level heterogeneity over Mexican wages is a limitation of this analysis. Table 7. Wage regressions in Mexico Dependent variable: Wage in Mexico 1989 2004 1989–2004 (1) (2) (3) Age 2.89*** 1.28*** 1.38*** (0.23) (0.01) (0.005) Age-squared –0.28*** –0.13*** –0.14*** (0.03) (0.002) (0.001) Male 0.78 0.15*** 0.18*** (0.10) (0.005) (0.002) 5–8 years education 1.12*** 0.46*** 0.72*** (0.13) (0.008) (0.02) 9–11 years education 1.74*** 0.95*** 1.36*** (0.14) (0.008) (0.01) 12 years education 2.96*** 1.26*** 2.49*** (0.15) (0.01) (0.02) 13$$+$$ years education 5.60*** 2.87*** 3.99*** (0.15) (0.009) (0.02) 0–4 years education $$\times$$ time –0.02*** (0.0004) 5–8 years education $$\times$$ time –0.02*** (0.001) 9–11 years education $$\times$$ time –0.03*** (0.001) 12 years education $$\times$$ time –0.09*** (0.002) 13+ years education $$\times$$ time –0.78*** (0.001) State fixed effects Yes Yes Yes $$R^{2}$$ 0.19 0.28 0.29 Dependent variable: Wage in Mexico 1989 2004 1989–2004 (1) (2) (3) Age 2.89*** 1.28*** 1.38*** (0.23) (0.01) (0.005) Age-squared –0.28*** –0.13*** –0.14*** (0.03) (0.002) (0.001) Male 0.78 0.15*** 0.18*** (0.10) (0.005) (0.002) 5–8 years education 1.12*** 0.46*** 0.72*** (0.13) (0.008) (0.02) 9–11 years education 1.74*** 0.95*** 1.36*** (0.14) (0.008) (0.01) 12 years education 2.96*** 1.26*** 2.49*** (0.15) (0.01) (0.02) 13$$+$$ years education 5.60*** 2.87*** 3.99*** (0.15) (0.009) (0.02) 0–4 years education $$\times$$ time –0.02*** (0.0004) 5–8 years education $$\times$$ time –0.02*** (0.001) 9–11 years education $$\times$$ time –0.03*** (0.001) 12 years education $$\times$$ time –0.09*** (0.002) 13+ years education $$\times$$ time –0.78*** (0.001) State fixed effects Yes Yes Yes $$R^{2}$$ 0.19 0.28 0.29 Notes: Standard errors in parentheses. $$^{*}p<0.05$$, $$^{**}p<0.01$$, $$^{***}p<0.001$$. Age is divided by 10. For education, the excluded group is people with less than five years of education. The dependent variable is hourly wages, in 2000 dollars using PPP exchange rates. Column (3) has data from 1989, 1992, and 1994–2004. Time is (year-1989). Quadratic and cubic terms for time also included in column (3). Table 7. Wage regressions in Mexico Dependent variable: Wage in Mexico 1989 2004 1989–2004 (1) (2) (3) Age 2.89*** 1.28*** 1.38*** (0.23) (0.01) (0.005) Age-squared –0.28*** –0.13*** –0.14*** (0.03) (0.002) (0.001) Male 0.78 0.15*** 0.18*** (0.10) (0.005) (0.002) 5–8 years education 1.12*** 0.46*** 0.72*** (0.13) (0.008) (0.02) 9–11 years education 1.74*** 0.95*** 1.36*** (0.14) (0.008) (0.01) 12 years education 2.96*** 1.26*** 2.49*** (0.15) (0.01) (0.02) 13$$+$$ years education 5.60*** 2.87*** 3.99*** (0.15) (0.009) (0.02) 0–4 years education $$\times$$ time –0.02*** (0.0004) 5–8 years education $$\times$$ time –0.02*** (0.001) 9–11 years education $$\times$$ time –0.03*** (0.001) 12 years education $$\times$$ time –0.09*** (0.002) 13+ years education $$\times$$ time –0.78*** (0.001) State fixed effects Yes Yes Yes $$R^{2}$$ 0.19 0.28 0.29 Dependent variable: Wage in Mexico 1989 2004 1989–2004 (1) (2) (3) Age 2.89*** 1.28*** 1.38*** (0.23) (0.01) (0.005) Age-squared –0.28*** –0.13*** –0.14*** (0.03) (0.002) (0.001) Male 0.78 0.15*** 0.18*** (0.10) (0.005) (0.002) 5–8 years education 1.12*** 0.46*** 0.72*** (0.13) (0.008) (0.02) 9–11 years education 1.74*** 0.95*** 1.36*** (0.14) (0.008) (0.01) 12 years education 2.96*** 1.26*** 2.49*** (0.15) (0.01) (0.02) 13$$+$$ years education 5.60*** 2.87*** 3.99*** (0.15) (0.009) (0.02) 0–4 years education $$\times$$ time –0.02*** (0.0004) 5–8 years education $$\times$$ time –0.02*** (0.001) 9–11 years education $$\times$$ time –0.03*** (0.001) 12 years education $$\times$$ time –0.09*** (0.002) 13+ years education $$\times$$ time –0.78*** (0.001) State fixed effects Yes Yes Yes $$R^{2}$$ 0.19 0.28 0.29 Notes: Standard errors in parentheses. $$^{*}p<0.05$$, $$^{**}p<0.01$$, $$^{***}p<0.001$$. Age is divided by 10. For education, the excluded group is people with less than five years of education. The dependent variable is hourly wages, in 2000 dollars using PPP exchange rates. Column (3) has data from 1989, 1992, and 1994–2004. Time is (year-1989). Quadratic and cubic terms for time also included in column (3). I use the results of year-by-year regressions to calculate an expected wage for each person in each location in Mexico and year. Because I do not have wage data for every year in the estimation, I need to compute expected wages in the missing years. To do this, I run a wage regression using all of the available data, including time trends in the returns to education, which allows for (1) changes in wage levels over time and (2) changes in the returns to education. The results of this regression are in the third column of Table 7. This allows me to calculate expected wages in Mexico in all years and states, using the year-by-year regressions when possible and the regression with all years of data when I do not have data for that year. To estimate the model, I also need to make assumptions on people’s beliefs on future wages. It is unlikely that people had perfect foresight over what would happen to Mexican wages over this period, especially due to the severe fluctuations in Mexico’s economy. To specify wage expectations, I use the results from the wage regression to impute an expected wage for each person in each location and time, denoted as $$\hat{w}^M_{ijt}$$. I assume that people expect there is some chance (denoted as $$p_{loss}$$) of a large wage drop (at rate $$\alpha$$) in each period that causes them to earn less than this expected wage.35 Then I can write each person’s wage expectations as \begin{equation} E w^M_{ijt}=\left\{ \begin{array}{ll} \hat{w}^M_{ijt} & \text{with probability } 1-p_{\rm loss}\\ (1-\alpha) \hat{w}^M_{ijt} & \text{with probability } p_{\rm loss}\text{ .} \end{array} \right. \end{equation} (15) The probability $$p_{\rm loss}$$ of this wage drop is given by the fraction of years Mexico experienced negative wage growth. The expected wage drop ($$\alpha$$) is equal to the average wage drop in these bad years. For wages in the U.S., the parameters are estimated jointly with the moving cost and utility parameters. There is a separate wage process for legal and illegal immigrants, written as \begin{eqnarray} w^{ill}_{ijt} &=& \beta^{ill} X_{it} + \gamma^{ill}_j + \kappa^{ill}_i + \epsilon^{ill}_{ijt}\label{eqn:wage_illegal}\\ \end{eqnarray} (16) \begin{eqnarray} w^{leg}_{ijt}&=&\beta^{leg}X_{it}+\gamma^{leg}_j+\kappa^{leg}_i+\epsilon^{leg}_{ijt}.\label{eqn:wage_legal} \end{eqnarray} (17) Wages depend on demographic characteristics $$X_{it}$$, which include education, gender, age, and whether or not a person has family living in the U.S.36 I include time trends to allow for changes over time, as well as location fixed effects $$\gamma_j$$.37 The match component, which is the source of unobserved heterogeneity over wages, is written as $$\kappa_i=\{\kappa_i^{ill},\kappa^{leg}_i\}$$. When estimating these terms, I assume the legal and illegal fixed effects are each drawn from a symmetric three point distribution where each value is equally likely. There is a correlation between the unobserved types of husbands and wives. Each individual knows the value of his fixed effect if he were to move to the U.S. For legal immigrants, the MMP only has a small number of observations with wage information, making it difficult to precisely estimate the wage parameters. I combine the MMP wage observations with CPS data to estimate this wage process. I use data on Mexican-born individuals in the CPS, jointly with the MMP wage observations for legal immigrants, to estimate this set of wage parameters.38 For the CPS data, I do not have information on their migration decisions, so these individuals contribute to the likelihood through just their wages. 6.2. Moving costs Here I explain the determinants of moving costs for the mover types in the model. The full parameterization of the moving cost function is explained in Online Appendix C. The cost of moving includes a fixed cost, and also depends on the distance between locations, calculated as the driving distances between the most populous cities in each state.39 The cost of moving also depends on age, which captures other effects of age on immigration that are not accounted for in the model or the wage distribution. The population size of the destination also affects moving costs, to account for the empirical fact that people are more likely to move to larger locations.40 For people moving to the U.S., I allow the moving cost to depend on education. Networks, defined as the people that an individual knows who are already living in the U.S., can affect the cost of moving to the U.S. for that person.41 Empirical evidence shows that migration rates vary across states, suggesting that people from high-migration states have larger networks. I exploit differences in state-level immigration patterns, which have been well-documented empirically, to measure a person’s network. I use the distance to the railroad as a proxy for regional network effects.42 When immigration from Mexico to the U.S. began in the early 1900s, U.S. employers used railroads to transport laborers across the border, meaning that the first migrants came from communities located near the railroad (Durand et al., 2001). These communities still have the highest immigration rates today. U.S. border enforcement affects the border crossing costs for illegal immigrants. However, there is potential endogeneity in that enforcement at each sector could be affected by the number of migrants crossing there. To account for this, I follow Bohn and Pugatch (2015) and use the enforcement levels, lagged by 2 periods, to predict future enforcement. Budget allocations for border enforcement are typically determined two years ahead of time, although extra resources can be allocated when needed due to unexpected shocks. The two-year-lagged values of border enforcement levels represent the best predictor of future enforcement needs before these shocks hit. This controls for endogeneity of enforcement and migration flows at each sector. This setup assumes perfect foresight, which is a strong assumption. I have estimated the model assuming myopic expectations and the results were similar. The cost of moving through a specific border patrol sector depends on the predicted enforcement levels there, as well as a fixed cost of crossing through that point. Some of the border crossing points consistently have low enforcement, yet few people choose to cross there. I assume that there are other reasons, constant across time, that account for this trend, such as being in a desert where it is dangerous to cross. The estimated fixed costs account for these factors. Since the model is dynamic, I need to make assumptions on people’s beliefs on future levels of border enforcement. I assume that people have perfect foresight on border enforcement.43 6.3. Transition rates The transition probabilities defined in Section 3.2.3 are over spouse locations, legal status, and marriage rates. The transitions over spouse’s location come from the choice probabilities in the model. The legal status and marriage transition rates come from the data. Using the MMP data, I estimate the probability that a person switches from illegal to legal status with a probit regression that controls for education, family networks, and gender. I assume the amnesty due to IRCA in 1986 was unanticipated. People could only be legalized under IRCA if they had lived in the U.S. continuously since 1982. Therefore, this policy would only affect immigration decisions if it was anticipated four to five years prior to implementation, making this assumption reasonable. The results of this regression, shown in column (1) of Table A7 in Online Appendix A, indicate that having family in the U.S. and being male strongly affects the probability of being granted legal status. I use the results of this regression to impute a probability that each person is granted legal status, which is used as exogenously given transition rates when estimating the model. In the model, single people know that there is some probability that they will get married in future periods. I estimate marriage rates using a probit regression. Column (2) of Table A7 in Online Appendix A shows how different factors affect the probability of becoming married. I use these results for the transition probabilities in the model estimation. 6.4. Utility function Utility depends on a person’s expected wage, which is a function of his location and characteristics. A person’s utility increases if he is living at his home location, which is defined as the state in which he was born. I allow for utility to increase if a person is in the same country as his spouse. Alternatively, I could have assumed that this depends on being in the same location as one’s spouse, but this would significantly increase computation. My methodology only requires me to track the country of his spouse instead of the exact location, and yet still captures the empirical trend that people make migration decisions to be near their spouse.44 I also allow for higher utility in the U.S. if a person also has family members living there. The full parameterization of the utility function is explained in Online Appendix C. 6.5. Likelihood function In this section, I explain the derivation of the likelihood function; the full details are explained in Online Appendix D. I estimate the model using maximum likelihood. I calculate the likelihood function at the household level, where I integrate over the probability that each household is of a specific moving cost type, the probability that each person has a specific wage fixed effect, and the probability that the woman is a worker type.45 For each household, I observe a history of location choices for the primary and secondary mover. These choices depend on moving cost type $$\tau$$, where I assume there are mover and stayer types, where the stayer types have infinitely high costs of moving to the U.S. Women can be worker or non-worker types, where utility for the non-worker types is not affected by wages. For each person, I observe wage draws when in the U.S. There is unobserved heterogeneity in the wage draws. These are individual specific terms, known by every member of the household and unobserved by the econometrician. I allow for a correlation between the unobserved types of husbands and wives. First, I explain how I calculate the likelihood function conditional on moving cost type and wage type. The migration probabilities for each period come from the choice probabilities defined in equations (11) and (6). For secondary movers, I differentiate between the choice probabilities for worker and non-worker types, where the utility for non-worker types is not affected by wages. I calculate the probability of seeing an observed history for a household when the woman is a worker type and when she is a non-worker type. I then integrate over the probability that the woman is a worker type. The previous explanation was for calculating the likelihood conditional on moving cost and wage type. To calculate the full likelihood, I have to incorporate the probability that a household has moving cost type $$\tau$$ and each individual has a given wage type. I estimate the probability that a household has moving cost type $$\tau$$. I allow for a correlation between the types of husbands and wives, by estimating the probability that a woman with a given wage-type is married to a man with a given type. This allows for assortative matching in the labour market, if the estimates reveal that a high-wage-type man is most likely to be married to a high-wage-type woman.46 7. Results Table 8 reports the utility parameter estimates.47 The results show that people prefer to live at their home location, and that men living in the same location as their spouse have higher utility. There is no statistically significant effect for women, which can be explained due to the assumptions in the model. Because women rarely move to the U.S. without their husband, I assumed that a married woman cannot live in the U.S. unless her husband is there. Without this assumption, I would get a much larger preference for living in the same location as one’s spouse for women, since women do not move to the U.S. without their husbands. In addition, people with family in the U.S. have higher utility when living in the U.S. than those who do not. There are mover and stayer types in the model; the estimation is set so that the fixed cost of moving to the U.S. is infinity for stayer types so they will never choose to make that move. I find that the probability that a household is a mover type is close to 70%. Table 8. Utility parameter estimates Wage term 0.056 (0.0022) Home bias 0.20 (0.0040) With spouse (men) 0.36 (0.053) With spouse (women) 0.032 (0.042) Family in U.S. 0.029 (0.012) Probability (mover type) 0.68 (0.022) Log-likelihood –232,643.05 Wage term 0.056 (0.0022) Home bias 0.20 (0.0040) With spouse (men) 0.36 (0.053) With spouse (women) 0.032 (0.042) Family in U.S. 0.029 (0.012) Probability (mover type) 0.68 (0.022) Log-likelihood –232,643.05 Notes: Standard errors in parentheses. Table 8. Utility parameter estimates Wage term 0.056 (0.0022) Home bias 0.20 (0.0040) With spouse (men) 0.36 (0.053) With spouse (women) 0.032 (0.042) Family in U.S. 0.029 (0.012) Probability (mover type) 0.68 (0.022) Log-likelihood –232,643.05 Wage term 0.056 (0.0022) Home bias 0.20 (0.0040) With spouse (men) 0.36 (0.053) With spouse (women) 0.032 (0.042) Family in U.S. 0.029 (0.012) Probability (mover type) 0.68 (0.022) Log-likelihood –232,643.05 Notes: Standard errors in parentheses. In a separate exercise, I estimated a simpler version of this model, taking away the utility preference for living at the same location as a person’s spouse. This leads to a significant change in the likelihood at the optimal point, where equality of the likelihood with the original and simpler model was rejected by a likelihood ratio test.48 This shows that the inclusion of this part of the model substantially improves its ability to model decisions. Table 9. Immigrant wage estimates Illegal Legal Match probabilities Age 2.63 6.17 Low–low 0.32 (1.14) (0.13) (2.17) Age-squared –0.44 –0.64 Low–medium 0.01 (0.21) (0.016) (2.01) 5–8 years education 1.23 1.49 Medium–low 0.01 (0.20) (0.16) (2.05) 9–11 years education 1.93 2.80 Medium–medium 0.28 (0.21) (0.16) (1.96) 12 years education 2.24 4.83 (0.24) (0.15) 13$$+$$ years education 2.05 6.87 (0.37) (0.16) Family in U.S. –0.51 (0.22) Male 1.31 2.76 (0.28) (0.047) Match component 2.29 0.98 (0.25) (0.60) Constant 0.96 –6.96 (1.44) (0.27) Standard deviation of wages 2.52 5.08 (0.082) (0.078) Illegal Legal Match probabilities Age 2.63 6.17 Low–low 0.32 (1.14) (0.13) (2.17) Age-squared –0.44 –0.64 Low–medium 0.01 (0.21) (0.016) (2.01) 5–8 years education 1.23 1.49 Medium–low 0.01 (0.20) (0.16) (2.05) 9–11 years education 1.93 2.80 Medium–medium 0.28 (0.21) (0.16) (1.96) 12 years education 2.24 4.83 (0.24) (0.15) 13$$+$$ years education 2.05 6.87 (0.37) (0.16) Family in U.S. –0.51 (0.22) Male 1.31 2.76 (0.28) (0.047) Match component 2.29 0.98 (0.25) (0.60) Constant 0.96 –6.96 (1.44) (0.27) Standard deviation of wages 2.52 5.08 (0.082) (0.078) Notes: Standard errors in parentheses. The excluded term is people with less than five years of education. Age is divided by 10. The match components are drawn from a three-point symmetric distribution around zero. The first component in the match probability is for the husband, and the second is for the wife. The wage equations include time trends in education and location fixed effects from the CPS. Table 9. Immigrant wage estimates Illegal Legal Match probabilities Age 2.63 6.17 Low–low 0.32 (1.14) (0.13) (2.17) Age-squared –0.44 –0.64 Low–medium 0.01 (0.21) (0.016) (2.01) 5–8 years education 1.23 1.49 Medium–low 0.01 (0.20) (0.16) (2.05) 9–11 years education 1.93 2.80 Medium–medium 0.28 (0.21) (0.16) (1.96) 12 years education 2.24 4.83 (0.24) (0.15) 13$$+$$ years education 2.05 6.87 (0.37) (0.16) Family in U.S. –0.51 (0.22) Male 1.31 2.76 (0.28) (0.047) Match component 2.29 0.98 (0.25) (0.60) Constant 0.96 –6.96 (1.44) (0.27) Standard deviation of wages 2.52 5.08 (0.082) (0.078) Illegal Legal Match probabilities Age 2.63 6.17 Low–low 0.32 (1.14) (0.13) (2.17) Age-squared –0.44 –0.64 Low–medium 0.01 (0.21) (0.016) (2.01) 5–8 years education 1.23 1.49 Medium–low 0.01 (0.20) (0.16) (2.05) 9–11 years education 1.93 2.80 Medium–medium 0.28 (0.21) (0.16) (1.96) 12 years education 2.24 4.83 (0.24) (0.15) 13$$+$$ years education 2.05 6.87 (0.37) (0.16) Family in U.S. –0.51 (0.22) Male 1.31 2.76 (0.28) (0.047) Match component 2.29 0.98 (0.25) (0.60) Constant 0.96 –6.96 (1.44) (0.27) Standard deviation of wages 2.52 5.08 (0.082) (0.078) Notes: Standard errors in parentheses. The excluded term is people with less than five years of education. Age is divided by 10. The match components are drawn from a three-point symmetric distribution around zero. The first component in the match probability is for the husband, and the second is for the wife. The wage equations include time trends in education and location fixed effects from the CPS. Table 10 shows the parameters of the immigrant wage distribution, for both legal and illegal immigrants. There are stronger returns to education for legal immigrants than for illegal immigrants, reflecting that high-skilled legal immigrants can access jobs that reward these skills.49 The age profile has the standard concave shape for legal immigrants. For illegal immigrants, wages increase slightly at young ages, but then decrease. For the age range that comprises most of the sample, the wage profile is essentially flat, since the steeper drop-off in wages does not begin until older ages. Table 10. Moving cost estimates Mexico to U.S. Return migration Internal migration Fixed cost for men 3.43 3.59 3.53 (0.47) (0.37) (0.12) Fixed cost for women 2.22 6.40 3.55 (0.45) (0.38) (0.12) Distance (legal) 0.60 –0.91 0.0000027 (0.18) (0.086) (0.046) Age 0.0047 0.062 0.13 (0.013) (0.014) (0.0050) Population size 0.0053 –0.00016 –0.014 (0.00034) (0.0013) (0.00091) Distance to railroad 0.30 (0.027) 5–8 years education –0.047 (0.089) 9–11 years education –0.21 (0.084) 12 years education 0.67 (0.12) 13$$+$$ years education 0.98 (0.18) Mexico to U.S. Return migration Internal migration Fixed cost for men 3.43 3.59 3.53 (0.47) (0.37) (0.12) Fixed cost for women 2.22 6.40 3.55 (0.45) (0.38) (0.12) Distance (legal) 0.60 –0.91 0.0000027 (0.18) (0.086) (0.046) Age 0.0047 0.062 0.13 (0.013) (0.014) (0.0050) Population size 0.0053 –0.00016 –0.014 (0.00034) (0.0013) (0.00091) Distance to railroad 0.30 (0.027) 5–8 years education –0.047 (0.089) 9–11 years education –0.21 (0.084) 12 years education 0.67 (0.12) 13$$+$$ years education 0.98 (0.18) Notes: Standard errors in parentheses. Distance measured in thousands of miles. Population divided by 100,000. Table 10. Moving cost estimates Mexico to U.S. Return migration Internal migration Fixed cost for men 3.43 3.59 3.53 (0.47) (0.37) (0.12) Fixed cost for women 2.22 6.40 3.55 (0.45) (0.38) (0.12) Distance (legal) 0.60 –0.91 0.0000027 (0.18) (0.086) (0.046) Age 0.0047 0.062 0.13 (0.013) (0.014) (0.0050) Population size 0.0053 –0.00016 –0.014 (0.00034) (0.0013) (0.00091) Distance to railroad 0.30 (0.027) 5–8 years education –0.047 (0.089) 9–11 years education –0.21 (0.084) 12 years education 0.67 (0.12) 13$$+$$ years education 0.98 (0.18) Mexico to U.S. Return migration Internal migration Fixed cost for men 3.43 3.59 3.53 (0.47) (0.37) (0.12) Fixed cost for women 2.22 6.40 3.55 (0.45) (0.38) (0.12) Distance (legal) 0.60 –0.91 0.0000027 (0.18) (0.086) (0.046) Age 0.0047 0.062 0.13 (0.013) (0.014) (0.0050) Population size 0.0053 –0.00016 –0.014 (0.00034) (0.0013) (0.00091) Distance to railroad 0.30 (0.027) 5–8 years education –0.047 (0.089) 9–11 years education –0.21 (0.084) 12 years education 0.67 (0.12) 13$$+$$ years education 0.98 (0.18) Notes: Standard errors in parentheses. Distance measured in thousands of miles. Population divided by 100,000. Table 11 shows the moving cost parameters (excluding the parts related to illegal immigration). There are three moving cost functions: Mexico to U.S. migration, return migration, and internal migration. The first component of the moving cost is the fixed cost of moving, which I allow to vary with gender. The moving cost also depends on the distance between locations. For Mexico to U.S. (legal) migration, the cost increases in distance, as expected, and I do not see a statistically significant effect of distance on internal migration decisions. For return migration, the moving cost decreases with distance. The location in Illinois has the highest return migration rates, and is the furthest from the border.50 This behaviour is most likely driving this parameter estimate. Moving costs also depend on population size, in that I would expect people to be more likely to move to larger locations.51 For internal migration, the moving cost decreases with population size, indicating that people are more likely to move to larger locations. For Mexico to U.S. migration, the effect is positive but small. Population size is perhaps not an accurate proxy in this case, since migrants may care more about the number of people from their community in a location than the total population size. Table 11. Illegal immigration parameter estimates Distance 1.23 (0.056) Enforcement 0.04 (0.0069) Fixed cost 1.17 (0.39) Crossing point fixed costs El Paso, TX –1.07 (0.26) San Diego, CA –4.01 (0.23) Laredo, TX –0.37 (0.28) Rio Grande Valley, TX 0.065 (0.30) Tucson, AZ –2.05 (0.24) El Centro, TX –2.36 (0.24) Distance 1.23 (0.056) Enforcement 0.04 (0.0069) Fixed cost 1.17 (0.39) Crossing point fixed costs El Paso, TX –1.07 (0.26) San Diego, CA –4.01 (0.23) Laredo, TX –0.37 (0.28) Rio Grande Valley, TX 0.065 (0.30) Tucson, AZ –2.05 (0.24) El Centro, TX –2.36 (0.24) Notes: Standard errors in parentheses. Enforcement measured in 10,000 person-hours. Distance measured in thousands of miles. Table 11. Illegal immigration parameter estimates Distance 1.23 (0.056) Enforcement 0.04 (0.0069) Fixed cost 1.17 (0.39) Crossing point fixed costs El Paso, TX –1.07 (0.26) San Diego, CA –4.01 (0.23) Laredo, TX –0.37 (0.28) Rio Grande Valley, TX 0.065 (0.30) Tucson, AZ –2.05 (0.24) El Centro, TX –2.36 (0.24) Distance 1.23 (0.056) Enforcement 0.04 (0.0069) Fixed cost 1.17 (0.39) Crossing point fixed costs El Paso, TX –1.07 (0.26) San Diego, CA –4.01 (0.23) Laredo, TX –0.37 (0.28) Rio Grande Valley, TX 0.065 (0.30) Tucson, AZ –2.05 (0.24) El Centro, TX –2.36 (0.24) Notes: Standard errors in parentheses. Enforcement measured in 10,000 person-hours. Distance measured in thousands of miles. Table 11 shows the parameter estimates relating to illegal immigration. Distance increases the cost of moving, where the distance is calculated as the distance from the Mexican state to the crossing point plus the distance from the crossing point to the U.S. destination. This allows for the location choices and crossing point decisions to be related. I find that moving costs increase with border enforcement. I estimate a separate fixed cost for each border crossing point. The crossing points with low levels of enforcement, but where people do not cross, have high fixed costs. For example, San Diego is where the greatest share of people cross, but it also has the highest enforcement. Therefore the estimation finds that this point has the lowest fixed costs. 7.1. Model fit To look at the model fit, I first show statistics on annual Mexico to U.S. and return migration rates, comparing the values in the data to model predictions. The first row of Table 12 shows the whole sample, and the second two rows split the sample by legal status. The model fits migration rates for illegal immigrants well, but is unable to match the high migration rates and overestimates return migration rates for legal migrants. Legal immigrants are a small part of the sample. The model allows for different moving costs and wages for legal immigrants, but since most of the other parameters are the same, the model cannot fit the data for legal immigrants well.52 The last four rows split the sample by marital status, first looking at the full history sample and then at the partial history sample. The full history sample is split into married primary movers, married secondary movers, and people who are single. The model underpredicts both the migration rates of married men and women, although it does capture that primary movers are much more likely to move than secondary movers. Table 13 splits the sample by education, looking at the same summary statistics, and again shows that the model is fitting the annual migration rates relatively well. Looking at this along another dimension, Figures 4 and 5 show the annual migration rates over time, and Figures 6 and 7 split the sample by age. The model is fitting the general trends relatively well. However, it is overestimating return migration rates in the later years. Figure 4 View largeDownload slide Model fit: mexico to U.S. migration rates by year. Notes: For each year, I calculate the average Mexico to U.S. migration rate, in the data and as predicted by the model. Figure 4 View largeDownload slide Model fit: mexico to U.S. migration rates by year. Notes: For each year, I calculate the average Mexico to U.S. migration rate, in the data and as predicted by the model. Figure 5 View largeDownload slide Model fit: return migration rates by year. Notes: For each year, I calculate the average return migration rate, in the data and as predicted by the model. Figure 5 View largeDownload slide Model fit: return migration rates by year. Notes: For each year, I calculate the average return migration rate, in the data and as predicted by the model. Figure 6 View largeDownload slide Model fit: Mexico to U.S. migration rates by age. Notes: For each age, I calculate the average Mexico to U.S. migration rate, in the data and as predicted by the model. Figure 6 View largeDownload slide Model fit: Mexico to U.S. migration rates by age. Notes: For each age, I calculate the average Mexico to U.S. migration rate, in the data and as predicted by the model. Figure 7 View largeDownload slide Model fit: return migration rates by age. Notes: For each age, I calculate the average return migration rate, in the data and as predicted by the model. Figure 7 View largeDownload slide Model fit: return migration rates by age. Notes: For each age, I calculate the average return migration rate, in the data and as predicted by the model. Table 12. Model fit: annual migration rates Mexico to U.S. migration rate Return migration rate Model (%) Data (%) Model (%) Data (%) Whole sample 2.60 2.37 10.1 8.50 Illegal immigrants 2.53 2.19 10.1 9.31 Legal immigrants 16.55 40.83 9.78 4.52 Full history sample $$\quad$$ Primary movers 1.45 3.30 22.26 29.03 $$\quad$$ Secondary movers 0.00073 0.0027 33.87 25.48 $$\quad$$ Single people 3.46 2.15 10.89 24.93 Partial history sample 2.63 2.46 9.33 5.64 Mexico to U.S. migration rate Return migration rate Model (%) Data (%) Model (%) Data (%) Whole sample 2.60 2.37 10.1 8.50 Illegal immigrants 2.53 2.19 10.1 9.31 Legal immigrants 16.55 40.83 9.78 4.52 Full history sample $$\quad$$ Primary movers 1.45 3.30 22.26 29.03 $$\quad$$ Secondary movers 0.00073 0.0027 33.87 25.48 $$\quad$$ Single people 3.46 2.15 10.89 24.93 Partial history sample 2.63 2.46 9.33 5.64 Notes: I calculate the model-predicted Mexico to U.S. and return migration rates for all individuals in the sample, and compare them to rates in the data. For Mexico to U.S. migration, I use all people in Mexico at the start of the period. For return migration, I use all people in the U.S. at the start of the period. Table 12. Model fit: annual migration rates Mexico to U.S. migration rate Return migration rate Model (%) Data (%) Model (%) Data (%) Whole sample 2.60 2.37 10.1 8.50 Illegal immigrants 2.53 2.19 10.1 9.31 Legal immigrants 16.55 40.83 9.78 4.52 Full history sample $$\quad$$ Primary movers 1.45 3.30 22.26 29.03 $$\quad$$ Secondary movers 0.00073 0.0027 33.87 25.48 $$\quad$$ Single people 3.46 2.15 10.89 24.93 Partial history sample 2.63 2.46 9.33 5.64 Mexico to U.S. migration rate Return migration rate Model (%) Data (%) Model (%) Data (%) Whole sample 2.60 2.37 10.1 8.50 Illegal immigrants 2.53 2.19 10.1 9.31 Legal immigrants 16.55 40.83 9.78 4.52 Full history sample $$\quad$$ Primary movers 1.45 3.30 22.26 29.03 $$\quad$$ Secondary movers 0.00073 0.0027 33.87 25.48 $$\quad$$ Single people 3.46 2.15 10.89 24.93 Partial history sample 2.63 2.46 9.33 5.64 Notes: I calculate the model-predicted Mexico to U.S. and return migration rates for all individuals in the sample, and compare them to rates in the data. For Mexico to U.S. migration, I use all people in Mexico at the start of the period. For return migration, I use all people in the U.S. at the start of the period. Table 13. Model fit: annual migration rates Mexico to U.S. migration rate Return migration rate Years of education Model (%) Data (%) Model (%) Data (%) 0–4 2.38 2.07 11.55 11.87 5–8 2.99 2.97 10.42 8.93 9–11 3.41 2.69 10.65 8.10 12 1.81 1.92 7.23 6.0 13$$+$$ 0.81 0.79 7.96 7.47 Mexico to U.S. migration rate Return migra