Abstract
We analyze the roles of and interrelationships among school inputs and parental inputs in affecting child development through the specification and estimation of a behavioral model of household migration and maternal employment decisions. We integrate information on these decisions with observations on child outcomes over a 13-year period from the NLSY. We find that the impact of our school quality measures diminish by factors of 2 to 4 after accounting for the fact that families may choose where to live in part based on school characteristics and labor market opportunities. The positive statistical relationship between child outcomes and maternal employment reverses sign and remains statistically significant after controlling for its possible endogeneity. Our estimates imply that when parental responses are taken into account, policy changes in school quality end up having only minor impacts on child test scores.
Keywords: Education production function, Migration, Maternal Employment, School choice
1 Introduction
Understanding the impacts of school inputs and parental behavior on children’s educational development can provide crucial information to both policy makers and parents as they make decisions about the allocation of resources to children. The literature providing estimates of educational production functions, however, provides little consensus about the magnitude or even the direction of the impact of many school and family inputs to children’s development. Two of the most recent reviews of the literature on the impact of school resources and school “quality” on educational outcomes, by Hanushek et al. (1998) and Krueger (1998), present conflicting interpretations of the literature. Similarly, the extensive literature on the impact of parental inputs on educational production as surveyed by Haveman and Wolfe (1995) reports widely varying effect estimates.
One potential reason for the ambiguity in the literature appears, at a first glance, to be due to the fact that it is impossible to collect data on all of the relevant inputs to the educational production function. Some studies have detailed measures of school inputs but almost no information about parental inputs (e.g. Hanushek, 1996, Krueger, 1999). Other studies have good information on what parents do for their children with only limited information on the environment in the schools the children attend (e.g. Harvey, 1999; Moore and Driscoll, 1997). There are also experimental impact studies where children have been randomly placed in different schooling environments (Krueger, 1999). Researchers have used all of these different types of data resources to estimate the impacts on child educational outcomes of the inputs to educational production.
A recent paper by Todd and Wolpin (2003) addresses explicitly several reasons for why studies using different data sources could give rise to different estimates of the impacts of school inputs on child outcomes. Their basic interpretation of the literature is that researchers have failed to use a coherent, common theoretical framework in the interpretation of their empirical results. Todd and Wolpin posit that the education production process uses inputs that are determined by both parents and schools and that the levels of these inputs are influenced by each child’s ability to use the inputs productively. If the data are not rich enough to include measures of both parents’ and schools’ inputs and child background characteristics, and if there are correlations among these different types of inputs, then the observed inputs will be correlated with the implicit error terms. The resulting endogeneity biases will make it difficult to compare estimates based on different data sets. Todd and Wolpin also point out that even randomly assigned, experimentally determined inputs do not resolve such problems. If parents and schools can adjust their input decisions in response to random assignments, then experimentally estimated impacts will convolute the true production impacts of the experimentally-assigned inputs with the impacts of adjustments to other inputs made by parents and schools in response to the experimentally-induced input decisions. While the resulting estimates may still measure useful policy effects, they are not estimates of educational production function parameters.
The problems of estimating an educational production function are not solely due to data limitations. At one extreme is the lack of rigor in defining what researchers would like to learn about educational production that Todd and Wolpin highlight; researchers have been unclear about what they would like to estimate. It is important to define precisely what one can estimate. Additionally, researchers have focused on a diverse set of simple parametric specifications of the production function. This can give rise to unstable estimates that can vary widely and depend crucially on the sample under consideration.
We attempt to address a wide variety of these issues. We start by specifying a formal model of parental decisions about the choice of schools and parental involvement with their children. We implement this formal model by assuming that parents choose their place of residence in part because of employment opportunities and in part because of the characteristics of the schools where they choose to reside. We distinguish between various measures of school quality to assess school inputs, while maternal employment is the main parental input considered. Parents make their place of residence decisions based on location-specific wage offer distributions and school quality measures, but before knowing the exact child outcome and wage offers they might receive. After making a place of residence decision, both parents receive wage offers. While the husband is assumed to always accept the wage, the mother chooses how much time to devote to labor market activities at that wage rate. She does this with the understanding that more hours spent in the labor market could cut back on her inputs to the education production function. Even though she knows the average effect that her work behavior would have on her child’s educational outcomes, she does not know what the child’s actual educational outcomes will be.
An important unanswered question from the literature on education production is whether parental and school inputs are substitutes or complements to each other in the process of education production. Moreover, if substitutability and complementarity are co-existing, it is not clear which one is dominant among families at different socio-economic strata. If the substitutability of two inputs is prevalent it would be expected that a mother spends less time with her children once she puts them in a school with a perceived better quality, holding everything else constant. Similarly, the mother should be more actively involved in educating her children by sacrificing leisure time or working fewer hours in the labor market if she perceives the school quality to be poor. At the extreme, parents may choose to take the sole responsibility of educating their children, reflected by the increasing popularity of home schooling (Behrman and King, 2001). If the hypothesis holds that parental inputs and school inputs are substitutes, then results from studies of the effects of teacher and school quality might be biased downward due to the lack of control for parental inputs. On the other hand, some empirical evidence seems to support the existence of complementarities between school and parental inputs in the production of education. For instance, several educational reform programs actually mandate that the parents be more involved (McMillan, 2000). If this complementarity effect dominates, impact estimates of parental inputs from regressions that do not properly control for school quality are likely to be biased upward.
In summary, this study introduces an economic model of maternal employment and location decisions and their impacts on child development, to provide a more complete understanding of the relationship between parental inputs and school inputs. We estimate the model using paired mother-child data from the National Longitudinal Study of Youth of 1979 (NLSY). Estimating the educational production function as part of this structural model, while explicitly accounting for unobserved heterogeneity, results in significantly different estimates of the production process. For example, we find that the impacts on child outcomes of a residential move and of a mother working full-time (as opposed to not working) change signs while remaining statistically significant after accounting for the possible endogeneity of these decisions. While the estimated effects of several of our school quality measures are statistically significant, they are small in magnitude. On the other hand, we find statistically significant and relatively large negative effects of residential moves and of maternal employment. All else equal, full-time employment of the mother is predicted to reduce a child’s performance on a mathematics test on average by 1.2 percentile points, while a move reduces it by 5.8 percentile points. This study also presents results from several counter-factual policy simulations to analyze: 1) the effects of exogenous changes in several school quality measures on mother’s labor supply decisions, 2) the effects of a change in mothers’ wages on labor supply, schooling choices, and child outcomes, and 3) the impacts of family and school characteristics on children’s test scores. The findings indicate that parents can and do adjust their behavior to mitigate the direct impact of policy interventions on child development. Therefore, our findings have important implications for the optimal design of educational policies as well as social programs aimed at encouraging maternal employment. For example, in assessing the consequences for child development of imposing more stringent work requirements on welfare participants or of increasing tax credits aimed at encouraging mothers to enter the labor force, it is important to not only consider the direct negative effects of associated increases in maternal employment, but also to take into account possible changes in school inputs resulting from policy-induced residential moves. To help mitigate any negative consequences for children, it may also be worthwhile for policy makers to consider simultaneous investments in school inputs, or policies that could reduce moving costs.
2 Background
The literatures on the impacts of school resources and school “quality” on educational outcomes on the one hand and on the impact of parental inputs on educational production on the other are extensive and have been reviewed in detail by others. Here we focus on some recent discussions of potential reasons for the disparate empirical findings in the literature.
One important source of variation across studies has been the measure of child outcome used. For example, a frequently cited paper by Card and Krueger (1992) argued that long-term labor market outcomes (such as earnings) are better output measures than test scores. Unlike several influential studies based on test scores, their paper reported significant school input effects. More recent evidence by Betts (1995, 1996) using NLSY data, however, suggests that the choice of outcome variable is not the reason for the difference in findings.
A second potential source of variability in estimates concerns the data source used. It has been argued that because of input endogeneity issues, as well as measurement error issues, that using student level data is likely to lead to biased estimates; grouped or more aggregated data presumably are less likely to suffer from such problems (Loeb and Bound, 1996; Hanushek et al, 1996). In addition, as pointed out by Todd and Wolpin (2003), one can expect input effect estimates based on experimental data to be very different from those based on non-experimental data, because they generally would estimate entirely different causal effects. The first would include short-term parental responses to the experimental input assignment, while those based on non-experimental data usually aim to estimate production function parameters.
Another important source of heterogeneity across studies is the identification strategy used to deal with the endogeneity of input choices in estimating their effects on educational outcomes. Input choices are likely to depend on parents’ perceptions of a child’s innate ability, which will be unobserved to the evaluator. Evaluations in which the endogeneity of school and parental inputs are ignored are likely to produce unreliable results which could vary considerably with the set of variables included in the regression. The choice of econometric method to solve the endogeneity problem is also likely to be a factor. As argued by Card (1995, 1999), who investigated the variability in estimated returns to years of schooling across studies, if input effects vary across students, schools or locations, or if these effects are nonlinear, different evaluation strategies usually identify local average treatment effects (Imbens and Angrist, 1994) which apply to different sub-populations.
While we believe that all these alternative explanations for the large variation in estimates in the literature have merit and undoubtedly will play a role, it is our view that the specification of the educational production function and the fact that many inputs are often ignored or missing have not received sufficient attention in the literature. In particular, if parental inputs are chosen jointly or in response to school inputs, then this raises serious questions about the validity of instruments that underly most current identification approaches to deal with the endogeneity of school characteristics.
Besides ample evidence of an effect of maternal employment on child outcomes (e.g., Blau and Grossberg, 1992; Parcel and Menaghan, 1994; Bernal, 2007), there exists a growing literature documenting that parents influence and respond to school quality differences. In Tennessee’s STAR class size experiment, for example, because of complaints by the parents of children who had been randomly assigned to larger classes, a teacher aide was subsequently assigned randomly to such classes (Krueger, 1998). Second, studies of housing values indicate that parents value better schools, as differences in school expenditures and average test scores have been found to be factored into cost of housing (e.g. Black, 1999). In making location choices parents have been found to respond strongly to school quality differences as well as employment opportunities, and both incentives appear to be strongly related (Blanchard and Katz, 1992; Nechyba and Strauss, 1998; Bayer, 2000; Ferreyra, 2007, Kennan and Walker, 2005). Hedges and Greenwald (1996) have suggested that the interaction between school inputs and family inputs may explain the small school input effects found in many studies. They propose that the increase in female labor force participation rates and the rising prevalence of single parent households may have offset the positive effects of increased spending and declines in class sizes, to produce no overall improvements in outcomes.
Our approach for investigating the interaction between and endogeneity of school and family inputs is to specify and estimate a structural model of school choice and maternal employment decisions, taking into account the dependence of both choices on parental preferences and financial constraints as well as the child’s innate ability. As in Bayer (2000), who estimates a static equilibrium model of household location and school choice decisions, this allows us to estimate an educational production function while controlling for the non-random sorting of households across locations and schools. In addition, this approach explicitly addresses the endogeneity of the mother’s work decision. Recently Gemici (2007) and Bishop (2007) have estimated dynamic structural migration models, though they focused on much fewer destinations that we consider here.
3 Theory and Estimation
In this section we present our theoretical model and the estimation method that we use to estimate it. The primary concern of this paper is to obtain reliable estimates of an education production function that incorporates both school and parental inputs. To do this, the paper posits a structural model where in each period parents choose a place of residence that determines the school characteristics for their children, a housing price, and a set of local labor market conditions to maximize expected utility. Possibly correlated unobserved determinants influence the educational production function, the utility function, and the wage offer functions for the parents. These heterogeneity terms, in conjunction with a set of intertemporal exclusion restrictions that follow naturally from the decision making process across time periods, allow one to control for the potential endogeneity of the inputs to the production function.
3.1 Overview of the Model
In each period a decision is made on where to live, how much to consume, and how many hours the mother will work. These decisions are made taking into account their expected effects on the distribution of the child’s cognitive achievement. At the beginning of each period t (where each period corresponds to a given child’s age), a family first makes a location decision on the basis of the expected utility of living in each location. Locations are characterized by a set of school quality indicators, local labor market conditions, moving and housing costs and their geographic location. The choice of location k ∈{1, 2, 3, ···, K} in each period will depend on the wage offer distributions in each location, the child achievement score distribution (given school quality), as well as the costs of moving. After choosing a location of residence, the father (if present) receives an earnings offer ys and the mother receives a wage offer w. While fathers are always assumed to accept their earnings offer in each period, which could be zero, the mother makes an employment decision h. Given the location and work choice, school and family inputs produce a child’s cognitive achievement score q.
The time line of this model is described as follows:

Consider a one-period model in which a family i with Ni children has preferences over consumption, xi, a vector of the children’s cognitive achievements, Qi = (qi1, …, qiNi), the mother’s hours devoted to non-market activities, li, and the geographic location of residence, ki. li is defined to include maternal time directly devoted to child education, other household production activities, and leisure. Explicitly, it satisfies
| (1) |
where T is total time available to the mother, and hi represents mother’s hours spent working in the market. It is therefore equivalent to state preferences in terms of h, instead of l.
Let those preferences be represented by the utility function,
In our empirical analysis we model the mother’s hours of work as a discrete choice variable equal to zero (h0), part-time (h1), or full-time (h2) and adopt the following specification of the utility function:
| (2) |
where MS is an indicator for the marital status of the mother, and 1(·) is the indicator function equal to 1 if the argument is true and 0 if not. Note that in household i the child n’s cognitive achievement score qin is assumed to enter as a relative score, measured relative to a power function of the mother’s Armed Forces Qualification Test (AFQT) score, q̄i. This allows high achieving mothers to have different standards in evaluating child success than lower achieving mothers, and it includes as a special case the mother’s score being irrelevant (when α3 = 0). Note that the model characterizes the behavior of both single and multiple-child families. In particular, child quality is additive in the number of children Ni. Thus the importance of school quality and other inputs to child development is proportional to the number of school-aged children in the household. The parameter α4ai measures the disutility of working part-time, when compared to not doing any market work, and α4bi measures the additional disutility from working full-time. A corresponding change in the marginal utility of consumption when working compared to not-working is captured by α1.
The specification also allows for direct geographic preferences for living in each of the four census regions of the U.S., as captured by the parameters α5j (j = 1, …, 4), where the R(k) ∈ {1, 2, 3, 4} denotes the region corresponding to location k. mk−1k measures the psychic cost associated with moving from the previous period’s location k−1 to k, which is further allowed to differ between moves within and across state borders and geographic census regions, as in
| (3) |
The function ST (·) maps each residential location into its corresponding state in the U.S. The stochastic components εh,i = (εh0,i, εh1,i, εh2,i) are maternal evaluations of the unobserved attributes of the three employment states. They are assumed to be independently and identically distributed over different working hours and across individuals and time periods, with an extreme value distribution with variance parameter b1. In addition, ξk,i represents random variation in the evaluation of the unobserved attributes of community k, which is assumed to be independently and identically distributed over different communities, individuals and time. It follows an extreme value distribution with variance parameter b2.
In the empirical model, we introduce two additional sources of heterogeneity in preferences. First, we specify the parameters measuring the disutility from working, α4ai and α4bi, as functions of the mother’s marital status, the number of children in the household between the ages of 0 and 5 (nyi), the number of children between the ages of 6 and 17 (noi), as well as a time-invariant mother-specific heterogeneity component :
Second, we allow for individual heterogeneity in the “reserve consumption” value γ0i by specifying . For families with multiple children, this specification implies that the number of children in the family also affects parental behavior through the disutility of working and the rates of substitution between consumption and leisure in the family’s objective function.
The husband’s earnings are assumed to be stochastic and location specific. The presence of a father directly influences preferences for leisure and consumption. In addition, as described below, it directly affects child outcomes. Our empirical framework implicitly treats the parents’ prior family formation, fertility and education behavior as exogenous in the analysis of current choice behavior. In addition we ignore capital markets, by assuming that parents do not save or borrow. Hence, the budget set when residing in location k can be expressed as
| (4) |
where wki represents the wage offered to the mother in location k, which is observed only after a person has decided to locate in k, and are the earnings of the father. Household non-labor income is denoted by , and cki explicitly measures the average housing cost in location k.1
As will be discussed in greater detail later, the child quality measure qin we use is discrete and is modeled using a very flexible parametric specification for the child quality production function, defined as
| (5) |
where t represents the child’s age, b is a indicator for the child gender, and Skn is a vector of school characteristics in location k believed to influence school outcomes of child n, including child-age/grade specific school quality measures such as per pupil expenditures and the average teacher salary, relative to average earnings in other professions. It also includes the average high school dropout rate, measuring community level differences across school districts, to capture school quality and peer effects. Zi is a vector of mother’s characteristics in the current period including the mother’s education, age, AFQT score, race and marital status.
The specification allows for a (one-period) reduction in the child’s performance due to a move. Unobserved heterogeneity in the child’s intellectual endowments is captured by the inclusion of . This term represents the child’s unobserved endowment at the age at which they enter our data set (age 5 or 6). The stochastic component εqn,i represents all other unobserved factors influencing the child’s outcome at each age, and it is assumed to be independently distributed across individuals and periods. The exact specification of fq and the distribution of εqni will be discussed later.
Similar to the treatment of child achievement scores, the mother’s hourly wage rates are discretized and the wage offer distribution function in location k is specified as a flexible parametric function of the variables describing local labor market conditions ( ), mother characteristics in Zi and the unobserved trait , as in
| (6) |
where εw,i captures stochastic changes over time in the offered wage rate. While each woman is assumed to know the wage offer distribution associated with each location, she is assumed to receive only one wage offer each period and only for the chosen location of residence. The specification of fw and the distribution of εw,i will be discussed later.
New earnings realizations for the husband are modeled in a similar way. They are specified as a flexible parametric distribution of local labor market variables for men, the husband’s characteristics, and the spouse’s unobserved ability endowment , where εy,i are i.i.d. earnings shocks, as in
| (7) |
3.2 Value Functions
Location and work decisions are made sequentially with a family first making a location decision. This is followed by the location specific realizations of the husband’s earnings and the wife’s wage offers, and then the mother makes her work decision. After each of these decisions has been made the child’s achievement score is realized. When deciding on her hours of market work, the mother takes into account the effect of her work decision as well as of the location-specific school quality level on the expected child outcome. Similarly, when the location decision is made, the effect of this decision on subsequent wage offer distributions, work decisions and the effect of these on the distribution of child outcomes also are taken into account. In doing so, both parental choice decisions will depend on the child’s innate ability as in Becker and Tomes (1976). The solution to this stochastic optimization problem can be obtained by solving backwards, first for the optimal work decision for each possible location choice, husband earnings realization and wage offer, and second, by determining the optimal location choice.
If the mother ends up in location k, with the husband earning , the mother receiving wage draw wk, and learning the utility shocks associated with each work decision (εh = (εh0, εh1, εh2)), the expected utility associated with each hours of work choice can be defined as
| (8) |
where Ωk is a set of state variables, Ωk = {t1, t2, …, tN, Sk1, …, Skn, Z, k−1}, μ represents the vector (μm, μs, μc), and x(h) and Q(h) are the values of consumption and the children’s achievement scores when the mother works h hours as implied by (4) and (5). We have dropped the household subscripts, i, to simplify notation.
This requires the calculation of an expectation because the children’s educational outcomes are stochastic (though influenced by mother’s hours of work choice) and realized after the work decision is made. With a finite number of outcome scores, for each hours of work choice, the expected utility to the family after observing her wage draw, her husband’s earnings and utility shocks, and integrating over all possible test score outcomes equals
| (9) |
where Ū [x(h), h, k] is the deterministic component of the utility function less utility derived from the children’s achievement score in equation (2). The first equality follows from the additive separability of the utility function in both errors, combined with the assumed independence (conditional on μ) of (εh, ξk) and εq.
This implies that given the choice of a location k, a wage rate offer wk and husband’s earnings realization , the optimal work decision can be defined as
| (10) |
With εh being independently, and identically distributed extreme value errors, the probability of woman from household i working hj (j = 0, 1, 2) hours conditional on wage and earnings draws wk and equals
| (11) |
At the time of the location decision, while each woman knows the distributions of her wage offer and the husband’s earnings at each location k = 1, 2, ···, K, she knows neither the actual wage she will be offered nor what her husband will earn at each location. Equally important, she does not know the realizations of the utility shocks associated with her future work decision or the shocks to the test score outcome discussed above. For any given offered wage and husband’s earnings level in location k, the expected maximum utility associated with choosing location k is given by
| (12) |
where the expectation is with respect to the extreme value distributed utility shocks εh, b1 is the variance of these shocks, and the maximization is subject to the budget constraint (4).
When making a residential location decision, the actual wage offers and husband earnings realizations in each possible destination are unknown. Integrating the value function defined in equation (12) over G possible (discrete) wage offers (w̄g, g = 1, …, G) and G possible husband’s earnings realizations ( , g′ = 1, …, G) in location k yields
| (13) |
At the time of the location decision ξk is known, and the optimal location decision for the agent can be defined as
| (14) |
Assuming i.i.d. extreme value errors for the ξk, the probability to the researcher that location k is chosen by family i is therefore given by
| (15) |
where Ωi = {(Ωki, Lki); k = 1,.., K}, and Lki represents the local labor market conditions at location k for individual i.
While we do make extensive use of multinomial logit formulations for describing the utility maximizing choices, it is important to note that our estimation strategy does not require the strong independence of irrelevant alternatives assumption. In particular, our use of person and family specific unobservables in the utility and production functions allows for the very real possibility that there can be correlation in utilities across all possible choices.
3.3 Wage, Spousal Earnings and Educational Production Functions
Empirically, the child achievement score q is discretized and the educational production function in (5) is specified using a flexible parametrization of its corresponding “hazard rate.” The hazard representation of the probability density function describes the conditional probability of a given outcome realization conditional on the outcome being equal to or greater than that value. We follow Gilleskie and Mroz (2004) in modeling the probability of an advance to a higher level, conditional on reaching at least a given achievement level p (i.e. one minus the hazard rate) using the logit function
| (16) |
where qin is the observed score of child n in household i, fbq(p) represents the “baseline” hazard at achievement level p and fqin captures the effect of covariates.
The baseline hazard is specified as a third-degree polynomial in the current (potential) achievement level p as follows:
| (17) |
where P is the highest score level. The covariate component is further specified as
| (18) |
where Skn is the vector of school inputs defined earlier, Zi is a vector of characteristics of the mother expected to influence the child’s performance, and is the estimated effect of the unobserved heterogeneity component . Empirically, it is approximated by a third degree polynomial in μc, where μc has a discrete distribution taking on J different values, with corresponding probabilities , j = 1, …, J.
Note that this specification is very flexible, allowing effects of covariates to differ at different levels of the child’s achievement score. In the estimations we also relax several of the separability assumptions implicit in (16) to allow for interactions of the baseline hazard, the age of the child, unobserved heterogeneity and school and parental characteristics by including interactions between fbq (p) and several determinants of fvin. This flexibility allows us to estimate the degree of substitutability or complementarity across all inputs. In addition to increased flexibility, this approach has a computational advantage in that discretization of the outcome variable greatly simplifies the problem of integrating over child outcomes which, as was shown in the value function expressions in section 3.2, now reduces to a simple calculation of a weighted sum.
We similarly model the mother’s wage offer distribution, discretizing the wage data into G groups, with the probability density function Pr(wk = w̄g) of having a wage offer w̄g similarly defined in term of the corresponding conditional hazard probabilities
| (19) |
| (20) |
| (21) |
and is the median local earnings for women with the same education level, race, and age. We allow for interactions between the baseline hazard fbw, and maternal characteristics.
Finally, the spouse’s earnings distribution is modeled similarly using G′ discrete values for ys, and with fwi and fbw,in equations (20) and (21) replaced by
| (22) |
| (23) |
where is the median local wage rate for men with the same education level, race, and age. Higher order terms and interactions between baseline hazard fby and the characteristics of the spouse are also included in fyi.
3.4 Estimation
As discussed below, our data set includes longitudinal information on the place of residence, the mother’s work decision and wage, the husband’s earnings (if married), the children’s achievement test scores, as well as all covariates described in our model, for each household i in our sample. While we observe for each child-year observation the location of residence and the mother’s work choice , we only observe the mother’s wage draw, and husband’s earnings at the location k* that was chosen. We do not observe any wage when the woman is not engaged in market work. It is important to note that our maximum likelihood framework, which accounts for correlated person and family specific unobservable components, controls for endogeneity and self-selection issues in wages, work, and place of residence.
Given the optimal decision rules (10) and (14), and specifying a discrete multinomial distribution for the unobserved heterogeneity vector μi, where μi can take on J sets of different values, the likelihood contribution for household i is given by
| (24) |
The sample likelihood function then is given by
| (25) |
The size of the choice set we consider (which is the set of all counties in the U.S.), makes direct implementation of the likelihood maximization procedure impractical. To deal with the large size of the choice set we apply a random sampling procedure proposed by McFadden (1978). Let C = {1, …, K} be the full choice set, and let D ⊆ C be a subset consisting of K̃ elements. The sampling method is to select the chosen alternative plus K̃− 1 non-chosen alternatives randomly drawn from the set C. Let be the probability that D will be drawn, given the observed choice . Then
As shown by McFadden, the choice probabilities in equation (24) can be replaced by:
| (26) |
Consistency of the resulting maximum likelihood estimators relies on the Independence of Irrelevant Alternatives (IIA) property of the error terms in the discrete choice model. It can be demonstrated that the estimator described in equation (26) possesses the IIA property, but only conditional on a given heterogeneity type μj. Unconditionally, we cannot rely on the IIA property to guarantee consistency. To assess the properties of the estimator in this case, Liu (2004) conducts a Monte-Carlo study to test the sensitivity of parameter estimates to the size of the sampling set K̃. For a simulated model which is comparable to that estimated here, he finds that coefficient estimates and standard errors remained essentially unchanged for choice sample sizes greater than 20. This finding was insensitive to the goodness-of-fit of the full choice set model.
3.5 Assessment of Modeling Assumptions
In our empirical model, we use maternal employment as a proxy for maternal time inputs to the child outcome production process. Ideally, we would like to have detailed diary data that permit us to measure time inputs more accurately. Unfortunately, these data are not available in the NLSY. However, tabulations from the 1997 Time Use Diaries in the PSID’s Child Development Supplement indicate that there is a close connection between the time mothers spend with children in learning activities and her labor supply. Working mothers of children aged less than 13 in two-parent households, spent 30% fewer minutes reading with their children than stay-at-home mothers. Additionally, when children are doing their homework, working mothers spend 20% less time with their children than non-working mothers. Though these simple mean comparisons are not causal effects, they do suggest that the time constraints faced by working mothers could lead them to devote significantly less time to their children’s intellectual development.
While the model in this study is “dynamic” within a decision period (which, as explained in the next section, will correspond to a 2-year period), it assumes that households are fairly myopic in their decision making. For example, the model does not allow parents to choose a current place of residence because that choice could reduce the future costs of moving from there to other locations. Nor does the model allow mothers to make a current employment decision because of its longer term impacts on their future earnings ability. Additionally, the modeling approach does not allow a family to move to a known job at the new destination. These are all potentially important shortcomings.
To some extent these model simplifications are due to the need to develop a tractable empirical model. For example, it would be computationally difficult if not infeasible to develop a dynamic optimization model where, for each of the over 3,000 possible location choices one might choose in the current time period, one would need to model precisely how a current location choice would affect the possible costs of moving to any one of the more than 3,000 possible locations in each and every subsequent time period. However, given the large moving costs we estimate, we suspect that such a planned sequencing of moves might not be one of the primary motivations for the vast majority of residential location decisions. We plan to explore this further in future research.
The assumption in the economic model that parents make a location decision based on location-specific wage offer distributions rather than their exact wage realization (which they are assumed to learn about only after a move) is also motivated in part by technical concerns. It is also, in part, motivated by an appeal to the actual job holding experiences of recent migrants to a new area. Technically, if the model allowed families to move in response to a known job prospect, it would be necessary to model all of their possible turned-down job prospects. Such data is almost never available, and so one would need to impose strong and arbitrary assumptions about the existence, number and values of “offers not taken.” However, it is important to note that most moves to a new area are not associated with long term jobs that were held when they arrived in the new area. In an analysis of data from the Panel Study of Income Dynamics (PSID), for example, we found that only about one half of recent migrants to an area who are working in the month after the move are working for that same employer two years later2. For most migrants, it would seem to be more realistic for them to consider all possible employment prospects in a particular area when undertaking a costly move. Again, the empirical facts suggest that the modeling assumption made here may not be any less reasonable than assuming people know they will earn a specific wage after moving to a new location and evaluating that location only on the basis of that one job prospect.
In our empirical analysis that follows, a county is used to represent a chosen school district. As a result, we are unable to capture switches between school attendance areas accompanying within-county moves. The 2000 Census numbers show that a bit more than one half (54.2%) of all moves from 1995 to 2000 are within a county, and we miss moves of this type. Nevertheless, a series of variance decompositions of school quality measures show that while there is non-negligible variation within counties, the contribution of the across county variation usually is nearly twice as important. This suggests that by focusing on across county moves we at least may be capturing most of the relevant variations in school quality, though certainly not all of the relevant variations.
The model developed in this study, like all empirical models, simplifies reality. But it is important to note that it does capture most of the salient determinants of location choices, schooling choices, and work choices. For the vast majority of people in the U.S., the choice of a residential location determines the school that their children will attend. While some people might move to particular areas because of a known job prospect, for most adults there is at most a moderate term duration of their first job in a new location. Mothers make decisions about working based upon the types of jobs they are offered, and they recognize that their work decisions could affect the distribution of possible developmental outcomes for their children. The model used in this paper to control for the endogeneity of the inputs to the production function for child quality incorporates all of these major features. More importantly, it represents an important first step and provides a useful and reasonable foundation for future research aimed at addressing some of its less realistic assumptions.
3.6 Model Identification
In order to discuss identification issues, it is useful to characterize the structural model estimated in this paper as consisting of three main components: (1) the child outcome equation as a function of two sets of endogenous inputs; maternal employment and school inputs, (2) the mother’s employment choice equation and (3) the school input equation, which corresponds to the residential location decision. Identification of the effect of both sets of inputs on child outcomes relies to a considerable extent on our assumption that a household’s place of residence when the first child reached age 5 or 6 is exogenous to the subsequent residence locations, work decisions, and child outcomes. Because moves are costly, with costs depending on the geographical distance between the current and new location, local labor market conditions and school characteristics in any given period will depend on the family’s initial location. Moreover, there are stochastic, age-varying exogenous changes over time in the distribution of local male and female labor market conditions and in child age/grade-specific school quality measures in each location, and these interact with the family’s initial location to generate possibly substantial variation in current labor market and school conditions. For example, those who face relatively improved labor market conditions in their initial location of residence will be less likely to move. In addition to cross-sectional variation in the family’s initial location and time-variation in geographic labor market conditions for men and women and in school characteristics for a fixed location, the treatment of time-varying fertility and marital status decisions as exogenous provides an additional source of identification. In future work, however, it would be important to explore relaxing these latter exogeneity assumptions.
Panel data relationships like those examined in this study can implicitly provide many additional identification conditions than one might infer by simply counting the number of contemporaneous exogenous variables (e.g. instruments) excluded from the structural equation of interest. There are two primary reasons for this. First, consider the case examined by Bhargava (1991) when one is willing to impose structural parameter stability over the time dimension. As Bhargava (1991) demonstrated, every lag of each instrument could have a separate effect on the “contemporaneous” value of the endogenous explanatory variables. This time dimension for the exogenous time varying instruments creates a multiplicity of “instruments” associated with each “exclusion restriction,” resulting in significantly more variables to control for endogeneity. He demonstrates that over-identification can be obtained under quite weak conditions. In this study the exogenous time-varying variables include the mothers’, children’s, and fathers’ ages, marital status, and age induced variations in the distribution of local labor marker conditions and school characteristics, and many of these factors do not vary deterministically through time. These time-varying exogenous factors interact with each observation’s initial place of residence to generate numerous exclusion restrictions.
A second source of additional identification arises in the context of dynamic nonlinear models. Mroz and Surette (1998) discuss this in more detail. Their discussion exploits the fact that variations in the time-ordering of the exogenous variables provide even higher degrees of over-identification than would be obtained by a simple reference to Bhargava’s (1991) observation as discussed above. The basic idea underlying their additional identification argument is that in dynamic nonlinear models of the type used here the impact of a lagged exogenous variable, say at (t − s), on a current endogenous variable, at t, depends crucially on the entire time series of all exogenous variables before t-s and between t-s and t. In the presence of endogenous determinants that evolve over time, the impact of an exogenous variable dated (t − s) on an endogenous determinant at t will vary depending on the values of the same exogenous variable dated at (t − s − 1) and at (t − 1). Each additional exogenous variable will typically increase dramatically the degree of identification.
By using an explicit sequential choice modeling framework, one can incorporate all such interactions that depend on the precise timing and sequencing of the values of the varying exogenous variables. The maximum likelihood approach we use here automatically incorporates these interactions among the time series properties of the sets of exogenous variables. They do so efficiently, without one having to resort to including numerous time-varying interactions of the exogenous variables in an arbitrary fashion, as would be the case with a more static instrumental variables approach.
It is important to note that this is the only study of the determinants of child test score outcomes we are aware of to consider the endogeneity of both mother’s work behavior and school characteristics. Clearly the results reported here will not provide the last word on the estimation of the effects of mother’s work and school characteristics on child outcomes. But the framework used here provides a useful starting point and lays a foundation for future research aimed at incorporating many additional, potentially important modeling issues to improve our understanding of this important topic.
4 Data Description
Our primary data source is the Geocode version of the NLSY79 data set and its Child-Mother Supplement. The NLSY79 began in 1979 with a national sample of 12,686 youngadults between the ages of 14 and 21. It included a nationally representative sample of 6,111 youths, an over-sample of 5,295 blacks, Hispanics, and economically disadvantaged whites, and a supplemental sample of 1,280 persons in the military in September 1978. Interviews with the military sub-sample were suspended after 1984 and for economically disadvantaged non-Hispanic whites after 1990. Following most of the literature based on NLSY data, we exclude the disadvantaged non-Hispanic white and military over-samples, because they were selected on the basis of potentially endogenous variables. The Black and Hispanic over-samples are included, and race and ethnicity indicators are included as explanatory variables to capture differences in preferences and opportunities. We restrict our analysis further to cases for which we have mother-child data.
Beginning in 1986, the NLSY-Child collected data on all of the children born to the female NLSY respondents. The NLSY-Child sample (through 1998) supplies data on children with mothers between the ages of 33 and 40 at the end of 1997. Children under the age of 15 comprise the majority of this sample. The NLSY-Child biennially interviewed both mother and child. The unit of observation in our analysis is a household, in which at least one child is observed between ages 5 and 15 at the time of interview. Children in the data can have up to six possible time-period specific observations. Because of the structure of the data set, we model some child outcomes at ages 6, 8, 10, 12 and 14, and other children’s outcomes at ages 5, 7, 9, 11, 13, and 15. Consequently, each period in our empirical model corresponds to a 2-year interval. The NLSY-Child contains a set of cognitive and behavioral assessments of each child at these ages. We exploit information on each child’s performance on the Peabody Individual Achievement Test (PIAT) in mathematics. This test is among the most widely used academic achievement assessment instruments that have demonstrably high re-test reliability, concurrent validity, and good psychometric properties (Markwardt, 1989). These scores, nationally normed by age and measured in terms of percentile scores, serve as our child outcome measure.
In our model households make residential location choices based on local labor market conditions, cost of housing, geographic preferences, and the quality of the school associated with that location. Regarding the first, we assume the relevant geographic labor market to be the county of residence. Using the Geocodes in the NLSY, matched with US Census 5-Percent Public-Use Microdata Samples (PUMS 5%) county-level data, we measure labor market conditions in the county by the median wage rate by education level, age, race and sex. We distinguish among at least three levels of education (less than high school, high school, more than high school), and compute the median wage by single year of age for each of two race categories (white, non-white). The local labor markets that each mother and father in the sample face are then characterized by the median wage in each locality corresponding to her/his education level, race, and age. That is, the median wage is used as a location specific determinant of the distribution of wages that the individual expects to face in that location. The school quality indicators we use to measure school inputs at each location are measured at the smallest geographic unit currently available in case of the NLSY, the county level3. The choice set from which families choose their residential location in each period therefore is the set of all 3,141 counties in the U.S. The 1990 Census school district special tabulation (see Appendix A) provides comprehensive geographic information on school district boundaries, permitting us to link the sample households to the school districts where they resided, assuming little temporal variation in the boundaries.
Two school quality indicators we consider are per-pupil average school expenditures and average teacher salaries. Both are measured as weighted averages by population size across all public schools in each county. We measure these variables relative to the average annual earnings of college educated males age 27–40 in the county. A third school and neighborhood characteristic we consider is the average high school dropout rate in each county. These quality measures are extracted from the School District Data Book (SDDB) and Common Core Data (CCD) files. The CCD is a comprehensive, annual, national statistical database of information concerning all public elementary and secondary schools (approximately 95,000) and school districts (approximately 17,000). The special Geocode data, combined with information from the SDDB also provide us with a measure of local housing costs. In particular, a special tabulation from the SDDB offers information on the median housing rent and the median housing value at the school district level. While ideally we would have wanted to model the residential location decision at the school attendance area, it is important to point out that our data set contains finer geographic information than is commonly used to analyze the relationship between school inputs and child outcomes (for example, Card and Krueger 1992).
For the estimation of the production function, we group the percentile scores of the children’s math tests into 10 discrete cells (5:1–10, 15:11–20, …,95:91–100), and model the determinants of how a child “progresses” (within a given period’s outcome distribution) from one cell to the next highest cell through the logit “hazard” model specified in section 3. The child’s score depends upon characteristics of the child such as his/her age and gender, as well as characteristics of the mother such as her age, marital status, schooling level, part-time and full-time work status and AFQT score. This production function also depends upon characteristics of the school district where the family resides, explicitly the high school dropout rate, average teacher salary as a fraction of the average annual earnings of college educated persons aged 27 to 40, and per child school expenditures as a fraction of the average earnings of individuals in the county. Monte-Carlo estimates reported by Gilleskie and Mroz (2004) indicate that little information is lost by using a discretized outcome rather than the more continuous one. That this should be the case becomes much clearer when examines the ability of a small number of percentile points to explain a continuous outcome. For example, the R2 from a regression of a normally distributed random variable on indicators of it falling within each of the five quintiles of the distribution is almost 0.9, and the R2 obtained from using deciles (ten dummy variable indicators) is over 0.95. At least for a normal distribution the discretized outcome clearly reflects nearly all of the information contained in the continuous measure.
To describe the wage offers available to the individual women in each locality k at t, wit, we use a discrete distribution with 10 points of support (i.e. wit ∈ {w̄1t, w̄2t, …, w̄10t}) and model the probability that each woman has an offered wage from each of the 10 categories. As in the child production function model, we use a “hazard” model to describe transitions across categories. Among the arguments in these hazard models, we allow for detailed interactions between individual level covariates and local area wages. In particular, for each of three levels of education (less than high school, high school, and more than high school), we compute the median wage by single year of age for each of two race categories (white, non-white). For each woman, we assign to her in each locality the median wage corresponding to her education level, race, and age. We use these median wages as explanatory variables for the hazard model of the wage offer distribution. We also allow for separate education level effects and effects of the mother’s AFQT score on the probabilities of each of these discrete wage offers. A specification with 10 points of support was similarly adopted for the spouse’s earnings equation. In addition to local median male yearly earnings in the county, the equation includes the father’s race, and his education level as explanatory variables.
Table 1 contains summary statistics for the sample data on parents and children. The table shows that the mean education levels and hourly wage rates of the mothers in our sample increased over the years. This is in part due to self selection into our sample through women’s decision to enter parenthood at younger versus older ages. Table 2 contains the data on school district characteristics broken down into 24 different study regions, which are used to conduct a series of policy simulation exercises. These study regions of residence are defined first by four geographic breakdowns (Northeast, Midwest, South, and West). Within each geographic area we make a further division by three levels of the school dropout rate within each region. We then divide each of these 12 geographic-dropout rate regions into two cells on the basis of the level of per student school expenditures in the school districts. As shown by Table 2, there is considerable regional variability in average values of these school district characteristics.
Table 1.
| A. Summary statistics for parents | ||||||||
|---|---|---|---|---|---|---|---|---|
| Variable | Mean (Standard Deviation) |
|||||||
| 1986 | 1988 | 1990 | 1992 | 1994 | 1996 | 1998 | Overall | |
| Age of mother | 26.540 (1.941) | 28.543 (1.938) | 30.060 (2.116) | 31.800 (2.178) | 33.600 (2.188) | 35.452 (2.183) | 37.340 (2.186) | 32.594 (4.003) |
| Married | 0.509 (0.500) | 0.507 (0.500) | 0.539 (0.499) | 0.568 (0.495) | 0.584 (0.493) | 0.602 (0.490) | 0.620 (0.486) | 0.570 (0.495) |
| High school | 0.516 (0.500) | 0.525 (0.500) | 0.534 (0.499) | 0.525 (0.500) | 0.494 (0.500) | 0.468 (0.499) | 0.447 (0.497) | 0.497 (0.500) |
| More than high school | 0.159 (0.366) | 0.182 (0.386) | 0.243 (0.429) | 0.293 (0.455) | 0.342 (0.475) | 0.393 (0.489) | 0.441 (0.497) | 0.313 (0.464) |
| Non-white | 0.647 (0.478) | 0.646 (0.478) | 0.617 (0.486) | 0.576 (0.494) | 0.564 (0.496) | 0.538 (0.499) | 0.505 (0.500) | 0.575 (0.494) |
| Mother’s AFQT score | 0.354 (0.267) | 0.357 (0.268) | 0.403 (0.271) | 0.433 (0.273) | 0.448 (0.276) | 0.472 (0.279) | 0.495 (0.283) | 0.433 (0.279) |
| Yearly housing cost1 | 0.499 (0.137) | 0.500 (0.136) | 0.504 (0.135) | 0.507 (0.136) | 0.508 (0.138) | 0.511 (0.140) | 0.518 (0.140) | 0.508 (0.138) |
| Move | 0.135 (0.342) | 0.164 (0.371) | 0.143 (0.350) | 0.067 (0.250) | 0.152 (0.359) | 0.091 (0.288) | 0.086 (0.281) | 0.115 (0.319) |
| Net income1 | 1.215 (1.878) | 1.375 (1.810) | 1.741 (2.033) | 2.003 (2.568) | 2.117 (2.432) | 2.548 (2.993) | 2.981 (3.168) | 2.105 (2.610) |
| Mother’s hourly wage rate2 | 0.661 (0.422) | 0.715 (0.768) | 0.781 (0.606) | 0.799 (0.528) | 0.909 (0.862) | 0.927 (0.967) | 1.032 (1.038) | 0.858 (0.805) |
| Father’s yearly earnings1 | 1.813 (1.409) | 1.963 (1.548) | 2.266 (1.752) | 2.342 (1.959) | 2.543 (2.099) | 2.844 (2.312) | 3.236 (2.555) | 2.541 (2.122) |
| Local median hourly wage rate for women2 | 0.809 (0.393) | 0.829 (0.347) | 0.923 (0.381) | 0.977 (0.419) | 1.021 (0.407) | 1.078 (0.430) | 1.102 (0.428) | 0.990 (0.419) |
| Part-time work3 | 0.353 (0.478) | 0.323 (0.468) | 0.329 (0.470) | 0.312 (0.464) | 0.289 (0.453) | 0.271 (0.445) | 0.283 (0.451) | 0.304 (0.460) |
| Full-time work3 | 0.298 (0.457) | 0.387 (0.487) | 0.416 (0.493) | 0.425 (0.495) | 0.462 (0.499) | 0.499 (0.500) | 0.529 (0.499) | 0.444 (0.497) |
| # of young children in HH (0–5) | 1.134 (0.945) | 0.710 (0.803) | 0.652 (0.781) | 0.647 (0.784) | 0.546 (0.738) | 0.437 (0.691) | 0.379 (0.650) | 0.604 (0.788) |
| # of old children in HH (6–17) | 1.127 (0.933) | 1.613 (0.990) | 1.641 (0.981) | 1.828 (0.982) | 1.946 (0.937) | 2.002 (0.945) | 2.011 (0.960) | 1.793 (0.996) |
| Sample size | 1075 | 992 | 1409 | 1688 | 1858 | 1859 | 1739 | 10620 |
| B. Summary statistics for children | ||||||||
|---|---|---|---|---|---|---|---|---|
| Variable | Mean (Standard Deviation) |
|||||||
| 1986 | 1988 | 1990 | 1992 | 1994 | 1996 | 1998 | Overall | |
| Age of child | 7.608 (2.008) | 9.072 (2.323) | 9.389 (2.553) | 9.805 (2.656) | 9.938 (2.617) | 10.063 (2.615) | 10.226 (2.553) | 7.608 (2.008) |
| Non-white | 0.663 (0.473) | 0.656 (0.475) | 0.629 (0.483) | 0.588 (0.492) | 0.570 (0.495) | 0.543 (0.498) | 0.506 (0.500) | 0.663 (0.473) |
| Boy | 0.528 (0.499) | 0.515 (0.500) | 0.504 (0.500) | 0.499 (0.500) | 0.498 (0.500) | 0.502 (0.500) | 0.509 (0.500) | 0.528 (0.499) |
| Dropout rate | 0.064 (0.011) | 0.063 (0.011) | 0.063 (0.011) | 0.063 (0.011) | 0.063 (0.011) | 0.062 (0.010) | 0.062 (0.010) | 0.064 (0.011) |
| Expenditure per pupil1 | 0.126 (0.039) | 0.124 (0.038) | 0.124 (0.038) | 0.124 (0.038) | 0.124 (0.039) | 0.124 (0.040) | 0.123 (0.040) | 0.126 (0.039) |
| Teacher salary1 | 0.760 (0.187) | 0.766 (0.181) | 0.771 (0.195) | 0.769 (0.190) | 0.772 (0.185) | 0.774 (0.187) | 0.770 (0.189) | 0.760 (0.187) |
| PIAT Math score | 0.451 (0.259) | 0.429 (0.252) | 0.445 (0.258) | 0.461 (0.261) | 0.477 (0.266) | 0.511 (0.272) | 0.524 (0.271) | 0.491 (0.259) |
| Sample size | 1493 | 1796 | 2515 | 3056 | 3300 | 3202 | 2873 | 18235 |
Notes: In $10,000’s of 1990 dollars;
In $10’s of 1990 dollars;
See appendix for the definition of part-time and full-time
Note: Measured as a proportion of yearly income of college-graduated prime-aged males. See appendix for details.
Table 2.
Study region definitions and school and other characterisstics
| Census region | Dropout divisions | Expenditure per pupil | Median dropout rate | Median teacher salary1 | Median expenditure per pupil1 | Median Annual Housing Cost | ||
|---|---|---|---|---|---|---|---|---|
| Raw data | Increased for simulation | Raw data | Increased expenditure for simulation | |||||
| Northeast | Low | Low | 4.75 | 5.93 | 0.780 | 0.143 | 0.179 | 0.997 |
| High | 4.82 | 6.03 | 0.867 | 0.205 | 0.257 | 0.898 | ||
| Middle | Low | 5.46 | 6.82 | 0.840 | 0.124 | 0.155 | 0.850 | |
| High | 5.52 | 6.89 | 0.840 | 0.179 | 0.224 | 0.715 | ||
| High | Low | 6.44 | 8.05 | 0.827 | 0.136 | 0.170 | 0.786 | |
| High | 6.88 | 8.60 | 0.976 | 0.181 | 0.226 | 0.676 | ||
| Midwest | Low | Low | 4.60 | 5.75 | 0.873 | 0.059 | 0.074 | 0.519 |
| High | 4.56 | 5.70 | 0.863 | 0.130 | 0.163 | 0.558 | ||
| Middle | Low | 5.47 | 6.84 | 0.782 | 0.082 | 0.102 | 0.588 | |
| High | 5.60 | 7.00 | 0.856 | 0.133 | 0.166 | 0.594 | ||
| High | Low | 6.51 | 8.14 | 0.755 | 0.088 | 0.111 | 0.602 | |
| High | 6.80 | 8.50 | 0.758 | 0.167 | 0.209 | 0.715 | ||
| South | Low | Low | 5.65 | 7.06 | 0.708 | 0.094 | 0.117 | 0.543 |
| High | 5.37 | 6.72 | 0.792 | 0.155 | 0.194 | 0.636 | ||
| Middle | Low | 6.28 | 7.85 | 0.718 | 0.093 | 0.117 | 0.537 | |
| High | 6.43 | 8.04 | 0.773 | 0.150 | 0.187 | 0.655 | ||
| High | Low | 7.74 | 9.68 | 0.749 | 0.102 | 0.128 | 0.606 | |
| High | 7.75 | 9.69 | 0.774 | 0.152 | 0.190 | 0.594 | ||
| West | Low | Low | 5.46 | 6.83 | 0.792 | 0.097 | 0.121 | 0.861 |
| High | 5.56 | 6.95 | 0.959 | 0.146 | 0.182 | 0.652 | ||
| Middle | Low | 6.51 | 8.14 | 0.817 | 0.094 | 0.118 | 0.856 | |
| High | 6.76 | 8.45 | 0.885 | 0.131 | 0.163 | 0.650 | ||
| High | Low | 7.48 | 9.35 | 0.803 | 0.115 | 0.144 | 0.937 | |
| High | 8.18 | 10.22 | 0.847 | 0.136 | 0.170 | 0.507 | ||
| Overall Sample Mean | 6.26 | 7.83 | 0.801 | 0.124 | 0.155 | 0.664 | ||
Notes: Measured as a proportion of yearly income of college-graduated prime-aged males. See appendix for details;
In $10,000’s of 1990 dollars
5 Estimation Results
I. Point Estimates of Utility and Production Function Parameters
Tables A1 to A5 in the appendix report parameter estimates for the production function, utility function, wage offer, and heterogeneity distribution parameters. The estimates in these tables assume that there is unobserved heterogeneity that can influence the production function, the “reserve” consumption in the utility function, the utility costs of full- and part-time work, the mother’s wage offer distribution, and the earnings distribution for the mother’s spouse. We specify four points of support for this unobserved heterogeneity. We permit up to a third order polynomial in the value of the heterogeneity to influence all of the model components, so this is close to a non-parametric specification of the unobservable heterogeneity in this semi-parametric model. We model this heterogeneity as constant throughout the years of observation on the mother’s family.
Given the complex interactions between most of the covariates in the economic model, it is quite difficult to give a simple interpretation to each of the point estimates. Consider, for example, the impact of the mother having a high school degree on the wage hazard model presented in Table A3. The interpretation of this parameter is how a woman being a high school graduate instead of a dropout influences the conditional probability of earning a wage above a given wage level, holding constant the median wage paid to women in her locality who are high school dropouts. Thus it does not incorporate the local labor market returns to a high school diploma. The complete effect of having a high school degree on her wage level is much more complicated than this. To examine the actual marginal effect of having a high school diploma instead of a being a dropout, one would need to consider this impact in conjunction with how the higher median wages offered to high school graduates in that location affects the wage offer distribution. Therefore, we will analyze the estimated marginal effects of these explanatory variables using simulation methods described in the following subsection.
II. Marginal Effect Estimates
In Table 3, we report OLS estimates of various linear specifications of the production function (PIAT math test score). The coefficient estimates of school district characteristics and maternal work decisions are fairly robust. Table 4 presents three sets of marginal, ceteris paribus, effect estimates of the impacts of characteristics on the child’s PIAT Math test score. We obtain these marginal effects by simulating outcomes using the estimated parameters. These simulated effects do not include indirect effects on child outcomes through the effect of changes in the covariates on parental location or work decisions. The first column contains OLS estimates, while columns 2 and 3 contain estimates based upon the “hazard” model for the child scores with controls for unobserved heterogeneity. We simulated how expected test scores would change in response to varying explanatory variables one at a time, normalized to a unit change in the characteristic as is the case with the OLS estimates.
Table 3.
Educational production function estimates marginal effects for alternative OLS specifications
| OLS |
|||
|---|---|---|---|
| Variable | 1 | 2 | 3 |
| Age of mother | −0.027*** | −0.025*** | −0.026*** |
| (0.005) | (0.005) | (0.005) | |
| Age of child | 0.020*** | 0.016** | 0.019** |
| (0.007) | (0.007) | (0.007) | |
| Married | 0.025*** | 0.028*** | 0.026*** |
| (0.005) | (0.004) | (0.004) | |
| High school (mother) | 0.036*** | 0.039*** | 0.035*** |
| (0.005) | (0.005) | (0.005) | |
| More than high school | 0.063*** | 0.066*** | 0.062*** |
| (0.006) | (0.006) | (0.006) | |
| Non-white | −0.058*** | −0.056*** | −0.056*** |
| (0.004) | (0.005) | (0.005) | |
| Boy | −0.001 | −0.001 | −0.001 |
| (0.004) | (0.004) | (0.004) | |
| Dropout rate | – | −0.802*** | −0.790*** |
| – | (0.180) | (0.180) | |
| Teacher salary | – | −0.013 | −0.015 |
| – | (0.010) | (0.010) | |
| Expenditure per pupil | – | 0.158*** | 0.165*** |
| – | (0.047) | (0.047) | |
| Move | 0.011** | 0.011** | 0.011** |
| (0.006) | (0.006) | (0.006) | |
| Part-time work | 0.027*** | – | 0.027*** |
| (0.005) | – | (0.005) | |
| Full-time work | 0.021*** | – | 0.022*** |
| (0.005) | – | (0.005) | |
| AFQT score (mom’s) | 0.273*** | 0.274*** | 0.270*** |
| (0.009) | (0.009) | (0.009) | |
| Intercept | 0.255*** | 0.308*** | 0.298*** |
| (0.017) | (0.022) | (0.022) | |
Notes: Standard errors are in parentheses.
Statistically significantly at the .90 level.
Statistically significantly at the .95 level.
Statistically significantly at the .99 level.
Table 4.
Educational production function estimates marginal effects for OLS and FIML specifications
| Marginal Effects |
|||
|---|---|---|---|
| Variable | OLS | Production function only (no endogeneity Controls) | Full model (with selection and endogeneity controls) |
| Exogenous Variables | |||
| Age of mother | −0.026*** | −0.053*** | −0.063*** |
| (0.005) | (0.009) | (0.005) | |
| Age of child | 0.019** | 0.024*** | 0.019*** |
| (0.007) | (0.008) | (0.005) | |
| Married | 0.026*** | 0.015*** | 0.019*** |
| (0.004) | (0.004) | (0.005) | |
| High school (mother) | 0.035*** | 0.030*** | 0.026*** |
| (0.005) | (0.004) | (0.003) | |
| More than high school | 0.062*** | 0.053*** | 0.062*** |
| (0.006) | (0.005) | (0.004) | |
| Non-white | −0.056*** | −0.048*** | −0.056*** |
| (0.005) | (0.004) | (0.003) | |
| Boy | −0.001 | 0.013*** | 0.007*** |
| (0.004) | (0.003) | (0.002) | |
| AFQT score (mom's) | 0.270*** | 0.270*** | 0.332*** |
| (0.009) | (0.007) | (0.008) | |
| Intercept | 0.298*** | - | - |
| (0.022) | - | - | |
| Endogenous Variables | |||
| Dropout rate | −0.790*** | −0.752*** | −0.216* |
| (0.180) | (0.174) | (0.111) | |
| Teacher salary | −0.015 | −0.011 | −0.012*** |
| (0.010) | (0.012) | (0.004) | |
| Expenditure per pupil | 0.165*** | 0.186*** | 0.091*** |
| (0.047) | (0.045) | (0.028) | |
| Move | 0.011** | 0.016*** | −0.058*** |
| (0.006) | (0.006) | (0.004) | |
| Part-time work | 0.027*** | 0.036*** | −0.009*** |
| (0.005) | (0.005) | (0.003) | |
| Full-time work | 0.022*** | 0.031*** | −0.012*** |
| (0.005) | (0.005) | (0.004) | |
Notes: Standard errors are in parentheses.
Statistically significantly at the .90 level.
Statistically significantly at the .95 level.
Statistically significantly at the .99 level.
The estimates used for the simulation results in Column 2 are based on estimates of the flexible functional form of the production function described in equations (16) to (18), using exactly the same form of the heterogeneity distribution as in the structural model. However, this specification does not incorporate any controls for selection or endogeneity. Its estimates were obtained by estimating the production function separately from the rest of the economic model, with the right-hand side variables treated as independent of the error terms, including the heterogeneity component μc. Note that this is equivalent to restricting the correlations between the heterogeneity terms in the various equations of the model to be zero. These simulated marginal effects are directly comparable to the OLS estimates. The marginal effects in Column 3 were based on estimates of the structural model that incorporated the endogeneity of the location decisions and the endogeneity of the mothers’ hours of work decisions. The standard errors of the marginal effects are calculated using a parametric bootstrap (100 replications) simulated using the estimated covariance matrix of the parameter estimators.
A comparison of the OLS estimates in Column 1 and the simulated derivatives in Column 2 isolates the impact of using a more flexible functional form for the estimation of the child outcome production function. For the most part, the estimated derivatives are quite close for these two models, with most of the absolute differences being less than one standard error of the OLS model. The standard errors in Column 2 are almost always smaller than those in Column 1, suggesting that the nonlinear production function provides more accurate estimates than does OLS.
The estimates of the marginal effects presented in the last column in Table 4 are based on estimates of the structural model which accounts for endogeneity of the parental choices including the location decisions. After controlling for the endogeneity of these variables, most of the impacts of the exogenous variables are fairly close to those from the two models that do not control for the endogeneity of the production function inputs. The impacts of the potentially endogenous variables, however, do change significantly. The negative impact of the local dropout rate on expected child outcome diminishes by more than 74 percent. The negative effect of higher teacher salaries falls by about 20 percent, and the positive impact of expenditures per pupil falls by 44 percent. Overall, the estimated effects of school district characteristics on a child’s test score, diminish considerably after using the structural model to control for endogeneity of the location and work decisions.
The impact of the family changing its county of residence becomes negative and statistically significant after controlling for the endogenous behaviors, while without the endogeneity controls it appears that a move would significantly increase a child’s test score. Specifically, without endogeneity controls, the OLS estimates and estimated marginal effects for the nonlinear functional form suggest that moving improves child test score by 1.1 and 1.6 percentile points respectively. The estimates obtained from the full structural model with endogeneity controls indicate that moving actually causes a 5.8 percentile point drop in child scores. This suggests that parents of children with higher intellectual endowments (higher values of μc) are more likely to move, so that in a reduced-form regression the true negative disruptive effect of relocation on child performance is confounded with non-random selection of children who experienced moves versus those who did not.
The positive impacts of the mother working part time or full-time (instead of not working) both become negative after controlling for the endogeneity of these parental decisions. Without accounting for endogeneity, it would appear that a mother’s working full-time could increase her child’s percentile score by 2.2 points. After controlling for endogeneity the estimated impact of a mother working full-time implies an expected decline in the child’s percentile score of 1.2 points. The estimates from the structural model with endogeneity controls provide significantly different estimated impacts of the effects of school district characteristics and parental work decisions on children’s expected test scores.
The first panel of Table 5 contains the marginal effects of schooling, race, and the mother’s AFQT score on the mothers offered wage distribution. The impact of being non-white and of local labor market conditions (measured by the local median wage) on offered wages increases after controlling for the selection into the labor force, while the magnitude of the mothers AFQT score falls by over 20 percent. The second panel of Table 5 contains the marginal effects on the fathers’ earnings. In the full model, the effects on earnings of higher levels of schooling are slightly lower than in the model without the full set of controls. The estimated effect of race is also smaller in the complete model.
Table 5.
Marginal effect estimates of wage and earnings equations
| A. Mother’s wage equation1: OLS and FIML specifications2 | |||
|---|---|---|---|
| Marginal Effects |
|||
| Variable | OLS | Wage function only (no endogeneity Controls) | Full model (with selection and endogeneity controls) |
| Local median wage | 0.158*** | 0.204*** | 0.224*** |
| (0.024) | (0.018) | (0.024) | |
| High school | 0.029 | 0.122*** | 0.141*** |
| (0.030) | (0.015) | (0.039) | |
| More than high school | 0.196*** | 0.220*** | 0.212*** |
| (0.035) | (0.021) | (0.021) | |
| Non-white | 0.114*** | 0.129*** | 0.302*** |
| (0.022) | (0.012) | (0.022) | |
| Mother AFQT score | 0.652*** | 0.606*** | 0.475*** |
| (0.045) | (0.026) | (0.027) | |
| Intercept | 0.245*** | - | - |
| (0.037) | - | - | |
| B. Father’s earnings equation 1: OLS and FIML specifications2 | |||
|---|---|---|---|
| Marginal Effects |
|||
| Variable | OLS | Earning function only (no endogeneity Controls) | Full model (with selection and endogeneity controls) |
| Local yearly median earnings | 0.027*** | 0.035*** | 0.029*** |
| (0.002) | (0.002) | (0.002) | |
| High school | 0.371*** | 0.520*** | 0.495*** |
| (0.071) | (0.034) | (0.038) | |
| More than high school | 1.271*** | 1.291*** | 1.134*** |
| (0.080) | (0.049) | (0.048) | |
| Non-white | −0.844*** | −0.755*** | −0.545*** |
| (0.052) | (0.032) | (0.035) | |
| Intercept | 1.598*** | - | - |
| (0.083) | - | - | |
Notes: Hourly wage is measured in $10 of 1990 dollars;
Standard errors are in parentheses.
Statistically significantly at the .90 level,
Statistically significantly at the .95 level,
Statistically significantly at the .99 level.
Notes: Earnings are measured in $10,000’s of 1990 dollars;
Standard errors are in parentheses.
Statistically significantly at the .90 level,
Statistically significantly at the .95 level,
Statistically significantly at the .99 level.
III. Goodness of Fit and Policy Simulations
Using the parameter estimates from the full structural model and the corresponding estimated optimal decision rules, we are able to simulate a set of residential location choice and maternal work decisions, and child outcomes for an observed household at each period. We then map the simulated choices of the counties of residence into the 24 study regions defined earlier in the data description section. The simulated frequencies in the second column of Table 6 demonstrate that the structural model fits the observed distribution of residential location decisions, as categorized in terms of the 24 different regions, quite well. A primary reason why this model appears to perform so well is that we treat the initial residential location decision (when the child was either age 5 or age 6) as exogenous. This, in conjunction with the relatively large utility costs of migration that we estimate makes it relatively easy to fit the marginal distribution of residences. The third and fourth columns of Table 6 represent actual and simulated average employment rates of mothers among 24 study regions. The sample and simulated average child test scores are presented in the fifth and sixth columns. For most regions, the model predicts average employment decisions and child scores fairly accurately.
Table 6.
Goodness of fit: Simulated region of residence, employment rate, and PIAT math score
| Location | Employment | Math Score | ||||||
|---|---|---|---|---|---|---|---|---|
| Raw data | Simulated | Raw data | Simulated | Raw data | Simulated | |||
| All regions | - | - | 75.1445 | 75.6247 | 0.4910 | 0.4904 | ||
| Census region | Dropout divisions | Expenditure per pupil | ||||||
| Northeast | Low | Low | 1.460 | 1.508 | 80.000 | 68.021 | 0.590 | 0.548 |
| High | 1.412 | 1.403 | 76.667 | 74.329 | 0.531 | 0.498 | ||
| Middle | Low | 1.733 | 1.846 | 75.543 | 75.628 | 0.555 | 0.517 | |
| High | 3.107 | 3.206 | 69.394 | 78.159 | 0.541 | 0.514 | ||
| High | Low | 2.580 | 2.514 | 63.504 | 73.597 | 0.444 | 0.479 | |
| High | 3.183 | 3.162 | 64.201 | 74.967 | 0.449 | 0.449 | ||
| Midwest | Low | Low | 3.597 | 3.661 | 78.272 | 73.119 | 0.584 | 0.558 |
| High | 3.795 | 3.983 | 84.119 | 73.408 | 0.568 | 0.542 | ||
| Middle | Low | 2.834 | 2.720 | 79.402 | 79.615 | 0.591 | 0.535 | |
| High | 5.716 | 5.490 | 72.652 | 77.776 | 0.483 | 0.495 | ||
| High | Low | 4.529 | 4.573 | 69.647 | 75.541 | 0.470 | 0.494 | |
| High | 3.380 | 3.371 | 64.624 | 75.471 | 0.487 | 0.465 | ||
| South | Low | Low | 7.599 | 7.554 | 77.323 | 76.072 | 0.486 | 0.483 |
| High | 6.620 | 6.532 | 78.378 | 72.407 | 0.463 | 0.470 | ||
| Middle | Low | 8.409 | 8.204 | 79.731 | 77.805 | 0.423 | 0.467 | |
| High | 6.638 | 6.508 | 77.872 | 77.212 | 0.464 | 0.462 | ||
| High | Low | 6.902 | 6.991 | 75.580 | 80.013 | 0.422 | 0.443 | |
| High | 6.733 | 6.933 | 76.084 | 74.152 | 0.430 | 0.465 | ||
| West | Low | Low | 2.034 | 2.079 | 70.370 | 77.364 | 0.483 | 0.502 |
| High | 2.476 | 2.841 | 77.947 | 76.881 | 0.477 | 0.528 | ||
| Middle | Low | 2.985 | 2.867 | 72.871 | 79.884 | 0.501 | 0.464 | |
| High | 3.503 | 3.465 | 73.925 | 77.295 | 0.468 | 0.500 | ||
| High | Low | 8.136 | 7.919 | 69.560 | 70.943 | 0.449 | 0.458 | |
| High | 0.640 | 0.669 | 82.353 | 74.434 | 0.305 | 0.432 | ||
Table 7 presents the simulated overall average moving rate, percentage employed, and average child test scores, given three hypothetical scenarios. In the third column, we increase per pupil school expenditures in each county by 25%. The actual average expenditure levels for each region are presented in Table 2. It appears that parents are re-optimizing their migration and maternal employment decisions, given the exogenous changes to school district characteristics. After increasing the expenditure levels for all regions, there is a reduction in the migration rate and a small increase in the employment rate. Despite an increase in per-pupil expenditures, due to the dominating effect of declined maternal time inputs, the simulated average child test scores are lower than the baseline level by 0.4 percentage point. The fourth column represents a hypothetical 25% increase in the high school dropout rate in each county. In this case, the model predicts a small reduction in the moving rate and a small increase in maternal employment. Combined with the direct negative effect of an overall rise in dropout rates, we find that the resulting average child test scores drop by 1.1 percentage points. Finally, the fifth column represents a 25% across the board increase in wage rates. We find the scale of the migration rate reduction to be similar to that of the previous two experiments, while the rise in employment rate is somewhat greater than the results in columns 3 and 4. With all of the effects on child outcomes being indirect effects in this case, we find a decline in average child test scores that falls between those associated with increases in average per-pupil expenditures and in dropout rates. Simulation results from several additional “local” policy interventions, in which inputs are changed in only a subset of locations, are discussed in Liu et al. (2002). Moreover, the model developed in this paper can be readily adapted to analyze other interesting local policy simulations. For example, Liu (2008) adds family participation in welfare programs to the choice set and conducts policy simulations to study welfare induced migration behaviors and subsequent impact on child outcomes.
Table 7.
Simulated impacts on migration, employment, and math scores
| Simulated |
|||||
|---|---|---|---|---|---|
| Raw data | Baseline | Increase expenditure by 25% | Increase dropout rate by 25% | Increase wage rate by 25% | |
| Biyearly migration rate | 11.34 | 11.92 | 11.42 | 11.41 | 11.41 |
| Employment rate | 75.14 | 75.62 | 75.70 | 75.71 | 76.14 |
| Math score | 0.491 | 0.490 | 0.486 | 0.479 | 0.483 |
6 Conclusions
Estimating the educational production function as part of a structural model provides significantly different estimates of the production process. For the most part, the impacts of the school district characteristics diminish by factors of 2 to 4 after controlling for the fact that families may be choosing where to live because of the school district characteristics and labor market opportunities. We also find that the impacts on child outcomes of having moved and of working full-time (as opposed to not working) to change signs and remain statistically significant after controlling for the possible endogeneity of these decisions. One interpretation of these changes in the estimated production function impacts is that families whose children would anyways perform quite well tend to choose to live in school districts with the highest levels of productive inputs while at the same time working more. This is a standard endogeneity of inputs argument. Inputs to the educational production process remain statistically significant determinants of the child outcomes, but their effects are much smaller than is implied by estimation methods that do not deal with the possibility of endogeneity biases.
When we turn to the estimates of the overall effects of changes in characteristics on child outcomes, a somewhat different story emerges. Since parents can re-optimize by choosing different school districts and hours of work, many of the benefits (or detriments) to students from changing school district characteristics end up having only minor impacts on the child test scores. While we have only considered a small number of simulations, these results do suggest that it is important to recognize that parents’ decisions about work and school districts can offset improvements in the school environments. Additionally, parents can substitute into better school districts in order to make up for the fact that full-time work can leave the parents with less time to spend helping their children learn. The presence of such substitution possibilities might help explain some of the disparate estimates of the educational production function found in the literature.
The results reported in the paper are among the first to explicitly recognize and model the joint decisions by parents of school characteristics for their children and the mother’s labor supply when assessing the impacts of these factors on their children’s developmental outcomes. As such, our model imposes many simplifying assumptions, and future research should assess whether the principle economic findings continue to hold after relaxing some of the more important ones. Among these we identify three important issues to be explored. First, for many localities the characterization of schools by a set of average county-level school quality measures is not very realistic, and it will be important to recognize the within county variability in school characteristics. Data limitations imposed due to confidentiality concerns might hamper progress on this front, but it may be feasible to develop consistent modeling and estimation approached for explicitly dealing with this aggregation issue.
Second, more than 10 percent of children are enrolled in private schools. This important feature of the market for schools is likely to interact with residential location and labor supply decisions and the private school option should therefore be included in future research. Third, it would be useful to consider alternative, more dynamic, characterizations of the educational production process, such as those based on value added measures of child achievement. These important limitations of our preliminary study imply that our findings should be considered more exploratory than definitive. However, we believe that our modeling approach and the corresponding results showing that controlling for the endogeneity of the inputs to the child development process yields substantively different results than approaches that ignore the joint determination of the inputs and outputs, provides a fruitful first step in analyzing this important issue theoretically and empirically.
Acknowledgments
Grants from the National Institutes of Child Health and Development (R01-HD047213) and the Fogarty International Center (RO1-TW05596) provided partial support for this research. We have benefited from helpful comments from participants at the 2002 SITE meetings at Stanford, participants at the 2003 PET meetings at Duke, participants at the 2005 Conference on Structural Models in Labor, Aging, and Health, and seminar participants at the University of Essex, University of Wisconsin-Madison, Northwestern University, Johns Hopkins University, University of Pennsylvania, Laval University, Georgia State University, Ohio State University, Clemson University, University of South Carolina, University of South Florida, and University of North Carolina-Greensboro. The authors would like to thank Donna Gilleski and Ahmed Khwaja for their excellent editorship and two referees for insightful comments. The views and opinions offered in this article do not necessarily reflect those of the Federal Reserve Bank of New York or the Federal Reserve System as a whole.
Appendix A. Appendix Tables
Table A1.
Parameter estimates from the full model production function parameters
| Variable | Estimate | Std. Err. |
|---|---|---|
| Intercept | −0.430 | 0.337 |
| Age of mom (in 10 years) | 0.058 | 0.345 |
| child age (in 10 years) | 0.021 | 0.135 |
| child age squared | −0.064 | 0.020 |
| Married | 0.109 | 0.016 |
| Mother education high school | 0.147 | 0.018 |
| Mother education more than HS | 0.353 | 0.023 |
| Non-white | −0.312 | 0.016 |
| Boy | 0.042 | 0.013 |
| Dropout rate | −0.152 | 3.757 |
| Expenditure per pupil | 1.555 | 0.915 |
| Teacher salary | −0.135 | 0.207 |
| Move | −0.330 | 0.022 |
| Mother part-time work | 0.178 | 0.070 |
| Mother full-time work | 0.200 | 0.071 |
| Dropout rate × Log(P-p) | −0.231 | 1.260 |
| Expense per pupil × Log(P-p) | −0.229 | 0.327 |
| Teacher salary × Log(P-p) | −0.010 | 0.082 |
| Mother part-time × Log(P-p) | −0.147 | 0.035 |
| Mother full-time × Log(P-p) | −0.210 | 0.036 |
| Dropout rate× Child age | −0.784 | 2.895 |
| Expense per pupil × Child age | −0.675 | 0.747 |
| Teacher salary×Child age | −0.120 | 0.151 |
| Part-time × Child age | 0.030 | 0.036 |
| Full-time × Child age | 0.111 | 0.042 |
| Mother’s AFQT score | 0.347 | 0.086 |
| Mother’s AFQT × Log(P-p) | 0.539 | 0.053 |
| Loading on 1st order heterogeneity factor | −0.351 | 0.211 |
| Loading on 2nd order heterogeneity factor | −1.845 | 0.516 |
| Loading on 3rd order heterogeneity factor | 0.949 | 0.327 |
| 1st order of baseline hazard [log(P-p)] | 4.738 | 0.148 |
| 2nd order of baseline hazard [log(P-p)]^2 | 9.140 | 0.063 |
| 3rd order of baseline hazard [log(P-p)]^3 | 6.873 | 0.029 |
| 4th order of baseline hazard [log(P-p)]^4 | 1.551 | 0.012 |
Note: P is the highest level of discretized scores and p is any given discretized score level.
Table A2.
Parameter estimates from the full model utility function parameters
| Variable | Estimates | Std. Err |
|---|---|---|
| Intercept in reserve | 2.765 | 0.490 |
| 1st order discrete factor loading in reserve | −3.317 | 1.355 |
| 2nd order discrete factor loading in reserve | 1.744 | 1.094 |
| Power of consumption (γ1) | 0.003 | 0.002 |
| Married (α0) | 0.163 | 0.061 |
| Any work (α1) | 0.004 | 0.003 |
| Relative power of mother’s AFQT score (α3) | 1.005 | 0.757 |
| Scale on child’s score/mother’s AFQT score (α2) | 2.432 | 5.817 |
| Reserve child’s score/mother’s AFQT score (γ2) | 1.709 | 4.251 |
| Power of child’s score/mother’s AFQT score (γ3) | 0.136 | 0.530 |
| Intercept in part-time leisure | 0.427 | 0.338 |
| 1st order discrete factor loading in part-time leisure | −1.499 | 0.639 |
| 2nd order discrete factor loading in part-time leisure | 0.169 | 0.964 |
| 3rd order discrete factor loading in part-time leisure | 1.460 | 0.843 |
| # of young children (0–5) in part-time leisure | 0.094 | 0.030 |
| # of old children (6–17) in part-time leisure | −0.003 | 0.006 |
| Married in f4a(.) | 0.008 | 0.014 |
| Intercept in full-time leisure | −2.088 | 0.155 |
| 1st order discrete factor loading in full-time leisure | −0.245 | 1.075 |
| 2nd order discrete factor loading in full-time leisure | 3.231 | 0.984 |
| 3rd order discrete factor loading in full-time leisure | −1.079 | 25.172 |
| # of young children (0–5) in full-time leisure | 0.542 | 0.057 |
| # of old children (6–17) in full-time leisure | 0.170 | 0.032 |
| Married in f4b(.) | −0.104 | 0.100 |
| Moving psychic cost (any move) | −3.739 | 1.147 |
| Additional moving psychic cost across states | −1.629 | 0.481 |
| Additional moving psychic cost across census regions | −0.570 | 0.170 |
| Inverse of parameter b Gumbel error on working choice | 3.227 | 0.270 |
| Inverse of parameter b Gumbel error on location choice | 1.969 | 0.578 |
| Dummy for census region – Midwest1 | −0.369 | 0.145 |
| Dummy for census region – South | −0.224 | 0.108 |
| Dummy for census region – West | −0.073 | 0.101 |
Note: Northeast is the base region.
Table A3.
Parameter estimates from the full model mother’s wage equation parameters
| Variable | Estimates | Std. Err. |
|---|---|---|
| Intercept | −1.859 | 0.135 |
| Local wage rate | 0.912 | 0.102 |
| Local wage rate squared | −0.236 | 0.029 |
| Local wage rate × discrete factor | 0.050 | 0.122 |
| High school (mother) | 0.328 | 0.033 |
| More than high school (mother) | 0.548 | 0.043 |
| Non-white | 0.238 | 0.023 |
| Mother’s AFQT score | 0.405 | 0.200 |
| Mother’s AFQT × Log(G-g#) | −1.633 | 0.151 |
| Loading on 1st order heterogeneity factor | −1.232 | 0.223 |
| Loading on 2nd order heterogeneity factor | −0.223 | 0.211 |
| 1st order of baseline hazard [log(G-g)] | 0.058 | 0.058 |
| 2nd order of baseline hazard [log(G-g)]^2 | −0.664 | 0.132 |
| 3rd order of baseline hazard [log(G-g)]^3 | −0.247 | 0.076 |
Note: G is the highest level of discretized wage rates; g is any given discretized wage rate level.
Table A4.
Parameter estimates from the full model father’s earnings function parameters
| Variable | Estimates | Std. Err. |
|---|---|---|
| Intercept | −1.658 | 0.102 |
| Local median earnings | 0.040 | 0.003 |
| Local median earnings × discrete factor | 0.000 | 0.000 |
| High school (father) | −0.019 | 0.004 |
| More than high school (father) | 0.352 | 0.039 |
| Non-white | 0.809 | 0.046 |
| Loading on 1st order discrete factor | −0.618 | 0.033 |
| Loading on 2nd order discrete factor | −3.650 | 0.221 |
| 1st order of baseline hazard [log(G′-g)] | −0.402 | 0.181 |
| 2nd order of baseline hazard [log(G′-g′)]^2 | −1.357 | 0.207 |
| 3rd order of baseline hazard [log(G′-g′)]^3 | −0.964 | 0.213 |
Note: G′ is the highest level of discretized earnings; g is any given discretized earning level.
Table A5.
| A. Parameter estimates from the full model | ||
|---|---|---|
| Variable | Estimates | Std. Err. |
| Probability parameter at 0 | 1.552 | 0.129 |
| Probability parameter at 1/3 | −1.932 | 0.130 |
| Probability parameter at 2/3 | 3.777 | 0.163 |
| B. Discrete probability distribution of heterogeneity types | ||
|---|---|---|
| Heterogeneity factor | Probability | |
| 0 | 0.201 | |
| 1/3 | 0.277 | |
| 2/3 | 0.369 | |
| 1 | 0.153 | |
Appendix B. Summaries of Major Data Sources
B.1. The National Longitudinal Survey of Youth
The original NLSY began in 1979 with a national sample of 12686 young adults between the ages of 14 and 21. It included a nationally representative sample of 6,111 youths, an over-sample of 5,295 blacks, Hispanics, and economically disadvantaged whites, and a supplemental sample of 1,280 persons in the military in September 1978. Interviews with the military sub-sample were suspended after 1984 and for economically disadvantaged non-Hispanic whites after 1990. In this study, we exclude these economically disadvantage non-Hispanic whites and focus on the mothers who had mother-child data from 1986 to 1998.
B.2. The National Longitudinal Survey of Youth - Children Sample
Beginning in 1986, the NLSY-Child collected data on all of the children born to the female NLSY respondents. The NLSY-Child sample (through 1998) supplies data on children with mothers between the ages of 33 and 40 at the end of 1997. Children under the age of 15 comprise the majority of this sample. The NLSY-Child contains a set of cognitive and behavioral assessments. The NLSY-Child sample biennially interviews both mother and child.
B.3. The Common Core of Data (CCD)
This is a database of the U.S. Department of Education’s National Center for Education Statistics that annually collects fiscal and non-fiscal data about all public schools, public school districts and state education agencies in the United States. Grade-specific expenditures per pupil are obtained by aggregating over all public schools within a given county from this data set.
B.4. Top 100 Database of Key Demographic Items, School District Data Book
This is a compact file of key demographic data items, drawn mainly from 1990 Census school district special tabulation. Expenditures per pupil are obtained by counties from this data set to supplement the CCD data for counties that did not report to the U.S. Department of Education.
B.5. Census of Population and Housing, 1990 [United States]: Public Use MicroData Sample: 5-percent Sample
To construct our relative measurements of teacher salary, we select college-graduated white males, who were between 27 and 38 years old and working full-time (35+ hours a week and 40+ weeks a year). The smallest geographical indentifier in the 5-percent PUMS data is Public Use Microdata Area (PUMA), which could include partial, single or multiple counties. We aggregate all PUMA’s to 908 study areas, and any given county exclusively belong to one of them. In this selected sample, the relative median teacher salary in a county is calculated by dividing the median annual wage income of male public non-postsecondary teachers by the median annual wage income of males with occupations other than these teachers. These non-postsecondary teachers include pre-kindergarten and kindergarten teachers, elementary school teachers, and secondary school teachers). These grade-dependent teacher salaries then are used in the education production function estimated in the paper. Similarly, the relative expenditure per pupil is measured by the nominal expenditure per pupil relative to the median annual wage income of all male in the selected MicroData sample.
The empirical model also uses the median wage rates of females in this data set by age, education attainment, and ethnic groups. These median wage rates are treated as important elements of local wage distribution.
B.6. USA Counties
USA Counties is a database produced by the U.S. Bureau of Census. It contains statistical data from the Census Bureau, other federal agencies, and private organizations. High school drop-out rates in 1990 are obtained from this database.
Footnotes
Since moving costs are unobserved and to avoid identification problems, we did not attempt to distinguish between the monetary and psychic costs of moving captured in the model through mk−1k.
The PSID contains detailed information on the exact month when the families moved as well as on job changes.
Individual observations in the standard Geocode version of the NLSY have county Federal Information Processing Standards (FIPS) codes as smallest geographic indicator of their residential location.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
JEL classification: I21; J22; J60; R23
References
- Bayer P. Tiebout sorting and discrete choices: A new explanation for socioeconomic differences in the consumption of school quality. Mimeo: Yale University; 2000. [Google Scholar]
- Becker G, Tomes N. Child endowments and the quantity and quality of children. Journal of Political Economy. 1976;75 (4):352–365. [Google Scholar]
- Behrman JR, King EM. Household schooling behaviors and decentralization. Economics of Education Review. 2001;20:321–341. [Google Scholar]
- Bernal R. The effect of maternal employment and child care on children’s cognitive development. Mimeo: New York University; 2007. [Google Scholar]
- Betts J. Does school quality matter? Evidence from the National Longitudinal Survey of Youth. Review of Economics and Statistics. 1995;77:231–50. [Google Scholar]
- Betts J. Is there a link between school inputs and earnings? Fresh scrutiny of an old literature. In: Burtless G, editor. Does money matter? The effects of school resources on student achievement and adult success. Brookings Institution; Washington, D.C: 1996. pp. 141–91. [Google Scholar]
- Bhargava A. Identification and panel data models with endogenous regressors. Review of Economic Studies. 1991;58 (1):129–140. [Google Scholar]
- Bishop K. A Dynamic model of location choice and hedonic valuation. Mimeo: Duke University; 2007. [Google Scholar]
- Black SE. Do better schools matter? Parental valuation of elementary education. Quarterly Journal of Economics. 1999;114 (2):577–599. [Google Scholar]
- Blanchard OJ, Katz LF. Regional evolutions. Brookings Papers on Economic Activity. 1992;1:1–37. [Google Scholar]
- Blau FD, Grossberg AJ. Maternal labor supply and children’s cognitive development. Review of Economics and Statistics. 1992;74 (3):474–81. [Google Scholar]
- Card D. Earnings, schooling and ability revisited. In: Polachek S, editor. Research in Labor Economics. Vol. 14. 1995. pp. 23–48. [Google Scholar]
- Card D. Handbook of Labor Economics. 3A. North-Holland: Amsterdam; 1999. The causal effect of education on earnings. [Google Scholar]
- Card D, Krueger AB. Does school quality matter? Returns to education and the characteristics of public schools in the United States. Journal of Political Economy. 1992;100 (1):1–40. [Google Scholar]
- Ferreyra M. Estimating the effects of public school vouchers in multi-district economies. American Economic Review. 2007;97(3):789–817. [Google Scholar]
- Gemici A. Family migration and labor market outcomes. Mimeo: New York University; 2007. [Google Scholar]
- Gilleskie D, Mroz T. A flexible approach for estimating the effects of covariates on health expenditures. Journal of Health Economics. 2004;23 (2):391–418. doi: 10.1016/j.jhealeco.2003.09.008. [DOI] [PubMed] [Google Scholar]
- Hanushek EA. School resources and student performance. In: Burtless G, editor. Does money matter? The effects of school resources on student achievement and adult success. Brookings Institution; Washington, D.C: 1996. pp. 43–73. [Google Scholar]
- Hanushek EA. Economic Policy Review. 1. Vol. 4. Federal Reserve Bank of New York; New York: 1998. Conclusions and controversies about the effectiveness of school resources. [Google Scholar]
- Hanushek EA, Rivkin SG, Taylor LL. Aggregation and the estimated effects of school resources. The Review of Economics and Statistics. 1996;78 (4):611–27. [Google Scholar]
- Harvey E. Short-term and long-term effects of early parental employment on children of the National Longitudinal Survey of Youth. Developmental Psychology. 1999;35:445–459. doi: 10.1037//0012-1649.35.2.445. [DOI] [PubMed] [Google Scholar]
- Haveman R, Wolfe B. The determinants of children’s attainments: A review of methods and findings. Journal of Economic Literature. 1995;33:1829–1878. [Google Scholar]
- Hedges LV, Greenwald R. Have times changed? The relationship between school resources and student performance. In: Burtless G, editor. Does money matter? The effects of school resources on student achievement and adult success. Brookings Institution; Washington, D.C: 1996. pp. 74–92. [Google Scholar]
- Imbens G, Angrist J. Identification and estimation of local average treatment effects. Econometrica. 1994;62:467–76. [Google Scholar]
- Kennan J, Walker JR. The effect of expected income on individual migration decisions. Mimeo: University of Wisconsin-Madison; 2005. [Google Scholar]
- Krueger AB. Economic Policy Review. 1. Vol. 4. Federal Reserve Bank of New York; New York: 1998. Reassessing the view that American schools are broken. [Google Scholar]
- Krueger AB. Experimental estimates of education production functions. Quarterly Journal of Economics. 1999;114 (2):497–532. [Google Scholar]
- Liu H, Mroz T, Van der Klaauw W. Maternal employment, migration, and child development. Conference paper presented at the 2002 SITE meetings; Stanford. 2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu H. Using random sampling to reduce the choice set of a conditional Logit model estimated by the discrete factor method. Mimeo: East Carolina University; 2004. [Google Scholar]
- Liu H. A migration study of mothers work, welfare participation, and child development. Labour: Review of Labour Economics and Industrial Relations. 2008;22 (1):23–71. [Google Scholar]
- Loeb S, Bound J. The effect of measured school inputs on academic achievement: Evidence from the 1920s, 1930s and 1940s birth cohorts. Review of Economics and Statistics. 1996;78 (4):653–664. [Google Scholar]
- Markwardt F. Peabody Individual Achievement Test-Revised. American Guidance Services; Circle Pines, MN: 1989. [Google Scholar]
- McFadden D. Modeling the choice of residential location. In: Karlqvist A, Lundqvist L, Snichars F, Weibull J, editors. Spatial Interaction Theory and Planning Models. North Holland: Amsterdam; 1978. pp. 75–96. [Google Scholar]
- McMillan R. Competition, parental involvement, and public school performance: An empirical analysis. National Tax Association Proceedings; Evanston, IL. 2000. pp. 150–155. [Google Scholar]
- Moore KA, Driscoll AK. Low-wage maternal employment and outcomes for children: A study. The Future of Children. 1997;7 (1):122–127. [PubMed] [Google Scholar]
- Nechyba TJ, Strauss RP. Community choice and local public services: A discrete choice approach. Regional Science and Urban Economics. 1998;28(1):51–74. [Google Scholar]
- Parcel TL, Menaghan E. Early parental work, family social capital and early childhood outcomes. American Journal of Sociology. 1994;99 (4):972–1009. [Google Scholar]
- Todd PE, Wolpin KI. On the specification and estimation of the production function for cognitive achievement. Economic Journal. 2003;113:485–515. [Google Scholar]
