I. Introduction
Considerable research has demonstrated a positive association between early marijuana use and low educational attainment as measured by both years of education and high school dropout status (Chatterji, 2006; Bray et al., 2000; Ellickson et al., 1998; Schulenberg et al, 1994; Mensch and Kendel, 1988; Newcomb and Bentler, 1986). This finding has frequently been interpreted as evidence that marijuana use interferes with learning by impairing memory, attention or other cognitive functioning and/or motivation, any of which could translate into poor schooling outcomes. However, evidence supporting alternative explanations for the negative association challenges this causal interpretation. For example, some studies show that poor schooling outcomes actually precede regular and heavy marijuana use (Fergusson and Horwood, 1997; Hawkins et al, 1992; Newcomb and Bentler, 1988). Other research suggests that the relationship between marijuana use and low educational attainment is explained by a common third variable (Barnes et al., 2005; Kumar et al, 2002; Sander, 1998; Shulenberg et al., 1994; Farrell and Fuchs, 1982) or that marijuana use influences other factors such as peer associations and attitudes toward schooling (Brook et al., 1989; Brook et al., 2002; Friedman, Bransfield, Kreisher, 1994) that serve as an alternative pathway to affecting school completion.
Separating any causal effects of marijuana use on educational attainment from the effects of other factors is complicated by the sheer number of potential confounders and the presence of selection biases in observational data. In addition, adolescence is a period of development and marked change in the child’s environment; hence accounting for third variables at one point in time may miss relevant changes that have occurred earlier or later. Instrumental variables and techniques usually employed to overcome these sorts of problems have been less useful in the case of illicit drugs, as models testing what are typically weak state-level instruments generally reject the assumption of endogeneity (Chatterji, 2006; Bray et al., 2000).
This study revisits the question of how and why marijuana use and schooling are related. In particular, it explores the conjecture that the relationship reflects a causal effect of marijuana on student cognitive functioning and, in so doing, improves upon previous studies in two important ways. First, it minimizes the effects of selection bias by using propensity score methods, which create observationally equivalent groups of marijuana users and nonusers who are matched on a rich set of constructs and observable characteristics at baseline. A new longitudinal data source allows us to control for a wide variety of alternative factors that have been used to explain the association between marijuana and high school dropout, including one’s predisposition toward problem behavior, use of substances other than marijuana, peer and family social influences, attachment to conventional institutions such as family, school and religion, emotional distress and time preference. Second, we evaluate whether changes in these constructs between early and middle adolescence help explain the association rather than assume that the measurement or influence of a particular construct is stable over time.
As in previous studies, we find a positive association between marijuana use and high school dropout status (OR=5.6, RR=3.8), but we show that over half of the association can be explained by prior differences in observational characteristics and behaviors (i.e., selection bias). The remaining association (OR=2.4, RR=1.7) becomes statistically insignificant after measures of cigarette smoking are included in the analysis, a variable that is not systematically included in economic analyses. Because we are aware of no physiological justification for why controlling for cigarette smoking should account for marijuana’s cognitive effects on learning, we interpret this as indicating that the negative relationship between marijuana use and high school completion is unlikely to be due to adverse effects on cognition and more likely to be the result of omitted variable bias or the effects of peer associations or other factors. We then explore the source of this bias using the rich panel data.
The remainder of the paper is organized as follows. Section II provides a brief summary of the literature indicating the importance of alternative constructs frequently ignored in the economics literature. Section III describes our propensity score methods and the data used to construct these weights. Section IV presents the empirical model used to evaluate the impact of persistent and heavy marijuana use on schooling. Results are presented in Section V. Section VI summarizes the implications for future work by economists interested in examining the causal association between marijuana use and high school outcomes.
II. Literature Review
II.1 Literature on Mechanisms Through Which Marijuana Might Influence Schooling
Three explanations have been put forth for the association between marijuana use and low educational attainment: (1) marijuana use causes poor schooling outcomes; (2) marijuana use is a consequence of poor school performance; and (3) marijuana use and low educational attainment are not directly related but share a common underlying cause, such as deviant behavior, peers, family dysfunction, or rates of time preference. Evidence supporting each of these explanations comes from studies informed by a variety of disciplines.
The basic sciences provide the main rationale for believing that marijuana use causes low educational attainment. Neuroscientists have shown that marijuana use interrupts normal cognitive functioning and memory by activating cannabinoid receptor sites in the part of the brain that controls memory (Matsuda et al., 1993; Heyser et al., 1993). What remains debated is whether the detrimental effect on memory and cognitive functioning is short-lived, sustained for a period of time past intoxication, or cumulative in terms of its total detrimental effect on cognitive functioning. Although a few economic studies provide support for the cognitive detriment explanation, none have satisfactorily dealt with the issue of endogeneity (Chatterji, 2006; Roebuck et al., 2004; Bray et al., 2000; Yamada et al., 1996).
Economics and psychology provide explanations for why poor schooling could precede marijuana use. The economic theory of health production (Grossman, 1972) suggests that schooling will be positively associated with healthy behavior and negatively associated with unhealthy behaviors, as an individual’s ability to understand how particular behaviors affect health improves with schooling and hence makes individuals better producers of health. The psychological literature postulates that use of marijuana and other substances is a coping mechanism for students who struggle in school (Newcomb and Bentler, 1986; Wills and Shiffman, 1985). Empirically, a substantial literature outside of economics supports the notion that poor school performance precedes marijuana use (Hawkins et al., 1992; Fergusson et al, 1996; Duncan et al 1998). However, evidence showing that poor school performance may precede marijuana use does not rule out the possibility that marijuana use leads to poor (or worse) schooling outcomes. The association may run both ways.
The third variable explanation is supported by several theories. Problem behavior theory postulates that individuals with a predisposition toward nonconformity and deviance are more likely to engage in multiple unconventional behaviors that reciprocally influence one another (Jessor and Jessor, 1977; Donovan and Jessor, 1985). Social attachment theory argues that it is weak bonds with family, school, religion, or other conventional institutions that lead to general problem behaviors (Hawkins and Weis, 1985; Simmons and Blyth, 1987; Sommer, 1985). Social learning theory (Bandura, 1977; 1985) stresses the influence of exposure to deviant peers or family members who act as role models for specific actions through their approval of them. Finally, health economists postulate that a positive association between poor schooling outcomes and substance use could also result because of their association with a high discount rate or rate of time preference (Fuchs, 1982; Farrell and Fuchs, 1982). Individuals with high rates of time preference place greater value (or ‘utility’) on rewards and punishments that happen immediately and less value on rewards and punishments that happen in the future. Substantial empirical evidence supports each of these “third factor” theories (Roebuck et al., 2004; Brook et al., 1999; Sander, 1998; Fergusson et al., 2002; Schulenberg et al., 1994).
II.2 Prior Studies Attempting to Investigate the Role of Selection Bias and Other Unobservables
Economic analyses have attempted to determine the causal association between marijuana use and educational attainment by using two-stage least squares, instrumental variables techniques, or fixed-effects modeling to disentangle the influence of selection bias and the other unobserved factors mentioned above. The findings from this literature support a negative association between marijuana and schooling outcomes, but the causal interpretation remains elusive due to limitations of each study.
Bray et al. (2000) examine the relationship between marijuana initiation and the age-specific probability of dropping out of high school for a sample of 1392 students who participated in a longitudinal study in a Southeastern U.S. school system. Early marijuana initiation is used as an instrument for current marijuana involvement, which is supported by research identifying a strong association between the two (Perkonigg et al, 2007; van Ours, 2006). Empirical tests were done to evaluate the exogenous treatment of age of initiation and the hypothesis of exogeneity could not be rejected, although the authors do not report information on identifying variables used to generate the test results. In their models they find that early marijuana initiation (prior to the age of dropout) has a positive and statistically significant effect on the probability of dropping out at ages 16 and 18. They estimate that early marijuana initiators are 2.3 times more likely to subsequently drop out of high school than nonusers.
In a similar study using a nationally representative sample, Register, Williams and Grimes (2001) examine the impact of early marijuana use on subsequent educational attainment (i.e., years of schooling) using young males from the 1992 NLSY. The authors defined “early” marijuana use as use prior to age 18. They employed a two-stage regression technique to deal with the structural endogeneity of marijuana use. In the first set of regressions they predicted separate likelihood functions for the probability of using any illicit drug, any hard drug, or any marijuana use prior to age 18, with the primary identifying instrument in the first stage for marijuana use being whether the person lived in a decriminalized state at age 14. They then constructed predicted probabilities for each of these measures and evaluated their effects on subsequent total educational attainment in 1992, when the respondents were between the ages of 27 and 34 years of age. They found that any use of marijuana prior to age 18 had a negative and statistically significant effect on educational attainment in the full sample, but the result was generally driven by their sample of whites, for whom they found an average reduction of 1.187 years of education among early marijuana users.
While studies support the high correlation between early marijuana initiation and subsequent use, Chatterji (2006) and other social scientists point out that it may also be the case that adolescents who are able to initiate at a young age have environmental or personal factors that may make them less likely to complete high school independent of their later marijuana use, such as less parental supervision, bad peer group, ADHD, general antisocial behavior, or other mental health problems. In her study of the effect of marijuana on educational attainment at age 26 using the National Education Longitudinal Survey (NELS), Chatterji (2006) shows that including school environmental factors in an examination of the effect of 10th grade marijuana use on final educational attainment reduces the association between marijuana use and schooling by 69 percent, and the additional inclusion of 8th grade risk factors reduces the association of marijuana use and schooling even further by another 14 percent. The negative association between marijuana use and educational attainment remained statistically significant in the final specification, but it was dramatically smaller than in models that excluded these additional factors, suggesting that selection into marijuana use based on environmental and personal risk factors could be extremely important.
Chatterji (2006) also presents estimates from IV models that attempt to purge the potentially endogenous drug use measure of its correlation with the error term, under the assumption that even with her rich data set unobservable factors may still bias results. What is most compelling in her IV analysis is that standard state policy instruments and individual-level variables that have been used in previous studies to predict marijuana use (e.g., state decriminalization status) did not generate F-test statistics above a value of 4. When additional controls capturing 8th grade risk factors were also included in the first stage, the F-statistics rose but only one test exceeded 8 (reaching a value of 8.51). Chatterji herself concludes from this result and the fact that the estimated effect on marijuana use in the IV model was larger than that for the OLS model, that the instruments used to generate the IV estimates in the case of marijuana were fairly weak.
In light of the results presented by Chatterji (2006) and concerns raised in the alcohol literature regarding the use of weak instruments to assess the causal effect of alcohol on educational attainment and other outcomes (Dee and Evans 2003; Koch and Ribar, 2001), it seems reasonable to question whether current attempts to assess a causal association between marijuana use and schooling using IV or two-stage estimation techniques are indeed reliable. Another method to account for unobserved factors that might influence both the selection to use drugs as well as lower educational attainment is through fixed-effect methods. We are aware of only one study that employs individual fixed-effects methods. Using repeated observational data from the Persistent Effects of Treatment Study – Adolescent (PETS-A), Engberg and Morral (2006) examine the impact of reducing marijuana use (as well as other substance use) on schooling attendance during 3-month intervals over the course of a year. Random and fixed-effects models were estimated and controls for baseline (pre-treatment) differences in the sample were included. The results for marijuana showed that any use was negatively associated with the probability of attending school, whereas the findings for other substances (alcohol, stimulants, hallucinogens, and other drugs) showed that their effects were specific to the quantity consumed and their use was not statistically associated with lower probabilities of attending school.
Because the association between marijuana use and poor schooling outcomes persists even after econometric techniques are applied to deal with selection bias and unobservables, the hypothesis that marijuana use is causally associated with poor schooling outcomes remains viable. The mechanism through which this association operates, however, remains highly questionable.
In this paper, we revisit the question of the causal association between marijuana use and schooling using propensity score methods as an alternative strategy to reduce the influence of selection bias. Taking the analysis a step farther, we also test the assumption of stable unobserved influences. After adjusting for pre-existing differences in user groups, we control for additional variables that change over time when assessing the relationship between marijuana and high school dropout status. Thus, we can assess whether factors shown to influence these associations change over time, as suggested by developmental theorists (LaGrange and White, 1985; Bailey and Hubbard, 1990).
III. Structural Model & Data
III.1 Economic Theory on the Causes of Dropping Out
In his seminal work on the economics of schooling, Gary Becker described education as a form of “human capital” that raises the skill set and hence the productive capacity of a worker (Becker, 1964). Hence, schooling became viewed as a form of investment, with higher wages being the return on that investment. Huge literatures have evolved examining both the returns to schooling (Card, 1999; Willis, 1987) as well as the productive inputs that lead to improved human capital formation in school (Becker 1993; Willis and Rosen, 1979; Shultz, 1961). Based on these literatures, standard economic theory models the decision to leave school early as a function of individual achievement, family resources and personal characteristics. Within this construct, marijuana use could positively influence one’s willingness to leave school by reducing his or her general achievement (through reduced cognitive ability, school attendance, or willingness to work hard and get good grades). Alternatively, marijuana use may simply be highly correlated with specific personal characteristics that make it more likely for the individual to decide to drop out of school, such as anti-social disorder, weak social bonds with formal institutions, high discount rates, and other behavioral factors described in the previous section.
III.2. Causal Model and Propensity Score Approach
Our empirical model is based on the linear latent variable model in which the net benefits of schooling, Y*, are a function of a vector of individual characteristics, xi, and, assuming a causal influence of marijuana on schooling, the decision to be a heavy and persistent marijuana user (HPMU), or:
(1) |
We cannot observe the net benefits of schooling, but we do observe whether an individual continues schooling or not:
Let DO1 denote dropout status (1 if dropout; 0 otherwise) if a chosen student with characteristics x is a HPMU. Similarly, let DO0 denote dropout status if the chosen student was not a HPMU. Assuming a logistic specification, we can then denote the probability of a HPMU dropping out and a non-HPMU dropping out, respectively, as:
(2) |
(3) |
The parameter γ determines the ratio of the odds of dropout for the counterfactual outcomes and is the causal effect parameter of interest. We can observe only a single counterfactual outcome on each student and cannot directly estimate γ. However, as discussed in Bang and Robins (2005), the probability of being a HPMU, also referred to as the propensity score (Rosenbaum and Rubin, 1983), can be used to weight the data and obtain consistent estimates of causal effects provided the model for the propensity score or the structural model is correct and strong ignorability holds.
Strong ignorability requires that there are no unobserved variables that predict both the probability of dropping out and the probability of HPMU by the 10th grade (the point at which school is no longer mandatory). To ensure strong ignorability, we need to construct propensity score weights that can account for an array of constructs that the literature suggests are correlated with both behaviors of interest, including early use of alcohol and cigarettes, deviance, time preference, family influence, peer substance use, religiosity, social bonds, school bonds, parental bonds and emotional distress.
Given a vector of such pre-existing variables, W, we estimate the propensity score pw = Pr(HPMU = 1 | W) using generalized boosting methods (GBM), a flexible nonparametric approach to modeling the log(pw/(1 − pw)). GBM-based propensity scores provide a flexible model for the propensity score equation, handle a large number of variables in an automated and systematic manner, and have been shown to provide estimated propensity scores that yield better estimates of effects than do other approaches (Ridgeway and McCaffrey, 2007).1 Because we are interested in estimating the effects of being a HPMU on users, we weight each HPMU by a weight of one and every student who was not a HPMU by of pw/(1 − pw), the odds of being a HPMU, to obtain an estimate of the average effect of being a HPMU on the users (Wooldridge, 2002).
III.3. ALERT Plus Data
We use data from the field trial evaluation of the ALERT and ALERT Plus drug prevention programs, which were administered to seventh-grade students in 61 middle schools drawn from 48 school clusters in South Dakota in 1997. Schools in the study were randomly assigned to one of the two treatment conditions or a control group condition (see Ellickson et al., 2003 for details on the experimental trial). Students completed annual paper-and-pencil surveys in school at Grades 7 through 11. Study participants also completed a mail or web survey during the summer and fall of 2004, when they were about 14 to 20 months post their expected high school graduation date. The baseline survey, administered to 5,857 students, collected detailed background information including age, gender, race/ethnicity, family structure, and parental education, as well as information regarding school performance, deviant behavior, attitudes toward school and family, family influences, peer influences, and substance use. Many of the same questions were included in follow-up surveys each year. Extensive tracking procedures to survey students who left the study schools were employed. Follow-up response rates in 8th, 9th, and 10th grade were 91, 87, and 83 percent, respectively.
The dependent variable, high school dropout status, was collected through both school staff reports and student self-reports.2 School-reported graduation data are missing for 301 students who moved from South Dakota before completing high school and for another 387 students for whom the South Dakota high schools’ reported graduation status was unknown. Self-reported graduation status is missing for 2560 participants who failed to complete the post-high school follow-up and 29 who completed the survey but failed to report high school graduation status. The two data sources disagree with each other for some students. In particular, of the students the school data reported as dropouts, 40% of those who also provided self-reported status reported graduating. Hence, as discussed in the next section, we develop a model for the joint distribution of student and school-reported graduation status that allows for measurement error in both.
Building on previous work demonstrating the importance of differentiating light and heavy marijuana use (Chatterji, 2006; Roebuck et al., 2004; Ellickson et al., 1998; Yamada et al., 1996), we construct a dichotomous measure of heavy and persistent marijuana use that equals one if the student reports using marijuana in the past month on both the 9th and 10th grade surveys and three or more times in that period on at least one survey and zero otherwise. This measure has the advantage of capturing relatively extreme use, as only about 30% of past year marijuana users are heavy and persistent users. In addition, it predicts drug dependence and alcohol abuse at ages 19 and 21 for our sample3 and explains more of the variance in these outcomes than daily use of marijuana (as defined by use on 20 or more days in the past month at the tenth grade survey administration).4
The following background, attitudinal and behavioral grade 7 control variables capture the constructs represented in W: student demographics; family characteristics; school grades and academic expectations; impulsivity; beliefs about the health consequences of alcohol, cigarette, and marijuana use; parental monitoring; deviance; rates of time preference; religiosity; school bonds; parental bonds; emotional distress; family (parent/sibling) substance use; and alcohol, cigarette, and marijuana use in grade 7. We also control for peer effects for alcohol and cigarette use, school size, region of the state, and the school’s experimental condition for the drug prevention trial. 5 Next, we discuss how these constructs are measured. Unless otherwise noted, measures are created at grades 7 to 10 using parallel items; the grade 7 measures are included in the propensity score weights and measures from later grades are used in sensitivity analyses.
Cigarette use is the logarithm of the 30-day average of one plus an overall frequency index combining lifetime, past year, and past month frequency of use scaled by the student’s quantity of use.6 Alcohol use is represented by categorical indicators measuring the number of days of alcohol use in the past month, with levels of use indicated by none, 1, 2 to 4 days, 5 to 8 days, and 9 or more days (the reference group).7
Deviance is the average frequency of six items tapping various deviant behaviors in each grade (e.g., skipped school or breaking into houses, schools or places of business). To accommodate developmental change in the forms of deviant behavior, we added additional items to the grade 10 deviance index.8 The student’s rate of time preference is measured in grades 7 and 9 as the sum of two items: How much of the time do you do or feel the following things: you do what feels good now without thinking about the future, and you focus on the short run instead of the long run (range = 0 (never) – 5 (almost always)).
Grade-specific measures of family influences on student behavior are based on two items: whether the respondent perceives the adult they are closest to has a drinking habit and the extent to which their parents would disapprove of them smoking or using marijuana. Grade-specific measures of peer effects are tapped through self-reports by the students regarding the frequency with which they are with marijuana-using peers and whether they think their best friend uses marijuana.9
Student attitudes and beliefs about alcohol, cigarettes, and marijuana are based on three positive and three negative items for each substance, with a higher score on either scale indicating more pro-drug attitudes. Indicators of bonds with conventional institutions are captured through a measure of religiosity, school bonds, and parental bonds.10 Details regarding the constructs of each of these can be found in Ellickson et al. (2003). Finally, we measured emotional distress with the six-item mental health inventory index (MHI-6) that equals the MHI-5 (Wells et al., 1996) plus an additional item on the frequency of feeling downhearted and blue in the past month. See Table 1 for descriptive statistics on the baseline (grade 7 or grade 8 when grade 7 data do not exist) values of variables for all students and by HPMU status. As shown in the table, there are large differences between the groups on cigarette use, grades, deviance, attitudes about substance use, and parental approval of substance use; however, these differences are greatly reduced by weighting.
Table 1.
Summary Statistics and Standard Bias for Baseline Variables
All | HPMU | Comparison | Comparison Propensity Score Weighted (f) |
Std Bias | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Mean | Std Dev |
Mean | Std Dev |
Mean | Std Dev |
Std Bias |
Mean | Std Dev |
Weighted | |
HPMU | 0.10 | 0.35 | ||||||||
Age | 12.36 | 0.58 | 12.40 | 0.70 | 12.35 | 0.57 | 0.06 | 12.39 | 0.19 | 0.00 |
Female | 0.50 | 0.57 | 0.42 | 0.63 | 0.51 | 0.56 | −0.14 | 0.47 | 0.18 | −0.08 |
White(a) | 0.87 | 0.38 | 0.78 | 0.53 | 0.88 | 0.36 | −0.20 | 0.83 | 0.14 | −0.09 |
Native American(a) | 0.08 | 0.31 | 0.18 | 0.49 | 0.07 | 0.28 | 0.23 | 0.12 | 0.12 | 0.14 |
Father's Education | 2.02 | 1.17 | 2.27 | 1.41 | 1.99 | 1.14 | 0.20 | 2.23 | 0.38 | 0.03 |
Mother's Education | 1.91 | 1.13 | 2.21 | 1.34 | 1.88 | 1.10 | 0.25 | 2.10 | 0.38 | 0.09 |
Nuclear Family | 0.69 | 0.53 | 0.53 | 0.64 | 0.71 | 0.51 | −0.27 | 0.60 | 0.18 | −0.10 |
Experimental Condition: |
||||||||||
ALERT Only (b) | 0.34 | 0.54 | 0.35 | 0.61 | 0.34 | 0.54 | 0.02 | 0.35 | 0.17 | 0.01 |
Experimental Condition: |
||||||||||
Control (b) | 0.32 | 0.53 | 0.40 | 0.63 | 0.32 | 0.52 | 0.13 | 0.35 | 0.17 | 0.07 |
Days Alone After School |
2.74 | 2.09 | 3.17 | 2.23 | 2.69 | 2.07 | 0.21 | 3.00 | 0.66 | 0.07 |
Hours Alone After School |
2.82 | 1.80 | 3.21 | 2.04 | 2.78 | 1.77 | 0.21 | 3.16 | 0.59 | 0.03 |
Used Alcohol 0 Days in Past Month |
0.98 | 0.15 | 0.97 | 0.21 | 0.98 | 0.15 | −0.05 | 0.94 | 0.08 | 0.15 |
Used Alcohol 1 Day in Past Month (c) |
0.01 | 0.13 | 0.02 | 0.18 | 0.01 | 0.12 | 0.05 | 0.04 | 0.07 | −0.11 |
Used Alcohol 2–4 Days in Past Month (c) |
0.00 | 0.07 | 0.00 | 0.07 | 0.00 | 0.07 | −0.02 | 0.02 | 0.05 | −0.19 |
Used Alcohol 5–8 Days in Past Month (c) |
0.00 | 0.03 | 0.00 | 0.07 | 0.00 | 0.02 | 0.03 | 0.00 | 0.01 | 0.03 |
Freq of Cig Use | −3.18 | 0.89 | −2.69 | 1.64 | −3.24 | 0.77 | 0.33 | −2.81 | 0.45 | 0.07 |
Grades | 1.92 | 1.03 | 2.27 | 1.27 | 1.88 | 0.99 | 0.30 | 2.15 | 0.34 | 0.09 |
Deviance | 0.27 | 0.57 | 0.55 | 0.83 | 0.24 | 0.53 | 0.37 | 0.53 | 0.24 | 0.03 |
Religiosity (d) | 2.40 | 1.18 | 2.85 | 1.33 | 2.35 | 1.15 | 0.38 | 2.83 | 0.39 | 0.01 |
School Bonds (d) | 2.29 | 0.73 | 2.60 | 0.89 | 2.25 | 0.71 | 0.39 | 2.56 | 0.26 | 0.04 |
Family Bonds (d) | 1.84 | 0.86 | 2.09 | 0.99 | 1.81 | 0.84 | 0.28 | 2.03 | 0.29 | 0.06 |
Emotional Distress | 1.24 | 0.94 | 1.40 | 1.13 | 1.22 | 0.92 | 0.16 | 1.40 | 0.31 | 0.00 |
Positive Attitudes, | 1.40 | 0.84 | 1.80 | 1.30 | 1.35 | 0.77 | 0.35 | 1.77 | 0.35 | 0.03 |
Negative Attitudes, | 1.56 | 1.01 | 1.90 | 1.32 | 1.52 | 0.97 | 0.29 | 1.82 | 0.35 | 0.06 |
Adult Use Cig | 1.08 | 1.56 | 1.68 | 1.78 | 1.01 | 1.52 | 0.37 | 1.54 | 0.51 | 0.08 |
Parental Approval Cig (d) |
1.40 | 0.83 | 1.90 | 1.24 | 1.34 | 0.76 | 0.45 | 1.84 | 0.34 | 0.05 |
Parental Approval Marijuana (d) |
1.13 | 0.53 | 1.46 | 1.03 | 1.10 | 0.44 | 0.35 | 1.37 | 0.25 | 0.08 |
Peer Effects | 0.18 | 0.50 | 0.16 | 0.53 | 0.18 | 0.50 | −0.03 | 0.19 | 0.16 | −0.05 |
All values are weighted for nonresponse and where noted the comparison group is weighted by propensity score weights. All covariates are measured at grade 7 unless otherwise noted. Standardized bias equals the difference in HPMU and comparison group means divided by the standard deviation for the HPMU.
Other race is the reference category for ethnicity;
ALERT Plus is the reference group for the Treatment condition;
No use is the reference category for alcohol use;
grade 8 measures because variable was not collected at grade 7.
Attrition and missing data
The data set includes 4,375 students with both marijuana use and high school dropout data, which excludes 718 cases without graduation information from either source, and 656 students who failed to respond to either or both of the grade 9 and 10 surveys, and 108 students who completed both surveys but did not provide information about marijuana on at least one of them. To account for attrition, we weighted the observations using propensity score-based nonresponse weights (Little and Rubin, 2002). In this case the propensity score represents the probability that a student with a given set of baseline covariate values answered the 9th and 10th grade survey questions on past month marijuana use and had data on graduation status. Nonresponse weights equal the inverse of this probability for responding students. Missing data for baseline covariates are imputed using a Bayesian model for the joint distribution of all baseline and 8th grade follow-up variables.11 For data from the 9th and 10th grade follow-ups, missing values of independent variables other than marijuana use are also imputed using a Bayesian model for the joint distribution of all analysis variables.
IV. Empirical Specification
As shown in Table 2, graduation information collected through self-reports did not always match school administrator reports. To account for potential errors in reported graduation status and make efficient use of the two incomplete data sources, we fit a model for the joint distribution of student and school reported graduation status that allows for measurement error in both. Given our structural model, we assumed a logistic regression model for the true but not necessarily observed dropout status, DO = DO1HPMU + DO0 (1-HPMU),
(4) |
Table 2.
Summary of Graduation Status Data as Reported by Schools and Participants.
Student Reports | |||
---|---|---|---|
School Reports | Graduate | Dropout | Missing |
Graduate | 2,657 | 15 | 1,490 |
Dropout | 101 | 149 | 377 |
Don’t Know | 121 | 53 | 213 |
Not Collected | 132 | 40 | 509 |
The model also assumes that both school and student reports of graduation status might contain errors. The error rates were specified through the conditional distribution of self-reported status, U, given the true status, DO, and the conditional distribution of the school reported status, Z, given DO. These were
(5) |
(6) |
(7) |
(8) |
Equation (5) states that dropouts might self-report graduating with an error rate of ε1. Equation (6) assumes that high school graduates never incorrectly reported dropping out. Equations (7) and (8) allow for errors in school reports and assume that false reports of dropout for graduates and false reports of graduation for dropouts are equal.12
Limited write-in data suggest that some students who had completed alternative education programs mistakenly reported graduating from high school. We assumed that students who had truly graduated from high school provided the correct information because there is no obvious potential source of confusion that would result in misreporting graduation when it truly occurred. We also assumed that students who reported dropping out provided the correct information because there is no clear motivation for students to purposely misreport dropping out (such as avoiding stigma or providing a socially desirable outcome) and because many students who reported dropping out supplied reasons. Equations (7) and (8) are consistent with the assumption that errors in school reporting are due to clerical mistakes, so that errors in either direction have an equal probability of occurring.
The model also assumes that U and Z are independent conditional on DO, which implies that the errors made by schools are independent of errors by students. Given that the likely sources of error in student and school reports are distinct this last assumption seems reasonable.
Equations (4) – (8) yield
(9) |
From Equation (9), we derived a pseudo log-likelihood function for the observed pairs of (U, Z), weighted by the propensity score weights, to obtain estimates of (β, ε1, and ε2). We estimated standard errors for the parameter estimates using a cluster-adjusted sandwich estimator (Liang and Zeger, 1986) based on the weighted pseudo log-likelihood function to account for possible correlation among responses from students attending the same school. To account for the imputation of predictor variables, we repeated the estimation of parameters and standard errors with each of the five imputation completed datasets and combined the resulting estimates using standard methods (Schafer, 1997).13
V. Results
Many of the students who were not HPMU differed from the HPMU students and received little weight in the analysis. The average weight was about 0.13 with one percent greater than one and four percent greater then 0.5. The effective sample size of the weighted comparison sample is 615. Table 1 shows the balance in selected variables before and after weighting; Figure 1 demonstrates the balance between variable distributions for the HPMU and other students in terms of the absolute standardized bias for all pre-existing variables controlled in our analysis. For each variable, the absolute standardized bias equals the absolute value of the difference in the mean for the marijuana users and the (weighted) mean for other students divided by the standard deviation for the marijuana users. If the groups are comparable these values should be close to zero. Values of greater than 0.25 are often considered problematic (Ho et al., 2007).
Figure 1.
Comparison of HPMU and other students on covariates measured prior to HPMU measure before and after propensity score weighting. Dots represent the absolute standardized bias, the absolute value of the quantity of the difference in the mean for HPMU and other students divided by the standard deviation for the HPMU students. For each variable considered in the modeling, lines connect dots for the same variable with and without weighting. Solid dots denote statistically significant differences (p < 0.05).
As shown in Figure 1, the groups differed substantially on many variables before weighting for baseline differences in observable characteristics, with the absolute standardized bias being greater than 0.2 for the majority of variables and over 0.5 for 12 percent of the variables. After weighting, the largest absolute standardized bias is just under 0.2 and only six differences are significant. We also explored alternative specifications with additional pre-existing variables included in the logistic regression model for dropout and our results were qualitatively invariant to these changes.
Table 3 presents results from our baseline models that evaluate the impact of persistent marijuana use during grades 9 and 10 on high school dropout status. In Model 1, which accounts only for attrition (using nonresponse weights), the odds of high school dropout are nearly six times higher for persistent marijuana users than nonusers or casual users (10 versus 38 percent or 28 percentage point difference), and the finding is statistically significant. However, Model 2 shows that the effect size of marijuana use on dropout status is cut in half when we also use propensity score weights to account for differences in baseline characteristics between persistent marijuana users and nonusers. The result remains statistically significant, but marijuana use is now associated with just over a 2.4 fold increase in the odds of dropout (15 percentage point difference). If we can assume strong ignorability, then we can conclude that heavy and persistent marijuana use increases the risk of high school dropout.
Table 3.
Results from Logistic Regression Models Examining the Effects of Persistent Marijuana Use on High School Drop Out Status
Dependent Variable: | Model 1 | Model 2 | Model 3 | Model 4 | Model 5 | Model 6 | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
High School Drop Out | Odds Ratio | P-value | Odds Ratio | P-value | Odds Ratio | P-value | Odds Ratio | P-value | Odds Ratio | P-value | Odds Ratio | P-value | ||||||
Persistent MJ use (Gr 9 &10) | 5.595 | *** | 0.000 | 2.406 | *** | 0.000 | 2.797 | *** | 0.000 | 1.273 | 0.343 | 1.883 | *** | 0.014 | 1.204 | 0.491 | ||
Age | 1.112 | 0.557 | 1.088 | 0.640 | 1.086 | 0.677 | 1.305 | 0.212 | 1.265 | 0.296 | ||||||||
Female | 1.201 | 0.390 | 1.156 | 0.527 | 1.011 | 0.963 | 1.676 | ** | 0.021 | 1.453 | 0.123 | |||||||
White(a) | 0.957 | 0.924 | 1.009 | 0.983 | 0.837 | 0.737 | 0.900 | 0.803 | 0.816 | 0.686 | ||||||||
Native American(a) | 2.299 | 0.124 | 2.351 | 0.112 | 2.529 | 0.140 | 1.734 | 0.298 | 1.880 | 0.298 | ||||||||
Father's Education | 1.364 | ** | 0.014 | 1.379 | ** | 0.016 | 1.349 | ** | 0.022 | 1.281 | * | 0.083 | 1.288 | * | 0.078 | |||
Mother's Education | 1.462 | *** | 0.008 | 1.495 | *** | 0.008 | 1.394 | ** | 0.026 | 1.276 | 0.140 | 1.225 | 0.209 | |||||
Nuclear Family | 0.625 | *** | 0.007 | 0.628 | ** | 0.013 | 0.677 | ** | 0.043 | 0.693 | * | 0.089 | 0.734 | 0.190 | ||||
Experimental Condition: ALERT Only(b) | 0.761 | 0.181 | 0.760 | 0.177 | 0.657 | ** | 0.037 | 0.785 | 0.313 | 0.702 | 0.132 | |||||||
Experimental Condition: Control (b) | 0.936 | 0.749 | 0.936 | 0.748 | 0.770 | 0.204 | 0.799 | 0.379 | 0.674 | * | 0.098 | |||||||
Days Alone After School, Gr 7 | 1.015 | 0.824 | 1.018 | 0.804 | 0.989 | 0.871 | 1.005 | 0.950 | 0.983 | 0.829 | ||||||||
Hours Alone After School, Gr 7 | 1.036 | 0.461 | 1.040 | 0.414 | 1.039 | 0.509 | 1.067 | 0.246 | 1.070 | 0.247 | ||||||||
Used Alcohol 0 Days in Past Month, Gr 8(c) | 1.042 | 0.925 | ||||||||||||||||
Used Alcohol 1 Day in Past Month, Gr 8 (c) | 1.016 | 0.973 | ||||||||||||||||
Used Alcohol 2–4 Days in Past Month, Gr 8 (c) | 1.339 | 0.559 | ||||||||||||||||
Used Alcohol 5–8 Days in Past Month, Gr 8 (c) | 1.243 | 0.713 | ||||||||||||||||
Used Alcohol 0 Days in Past Month, Gr 9(c) | 1.897 | 0.304 | ||||||||||||||||
Used Alcohol 1 Day in Past Month, Gr 9 (c) | 1.083 | 0.877 | ||||||||||||||||
Used Alcohol 2–4 Days in Past Month, Gr 9 (c) | 1.615 | 0.428 | ||||||||||||||||
Used Alcohol 5–8 Days in Past Month, Gr 9 (c) | 1.178 | 0.720 | ||||||||||||||||
Used Alcohol 0 Days in Past Month, Gr10 (c) | 1.185 | 0.821 | ||||||||||||||||
Used Alcohol 1 Day in Past Month, Gr10 (c) | 0.896 | 0.877 | ||||||||||||||||
Used Alcohol 2–4 Days in Past Month, Gr10 (c) | 1.245 | 0.733 | ||||||||||||||||
Used Alcohol 5–8 Days in Past Month, Gr10 (c) | 0.953 | 0.937 | ||||||||||||||||
Gr 8: Freq of Cig Use | 1.168 | ** | 0.014 | 1.150 | * | 0.088 | ||||||||||||
Gr 9: Freq of Cig Use | 1.096 | 0.229 | 1.052 | 0.491 | ||||||||||||||
Gr10: Freq of Cig Use | 1.190 | ** | 0.016 | 1.161 | ** | 0.019 | ||||||||||||
Grades in Gr 8 | 1.686 | *** | 0.005 | 1.547 | ** | 0.032 | ||||||||||||
Grades in Gr 9 | 1.545 | *** | 0.004 | 1.553 | ** | 0.011 | ||||||||||||
Grades in Gr10 | 1.653 | *** | 0.000 | 1.663 | *** | 0.000 |
Other race is the reference category for ethnicity.;
ALERT Plus is the reference group for the Treatment conditions’ holdout condition;
Used Alcohol more than 8 days in the last month is the holdout condition.
Indicates odds ratio is significant at th p=0.05 level and
indicates it is significant at p=0.001 level.
To test the strong ignorability assumption, we explore the effect of adding contemporaneous measures of substance use that might share common unobservables with marijuana use (Models 3 and 4). Because students who use marijuana are more likely to drink alcohol (Gfroefer et al., 2002) and drinking can cause cognitive impairments and affect high school graduation (Cook and Moore, 1993), we first consider alcohol. Including days of alcohol use during grades 8 through 10 does not change the marijuana results qualitatively. In fact, the odds ratio on persistent marijuana use actually increases. Next, we consider cigarettes (Model 4). Unlike alcohol, there is no physiological reason why cigarette smoking would affect school performance or dropout status, as smoking is not known to impair cognitive functioning nor is it known to offset the deleterious effects on cognitive functioning associated with marijuana use. However, numerous studies have shown that smoking is a strong predictor of marijuana use (Gfroefer et al., 2002; Ellickson et al., 1998; Sommer, 1985). Model 4 shows that including cigarette smoking in grades 8 through 10 causes a substantial reduction in the magnitude of our marijuana use measure (30 percent increase in the odds or 4 percentage point increase in dropout rate), which is no longer a statistically significant predictor of high school dropout status.
If heavy and persistent marijuana use was only associated with high school dropout because of its impact on cognitive functioning, then including cigarette use in our regression model should have no impact on the coefficient (or odds ratio) on marijuana use. The fact that it does suggests that the association is largely generated by an omitted variable bias. Recall that the propensity score weights are constructed using variables representing a wide range of constructs previously reviewed, but the constructs are all measured at baseline (Grade 7 or 8). If the influence of particular constructs changes over time, then baseline corrections will be insufficient to represent them.
Models 5 and 6 evaluate whether the omitted variable bias is caused by lack of controls for school performance. Economic theory suggests that poor performance will increase the likelihood that someone decides to leave school and hence represents an important factor when examining the decision to drop out of school. However, as described in the previous section, the mechanism through which marijuana might lead to the decision to dropout is by diminishing academic performance. Hence, including a measure of academic performance directly into the regression should significantly reduce if not eliminate the direct effect of marijuana on dropping out of school if this is the primary method through which marijuana influences the decision to drop out. What we see in Model 5 is that including grades in the 8th, 9th and 10th grade has only a very small impact on the estimated effect of marijuana use on dropping out when compared to Model 2. Furthermore, marijuana use remains statistically significant in Model 5, suggesting that a reduction in student achievement at school is not the primary mechanism through which marijuana use is influencing the decision to drop out of school. Interestingly, when we re-introduce cigarette smoking (Model 6), the odds ratio for marijuana use again drops substantially and is no longer significant. The use of cigarettes, therefore, appears to be significantly correlated with some personal characteristics that are the primary driver of the observed association between marijuana use and drop out status. The question remains, what is that personal characteristic?
Table 4 provides the results of sensitivity analyses that explore the impact of probable sources of the relationship between marijuana use and dropout being picked up by measures of cigarette use. In the first row, we include three separate indices of deviance in grades 8, 9 and 10, finding that the odds ratio for marijuana use is reduced but remains statistically significant unless cigarette use is also included (row 2). Indices of the student’s bonds to social institutions (religious and school) in grades 8, 9 and 10 also do not account for the relationship between marijuana use and high school dropout. The odds ratio in this specification (Row 3) remains greater than 2 and statistically significant, unless cigarette use is also added (Row 4). We get the exact same findings when we add rates of time preference (Rows 5 & 6), emotional distress (Rows 7 & 8) and family stress (Rows 9 & 10). Including attitudes toward drugs yields a somewhat larger drop in the odds ratio for marijuana use (Row 11), although the odds ratio is statistically significant at the 6 percent level and only becomes completely insignificant when measures of cigarette use are included (Row 12). In Rows 13 and 15, we finally identify two factors that explain the relationship between marijuana use and dropout: family influence and peer effects in grades 8, 9 or 10. When these indices are included (without cigarette use), marijuana use has a much smaller odds ratio and is no longer a significant predictor of high school dropout status. Cigarette use is significant when it is also included (Rows 14 & 16).
Table 4.
Sensitivity Analyses Examining the Effects of Persistent Marijuana (MJ) Use on High School Dropout Statusa
Model | Model Specification | Odds Ratio Persistent MJ Use |
P-value | Odds Ratio Gr 10 Cig Use |
P-value |
---|---|---|---|---|---|
1 | Deviance (Gr 8 – Gr 10) | 1.968 | 0.002 | ||
2 | Deviance + Cigarettes (Gr 8 – Gr 10) | 1.244 | 0.424 | 1.172 | 0.014 |
3 | Social Bonds (Gr 8 – Gr 10) | 2.229 | 0.001 | ||
4 | Social Bonds + Cigarettes (Gr 8 – Gr 10) | 1.205 | 0.514 | 1.181 | 0.010 |
5 | Time Preference (Gr 9) | 2.231 | 0.001 | ||
6 | Time Preference (Gr 9) + Cigarettes (Gr 8 – Gr 10) | 1.209 | 0.482 | 1.162 | 0.017 |
7 | Emotional Distress (Gr 8 – Gr 10) | 2.476 | 0.000 | ||
8 | Emotional Distress + Cigarettes (Gr 8 – Gr 10) | 1.194 | 0.533 | 1.164 | 0.021 |
9 | Family Stresses + Divorce (Gr 10) | 2.359 | 0.000 | ||
10 | Family Stresses + Divorce (Gr 10) + Cigarettes (Gr 8 – Gr 10) | 1.192 | 0.496 | 1.167 | 0.020 |
11 | Attitudes (Gr 8 – Gr 10) | 1.898 | 0.017 | ||
12 | Attitudes + Cigarettes (Gr 8 – Gr 10) | 1.358 | 0.329 | 1.165 | 0.025 |
13 | Family Influence (Gr 10) | 1.711 | 0.067 | ||
14 | Family Influence (Gr 10) + Cigarettes (Gr 8 – Gr 10) | 1.112 | 0.733 | 1.191 | 0.032 |
15 | Peer Effects (Gr 8 – Gr 10) | 1.338 | 0.392 | ||
16 | Peer Effects + Cigarettes (Gr 8 – Gr 10) | 1.084 | 0.786 | 1.149 | 0.036 |
All models include as additional regressors age, gender, ethnicity, parental education, an indicator of treatment effect, days alone and hours alone, nonresponse weights, and propensity score weights.
VI. Discussion and Conclusions
Several important conclusions can be drawn from these results. First, the Table 1 and 3 results indicate that important differences in baseline characteristics exist between individuals who choose to use marijuana and those who do not, even before most of these individuals ever initiate marijuana. Ignoring the selection bias indicated by these differences can lead to a significant over-estimation of the association between marijuana use and schooling, and is consistent with results presented by Chatterji (2006). Once we accounted for these baseline differences using propensity score weights, the odds ratio on marijuana use was cut in half, although it was still quite large and statistically significant.
Second, Models 2, 5 and 6 of Tables 3 indicate that the primary mechanism through which marijuana might influence the decision to drop out is probably not a reduction in educational achievement. Indeed, including grades in school for 8th, 9th, and 10th grade led to only a 22 percent reduction in the estimated odds ratio of marijuana use on the decision to drop out. The association remained statistically significant and marijuana users were still more than twice as likely to drop out of school even after including measures of school achievement in the model.
Surprisingly, it is the inclusion of indicators of cigarette smoking during 8th, 9th and 10th grade that leads to a reduction in the association between marijuana use and schooling in Table 3. There are at least four potential interpretations of this result. First, marijuana use causes cigarette smoking and controlling for smoking in the model is inducing a negative bias on the coefficient for marijuana use. This interpretation, however, is not supported by scientific evidence. We are not aware of any scientific evidence showing that cigarette use has an independent and direct negative effect on cognition; moreover, controlling for grades has almost no effect on the odds ratio for marijuana use. The second possible interpretation of the cigarette result is that there is common unexplained heterogeneity among students who smoke marijuana and cigarettes and including cigarette use in the model results in negative endogeneity bias in the odds ratio for marijuana use. A negative endogeneity bias requires that, conditional on these unobserved factors for smoking, students with greater unobserved risks for marijuana use have smaller risks for dropout. This seems unlikely. In all observed data from our study and many other studies, youth who use marijuana have greater risk factors for negative behaviors and outcomes including negative peer associations, weak social bonds, low aspirations, and increased deviant behaviors. While we cannot rule out the possibility that unobserved heterogeneity creates a situation where conditioning on smoking reverses the strong relationship between marijuana use and negative risk factors, this sort of endogeneity bias seems much less likely than omitted-variable bias between marijuana use and dropout.
However, there is some evidence for two other explanations of the results: 1) marijuana influences an unobserved set of behaviors that is highly associated with both smoking and dropout; and 2) omitted variable bias. In the former case, one might expect marijuana use to lead students to associate with more deviant peers or to develop opposition to traditional values and institutions which devalues schooling and leads to dropout. Students with such associations and attitudes are also at high risk for smoking (Ellickson, Tucker, Klein, 2008). We test for many of these alternative pathways, finding that peer associations might explain our results.
Omitted variable bias, which arises from a failure to fully account for pre-existing differences between students who use marijuana in grades 9 and 10 and those who do not, is also a defensible explanation. Earlier behavioral problems and risk factors not captured by our measures but known to be associated with marijuana and other problem behaviors in adolescents are one possible source of omitted variable bias. Cigarette smoking is known to be correlated with multiple demographic and psychosocial risk factors for negative outcomes (Collins and Ellickson, 2004; Tucker et al., 2006). In this study, both peer associations and family influences also accounted for the relationship between marijuana use and dropout. Although peer associations could be endogenous and causally influenced by marijuana use, they could also be correlated with omitted variables that better explain baseline differences. Family influences are less likely to be endogenous, which supports the conclusion that the relationships between marijuana use during grades 9 and 10 and dropout is the spurious result of omitted variables. However, it is possible that parental attitudes towards children’s substance use (a component of our family influences) could change in response to use or that children’s perception of their parents’ attitudes may change following use. Hence our data do not allow us to provide a definitive conclusion about whether the effects of marijuana use on dropout reflect the influence of marijuana use on other behaviors such as peer associations or omitted variable bias. We can conclude, however, that the data do not support a causal model in which marijuana results in dropout through cognitive impairment.
One other key finding of this study is that both time-varying and time persistent factors influence the observed relationship between marijuana use and dropping out. Our attempt to control for a very rich set of variables at grade 7 was insufficient for explaining the correlation between marijuana use and dropout status. Instead, variation in many of these constructs over time (in 8th, 9th and 10th grade) had additional influence on the relationship between marijuana use and high school success, suggesting that individual fixed-effects techniques would be insufficient for eliminating the bias caused by unobserved heterogeneity.
When interpreting the above results, two study limitations should be kept in mind. First, we examine student behaviors in only one state, South Dakota. Because South Dakota is very rural, with only two major cities, this sample may not accurately reflect the experience of all U.S. students. In particular, self-reported rates of marijuana use in South Dakota from the Youth Risk Behavioral Surveillance System were about 6 (lifetime) or 4 (current) percentage points lower than the national average of 43 (lifetime) and 24 percent (current) for 1999 to 2003, the years during which our study cohort was in high school. Smoking and alcohol use rates were, in contrast, generally a little higher in South Dakota than across the nation (Centers for Disease Control and Prevention, 2000, 2002, 2004). In addition, recent estimates for the period between 2000 and 2003 found higher graduation rates (by about nine percentage points) in South Dakota than nationally (Education Week, 2009). Thus, marijuana use and dropout might be somewhat more deviant in South Dakota then in the rest of the nation and this could alter the relationship between deviant behaviors and dropout.
Second, inherent measurement error or possible limitations in our measurement of specific baseline constructs could partially explain the insufficiency of baseline measures to remove differences between marijuana users and other students that contribute to differences in dropout rates. In other words, there may still be unobservable factors biasing our results. Finally, the estimate of the effect of heavy and persistent marijuana use during grades 9 and 10 accounts only for the effects of use during this period. Effects from earlier use are not included and may represent one of those unobserved factors still biasing results.
In addition to providing new interpretations of the observational relationship between marijuana use and high school dropout, this study also provides insights regarding appropriate instruments when trying to identify causal associations using instrumental variables techniques. Our results show that measures of cigarette use, peer substance use and parental substance use, which have been used in previous analyses, would be poor instruments as they are heavily correlated with high school completion as well as marijuana use.
Acknowledgement
This work was supported by Grant R01DA11246 from the National Institute on Drug Abuse. Dr. Pacula’s time was also partially supported by Grant R01DA019993 from the National Institute on Drug Abuse.
Footnotes
Author conflicts of interest: none
We also explored models with fixed effects for the high schools the students attended in 9th grade. The models provided qualitatively similar results to models without fixed effects and are not reported here.
The student survey asked “What is the HIGHEST LEVEL of education you completed?” Response choices included: less than high school, GED, high graduate or college attendance. The school survey was administered about a year after the student’s scheduled graduation date and provided schools with a list of students’ names and a box to check if the student was a “High School Graduate.”
To address this question, we used a seven-item drug dependence sum based on items chosen to meet DSM-IV criteria (D'Amico et al., 2005) and the AUDIT for alcohol (Babor et al., 1992; Bohn, Babor, Kranzler, 1995).
Models with daily use also showed the same pattern of results as our models with heavy and persistent use: marijuana use is a significant predictor of dropout out even accounting for baseline variables, but the effect is greatly reduced and not significant when we control for cigarette smoking or smoking and peer effects. Results are available from the authors upon request.
Research suggests that by including all of these variables we risk potentially introducing greater variance into our estimate (and hence are more likely to reject the finding of significance), but the alternative is introducing bias caused by omitting relevant variables and violating the assumption of strong ignorability by failing to control for variables related to HPMU and dropout (Rubin, 1997; Rubin and Thomas, 1996).
The measure can be negative or positive (the minimum value is ln(1/30)=−3.4), because it uses the logarithm of index values that are both less than or greater than 1.
An alternative specification using binge drinking (days of 3 or more drinks in the past month) was also considered for the sensitivity analyses. The results using this measure were qualitatively similar to those presented here and did not change the estimated relationship between marijuana use and drop out status.
Additional items are questions regarding how often during the past year the student: ran away from home for a night or more; stole or tried to steal items worth $50 or more; sold marijuana; sold other drugs; took a vehicle for a joyride without the owner’s permission; drove a car, motorcycle or other vehicle after drinking alcohol; used strong-arm force to get money or things from people; attacked someone with the idea of seriously hurting or killing that person; purposely set fire or tried to set fire to a building, car, or other property; got into trouble with the police because of something s/he did.
Because the original scales for the two items in the peer measures differ, we standardized the items to mean zero and standard deviation one prior to being averaged for the index. Hence the index takes on both positive and negative values, with higher values on this index indicating more exposure to peer-using friends.
Religiosity was not measured at grade 7.
The Bayesian imputation model uses a multivariate Gaussian distribution to approximate the joint distribution for the variables conditional on the unobserved parameter values. The imputed values are a random sample from the posterior distribution of the missing data and conditional on the observed data and the model. Using the NORM software (Schafer, 1999), we sampled five sets of imputed values. Imputed values have been found to be robust to model misspecifications (Schafer, 1997).
The Equations 5–8 yield that Pr{U = 1} = (1-ε1)px and Pr{Z = 1} = (1-ε2)px + ε2(1 − px) which can be solved by averaging over px to identify the error rates, ε1 and ε2.
We tested model specification by allowing the probability of errors in student self-reported data to depend on student variables and using alternative probit link functions. The results were qualitatively invariant to the model; consequently, we report the results from the parsimonious model of Equation (9) rather than alternatives.
References
- Bailey SL, Hubbard RL. Developmental variation in the context of marijuana initiation among adolescents. Journal of Health and Social Behavior. 1990;31:58–70. [PubMed] [Google Scholar]
- Babor TF, Dolinsky ZS, Meyer RE, et al. Types of alcoholics: Concurrent and predictive validity of some common classification schemes. British Journal of Addiction. 1992;87:1415–1431. doi: 10.1111/j.1360-0443.1992.tb01921.x. [DOI] [PubMed] [Google Scholar]
- Bandura A. Social Foundations of Thought and Action. Prentice Hall: Englewood Cliffs, NJ; 1985. [Google Scholar]
- Bandura A. Self-efficacy: toward a unifying theory of behavioral change. Psychological Review. 1977;84:191–215. doi: 10.1037//0033-295x.84.2.191. [DOI] [PubMed] [Google Scholar]
- Bang H, Robins JM. Doubly robust estimation in missing data and causal inference models. Biometrics. 2005;61:962–973. doi: 10.1111/j.1541-0420.2005.00377.x. [DOI] [PubMed] [Google Scholar]
- Barnes GE, Barnes MD, Patton D. Prevalence and predictors of “heavy” marijuana use in a Canadian sample. Journal of Substance Use and Misuse. 2005;40:1849–1863. doi: 10.1080/10826080500318558. [DOI] [PubMed] [Google Scholar]
- Becker GS. Human Capital: A Theoretical Analysis with Special Reference to Education. New York: Columbia University Press for National Bureau of Economic Research; 1964. [Google Scholar]
- Becker GS. Human Capital. 3rd Ed. Chicago: Chicago University Press; 1993. [Google Scholar]
- Bohn MJ, Babor TF, Kranzler HR. The alcohol use disorders identification test (AUDIT): Validation of a screening instrument for use in medical settings. Journal of Studies on Alcohol. 1995;56(4):423–432. doi: 10.15288/jsa.1995.56.423. [DOI] [PubMed] [Google Scholar]
- Bray JW, Zarkin GA, Ringwalt C, Qi J. The Relationship between marijuana initiation and dropping out of high school. Health Economics. 2000;9:9–18. doi: 10.1002/(sici)1099-1050(200001)9:1<9::aid-hec471>3.0.co;2-z. [DOI] [PubMed] [Google Scholar]
- Brook JS, Adams RE, Balka EB, Johnson E. Early adolescent marijuana use: Risks for the transition to young adulthood. Psychological Medicine. 2002;32:79–91. doi: 10.1017/s0033291701004809. [DOI] [PubMed] [Google Scholar]
- Brook JS, Balka EB, Whiteman M. The risks for late adolescence of early adolescent marijuana use. American Journal of Public Health. 1999;89:1549–1554. doi: 10.2105/ajph.89.10.1549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brook JS, Gordon AS, Brook A, Brook DW. The consequences of marijuana use on intrapersonal and interpersonal functioning in black and white adolescents. Genetic, Social, and General Psychology Monographs. 1989;115:349–369. [PubMed] [Google Scholar]
- Card D. The causal effect of education on earnings. In: Handbook of Labor Economics, Vol III, Part A. Ashenfelter O, Card D, editors. Amsterdam: Elsevier Science; 1999. pp. 1801–1863. [Google Scholar]
- Centers for Disease Control and Prevention. Youth risk behavior surveillance-United States, 1999. Morbidity and mortality weekly report, Vol. 49, No. SS-5. [17 June 2009];Atlanta: Centers for Disease Control; 2000 http://www.cdc.gov/mmwr/PDF/SS/SS4905.pdf. [PubMed]
- Centers for Disease Control and Prevention. Youth risk behavior surveillance-United States, 2001. Morbidity and mortality weekly report Vol. 51, No. SS-4. [17 June 2009];Atlanta: Centers for Disease Control; 2002 http://www.cdc.gov/mmwr/PDF/SS/SS5104.pdf. [PubMed]
- Centers for Disease Control and Prevention. Youth risk behavior surveillance-United States, 2003. Morbidity and mortality weekly report Vol. 53, No. SS-2. [17 June 2009];Atlanta: Centers for Disease Control; 2000 http://www.cdc.gov/mmwr/PDF/SS/SS5302.pdf.
- Chatterji P. Illicit drug use and educational attainment. Health Economics. 2006;15:489–511. doi: 10.1002/hec.1085. [DOI] [PubMed] [Google Scholar]
- Collins RL, Ellickson PL. Integrating four theories of adolescent smoking. Substance Use and Misuse. 2004;39:179–209. doi: 10.1081/ja-120028487. [DOI] [PubMed] [Google Scholar]
- Cook PJ, Moore MJ. Drinking and schooling. Journal of Health Economics. 1993;12:411–429. doi: 10.1016/0167-6296(93)90003-w. [DOI] [PubMed] [Google Scholar]
- D'Amico EJ, Ellickson PL, Collins RL, Martino S, Klein DJ. Processes linking adolescent problems to substance-use problems in late young adulthood. Journal of Studies on Alcohol. 2005;66:766–775. doi: 10.15288/jsa.2005.66.766. [DOI] [PubMed] [Google Scholar]
- Dee TS, Evans WN. Teen drinking and educational attainment: evidence from two-sample instrumental variables (TSIV) estimates. Journal of Labor Economics. 2003;21:178–209. [Google Scholar]
- Donovan JE, Jessor R. Structure of problem behavior in adolescence and young adulthood. Journal of Consulting and Clinical Psychology. 1985;53:890–904. doi: 10.1037//0022-006x.53.6.890. [DOI] [PubMed] [Google Scholar]
- Duncan SC, Duncan TE, Biglan A, Ary D. Contributions of the social context to the development of adolescent substance use: a multivariate latent growth modelling approach. Drug and Alcohol Dependence. 1998;50:57–71. doi: 10.1016/s0376-8716(98)00006-4. [DOI] [PubMed] [Google Scholar]
- Education Week. Diplomas Count 2009: Broader Horizons: The Challenge of College Readiness For All Students. Graduation Rate Trends 1996–2006. [17 June 2009];Education Week. 2009 28(34) http://www.edweek.org/ew/dc/2009/gradrate_trend.html. [Google Scholar]
- Ellickson PK, Bui K, Bell R, McGuigan KA. Does early drug use increase the risk of dropping out of high school? Journal of Drug Issues. 1998;28:357–380. [Google Scholar]
- Ellickson PL, McCaffrey DF, Ghosh-Dastidar B, Longshore D. New inroads in preventing adolescent drug use: results from a large-scale trial of project ALERT in middle schools. American Journal of Public Health. 2003;93:1830–1836. doi: 10.2105/ajph.93.11.1830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ellickson PL, Tucker JS, Klein DJ. Reducing Early Smokers’ Risk for Future Smoking and Other Problem Behavior: Insights From a Five-Year Longitudinal Study. Journal of Adolescent Health. 2008;43:394–400. doi: 10.1016/j.jadohealth.2008.03.004. [DOI] [PubMed] [Google Scholar]
- Engberg J, Morral AR. Reducing substance use improves adolescents’ school attendance. Addiction. 2006;101:1741–1751. doi: 10.1111/j.1360-0443.2006.01544.x. [DOI] [PubMed] [Google Scholar]
- Farrell P, Fuchs VR. Schooling and health: the cigarette connection. Journal of Health Economics. 1982;1:217–230. doi: 10.1016/0167-6296(82)90001-7. [DOI] [PubMed] [Google Scholar]
- Fergusson DM, Horwood LJ. Early onset cannabis use and psychosocial adjustment in young adults. Addiction. 1997;92:279–296. [PubMed] [Google Scholar]
- Fergusson DM, Horword LJ, Swain-Campbell N. Cannabis use and psychosocial adjustment in adolescence and young adulthood. Addiction. 2002;97:1123–1135. doi: 10.1046/j.1360-0443.2002.00103.x. [DOI] [PubMed] [Google Scholar]
- Fergusson DM, Lynskey MT, Horwood LJ. The short-term consequences of early onset cannabis use. Journal of Abnormal Child Psychology. 1996;24:499–512. doi: 10.1007/BF01441571. [DOI] [PubMed] [Google Scholar]
- Friedman AS, Bransfield S, Kreisher C. Early teenage substance use as a predictor of educational-vocational failure. American Journal on Addictions. 1994;3:325–336. [Google Scholar]
- Fuchs VR. Time preference and health: An exploratory study. In: Fuchs VR, editor. Economic Aspects of Health. Chicago, IL: University of Chicago Press for National Bureau of Economic Research; 1982. pp. 93–120. [Google Scholar]
- Gfroerer JC, Wu LT, Penne MA. Rockville, MD: Substance Abuse and Mental Health Services Administration, Office of Applied Studies; Initiation of Marijuana Use: Trends, Patterns, and Implications. 2002 Analytic Series: A-17, DHHS Publication No. SMA 02-3711.
- Grossman M. On the concept of health capital and the demand for health. Journal of Political Economy. 1972;80:223–255. [Google Scholar]
- Hawkins JD, Catalano RF, Miller JY. Risk and protective factors for alcohol and other drug problems in adolescence and early adulthood: implications for substance abuse prevention. Psychological Bulletin. 1992;112:64–105. doi: 10.1037/0033-2909.112.1.64. [DOI] [PubMed] [Google Scholar]
- Hawkins JD, Weis JG. The social development model: An integrated approach to delinquency prevention. Journal of Primary Prevention. 1985;6:73–97. doi: 10.1007/BF01325432. [DOI] [PubMed] [Google Scholar]
- Heyser CJ, Hampson RE, Deadwyler SA. Effects of Delta-9-Tetrahydrocannabinol on delayed match to sample performance in rats: alterations in short-term memory associated with changes in task specific firing of hippocampal cells. The Journal of Pharmacology and Experimental Therapeutics. 1993;264:294–307. [PubMed] [Google Scholar]
- Ho D, Imai K, King G, Stuart EA. Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Political Analysis. 2007;15:199–236. [Google Scholar]
- Jessor R, Jessor SL. Problem Behavior and Psychosocial Development: A Longitudinal Study of Youth. New York: Academic Press; 1977. [Google Scholar]
- Koch SF, Ribar DC. A siblings analysis of the effects of alcohol consumption onset on educational attainment. Contemporary Economic Policy. 2001;19:162–174. [Google Scholar]
- Kumar R, O’Malley PM, Johnston LD, Schulenberg JE, Bachman JG. Effects of school-level norms on student substance use. Prevention Science. 2002;3:105–124. doi: 10.1023/a:1015431300471. [DOI] [PubMed] [Google Scholar]
- LaGrange RL, White HR. Age differences in delinquency: a test of theory. Criminology. 1985;23:19–45. [Google Scholar]
- Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73:13–22. [Google Scholar]
- Little RJA, Rubin DB. Statistical Analysis with Missing Data. New York: John Wiley & Sons; 2002. [Google Scholar]
- Matsuda LA, Bonner TI, Lolait SJ. Localization of cannabinoid receptor messenger RNA in rat brain. Journal of Comparative Neurology. 1993;327:535–550. doi: 10.1002/cne.903270406. [DOI] [PubMed] [Google Scholar]
- Mensch BS, Kandel DB. Dropping out of high school and drug involvement. Sociology of Education. 1988;61:95–113. [Google Scholar]
- Mendelson J, Rossi M, Meyer R. The Use of Marihuana: A Psychological and Physiological Inquiry. New York: Plenum; 1974. [Google Scholar]
- Newcomb M, Bentler P. Drug use, educational aspirations, and work force involvement: the transition from adolescence to young adulthood. American Journal of Community Psychology. 1986;14:303–321. doi: 10.1007/BF00911177. [DOI] [PubMed] [Google Scholar]
- Newcomb M, Bentler P. Consequences of Adolescent Drug Use: Impact on the Lives of Young Adults. Newbury Park, CA: Sage Publications; 1988. [Google Scholar]
- Perkonigg A, Goodwin RD, Fiedler A, Behrendt S, Beesdo K, Lieb R, Wittchen H. The natural course of cannabis use, abuse and dependence during the first decades of Life. Addiction. 2007;103:439–449. doi: 10.1111/j.1360-0443.2007.02064.x. [DOI] [PubMed] [Google Scholar]
- Register CA, Williams DR, Grimes PW. Adolescent drug use and educational attainment. Education Economics. 2001;9:1–18. [Google Scholar]
- Ridgeway G, McCaffrey DF. Comment: demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science. 2007;22:540–543. doi: 10.1214/07-STS227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roebuck MC, French MT, Dennis ML. Adolescent marijuana use and school attendance. Economics of Education Review. 2004;23:133–141. [Google Scholar]
- Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70:41–55. [Google Scholar]
- Rubin DB. Estimating causal effects from large data sets using propensity scores. Annals of Internal Medicine. 1997;127:757–176. doi: 10.7326/0003-4819-127-8_part_2-199710151-00064. [DOI] [PubMed] [Google Scholar]
- Rubin DB, Thomas N. Matching using estimated propensity scores: relating theory to practice. Biometrika. 1996;52:249–264. [PubMed] [Google Scholar]
- Sander W. The effects of schooling and cognitive ability on smoking and marijuana use by young adults. Economics of Education Review. 1998;17:317–324. [Google Scholar]
- Schafer JL. Analysis of Incomplete Multivariate Data by Simulation. New York: Chapman & Hall Ltd; 1997. [Google Scholar]
- Schafer JL. Software for multiple imputation. 1999. [May 26, 2008]. www.stat.psu.edu/~jls/misoftwa.html. [Google Scholar]
- Schulenberg J, Bachman JG, O'Malley PM, Johnston LD. High school educational success and subsequent substance use: a panel analysis following adolescents into young adulthood. Journal of Health and Social Behavior. 1994;35:45–62. [PubMed] [Google Scholar]
- Schultz TW. Investment in human capital. American Economic Review. 1961;51:1–17. [Google Scholar]
- Simmons RG, Blyth DA. Moving into Adolescence: The Impact of Pubertal Change and School Context. New York: Aldine de Gruyter; 1987. [Google Scholar]
- Sommer B. Truancy in early adolescence. Journal of Early Adolescence. 1985;5:145–160. [Google Scholar]
- Tucker JS, Ellickson PE, Orlando M, Klein DJ. Cigarette smoking from adolescence to young adulthood: women’s developmental trajectories and associated outcomes. Women’s Health Issues. 2006;16:30–37. doi: 10.1016/j.whi.2005.08.004. [DOI] [PubMed] [Google Scholar]
- Van Ours J. Dynamics in the use of drugs. Health Economics. 2006;15:1283–1294. doi: 10.1002/hec.1128. [DOI] [PubMed] [Google Scholar]
- Wells KB, Sturm R, Sherbourne CD, Meredith LS. Caring for Depression. Cambridge MA: Harvard University Press; 1996. [Google Scholar]
- Willis RJ. Wage determinants: a survey and reinterpretation of human capital earnings functions.”. In: Ashenfelter O, Layard R, editors. Handbook of Labor Economics. Vol I. Amsterdam: Elsevier Science; 1987. pp. 525–602. 1987. [Google Scholar]
- Willis RJ, Rosen S. Education and self-selection. Journal of Political Economy. 1979;87:S7–S38. [Google Scholar]
- Wills TA, Shiffman S. Coping and substance use: a conceptual framework. In: Wills TA, Shiffman S, editors. Coping and Substance Use. Orlando FL: Academic Press; 1985. pp. 3–24. [Google Scholar]
- Wooldridge JM. Econometric Analysis of Cross Section and Panel Data. Cambridge MA: MIT Press; 2002. [Google Scholar]
- Yamada T, Kendix M, Yamada T. The impact of alcohol consumption and marijuana use on high school graduation. Health Economics. 1996;5:77–92. doi: 10.1002/(SICI)1099-1050(199601)5:1<77::AID-HEC184>3.0.CO;2-W. [DOI] [PubMed] [Google Scholar]