Abstract
We analyze family decisions to participate in community-based universal substance-abuse prevention programs through the framework of expected utility theory. Family functioning, which has been shown to be a good indicator of child risk for substance abuse, provides a useful reference point for family decision making. Our results show that well-functioning families (with children at low risk for substance use) should have the lowest incentive to participate, but that high-risk families may also opt out of prevention programs. For programs that are most effective for high-risk youth, this could be a problem. Using data from the Strengthening Families Program and the Washington Healthy Youth Survey, we empirically test the implications of our model and find that at least for one measure of family functioning those families with children most likely to be at risk for substance use are opting out of the program.
Introduction
Early initiation of substance use by adolescents is strongly associated with numerous risk behaviors and negative outcomes, including delinquency, pregnancy, and HIV, poor health and lower educational attainment as well as with long-term substance dependence and abuse (DuRant, et al., 1999; Gil and Molina, 2007). Interventions designed to prevent adolescent substance use are important because of the high personal and societal costs of substance abuse: dollar costs of drug and alcohol abuse and dependence were estimated at $245.7 billion annually in 1992 and have increased by approximately 5% annually since then (Office of National Drug Control Policy, 2001). Over the past two decades, experimental trials have shown that family-based interventions, which aim to minimize adolescent risk behaviors through strengthening parent-child relationships and improving family management, can reduce rates of adolescent substance use initiation and frequency (Foxcroft, et al., 2003; Hawkins, et al., 1999). In some communities and states, such evidence-based programs are now being implemented on a large-scale as public health interventions.
In the lexicon of public health, “universal” prevention programs or strategies are delivered without any prior screening (e.g. anti-drunk driving campaigns) and are open for all who wish to participate; “selective” prevention programs target those at higher-than-average risk for a problem (e.g. diabetes prevention with overweight individuals); and “indicated” prevention programs target those who display early signs of a problem (e.g. dropout prevention with students failing classes) (Mrazek and Haggerty, 1994). Most substance abuse prevention programs are designed as universal interventions, usually for young adolescents, either in schools or in community-based family groups. The systematic study of substance abuse intervention effectiveness is relatively recent, and therefore most studies measuring the potential benefits of such programs are based on randomized clinical trials (RCTs). But when universal programs are implemented in a community, participation may be non-random1, in which case the benefits observed in practice may be appreciably different from those observed in RCTs2. More specifically, evaluating community-based prevention programs may be complicated by selective participation. If individual decisions about participation are based on factors that affect the likelihood that the program is successful3, measuring the program's impact becomes problematic. Most importantly, any measurements of benefits and cost must account for an endogenous self-selection bias. Thus, understanding the cause and impacts of self-selection is important in any study of community-based prevention programs.
In this paper we use expected utility theory (EU) to explain self-selection into community-based programs designed to improve how a family functions. High levels of family functioning, as assessed by measures of family management and parent-child relationship, are associated with a lower likelihood of adolescent substance use and misuse (Dakof, 2000; Schmidt, et.al. 1996). Thus we refer to families with high scores on measures of family functioning as “low risk” and those with low scores as “high risk”. Using a model where family functioning influences the propensity of children towards future substance abuse we show that low-risk families would be less likely to participate. Moreover, we also find some evidence that the highest-risk families, by which we mean those with the lowest levels of functioning, may also be less likely to participate in these programs. Since the youth in these families are most needful of the intervention, this is a costly outcome. We test the conjectures from this model with data from one such evidence-based intervention, the Strengthening Families Program for Parents and Youth 10-14 (SFP), in Washington State and find evidence that both very high-functioning families and very low functioning families are less likely to participate.
In the next section we offer further detail of the literature covering selection issues in community-based prevention programs. Following that we develop an EU model which explains family self-selection into such programs in the context of family functionality. We then introduce the analytical techniques we use to test the predictions of the model. Subsequent sections discuss the data we use and our empirical results. We close the paper with conclusions and implications for further research, including briefly describing how self-selection may impact the apparent as opposed to real costs and benefit from community-based prevention programs.
Selectivity Issues in Community Based Prevention Programs
A large literature on predictors of prevention program participation has developed in recent years. These studies use constructs derived from the health beliefs model, protection motivation theory, theory of reasoned action, and subjective utility theory to model the decision to attend a prevention program (for a review and comparison of these theories see Weinstein, 1993). Spoth and Redmond (1995) note that “all of these models and theories can be characterized as value expectancy approaches, incorporating constructs that address the value placed on a specific, health-related outcome and the estimated likelihood that a specific action will achieve that outcome” (p. 295). In a medical application, research on program participation in health interventions has shown selection effects related to value expectancies and demographic characteristics (Murphy, et al., 2010). Other studies have found that families with lower socioeconomic status are more likely to report privacy concerns and logistical barriers (Haggerty et al., 2002; Heinrichs, Bet al., 2005; Spoth, et al., 1995). Orrell-Valente and colleagues (1999) found that parents were more likely to have positive therapeutic alliance and thus to participate when program leaders were of the same race and of similar socioeconomic background. Possibly most relevant to the issue discussed in this paper, some studies have found that parent perception of child behavior problems increases the likelihood of attendance (Haggerty et al., 2002; Heinrichs, et al., 2005), and parents who see potential benefits of a program are also more likely to attend (Spoth, et al., 1996). In addition, several studies have found that positive family attributes, including clear communication patterns and family organizational skills, are positively related to program participation (Bauman, et al., 2001).
Expected Utility and Self-Selection
Community-based universal programs to improve family relationships or prevent substance abuse are open to all. Families choose to participate if their expected gain, in terms of improved family relationships or a lower likelihood of child substance abuse, exceeds the costs of attending. Psychologists have shown measures of parent-child relationship quality and family management practices are good predictors of youth substance abuse, and many programs, like SFP, are designed to improve family functioning and thereby lessen risk for adolescent substance use. In our model we use family functioning as the target of the intervention and demonstrate that the current status of family functioning may affect the decision to attend a universal family preventive intervention.
Let F be a measure of family functioning and U(F) be a family's valuation of this functioning (especially their valuation of the associated likelihood of future drug abuse by children in the family4), with U(F) > 0, U′(F) ≥ 0 and U″(F) <0.The program is designed to increase family functioning. Thus, participation in the program can be thought of as “successful” if it increases F and unsuccessful if F does not increase. We assume the program does no harm so F cannot decrease from program participation. The benefit of a successful program to a family is therefore approximated by ΔF × U′(F).
Youth who participate in the program incur costs associated with forgone activities, and their willingness to participate is likely to be highly influenced by various social institutions (Leung, 2004). Adults who participate in the program also incur costs, both real dollar costs for such things as transportation, foregone wages5 and the loss of leisure time. In addition families face psychological and social costs related to organizing and cajoling the family into participating in the program. For example, families participating in the program may be required to share with strangers matters traditionally considered private to the family, or parents may disagree with each other or with their child about attending a program (Spoth et al., 1996). Public exposure of how the family works may be especially hard for families with lower functioning levels. Some research has shown that youth with problem behaviors are unwilling to attend family programs despite the fact that parents who perceive that their child has a problem or is susceptible to problems are more willing to attend a program (Spoth, et al., 1997). In contrast, families with adolescents who feel strongly attached to a parent are more likely to attend a program (Bauman et al., 2001). A decision to attend a family strengthening program may imply that the family is in trouble or that parents are doing a poor job; in this case, participants may face social disapproval, which has been shown to discourage program participation (Gensheimer, Ayers, & Roosa, 1993). It is these psychological and social costs which we expect to be higher for lower functioning families, and which form the basis for our theoretical construct on family costs of attending the program.
Assume that the transportation costs, foregone wages, and lost leisure time are the same for all families but the psychological and social costs differ according to family functioning. Based on the arguments discussed in the previous paragraph we assume that the psychological and social costs decrease as family functioning goes up but all families incur some costs related to transportation and foregone wages. Thus we have C(F) > CM > 0, C′(F) < 0 and C″(F) ≥ 0 where CM is the minimum costs incurred if a family participates in the program. Hence, CM is the cost of participation for the most functional families.
The program does not necessarily increase family functioning. We assume initially that the probability of program success, P, is independent of family functioning so P(F) = P for all F. A family participates if the expected value of participation is positive, that is, if
| (1) |
and so families participate only if ΔF × U′(F)*P > C(F). For any given ΔF the limit of the left-hand-side of this inequality as F → ∞ is 0, while the limit of the right-hand-side is CM. Hence there is some F* above which families will choose not to attend. Figure 1 shows the case when lower functioning monotonically increases a family's incentive to participate. However, because the psychological and social costs for low-functioning families may be large, it is not clear that families at the very lowest level of functioning will have a positive expected value. Figure 2 shows the case where the very lowest-functioning and the highest-functioning families opt out and only families with levels of functioning, between F1 and F2, participate.6
Figure 1.

The lower the family functionality the greater the incentive to join SEP.
Figure 2.

Only families in the midrange of functionality join SEP.
It is likely, however, that the probability of success is not independent of (initial) family functioning, so the probability of success is P(F) with 1 ≥ P(F) ≥ 0. At issue is how P(F) relates to F. We see two possibilities. The first is that lower-functioning families are less likely to have a successful program because they are less likely to follow through on program activities and recommendations, so P′(F) > 0. (Spoth et al., 1998). The second is that more functional families have “less to fix” and therefore are less likely to find that F improves and so P′(F) < 0.7 Equation (1) now becomes
| (2) |
and families participate if ΔF × U′(F)*P(F) > C(F).or, more conveniently for our purposes, if ΔF × U′(F) > C(F) / P(F). Again, for any given ΔF taking the limit of both sides of this inequality as F → ∞ we find that our previous results still hold. As F → ∞ the left-hand-side still approaches 0. The right-hand-side approaches CM if P′(F) > 0 (since P(F) →1) and it approaches infinity if P′(F) < 0 (since P(F) →0). Empirically these are observationally equivalent, and the only impact is that families might opt out at lower levels of functionality if P′(F) < 0.
One remaining question is if we could ever see higher-functioning families being more likely to opt in when compared to mid-level and low-level functioning families. The answer is yes if C′(F) ≪ 0 which could be the case if transportation or other costs, especially the opportunity costs of time in terms of foregone wages, is strongly related to family functioning. However, we believe this is unlikely to be sufficiently strong to dominate in the great majority of families.8
Data
We test the implications of our theoretical model by examining participation in the Strengthening Families Program (SFP). SFP is a universal program in which families meet once weekly for seven weeks. In each meeting, parents learn effective means of communicating expectations and setting consequences; youth learn peer resistance skills; and parents and youth engage in exercises designed to promote family closeness. The rigorous clinical efficacy trial of SFP has produced solid evidence of long-term effectiveness in delaying onset and frequency of adolescent substance use (Foxcroft, et al., 2003; Spoth, Redmond, & Shin, 2001). In that trial, families were invited to participate in an experiment and then randomized to treatment or control conditions; a telephone survey demonstrated that the trial sample was representative of the local population. In a community-based implementation, however, families self-select into the program. Recruitment techniques vary from community to community: families may hear about the program through friends and neighbors; through a school principal or guidance counselor; through their faith community, or through public advertising.
We use data from multiple sites in a statewide dissemination of SFP. A survey that program participants complete on the first night of the program included questions that assess dimensions of family functioning targeted by SFP, including parent-child involvement, positive reinforcement of children's good behavior, clear communication of rules and consequences, and children's peer-pressure resistance skills. A total of 294 youth from 42 programs in 10 counties participated in the survey.
To perform our analysis, we require a supplementary data set with identical variables to those measured for the SFP participants. The Washington State Healthy Youth Survey (HYS) provides such a data set (Washington State Department of Health, 2008).9 The Washington Healthy Youth Survey is conducted biennially in schools statewide in an effort to measure health risk behaviors that contribute to the morbidity, mortality and social problems of youth in Washington State. Demographic items on the HYS are administered to all students in participating schools. Other items and scales are included based on age. Of relevance to this study, 6th graders answered questions on parent-child involvement and positive reinforcement, and did not answer questions regarding peer social skills and family management. Older students answered question of the latter type but not the former.
We used HSY data from grades 6, 8, and 10 (corresponding to the age range in the SFP dataset). School response rates ranged from 80% (6th grade) to 86% (10th grade) and individual response rate across grades was 65%.10 Our dataset included 8294 complete observations for younger children (those that responded to questions regarding parent-child involvement and positive reinforcement) and 4413 complete observations for older children (those that responded to questions regarding family management, peer social skills). Summary statistics for both the SFP dataset and the HYS dataset are given in Table 1. We have indicated in the table significant (p<.05) differences in the means for specific variable between the HYS and SFP samples. We note three primary differences. First for both age groups the SFP sample is slightly older than the HYS sample. Second the racial makeups of the samples differ. For the younger age group both whites and Hispanics are over represented compared to the HYS sample. Hispanics are also over represented in the older age group, but whites are underrepresented. Finally, in both age groups there is evidence of lower family functioning on average among the participants in SFP compared to the HYS sample. Younger SFP participants report lower average scores for family involvement compared to the HYS youth of the same age group, while the older SFP participants report lower average peer social skills compared the HYS sample of the same age group.
Table 1. Summary statistics for the younger children (nHYS=8041, nSFP=200) and the older children (nHYS=4413, nSFP=94).
| Variable | Range | Ages (11-12)α | Ages (13-16)β | ||||||
|---|---|---|---|---|---|---|---|---|---|
| HYS | SFP | HYS | SFP | ||||||
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | ||
| Male | 0,1 | .48 | .0056 | .46 | .035 | .47 | .50 | .56 | .50 |
| Age (A,B) | 11-16 | 11.28 | 0050 | 11.42 | .035 | 13.29 | .48 | 13.57 | .74 |
| White (A,B) | 0,1 | .522 | .0056 | .635 | .034 | .645 | .48 | .468 | .50 |
| Black | 0,1 | .020 | .0016 | .025 | .011 | .027 | .16 | .012 | .10 |
| Asian/Pacific (A) | 0,1 | .048 | .0024 | .025 | .011 | .050 | .22 | .032 | .18 |
| Hispanic (B) | 0,1 | .167 | .0042 | .195 | .028 | .169 | .37 | .340 | .47 |
| Native (B) | 0,1 | .096 | .0033 | .065 | .017 | .056 | .23 | .106 | .11 |
| Other (A,B) | 0,1 | .21 | .0045 | .055 | .016 | .093 | .29 | .050 | .20 |
| Reinforcement | 1-4 | 3.42 | .0072 | 3.37 | .045 | n/a | n/a | n/a | n/a |
| Involvement (A) | 1-4 | 3.19 | .0079 | 2.89 | .049 | n/a | n/a | n/a | n/a |
| Family Mgmt Skills | 1-4 | n/a | n/a | n/a | n/a | 3.10 | .73 | 2.86 | .78 |
| Peer Social Skills (B) | 1-4 | n/a | n/a | n/a | n/a | 3.31 | .64 | 3.21 | .61 |
| Substance Use | 0,1 | n/a | n/a | n/a | n/a | .209 | .41 | .234 | .43 |
A implies a significant (p<.05) difference in the mean for the younger group
B implies a significant (p<.05) difference in the mean for the older group
Analytic Approach
The difficulty in analyzing our choice-restricted (SFP) and supplementary (HYS) data lies in the fact that in our supplementary data set, we are unable to determine which individuals participated in SFP and are therefore included in our data twice (Hill, Goates, and Rosenman, 2010). Steinberg and Cardell (1992) address this issue and develop an appropriate weighting procedure for analyzing this data in a pseudo-logistic regression. As this regression procedure is rarely used, we include details on its implementation in Appendix.
Our dependent variable is participation in the program and our explanatory variables of interest are the various scales that measure family functioning. For the younger children our two measures of family functioning are Positive Reinforcement for Prosocial Behavior (which we term “Reinforcement”) and Opportunities for Prosocial Behavior (which we term “Involvement”). These two measures assess the degree to which parents encourage their children's involvement in family activities and decisions and reward them for doing so. For the older children we have measures of Peer Social Skills (which assesses children's ability to resist peer pressure) and Family Management Skills (which assesses parental monitoring of youth activities and clarity of household rules). All scales have strong psychometric properties and are standard measures of risk and protective factors used in national and state surveys such as the Monitoring the Future survey, supported by the National Institute of Drug Abuse (2010). With both age levels we include as covariates gender, age, race, and for the older group we also include current substance use (this last variable is not available for those in the younger group). Specification tests suggest quadratic terms be used for Reinforcement and Involvement when estimating participation by the younger group, while only linear terms are indicated for the variables of interest in the equation explaining participation by the older group.
Results and Discussion
In Table 2 we present results of a logistic regression using the Steinberg-Cardell technique mentioned earlier. These coefficients can be interpreted the same way a logistic regression from a randomized sample might be interpreted, with each coefficient predicting program participation from each of the risk and protective factor scale scores (both linear and curvilinear) and controlling for youth race/ethnicity, sex, county, and age. For older youth current substance use (0 or 1) was also included in the regression. Our results show that in the younger group, families with female children and non-minority families were more likely to attend than families without these characteristics. These results are consistent with previous research showing that minority families and those with male children were less likely to attend a family program (Bauman et al., 2001). In the older group, the opposite was true: families with male children were more likely to attend, as were Latino and American Indian families. One possible explanation for this discrepancy is that Latino and American Indian participants were more likely to attend with both older and younger children. Another possibility is that older male children were more likely to be experiencing behavior problems, and their parents were therefore more likely to attend. However, African-American and Asian-American families, and those from other non-White families, were still less likely to attend.
Table 2. Summary of Steinberg-Cardell Pseudo- Logistic Regression Analysis for Variables Predicting Decisions to participate in the program for children age 11-12 and children age 13-16, Controlling for Background Variables.
| Predictor | Ages 11-12 | Ages 13-16 | ||||
|---|---|---|---|---|---|---|
| B | SE B | OR | B | SE B | OR | |
| Male | -.14 | .15 | .87 | .12 | .22 | 1.12 |
| Hispanic | -.34** | .20 | .71 | .75*** | .27 | 2.11 |
| Native American | -.88*** | .30 | .42 | .57* | .38 | 1.76 |
| Asian/Pacific Islander | -.70* | .46 | .50 | -.10 | .57 | .91 |
| Black | -.29 | .46 | 1.33 | -.63 | .96 | .53 |
| Other | -1.80*** | .33 | .17 | -1.11** | .61 | .33 |
| Reinforcement | .54 | 1.03 | n/a | n/a | n/a | n/a |
| Involvement | 1.36 | .80 | n/a | n/a | n/a | n/a |
| Reinforcement Squared | -.02 | .21 | n/a | n/a | n/a | n/a |
| Involvement Squared | -.45 | .17 | n/a | n/a | n/a | n/a |
| Reinforcement*Involvement | .05 | .28 | n/a | n/a | n/a | n/a |
| Peer Social Skills | n/a | n/a | n/a | -.04 | .18 | .96 |
| Family Management Skills | n/a | n/a | n/a | -.32** | .18 | .73 |
| Substance Use | n/a | n/a | n/a | -.42* | .32 | .65 |
p < .10.
p < .05.
p < .01.
Our main interest here was the effects of family functioning factors on program participation. As noted earlier, preliminary analysis indicated that second-order terms for functioning factors belong in the regression equation for the younger group, but not for the older group. To improve efficiency the reported analysis for the older group reflects this finding and does not include estimates for curvilinear effects. In logit estimation one must be cautious about interpreting the parameter estimates associated with explanatory variables, as they are estimators of the change in the logit caused by a unit change in the independent variable, not the change in the variable itself. The presence of the squared terms in the younger group means this caution is important as one must compute odds ratios using linear, squared and cross-product terms; hence the odds ratios are not constant.
The non-linear terms in the equation for the younger group mean interpretation of the logistic estimates must focus on the total rather than marginal effects The empirical analysis shows that families reach their highest overall odds ratios of participation with scores of Reinforcement = 4.00 (the highest possible value) and Involvement=1.71, and the odds of participating decrease with movement in any direction from this peak within the range of scale scores. The marginal odds ratios peak at the same values, of course, when computed at the mean of the other variable. However, we note that the estimation results for Reinforcement are not statistically significant even with a p-value of 0.15, hence for the younger group we focus on the results for Involvement. The peak value for this variable (1.71) indicates for the younger group those families with a middle level of functioning are most likely to attend, while those at both extremes (low functioning and high functioning) are least likely to attend. This is consistent with figure 2.
Results for the older group are somewhat easier to interpret since the model uses only linear estimation. Odds ratios are much more straightforward; lower scores on Family Management significantly predicted greater participation (OR = 0.725, p < .04), which is consistent with figure 1. Adolescent substance use was also negatively associated with participation (OR = 0.654, p < .10). If, after controlling for the other measures of family functioning, we interpret the presence of adolescent substance abuse as lower family functioning our results for the families with older children are also more consistent with figure 2 than with figure 1.
Conclusions and Implications
Using expected utility we explain how a family's functioning affects selection into a universal program to decrease substance abuse. Our theoretical results, supported by our empirical analysis, indicate that high functioning families are less likely to participate. In our view this is not a problem since indications are that youth coming from those families are less likely to abuse drugs and alcohol in the future. However, also consistent with what our theory showed, our empirical analysis finds that the least functioning families may also decline to participate in programs like SFP. Youth from these families are at relatively high risks of (future) substance abuse, and thus this is a social problem. It may be that social welfare can be increased by targeting lower-functioning families for participation in programs like SFP, perhaps by working to lower the costs of participation, or by subsidizing participation to increase their expected net benefits.
Moreover, our results have implications when analyzing programs using cost-benefit or cost-effectiveness analysis, especially if the probability of program success is correlated with the family functioning of the participants. If a successful program increases functioning equally for all families regardless of the family's initial functioning (i.e., if ΔF is the same for all successful participants) we would still expect that how they value the change (i.e., ΔF × U′(F)) will differ because of the properties of U′(F). The economic valuation of the program then would differ from the valuation coming out of the RCT if the average result of the RCT is used to calculate the program value. Moreover, if the probability of program success and the magnitude of change that results is correlated with family functioning (so P(F) is not constant and ΔF itself is a function of F) then benefit cost estimates for the program will be even more biased if the distribution of family functioning among the participant population is different than that of the RCT population. For example, if lower-functioning families self-select into programs, but the program is more likely to change the risk of substance abuse for these families, then the benefit-cost estimate of the clinical trial would understate the true benefit of application. There is some evidence that the long-term effectiveness of prevention programs does not depend on a family's initial level of functioning (Guyll, et al., 2004) but there is also emerging evidence that prevention efforts are, in fact, disproportionately effective with higher-risk individuals (Griffin, et al., 2003; Spoth, et al., 2008). At the same time, evidence from clinical trials suggests that lower-functioning families are less likely to attend community-based programs (Haggerty et al., 2002), supportive of our finding that the least functioning families are more likely to not participate. Thus, these programs might be missing those families most in need of them, and which would benefit the most from participating, clearly yielding less value than they could.
Appendix: The Steinberg-Cardell Test for Selectivity
Unlike RCTs, real world interventions often have restricted samples that provide information only on participants (i.e. have zero variance in the dependent variable), leaving analysts unable to identify selection using standard techniques such as logit or probit. Ideally, a choice-restricted sample could be augmented with surveys of the general population and estimated as an enriched sample, but such supplementary surveys are often costly and are not always feasible. At the same time, there are many widely available datasets which contain information on the exogenous variables of interest for the general population (e.g. census data, etc.) but fail to contain data on the participation choice of the individual. We will call this second kind of data a supplementary sample.
Several estimators allow us to estimate the parameters of the choice model when appropriate choice restricted and supplementary samples exist (Cosslett, 1981a, 1981b, 2007; Imbens, 1992; Steinberg & Cardell, 1992). The first estimator of this type was developed by Cosslett in 1981. Although Cosslett's 1981 estimator is theoretically appealing, it is extremely difficult to estimate in application. Other estimators in this class are essentially variations of Cosslett's original estimator which allow for easier application.
For reasons of computational simplicity and tractability, we chose to use the Cardell-Steinberg technique. This technique relies on weighting different parts of the classic log-likelihood function for a logistic regression to produce unbiased estimates of parameters. Because this technique is unfamiliar to many practitioners, we provide a brief description, much of which is taken from Steinberg and Cardell (1992).
The classic log likelihood function for a dichotomous outcome is
| (A1) |
Where P is an appropriate probability model, specifying that Pr (Yi = 1) = h(Xi B) for some known function h (in our estimation we use the logit model); β is a column vector of unknown parameters; Xi is a row vector of covariates; Pi = h (Xi, b) and i indexes the observations. Neither the choice-restricted samples nor supplementary samples by themselves can support the estimation of the model in equation (A1). Combined, however, the samples are sufficient to estimate the model. For expository purposes, assume that the entire population is surveyed in the supplementary sample, and all persons that participate in the program are surveyed in the choice restricted sample11. Then the likelihood function in (A1) can be rearranged and rewritten as
| (A2) |
This likelihood function can be broken up into its separate terms to better understand the Cardell-Steinberg technique. The first term is derived from the entire population (that is, the supplementary sample) and is equivalent to treating every observation as if it had a value of 0 for the response variable, Y. The second term comes from the choice restricted sample, and accumulates the correct log P term for observations having a value of 1 for the participation variable, Y. The third term is also calculated from the choice restricted sample, and acts as a correction to the first term by subtracting out precisely those values that were misclassified in the first term.
When the sampling rates are less than one, which is almost always the case, a related pseudo-likelihood can be applied to a pooled sample of supplementary and choice-restricted samples:
| (A3) |
wherer r0 is the sampling rate of the supplementary sample and r1 is the sampling rate of the choice restricted sample. N is the size of the supplementary sample, and I is the size of the combined samples. Steinberg-Cardell (1992) show that maximizing this pseudo-likelihood function results in unbiased, though inefficient, estimates of beta.12
Footnotes
For example, it may be that individuals who are least at risk for the behavior and most receptive to the program goals are most likely to attend, while those who are at greater risk and less receptive to the program goals fail to participate.
Berger and Exner (1999), and Berger and Christophi (2003) discuss how there may be nonrandom participation in RCTs as well.
For an interesting analysis on what may constitute “success” in the context of substance abuse outcomes, see Brent (1998)
As discussed in the introduction, higher levels of family functioning have been found to be associated with a lower likelihood of future substance abuse by the children in the family, hence our calling high functioning families “low risk”.
A study by Carlin and Sandy (1990), suggests that families with a higher opportunity cost of time are less likely to engage in the preventative activity of properly installing car seats.
We show U″(F) = 0 in this graph. The implications do not change if U″(F) <0.
In neither case does the second derivative of P(F) matter.
One alternative theory for decision making under uncertainty is Prospect Theory (PT). A model showing how PT can result in low risk families more likely to opt into the program than high risk families is available from the authors. However, as we show below, our empirical analysis does not support this idea.
To enable comparison of program attendees with non-attendees, the SFP evaluation was specifically designed to include measures assessing risk and protective factors that, besides being targets of the program, are also collected in the Healthy Youth Survey.
Alternative schools were significantly less likely to participate than regular schools (63% versus 82.7%). However, analysis of survey data patterns showed that school self- selection did not appear to bias survey results.
This assumption is note needed in actual application, but is included here to make the sums in (5) exact.
Cosslett provides an unbiased and efficient estimator of the probability model, but their estimator is notoriously difficult to estimate (Imbens, 1992). In several attempts to use the Cosslett estimator with our data, we found that the estimates were sensitive to initial parameter values. We therefore adopt the theoretically less appealing, but more easily applicable Steinberg-Cardell estimator.
Contributor Information
Prof. Robert Rosenman, Email: yamaka@wsu.edu, Washington State University, School of Economic Sciences, Hulbert Hall 101, Pullman, 99164-6210 United States
Mr. Scott Goates, Washington State University, School of Economic Sciences, Pullman, United States
Dr. Laura Hill, Washington State University, Human Development, Pullman, United States
References
- Bauman KE, Ennett ST, Foshee VA, Pemberton M, Hicks K. Correlates of Participation in a Family-Directed Tobacco and Alcohol Prevention Program for Adolescents. Health Education & Behavior. 2001;28(4):440. doi: 10.1177/109019810102800406. [DOI] [PubMed] [Google Scholar]
- Berger VW, Christophi CA. Randomization technique, allocation concealment, masking, and susceptibility of trials to selection bias. Journal of Modern Applied Statistical Methods. 2003;2(1):80–86. [Google Scholar]
- Berger VW, Exner DV. Detecting Selection Bias in Randomized Clinical Trials. Controlled Clinical Trials. 1999;20(4):319–327. doi: 10.1016/S0197-2456(99)00014-8. [DOI] [PubMed] [Google Scholar]
- Brent RJ. Estimating the effectiveness and benefits of alcohol treatment programmes for use in economic evaluations. Applied Economics. 1998;30(2):217–226. [Google Scholar]
- Carlin P, Sandy R. The value of time and the effect of fines on child car safety seat usage. Applied Economics. 1990;22(4):463. [Google Scholar]
- Cosslett SR. Maximum likelihood estimator for choice-based samples. Econometrica. 1981a;49(5):1289–1316. [Google Scholar]
- Cosslett SR. Efficient estimation of discrete choice models. Structural analysis of discrete data with econometric applications. 1981b:51–111. [Google Scholar]
- Cosslett SR. Efficient Estimation of Semi-Parametric Models By Smoothed Maximum Likelihood. International Economic Review. 2007;48(4):1245–1272. [Google Scholar]
- Dakof GA. Understanding gender differences in adolescent drug abuse: Issues of comorbidity and family functioning. Journal of Psychoactive Drugs. 2000;32(1):25–32. doi: 10.1080/02791072.2000.10400209. [DOI] [PubMed] [Google Scholar]
- DuRant RH, Smith JA, Kreiter SR, Krowchuk DP. The Relationship Between Early Age of Onset of Initial Substance Use and Engaging in Multiple Health Risk Behaviors Among Young Adolescents. Arch Pediatr Adolesc Med. 1999;153(3):286–291. doi: 10.1001/archpedi.153.3.286. [DOI] [PubMed] [Google Scholar]
- Foxcroft DR, Ireland D, Lister-Sharp DJ, Lowe G, Breen R. Longer-term primary prevention for alcohol misuse in young people: a systematic review. Addiction. 2003;98(4):397–411. doi: 10.1046/j.1360-0443.2003.00355.x. [DOI] [PubMed] [Google Scholar]
- Gensheimer LK, Ayers T, Roosa M. School-Based Preventive Interventions for At-Risk Populations: Practical and Ethical Issues. Evaluation and Program Planning. 1993;16(2):159–67. [Google Scholar]
- Gil AI, Molina JA. Human development and alcohol abuse in adolescence. Applied Economics. 2007;39(10):1315–1323. [Google Scholar]
- Griffin KW, Botvin GJ, Nichols TR, Doyle MM. Effectiveness of a universal drug abuse prevention approach for youth at high risk for substance use initiation. Preventive Medicine. 2003;36(1):1–7. doi: 10.1006/pmed.2002.1133. [DOI] [PubMed] [Google Scholar]
- Guyll M, Spoth RL, Chao W, Wickrama KAS, Russell D. Family-focused preventive interventions: Evaluating parental risk moderation of substance use trajectories. Journal of Family Psychology. 2004;18(2):293–301. doi: 10.1037/0893-3200.18.2.293. [DOI] [PubMed] [Google Scholar]
- Haggerty K, Fleming CB, Lonczak HS, Oxford ML, Harachi TW, Catalano RF. Predictors of Participation in Parenting Workshops. The Journal of Primary Prevention. 2002;22(4):375–387. doi: 10.1023/A:1015227623145. [DOI] [Google Scholar]
- Heinrichs N, Bertram H, Kuschel A, Hahlweg K. Parent recruitment and retention in a universal prevention program for child behavior and emotional problems: Barriers to research and program participation. Prevention Science. 2005;6(4):275–286. doi: 10.1007/s11121-005-0006-1. [DOI] [PubMed] [Google Scholar]
- Heinrichs N, Bertram H, Kuschel A, Hahlweg K. Parent Recruitment and Retention in a Universal Prevention Program for Child Behavior and Emotional Problems: Barriers to Research and Program Participation. Prevention Science. 2005;6(4):275–286. doi: 10.1007/s11121-005-0006-1. [DOI] [PubMed] [Google Scholar]
- Hill LG, Goates S, Rosenman R. Detecting Selection Bias in Community Disseminations of Universal Family-Based Prevention Programs. American Journal of Public Health. 2010;100(4):623–630. doi: 10.2105/AJPH.2008.154112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Imbens G. An Efficient Method of Moments Estimator for Discrete Choice Modles with Choice-Based Sampling. Econometrica. 1992;60(5):1187–1214. [Google Scholar]
- Leung A. Delinquency, schooling, and work: time allocation decision of youth. Applied Economics. 2004;36(9):987–993. [Google Scholar]
- Monitoring the Future Website. Monitoring the Future. 2010. Retrieved March 14, 2010, from http://www.monitoringthefuture.org/
- Mrazek PB, Haggerty RJ. Reducing risks for mental disorders: frontiers for preventive intervention research. National Academy Press; 1994. [PubMed] [Google Scholar]
- Murphy S, Rosenman R, Yoder JK, Friesner D. Patient's Perceptions and Treatment Effectiveness. Applied Economics. forthcoming. [Google Scholar]
- Office of National Drug Control Policy. The economic costs of drug abuse in the United States 1992-1998. 2001. Retrieved 08/24/2009. from http://www.whitehousedrugpolicy.gov//publications/pdf/economic_costs98.pdf.
- Orrell-Valente JK, Pinderhughes EE, Valente E, Laird RD, Bierman KL, Coie JD, Dodge KA, et al. If It's Offered, Will They Come? Influences on Parents' Participation in a Community-Based Conduct Problems Prevention Program. American Journal of Community Psychology. 1999;27(6):753–783. doi: 10.1023/a:1022258525075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmidt SE, Liddle HA, Dakof GA. Changes in parenting practices and adolescent drug abuse during multidimensional family therapy. Journal of Family Psychology. 1996;10(1):12–27. [Google Scholar]
- Spoth R, Redmond C. Parent motivation to enroll in parenting skills programs: a model of family context and health belief predictors. Journal of Family Psychology. 1995;9(3):294–310. [Google Scholar]
- Spoth R, Redmond C, Haggerty K, Ward T. A controlled parenting skills outcome study examining individual difference and attendance effects. Journal of Marriage and the Family. 1995;57(2):449–464. [Google Scholar]
- Spoth R, Redmond C, Hockaday C, Shin CY. Barriers to Participation in Family Skills Preventive Interventions and Their Evaluations: A Replication and Extension. Family Relations. 1996;45(3):247–254. [Google Scholar]
- Spoth R, Redmond C, Shin C, Lepper H, Haggerty K, Wall M. Risk moderation of parent and child outcomes in a preventive intervention: A test and replication. American Journal of Orthopsychiatry. 1998;68(4):565–579. doi: 10.1037/h0080365. [DOI] [PubMed] [Google Scholar]
- Spoth R, Redmond C, Shin CY. Randomized trial of brief family interventions for general populations: Adolescent substance use outcomes 4 years following baseline. Journal of Consulting and Clinical Psychology. 2001;69(4):627–642. doi: 10.1037//0022-006x.69.4.627. [DOI] [PubMed] [Google Scholar]
- Spoth R, Trudeau L, Shin C, Redmond C. Long-term effects of universal preventive interventions on prescription drug misuse. Addiction. 2008;103(7):1160. doi: 10.1111/j.1360-0443.2008.02160.x. [DOI] [PubMed] [Google Scholar]
- Spoth RL, Redmond C, Kahn JH, Shin C. A prospective validation study of inclination, belief, and context predictors of family-focused prevention involvement. Family Process. 1997;36(4):403–429. doi: 10.1111/j.1545-5300.1997.00403.x. [DOI] [PubMed] [Google Scholar]
- Steinberg D, Cardell NS. Estimating logistic regression models when the dependent variable has no variance. Communication in Statistics Theory and Methods. 1992;21(2):423–450. [Google Scholar]
- Washington State Department of Health. Healthy Youth Survey. 2008. Retrieved from http://www3.doh.wa.gov/hys.
- Weinstein ND. Testing Four Competing Theories of Health-Protective Behavior. Health Psychology. 1993;12:324–324. doi: 10.1037//0278-6133.12.4.324. [DOI] [PubMed] [Google Scholar]
