Abstract
Background
Similar to other fields, a targeted behavioral medicine perspective can aid decision-making related to participant–intervention matching.
Purpose
To present one potentially useful definition of intervention targeting activity; describe potential targeting domains of particular relevance to behavioral medicine; discuss different statistical approaches to aid the targeted intervention development process; and discuss the challenges and opportunities accompanying the incorporation of targeted intervention development methods into behavioral randomized clinical trial (RCT) research.
Methods
Drawing from recent conceptual work by the MacArthur group and other scientists in the field, methods and approaches to undertaking moderator analysis are discussed.
Results
Examples of moderator analyses are provided which reflect the different statistical methods and variable domains that may serve as moderators of intervention success.
Conclusions
The recommended exploratory work can help to make the most efficient use of RCTs to identify the best paths for subsequent RCT development in a resource-constrained era.
Keywords: Targeted intervention, Moderator analysis, RCT, Exploratory analysis
Introduction
The latter half of the twentieth century has brought major advances in the development of effective behavioral interventions for a wide range of health issues and conditions facing our nation [1]. Among the health contributions made by the behavioral and social sciences are advances in methods and interventions for reducing tobacco use [2]; contributions to behavioral risk factor management that have contributed to reductions in cardiovascular disease and enhanced chronic disease management [3, 4]; successes in demonstrating that lifestyle intervention (i.e., regular physical activity and reasonable weight loss) can reduce the risk of developing type 2 diabetes in high-risk adults [5–7]; and continued contributions to the prevention and control of HIV/AIDS [8].
The advent of such advances has brought an increased awareness that, as in other health fields, ‘one size does not fit all’ [9]. In fact, a growing emergence of ‘individualized’ or targeted medicine approaches to intervention has occurred in a number of fields, spearheaded by the latest discoveries in areas such as pharmacogenomics that have begun to identify patient subgroups with particular genotypes that may predispose them to respond favorably or poorly to pharmacological interventions [10]. The strengths of a targeted perspective, whereby subgroups of people are assigned to specific interventions based on certain characteristics [10], include the potential for enhanced intervention efficacy as well as cost savings accrued through matching intervention-relevant resources to participant requirements [11]. Given these strengths, identification of ways to enhance program targeting is currently recognized as a key public health goal [12–16]. The targeted medicine perspective, in essence, modifies the question typically asked in randomized clinical trials (RCTs) and similar intervention studies from ‘does this intervention work?’ to ‘does this intervention work, and for whom and under what conditions?’ [16–18]. The targeted medicine perspective also aids decision-making related to which participants should not receive an intervention (i.e., because of lack of benefits, side effects or other types of adverse events, etc.) [16, 18, 19]. Similar to the long-standing scientific concern related to overgeneralizing RCT results to populations excluded from the RCT [20], the targeting perspective aims to better understand the extent to which the results from a RCT may (or may not) apply to each subpopulation included in the RCT [21].
The purpose of this paper is to:
present one potentially useful definition of targeting within the context of intervention development, adaptation, and delivery;
describe potential targeting domains that are of particular relevance to behavioral medicine;
briefly discuss the use of different statistical approaches and methods to aid the targeted intervention development process; and
discuss the challenges and opportunities accompanying the incorporation of targeted intervention development methods into behavioral RCT research.
We acknowledge that what we are describing is one of a range of perspectives related to targeting and applications of exploratory investigation in the intervention development field. Our hope is that the perspectives represented in this paper may stimulate further discussion of these issues as they relate to the behavioral medicine arena.
The current article focuses specifically on RCT research. Subsequent dismantling of interventions already shown to be effective to better understand their ‘mechanisms of action’ is seen as an appropriate next step after a successful RCT to optimize the cost-effectiveness of such treatments.
Definition of Terms
Whereas the term ‘targeting’ has been used in a variety of ways [22], for purposes of this paper we define targeting as the systematic matching of subpopulations of individuals to specific interventions based on the characteristics of the subpopulation (i.e., moderators) (Table 1). Table 2 includes examples of person-related targeting domains as well as the types of intervention-related contextual variables (i.e., the conditions surrounding intervention delivery) that may be applicable in optimizing intervention success for different subpopulations. Our perspectives build on the lessons learned from previous efforts to enhance intervention efficacy through client targeting [23], including the recognition of the importance of choosing targeting characteristics that have a specific theoretical and empirical basis, pursuing targeting research in a defined population group, exploring empirically derived combinations of stable, easy-to-measure participant characteristics (as opposed to single characteristics), and evaluating the robustness of the targeting variables using several interventions that are distinct with respect to potentially important intervention matching dimensions (e.g., intervention content, delivery source, channel, location, etc.) [11, 15, 23, 24].In addition, we recognize that, because of budgetary constraints, many experimental studies are limited with respect to being adequately powered to evaluate statistically significant moderator (i.e., subgroup×treatment) effects. Approaches to exploring subgroup effects within such constraints are discussed in the statistical methods section below. Our definition of ‘targeting’ is directed toward choosing which of two or more interventions is best for a particular subgroup of individuals [17]. By way of analogy, ‘targeting’ is analogous to selecting which jacket is best suited to the customer’s needs. This paper is focused primarily on the targeting endeavor.
Table 1.
Definitions based on the MacArthur group recommendations | |
---|---|
Targeting | Systematic matching of subpopulations of individuals to specific interventions based on the characteristics of the subpopulation (i.e., moderators) and the intervention. |
Tailoring | Adapting a particular intervention that individuals are receiving to the specific needs of the individual. |
Prediction | Use of information available at a given time (e.g., before an intervention is delivered or after early response to an intervention is observed) to identify which participants may or may not respond to a given intervention. |
Table 2.
Person-related targeting domains | Intervention-relevant contextual factors potentially influencing intervention success |
---|---|
Demographic | Content |
Cultural | Delivery channel |
Behavioral | Delivery source |
Psychosocial | Dose |
Biological | Location |
Genetic | Social context |
Combinations of above | Timing |
We consider the above targeting endeavor (i.e., selecting to whom to give which intervention using baseline information about participants, also referred to by some researchers as ‘matching’) to differ from the more dynamic individual ‘tailoring’ activities occurring within a particular intervention [25]. Whereas there are, similar to definitions of targeting, different definitions of tailoring that have been used in the scientific literature, for purposes of the present discussion we apply the MacArthur group’s definition of ‘tailoring’ with respect to adapting a particular intervention the individual is receiving to the specific needs of that individual [26], i.e., akin to altering the jacket selected to the specific measurements of the customer. Within this context, an example of tailoring is adapting problem-solving activities found in many cognitive–behavioral interventions to the specific barriers experienced by individual participants during treatment.
Both ‘targeting’ and ‘tailoring’ of interventions to individuals are vital to consumer health and safety as well as reducing unnecessary societal health costs. However, it is often the case in randomized clinical trials that both issues tend to be ignored, and the two terms are often confused with one another and with activities focusing on the prediction of intervention response. To aid efforts concerning the use of more uniform definitions of these terms in the health and mental health fields, the MacArthur group has defined ‘prediction’ as an activity that does not necessarily involve comparison of two or more interventions as does ‘targeting’ nor consideration of two or more versions of a particular intervention as does ‘tailoring’. Rather, they define ‘prediction’ as using what information is available at a given time to identify those individuals who may do better or worse with a particular intervention. Whether the participant would have fared as well or more poorly with other interventions (or other versions of an intervention) is not considered. Thus, one might choose to deliver a specific intervention to certain types of individuals defined by predictors, ignoring the fact that those individuals might have done as well or better with no intervention at all or with another intervention, i.e., the comparison with other types of interventions is lost.
Whereas exploring predictors of success (or failure) within a particular intervention does not provide specific information related to targeting subgroups for one type of intervention versus another, it can provide other types of useful information related to intervention refinements. For example, predictor analysis has been used to identify persons at risk for long-term relapse after smoking cessation interventions [27], participant subgroups that may or may not respond to community cardiovascular disease interventions [28], and subgroups that may respond well or poorly to automated health promotion interventions [29]. Exploratory predictor analysis can also be used as part of post-RCT intervention dissemination research (which is typically limited to pre–post designs) to identify subgroups for which intervention refinements may be particularly indicated. For instance, in exploratory analysis of predictors of success in the Robert Wood Johnson Foundation’s Active for Life Physical Activity Dissemination Initiative [30], participant subgroups reporting living in neighborhoods with heightened traffic and crime levels were less successful in increasing physical activity via an evidence-based telephone-assisted intervention relative to subgroups reporting living in more supportive neighborhoods for walking [31]. Such results set the stage for experimental studies evaluating further refinements of the intervention for these less successful subgroups.
The above issues notwithstanding, clinical (or public health) decisions always involve choices between two or more courses of action, which is why randomized clinical trial (RCT) methodology requires a control or comparison group against which the efficacy or effectiveness of an intervention is assessed. As is widely recognized in the scientific community, pre–post changes within one intervention often reach statistical significance, yet such changes may be because of statistical artifacts rather than clinically significant changes in the condition or behavior of the participant. For such reasons, it is considered prudent to base targeting decisions on variations in the effect size of an intervention relative to the comparison condition that are associated with information known about the participant before intervention onset, i.e., on the moderators of intervention response. Targeted intervention development may also involve adapting interventions to accommodate the needs of future target groups.
Whereas tailoring and prediction activities represent important aspects of the intervention endeavor [32, 33], the focus of this article is on the choice of which intervention to deliver to specific participant subpopulations based on initial or baseline information about those participants (i.e., targeting). In the case of targeting activities of this type, the goal is to assign a particular type of intervention to a specific subpopulation of individuals who are homogeneous on one or more (ideally) easy-to-measure characteristics (i.e., moderators) [21].
Exploratory Activities to Aid Intervention Targeting—Background and Rationale
Exploratory data analysis has long been recognized as a key component of scientific inquiry that has been supported for a number of decades by well-respected statisticians and methodologists as well as national scientific organizations [34–37]. The aim of exploratory data analysis is primarily hypothesis generation, as opposed to hypothesis testing. A well-known example of such exploratory analysis is represented in the investigative activities occurring as part of the Human Genome Project. As such, exploratory data analyses are guided by a set of principles and procedures that generally differ from the principles that apply to hypothesis-testing investigations, given the different goals associated with these two avenues of gaining knowledge. Such differences can include relaxing or suspending formal determinations of power and statistical adjustments related to type I error [34–36]. However, the caveats attendant in using p values in the hypothesis-generating context as simple ‘sign posts’ that indicate that further scientific attention may be warranted as opposed to as tests of formal hypotheses need to be made explicit [34–36]. Perhaps because of the growing familiarity and use of RCT methods in the behavioral medicine field as well as other scientific arenas, however, scientists often express discomfort when faced with exploratory analyses that do not apply hypothesis-testing statistical methods, although doing so can severely limit the utility of exploratory (hypothesis-generating) activities [34–37]. We believe that further discussions of the principles and procedures accompanying exploratory research as discussed by Tukey, Behrens, and other luminaries in the statistical field can enhance behavioral medicine inquiry.
Recent discussions of the utility and methods underlying moderator and related forms of subgroup analysis occurring within the context of RCTs have been published in major medical journals [21, 38]. As part of these discussions, it is useful to distinguish between subgroup analyses that divide the total sample into smaller distinct subgroups and test for treatment effects within each subgroup, and moderator analyses (the focus of this article) that use the entire sample and test for differential treatment effects associated with baseline variables.
Some methodologists have recommended formal testing and reporting of a priori moderator effects with appropriate adjustments for multiple testing (including the number of interaction effects one would expect to reach statistical significance through chance alone) in the primary publication of a clinical trial [38]. This is a reasonable and prudent approach when likely moderator effects can be appropriately specified and powered for in the RCT design. There is often, however, insufficient evidence to support a priori hypotheses related to moderator effects in many intervention areas. Given this situation and the potential clinical and public health value of throwing a wider ‘net’ in attempting to better understand combinations of variables that may be linked to intervention success or failure, alternative methods deserve consideration. Hypothesis-generating exploratory research activities, undertaken as part of secondary data articles generated from RCT investigations, can help identify potential moderators of intervention for future hypothesis testing. The results from such hypothesis-generating activities can thus serve as the pilot work to inform the next generation of RCTs aimed at formally testing such moderator effects [21].
The ‘bottom line’ with such exploratory lines of inquiry is that they require subsequent formal testing using standard hypothesis-testing methods before deciding whether they are truly important in advancing interventions or simply ‘noise’. Yet, by foregoing such exploratory data analysis as a standard part of secondary clinical trials analysis, the risk of missing serendipitous findings or unexpected associations that could be used in the subsequent development and testing of refined interventions may be increased. Unfortunately, the testing of ambiguous interventions that were not substantively informed by empirical observations or evidence before being formally evaluated in an RCT design has presented problems in a variety of scientific fields, including behavioral ones [23].
It is unfortunate that hypothesis-generating activities have received the dubious label in many scientific quarters of being ‘fishing expeditions.’ Such negative labels are likely a response to published exploratory analyses that have drawn specific scientific conclusions from the results (i.e., turning hypothesis-generating activities into hypothesis-testing activities). Drawing such conclusions from exploratory analyses is clearly inappropriate. As noted in the guidelines summarized by the Task Force on Statistical Inference of the APA Board of Scientific Affairs, “Each form of research [including exploratory research] has its own strengths, weaknesses, and standards of practice” ([34–37], p. 594).
The prime utility of exploration is to generate hypotheses concerning how to advance or improve interventions that can subsequently be tested using experimental designs. Whereas it is certainly possible that exploratory analyses may yield little constructive information, it is, in our experience, more likely that such analyses will provide useful observations that can be applied subsequently in experimental testing of refined versions of an intervention in the population subgroups that may most benefit from them (i.e., intervention targeting) [39]. An excellent example of exploratory research aimed at better understanding potential subgroup effects related to a multisite RCT is the work published by Schneiderman et al. related to the Enhancing Recovery in Coronary Heart Disease (ENRICHD) trial that suggests that whereas the overall trial showed no differences between treatment and control arms, a moderator analysis indicated that white men, although not other subgroups, may have benefited from the ENRICHD intervention with respect to medical endpoints [40]. Such exploratory work can set the stage for the development of more powerful interventions for other subgroups (e.g., women, ethnic minority men) that were found to receive less benefit from the original intervention [40].
Targeting Dimensions: A Myriad of Choices
Growing awareness of the multidimensional influences impacting health behaviors has led to greater acceptance of more complex conceptual models for understanding health behavior change [1, 41]. A similar understanding has developed concerning the variety of dimensions of potential relevance for intervention targeting. Targeting variables, also known as moderators, have been defined by the MacArthur group as baseline variables that are uncorrelated with the intervention (a result of randomization) and delineate subgroups that may be more or less successful with the intervention to which they have been assigned relative to another intervention [17, 21, 33, 42]. Such variables can be used in subsequently matching target groups to appropriate interventions as well as in adapting interventions to accommodate the needs of a target group.
As shown in Table 2, targeting variables generally consist of person-related variables. Whereas the most typical person-related variables used in intervention matching have been demographic variables, a number of other domains (e.g., psychosocial [including patient preferences], behavioral, cultural, biological, genetic) can also be used in matching persons with interventions that may promote enhanced intervention success [39, 43, 44]. For instance, recent advances in diagnostic methods related to breast cancer have added potential moderators (e.g., estrogen receptor status) to the cancer treatment field [45]. Similarly, potential behavioral moderators (e.g., number of cigarettes smoked per day) of successful smoking cessation intervention have been reported [46], as well as potential biologic moderators (e.g., insulin resistance) of weight loss dieting success in overweight individuals [47–49].
Table 2 additionally summarizes the types of intervention-relevant contextual domains to be considered in modifying interventions to optimize subgroup success. Intervention-related dimensions include delivery source (e.g., health educator, automated delivery agents), channel (e.g., face-to-face, print, internet, electronic devices such as cell phones, etc.), content (e.g., the health behaviors being targeted, instructional orientation), timing (e.g., daily, weekly, during ‘critical periods’ of development), dose (e.g., contact duration, frequency), social context (e.g., individual, group-based, internet chat room), and location (e.g., health setting, home, neighborhood, school, work-place). For instance, in the instance of insulin-resistant persons participating in weight loss interventions referred to earlier [47–49], some studies have suggested the potential importance of dietary content (i.e., low-glycemic load foods) in enhancing weight loss success. Certain intervention-relevant domains (e.g., delivery source, channel, dose) may also hold promise for curbing costs and extending intervention reach into diverse population segments [39, 43, 44].
Finally, with the increased appreciation of the utility of ecological frameworks in influencing health behaviors and other aspects of health [50], physical environment variables (both objective and perceived) could also conceivably be added as a potential targeting domain in future research (for example, persons living in particular neighborhood environments could be treated as a subgroup for intervention targeting purposes relative to persons living in different environments).
Innovative RCT designs have been increasingly applied in some health fields to advance knowledge related to treatment matching and subsequent intervention success [51, 52]. For example, in the Sequenced Treatment to Relieve Depression (STAR*D) trial, the goal was to more accurately mimic actual clinical practice by offering patients who did not achieve initial citalopram-related remission of their depression or who experienced too many side effects a choice related to their subsequent treatment [53, 54]. In this manner, different subgroups of patients could be identified with respect to future matching of specific patients and interventions [55].
In addition to such innovative types of designs, RCTs and other experimental research present an excellent opportunity to collect subgroup information that can inform subsequent targeting activities. A critical time point for collecting such information is often at baseline assessment. Planned exploratory analysis using baseline variables, occurring after the intent-to-treat analysis (i.e., primary outcomes paper) is completed, can in turn suggest hypotheses to be tested in follow-up experimental research aimed specifically at testing those hypotheses [46, 56]. To broaden the range of potential moderators of intervention effects suggested by these secondary exploratory analyses, health areas can benefit from the application of potentially relevant theories that have received less systematic attention in the target area, in addition to including variables identified as potentially important in the empirical literature. An example of this type of cross-disciplinary work is the increasing exploration of self-determination theory—developed and applied originally in nonhealth arenas—in the physical activity [57, 58] and smoking [59] intervention fields. To enhance eventual intervention dissemination, it may be particularly useful to focus on those variables that could be easily assessed in nonresearch settings.
Statistical Approaches to Aid the Targeted Intervention Development Process
There are several readily available statistical methods to explore moderators of intervention. The most common involves linear modeling. With a continuous outcome measure, an appropriate example is linear regression analysis. With an analysis focused on time to an event (e.g., time to remission), an example is Cox proportional hazards model. With a binary outcome, an example is logistic regression methods (which are mathematically related to linear discriminant analysis) [60]. In all of these regression analyses, a ‘risk’ score is developed that can be applied to each individual, higher values of which are associated with less-preferred outcomes [61]. Decisions concerning how many or what types of baseline predictor variables to enter into the regression analyses, including which variable interactions are specified and what type of variable entry method is applied (e.g., simultaneous, stepwise), are typically based on investigator judgment in combination with sample size and variable collinearity considerations [62].
In regression analysis, the researcher can test for moderation by including as independent variables intervention assignment, the baseline variable of interest, and the interaction between intervention assignment and the baseline variable [17, 21]. A significant interaction indicates a possible moderator effect (to be evaluated further through additional evaluation of clinical significance or relevance). An example of this type of moderator analysis occurred in a recently published examination of perceived environmental factors as potential moderators of physical activity intervention effects across three RCT data sets collected as part of the NIH Behavior Change Consortium [63]. Moderator analysis of the three RCTs (located in Atlanta, GA; the San Francisco peninsula region; and Eugene, OR) showed that perceived neighborhood traffic safety issues (e.g., absence of pedestrian crosswalks, speeding drivers) were significant moderators of the intervention–PA relationship across all three RCTs. For each RCT, physical activity intervention subjects endorsing negative neighborhood traffic had significantly less physical activity increases relative to intervention subjects without these negative traffic conditions (differences >90 min of physical activity per week), whereas control subjects showed no such impact on their physical activity levels (i.e., negative traffic conditions appeared to moderate the impact of the physical activity intervention in all three RCTs) [63].
Under the assumptions of the above types of linear models, the effect sizes used are most frequently variations of Cohen’s d (the standardized mean differences between groups). Recently, univariate nonparametric methods have also been developed [64] based on the use of ‘numbers to treat’ (NTT), ‘area under the curve’ (AUC), or ‘success rate difference’ (SRD), effect sizes increasingly recommended [65, 66] for their clinical interpretability. Multivariate nonparametric approaches include various recursive partitioning models [28, 62, 67, 68], such as classification and regression tree analysis (C&RT) [69] and signal detection methods [70]. Such recursive partitioning models have been increasingly used in behavioral medicine RCT research to begin to identify subgroups, defined by a combination of baseline variables, which were more or less successful with one type of intervention versus another. For example, exploratory signal detection moderator analysis of a 2-year physical activity RCT in initially sedentary, healthy 50- to 65-year-old adults indicated that the least successful subgroup (success being defined as achieving at least 66% adherence to the physical activity intervention at 2 years) consisted of those participants who had been randomized to the community-based exercise class intervention and who had baseline body mass index (BMI) values >27 [39]. Only 8% of this subgroup achieved the above level of intervention success. The most successful subgroup identified via this method consisted of participants who at baseline had average or below-average ratings of stress, below-average fitness levels measured via treadmill exercise testing, ≤12 years of education, and who had been randomized to the telephone-assisted home-based exercise intervention (69% of them met the success criteria at 2 years) [39]. Similar recursive partitioning methods have been used to explore moderators of intervention success in other behavioral medicine areas such as weight loss [44].
How Examination of Targeting Variables Fits into the Intervention Development Continuum
After suitable initial intervention development and pilot work, the continuum of research aimed at successful intervention evaluation typically consists of efficacy trials which optimize internal validity, subsequent dismantling of effective interventions to better understand their ‘mechanisms of action,’ effectiveness trials aimed at broadening the operational aspects of the intervention to increase suitability to ‘real-world’ settings, and translation/dissemination efforts and activities to increase uptake of the intervention across relevant segments of the population [71]. RCTs remain the most powerful design for establishing the causal link between an intervention and desirable health outcomes [72, 73]. Given, however, the relatively high costs of RCTs, it has become increasingly important for scientists to obtain the maximum amount of information available from well-run RCTs and similar experimental research to inform subsequent scientific and public health activities, particularly during restricted funding climates [18, 21]. In addition to the exploratory moderator activities described in this article, increased attention in the behavioral medicine field has been focused on including information on intervention reach (i.e., percentage of eligible individuals in the target population who participate) and other aspects reflecting the external validity (generalizability) of RCT research [74]. Collecting both types of information as part of the experimental endeavor can serve to expedite translation and dissemination efforts that are at the core of RCT research aimed at serving the public health [71].
Discussion
Identifying what behavioral interventions work best for whom and under what conditions (i.e., which programs for which subgroups under which environmental contexts to achieve which outcomes) remains a critical challenge for researchers and public health policymakers. This paper discusses some key conceptual and methodological issues in developing targeted behavioral interventions. Measuring a diverse set of theoretically and empirically relevant baseline variables provides relevant information for planned exploratory analyses that can, in turn, set the stage for subsequent systematic, hypothesis-testing investigations. It is likely that if there are moderators ‘lurking’ in our data and we do not evaluate them, we may end up with attenuated effect sizes and nonsignificant results. By the same token, there are challenges attendant in discovering many moderators, as this can mean larger numbers of stratification variables and increased study complexity down the road. However, identification of a large number of moderators is rare in our experience [44, 63].
Once the above types of preliminary results have been obtained, the next phase in this development process involves conducting experimental investigations to verify the causal relationships between potentially relevant subgroups at increased risk for intervention success or failure and appropriately matched interventions. One method of doing this would be to randomize individuals representing the specifically targeted subgroup to either a ‘matched’ or ‘mismatched’ intervention. For example, as suggested in the Behavior Change Consortium moderator analysis of environmental factors and physical activity change [63], one could randomize the subpopulation of individuals residing in traffic-congested neighborhoods to receive either a standard physical activity intervention or the intervention coupled with additional instructional or neighborhood-based activities related to overcoming traffic-related physical activity barriers. If, in fact, the differential physical activity rates suggested from the post hoc exploratory analyses were verified using the experimental design, confidence in the subsequent matching of this specific subpopulation to the environmentally enhanced physical activity intervention would be increased.
It is important to underscore that, with all such exploratory investigations, the utility of the data generated is influenced by the level of theory applied as well as the adequacy of the psychometric properties of the measures being used. This perspective emphasizes the observation that every good study is both theory- and data-driven (i.e., the relationships between theory and data are transactional and iterative in nature).
Challenges of Incorporating Targeted Intervention Development Methods into RCT Research
Among the challenges attendant with the above systematic approach to targeted intervention development are:
Population specificity issues. It is critical that the population under study be defined clearly enough that subsequent research can be undertaken with the same population. Similarly, the sample needs to be sufficiently heterogeneous to allow for the detection of moderators.
Planning issues. The exploratory, hypothesis-generating approach being described will be most useful if researchers carefully consider, up front, a broad range of theoretically and empirically derived variables that may serve as intervention moderators. This requires care, forethought, and a willingness to engage researchers from other disciplines and perspectives as a means of broadening the types of moderator variables under consideration.
Complexity issues. It is becoming increasingly clear that the moderator field will advance most rapidly if the evaluation of higher-order interactions among baseline variables is considered and evaluated from the start. The increasing familiarity with and acceptance of nonparametric risk classification statistical methods as a means of identifying such higher-order interactions in the medical and behavioral sciences fields represents an important methodological advance of which researchers should take advantage [69]. Given the exploratory nature of such approaches, it remains important to replicate and verify the robustness of the identified higher-order interactions before full implementation of a behavioral intervention based on such targeting information [75]. It also bears repeating that the moderator analyses that we are proposing, which directly compare two or more interventions ‘head to head’ in identifying any subgroup effects, are distinct from forms of subgroup analysis that are undertaken within a particular intervention [38].
Relevance issues. It is important that baseline variables be chosen based not only on potential theoretical and empirical relevance, but also with an eye to feasibility of use in ‘real-world’ settings. Baseline variables consisting of short paper-and-pencil measures that are readily scored and feasible for community settings have distinct advantages in this regard.
Resource issues. Although reasonably inexpensive paper-and-pencil measures can be identified for demographic and many behavioral and psychosocial constructs of potential relevance to intervention matching, measures of biological and genetic variables invariably involve additional costs that may pose a problem for researchers and, by extension, community providers who constitute the ultimate intervention delivery source. Continued efforts to measure such biological variables in a manner that is reasonably inexpensive and convenient will aid future intervention translation efforts. Such advances have occurred over the past 20 years, including automated blood pressure units [76], simple finger-stick methods for collecting and analyzing blood samples [77], and portable devices for continuous glucose monitoring and ambulatory skin impedance/body temperature monitoring [78].
Finally, a separate resource issue involves the development of increasingly low-cost and wider-reach intervention approaches. The recent advances being witnessed in the communication technology field provide a means for advancing intervention delivery and reach through use of electronic and other mediated interfaces [79]. Such advances offer a potential expansion of interventions into population segments that heretofore have rarely sought out health interventions. Identifying which types of intervention delivery channels, sources, and messages will most appropriately serve the needs of such subgroups remains a major public health challenge that the targeted intervention field will hopefully be able to ultimately address.
In summary, whereas RCTs remain the bedrock of systematic scientific advances, the exploratory work discussed in this paper can help to make the most efficient use of RCTs to identify the best paths for subsequent RCT development. This approach can help to facilitate the advancement of the behavioral medicine field in a resource-constrained era. As noted by the late Michael J. Mahoney in “Suffering, Philosophy, and Psychotherapy” (2005):
Science is about questions, about quests. The best questions are those that explore the edges or center of our understanding. The best answers are those that lead to better questions and future quests
(pp. 343, 345).
Acknowledgments
This work was funded in part by Public Health Service Grant #R01AG021010 from the National Institute on Aging awarded to Dr. King. The views expressed in this paper represent those of the authors and not of the National Cancer Institute.
Contributor Information
Audie A. Atienza, Division of Cancer Control and Population Sciences, Behavioral Research Program, Health Promotion Research Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
Helena C. Kraemer, Department of Psychiatry and Behavioral Science, Stanford University School of Medicine, Stanford, CA 94305, USA
References
- 1.Office of Behavioral and Social Sciences Research. Healthier Lives through Behavioral and Social Sciences Research. Bethesda, MD: OBSSR, National Institutes of Health; 2006. [Google Scholar]
- 2.Ockene JK, Emmons KM, Mermelstein RJ, et al. Relapse and maintenance issues for smoking cessation. Health Psychol. 2000; 19: 17–31. [DOI] [PubMed] [Google Scholar]
- 3.Haskell WL, Alderman EL, Fair JM, et al. Effects of intensive multiple risk factor reduction on coronary atherosclerosis and clinical cardiac events in men and women with coronary artery disease: The Stanford Coronary Risk Intervention Project (SCRIP). Circulation. 1994; 89: 975–990. [DOI] [PubMed] [Google Scholar]
- 4.Lorig KR, Mazonson PD, Holman HR. Evidence suggesting that health education for self-management in patients with chronic arthritis has sustained health benefits while reducing health costs. Arthritis Rheum. 1993; 36: 439–446. [DOI] [PubMed] [Google Scholar]
- 5.Diabetes Prevention Program Research Group. Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. New Engl J Med. 2002; 346: 393–403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tuomilehto J, Lindstrom J, Eriksson JG, et al. Prevention of type 2 diabetes mellitus by changes in lifestyle among subjects with impaired glucose tolerance. N Engl J Med. 2001; 344: 1343–1350. [DOI] [PubMed] [Google Scholar]
- 7.Pan XR, Li GW, Hu YH, et al. Effects of diet and exercise in preventing NIDDM in people with impaired glucose tolerance. The Da Qing IGT and Diabetes Study. Diabetes Care. 1997; 20: 537–544. [DOI] [PubMed] [Google Scholar]
- 8.Durantini MR, Albarracin D, Mitchell AL, Earl AN, Gillette JC. Conceptualizing the influence of social agents of behavior change: A meta-analysis of the effectiveness of HIV-prevention interventionists for different groups. Psychol Bull. 2006; 132: 212–248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Christensen AJ. Patient-by-treatment context interaction in chronic disease: A conceptual framework for the study of patient adherence. Psychosom Med. 2000; 62: 435–443. [DOI] [PubMed] [Google Scholar]
- 10.Evans WE, Relling MV. Moving towards individualized medicine with pharmacogenomics. Nature. 2004; 429: 464–468. [DOI] [PubMed] [Google Scholar]
- 11.Owens EB, Hinshaw SP, Kraemer HC, Arnold LE, Abikoff HB, Cantwell DP, et al. Which treatment for whom for ADHD? Moderators of treatment response in the MTA. J Consult Clin Psychol. 2003; 71: 540–552. [DOI] [PubMed] [Google Scholar]
- 12.Orleans TC, Gruman J, Ulmer C, Emont SL, Hollendonner JK. Rating our progress in population health promotion: Report card on six behaviors. Am J Health Promot. 1999; 14: 75–82. [DOI] [PubMed] [Google Scholar]
- 13.Campbell MK, Tessar O, DeVellis B, et al. Effects of a tailored health promotion program for female blue-collar workers: Health works for women. Prev Med. 2002; 34: 313–323. [DOI] [PubMed] [Google Scholar]
- 14.King AC, Bauman A, Calfas K. Innovative approaches to understanding and influencing physical activity. Am J Prev Med. 2002; 232 Suppl: 56–63. [Google Scholar]
- 15.Mattson ME, Allen JP. Research on matching alcoholic patients to treatments: Findings, issues, and implications. J Addict Dis. 1991; 11: 33–49. [DOI] [PubMed] [Google Scholar]
- 16.Kent D, Hayward R. When averages hide individual differences in clinical trials. Am Sci. 2007; 95: 60–68. [Google Scholar]
- 17.Kraemer HC, Wilson GT, Fairburn CG, Agras WS. Mediators and moderators of treatment effects in randomized clinical trials. Arch Gen Psychiatry. 2002; 59: 887–883. [DOI] [PubMed] [Google Scholar]
- 18.Zerhouni EA. NIH in the postdoubling era: Realities and strategies for the future In: NIMH Council Meeting; September; Bethesda, MD; 2006. [Google Scholar]
- 19.Murphy GM Jr., Hollander SB, Rodrigues C, Krema C, Schatzberg AF. Effects of the serotonic transporter gene promoter polymorphism on mirtazapine and paroxetine efficacy and adverse events in geriatric major depression. Arch Gen Psychiatry. 2004; 61: 1163–1169. [DOI] [PubMed] [Google Scholar]
- 20.Rothwell PM. Treating individuals 1: External validity of randomised controlled trials: “To whom do the results of this trial apply?”. The Lancet. 2005; 365: 82–93. [DOI] [PubMed] [Google Scholar]
- 21.Kraemer HC, Frank E, Kupfer DJ. Moderators of treatment outcomes: Clinical, research, and policy importance. JAMA. 2006; 296: 1286–1289. [DOI] [PubMed] [Google Scholar]
- 22.Napolitano MA, Marcus BH. Targeting and tailoring physical activity information using print and information technologies. Exerc Sport Sci Rev. 2002; 30: 122–128. [DOI] [PubMed] [Google Scholar]
- 23.Moyer A, Finney JW, Elworth JT, Kraemer HC. Can methodological features account for patient–treatment matching findings in the alcohol field? J Stud Alcohol. 2001; 62: 62–73. [DOI] [PubMed] [Google Scholar]
- 24.Skinner HA. Different strokes for different folks: Differential treatment for alcohol abuse In: Meyer RE, Babor TF, Glueck BC, Jaffe JH, O’Brien JE, Stabenau JR, eds. Evaluation of the Alcoholic: Implications for Research, Theory, and Treatment. Washington, DC: Government Printing Office; 1981: 349–367. [Google Scholar]
- 25.Marcus BH, Bock BC, Pinto BM, Forsyth LH, Roberts MB, Traficante RM. Efficacy of an individualized, motivationally-tailored physical activity intervention. Ann Behav Med. 1998; 20: 174–180. [DOI] [PubMed] [Google Scholar]
- 26.Strecher VJ, Marcus A, Bishop K, et al. A randomized controlled trial of multiple tailored messages for smoking cessation among callers to the Cancer Information Service. J Health Commun. 2005; 10: 105–118. [DOI] [PubMed] [Google Scholar]
- 27.Killen JD, Fortmann SP, Kraemer HC, Varady A, Newman B. Who will relapse? Symptoms of nicotine dependence predict long-term relapse after smoking cessation. J Consult Clin Psychol. 1992; 60: 797–801. [DOI] [PubMed] [Google Scholar]
- 28.Winkleby MA, Flora JA, Kraemer HC. A community-based heart disease intervention: Predictors of change. Am J Publ Health. 1994; 84: 767–772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.King AC, Satariano W, Marti J, Zhu W. Multi-level modeling of walking behavior: Advances in understanding the interactions of people, place, and time. Med Sci Sports Exerc, 2008;in press. [DOI] [PubMed] [Google Scholar]
- 30.Wilcox S, Dowda M, Griffin SF, et al. Results of the first year of Active for Life: Translation of two evidence-based physical activity programs for older adults into community settings. Am J Publ Health. 2006; 96: 1201–1209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.King AC, Castro C, Wilcox S. From science to practice and back again: Predictors of successful physical activity participation in a telephone-based dissemination research program. Int J Behav Nutr Phys Act. 2008;5.(abstract).18226268 [Google Scholar]
- 32.Castro FG, Barrera M Jr, Martinez CR Jr. The cultural adaptation of prevention interventions: Resolving tensions between fidelity and fit. Prev Sci. 2004; 5: 41–45. [DOI] [PubMed] [Google Scholar]
- 33.Munoz RF, Mendelson T. Toward evidence-base interventions for diverse populations: The San Francisco General Hospital Prevention and Treatment Manuals. J Consult Clin Psychol. 2005; 73: 790–799. [DOI] [PubMed] [Google Scholar]
- 34.Behrens JT. Principles and procedures of exploratory data analysis. Psychol Methods. 1997; 2: 131–160. [Google Scholar]
- 35.Greenhouse JB, Greenhouse JBW. Exploratory statistical methods, with applications to psychiatric research. Psychoneuroendocrinology. 1992; 17: 423–441. [DOI] [PubMed] [Google Scholar]
- 36.Tukey J Exploratory Data Analysis. Reading, MA: Addison-Wesley; 1977. [Google Scholar]
- 37.Wilkinson L, The Task Force on Statistical Inference of the APA Board of Scientific Affairs. Statistical methods in psychology journals: Guidelines and explanations. Am Psychol. 1999; 54: 594–604. [Google Scholar]
- 38.Wang R, Lagakos SW, Ware JH, Hunter DJ, Drazen JM. Statistics in medicine—Reporting of subgroup analyses in clinical trials. N Engl J Med. 2007; 357: 2189–2194. [DOI] [PubMed] [Google Scholar]
- 39.King AC, Kiernan M, Oman RF, Kraemer HC, Hull M, Ahn D. Can we identify who will adhere to long-term physical activity? Application of signal detection methodology as a potential aid to clinical decision-making. Health Psychol. 1997; 16: 380–389. [DOI] [PubMed] [Google Scholar]
- 40.Schneiderman N, Saab PG, Catellier DJ, et al. Psychosocial treatment within sex by ethnicity subgroups in the Enhancing Recovery in Coronary Heart Disease clinical trial. Psychosom Med. 2004; 66: 475–483. [DOI] [PubMed] [Google Scholar]
- 41.King AC, Stokols D, Talen E, Brassington GS, Killingsworth R. Theoretical approaches to the promotion of physical activity: Forging a transdisciplinary paradigm. Am J Prev Med. 2002; 232S: 15–25. [DOI] [PubMed] [Google Scholar]
- 42.Kraemer HC, Stice E, Kazdin A, Offord D, Kupfer D. How do risk factors work together? Mediators, moderators, and independent, overlapping, and proxy risk factors. Am J Psychiatr. 2001; 158: 848–856. [DOI] [PubMed] [Google Scholar]
- 43.King AC, Marcus BH, Dunn AL, et al. Identifying subgroups who succeed or fail with three physical activity interventions: The Activity Counseling Trial. Health Psychol. 2006; 25: 336–347. [DOI] [PubMed] [Google Scholar]
- 44.Kiernan M, King AC, Stefanick M, Kraemer HC. Characteristics of successful and unsuccessful dieters: An application of signal detection methodology. Ann Behav Med. 1998; 20: 1–6. [DOI] [PubMed] [Google Scholar]
- 45.Berry DA, Cirrincione C, Henderson IC, et al. Estrogen-receptor status and outcomes of modern chemotherapy for patients with node-positive breast cancer. JAMA. 2006; 295: 1658–1667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Killen JD, Fortmann SP, Davis L, Strausberg L, Varady A. Do heavy smokers benefit from higher dose nicotine patch therapy? Exp Clin Psychopharmacol. 1999; 7: 226–233. [DOI] [PubMed] [Google Scholar]
- 47.Pittas AG, Das SK, Hajduk CL, et al. A low-glycemic load diet facilitates greater weight loss in overweight adults with high insulin secretion but not in overweight adults with low insulin secretion in the CALERIE Trial. Diabetes Care. 2005; 28: 2939–2941. [DOI] [PubMed] [Google Scholar]
- 48.Ebbeling CB, Leidig MM, Feldman HA, Lovesky MM, Ludwig DS. Effects of a low-glycemic load vs low-fat diet in obese young adults: A randomized trial. JAMA. 2007; 297: 2092–2102. [DOI] [PubMed] [Google Scholar]
- 49.Cornier MA, Donahoo WT, Pereira R, et al. Insulin sensitivity determines the effectiveness of dietary macronutrient composition on weight loss in obese women. Obes Res. 2005; 13: 703–709. [DOI] [PubMed] [Google Scholar]
- 50.Sallis JF, Owen N. In: Glanz K, Rimer BK, Lewis FM, eds. Health Behavior and Health Education: Theory, Research, and Practice Ecological models of health behavior. 3rd ed San Francisco: Jossey-Bass; 2002: 462–484. [Google Scholar]
- 51.Murphy SA. An experimental design for the development of adaptive treatment strategies. Stat Med. 2005; 24: 1455–1481. [DOI] [PubMed] [Google Scholar]
- 52.Collins LM, Murphy SA, Nair VN, Strecher VJ. A strategy for optimizing and evaluating behavioral interventions. Ann Behav Med. 2005; 30: 65–73. [DOI] [PubMed] [Google Scholar]
- 53.Insel TR. Beyond efficacy: The STAR*D trial. Am J Psychiatry. 2006; 163: 5–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Rush AJ, Trivedi M, Fava M. Depression, IV: STAR*D treatment trial for depression. Am J Psychiatry. 2003; 160: 237–245. [DOI] [PubMed] [Google Scholar]
- 55.Trivedi MH, Fava M, Wisniewski SR, et al. Medication augmentation after the failure of SSRIs for depression. N Engl J Med. 2006; 354: 1243–1252. [DOI] [PubMed] [Google Scholar]
- 56.Killen JD, Fortmann SP, Kraemer HC, Varady AN, Davis L, Newman B. Interactive effects of depression symptoms, nicotine dependence, and weight change on late smoking relapse. J Consult Clin Psychol. 1996; 64: 1060–1067. [DOI] [PubMed] [Google Scholar]
- 57.Biddle SJH, Nigg CR. Theories of exercise behavior. Int J Sport Psychol. 2000; 31: 290–304. [Google Scholar]
- 58.King AC, Friedman R, Marcus B, et al. Harnessing motivational forces in the promotion of physical activity: The Community Health Advice by Telephone (CHAT) Project. Health Educ Res. 2002; 17: 627–636. [DOI] [PubMed] [Google Scholar]
- 59.Williams GG, Gagne M, Ryan RM, Deci EL. Facilitating autonomous motivation for smoking cessation. Health Psychol. 2002; 21: 40–50. [PubMed] [Google Scholar]
- 60.Efron B The efficiency of logistic regression compared to normal discriminant. J Am Stat Assoc. 1975; 70: 892–898. [Google Scholar]
- 61.Kiernan M, Kraemer HC, Winkleby MA, King AC, Taylor CB. Do logistic regression and signal detection identify different subgroups at risk? Implications for the design of tailored interventions. Psychol Methods. 2001; 6: 35–48. [DOI] [PubMed] [Google Scholar]
- 62.Kraemer HC. Evaluating Medical Tests: Objective and Quantitative Guidelines. Newbury Park, CA: Sage; 1992. [Google Scholar]
- 63.King AC, Toobert D, Ahn D, et al. Perceived environments as physical activity correlates and moderators of interventions in five studies. Am J Health Promot. 2006; 21: 24–35. [DOI] [PubMed] [Google Scholar]
- 64.Kraemer HC. Toward non-parametric and clinically meaningful moderators and mediators. Stat Med. 2007; 27(10): 1679–1692. [DOI] [PubMed] [Google Scholar]
- 65.Cook RJ, Sackett DL. The number needed to treat: A clinically useful measure of treatment effect. Br Med J. 1995; 310: 452–454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Grissom RJ, Kim JJ. Effect Sizes for Research. Mahwah, NJ: Erlbaum; 2005. [Google Scholar]
- 67.King AC, Haskell WL, Taylor CB, Kraemer HC, DeBusk RF. Group- versus home-based exercise training in healthy older men and women: A community-based clinical trial. JAMA. 1991; 266: 1535–1542. [PubMed] [Google Scholar]
- 68.Gaudard M, Ramsey P, Stephens M. Interactive data mining and design of experiments: The JMP partition and custom design platforms In: JMP version 5.1.2 (SAS). Cary, NC: SAS; 2006. [Google Scholar]
- 69.Lemon SC, Roy J, Clark MA, Friedmann PD, Rakowski W. Classification and regression tree analysis in public health: Methodological review and comparison with logistic regression. Ann Behav Med. 2003; 26: 172–181. [DOI] [PubMed] [Google Scholar]
- 70.Kraemer HC. Assessment of 2×2 association: Generalization of signal detection methods. Am Stat. 1988; 42: 37–49. [Google Scholar]
- 71.Glasgow RE, Lichtenstein E, Marcus AC. Why don’t we see more translation of health promotion research to practice? Rethinking the efficacy-to-effectiveness transition. Am J Publ Health. 2003; 93: 1261–1267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Davidson KW, Goldstein M, Kaplan RM, et al. Evidence-based behavioral medicine: What is it and how do we achieve it? Ann Behav Med. 2003; 26: 161–171. [DOI] [PubMed] [Google Scholar]
- 73.Rosen L, Manor O, Engelhard D, Zucker D. In defense of the randomized controlled trial for health promotion research. Am J Publ Health. 2006; 96: 1181–1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Glasgow RE, Vogt TM, Boles SM. Evaluating the public health impact of health promotion interventions: The RE-AIM framework. Am J Publ Health. 1999; 89: 1322–1327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Atienza AA, Yaroch AL, Masse LC, Moser RP, Hesse BW, King AC. Identifying sedentary subgroups: The National Cancer Institute’s Health Information National Trends Survey. Am J Prev Med. 2006; 31: 383–390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Pickering TG, Shimbo D, Haas D. Ambulatory blood pressure monitoring. New Engl J Med. 2006; 354: 2368–2374. [DOI] [PubMed] [Google Scholar]
- 77.Pence C, McErlane K. Anticoagulation self-monitoring. Am J Nurs. 2005; 105: 62–65. [DOI] [PubMed] [Google Scholar]
- 78.Harris ND, Baykouchev SB, Marques JL, et al. A portable system for monitoring physiological responses to hypoglycaemia. J Med Eng Technol. 1996; 20: 196–202. [DOI] [PubMed] [Google Scholar]
- 79.Patrick K, Intille SS, Zabinsky MF. An ecological framework for cancer communication: Implications for research. J Med Internet Res. 2005; 7: 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]