Abstract
The idea that epidemiologic studies should start from first exposure onward has been advocated in the past few years. The study of incident exposures is contrasted with studies of prevalent exposures in which follow-up may commence after first exposure. The former approach is seen as a hallmark of a good study and necessary for causal inference. We argue that studying incident exposures may be necessary in some situations, but it is not always necessary and is not the preferred option in many instances. Conducting a study involves decisions as to which person-time experience should be included. Although studies of prevalent exposures involve left truncation (missingness on the left), studies of incident exposures may involve right censoring (missingness on the right) and therefore may not be able to assess the long-term effects of exposure. These considerations have consequences for studies of dynamic (open) populations that involve a mixture of prevalent and incident exposures. We argue that studies with prevalent exposures will remain a necessity for epidemiology. The purpose of this paper is to restore the balance between the emphasis on first exposure cohorts and the richness of epidemiologic information obtained when studying prevalent exposures.
Keywords: dynamic populations, epidemiologic methods, incidence rate, incident exposure, left truncation, prevalent exposure, right censoring, study design
Editor's note:Counterpoints to this article appear on pages 834 and 840, and a response appears on page 846.
The notion that epidemiologic studies should start with participants that are observed from initiation of exposure onward (1) and that this is necessary for causal inference has recently received wide support (2–6). In contrast, studies that enroll people with “prevalent exposures” are considered to be prone to selection bias (7). The “first exposure” approach received support because of observational research wrongly indicating that combined hormone replacement therapy (cHRT) protects against myocardial infarction. It is also supported on theoretical grounds rooted in counterfactual thinking (see below). Conversely, for over a century epidemiologic studies have involved prevalent exposures, and their results have largely stood the test of time. In this paper, we argue that studies (cohort or case-control) that estimate incidence rates and rate ratios (hazard ratios) in dynamic populations (8, 9) involving a mix of prevalent and incident exposures are necessary, and that universal application of the first exposure principle would be detrimental to epidemiology.
PRELIMINARY CONSIDERATIONS
We will first consider some background issues about types of evidence, populations, exposures, follow-up, and patterns of hazard ratios over exposure time.
Types of evidence used in causal inference
The first exposure principle is supported by theories of causal inference that hold that causality can only be assessed by either a randomized trial or an observational study that closely mimics a randomized trial. In a randomized trial, follow-up usually starts from the initiation of an intervention (it could be conducted in a population with previous/prevalent exposure/treatment, but the focus would be on the effects of the new (randomized) exposure/treatment). By analogy, the effect of an exposure in an observational study can only be fully and validly assessed if follow-up starts from first exposure. The first exposure principle is implicit in the theory of potential outcomes and counterfactual thinking (10, 11). Recently, the principle has been made more explicit (2, 12, 13).
This view of causal inference is different from that in Austin Bradford Hill's causal considerations that may “… help us to make up our minds on the fundamental question—is there any other way of explaining the set of facts before us, is there any other answer equally, or more, likely than cause and effect?” (14, p. 299). The evidence used in such considerations includes analytical studies that may include prevalent exposures (having hypertension, being diabetic), as well as evidence from time trends, ecological analyses, and biological and mechanistic insights. Although some types of evidence may be better than others for a specific hypothesis, there is usually no definitive single study or study design that answers the question of causality on its own. Moreover, randomized controlled trials (or observational studies which mimic such trials) may be regarded as the (hypothetical) “gold standard” for many research questions, but there are other important research questions for which it is difficult to conceive of a definitive randomized trial even in theory (e.g., the health effects of climate change, or macro-level economic policies).
Types of populations
Populations can be closed or open, and the distinction depends in part on the time axis used (15). For example, a population of people who have used a particular medication would be “closed” if time is defined as time since starting medication but would be “open” if time is defined as calendar time. In addition, populations may be regarded as dynamic if people can join, leave, and rejoin (e.g., the population of London during the year 2014) or may be regarded as closed, or fixed, if this is not permitted (e.g., follow-up of everyone living in London on January 1, 2014).
In the current context, the key distinction is whether the population is open to entry after the exposure under study begins and open to exit before the outcome under study occurs (S. Greenland, University of California, Los Angeles, personal communication, 2013). The first part of this criterion involves the distinction between a study of “incident exposures” in which follow-up starts from (or before) the time of first exposure and a study of “prevalent exposures” in which follow-up may start after first exposure. This leads to left truncation (missingness on the left). Almost all studies also involve right censoring (missingness on the right), because there will be a proportion of participants whose length of follow-up is restricted.
Figure 1 shows a population that is dynamic with respect to calendar time: Participants enter at different calendar time points and are either exposed or unexposed, or they switch exposures over time. The population is studied for the 4-year period 2009–2012, and the investigators will have information only on disease incidence during this period (although they may be able to collect information on exposures from previous periods).
Figure 2 shows how this selected period of follow-up from the 4-year calendar time period then matches up with the information that would have been obtained in the corresponding fixed (with respect to start of exposure) cohort study in which all participants would have been followed from first exposure or the corresponding date for the nonexposed. The key feature is that each study participant is at most followed for 4 years. For some, this is the 4 years following the start of follow-up (exposed or nonexposed), but for most it represents a different period (e.g., years 7–11 after first exposure or start of follow-up). There is “left truncation” (delayed study entry at different times during follow-up); if the entry at different follow-up times is not taken into account, the data would represent a mix of the information that would have been obtained from different follow-up periods in the full fixed cohort study, and selection bias may occur (1, 2). However, this phenomenon is well known in the clinical epidemiologic literature (16–18), as well as in the occupational epidemiology literature (19), and standard analytical solutions exist (e.g., by calculating life tables with left-truncation, or by calculating person-years of exposure in different categories of follow-up time) (19–21).
Types of exposure
Irrespective of when people are recruited into a cohort (or identified as cases and controls in a case-control study), information can be collected on previous exposures. Thus, prevalent exposures can be classified by using information on current exposure, past exposure, or cumulative exposure, and incident exposures can also be classified the same way once follow-up has proceeded. For example, for cigarette smoking and lung cancer, cumulative exposure may be most important; this can be assessed at each person-year of follow-up irrespective of when smoking started. On the other hand, for hormone replacement therapy and myocardial infarction, the recently initiated exposures may be most relevant. Finally, there are exposures that are “on-off” phenomena or exposures that are completely transient, which also determines the type of follow-up that is needed.
Types of follow-up that are needed
The relevant follow-up period will differ according to the context. For example, in a study of venous thrombosis, follow-up of a person who was using oral contraceptives at age 25, had used the pill for 6 years, continued to use the pill for another 13 years, and then stopped gives useful information for several time windows of use/follow-up. Follow-up might start with prevalent use and end at most 3 months after stopping. This approach would be adopted mainly because the risk of venous thrombosis with oral contraceptives is expected to be an “on-off” coagulation effect, with perhaps an early peak, but followed by a sustained increased hazard ratio until shortly after cessation of use (22–24). On the other hand, in a study of oral contraceptives and breast cancer, follow-up should continue for many years after stopping use. In the first instance, the effect is due to the “drug in the blood” and stops when the drug is no longer present; in the second instance, the effect is presumably due to lasting cellular alterations.
Exposure-disease associations in case-crossover studies are often an extreme form of expected on-off effects: They are transient, more or less instantaneous, and completely reversible (e.g., coffee drinking and myocardial infarction (25), cell phone use and car accidents (26)).
Patterns of hazard ratio by time since first exposure
Figure 3 shows 4 general patterns of hazard ratios by time since first exposure. They can be viewed as daily hazard ratios from date of first exposure onward (i.e., the ratio of daily hazard rates or incidence rates between exposed and nonexposed, analogous to the theory of the Cox model). In Figure 3A, the hazard ratio is constant from first exposure onward, after an initial latency period. If the time axis was days (or even minutes), this pattern would apply to transient and on-off exposures whose effect is present as long as the exposure is present. Figure 3B shows an initially increased risk followed by a subsequently reduced risk, a pattern that was found for cHRT exposure and myocardial infarction over years of follow-up (2, 27–29). In Figure 3C, the hazard ratio increases with duration of exposure; the findings for cHRT and breast cancer are compatible with this pattern over years of follow-up (27, 30–32). Figure 3D shows an initial peak followed by a sustained increased risk; this pattern was seen for cHRT and venous thrombosis (33, 34). The consequences of left truncation are different for these different exposures.
THE PROBLEM OF LEFT TRUNCATION (MISSINGNESS ON THE LEFT)
Hormone replacement therapy and myocardial infarction
The potential problem with studying prevalent exposures has been highlighted in the cHRT controversy. Observational studies showed a consistent overall decrease of myocardial infarction among users of cHRT. In contrast, in randomized trials cHRT increased the risk of myocardial infarction in the first years of use, but the hazard ratio was subsequently decreased (sketched in Figure 3B) (35). This apparent contradiction was solved by the realization that the observational studies started with current users (35), who were contrasted with never users or with past users. At the start of follow-up, the current users were mostly past the initial period with elevated risk; they showed a decreased hazard ratio of myocardial infarction that corresponded to the findings of the randomized trials after the first few years of use (28, 29, 35, 36).
Two different analytical solutions exist. First, analogous to the reasoning about left-truncation when discussing types of populations, analyses in the observational part of the Women's Health Initiative study included study participants with prevalent exposures, stratified on duration of cHRT use (synonymous with time since first exposure) at enrollment, and showed that for coronary heart disease the observational study findings were consistent with those of the Women's Health Initiative randomized trials (36). Second, an analysis of the observational Nurses' Health Study compared incident users, based on counterfactual thinking (see above): It excluded prevalent users, identified new (incident) users of cHRT from each biannual examination, and analyzed these data as a sequence of imaginary successive randomized trials; this also made the findings for coronary heart disease consistent with those from the randomized trials (29). The first solution was particularly suited for analyzing the Women's Health Initiative observational data, as there were few women who initiated cHRT during the follow-up; the second approach was appropriate for the Nurses’ Health Study as there were ample women initiating cHRT during successive biannual periods (36). The 2 methods were proposed in different contexts, but their underlying approaches are consistent and complementary. There is debate whether a similar biasing effect of studying prevalent users may exist for observational studies of the effects of statins (refer to Web Appendix 1, available at http://aje.oxfordjournals.org/).
Confounders that change after initiation of exposure
Confounders and other covariates (e.g., mediators) may change over time, and this may be caused by previous exposure. This phenomenon can exist in all follow-up studies of a fixed cohort. An “old” cohort (i.e., the latter part of follow-up) may differ from the baseline with respect to these covariates. Analyses that adjust for variables that are affected by exposure may produce bias (7). This may be important in some circumstances (e.g., in clinical and pharmacoepidemiologic studies of beneficial effects) when therapy changes in the course of follow-up and where confounding by indication plays a large role (i.e., the indication for treatment may have changed). It may be less important in studies of adverse effects, in which the risk factors for adverse events are different from the reasons for exposure (37) (refer to the example about nonsteroid antiinflammatory drugs below), or when the exposure is a recurrent transient phenomenon such as cell phone use while driving, or coffee drinking (25, 26).
The exclusive solution: study follow-up from beginning of exposure
It has been argued, on the basis of the cHRT experience, that all “good” exposure-disease studies should start follow-up from the start of exposure (3, 4, 38–40). Moreover, starting from inception of exposure enables us to capture the life-course effects of exposure. Studies of prevalent exposures are weighted more towards participants who started exposure at some time in the past. This is not a major problem—in the sense that it will not put epidemiology completely on the wrong foot—if the hazard rate is reasonably constant across time periods since the first exposure (Figure 3A), is slowly increasing (Figure 3C), or if an early peak is followed by a sustained risk (Figure 3D). However, in the cHRT and myocardial infarction example (Figure 3B), the cohort studies effectively placed greater weight on the later time periods where the rate ratio was less than 1.0 and less weight on the earlier time periods where it was greater than 1.0. Although limiting the analysis to persons followed from first exposure onward solved this problem in 1 study (29), an analysis by duration of exposure (time since first exposure) (28) addressed the problem equally well and showed essentially the same findings (36).
An inclusive solution: the period analysis life table
Methods to account for left truncation are not new (20). In oncology, a period analysis is used because the prognostic information for the near future is better with period analysis than with full cohort analyses (41). Suppose that an investigator wants to know the 20-year survival of persons who have undergone colorectal surgery for colon cancer. In a regional (or country wide) cancer registry, all persons are identified who are alive after such surgery on January 1, 2009, and thereafter all new patients who undergo the same surgery are enrolled; all are followed until December 31, 2012. Follow-up is at most 4 years. However, these 4-year periods represent different time windows since the start of exposure for different patients. For each of these time windows, incidence rates of death are calculated, which are then transformed to cumulative incidences, and a life table can be calculated (analogous to our Figures 1 and 2).
The 20-year cohort experience that is reconstructed is not the same as the cohort experience that is obtained when starting follow-up with everyone who had surgery 20 years ago. The latter life table is of little interest for future patients, since it concerns surgical and medical practices that may not apply in the future. Conversely, the follow-up experience of those in their 16th–20th year of follow-up in the 2009–2012 time window may be of less interest to persons who receive their surgery in 2009–2012 because their survival is most likely going to be different 20 years in the future, but it still has the advantage of representing the recent state of health care. The most recent (in calendar time) follow-up experience may be the most valuable for decisions about future health care (41, 42). The problem of left truncation in a life table was already described and solved several decades ago in the statistical literature (refer to the reports by Turnbull (42, 43), Detels et al. (44), and Hsia et al. (45) for an overview of the older literature), but solutions were sometimes reinvented by shrewd clinical investigators who faced the problem in studies on the prognosis of transient ischemic attacks (18), of colon cancer risk in patients with colitis (16), and of multiple sclerosis (17).
Similar analyses are routinely used in cohort studies and case-control studies based on dynamic populations, wherein hazard ratios or odds ratios from case-control studies are often presented for different strata of duration of exposure. Hernán (2) has proposed that it might be more insightful to present such findings as cumulative risks, rather than as an array of odds ratios or hazard ratios, as one can string together the incidence rate ratios for successive duration of exposure/follow-up windows (as in the period analysis life table). Refer to Web Appendix 2, including Web Figure 1, for an example.
How generalizable is the problem?
How important is the cHRT and myocardial infarction phenomenon for epidemiology? Severely biased findings for the association of cHRT with myocardial infarction were derived in epidemiologic studies, because the hazard ratio completely reversed over the follow-up time (sketched in Figure 3B). However, the same studies with prevalent cHRT users did not produce any grossly wrong estimates for other outcomes: The findings for venous thrombosis or pulmonary embolism, breast and colon cancer, or strokes were all in line with the findings of randomized trials (27, 36). For venous thrombosis, there was an initial peak (as sketched in Figure 3D), followed by a sustained risk that was quite similar in both randomized trials and observational studies (33, 34). For breast cancer, both observationally and in the randomized trials, there was a slow increase in cumulative risk (31), as well as a suggestion for an increasing hazard ratio (as sketched in Figure 3C) over exposure time (27, 30–32). Thus, all these results were in the same direction for randomized trials and observational studies and were numerically similar in their order of magnitude. Moreover, separate analyses for duration of exposure showed the pattern of the hazard ratios over time. None of these other outcomes led to a clear selection bias problem because of the use of prevalent exposures.
The reasons why the hazard ratios of myocardial infarction went below unity in long-term users are not clear. Depletion of susceptibles has been proposed (2), but this may not be a complete explanation (Web Appendix 3).
A follow-up study of different nonsteroid antiinflammatory drugs and the occurrence of myocardial infarction as an adverse effect (46) is an example of the combination of the best of both worlds, as it was possible to do sensitivity analyses by comparing new users with prevalent users and to assess the effects of the potential selection bias from the latter. There was an early peak following first exposure, but the main analyses yielded similar findings just using the overall incidence rate ratios. Thrombotic phenomena with nonsteroid antiinflammatory drugs are an example of a “drug-in-the-blood on-off” phenomenon, where associations are present as long as the drug is used, which allows that parts of the follow-up are started during current exposure and are stopped at the cessation of exposure. In addition, the data were obtained in a period wherein the problem of myocardial infarction occurrence with different types of nonsteroid antiinflammatory drugs was not yet known, so any reason for prescribing or changing prescriptions would have been the result of noncardiovascular factors (e.g., stomach complaints), and bias from confounders changing over time would be unlikely (37).
The iconic follow-up study of smoking and lung cancer by Doll and Hill (47) started with prevalent exposures for smoking. The “current versus never” contrast yielded very large risk ratios, and duration of smoking was a strong risk factor in all analyses. There was no risk reversal during follow-up. On the contrary, risk continued to increase strongly over exposure time. Even after cessation of exposure, continued increased risks existed that may gradually return to background levels (48). This phenomenon is not unusual. In many studies of chronic environmental or occupational exposures and cancer (e.g., nickel and nasal cancer, asbestos and mesothelioma), it is standard to include both prevalent and incident exposures (19). Such studies start with the current employees of the factory/industry on a particular day in the past and then add in new workers who joined subsequently, and it is possible to have separate incidence rate calculations for different time periods since first exposure. This allows us to have all the information that one could obtain from a study of incident exposures—and much more quickly, and in larger numbers—analogous to the period life table analysis. Omitting prevalent users would lead to a considerable reduction in study size and would increase the problem of right censoring because there would be very few participants with long follow-up times since first exposure.
THE NECESSITY OF STUDIES WITH PREVALENT EXPOSURES
Studies of prevalent exposures are not only reasonable but also the best option in many situations. For many exposures, like smoking or other environmental or occupational exposures, it is unfeasible to start from the first exposure onward. For smoking, that would need follow-up from age 11 onward.
In situations where follow-up includes a mix of prevalent and incident exposures, as in many occupational studies, inclusion of prevalently exposed persons may greatly increase the statistical precision of the “late” exposure duration windows and/or the “long” time-since-first-exposure windows. This may produce lesser precision in the early time-since-first-exposure windows (5), but these are often of less interest.
Moreover, the situation is different for exposures that have temporary and current effects (e.g., oral contraceptives and venous thrombosis) versus exposures that have lasting effects (e.g., oral contraceptives and breast cancer). In the case of transient exposures, the problem might also be less important (cellular phone use and car accidents, coffee drinking and myocardial infarction).
Finally, it is often assumed that there are always potentially clearly demarked beginnings of an exposure. However, for many states, such as hypertension, hypercholesterolemia, dementia, diabetes, and so on, the exact date at which these conditions start is unknown. Nevertheless, these are clearly important etiological causes of disease.
THE PROBLEM OF RIGHT CENSORING (MISSINGNESS ON THE RIGHT)
Although the problem of left truncation has received considerable attention, there has been relatively little discussion of the problem of right censoring. Ideally, every study should follow all participants until death, but this is rarely achievable. Unless there are unlimited resources and unlimited time available, any decision to restrict a study to incident users inevitably means restricting its size and restricting it to the earlier years since first exposure.
The problem of such restrictions is evident in the studies of cancer in atomic bomb survivors (49). An initial increase in leukemia and other hematological malignancies occurred, but the hazard ratio declined after about 10 years; in contrast, the increased risks for solid (epithelial) tumors only began to become apparent 10–15 years after exposure. Similar patterns were observed in studies of the development of secondary tumors after chemo- and radiotherapy for cancer, wherein randomized trials found little risk in contrast to observational long-term follow-up studies (50, 51).
Finally, other biases exist in observational studies, as well as randomized trials, that are due to self-selection toward exposures and self-selection in adherence to exposures. They can be important, but generally, they are similar between studies that start from first exposures and studies with prevalent exposures. Refer to Web Appendix 4.
DISCUSSION
We have considered studies that enroll prevalently exposed persons (left truncation). The context of our discussion is the usefulness of dynamic populations with estimation of incidence rates and rate ratios for many environmental, occupational, and also pharmacoepidemiologic problems in cohort studies or case-control studies. It is apparent that the necessity of starting follow-up from first exposure should be decided on the basis of the best subject matter knowledge that is available.
If a more or less ideal cohort exists (large and extensive longitudinal data) with incident exposures and follow-up time in the relevant exposure time windows, this might be preferred. For all other situations, we will continue to need dynamic population studies, because either the ideal cohort does not exist or we cannot wait for it, or exposures are transient. A dynamic population study can be a cohort or a case-control study and is often the most feasible option; it needs information about only a few calendar years to yield data about many time-since-first-exposure windows. Even if a suitable cohort with incident exposures and very long-term follow-up exists, dynamic population studies can give more interesting insights, because the early as well as the late time-since-first-exposure experience has accrued over recent calendar times and has immediate relevance for the next calendar years.
Strong warnings about the inherent biasedness of studies involving prevalent exposures (5, 6) seem inconsistent with the practice and experience of epidemiology. Instead of demanding that studies start with newly exposed persons, it might be more thoughtful to discriminate among 3 situations:
Start of follow-up coincides with the beginning of exposure, which is alike to the randomized trial situation. This might often work in pharmacoepidemiology. A potential problem is that follow-up duration is usually short in such studies and does not allow the study of long-term risks.
There is a suitable mix of prevalent and incident exposures, that is, substantial numbers of new users as well as of prevalent users with different durations of exposure. This might often be the ideal situation, because it allows us to explore the effects of different durations of exposure (and/or time since first exposure) and greatly enhances information on long exposure durations. This is the case in many population-based case-control studies and occupational and environmental follow-up studies.
There are only persons with prevalent exposures. This makes it necessary to assess whether this might lead to completely wrong conclusions if there is a qualitative reversal of outcome between early and later time-since-first-exposure periods.
In situations 2 and 3, there is a need to consider whether the analyses should be stratified on time since first exposure, and whether the findings should be presented with this stratification. In addition, there should be caution in adjusting for confounders that may be consequences of previous exposure. All these considerations will be context specific as they will differ between different applications in epidemiology (e.g., between pharmacoepidemiology and occupational and environmental epidemiology), which often rely on data with different structures and different challenges. In many situations, such as on-off effects and transient exposures, this will not matter. No view from a single experience should become an absolute norm.
Finally, there are implications for teaching. All types of epidemiologic studies should continue to be taught. All designs have their indications and contraindications, and their merits differ in different situations. In particular, dynamic population studies, follow-up studies, and case-control studies will remain important epidemiologic tools and therefore should continue to be explicitly taught in introductory courses. Students of epidemiology should learn that the mere potential of a bias will not invalidate a study, but that in each case they should reason about whether the potential bias is likely to occur, how strong it is likely to be, and whether it can be removed by appropriate data analysis.
Supplementary Material
ACKNOWLEDGMENTS
Author affiliations: Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, the Netherlands (Jan Vandenbroucke); Centre for Global Noncommunicable Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom (Neil Pearce); and Centre for Public Health Research, Massey University, Wellington, New Zealand (Neil Pearce).
The Centre for Global Noncommunicable Diseases is supported by the Wellcome Trust Institutional Strategic Support Fund, 097834/Z/11/B. The Centre for Public Health Research is supported by a program grant from the Health Research Council of New Zealand.
We thank Drs. Miguel Hernán and Sander Greenland for fruitful discussions that preceded the writing of this paper, as well as Drs. Miguel Hernán and Lorenzo Richiardi for comments on a draft manuscript.
Conflict of interest: none declared.
REFERENCES
- 1.Ray WA. Evaluating medication effects outside of clinical trials: new-user designs. Am J Epidemiol. 2003;1589:915–920. [DOI] [PubMed] [Google Scholar]
- 2.Hernán MA. The hazards of hazard ratios. Epidemiology. 2010;211:13–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Schneeweiss S. A basic study design for expedited safety signal evaluation based on electronic healthcare data. Pharmacoepidemiol Drug Saf. 2010;198:858–868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Johnson ES, Bartman BA, Briesacher BA et al. . The incident user design in comparative effectiveness research. Pharmacoepidemiol Drug Saf. 2013;221:1–6. [DOI] [PubMed] [Google Scholar]
- 5.Schisterman EF, Cole SR, Ye A et al. . Accuracy loss due to selection bias in cohort studies with left truncation. Paediatr Perinat Epidemiol. 2013;275:491–502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Institute of Medicine. Ethical and Scientific Issues in Studying the Safety of Approved Drugs. Washington, DC: The National Academies Press; 2012. [PubMed] [Google Scholar]
- 7.Hernán MA, Hernández-Díaz S, Robins JM. A structural approach to selection bias. Epidemiology. 2004;155:615–625. [DOI] [PubMed] [Google Scholar]
- 8.Vandenbroucke JP, Pearce N. Incidence rates in dynamic populations. Int J Epidemiol. 2012;415:1472–1479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Vandenbroucke JP, Pearce N. Case-control studies: basic concepts. Int J Epidemiol. 2012;415:1480–1489. [DOI] [PubMed] [Google Scholar]
- 10.Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;665:688–701. [Google Scholar]
- 11.Robins JM. The analysis of randomized and non-randomized AIDS treatment trials using a new approach to causal inference in longitudinal studies. In: Sechrest L, Freeman L, Mulley A, eds. Health Services Research Methodology: a Focus on AIDS. Bethesda, MD: US Public Health Service, National Center for Health Services Research; 1989:113–159. [Google Scholar]
- 12.Hernán MA. With great data comes great responsibility: publishing comparative effectiveness research in epidemiology. Epidemiology. 2011;223:290–291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hernán MA, Taubman SL. Does obesity shorten life? The importance of well-defined interventions to answer causal questions. Int J Obes (Lond). 2008;32(suppl 3):S8–S14. [DOI] [PubMed] [Google Scholar]
- 14.Hill AB. The environment and disease: association or causation? Proc R Soc Med. 1965;585:295–300. [PMC free article] [PubMed] [Google Scholar]
- 15.Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3rd ed Philadelphia, PA: Wolters Kluwer Health/Lippincott Williams & Wilkins; 2008. [Google Scholar]
- 16.Devroede GJ, Taylor WF, Sauer WG et al. . Cancer risk and life expectancy of children with ulcerative colitis. N Engl J Med. 1971;2851:17–21. [DOI] [PubMed] [Google Scholar]
- 17.Kurtzke JF. On estimating survival; a tale of two censors. J Clin Epidemiol. 1989;422:169–175. [DOI] [PubMed] [Google Scholar]
- 18.Goldner J, Wisnant JP, Taylor WF. Long-term prognosis of transient cerebral ischemic attacks. Stroke. 1971;22:160–167. [DOI] [PubMed] [Google Scholar]
- 19.Checkoway H, Pearce N, Kriebel D. Research Methods in Occupational Epidemiology. 2nd ed New York, NY: Oxford University Press; 2004. [Google Scholar]
- 20.Cleves M. An Introduction to Survival Analysis Using Stata. 2nd ed College Station, TX: Stata Press; 2008. [Google Scholar]
- 21.Lamarca R, Alonso J, Gómez G et al. . Left-truncated data with age as time scale: an alternative for survival analysis in the elderly population. J Gerontol A Biol Sci Med Sci. 1998;535:M337–M343. [DOI] [PubMed] [Google Scholar]
- 22.Bloemenkamp KW, Rosendaal FR, Helmerhorst FM et al. . Higher risk of venous thrombosis during early use of oral contraceptives in women with inherited clotting defects. Arch Intern Med. 2000;1601:49–52. [DOI] [PubMed] [Google Scholar]
- 23.Kemmeren JM, Algra A, Meijers JC et al. . Effect of second- and third-generation oral contraceptives on the protein C system in the absence or presence of the factor V Leiden mutation: a randomized trial. Blood. 2004;1033:927–933. [DOI] [PubMed] [Google Scholar]
- 24.Kemmeren JM, Algra A, Meijers JC et al. . Effects of second and third generation oral contraceptives and their respective progestagens on the coagulation system in the absence or presence of the factor V Leiden mutation. Thromb Haemost. 2002;872:199–205. [PubMed] [Google Scholar]
- 25.Maclure M. The case-crossover design: a method for studying transient effects on the risk of acute events. Am J Epidemiol. 1991;1332:144–153. [DOI] [PubMed] [Google Scholar]
- 26.McEvoy SP, Stevenson MR, McCartt AT et al. . Role of mobile phones in motor vehicle crashes resulting in hospital attendance: a case-crossover study. BMJ. 2005;3317514:428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Grodstein F, Clarkson TB, Manson JE. Understanding the divergent data on postmenopausal hormone therapy. N Engl J Med. 2003;3487:645–650. [DOI] [PubMed] [Google Scholar]
- 28.Prentice RL, Langer R, Stefanick ML et al. . Combined postmenopausal hormone therapy and cardiovascular disease: toward resolving the discrepancy between observational studies and the Women's Health Initiative clinical trial. Am J Epidemiol. 2005;1625:404–414. [DOI] [PubMed] [Google Scholar]
- 29.Hernán MA, Alonso A, Logan R et al. . Observational studies analyzed like randomized experiments: an application to postmenopausal hormone therapy and coronary heart disease. Epidemiology. 2008;196:766–779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Beral V, Reeves G, Bull D et al. . Breast cancer risk in relation to the interval between menopause and starting hormone therapy. J Natl Cancer Inst. 2011;1034:296–305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Rossouw JE, Anderson GL, Prentice RL et al. . Risks and benefits of estrogen plus progestin in healthy postmenopausal women: principal results from the Women's Health Initiative randomized controlled trial. JAMA. 2002;2883:321–333. [DOI] [PubMed] [Google Scholar]
- 32.Prentice RL, Chlebowski RT, Stefanick ML et al. . Estrogen plus progestin therapy and breast cancer in recently postmenopausal women. Am J Epidemiol. 2008;16710:1207–1216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Cushman M, Kuller LH, Prentice R et al. . Estrogen plus progestin and risk of venous thrombosis. JAMA. 2004;29213:1573–1580. [DOI] [PubMed] [Google Scholar]
- 34.Sweetland S, Beral V, Balkwill A et al. . Venous thromboembolism risk in relation to use of different types of postmenopausal hormone therapy in a large prospective study. J Thromb Haemost. 2012;1011:2277–2286. [DOI] [PubMed] [Google Scholar]
- 35.Vandenbroucke JP. The HRT controversy: observational studies and RCTs fall in line. Lancet. 2009;3739671:1233–1235. [DOI] [PubMed] [Google Scholar]
- 36.Prentice RL. Data analysis methods and the reliability of analytic epidemiologic research. Epidemiology. 2008;196:785–788; discussion 789–793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Vandenbroucke JP. When are observational studies as credible as randomised trials? Lancet. 2004;3639422:1728–1731. [DOI] [PubMed] [Google Scholar]
- 38.Schneeweiss S, Patrick AR, Stürmer T et al. . Increasing levels of restriction in pharmacoepidemiologic database studies of elderly and comparison with randomized trial results. Med Care. 2007;45(10 suppl 2):S131–S142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Danaei G, Rodríguez LA, Cantero OF et al. . Observational data for comparative effectiveness research: an emulation of randomised trials of statins and primary prevention of coronary heart disease. Stat Methods Med Res. 2013;221:70–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Danaei G, Tavakkoli M, Hernán MA. Bias in observational studies of prevalent users: lessons for comparative effectiveness research from a meta-analysis of statins. Am J Epidemiol. 2012;1754:250–262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Brenner H, Hakulinen T. Up-to-date long-term survival curves of patients with cancer by period analysis. J Clin Oncol. 2002;203:826–832. [DOI] [PubMed] [Google Scholar]
- 42.Turnbull BW. Nonparametric estimation of a survivorship function with doubly censored data. J Am Stat Assoc. 1974;69345:169–173. [Google Scholar]
- 43.Turnbull BW. The empirical distribution function with arbitrarily grouped, censored and truncated data. J R Stat Soc Ser B Stat Methodol. 1976;383:290–295. [Google Scholar]
- 44.Detels R, Muñoz A, McFarlane G et al. . Effectiveness of potent antiretroviral therapy on time to AIDS and death in men with known HIV infection duration. Multicenter AIDS Cohort Study Investigators. JAMA. 1998;28017:1497–1503. [DOI] [PubMed] [Google Scholar]
- 45.Hsia J, Langer RD, Manson JE et al. . Conjugated equine estrogens and coronary heart disease: the Women's Health Initiative. Arch Intern Med. 2006;1663:357–365. [DOI] [PubMed] [Google Scholar]
- 46.Ray WA, Varas-Lorenzo C, Chung CP et al. . Cardiovascular risks of nonsteroidal antiinflammatory drugs in patients after hospitalization for serious coronary heart disease. Circ Cardiovasc Qual Outcomes. 2009;23:155–163. [DOI] [PubMed] [Google Scholar]
- 47.Doll R, Hill AB. The mortality of doctors in relation to their smoking habits; a preliminary report. Br Med J. 1954;14877:1451–1455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Doll R, Peto R, Boreham J et al. . Mortality in relation to smoking: 50 years’ observations on male British doctors. BMJ. 2004;3287455:1519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Preston DL, Ron E, Tokuoka S et al. . Solid cancer incidence in atomic bomb survivors: 1958–1998. Radiat Res. 2007;1681:1–64. [DOI] [PubMed] [Google Scholar]
- 50.Franklin JG, Paus MD, Pluetschow A et al. . Chemotherapy, radiotherapy and combined modality for Hodgkin's disease, with emphasis on second cancer risk. Cochrane Database Syst Rev. 2005;(4):CD003187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.van Eggermond AM, Schaapveld M, Lugtenburg PJ et al. . Risk of multiple primary malignancies following treatment of Hodgkin lymphoma. Blood. 2014;1243:319–327; quiz 466. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.