Abstract
Many paediatric clinical research studies, whether observational or interventional, have as an eventual aim the identification or quantification of causal relationships. One might ask: does screen time influence childhood obesity? Could overuse of paracetamol in infancy cause wheeze? How does breastfeeding affect later cognitive outcomes? In this review, we present causal Directed Acyclic Graphs (DAGs) to a paediatric audience. DAGs are a graphical tool which provide a way to visually represent and better understand the key concepts of exposure, outcome, causation, confounding, and bias. We use clinical examples, including those outlined above, framed in the language of DAGs, to demonstrate their potential applications. We show how DAGs can be most useful in identifying confounding and sources of bias, demonstrating inappropriate statistical adjustments for presumed biases, and understanding threats to validity in randomised controlled trials. We believe that a familiarity with DAGs, and the concepts underlying them, will be of benefit both to researchers planning studies, and practicing clinicians interpreting them.
Introduction
A wide variety of studies are undertaken with the aim of understanding and improving paediatric health, and in particular to identify the causal processes that lead to the development of health outcomes or disease. To achieve this, it is helpful to define a number of variables (for the benefit of the reader unacquainted with the technical language that follows, we have created a Supplemental Glossary available online). For example, in a study looking at the relationship between screen time (time spent watching television, using computers or games consoles) and childhood obesity (1), the authors hypothesised that more screen time (the exposure) may lead to an increased risk of childhood obesity (the outcome). The exposure may cause the outcome not directly but through an intermediate process – a direct reduction in physical activity (the intermediate variable or mediator). Finally, we want to identify factors which influence both the exposure and the outcome, but are not directly in the causal pathway. An example is low parental education, which is a cause of both increased screen time and an increased risk of obesity/overweight (2,3). In this situation, low parental education acts as what is called a confounder. If not recognised and controlled for, this could lead to a false interpretation of the true relationship between the two variables, for example by falsely attributing obesity solely to increased screen time.
We will not attempt to summarise the history, philosophy and applications of causal inference, but instead in this review focus on the use of a graphical tool, causal Directed Acyclic Graphs (DAGs). DAGs provide a simple way of graphically representing, communicating, and understanding key concepts of relevance to practicing clinicians and researchers, and are particularly helpful in delineating and understanding confounders and potential sources of bias in exposure-outcome relationships. A bias is a systematic, incorrect interpretation of the true relationship between the exposure and the outcome. Biases differ from random error in that they distort our interpretation of true causal relationships in a non-random way: repeating a study, or increasing the sample size, will not lead to the elimination of bias. Confounders and biases may distort our interpretations in a variety of ways. If the researcher is not aware of confounders, and does not appropriately control for them, a variable may erroneously appear to cause the outcome where there is no causal relationship, or the magnitude of this relationship may be distorted. Conversely, if a researcher treats variables associated with both exposure and outcome as confounders when they in fact are not (see below), and inappropriately controls for them, this too may cause bias.
We first discuss how to create and interpret DAGs, using paediatric examples to demonstrate how they can identify, and appropriately correct for, confounders and biases in observational studies that can affect our ability to draw correct conclusions about causal relationships. We then outline how they can be helpful in interpreting interventional studies, and understanding potential threats to validity in these. After outlining some of the limitations of DAGs, we conclude with some thoughts on how they might prove useful for researchers and clinicians.
Creating and interpreting a DAG
Diagrams have been used to represent causal relationships for many years, in a variety of fields ranging from genetics to sociology (4–7). However, in recent years an epidemiological literature outlining a standard terminology and set of rules (8), has grown around DAGs. In a DAG, causal relationships are represented by arrows between the variables, pointing from cause to effect. As seen in Figure 1a, an arrow from screen time to obesity means that we hypothesise that a change in screen time causes a change in adiposity. DAGs must obey two rules. First, they must be acyclic, which means that it is impossible to start at any variable in the DAG, follow the directed arrows forward, and end up at the same variable. In other words, a DAG must not contain a feedback loop where a variable causes itself. A corollary of this is that a causal relationship between two variables must be unidirectional: they cannot cause each other. Bidirectional arrows, often used to depict feedback loops, are in fact a simple graphical expedient to show as a single variable what in reality is a sequence of variables(9). Second, for a DAG to be complete, the shared cause of any two variables in the DAG must be included (10).
In a DAG, two variables can be connected by what is called a “path” between them. Open “paths” represent statistical associations between two variables; closed “paths” the absence of such associations (the correspondence between path “openness” and associations in DAGs derives from mathematics)(8). Variables and arrows can be combined into three main types of paths:
-
1)
Directed Paths: all arrows point in the same direction, and the association between these variables reflects a causal relationship. Such a path is open. Increased screen time leads directly to reduced physical activity, which in turn leads to an increased risk of obesity [Figure 1b].
-
2)
Backdoor Paths: this is where two variables share the same cause. Looking at Figure 1c, increased screen time and childhood obesity are influenced by low parental education. In the terminology of DAGs, screen time and childhood adiposity are said to be connected by a backdoor path through low parental education. This path (one that connects exposure and outcome through a third variable, including an arrow entering rather than emanating from the exposure) is open, and depicts a statistical association between screen time and adiposity, through low parental education. However, the association transmitted by this backdoor path is non-causal, and represents the basic structure of confounding.
-
3)
Closed (or blocked) Paths: this is when two variables have the same effect, called a collider [Figure 1d]. For example, both screen time and obesity have been found to increase the risk of low self-esteem, self-harm and suicidal ideation in adolescents (11–13), that is self-harm is a collider between physical activity and obesity. In this situation, unlike directed and backdoor paths, this path is closed: there is no association between screen time and adiposity transmitted through self-harm.
Researchers can change the status of a path from open to closed, or vice versa, by acting on (conditioning on, or controlling for) a variable, which can occur through study design or statistical adjustments such as restriction, stratification, matching, standardisation, or multivariable regression. Controlling for parental education (a confounder) in our example will close this backdoor path, and lead to a less biased relationship in the directed path between screen time and adiposity. Conversely, conditioning on a variable in a directed path between two variables (a mediator), physical activity in our example, closes this path, and could lead to an incorrect estimate of the true overall association between the variables. Finally, conditioning on a variable in a closed path (a collider) opens this path and leads to transmission of a non-causal association. If we were to mistakenly identify self-harm as a confounder, and condition on it, this would distort the true relationship between the exposure and the outcome. Equally, if a study examined the relationship between screen time and obesity in a group of adolescents selected because they had a history of self-harm, this would again represent conditioning on a collider, in this case by restriction (see “selection bias” below).
Using DAGs to understand confounding: Paracetamol and wheeze
In the language of DAGs, a confounder is defined as a common cause of the exposure and the outcome. This situation occurs in nature; it is not created by the researcher. Confounders, if not identified and appropriately adjusted for (conditioned on), can distort the true causal relationship between an exposure and an outcome. As an example, a number of observational studies (14,15) found an association between receiving paracetamol in the first year of life and later risk of developing wheeze/asthma. DAGs have proven useful in examining this relationship (16).
In fact, the increased risk of later wheezing may not be due to paracetamol, but to confounding. Viral respiratory tract infections – for which paracetamol is prescribed - are common in children, trigger wheezing and might increase the risk of later wheeze (17,18). That is, viral infections act as a confounder in the relationship between paracetamol usage and wheezing [Figure 2a] (so-called confounding by indication)(19,20). Therefore we might expect these two variables (paracetamol use and wheeze) to be statistically associated through a common cause, even if there is no direct causal association. To identify the true causal relationship between paracetamol use and later wheeze, one should condition on viral infections. Conditioning in a DAG is generally shown as a box around the variable, and as described previously changes an open path (in this case a backdoor path) to a closed path [Figure 2].
Once we have closed this backdoor path by making the appropriate statistical adjustments, and assuming there are no other confounders, we should be able to identify the true magnitude -if any - of the relationship between paracetamol use and wheeze. Supporting this hypothesis, studies which have conditioned on respiratory tract infections in early life find a diminished relationship between paracetamol use and later wheeze, suggesting that part of this apparent relationship may be due to confounding (16,21,22).
Risks of inappropriate adjustment: Overadjustment
Whilst failing to identify confounders can threaten the validity of findings, the converse, inappropriately identifying other variables as confounders, can also be problematic (23). Take the relationship between the administration of antenatal steroids (the exposure) and the outcome of bronchopulmonary dysplasia (BPD) [Figure 3a]. A number of studies found that administration of antenatal steroids was not associated with a decreased risk of BPD (24–26), despite reducing several known risk factors for BPD. These studies adjusted for variables such as severity of neonatal disease (24), and need for mechanical ventilation (24–26), the rationale being that these factors are associated with both the administration of antenatal steroids and an increased risk of BPD.
However, these variables do not fulfil the definition of a confounder (they are not causes of both exposure and outcome), but act as mediators between the exposure (antenatal steroids) and the outcome (BPD) [Figure 3b]. Conditioning on a mediator closes one of the causal paths between antenatal steroids and BPD and distorts the overall relationship between the two. This is represented in Figure 3c by the boxes surrounding the two intermediate variables. Adjusting for (conditioning on) an intermediate results in overadjustment. With this process we remove the part of the association between antenatal steroids and BPD mediated through the reduction of severe illness, or the reduced need for mechanical ventilation (27). This adjustment can attenuate the true causal effect of the exposure or even reverse it, leading to counterintuitive results.
Supporting the interpretation that overadjustment might explain the apparent lack of effect of antenatal steroids on the development of BPD, a cohort study (28) found a negative (protective) association between antenatal steroid administration and mediators (severity of neonatal disease and the need for mechanical ventilation), and a positive association between the mediators and the risk of the BPD. The authors found a protective effect of steroids on BPD when intermediate factors were not adjusted for, but not when they adjusted for these intermediate variables [Figure 3c]. That is, inappropriately conditioning on mediators led to a distortion of the true (likely protective) relationship between antenatal steroids and risk of developing BPD. Of note, whilst conditioning on mediators distorts the overall relationship between an exposure and an outcome, Figure 3c also shows that this should reveal the direct effect of steroids on BPD (with the highly simplified assumption that there are no other common causes of steroid administration, BPD, or the mediators); this concept underlies the field of mediation analysis (29,30).
Selection bias in the language of DAGs
In observational or interventional studies, selection bias occurs when both the exposure and the outcome affect whether an individual is included in the analyses. In the language of DAGs, selection bias occurs due to inappropriate conditioning on a collider. An example are studies that examined HLA subtypes (the exposure) as risk factors for the development of acute lymphoblastic leukaemia (ALL, the outcome). Initial cross-sectional studies using prevalent (i.e. already diagnosed) cases and matched controls (31–33) found an increased risk of developing ALL in individuals with the HLA-A2 serotype [Figure 4a].
However, a subsequent study (34) examined patients presenting with a new diagnosis of ALL (incident cases), and found that the frequency of HLA-A2 in these individuals matched that of the general population. When examining survivors at the time of typing, they found that the frequency of HLA-A2 was higher than in the general population, and that length of survival appeared to be associated with the HLA-A2 serotype. That is, HLA-A2 was not associated with an increased risk of developing ALL, but rather with an increased chance of survival. Because previous studies had examined a prevalent population (patients with a previous diagnosis of ALL) rather than an incident one (those presenting with a new diagnosis), they had examined the relationship between HLA-A2 and leukaemia in a sample restricted to survivors, and an incorrect association was inferred between the exposure and the outcome.
Presented as a DAG, the source of this bias, also called “incidence-prevalence” or Neyman’s bias (35,36), can be seen to be due to conditioning on a collider. In this case, both the exposure and the outcome influence a third variable, survival, which acted as a collider [Figure 4b]. If we consider all patients - surviving or not - by including newly diagnosed patients, the two variables are not associated (the path is closed). Conditioning on survival (by restricting the inclusion to patients surviving a certain time, as shown by the box surrounding survival in the figure), opens the path through the collider, and we enrich the sample for individuals with HLA-A2 among ALL patients, creating a (spurious) association.
Overadjustment and selection bias
Overadjustment and selection bias can also coexist. Take the relationship between maternal pre-eclampsia (the exposure) and subsequent cerebral palsy (the outcome): pre-eclampsia is hypothesised to be directly causative of cerebral palsy. However, pre-eclampsia is also associated with a higher risk of medically indicated preterm birth, which in turn is associated with a higher risk of cerebral palsy [Figure 5a]. We might think to examine the effect of pre-eclampsia after adjusting for preterm birth or gestational age (as if this represented confounding) [Figure 5b], shown by the box around this variable. However, here preterm birth is an intermediate between pre-eclampsia and cerebral palsy, and not a common cause of both.
By (over)adjusting, we take away part of the detrimental effect of pre-eclampsia, that mediated through preterm birth. This adjustment can attenuate the true effect of the exposure and even reverse it. An early study found that maternal pre-eclampsia was protective in very preterm infants, but detrimental to those born at a later gestation (37). This was a surprising result – as a pathologic condition, we would expect pre-eclampsia to be detrimental across the entire spectrum of gestations (38,39). Visualised as a DAG, this finding could be due to the conditioning on gestational age at birth. In this case, conditioning does not take place through statistical adjustment, but by stratification (performing separate analyses in two groups) based on the criterion of gestational age at birth (preterm birth). This closes the causal path from pre-eclampsia to cerebral palsy via preterm birth, and could lead to bias.
However, the true situation is probably more complex. Examining a more realistic DAG, to which chorioamnionitis (as another cause of both preterm birth and cerebral palsy) has been added [Figure 5c], one can see that gestational age, as the shared effect of both pre-eclampsia and chorioamnionitis, also acts as a collider. Critically, closing one path between two variables may lead to a change in other potential paths between the two. Conditioning on gestational age opens a previously closed path, from pre-eclampsia to cerebral palsy through preterm birth and chorioamnionitis. This creates a new source of bias, and another reason for the counterintuitive association found in our example. If we analyse the relationship between pre-eclampsia and the outcome within the group of preterm infants, a faulty comparison group and a spurious association will be created. If a preterm baby is born to a mother who has pre-eclampsia, the baby will be less likely to have chorioamnionitis and vice versa. Among preterm infants the effect of pre-eclampsia on cerebral palsy will be compared with the effect of another significant cause of cerebral palsy, chorioamnionitis, and pre-eclampsia will falsely appear to be protective. Thus, the estimated direct causal effect of pre-eclampsia on the outcome will be biased (through the effect of chorioamnionitis). Although widely used, conditioning on gestational at birth in studies of prenatal exposures and their relationship to postnatal outcomes may not reduce but actually lead to bias through overadjustment and faulty comparisons as illustrated above (40–43), and generate counterintuitive results and apparent changes of effect in different groups of patients.
Disentangling confounders from mediators and colliders can prove challenging. Statistical tests reveal only the strength of an association between two variables, not the causal relationship between them, and in this context the researcher must rely on causal reasoning (44). Here, DAGs, supported by subject-matter knowledge, can be helpful as they illustrate a modern definition of confounding (45,46): a common cause of both the exposure and the outcome under study. This demonstrates how older definitions (47,48), focusing on factors associated with the exposure and also related to the risk of disease in the unexposed, and not being an intermediate (i.e. on associations rather than presumed causal relationships), may lead to biased statistical estimates due to inappropriate adjustment for a common effect of two variables (conditioning on a collider).
The language of DAGs as applied to interventional studies
The DAG in Figure 6a shows the causal structure of a randomised controlled trial (RCT) randomising women to an intervention promoting breastfeeding, the Baby Friendly Hospital Initiative (BFHI) (49), to examine cognitive development in childhood (50). Random assignment determines the exposure (BFHI) which in turn influences the outcome (cognitive development) via mediators such as breastfeeding (and probably others, not shown). Even if there are confounders that influence both the chance of breastfeeding and the outcome, they do not bias the causal effect of the random assignment on the outcome, as breastfeeding is a collider in the path between random assignment and cognitive development via potential confounders, and blocks this path. Therefore, as regards confounding, an intention-to-treat analysis (according to how a mother was randomised) is likely to be unbiased, and DAGs demonstrate the critical value of randomisation in inferring unbiased causal relationships. A per-protocol analysis of whether a mother actually breastfed is not immune to confounding, as it resembles an observational study where a backdoor path exists between breastfeeding and the outcome via any confounders (51). Of course, the effect of treatment actually received may be of interest, and a per-protocol analysis, carefully controlled for confounders, may be justified to extract the maximum of information from clinical trials (52).
Whilst RCTs and intention-to-treat analyses minimise threats to validity posed by confounding, they are not immune to other biases, including information bias (see glossary) and bias due to differential loss to follow-up. It is plausible that the BFHI might lead to differences in health awareness in the intervention group, leading to a different likelihood of follow up clinic attendance. In addition, mothers with a child exhibiting signs of impaired cognitive development could also be more likely to attend follow-up to find out whether their child was developing normally. Seen as a DAG [Figure 6b], both the BFHI and the outcome have a causal effect on the chance of follow-up. Examining data only on children who attend follow-up (conditioning on follow-up, represented by the box around clinic attendance), introduces bias into the relationship between the intervention and cognitive development via a faulty comparison, opening an otherwise closed path. Conditioning on a collider leads to what is called collider stratification bias (53,54). This example illustrates that whilst RCTs minimise confounding, they are still susceptible to bias such as that introduced by loss to follow up.
Limitations of DAGs and some caveats on their use
One limitation of DAGs is their non-parametric nature: they neither specify the form of the causal relationships, nor depict the size of the associations, and remain qualitative in nature. A DAG shows that uncontrolled confounding might bias the results, but does not give a quantitative measure of this (10,55). Another is that a DAG can only be as good as the background information used to create it (56); a DAG is complete and therefore has a causal interpretation only if it contains all common causes of any two variables (all confounders), including both measured and unmeasured variables. A further limitation is the inability of DAGs to depict random, as opposed to systematic, error. For instance, randomisation allocates known and unknown confounders equally between groups in the “average” ideal case. This does not always happen in real-world RCTs, where confounding, due to random differences at baseline, can - and indeed often does – occur, but is not shown by DAGs. Ignoring random error also means that when examining misclassification (information) bias, concepts such as non-differential measurement error (where error is randomly distributed across the groups being studied) cannot be incorporated into a DAG.
More subtly, and of relevance not only to DAGs but to any analytical approach, the research question influences how we consider variables and therefore analyse the data. A variable may be simultaneously a mediator, a collider, or a confounder, can be interpreted differently in separate research questions using the same data, and these will dictate different analytical strategies. For example, in the study looking at the relationship between antenatal steroids and BPD, one could ask about the effect of steroids (exposure) on the outcome. Here the need for mechanical ventilation is a mediator and should not be conditioned on. However, if one were to investigate the effect of mechanical ventilation (treating it as an exposure) on the risk of BPD, antenatal steroids are a confounder in the backdoor path between mechanical ventilation and the outcome, and should be conditioned on. In addition, it is possible that two researchers might ask the same research question, using the same variables in their analyses, but choose to condition on different variables because they have different opinions regarding the underlying causal relationship. Representing their analyses as DAGs allows an explicit comparison between the two approaches should their findings differ.
Finally, throughout this article we have of necessity presented simple examples to illustrate our key points. Partly, this is inherent to the approach: any graphical method is likely to over-simplify the complex biological reality being investigated. DAGs have for this reason attracted criticism because they may lead to oversimplification in the field of causal inference (57,58). DAGs however do not lead per se to oversimplified analyses, but only explicitly present their underlying assumptions. There are also a number of theoretical points, such as the exact distinction between selection bias and confounding, that remain contested (59,60). We therefore direct interested readers to more in-depth reviews about the theory and limitations of DAGs (8,10,61,62).
Conclusion
The aim of much clinical research is to elucidate and test causal relationships. In epidemiological terms, we want to establish exposures that might be amenable to modification, and test interventions acting on these leading to an improvement in health outcomes. Critical to a correct interpretation of causal relationships is correctly identifying and appropriately adjusting for confounders and potential sources of bias. In this review we have shown that DAGs can illustrate threats to validity found to greater or lesser extents in virtually all clinical research: confounding, selection (or collider-stratification) bias, and overadjustment. We believe that DAGs are useful for practicing clinicians in interpreting research that deals with proposed causal relationships, by allowing them to frame research questions and findings using the concepts of exposures, outcomes, intermediates, confounders and colliders. They remind those planning observational studies to collect sufficient data to condition on possible confounders, and to appropriately adjust for these in analyses, whilst refraining from inappropriate adjustments. Finally, they show that whilst randomisation does minimise the risks of confounding in interventional studies, possibilities for bias remain, for example through loss to follow-up.
Supplementary Material
Acknowledgements
We thank Professor Mark Klebanoff and the two reviewers for their careful reading of our manuscript and constructive comments.
Statement of financial support: TW is the recipient of a Wellcome Trust Grant [204802/Z/16/Z].
Footnotes
Disclosure statement: None of the authors have any financial ties to products in the study, and there are no potential/perceived conflicts of interest.
Author Contributions: All authors made substantial contributions to the conception, design and drafting of this review, and approved the final manuscript as submitted.
References
- 1.Nightingale CM, Rudnicka AR, Donin AS, et al. Screen time is associated with adiposity and insulin resistance in children. Arch Dis Child. 2017;102:612–6. doi: 10.1136/archdischild-2016-312016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Nagel G, Wabitsch M, Galm C, et al. Determinants of obesity in the Ulm Research on Metabolism, Exercise and Lifestyle in Children (URMEL-ICE) Eur J Pediatr. 2009;168:1259–67. doi: 10.1007/s00431-009-1016-y. [DOI] [PubMed] [Google Scholar]
- 3.Falbe J, Rosner B, Willett WC, Sonneville KR, Hu FB, Field AE. Adiposity and different types of screen time. Pediatrics. 2013;132:e1497–505. doi: 10.1542/peds.2013-0887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wright S. Correlation and Causation. J Agric Res. 1921;20:557–85. [Google Scholar]
- 5.Wright S. The Theory of Path Coefficients a Reply to Niles’s Criticism. Genetics. 1923;8:239–55. doi: 10.1093/genetics/8.3.239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Holland PW. Causal Inference, Path Analysis, and Recursive Structural Equations Models. Sociol Methodol. 1988;18:449. [Google Scholar]
- 7.Freedman DA. As Others See Us: A Case Study in Path Analysis. J Educ Stat. 1987;12:101. [Google Scholar]
- 8.Pearl J. Causal diagrams for empirical research. Biometrika. 1995;82:669–88. [Google Scholar]
- 9.Hernán M, Schisterman EF, Hernández-Díaz S. Invited commentary: composite outcomes as an attempt to escape from selection bias and related paradoxes. Am J Epidemiol. 2014;179:368–70. doi: 10.1093/aje/kwt283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Glymour M, Greenland S. Modern Epidemiology. Philadelphia: Lippincott Williams & Wilkins; 2008. Causal diagrams; pp. 183–212. [Google Scholar]
- 11.Oshima N, Nishida A, Shimodera S, et al. The Suicidal Feelings, Self-Injury, and Mobile Phone Use After Lights Out in Adolescents. J Pediatr Psychol. 2012;37:1023–30. doi: 10.1093/jpepsy/jss072. [DOI] [PubMed] [Google Scholar]
- 12.Liu M, Wu L, Yao S. Dose-response association of screen time-based sedentary behaviour in children and adolescents and depression: a meta-analysis of observational studies. Br J Sports Med. 2016;50:1252–8. doi: 10.1136/bjsports-2015-095084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Maras D, Flament MF, Murray M, et al. Screen time is associated with depression and anxiety in Canadian youth. Prev Med (Baltim) 2015;73:133–8. doi: 10.1016/j.ypmed.2015.01.029. [DOI] [PubMed] [Google Scholar]
- 14.Beasley R, Clayton T, Crane J, et al. Association between paracetamol use in infancy and childhood, and risk of asthma, rhinoconjunctivitis, and eczema in children aged 6–7 years: analysis from Phase Three of the ISAAC programme. Lancet. 2008;372:1039–48. doi: 10.1016/S0140-6736(08)61445-2. [DOI] [PubMed] [Google Scholar]
- 15.Etminan M, Sadatsafavi M, Jafari S, Doyle-Waters M, Aminzadeh K, FitzGerald JM. Acetaminophen Use and the Risk of Asthma in Children and Adults. Chest. 2009;136:1316–23. doi: 10.1378/chest.09-0865. [DOI] [PubMed] [Google Scholar]
- 16.Rusconi F, Gagliardi L, Galassi C, et al. Paracetamol and antibiotics in childhood and subsequent development of wheezing/asthma: association or causation? Int J Epidemiol. 2011;40:662–7. doi: 10.1093/ije/dyq263. [DOI] [PubMed] [Google Scholar]
- 17.Blanken MO, Rovers MM, Molenaar JM, et al. Respiratory Syncytial Virus and Recurrent Wheeze in Healthy Preterm Infants. N Engl J Med. 2013;368:1791–9. doi: 10.1056/NEJMoa1211917. [DOI] [PubMed] [Google Scholar]
- 18.Jartti T, Gern JE. Role of viral infections in the development and exacerbation of asthma in children. J Allergy Clin Immunol. 2017;140:895–906. doi: 10.1016/j.jaci.2017.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Bosco JLF, Silliman RA, Thwin SS, et al. A most stubborn bias: no adjustment method fully resolves confounding by indication in observational studies. J Clin Epidemiol. 2010;63:64–74. doi: 10.1016/j.jclinepi.2009.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kyriacou DN, Lewis RJ. Confounding by Indication in Clinical Research. JAMA. 2016;316:1818. doi: 10.1001/jama.2016.16435. [DOI] [PubMed] [Google Scholar]
- 21.Wickens K, Ingham T, Epton M, et al. The association of early life exposure to antibiotics and the development of asthma, eczema and atopy in a birth cohort: confounding or causality? Clin Exp Allergy. 2008;38:1318–24. doi: 10.1111/j.1365-2222.2008.03024.x. [DOI] [PubMed] [Google Scholar]
- 22.Lowe AJ, Carlin JB, Bennett CM, et al. Paracetamol use in early life and asthma: prospective birth cohort study. BMJ. 2010;341:c4616. doi: 10.1136/bmj.c4616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ananth CV, Schisterman EF. Confounding, causality, and confusion: the role of intermediate variables in interpreting observational studies in obstetrics. Am J Obstet Gynecol. 2017 doi: 10.1016/j.ajog.2017.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Egreteau L, Pauchard JY, Semama DS, et al. Chronic oxygen dependency in infants born at less than 32 weeks’ gestation: incidence and risk factors. Pediatrics. 2001;108:E26. doi: 10.1542/peds.108.2.e26. [DOI] [PubMed] [Google Scholar]
- 25.Van Marter LJ, Allred EN, Leviton A, et al. Antenatal glucocorticoid treatment does not reduce chronic lung disease among surviving preterm infants. J Pediatr. 2001;138:198–204. doi: 10.1067/mpd.2001.110980. [DOI] [PubMed] [Google Scholar]
- 26.Marshall DD, Kotelchuck M, Young TE, Bose CL, Kruyer L, O’Shea TM. Risk factors for chronic lung disease in the surfactant era: a North Carolina population-based study of very low birth weight infants. North Carolina Neonatologists Association. Pediatrics. 1999;104:1345–50. doi: 10.1542/peds.104.6.1345. [DOI] [PubMed] [Google Scholar]
- 27.Roberts D, Brown J, Medley N, Dalziel SR. Antenatal corticosteroids for accelerating fetal lung maturation for women at risk of preterm birth. In: Roberts D, editor. Cochrane Database of Systematic Reviews. Chichester, UK: John Wiley & Sons, Ltd; 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Gagliardi L, Bellù R, Rusconi F, Merazzi D, Mosca F. Antenatal steroids and risk of bronchopulmonary dysplasia: a lack of effect or a case of over-adjustment? Paediatr Perinat Epidemiol. 2007;21:347–53. doi: 10.1111/j.1365-3016.2007.00814.x. [DOI] [PubMed] [Google Scholar]
- 29.MacKinnon DP, Fairchild AJ, Fritz MS. Mediation analysis. Annu Rev Psychol. 2007;58:593–614. doi: 10.1146/annurev.psych.58.110405.085542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Richiardi L, Bellocco R, Zugna D. Mediation analysis in epidemiology: methods, interpretation and bias. Int J Epidemiol. 2013;42:1511–9. doi: 10.1093/ije/dyt127. [DOI] [PubMed] [Google Scholar]
- 31.Rogentine GN, Yankee RA, Gart JJ, Nam J, Trapani RJ. HL-A antigens and disease. J Clin Invest. 1972;51:2420–8. doi: 10.1172/JCI107055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lawler SD, Klouda PT, Hardisty RM, Till MM. Histocompatibility and acute lymphoblastic leukaemia. Lancet. 1971;1:699. doi: 10.1016/s0140-6736(71)92704-8. [DOI] [PubMed] [Google Scholar]
- 33.Walford RL, Finkelstein S, Neerhout R, Konrad P, Shanbrom E. Acute childhood leukaemia in relation to the HL-A human transplantation genes. Nature. 1970;225:461–2. doi: 10.1038/225461a0. [DOI] [PubMed] [Google Scholar]
- 34.Rogentine GN, Trapani RJ, Yankee RA, Henderson ES. HL-A antigens and acute lymphocytic leukemia: the nature of the HL-A2 association. Tissue Antigens. 1973;3:470–6. doi: 10.1111/j.1399-0039.1973.tb00517.x. [DOI] [PubMed] [Google Scholar]
- 35.Sackett DL. Bias in analytic research. J Chronic Dis. 1979;32:51–63. doi: 10.1016/0021-9681(79)90012-2. [DOI] [PubMed] [Google Scholar]
- 36.Neyman J. Statistics; servant of all sciences. Science (80-) 1955;122:401–6. doi: 10.1126/science.122.3166.401. [DOI] [PubMed] [Google Scholar]
- 37.Murphy DJ, Sellers S, MacKenzie IZ, Yudkin PL, Johnson AM. Case-control study of antenatal and intrapartum risk factors for cerebral palsy in very preterm singleton babies. Lancet (London, England) 1995;346:1449–54. doi: 10.1016/s0140-6736(95)92471-x. [DOI] [PubMed] [Google Scholar]
- 38.Mann JR, McDermott S, Griffith MI, Hardin J, Gregg A. Uncovering the complex relationship between pre-eclampsia, preterm birth and cerebral palsy. Paediatr Perinat Epidemiol. 2011;25:100–10. doi: 10.1111/j.1365-3016.2010.01157.x. [DOI] [PubMed] [Google Scholar]
- 39.Blair E, Watson L. Cerebral palsy and perinatal mortality after pregnancy-induced hypertension across the gestational age spectrum: observations of a reconstructed total population cohort. Dev Med Child Neurol. 2016;58:76–81. doi: 10.1111/dmcn.13014. [DOI] [PubMed] [Google Scholar]
- 40.Wilcox AJ, Weinberg CR, Basso O. On the pitfalls of adjusting for gestational age at birth. Am J Epidemiol. 2011;174:1062–8. doi: 10.1093/aje/kwr230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kramer MS, Zhang X, Platt RW. Commentary: Yerushalmy, maternal cigarette smoking and the perinatal mortality crossover paradox. Int J Epidemiol. 2014;43:1378–81. doi: 10.1093/ije/dyu164. [DOI] [PubMed] [Google Scholar]
- 42.VanderWeele TJ, Mumford SL, Schisterman EF. Conditioning on Intermediates in Perinatal Epidemiology. Epidemiology. 2012;23:1–9. doi: 10.1097/EDE.0b013e31823aca5d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Snowden JM, Basso O. Causal inference in studies of preterm babies: a simulation study. BJOG. 2017 doi: 10.1111/1471-0528.14942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Pearl J. Causality: models, reasoning, and inference. Cambridge: Cambridge University Press; 2009. Simpson’s Paradox: An Anatomy; pp. 173–82. [Google Scholar]
- 45.Weinberg CR. Toward a clearer definition of confounding. Am J Epidemiol. 1993;137:1–8. doi: 10.1093/oxfordjournals.aje.a116591. [DOI] [PubMed] [Google Scholar]
- 46.Howards PP, Schisterman EF, Poole C, Kaufman JS, Weinberg CR. Toward a clearer definition of confounding revisited with directed acyclic graphs. Am J Epidemiol. 2012;176:506–11. doi: 10.1093/aje/kws127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Breslow N, Day N. Statistical methods in cancer research Vol I. The analysis of case-control studies. IARC Scien. Lyon: International Agency for Research; 1980. [PubMed] [Google Scholar]
- 48.Kleinbaum D, Kupper L, Morgenstern H. Epidemiologic research: principles and quantitative methods. Belmont, CA: Lifetime Learning Publications; 1982. [Google Scholar]
- 49.World Health Organization, Unicef. Protecting, promoting and supporting breast-feeding: the special role of maternity services. Geneva: World Health Organization; 1989. [Google Scholar]
- 50.Kramer MS, Aboud F, Mironova E, et al. Breastfeeding and Child Cognitive Development. Arch Gen Psychiatry. 2008;65:578. doi: 10.1001/archpsyc.65.5.578. [DOI] [PubMed] [Google Scholar]
- 51.Hernán MA. A definition of causal effect for epidemiological research. J Epidemiol Community Health. 2004;58:265–71. doi: 10.1136/jech.2002.006361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Hernán MA, Robins JM. Per-Protocol Analyses of Pragmatic Trials. N Engl J Med. 2017;377:1391–8. doi: 10.1056/NEJMsm1605385. [DOI] [PubMed] [Google Scholar]
- 53.Greenland S. Quantifying biases in causal models: classical confounding vs colliderstratification bias. Epidemiology. 2003;14:300–6. [PubMed] [Google Scholar]
- 54.Liu W, Brookhart MA, Schneeweiss S, Mi X, Setoguchi S. Implications of M bias in epidemiologic studies: a simulation study. Am J Epidemiol. 2012;176:938–48. doi: 10.1093/aje/kws165. [DOI] [PubMed] [Google Scholar]
- 55.VanderWeele TJ, Robins JM. Directed acyclic graphs, sufficient causes, and the properties of conditioning on a common effect. Am J Epidemiol. 2007;166:1096–104. doi: 10.1093/aje/kwm179. [DOI] [PubMed] [Google Scholar]
- 56.Pearce N, Lawlor DA. Causal inference-so much more than statistics. Int J Epidemiol. 2016;45:1895–903. doi: 10.1093/ije/dyw328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Vandenbroucke JP, Broadbent A, Pearce N. Causality and causal inference in epidemiology: the need for a pluralistic approach. Int J Epidemiol. 2016;45:1776–86. doi: 10.1093/ije/dyv341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Krieger N, Davey Smith G. The tale wagged by the DAG: broadening the scope of causal inference and explanation for epidemiology. Int J Epidemiol. 2016;45 doi: 10.1093/ije/dyw114. dyw114. [DOI] [PubMed] [Google Scholar]
- 59.Haneuse S. Distinguishing Selection Bias and Confounding Bias in Comparative Effectiveness Research. Med Care. 2016;54:e23–9. doi: 10.1097/MLR.0000000000000011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Rothman KJ, Gallacher JE, Hatch EE. Why representativeness should be avoided. Int J Epidemiol. 2013;42:1012–4. doi: 10.1093/ije/dys223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology. 1999;10:37–48. [PubMed] [Google Scholar]
- 62.Shrier I, Platt RW. Reducing bias through directed acyclic graphs. BMC Med Res Methodol. 2008;8:70. doi: 10.1186/1471-2288-8-70. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.