Abstract
Understanding the safety of medication use during pregnancy relies on observational studies: However, confounding in observational studies poses a threat to the validity of estimates obtained from observational data. Newer methods, such as marginal structural models and propensity calibration, have emerged to deal with complex confounding problems, but these methods have seen limited uptake in the pregnancy medication literature. In this article, we provide an overview of newer advanced methods for confounding control and show how these methods are relevant for pregnancy medication safety studies.
1. INTRODUCTION
More than half of all pregnant women in Western countries take medication during pregnancy,1, 2, 3 making studies of medication safety a pressing public health concern. Studying medication safety in pregnancy presents particular challenges: Effects of medications on fetal development can be unpredictable, vulnerability to exposure changes during pregnancy, and outcomes may occur early in fetal development but be detected later.4 In the general population, knowledge of medication efficacy and safety is primarily based on randomized controlled trials. However, randomized trials routinely exclude pregnant women due to uncertainties about the effects of medications on fetal development, meaning that studies of medication safety in pregnancy must rely on reproductive toxicity studies in animals and on observational data in humans. Several landmark cases, such as the thalidomide disaster, have taught us that animal models for teratogenicity do not necessarily translate to humans. Observational studies, using data from cohort studies, registries, and administrative databases,5 are opportunities for understanding the risks of medication use in pregnancy, and in 2005, the Food and Drug Administration acknowledged that observational studies are the best method for assessing the maternal and fetal safety of using medication during pregnancy.6 However, confounding is a major source of bias in observational studies. Recent years have seen the rapid development of advanced methods for dealing with confounding, yet uptake of these methods has been slow in the pregnancy medication literature. This is unfortunate, because in this field, it is arguably especially important that researchers use the best methods for confounding control, because the consequences for getting the wrong answer are so profound: Failing to detect true effects of medication exposure can have enormous effects in the population, and falsely raising the alarm for a safe drug can result in women forgoing needed therapies and, in some cases, terminating wanted pregnancies.6
In this paper, we advocate for a greater use of advanced methods for confounding control in the pregnancy medication safety research field and provide an overview of these methods under the following framework:
How does this method help us to make fair comparisons between the exposed and unexposed groups?
How has this method been applied in the pregnancy medication literature?
How is the method used in practice?
What are the important assumptions for this method?
What are the major strengths and limitations of the method?
Table 1 provides an outline of pregnancy medication studies using advanced methods to deal with confounding. This paper gives a useful reference for both students and experienced researchers who wish to gain new skills in advanced methods for confounding control.
Table 1.
Examples of application of advanced confounding control methods in the pregnancy medication safety literature
Medication and Study Reference | Outcome | Confounder(s) | Confounding Problem(s) | Method(s) Used | ||||||
---|---|---|---|---|---|---|---|---|---|---|
Time Varying | Complex/High Dimensional | Unmeasured Confounders | Propensity Scores/Summary Scores | Marginal Structural Models | Propensity Calibration | Sibling/Family Studies | Instrumental Variables | |||
Ondansetron (Pasternak et al14) | Malformations | Nausea/vomiting; maternal characteristics, comorbidities, other medications, pregnancy history. | x | x | ||||||
Lithium (Patorno et al15) | Cardiac malformations | Maternal comorbidities, other medications, maternal characteristics. | x | x | ||||||
Statins (Bateman et al17) | Malformations | Maternal characteristics, obstetric and medical conditions, other medications. | x | x | ||||||
Triptans (Wood et al25) | Neurodevelopment | Other medications (time varying), maternal characteristics; migraine severity. | x | x | ||||||
Iron supplementation (Bodnar et al26) | Anemia | Maternal baseline characteristics; gastric symptoms; serum erritin and hemoglobin concentration. | x | x | ||||||
Triptans (Wood et al35) | Neurodevelopment | Other medications, maternal characteristics; migraine severity, attitudes about medication use. | x | x | x | x | ||||
SSRI (Nezvalová‐Henriksen et al41; Viktorin et al42) | Gestational age, birth weight | Family factors, maternal depression; illnesses, other medications. | x | x | x | |||||
Anti‐epileptic drugs (Bech et al58) | Spontaneous abortion | Severity of maternal epilepsy; maternal characteristics, environmental exposures, comorbidities. | x | x | ||||||
SSRI (Swanson et al46) | Maternal depression relapse | Maternal depression severity; comorbidities, other medications, maternal characteristics, proxies for severity. | x | x | x | x |
Abbreviation: SSRI, selective serotonin reuptake inhibitor.
KEY POINTS.
Studies of the safety of medication use during pregnancy depend mainly on observational studies, which are subject to confounding bias.
Novel methods for confounding control have seen limited uptake in the pregnancy medication safety literature.
Application of novel methods is necessary to appropriately address the complex confounding scenarios found in pregnancy studies.
2. CONFOUNDING IN PREGNANCY MEDICATION STUDIES
Confounding control begins with a review of the literature and consultation with subject‐area experts. Directed acyclic graphs (DAGs) provide a graphical means to represent the causal structure the investigator believes is present7 and guide study design, data collection, and analysis. Figure 1 is an example DAG showing one possible causal model for prenatal antidepressant exposure and childhood neurodevelopment, with potential biasing paths, including confounders (other psychiatric illness, other psychiatric medication use, depression severity, and genetics), which should be controlled as far as possible, as well as a mediator (gestational age), and a collider (live birth). Several nonbiasing paths, including a risk factor for the outcome that is unrelated to the exposure (child gender) and a predictor of exposure that is unrelated to the outcome (prepregnancy antidepressant use), are also shown. Obtaining unbiased effect estimates requires investigators to identify and control confounding, while avoiding bias from inappropriate control for colliders and mediators and loss of precision or confusing interpretation of estimates arising from control for factors only related to the exposure or outcome but not both.8 The Supporting Information contains a more comprehensive review of definitions of confounding, counterfactuals, and causal inference.
Figure 1.
Conceptual model for the effect of prenatal selective serotonin reuptake inhibitor (SSRI) exposure on attention deficit/hyperactivity disorder (ADHD), including a set of important confounders (depression severity, concomitant medication use, and genetics), a potential mediator (gestational age), a collider (live birth), and factors related only to the exposure (prepregnancy SSRI use) or the outcome (child gender)
2.1. Methods for measured confounders
In Box 1 (Supporting Information), we include a simplified illustration of confounding by measured factors and the methods to address confounding.
Confounder summary scores and marginal structural models (MSMs) work by reducing a large amount of information about an individual into a single summary score. Two individuals can have the same summary score but different individual confounder values (eg, a woman with a propensity score [PS] for antidepressant use of 0.5 might be an unemployed smoker with anxiety, or a nonsmoking lawyer with depression), but because their distribution of confounders is equivalent, any differences in outcome will be attributable only to exposure to the drug of interest. Fair comparisons between exposure groups can then be made because within each stratum of exposure, the distribution of common causes of exposure and outcome is the same.
2.1.1. Propensity scores (and other confounder summary scores)
The propensity score, which is the probability of exposure given observed confounders,9 reduces a large set of confounders to a single summary score. Propensity scores are commonly used in the medical literature; however, other summary score methods, including disease risk scores10 (preferred in the case of rare exposures) and polygenic risk scores11 (useful for cases when genetic confounding) are available.
Propensity scores are typically constructed using multivariable logistic regression, where exposure is the dependent variable and confounders are the independent variables. The PS model should include variables that are confounders or predictors of the outcome; inclusion of factors that are only predictors of exposure will increase variance without decreasing bias.12 High‐dimensional PSs, which include thousands of variables identified through computational algorithms, may also be useful for adjusting for unmeasured confounders, if the measured variables are partial proxies for the unmeasured confounders.13 The PS can be used to match, stratify, adjust, or weight the outcome model. Propensity scores, including high‐dimensional PS, have seen increased uptake in the pregnancy literature, ie, safety studies on ondansetron,14 lithium,15 antidepressants,16 and statins17 in pregnancy, but their use is still minimal compared to multivariable regression (Table 1). Box 1, in the Supporting Information, gives a simplified explanation of PS matching and weighting.
Assumptions
Use of PS requires several assumptions, including exchangeability (no unmeasured confounding) and positivity (nonzero probability of treatment). Neither assumption is formally testable. Positivity can be addressed by ensuring that the women in the sample all have the indication for the medication (ie, if assessing safety of antidepressants, all women in the sample should be at risk for treatment) and that no individuals with clear contraindications are included. Exchangeability is never assured; however, sensitivity analyses can yield estimates for how vulnerable an effect estimate may be to unmeasured confounding.
Strengths and limitations
PS is especially useful when working with a common treatment and rare outcome. They also separate the design of the study (modeling confounding) from modeling the outcome.18 However, for rare exposures, summary scores do not perform particularly well.19 In addition, use of PS methods may produce the appearance of effect modification and/or result in residual confounding in case control or case cohort studies20 or in cohort studies where exposure is misclassified.21
2.1.2. Marginal structural models
Marginal structural models address time‐varying exposure and confounding.22, 23 Rules for confounder adjustment state we must adjust for common causes of the exposure and outcome, but should not adjust for factors on the causal pathway. In the case of time‐varying exposure and confounding, we encounter a double bind: Factors that are confounders in one part of the causal structure are mediators in another part (Figure S1A). For example, when studying the safety of antidepressants, we may wish to control for depression severity. However, antidepressant use in earlier pregnancy predicts depressive symptoms in later pregnancy, which will also predict subsequent antidepressant use. Standard adjustments for depression severity will always be biased in this scenario.
Central to the MSM is the inverse probability of treatment weight. At each measurement time t, the investigator uses logistic regression to construct the numerator (probability of exposure) and denominator (probability of exposure, given baseline predictors and history of exposure at time t − 1).24 The total weight is the product of the weights at each time point, and analyses are conducted in the weighted population, or pseudo‐population, in which individuals who are likely to be exposed are downweighted, while those who are unlikely to be exposed are upweighted, producing balance of measured confounders within strata of exposure.
Use of MSMs for pregnancy medication safety studies remains rare,25, 26 despite examples where timing of exposure is of great importance, and exposure is conditional on time‐varying confounders, such as other medication use, or changes in disease severity.
Assumptions
Under assumptions of positivity, exchangeability, and consistency, the MSM will give an unbiased estimate of the effect of the exposure on the outcome. These assumptions are not formally testable, although assessment of the positivity assumption may include evaluation of the inverse probability of treatment weight for extreme weights and progressive truncation of the weights to determine whether extreme weights are highly influential.27 When important confounders are unmeasured or incompletely measured, MSM methods will not provide unbiased effect estimates.
Strengths and limitations
The key strength of the MSM is that it allows consideration of time‐varying exposure and confounding, which is highly relevant in pregnancy research due to the changes in fetal vulnerability through the course of pregnancy and the tendency of women to change their medication use during pregnancy.28, 29 However, when the treatment‐covariate association is very strong, MSMs can produce very wide confidence intervals, which fail to include the true effect.27
2.2. Methods for incomplete confounder data
Failure to adjust for unmeasured confounders results in biased effect estimates (Figure S1B). In some situations, the confounder of interest was not measured in the original dataset, but was measured in a similar sample. In this scenario, confounder adjustment is possible, even if the outcome has not been measured in this sample, using PS calibration.30, 31, 32 Propensity score calibration is a method based on regression calibration33 that offers an additional advantage over other methods of calibration,34 by allowing for adjustment for multiple confounders. For example, in a study of triptan safety, we used a cross‐sectional study to jointly adjust estimates for migraine severity and type.35
In this method, 2 PSs must be calculated: the error‐prone PS (estimated in both the main and validation studies, including only the confounders available in the main study) and the gold‐standard PS (estimated in the validation study, including all confounders). The outcome model is fitted using the difference between the error‐prone and gold‐standard PSs to calibrate effect estimates.
Assumptions
In addition to the assumptions of PS models, outlined previously, PS calibration also assumes that the validation sample is a reasonable stand‐in for the main sample and that the measurement error model is correctly specified.30, 31 Propensity score calibration also assumes surrogacy, meaning that the error‐prone PS is an adequate surrogate for the gold‐standard PS.36 If the outcome is not measured in the validation study, the surrogacy assumption is not testable. Violations of surrogacy occur when the direction of confounding differs between the main and validation studies,30 and bias arising from violations of surrogacy can be predicted.36
Other methods exist for unmeasured confounding, including weighting by the inverse probability of missingness, as well as standard imputation techniques, and a comparison of these methods with PS calibration showed little material difference in bias reduction.37
Strengths and limitations
The main strength of PS calibration allows for adjustment for multiple unmeasured confounders. However, calibration methods fail when unmeasured confounding is strong, and violations of the surrogacy assumption may result in increased bias.
2.3. Methods for unmeasured confounding
Information on confounders may be too difficult to measure (eg, family environment or parenting style) or too costly (eg, deep sequencing genetic data). The methods discussed below exploit aspects of observational data to control for measured and unmeasured confounders.
2.3.1. Sibling comparison designs
If the unmeasured confounders are shared between siblings (see Figure S1C for illustration), then studies examining with discordant exposure allows researchers to remove bias from shared confounders.38, 39, 40 If, for example, we believe that any differences in autism risk between children with and without prenatal exposure to antidepressants is due to inherited genetic risk, then comparing the autism diagnosis between pairs of siblings with different prenatal exposure should be less biased than comparing autism risk between unrelated exposed and unexposed groups.
There has been substantial uptake of sibling study designs in the pregnancy medication safety literature in recent years, particularly in studies examining the safety of antidepressants, where the main concern is separating the underlying genetic and familial components of depression from exposure to antidepressant medications.41, 42
Assumptions
Use of sibling designs is most appropriate when confounders that are shared between siblings are more important than unshared,39 and there are no carryover effects between siblings.43
Strengths and limitations
Sibling designs control measured and unmeasured confounding that is shared between siblings. However, failing to control for unshared confounders increases bias; sibling studies are also more vulnerable to bias from measurement error than nonsibling studies.39
2.3.2. Instrumental variables
Instrumental variable (IV) methods44, 45 require identifying a variable whose effect on the outcome occurs only through the exposure: An example of a perfect instrument is a coin toss assigning an individual to exposure or nonexposure, while commonly used instruments include provider prescription preference and calendar time. One example is a study of antidepressant efficacy during pregnancy using provider preference, calendar time as a function of Food and Drug Administration recommendations, and geographic differences in antidepressant use as instruments; however, these instruments were only weakly associated with the treatment, which may have contributed to the equivocal findings.46 Instrumental variable studies are often conducted using a 2‐stage least squares methods, where in the first stage, the instruments are used as explanatory variables in a model predicting the exposure, and the predicted values from this first stage are used as predictors in the outcome model. Identifying a strong instrument that meets all assumptions is challenging, which has contributed to the slower adoption of this method. Mendelian randomization, which uses a genetic marker as an instrument, is a subtype of IV analysis47; while Mendelian randomization has not yet been used in pregnancy medication studies, studies estimating the effect of alcohol use during pregnancy on later neurocognitive outcomes have used the genetic variants encoding alcohol dehydrogenase, an enzyme that metabolizes alcohol, with some success.48
Assumptions
Instrumental variable analyses allow for unbiased effect estimation under strict assumptions: (1) The instrument has a causal effect on the exposure of interest, (2) the instrument effects the outcome only through the exposure, not through any other pathways, and (3) there are no common causes or confounders of the instrument‐outcome pathway (Figure S1D).
Strengths and limitations
Instrumental variable analyses control measured and unmeasured confounding, and so instruments that meet all the assumptions will mimic the results from a randomized trial. However, estimates are highly sensitive to violations of untestable assumptions, and violations may produce bias amplification.44
Figure 2 guides readers through selecting a method or methods, based on characteristics of confounder data. The most important first step is to draw a DAG or DAGs that represent the proposed causal mechanism, without regard to availability of data on confounders: If a confounder is important, it should be included in the DAG, even if the study did not collect data on it. Next, determine which confounders are available in your study and whether the data support the analytic method. For example, if your DAG shows that medication use and confounders vary over time, but your data shows no such variation, an MSM approach should not be used; if the data cannot identify siblings, this method cannot be used. Most importantly, we urge researchers to consider potential sources of confounding regardless of whether they were measured in the data and to choose the methods most suited to the data they have available: Figure 2 suggests a systematic way of approaching this process.
Figure 2.
Choosing methods for confounding control
A reference to selected software for the methods discussed in this paper is included as part of the Supporting Information.
3. DISCUSSION
Studies of medication use during pregnancy use observational data to answer critical questions of safety and efficacy. More traditional methods for confounding control, such as stratification, restriction, matching, and adjustment have been described in great detail elsewhere, and because of this, we have not discussed them here. These older methods have their place in observational research, but as our understanding of the complexities of bias has progressed, so has our understanding of the limitations of these methods. The methods described in this paper were developed to address specific confounding problems and are necessary to reduce bias and, ultimately, to produce the best information possible to health care providers and pregnant women. Using these methods can produce substantially different results from traditional methods, such as when we compare the cohort and sibling studies of antidepressant safety,41, 42, 49 the regression‐adjusted estimates to the MSM estimates. Using multiple methods can also help to researchers triangulate, and it is reassuring when multiple methods, e.g. standard regression, PS methods, sibling controls, and negative paternal controls all produce similar estimates.50
With few exceptions, these methods have seen slow uptake in the pregnancy medication literature. This may be due to a sense of caution about methods that can seem opaque upon first encounter with the methods paper describing the technique. Caution is necessary when applying novel methods. However, it is also true that the standard regression methods require similar assumptions to the methods discussed in this paper. If readers find that their research question fits well with one of the scenarios described in this paper, we suggest approaching the problem by tackling the citations given for the technique. The techniques we describe in this paper have their roots in standard regression techniques and can be implemented with standard software.
While this paper focuses on bias due to confounding, other sources of bias such as exposure and/or outcome misclassification51 and selection bias,52 as well as seasonal effects,53 can also distort associations. This paper is not intended to be an exhaustive discussion of all possible methods for confounding control. New techniques are being developed all the time, and many of these, such as g‐estimation54, 55 and targeted maximum likelihood estimation,56 have not yet been implemented in the pregnancy medication literature. Quantitative bias analysis can help researchers account for bias from systematic errors in their data.57Further, the methods discussed herein are not mutually exclusive and can be used in combination with each other: Combining PSs with IVs46 or MSMs with quantitative bias analysis25 gives more information about the probable range of effect estimates than any single method.
Observational studies are vital to our understanding of medication safety in pregnancy, but great care must be taken in the analysis and interpretation of data to minimize confounding and bias. In all pharmacoepidemiological studies, sources of bias should be acknowledged and discussed and preferably quantified by performing sensitivity analysis of estimates under an array of assumptions about possible bias directions and magnitudes.
ETHICS STATEMENT
The authors state that no ethical approval was needed.
ACKNOWLEDGEMENT
The authors would like to thank Professor Sonia Hernandez‐Diaz, who contributed to early discussions about this article.
CONFLICT OF INTEREST
The authors declare no conflict of interest.
Supporting information
Directed acyclic graphs (DAGs) for (a)Time varying confounding: time‐varying exposure A, outcome Y, baseline confounders C and time‐varying confounders TVC at times 0, 1, and 2; (b)Unmeasured confounding: exposure A, outcome Y, and measured C and unmeasured U confounders; (c)Sibling study design, for siblings (1 and 2), with exposure A, outcome Y, and confounders C of AY, and shared unmeasured factors which cause C, A, and Y; (d) Instrumental variable (IV) which affects the outcome Y only through the exposure A and therefor controls both measured confounders C and unmeasured confounders U.
Wood ME, Lapane KL, van Gelder MMHJ, Rai D, Nordeng HME. Making fair comparisons in pregnancy medication safety studies: An overview of advanced methods for confounding control. Pharmacoepidemiol Drug Saf. 2018;27:140–147. 10.1002/pds.4336
The copyright line for this article was changed on 11 July 2019 after original online publication.
REFERENCES
- 1. Mitchell AA, Gilboa SM, Werler MM, Kelley KE, Louik C, Hernández‐Díaz S. Medication use during pregnancy, with particular focus on prescription drugs: 1976‐2008. Am J Obstet Gynecol. 2011;205(1): 51.e1‐8. doi: 10.1016/j.ajog.2011.02.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Headley J, Northstone K, Simmons H, Golding J. Medication use during pregnancy: Data from the Avon Longitudinal Study of Parents and Children. Eur J Clin Pharmacol. 2004;60(5):355‐361. 10.1007/s00228-004-0775-7 [DOI] [PubMed] [Google Scholar]
- 3. Lupattelli A, Spigset O, Twigg MJ, et al. Medication use in pregnancy: A cross‐sectional, multinational web‐based study. BMJ Open. 2014;4(2): e004365. doi: 10.1136/bmjopen-2013-004365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Savitz DA, Hertz‐Picciotto I, Poole C, Olshan AF. Epidemiologic measures of the course and outcome of pregnancy. Epidemiol Rev. 2002;24(2):91‐101. 10.1093/epirev/mxf006 [DOI] [PubMed] [Google Scholar]
- 5. Andrade SE, Bérard A, Nordeng HME, Wood ME, van Gelder MMHJ, Toh S. Administrative claims data versus augmented pregnancy data for the study of pharmaceutical treatments in pregnancy. Curr Epidemiol Reports. 2017;1‐11. 10.1007/s40471-017-0104-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Reviewer Guidance Evaluating the Risks of Drug Exposure in Human Pregnancies .; 2005. http://www.fda.gov/cber/guidelines.htm.
- 7. Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology. 1999;10(1):37‐48. 10.1097/00001648-199901000-00008 [DOI] [PubMed] [Google Scholar]
- 8. Bandoli G, Palmsten K, Flores KF, Chambers CD. Constructing causal diagrams for common perinatal outcomes: Benefits, limitations and motivating examples with maternal antidepressant use in pregnancy. Paediatr Perinat Epidemiol. 2016;30(5):521‐528. 10.1111/ppe.12302 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41‐55. [Google Scholar]
- 10. Arbogast PG, Ray WA. Use of disease risk scores in pharmacoepidemiologic studies. Stat Methods Med Res. 2009;18(1):67‐80. 10.1177/0962280208092347 [DOI] [PubMed] [Google Scholar]
- 11. Chatterjee N, Shi J, García‐Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat Rev Genet. 2016;17(7):392‐406. 10.1038/nrg.2016.27 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Brookhart MA, Schneeweiss S, Rothman KJ, Glynn RJ, Avorn J, Stürmer T. Variable selection for propensity score models. Am J Epidemiol. 2006;163(12):1149‐1156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Schneeweiss S, Rassen J, Glynn RJ, Avorn J, Mogun H, Brookhart MA. High‐dimensional propensity score adjustment in studies of treatment effects using health care claims data. Epidemiology. 2009;20(4):512‐522. 10.1097/EDE.0b013e3181a663cc.High-dimensional [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Pasternak B, Svanström H, Hviid A. Ondansetron in pregnancy and risk of adverse fetal outcomes. N Engl J Med. 2013;368(9):814‐823. 10.1056/NEJMoa1211035 [DOI] [PubMed] [Google Scholar]
- 15. Patorno E, Huybrechts KF, Bateman BT, et al. Lithium use in pregnancy and the risk of cardiac malformations. N Engl J Med. 2017;376(23):2245‐2254. 10.1056/NEJMoa1612222 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Huybrechts KF, Palmsten K, Avorn J, et al. Antidepressant use in pregnancy and the risk of cardiac defects. N Engl J Med. 2014;370(25):2397‐2407. 10.1056/NEJMoa1312828 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Bateman BT, Hernandez‐diaz S, Fischer MA, et al. Statins and congenital malformations: Cohort study. Br Med J. 2015;350 10.1136/bmj.h1035 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Austin PC. An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivar Behav Res. 2011;46(3):399‐424. 10.1080/00273171.2011.568786 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Glynn RJ, Schneeweiss S, Stürmer T. Indications for propensity scores and review of their use in pharmacoepidemiology. Basic Clin Pharmacol Toxicol. 2006;98(3):253‐259. 10.1111/j.1742-7843.2006.pto_293.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Maansson R, Joffe MM, Sun W, Hennessy S. On the estimation and use of propensity scores in case‐control and case‐ cohort studies. Am J Epidemiol. 2007;166(3):332‐339. 10.1093/aje/kwm069 [DOI] [PubMed] [Google Scholar]
- 21. Wood ME, Chrysanthopoulou SA, Nordeng HME, Lapane KL. The impact of nondifferential exposure misclassification on the performance of propensity scores for continuous and binary outcomes: A simulation study [ACCEPTED, IN PRESS]. Med Care. 2017; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Robins JM, Hernan M, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550‐560. [DOI] [PubMed] [Google Scholar]
- 23. Robins JM. Marginal structural models versus structural nested models as tools for causal inference. 1999;(1986):1–30.
- 24. Hernán MA, Brumback BA, Robins JM. Estimating the causal effect of zidovudine on CD4 count with a marginal structural model for repeated measures. Stat Med. 2002;21(12):1689‐1709. 10.1002/sim.1144 [DOI] [PubMed] [Google Scholar]
- 25. Wood ME, Lapane K, Frazier JA, Ystrom E, Mick EO, Nordeng H. Prenatal triptan exposure and internalising and externalising behaviour problems in 3‐year‐old children: Results from the Norwegian mother and child cohort study. Paediatr Perinat Epidemiol. November 2015. 10.1111/ppe.12253 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Bodnar L, Davidian M, Siega‐Riz AM, Tsiatis A. Marginal structural models for analyzing causal effects of time‐dependent treatments: An application in perinatal epidemiology. Am J Epidemiol. 2004;159(10):926‐934. 10.1093/aje/kwh131 [DOI] [PubMed] [Google Scholar]
- 27. Cole SR, Hernán MA. Constructing inverse probability weights for marginal structural models. Am J Epidemiol. 2008;168(6):656‐664. 10.1093/aje/kwn164 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Lupattelli A, Spigset O, Björnsdóttir I, et al. Patterns and factors associated with low adherence to psychotropic medications during pregnancy—a cross‐sectional, multinational web‐based study. Depress Anxiety. 2015; 11:n/a‐n/a. doi: 10.1002/da.22352 [DOI] [PubMed] [Google Scholar]
- 29. Harris G‐ME, Wood M, Eberhard‐Gran M, Lundqvist C, Nordeng H. Patterns and predictors of analgesic use in pregnancy: A longitudinal drug utilization study with special focus on women with migraine. BMC Pregnancy Childbirth. 2017;17(1). 10.1186/s12884-017-1399-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Stürmer T, Schneeweiss S, Rothman KJ, Avorn J, Glynn RJ. Performance of propensity score calibration—a simulation study. Am J Epidemiol. 2007;165(10):1110‐1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Stürmer T, Schneeweiss S, Avorn J, Glynn RJ. Adjusting effect estimates for unmeasured confounding with validation data using propensity score calibration. Am J Epidemiol. 2005;162(3):279‐289. 10.1093/aje/kwi192 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Sturmer T, Glynn RJ, Rothman KJ, Avorn J, Schneeweiss S. Adjustments for unmeasured confounders in pharmacoepidemiologic database studies using external information. Med Care. 2007;45(10(S)):1‐15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Spiegelman D, Mcdermott A, Rosner B. Regression calibration methods for correcting measurement error bias in nutritional epidemiology. Am J Clin Nutr. 1997;65:1179S‐1186S. [DOI] [PubMed] [Google Scholar]
- 34. Schneeweiss S. Sensitivity analysis and external adjustment for unmeasured confounders in epidemiologic database studies of therapeutics. Pharmacoepidemiol Drug Saf. 2006;15(5):291‐303. 10.1002/pds.1200 [DOI] [PubMed] [Google Scholar]
- 35. Wood ME, Frazier JA, Nordeng HME, Lapane KL. Prenatal triptan exposure and parent‐reported early childhood neurodevelopmental outcomes: An application of propensity score calibration to adjust for unmeasured confounding by migraine severity. Pharmacoepidemiol Drug Saf. 25(5):493–502. 10.1002/pds.3902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Lunt M, Glynn RJ, Rothman KJ, Avorn J, Stürmer T. Propensity score calibration in the absence of surrogacy. Am J Epidemiol. 2012;175(12):1294‐1302. 10.1093/aje/kwr463 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Toh S, Garcia Rodriguez LA, Hernan M. Analyzing partially missing confounding information in comparative effectiveness and safety research of therapeutics. Pharmacoepidemiol Drug Saf. 2012;21(0 2):13‐20. 10.1002/pds.3248.Analyzing [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Carlin JB, Gurrin LC, Sterne JAC, Morley R, Dwyer T. Regression models for twin studies: A critical review. Int J Epidemiol. 2005;34(5):1089‐1099. 10.1093/ije/dyi153 [DOI] [PubMed] [Google Scholar]
- 39. Frisell T, Öberg S, Kuja‐Halkola R, Sjölander A. Sibling comparison designs: Bias from non‐shared confounders and measurement error. Epidemiology. 2012;23(5):713‐720. 10.1097/EDE.0b013e31825fa230 [DOI] [PubMed] [Google Scholar]
- 40. Keyes KM, Smith GD, Susser E. On sibling designs. Epidemiology. 2013;24:473‐474. 10.1097/EDE.0b013e31828c7381 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Nezvalová‐Henriksen K, Spigset O, Brandlistuen RE, Ystrom E, Koren G, Nordeng H. Effect of prenatal selective serotonin reuptake inhibitor (SSRI) exposure on birthweight and gestational age: A sibling‐controlled cohort study. Int J Epidemiol. 2016;(Ci): dyw049. doi: 10.1093/ije/dyw049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Viktorin A, Lichtenstein P, Lundholm C, et al. Selective serotonin re‐uptake inhibitor use during pregnancy: Association with offspring birth size and gestational age. Int J Epidemiol. 2017;45(1):170‐177. 10.1093/ije/dyv351 [DOI] [PubMed] [Google Scholar]
- 43. Sjölander A, Frisell T, Kuja‐Halkola R, Öberg S, Zetterqvist J. Carry‐over effects in sibling comparison designs. Epidemiology. 2016;27(6). 10.1097/EDE.0000000000000541 [DOI] [PubMed] [Google Scholar]
- 44. Hernan MA, Robins JM. Instruments for causal inference: An epidemiologist's dream? Epidemiology. 2006;17(4):360‐372. 10.1097/01.ede.0000222409.00878.37 [DOI] [PubMed] [Google Scholar]
- 45. Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. J Am Stat Assoc. 1996;91(434):444‐455. [Google Scholar]
- 46. Swanson SA, Hernandez‐Diaz S, Palmsten K, Mogun H, Olfson M, Huybrechts KF. Methodological considerations in assessing the effectiveness of antidepressant medication continuation during pregnancy using administrative data. Pharmacoepidemiol Drug Saf. 2015;24(9):934‐942. 10.1002/pds [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Smith GD, Ebrahim S. “Mendelian randomization”: Can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1):1‐22. 10.1093/ije/dyg070 [DOI] [PubMed] [Google Scholar]
- 48. Zuccolo L, Lewis SJ, Smith GD, et al. Prenatal alcohol exposure and offspring cognition and school performance. A Mendelian randomization natural experiment. Int J Epidemiol. 2013;42(5):1358‐1370. 10.1093/ije/dyt172 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Brandlistuen RE, Ystrom E, Eberhard‐Gran M, Nulman I, Koren G, Nordeng H. Behavioural effects of fetal antidepressant exposure in a Norwegian cohort of discordant siblings. Int J Epidemiol. 2015;44(4):1397‐1407. 10.1093/ije/dyv030 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Rai D, Lee BK, Dalman C, Newschaffer C, Lewis G, Magnusson C. Antidepressants during pregnancy and autism in offspring: Population based cohort study. BMJ. 2017; 358:j2811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Grzeskowiak LE, Gilbert AL, Morrison JL. Exposed or not exposed? Exploring exposure classification in studies using administrative data to investigate outcomes following medication use during pregnancy. Eur J Clin Pharmacol. 2012;68(5):459‐467. 10.1007/s00228-011-1154-9 [DOI] [PubMed] [Google Scholar]
- 52. Liew Z, Olsen J, Cui X, Ritz B, Arah OA. Bias from conditioning on live birth in pregnancy cohorts: An illustration based on neurodevelopment in children after prenatal exposure to organic pollutants. Int J Epidemiol. 2015;44(1):345‐354. 10.1093/ije/dyu249 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Hutcheon JA, Fell DB, Jackson ML, et al. Detectable risks in studies of the fetal benefits of maternal influenza vaccination. Am J Epidemiol. 2016;184(3):227‐232. 10.1093/aje/kww048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Hernán MA, Lanoy E, Costagliola D, Robins JM. Comparison of dynamic treatment regimes via inverse probability weighting. Basic Clin Pharmacol Toxicol. 2006;98(3):237‐242. 10.1111/j.1742-7843.2006.pto_329.x [DOI] [PubMed] [Google Scholar]
- 55. Robins J. A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Math Model. 1986;7(9–12):1393‐1512. 10.1016/0270-0255(86)90088-6 [DOI] [Google Scholar]
- 56. van der Laan MJ. Targeted maximum likelihood based causal inference: Part I. Int J Biostat. 2010;6(2): Article 3. doi: 10.2202/1557-4679.1241 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Lash TL, Fox MP, MacLehose RF, Maldonado G, McCandless LC, Greenland S. Good practices for quantitative bias analysis. Int J Epidemiol. 2014;(July):1‐17. 10.1093/ije/dyu149 [DOI] [PubMed] [Google Scholar]
- 58. Bech BH, Kjaersgaard MIS, Pedersen HS, et al. Use of antiepileptic drugs during pregnancy and risk of spontaneous abortion and stillbirth: Population based cohort study. Br Med J. 2014;349 10.1136/bmj.g5159 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Directed acyclic graphs (DAGs) for (a)Time varying confounding: time‐varying exposure A, outcome Y, baseline confounders C and time‐varying confounders TVC at times 0, 1, and 2; (b)Unmeasured confounding: exposure A, outcome Y, and measured C and unmeasured U confounders; (c)Sibling study design, for siblings (1 and 2), with exposure A, outcome Y, and confounders C of AY, and shared unmeasured factors which cause C, A, and Y; (d) Instrumental variable (IV) which affects the outcome Y only through the exposure A and therefor controls both measured confounders C and unmeasured confounders U.