Skip to main content
PLOS Medicine logoLink to PLOS Medicine
. 2023 Jan 24;20(1):e1004148. doi: 10.1371/journal.pmed.1004148

School-age outcomes among IVF-conceived children: A population-wide cohort study

Amber L Kennedy 1,2, Beverley J Vollenhoven 3,4,5, Richard J Hiscock 1,2, Catharyn J Stern 1,6,7, Susan P Walker 1,2, Jeanie L Y Cheong 1,8,9, Jon L Quach 1,8, Roxanne Hastie 1,2, David Wilkinson 2,10, John McBain 1,6,7, Lyle C Gurrin 11, Vivien MacLachlan 5, Franca Agresta 7, Susan P Baohm 5,10, Stephen Tong 1,2,, Anthea C Lindquist 1,‡,*
PMCID: PMC9873192  PMID: 36693021

Abstract

Background

In vitro fertilisation (IVF) is a common mode of conception. Understanding the long-term implications for these children is important. The aim of this study was to determine the causal effect of IVF conception on primary school-age childhood developmental and educational outcomes, compared with outcomes following spontaneous conception.

Methods and findings

Causal inference methods were used to analyse observational data in a way that emulates a target randomised clinical trial. The study cohort comprised statewide linked maternal and childhood administrative data. Participants included singleton infants conceived spontaneously or via IVF, born in Victoria, Australia between 2005 and 2014 and who had school-age developmental and educational outcomes assessed. The exposure examined was conception via IVF, with spontaneous conception the control condition. Two outcome measures were assessed. The first, childhood developmental vulnerability at school entry (age 4 to 6), was assessed using the Australian Early Developmental Census (AEDC) (n = 173,200) and defined as scoring <10th percentile in ≥2/5 developmental domains (physical health and wellbeing, social competence, emotional maturity, language and cognitive skills, communication skills, and general knowledge). The second, educational outcome at age 7 to 9, was assessed using National Assessment Program–Literacy and Numeracy (NAPLAN) data (n = 342,311) and defined by overall z-score across 5 domains (grammar and punctuation, reading, writing, spelling, and numeracy). Inverse probability weighting with regression adjustment was used to estimate population average causal effects.

The study included 412,713 children across the 2 outcome cohorts. Linked records were available for 4,697 IVF-conceived cases and 168,503 controls for AEDC, and 8,976 cases and 333,335 controls for NAPLAN. There was no causal effect of IVF-conception on the risk of developmental vulnerability at school-entry compared with spontaneously conceived children (AEDC metrics), with an adjusted risk difference of −0.3% (95% CI −3.7% to 3.1%) and an adjusted risk ratio of 0.97 (95% CI 0.77 to 1.25). At age 7 to 9 years, there was no causal effect of IVF-conception on the NAPLAN overall z-score, with an adjusted mean difference of 0.030 (95% CI −0.018 to 0.077) between IVF- and spontaneously conceived children. The models were adjusted for sex at birth, age at assessment, language background other than English, socioeconomic status, maternal age, parity, and education. Study limitations included the use of observational data, the potential for unmeasured confounding, the presence of missing data, and the necessary restriction of the cohort to children attending school.

Conclusions

In this analysis, under the given causal assumptions, the school-age developmental and educational outcomes for children conceived by IVF are equivalent to those of spontaneously conceived children. These findings provide important reassurance for current and prospective parents and for clinicians.


In a population-wide cohort study, Dr Anthea C Lindquist and colleagues, examine school-age outcomes among IVF-conceived children in Australia.

Author summary

Why was this study done?

  • More than 8 million children have been conceived globally with the assistance of in vitro fertilisation (IVF).

  • Some studies suggest these children have an increased risk of congenital abnormalities, autism spectrum disorder, developmental delay, and intellectual disability.

  • Educational and school-age developmental outcomes following IVF conception have not yet been adequately characterised.

What did the researchers do and find?

  • Using statewide, linked population data from Victoria, Australia, we investigated the school-age developmental and educational outcomes for children born following IVF-assisted conception.

  • The study examined 2 separate assessments of school-age development and educational outcomes among 585,659 children, including 11,059 children who were conceived via IVF.

  • This study was designed and performed within a causal framework, in order to produce the best possible estimate of exposure effect using observational data.

  • We found no difference in school-age childhood developmental and educational outcomes between IVF- and spontaneously conceived children.

What do these findings mean?

  • These findings provide reassurance for current and prospective parents, as well as clinicians who are involved in IVF.

  • This information may be useful in providing informed consent and education to those considering IVF and those with children conceived via IVF.

Introduction

In vitro fertilisation (IVF) is a common mode of conception worldwide [1]. Since the first successful IVF birth in 1978, more than 8 million babies have been born globally following IVF conception [2,3]. In Australia, it is now estimated that 1 in 20 babies are born following IVF conception [4,5].

As the number of children born following IVF conception continues to rise, a deeper understanding of the long-term implications for these children is important. It is well established that there are increased risks of maternal and perinatal complications following IVF conception [68]. Large cohort studies have suggested an increase in the frequency of congenital abnormalities, autism spectrum disorder, developmental delay, and intellectual disability in children conceived via IVF or intracytoplasmic sperm injection (ICSI) techniques [913]. However, reports detailing longer term outcomes after IVF beyond the neonatal period remain sparse.

Educational and cognitive outcomes following IVF conception have not yet been thoroughly investigated. Several small cohort studies [1417] have reported conflicting results. One large population-based study suggested a small difference in school performance in favour of spontaneous conception [18]. Another population study recently concluded that school performance was not adversely affected by the process of IVF but, rather, the condition of subfertility [19].

Parents of IVF- and spontaneously conceived children possess inherently different health and sociodemographic characteristics [20,21]. Factors such as increased maternal age and higher education are known to be associated with both the use of fertility treatment and better early childhood outcomes [2224]. It is thus critical that such factors are appropriately acknowledged when examining the association between mode of conception and childhood outcomes. Proper adjustment in any statistical analysis is required before any association can be given a causal interpretation.

Our study aimed to overcome some of the limitations of the analysis of observational (non-randomised) data by using a causal inference approach that seeks to emulate the results of a randomised comparison in a clinical trial (Table 1) [25,26]. This analytical approach attempts to simulate a randomised trial by (1) requiring an a priori statistical analysis protocol; (2) addressing a causal question reflecting the effect of an intervention at a specific clinical decision point on a prespecified outcome; and (3) using inverse probability weighting via propensity score (PS) models to balance the outcome propensity differences between exposed and control populations, with the aim of producing exchangeable comparison groups [27]. This allowed us to estimate the population-average effect of mode of conception (IVF versus spontaneous conception) on childhood developmental and educational outcomes with a causal interpretation. Our study aims to estimate the total causal effect of IVF conception on school-age childhood developmental and educational outcomes using a causal inference approach and employing the necessary assumptions.

Table 1. Target trial emulation.

Protocol component Research question: What is the effect of mode of conception (IVF/non-IVF) on childhood development?
Target trial Emulation
(i) Eligibility criteria Inclusion criteria:
All couples wanting to conceive and with the ability to conceive by either method
Exclusion criteria:
Couples either without the ability to conceive or can only conceive using IVF.
E.g., maternal age >45 not compatible with spontaneous conception.
Inclusion criteria: Same as target trial.
Limitation: It is not possible to collect data on failed attempts, miscarriages, or stillbirths = Livebirth bias. Livebirth bias introduces potential selection bias, collider bias, and “depletion of susceptible cases” (see below).
Exclusion criteria: Same as target trial, same exclusion criteria can be applied—important to ensure positivity is maintained between exposure groups.
(ii) Treatment strategies (A) Spontaneous (“in vivo”) conception leading to a live birth
(B) Conception aided by IVF leading to a live birth
(A) Same as target trial
(B) Same as target trial
Positivity assumption: Every subject could potentially be included in either exposure group. This is both a design feature as well as one of statistical adjustment where the positivity assumption is addressed by dataset trimming to ensure overlapping of inverse probability weights.
(iii) Assignment procedures Randomised at entry—decision to conceive
vs.
Modified intention to treat (ITT)—randomised at decision to conceive and included in trial after subsequent live birth.
Modified ITT—time commences from livebirth.
DAG used to identify prespecified covariates to be included in estimator models for both confounding control and outcome adjustment. Estimator used a doubly robust inverse-probability-weighted regression adjustment model to achieve adherence to ignorability and positivity assumptions, such that observed groups can be considered balanced or exchangeable.
(iv) Follow-up period AEDC–school entry, 4–6 years of age
NAPLAN–Grade 3, 7–9 years of age
Some trial participants would be lost to follow-up.
Same as target trial.
Considerations and limitations:
Post-exposure and pre-selection: live birth bias
Post-exposure and post-selection: follow-up divided into (1) postnatal loss (akin to loss to follow-up), and (2) unobserved confounding.
(1) Postnatal: This is distinct from livebirth bias inclusion loss to follow-up post “live birth” due to neonatal death, infant death, childhood death, non-school attendance, missing outcome data.
(i) Dataset does not contain child death data
(iia) Missing data due to disability to be analysed as sensitivity analyses (AEDC: special needs, NAPLAN: exempt) (see below under outcome)
(iib) Dataset does not contain data on children who do not attend school, i.e., severe disability—limitation for discussion.
(2) Unobserved confounding: mediators (post-exposure) vs. unmeasured (pre-exposure) confounders. Sensitivity analysis for unmeasured confounders: E values were calculated.
(v) Outcome: AEDC–Binary
• Primary outcome: DV2 –developmental vulnerability in ≥2 domains
• Secondary outcome: individual domains
NAPLAN—Continuous
• Primary outcome: overall score (z-score)
• Secondary outcome: individual domains
NAPLAN—Binary
• Below national minimum standard in individual domains
Same outcome measures as target trial.
Considerations and sources of bias:
AEDC
• Children commence school at different ages at the start of calendar year and the assessment is not standardised for age range (4–6); age of assessment will be included in model for standardisation.
• Year of assessment is not included in model (i.e., 2012, 2015, 2018), as the metric is the same every year.
Children defined as having special needs are not awarded a valid result for the AEDC domains, however, by definition, are developmentally vulnerable, thus “missing not at random.” Methods for addressing “special needs status”:
(1) Primary analysis—“special needs missing” coded as “vulnerable.” Therefore, no missing outcome data or exposure data and imputation of missing covariate data only performed.
(2) Sensitivity analyses: (a) excluding special needs cases completely and (b) imputing their outcomes (biased); this data is MNAR and thus this is performed for illustration.
NAPLAN
Same NAPLAN outcome measure as target trial.
• Children commence school at difference ages at the start of calendar year and the assessment is not adjusted for age range (7–9), thus age of assessment will be included in model for standardisation.
• The NAPLAN paper is different each year, thus year of assessment is included in model.
Children exempt from sitting NAPLAN are, by definition, below the NAPLAN national minimum standard for each domain from which they have been excluded, thus “missing not at random.” Methods for addressing NAPLAN “exempt” status:
(1) Primary analysis—exempt coded as either the lowest possible test z-score (continuous) or below the national minimum standard (binary) NAPLAN domain outcomes. Therefore, no missing outcome data or exposure data and imputation of missing covariate data performed.
(2) Sensitivity analyses: (a) excluding exempt cases completely and (b) imputing their outcomes (biased); these data are MNAR and thus this was performed for illustration only.
(vi) Causal contrasts of interest and analysis plan ITT or modified ITT:
AEDC:
RD of being developmentally vulnerable (point estimate RD 95% CI)
RR of being developmentally vulnerable (point estimate RR 95% CI)
NAPLAN (continuous):
Difference in mean (MD) standardised NAPLAN total score (point estimate MD and 95% CI)
NAPLAN (binary)
RR of being below national minimum standard (point estimate RR 95% CI)
Causal comparisons:
(1) Is the risk of being developmentally vulnerable different in children born to mothers who conceived via IVF compared with mothers who conceived spontaneously?
(2) Is the mean overall NAPLAN score different for children born to mothers who conceived via IVF compared with mothers who conceived spontaneously?
Estimand = ATE
AEDC:
RD of being developmentally vulnerability (ATE RD 95% CI)
RR of being developmentally vulnerability (ATE RR 95% CI)
NAPLAN (continuous):
Difference in mean standardised NAPLAN total score; (ATE MD and 95% CI)
NAPLAN (binary):
RR of being below national minimum standard (ATE RR 95% CI)
Estimator model—Doubly robust IPW (with regression adjustment) with robust SE for maternal clustering—some mothers represented by several children (as detailed in Methods).

AEDC, Australian Early Development Census; ATE, average treatment effect; DAG, directed acyclic graph; IPW, inverse probability weight; IVF, in vitro fertilisation; NAPLAN, National Assessment Program for Literacy and Numeracy; RD, risk difference; RR, relative risk; SE, standard error.

Methods

Study design

This study is reported as per the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline (Checklist in S13 File).

Population

The study population included all singleton livebirths in Victoria between 2005 and 2014. Twins and higher order multiple births were excluded. Perinatal information was collected from audited birth outcome data through the Victorian Perinatal Data Collection (VPDC) [28,29]. The 3 largest IVF units in Victoria provided maternal records from all cycles that resulted in a birth during the study period. Creation of linked maternal/child data pairs required matching of the VPDC data with birth records, which were obtained from the Victorian Births, Deaths and Marriage registry.

Exposure

The exposure was conception via IVF compared with spontaneous conception. The term “IVF” is used collectively to include both conventional IVF, IVF with ICSI, and associated laboratory techniques. IVF cases were identified through the IVF database. Victorian births not identified in the IVF database were allocated to the control group. Pregnancies recorded as “IVF conception” in the VPDC but not identified within the IVF database were excluded, ensuring the control group did not contain any IVF conceptions. These cases likely represent overseas or interstate IVF conceptions, Victorian IVF conceptions not captured by our database, or failed linkages between the IVF database and state birth records.

Main outcome measures

Childhood educational and developmental outcomes were assessed using 2 standardised, national assessments. The Australian Early Development Census (AEDC) [30] and The National Assessment Program–Literacy and Numeracy (NAPLAN) [31]. See Supporting information file (Methods in S1 File), for a detailed description of each measure.

Australian Early Developmental Census (AEDC)

The AEDC assesses broad childhood functional development at school entry (age 4 to 6) across 5 domains: physical health and wellbeing, social competence, emotional maturity, language and cognitive skills (school-based) and communication skills, and general knowledge. The primary AEDC outcome for this study was a global measure, developmental vulnerability, defined as scoring <10th percentile in ≥ 2 of the 5 developmental domains. The secondary outcomes included developmental vulnerability in each of the 5 domains.

The National Assessment Program–Literacy and Numeracy (NAPLAN)

NAPLAN is a school-based psychometric assessment, assessing 5 educational domains: grammar and punctuation, reading, writing, spelling, and numeracy [32]. The study cohort’s grade 3 NAPLAN (fourth year of primary school) results were investigated. For this study, an overall z-score was calculated and used as the primary outcome, with the individual domain z-scores examined as secondary outcomes. By a priori consensus, a mean z-score difference of 0.2 standard deviations was considered to be clinically relevant. Individual domain scores below the published national minimum standard (NMS) NAPLAN scores for each year and for each domain were analysed as secondary (binary) outcomes.

Covariates

Covariates to be considered for inclusion in the statistical analysis models were decided a priori by the authorship team whose expertise included epidemiology, perinatology, reproductive endocrinology, and education. These covariates included child’s sex (as assigned at birth), child’s age in years at assessment, language background other than English (LBOTE), maternal age (at birth of the child), parity and both maternal and paternal highest obtained level of education, and socioeconomic status [33]. Gestational age at birth, mode of delivery, and birthweight were considered to be mediators on the causal pathways of interest and therefore not adjusted for in this analysis. A directed acyclic graph (DAG) was created to describe the structure of the relationships between all variables and identify the adjustment variable set, in line with the methodology recommendations of Tennant and colleagues [34]. Our prespecified statistical analysis plan (SAP) and the DAG were agreed upon and signed off by all authors in May 2020, prior to the commencement of data analysis (Protocol in S2 File).

Linkage

Administrative record linkage techniques were employed to match cases with the exposure (conception via IVF) through to childhood outcome data. Data linkage was performed by the Centre for Victorian Data Linkage (CVDL), a third-party government-funded data linkage unit [35]. Probabilistic linkage was performed between the 5 individual databases—birth records, birthing outcomes, IVF records, AEDC, and NAPLAN. Post-linkage data were manually screened for false matches using secondary variables (e.g., residential postcode). False matches and duplicates were removed (Table A in S3 File outlines number and percentage of successful linkages).

Two separate, linked study populations were identified, children with a linked AEDC record and children with a linked NAPLAN record. These 2 cohorts were analysed and are reported separately. Some children were included in both cohorts.

Causal assumptions

The ATE (average treatment effect) estimand used in this study is based upon the Potential Outcomes Framework. If a set of assumptions is met, then causal interpretation can be made. The causal assumptions are counterfactual consistency, ignorability (conditional exchangeability), and positivity. Counterfactual consistency means that the definition of exposure is consistent for all individuals. Ignorability states that treatment assignment can be considered random after controlling for, conditioning on, a set of covariates [36]. By identifying confounding variables, and importantly, the structure of the relationships between variables, via a DAG (S1 Fig) and by performing appropriate statistical modelling to balance the population (for example, inverse probability weighting), observed populations can be considered exchangeable or “unconfounded.” Exchangeability requires that there are no important unmeasured confounders; this assertion is untestable. The positivity assumption means that for all observations, the conditional probability of being exposed (receiving treatment/no treatment) is greater than zero. This is likely violated if overlap of the control and exposed populations is poor [26].

To best emulate a target trial, it must be possible for all participants to potentially receive both treatments. To ensure the assumptions underlying our causal approach were as robust as possible, we considered our observational data in direct comparison with the conditions of a target trial (Table 1)

Handling of missing data

The proportions of missing data are described in Table 2. Data were missing for outcomes and covariates; there were no missing exposure data. Missing covariates and outcomes that were considered to either be missing completely at random or missing at random were imputed. Children identified as having special needs are not allocated an AEDC domain category and thus their outcome data is missing. Likewise, children who attend school but have a disability that precludes them from being able to appropriately participate in the NAPLAN are exempt from sitting the test; by definition, these children are below the NAPLAN national minimum standard for each domain from which they have been excluded. Outcome data for these children for both AEDC domain categories and NAPLAN z-scores was considered missing not at random. In the analysis of all primary and secondary AEDC outcomes, children with special needs have been included and assumed to be “developmentally vulnerable.” In the analysis of NAPLAN outcomes, “exempt” children have also been included and allocated either the lowest possible test z-score or deemed to be below the national minimum standard for binary NAPLAN domain outcomes.

Table 2. Baseline characteristics of study cohort (including cases with missing outcome and covariate data).

Baseline characteristic AEDC NAPLAN
TOTAL cohort
N = 173,200
Controls
N = 168,503
IVF conceptions
N = 4,697
TOTAL cohort
N = 342,311
Controls
N = 333,335
IVF conceptions
N = 8,976
Child baseline data
Sex (% female)
Missing (%)
48.7
0.0
48.7
0.0
50.8
0.0
49.0
0.0
49.0
0.0
49.9
0.0
Age of assessment in years (decimal) (Median [IQR])
Missing (%)
5.4 [5.2, 5.7]
<0.1
5.4 [5.2, 5.7]
<0.1
5.4 [5.2, 5.7]
<0.1
8.3 [8.1, 8.6]
0.0
8.3 [8.3, 8.6]
0.0
8.4 [8.1, 8.6]
0.0
Language background other than English (%)
Missing (%)
18.6
0.0
18.8
0.0
13.5
0.0
23.5
0.0
23.6
0.0
20.1
0.0
ATSI (%)
Missing (%)
2.3
0.0
2.3
0.0
0.6
0.0
1.0
0.0
1.0
0.0
0.2
0.0
Mother born overseas (%)
Missing (%)
27.3
0.0
27.4
0.0
23.7
0.0
25.1
0.0
25.1
0.0
21.3
0.0
Birthweight (grams) (mean (SD))
Missing (%)
3,416 (544)
<0.1
3,417 (543)
<0.1
3,359 (571)
<0.1
3,422.7 (570)
<0.1
3,424.6 (569)
<0.1
3,352 (618)
<0.1
Small for gestational age (<10th centile)a (%) 9.3 9.3 8.60 9.3 9.3 8.60
Gestational age of delivery in weeks—decimal
(Median and [IQR]) Missing (%)
39.6 [38.6, 40.6]
<0.1
39.7 [38.7, 40.6]
<0.1
39.1 [38.3, 40.3]
0.2
39.5 [38.7, 40.7]
<0.1
39.5 [38.7, 40.7]
<0.1
39.0 [38.3, 40.3]
<0.1
Mode of birth (% of each category)
• Unassisted vaginal birth
• Instrumental vaginal birth
• Cesarean section
Missing (%)

54.9
14.0
31.1
<0.1

55.5
13.8
30.7
<0.1

32.7
19.6
47.7
<0.1

55.7
13.7
30.6
<0.1

56.3
13.6
30.1
<0.1

33.9
19.1
46.9
<0.1
Congenital anomaly (%)
Missing (%)
3.7
0.1
3.7
0.1
3.8
0.1
3.2
<0.1
3.2
<0.1
3.4
<0.1
Special needs status (AEDC)/exempt from NAPLAN (%) 5.2 5.2 5.2 2.1 2.1 1.4
Maternal baseline data
Age (Median and [IQR])
Missing (%)
31.4 [27.6, 35.0]
0.0
31.3 [27.5, 34.8]
0.0
35.7 [32.8, 38.6]
0.0
31.6 [27.7, 35.1]
0.0
31.5 [27.7, 35.1]
0.0
35.8 [32.9, 38.8]
0.0
Mother born overseas (%)
Missing (%)
27.3
0.0
27.4
0.0
23.7
0.0
25.1
0.0
25.1
0.0
21.3
0.0
High school level of education (% for each):
• Year 9 or below
• Year 10Year
• 11Year 12 and above
Missing (%)
2.8
6.4
6.4
53.8
30.5
2.9
6.5
6.5
53.6
30.6
0.7
3.0
4.3
62.6
29.3
4.2
9.3
10.6
74.1
1.9
4.2
9.4
10.7
73.8
1.9
1.6
5.1
8.9
81.7
2.8
Post-school level of education (% for each):
• No post-school education
• Certificate (including trade)
• Advanced diploma
• Bachelor degree or above
Missing (%)
12.9
16.9
10.9
27.7
31.6
13.0
17.1
10.9
27.4
31.6
8.2
11.1
10.9
39.9
30.5
21.4
23.4
15.6
36.5
3.1
21.6
23.6
15.6
36.2
3.1
15.2
15.6
15.9
49.3
4.0
Parity (% for each category):
0
1
2
3
4
5+
Missing (%)
43.2
35.1
14.5
4.6
1.5
1.1
0.02
42.6
35.2
14.8
4.7
1.5
1.2
0.02
64.3
29.6
4.8
0.9
0.2
0.1
0.00
42.8
34.9
14.9
4.7
1.5
1.1
0.06
42.2
35.1
15.2
4.8
1.6
1.1
0.01
63.7
28.0
4.8
1.1
0.2
0.2
2.04
SEIFA quintile:
1 (most disadvantaged)
2
3
4
5 (least disadvantaged)
Missing (%)
16.4
18.0
21.94
23.4
20.2
0.1
16.3
18.2
22.0
23.2
19.9
0.1
6.0
11.2
20.2
28.3
34.2
0.1
17.1
14.2
20.2
23.5
25.0
0.1
17.4
14.3
20.4
23.4
24.5
0.1
7.0
9.8
14.2
26.4
42.6
0.1
Occupation (% for each category):
• Senior management
• Other business manager
• Tradesman/woman, clerks, sales, and service staff
• Machine operators, hospitality staff, assistants, labourers, and related workers
• Not in paid work
Missing (%)
Not in database Not in database Not in database 17.3
17.3
18.0
12.5
32.6
2.2
17.1
17.2
18.1
12.7
32.9
2.2
26.9
22.1
17.7
7.8
22.8
2.7
Second parent baseline data
Level of education (% for each category):
High school: - Year 9 or below
• Year 10
yYear 11
• Year 12 and above
Missing (%)
3.1
7.5
6.3
40.9
42.2
3.2
7.6
6.3
40.6
42.3
1.4
4.9
5.8
51.3
36.7
5.0
11.8
11.6
57.8
13.8
5.1
11.9
11.6
57.6
13.9
2.9
8.4
11.1
67.2
10.4
Level of education (% for each category):
• Post-school: - No post-school education
• Certificate (including trade)
• Advanced diploma
• Bachelor degree or above
Missing (%)
Not in database Not in database Not in database 18.1
27.8
11.1
27.7
15.4
18.2
27.9
11.1
27.4
15.5
14.4
22.2
12.7
38.8
11.9
Occupation (% for each category):
• Senior management
• Other business manager
• Tradesman/woman, clerks, sales, and service staff
• Machine operators, hospitality staff, assistants, labourers, and related workers
• Not in paid work
Missing (%)
Not in database Not in database Not in database 18.5
21.7
22.2
16.6
8.1
12.9
18.2
21.6
22.3
16.7
8.1
13.0
29.7
26.7
19.4
9.9
5.0
9.3

AEDC, Australian Early Development Census; ATSI, Aboriginal and Torres Strait Islander; IVF, in vitro fertilisation (cases); IQR, interquartile range; NAPLAN, National Assessment Program for Literacy and Numeracy; SD, standard deviation; SEIFA, Socio-Economic Indexes for Areas by residential postcode.

a Birth weight centile derived from Dobbin’s growth chart.

All covariates in the analysis model that had missing data were imputed, even if missing was very low. For the AEDC analysis, imputed covariates included parity, age at assessment, maternal education, Socio-Economic Indexes for Areas (SEIFA), and outcome score. For the NAPLAN analysis, imputed covariates included parity, age at assessment, maternal education, paternal education, SEIFA, and outcome score. Maternal body mass index (BMI) was excluded from imputation and analysis because the missingness was too high (>50%). For AEDC imputation models, second parent education level (42% missing) was excluded due to non-convergence when included in the imputation model.

Multiple imputation of missing data was performed under a fully conditional specification using a predictive mean model for continuous and unordered categorical covariates and a logistic model for binary covariates, with standard errors (SEs) accounting for maternal clustering (Methods, Table 2 and Figs A–C in S5 File). The model contained outcome, exposure, model covariates, and auxiliary variables (AEDC: remote locality, Aboriginal and Torres Strait Islander (ATSI) status, and maternal country of origin; NAPLAN: ATSI status and maternal country of origin) along with interaction terms (exposure-parity, exposure-maternal age, exposure-age at assessment, gender-age at testing) and 1 higher order term (test age2). At 20 imputations, the Monte Carlo errors were less than 10% of corresponding SE for all covariates. Each imputation model was subjected to the recommended diagnostic tests [37].

Statistical analysis

Descriptive statistics were calculated and are reported for each cohort by IVF exposure status, according to type and distribution of data.

Treatment effect size modelling

All multivariate models were adjusted for the listed covariates identified in the prespecified SAP, except for (1) maternal BMI; and (2) second parent education level, for AEDC outcome models only.

For each of the imputed datasets, the predicted probability of exposure or PS and associated inverse probability weight (IPW = 1/PS) were estimated using a logistic regression model, conditional on all analysis model covariates [38]. These weights were then stabilised by including as a factor in the numerator the proportion of each treatment group within the population, i.e., the prevalence of IVF and spontaneous conception [39]. Diagnostic tests performed after planned treatment effect modelling (Figs A–D in S6 File) demonstrated poor overlap of exposed and non-exposed cohorts. We therefore restricted our analysis to the IVF population whose weights overlapped with the control group to ensure that the overlap (“positivity”) assumption was not violated. This reduced the IVF cases to 31.6% of the original AEDC cohort and 22.3% of the NAPLAN cohort. For each covariate, the standardised mean difference between the exposure arms was calculated to assess if balance between weighted pseudo-populations was achieved (Figs A and B in S8 File).

For each imputed dataset, a doubly robust inverse-probability-weighted regression adjustment (IPWRA) model [38,39] was then used to estimate the respective potential outcome means (POM) followed by (1) the risk difference (RD) and relative risk (RR) for binary outcomes (AEDC and NAPLAN); and (2) mean differences (MD) for continuous outcomes (NAPLAN z-score).

Finally, estimates for each imputed dataset were pooled to provide overall ATE with associated 95% confidence limits using Rubin’s method.

Provided the assumptions outlined above are satisfied, the estimates generated from these analyses can be interpreted as the population average causal effect, that is, the mean effect on the outcome if the treatment was applied to the entire population and contrasted with the outcome if the entire population received the control condition.

Clustering

Clustering of data within mothers due to more than 1 singleton birth during the study period was accounted for in the imputation models, the calculation of inverse probability weights and estimation of the treatment effect by using robust SEs.

Sensitivity analyses

Sensitivity analyses were also performed to address identified sources of potential bias. For both AEDC (special needs status) and NAPLAN (exempt status) cohorts, sensitivity analyses were performed: (1) by excluding these cases completely; and (2) by imputing their outcomes (Fig A in S4 File). Targeted maximum likelihood estimation (TMLE) modelling (a machine learning ensemble that is less sensitive to violations of positivity and does not require data distribution assumptions) was undertaken for comparison [40]. Additionally, calculation of E-values for our 2 primary outcomes was performed to quantify the magnitude of unobserved bias required to alter our findings.

The analysis for this study was performed using STATA MP Version 17.0 [41], EMTLE package [40], and R [42].

Ethics/Governance

Ethical approval for the project was obtained from Mercy, Monash Health and Melbourne IVF Health Human Research Ethics Committees. Each data custodian provided contractual approval for data access and data linkage. The CVDL approved the project and performed the linkage.

Results

The total cohort included 585,659 singleton births in Victoria between 2005 and 2014. Among this cohort, 173,200 children, including 4,697 IVF births, were linked to AEDC outcome data. Additionally, 342,331 children, including 8,976 IVF births, were linked to NAPLAN data (Fig 1). Overall, a total of 11,059 IVF-conceived children and 401,654 spontaneously conceived children were included in the study (2,614 IVF cases and 100,184 controls were in both study arms). We estimate that our study cohort includes >95% of IVF conceptions during the study timeframe (Tables A and B in S3 File). Analysis of the linked and non-linked cases showed little evidence of association between linkage and exposure status (Chi2 p = 0.80); that is, IVF cases were just as likely to be included in the final linked cohort as controls. There were no births from 2014 that linked to outcome data.

Fig 1. Participant flow chart.

Fig 1

IVF, in vitro fertilisation; IVF cases, pregnancies and children identified with conception assisted by IVF; Controls, pregnancies and children not identified as IVF assisted conception; ART, assisted reproductive technology; non-IVF ART, ovulation induction and intrauterine insemination; VPDC, Victorian Perinatal Data Collection; BDM, Victorian Births Death Marriages Registry; IVF, combined Victorian IVF pregnancy record database.

Baseline population characteristics differed considerably between the 2 exposure groups (Table 2). Compared with spontaneously conceived controls, children conceived via IVF had older, more highly educated parents and mothers with lower parity. IVF-conceived children resided in postal areas with higher socioeconomic ranking and were less likely to be from non-English speaking backgrounds. Age at assessment was similar between the exposure groups.

Global developmental vulnerability at school entry (The Australian Early Development Census, AEDC)

Primary outcome

Our findings suggest no causal effect of IVF conception on developmental vulnerability, with 13.6% of IVF-conceived children predicted to be developmentally vulnerable (<10th percentile in 2 or more domains of the 5 AEDC domains) compared with 13.9% of spontaneously conceived children. The adjusted RD was at −0.3%, indicating that 0.3% fewer children who were conceived by IVF were developmentally vulnerable compared with those conceived spontaneously. However, the 95% CI (−3.7% to 3.1%), indicates this result is indistinguishable from zero. Similarly, the adjusted relative risk showed no detectable difference in risk of developmental vulnerability, where IVF-conceived children were 3.0% less likely to be developmentally vulnerable than spontaneously conceived children (RR 0.97, 95% CI: 0.77 to 1.25) (Table 3).

Table 3. Results of final causal model.

Non-imputed crude data Imputed data–causal modela
Proportions Predicted proportions Regression co-efficient:
Controls (N = 168,503) IVF
(N = 4,697)
Unadjusted risk difference (95% CI)b Controls (N = 168,503) IVF
(N = 1,506–1,560)c
ATE risk difference
(95% CI)
ATE risk ratio
(95% CI)
Primary outcome (developmentally at risk in 2 or more domains)
0.140 0.104 −0.036 (−0.045 to −0.027) 0.139 0.136 −0.003 (−0.037 to 0.031) 0.97 (0.77 to 1.25)
Secondary outcomes (developmentally at risk for individual domains)
Physical health and wellbeing 0.126 0.105 −0.021 (−0.030 to −0.012) 0.125 0.124 −0.0003 (−0.034 to 0.033) 0.996 (0.76 to 1.31)
Social competence 0.130 0.105 −0.026 (−0.035 to −0.017) 0.130 0.136 0.006 (−0.028 to 0.040) 1.05 (0.80 to 1.37)
Emotional maturity 0.125 0.103 −0.022 (−0.031 to −0.013) 0.125 0.114 −0.011 (−0.042 to 0.021) 0.91 (0.70 to 1.20)
Language and cognitive skills (school-based) 0.107 0.076 −0.031 (−0.039 to −0.023) 0.106 0.118 0.012 (−0.020 to 0.045) 1.12 (0.85 to 1.46)
Communication and general knowledge 0.114 0.080 −0.034 (−0.042 to −0.026) 0.113 0.113 0.000 (−0.031 to −0.031) 1.00 (0.76 to 1.31)

Australian Early Development Census (children aged 4–6 years).

Children with missing outcome data identified as having special needs (5.2%) are included—their outcome category assumed to be “developmentally vulnerable.”

a Causal model: multiply imputed data (imputation of covariates and outcomes) pooled estimates—doubly robust method: regression adjustment model with stabilised inverse probability weighting, plus trimmed for complete weight overlap. Variables included in model: sex at birth, age at assessment, language background other than English, socioeconomic status, maternal age, parity, and education.

b Small variation in case number for each of the 20 imputation datasets.

AEDC, Australian Early Development Census; ATE, Average treatment effect; CI, confidence interval; IVF, in vitro fertilisation (cases).

Secondary outcomes

For secondary outcomes, we examined each of the 5 AEDC domains individually. The unadjusted observed results and causal model results for each individual domain are reported in Table 3. There were no differences between IVF- and spontaneously conceived children in adjusted risk difference for any of the individual AEDC domains.

Missing data

Outcome data were missing for 5.6% of the AEDC-linked cohort. The vast majority (92%) of these missing cases were children with special needs (5.2% of overall cohort). There was no evidence of an association between the presence of missing outcome and exposure status (Chi2 p = 0.68). Sensitivity analysis was performed by (1) excluding children with special needs; and (2) including these children, with multiple imputation of their missing outcomes (Tables A and B in S8 File). Most covariates had minimal or no missing data (<1.0%). Maternal education level was missing for 30.5% and maternal post-school education was missing for 31.6%.

Psychometric assessment of 5 educational domains at primary school (The National Assessment Program–Literacy and Numeracy, NAPLAN)

Primary outcome

Our findings indicate the causal effect of IVF conception on overall NAPLAN z-score was indistinguishable from zero. The predicted outcome mean z-score and was 0.013 (SE 0.024) for IVF-conceived children and −0.016 (SE 0.002) for spontaneously conceived controls, with an adjusted mean difference of 0.030 (95% CI −0.018 to 0.077) (Table 4).

Table 4. Results of final causal model.

Non-imputed crude data Imputed data–causal modela
Mean (SE) Potential outcome mean (SE) Regression co-efficient:
Control (N = 333,335) IVF
(N = 8,976)
Unadjusted mean difference
(95% CI)b
Controls
(N = 333,335)
IVF
(N = 1,985–2,010)c
ATE mean difference
(95% CI)
Primary outcome—overall Z-score
−0.006 (0.002) 0.232 (0.011) 0.238 (0.217 to 0.260) −0.016 (0.002) 0.013 (0.024) 0.030 (−0.018 to 0.077)
Secondary outcomes–individual domains Z-score
Grammar and punctuation −0.107 (0.002) 0.209 (0.015) 0.316 (0.286 to 0.345) −0.113 (0.003) −0.085 (0.033) 0.028 (−0.036 to 0.092)
Numeracy −0.113 (0.002) 0.174 (0.015) 0.286 (0.258 to 0.316) −0.118 (0.003) −0.070 (0.033) 0.048 (−0.018 to 0.113)
Reading −0.108 (0.002) 0.226 (0.015) 0.334 (0.305 to 0.263) −0.113 (0.003) −0.092 (0.033) 0.021 (−0.042 to 0.085)
Spelling −0.085 (0.002) 0.123 (0.012) 0.208 (0.184 to 0.231) −0.091 (0.002) −0.083 (0.029) 0.008 (−0.048 to 0.064)
Writing −0.115 (0.002) 0.137 (0.014) 0.252 (0.223 to 0.280) −0.124 (0.002) −0.056 (0.033) 0.068 (0.004 to 0.132)d

Grade 3 (fourth year of schooling) NAPLAN z-scores (children aged 7–9 years).

Children with missing outcome data exempt from NAPLAN (2.1%) are included—their z-score set at lowest possible score.

a Causal model: imputed data pooled estimates—regression adjustment model with stabilised inverse probability weighting, trimmed for complete weight overlap. Variables included in model: sex at birth, age at assessment, language background other than English, socioeconomic status, maternal age, parity, education, and second parent education.

b Small variation in case number for each of the 20 imputation datasets.

c 95% confidence interval does not cross the null.

d Linear regression/unpaired t test.

ATE, average treatment effect; IVF, in vitro fertilisation (cases); NAPLAN, National Assessment Program for Literacy and Numeracy; SE, standard error.

Secondary outcomes

For secondary outcomes, we examined individual NAPLAN domain z-scores (Table 4). IVF-conceived children performed better on average in measures of writing than their spontaneously conceived peers with a z-score mean difference of 0.068 (95% CI 0.004 to 0.132), but this is unlikely to be a clinically important difference. The estimated effect is less than 0.07 of a standard deviation, and a difference of 0.2 standard deviations or greater was determined a priori as representing a finding of importance.

Additionally, for each domain, a binary outcome (domain scores above or below the national minimum standard) was examined. In 4 of 5 domains (numeracy, reading, spelling, and writing), IVF-conceived children were less likely to be below the national minimum standard compared with their spontaneously conceived peers (Table 5). For these 4 domains, the RD was between −0.7% and −1.25%. In absolute terms, this equates to approximately 1 additional IVF-conceived child, for every 100, predicted to score above the national minimum standard compared with their spontaneously conceived peers.

Table 5. Results of final causal model.

Non-imputed crude data Imputed data–causal modela
Proportions Predicted outcome proportions Regression co-efficient:
Control (N = 333,335) IVF
(N = 8,976)
Unadjusted risk difference
(95% CI)b
Control
(N = 333,335)
IVF
(N = 1,985–2,010)c
ATE risk difference
(95% CI)
ATE risk ratio
(95% CI)
Secondary outcomes–individual domains below “National Minimum Standard”
Grammar and punctuation 0.051 0.029 −0.022 (−0.026 to −0.018)
0.052 0.044 −0.008 (−0.020 to 0.003)
0.84 (0.57 to 1.24)
Numeracy 0.038 0.024 -0.015 (-0.018 to -0.011)
0.039 0.028 −0.011 (−0.021 to −0.002)d
0.72 (0.48 to 1.06)
Reading 0.044 0.026 −0.018 (−0.022 to −0.015)
0.046 0.035 −0.011 (−0.022 to −0.001)d
0.75 (0.51 to 1.12)
Spelling 0.049 0.028 −0.022 (−0.025 to −0.018)
0.051 0.039 −0.012 (−0.023 to −0.001)d
0.77 (0.52 to 1.14)
Writing 0.033 0.020 −0.012 (−0.015 to −0.009)
0.034 0.021 −0.012 (−0.021 to −0.004)d
0.63 (0.43 to 0.94)d

Grade 3 (fourth year of schooling) NAPLAN, scores below “National Minimum Standard.”

Children with missing outcome data exempt from NAPLAN (2.1%) are included—their outcome is “below national minimum standard.”

a Causal model: imputed data pooled estimates—regression adjustment model with stabilised inverse probability weighting, trimmed for complete weight overlap. Variables included in model: sex at birth, age at assessment, language background other than English, socioeconomic status, maternal age, parity, education, and second parent education.

b Small variation in case number for each of the 20 imputation datasets.

c 95% confidence interval does not cross the null.

d Chi2 test.

ATE, average treatment effect; IVF, in vitro fertilisation (cases); NAPLAN, National Assessment Program for Literacy and Numeracy; SE, standard error.

Missing data

Spontaneously conceived children were more likely to have missing NAPLAN data (7.6%) than IVF-conceived children (5.9%, Chi2 p < 0.001). During the primary analysis, missing outcomes related to a child being absent or withdrawing from the test were imputed. The results presented include 7,222 children who were exempt from sitting the NAPLAN, with their results set to the lowest possible outcome score. Sensitivity analysis was performed by (1) excluding these children; and (2) including the exempt cases, with multiple imputation of their missing outcomes. There was no meaningful difference in the results (Tables A and B in S9 File). Most covariates had minimal or no missing data (<4.0%). Second parent school education level was missing in 13.8% of cases and post-school education missing in 15.4% of cases.

Sensitivity analyses

To validate our analysis model, we re-examined our AEDC primary outcome and the NAPLAN binary domain outcomes using TMLE modelling. Results from the TMLE model did not meaningfully differ from the findings of the primary analysis (Table A in S10 File).

An E-value was estimated for both primary outcomes and was found to be 1.90 and 1.77 for AEDC and NAPLAN outcomes, respectively, suggesting that an unknown bias of sufficient magnitude to change the study findings is unlikely (Figs A and B in S11 File).

Discussion

Using a causal inference approach, we found no effect of IVF conception on developmental vulnerability at school entry in Victorian children born between 2005 and 2014. Additionally, IVF-conceived children performed as well as their spontaneously conceived peers in school-based psychometric testing at age 7 to 9 years.

For the first time, our study has estimated the causal effect of IVF conception on global childhood development at school entry and educational outcomes at primary school, under the assumptions of causal inference. Using an updated epidemiological approach [25], this study provides robust evidence about the longer term implications of IVF conception. The findings of this study offer timely reassurance about the impact of IVF conception on the developmental and educational outcomes at primary school age of the children conceived. Neither the outcomes of developmental vulnerability at school entry nor educational achievement at age 7 to 9 differed substantially between IVF- and spontaneously conceived children. Among 4 out of 5 NAPLAN individual domain national minimum standard results, there was a trend towards better performance in the IVF cases, but the clinical and social implications of these findings are difficult to quantify.

Two large Scandinavian studies have reported on childhood outcomes following IVF conception. Norrman and colleagues found that IVF-conceived children perform worse on school-based assessment in year 9 [18], among their cohort of just over 8,000 IVF-conceived children. Wienecke and colleagues reported that IVF-conceived children had poorer school performance than controls and that spontaneously conceived children of subfertile parents also had poorer outcomes [19]. By examining a subpopulation of spontaneously conceived children of subfertile parents, the authors of this Danish study concluded that the IVF process itself was not responsible for the differences demonstrated [19].

These past studies are limited by examining historical birth cohorts dating back prior to the year 2001. Our study examines a more contemporary birth cohort (2005 to 2014), which is important given the advances in artificial reproductive techniques that have occurred since the turn of the century. IVF technologies that have evolved since this time include the introduction of blastocyst culture, vitrification, and single-embryo transfer [4,43]. Thus, our study findings are more generalisable to contemporary fertility practice. Importantly, our use of updated epidemiological and statistical methods ensures that we have estimated effects that have a causal interpretation. It is important that our methods are replicated in future studies to strengthen the existing evidence base.

Given the use of observational data, there were missing data and inherent differences in the covariate profile of the exposure cohorts. An a priori SAP was developed to overcome these limitations. First, inverse probability weighting with regression adjustment was used to mimic exchangeable treatment and control comparison groups, similar to those that would be generated by randomisation in a controlled trial. The success of this procedure is demonstrated by achieving adequate covariate balance and thus sufficient overlap of covariate distributions between exposure groups after inverse probability weighting (Figs A–D in S6 File). Second, we sought to mitigate the potential biases resulting from missing data. In order to do this, we performed multiple imputation of covariates included in our model and then compared the results of analyses that were based on complete cases with those of multiply imputed datasets (Tables A and B in S12 File).

It is possible that unmeasured common cause confounders may have led to bias in estimating the ATEs. Many important factors (socioeconomic status, maternal age, and education) were identified a priori, measured, and included in the estimation procedure. Potential known but unmeasured sources of bias include subfertility and maternal BMI. Current evidence suggests that subfertility is likely to be associated with poorer childhood outcomes [19]. Consequently, if this variable were able to be measured and included in our causal model, correcting for it is likely to have favoured IVF-conceived children in our analysis. Maternal BMI is also likely to have followed the same trend with higher average BMI among the IVF group (after accounting for socioeconomic position) and high BMI being associated with poorer perinatal and childhood outcomes [44].

Unmeasured variables may have had an impact on the outcome. Factors such as childcare attendance or grandparent involvement will be preceded on causal pathways by covariates that were measured and included in the model, such as maternal age and socioeconomic status [4547]. These factors were therefore considered to mediate rather than confound the relationship between these covariates and the outcome. Sensitivity analyses were performed to further evaluate unmeasured confounding, with E-values calculated for AEDC and NAPLAN primary outcomes. Within the limitation of E-values, these analyses indicate that it is unlikely an unknown bias exists without our knowledge and with the necessary magnitude of effect and prevalence to change our conclusions (Figs A and B in S11 File) [48].

Generalisation of our findings to all IVF births is a potential study limitation. As described in our Methods, observations with non-overlapping PSs were excluded from analysis in order to meet the assumption of positivity, required for causal inference under the potential outcomes framework. Generalisation of our findings to all IVF births therefore requires the consideration that the baseline characteristics of the population of interest are comparable to the IVF cases analysed in our final cohort.

Due to the use of school-based outcome assessments, our cohort was limited to children attending school. AEDC, as a triennial assessment, limited our sample to children captured during assessment years, and the later years of our birth cohort had not yet reached the assessment age for NAPLAN outcomes to be captured. However, our study included 70% of the relevant birth cohort for the study timeframe and in the years where both AEDC and NAPLAN data were available, over 95% of the Victorian birth cohort was sampled (Table A in S3 File). The remaining approximately 5% of children not sampled represent failed linkages as well as excluded IVF conceptions (due to non-Victorian IVF or non-IVF-assisted reproduction). A small percentage will also represent children with a disability significant enough not to attend mainstream school, introducing potential selection bias. Importantly, however, our study was not designed to assess severe disability or developmental delay, but rather an overall measure of global development and school achievement.

Furthermore, through the examination of school-based outcomes, our study was inherently designed to examine outcomes for liveborn children. “Live birth bias” as it is known, is a recognised limitation of observational studies that investigate periconception and antenatal exposures [49]. For the purposes of this research question, the outcomes of failed conception, miscarriage, stillbirth are considered alternative endpoints and less relevant to the research question that aims to compare the school-age outcomes of children born following IVF conception with those who were conceived without assistance.

Under the specified assumptions, this analysis has demonstrated that there is no causal effect within the population studied of IVF conception on early childhood developmental vulnerability and school-age educational outcomes. Compared with spontaneously conceived children, children conceived by IVF were no more likely to be developmentally vulnerable at school entry and had equivalent numeracy and literacy performance by age 7 to 9 years. These findings provide important reassurance for current and prospective parents and their treating clinicians.

Supporting information

S1 Fig. Direct acyclic graph.

(DOCX)

S1 File. Methods: Description of outcome metrics.

(DOCX)

S2 File. Study protocol.

(DOCX)

S3 File

Tables A and B. Table A. Successful linkages by birth year. Table B. Annual cycle summaries from major Victorian IVF providers 2010–2014.

(DOCX)

S4 File

Fig A. Analysis flow chart (NAPLAN).

(DOCX)

S5 File. Methods: Multiple imputation model summary and model diagnostics.

Table A. Missing data summary (NAPLAN). Fig A. NAPLAN convergence. Fig B. NAPLAN density plots of observed and imputed data. Fig C. NAPLAN distribution of outcome and covariates after imputation in m = 1 dataset.

(DOCX)

S6 File

Figs A–D: Distribution and overlap of manually calculated stabilised weights. Fig A. AEDC imputation #1. Fig B. AEDC imputation #13. Fig C. NAPLAN imputation #1. Fig D. NAPLAN imputation #13.

(DOCX)

S7 File

Figs A–C: Variable standardised mean differences. Fig A. NAPLAN imputation #1 variable standardised mean differences. Fig B. NAPLAN imputation #13 variable standardised mean differences. Fig C. AEDC imputation #7 variable standardised mean differences.

(DOCX)

S8 File

Tables A and B. Table A. Sensitivity analysis–AEDC (special needs multiply imputed). Table B. Sensitivity analysis–AEDC (special needs excluded).

(DOCX)

S9 File

Tables A and B. Table A. Sensitivity analysis–NAPLAN (exempt multiply imputed). Table B. Sensitivity analysis–NAPLAN (exempt excluded).

(DOCX)

S10 File

Table A–Sensitivity analysis–binary outcomes TMLE.

(DOCX)

S11 File

Figs A and B. Fig A. Sensitivity analysis AEDC primary outcome–E-value estimation. Fig B. Sensitivity analysis NAPLAN primary outcome–E-value estimation.

(DOCX)

S12 File

Tables A and B. Table A. Traditional regression and treatment effect models (AEDC). Table B. Traditional regression and treatment effect models (NAPLAN).

(DOCX)

S13 File. STROBE guideline checklist.

(DOCX)

Acknowledgments

This paper uses data from the Australian Early Development Census (AEDC). The AEDC is funded by the Australian Government Department of Education, Skills and Employment. The findings and views reported are those of the author(s) and should not be attributed to the Department or the Australian Government. We are grateful for the provision of data by the AEDC.

We are grateful to CCOPMM for providing access to the data used for this project and for the assistance of the staff at Safer Care Victoria. The conclusions, findings, opinions and views, or recommendations expressed in this paper are strictly those of the author(s). They do not necessarily reflect those of CCOPMM.

We are thankful for contribution of Victorian IVF providers, Melbourne IVF, Monash IVF, and City Fertility Centre to this research. We acknowledge the significant amount of work undertaken on behalf of this project and appreciate the opportunity to work with staff from each unit.

Finally, we are grateful to the Australian Curriculum Assessment and Reporting Authority (ACARA) for their assistance, collaboration, and for providing the National Assessment Program for Literacy and Numeracy (NAPLAN) data.

Abbreviations

AEDC

Australian Early Development Census

ATE

average treatment effect

ATSI

Aboriginal and Torres Strait Islander

BMI

body mass index

CI

confidence interval

CVDL

Centre for Victorian Data Linkage

DAG

directed acyclic graph

ICSI

intracytoplasmic sperm injection

IPW

inverse probability weight

IPWRA

inverse-probability-weighted regression adjustment

IVF

in vitro fertilisation

MD

mean differences

NAPLAN

National Assessment Program–Literacy and Numeracy

NMS

national minimum standard

POM

potential outcome means

PS

propensity score

RD

risk difference

RR

relative risk

SAP

statistical analysis plan

SE

standard error

SEIFA

Socio-Economic Indexes for Areas

TMLE

targeted maximum likelihood estimation

VPDC

Victorian Perinatal Data Collection

Data Availability

Data for this study was provided by various data custodians and linked by the Centre for Victorian Data linkage (https://www.health.vic.gov.au/reporting-planning-data/the-centre-for-victorian-data-linkage). With relevant ethical approval, data are available upon request to the governing data custodians.

Funding Statement

This work was supported by the National Health and Medical Research Council through the Australian Federal Government Graduate Research Scheme (AK) and Mercy Foundation, through Mercy Perinatal (AK). Ferring Pharmaceutics supported this work through an unconditional research grant (AK). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Adamson GD, de Mouzon J, Chambers G, Zegers-Hochschild F, Ishihara O, Banker M, et al. International Committee for Monitoring Assisted Reproductive Technology: world report on assisted reproductive technology. 2017. Available from: https://www.icmartivf.org/reports-publications/. [Google Scholar]
  • 2.European Society of Human Reproduction and Embryology. More than 8 million babies born from IVF since the world’s first in 1978: European IVF pregnancy rates now steady at around 36 percent, according to ESHRE monitoring. ScienceDaily. 2018. [cited 2021 Nov 8]. Available from: www.sciencedaily.com/releases/2018/07/180703084127.htm [Internet]. [Google Scholar]
  • 3.European Society of Human Reproduction and Embryology. World’s number of IVF and ICSI babies has now reached a calculated total of 5 million. ScienceDaily. 2012. [cited 2021 Nov 8]. Available from www.sciencedaily.com/releases/2012/07/120702134746.htm>. [Google Scholar]
  • 4.Newman JE, Paul RC, Chambers GM. Assisted reproductive technology in Australia and New Zealand 2018. National Perinatal Epidemiology and Statistics Unit, the University of New South Wales, Sydney, 2020. Available from: https://npesu.unsw.edu.au/surveillance/assisted-reproductive-technology-australia-and-new-zealand-2018. [Google Scholar]
  • 5.Australian Bureau of Statistics. Births, Australia. Canberra, Australia: Australian Bureau of Statistics, 2018. Available from: https://www.abs.gov.au/statistics/people/population/births-australia/2018. [Google Scholar]
  • 6.Pandey S, Shetty A, Hamilton M, Bhattacharya S, Maheshwari A. Obstetric and perinatal outcomes in singleton pregnancies resulting from IVF/ICSI: a systematic review and meta-analysis. Hum Reprod Update. 2012;18(5):485–503. doi: 10.1093/humupd/dms018 . [DOI] [PubMed] [Google Scholar]
  • 7.Qin J, Liu X, Sheng X, Wang H, Gao S. Assisted reproductive technology and the risk of pregnancy-related complications and adverse pregnancy outcomes in singleton pregnancies: a meta-analysis of cohort studies. Fertil Steril. 2016;105(1):73–85.e6. doi: 10.1016/j.fertnstert.2015.09.007 [DOI] [PubMed] [Google Scholar]
  • 8.Marino JL, Moore VM, Willson KJ, Rumbold A, Whitrow MJ, Giles LC, et al. Perinatal outcomes by mode of assisted conception and sub-fertility in an Australian data linkage cohort. PLoS ONE. 2014;9(1):e80398. Epub 2014/01/15. doi: 10.1371/journal.pone.0080398 ; PubMed Central PMCID: PMC3885393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Stromberg B, Dahlquist G, Ericson A, Finnstrom O, Koster M, Stjernqvist K. Neurological sequelae in children born after in-vitro fertilisation: a population-based study. Lancet. 2002;359(9305):461–5. Epub 2002/02/21. doi: 10.1016/S0140-6736(02)07674-2 . [DOI] [PubMed] [Google Scholar]
  • 10.Bay B, Mortensen EL, Hvidtjorn D, Kesmodel US. Fertility treatment and risk of childhood and adolescent mental disorders: register based cohort study. BMJ. 2013;347:f3978. Epub 2013/07/09. doi: 10.1136/bmj.f3978 ; PubMed Central PMCID: PMC3702157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Pinborg A, Loft A, Schmidt L, Andersen AN. Morbidity in a Danish national cohort of 472 IVF/ICSI twins, 1132 non-IVF/ICSI twins and 634 IVF/ICSI singletons: health-related and social implications for the children and their families. Hum Reprod. 2003;18(6):1234–43. Epub 2003/05/30. doi: 10.1093/humrep/deg257 . [DOI] [PubMed] [Google Scholar]
  • 12.Sandin S, Nygren KG, Iliadou A, Hultman CM, Reichenberg A. Autism and mental retardation among offspring born after in vitro fertilization. JAMA. 2013;310(1):75–84. Epub 2013/07/04. doi: 10.1001/jama.2013.7222 . [DOI] [PubMed] [Google Scholar]
  • 13.Davies MJ, Moore VM, Willson KJ, Van Essen P, Priest K, Scott H, et al. Reproductive technologies and the risk of birth defects. N Engl J Med. 2012;366(19):1803–13. Epub 2012/05/09. doi: 10.1056/NEJMoa1008095 . [DOI] [PubMed] [Google Scholar]
  • 14.Leslie GI, Gibson FL, McMahon C, Cohen J, Saunders DM, Tennant C. Children conceived using ICSI do not have an increased risk of delayed mental development at 5 years of age. Hum Reprod. 2003;18(10):2067–72. Epub 2003/09/26. doi: 10.1093/humrep/deg408 . [DOI] [PubMed] [Google Scholar]
  • 15.Knoester M, Helmerhorst FM, Vandenbroucke JP, van der Westerlaken LA, Walther FJ, Veen S, et al. Cognitive development of singletons born after intracytoplasmic sperm injection compared with in vitro fertilization and natural conception. Fertil Steril. 2008;90(2):289–96. Epub 2007/11/06. doi: 10.1016/j.fertnstert.2007.06.090 . [DOI] [PubMed] [Google Scholar]
  • 16.Mains L, Zimmerman M, Blaine J, Stegmann B, Sparks A, Ansley T, et al. Achievement test performance in children conceived by IVF. Hum Reprod. 2010;25(10):2605–11. Epub 2010/08/19. doi: 10.1093/humrep/deq218 . [DOI] [PubMed] [Google Scholar]
  • 17.Barbuscia A, Mills MC. Cognitive development in children up to age 11 years born after ART-a longitudinal cohort study. Hum Reprod. 2017;32(7):1482–8. Epub 2017/05/26. doi: 10.1093/humrep/dex102 ; PubMed Central PMCID: PMC5850752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Norrman E, Petzold M, Bergh C, Wennerholm UB. School performance in singletons born after assisted reproductive technology. Hum Reprod. 2018;33(10):1948–59. Epub 2018/08/31. doi: 10.1093/humrep/dey273 . [DOI] [PubMed] [Google Scholar]
  • 19.Wienecke LS, Kjaer SK, Frederiksen K, Hargreave M, Dalton SO, Jensen A. Ninth-grade school achievement in Danish children conceived following fertility treatment: a population-based cohort study. Fertil Steril. 2020;113(5):1014–23. Epub 2020/05/11. doi: 10.1016/j.fertnstert.2020.01.012 . [DOI] [PubMed] [Google Scholar]
  • 20.Ronfani L, Vecchi Brumatti L, Mariuz M, Tognin V, Bin M, Ferluga V, et al. The Complex Interaction between Home Environment, Socioeconomic Status, Maternal IQ and Early Child Neurocognitive Development: A Multivariate Analysis of Data Collected in a Newborn Cohort Study. PLoS ONE. 2015;10(5):e0127052. Epub 2015/05/23. doi: 10.1371/journal.pone.0127052 ; PubMed Central PMCID: PMC4440732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ferreira L, Godinez I, Gabbard C, Vieira JLL, Cacola P. Motor development in school-age children is associated with the home environment including socioeconomic status. Child Care Health Dev. 2018;44(6):801–6. Epub 2018/08/02. doi: 10.1111/cch.12606 . [DOI] [PubMed] [Google Scholar]
  • 22.Falster K, Hanly M, Banks E, Lynch J, Chambers G, Brownell M, et al. Maternal age and offspring developmental vulnerability at age five: A population-based cohort study of Australian children. PLoS Med. 2018;15(4):e1002558. Epub 2018/04/25. doi: 10.1371/journal.pmed.1002558 ; PubMed Central PMCID: PMC5915778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hanly M, Falster K, Banks E, Lynch J, Chambers GM, Brownell M, et al. Role of maternal age at birth in child development among Indigenous and non-Indigenous Australian children in their first school year: a population-based cohort study. Lancet Child Adolesc Health. 2020;4(1):46–57. Epub 2019/11/24. doi: 10.1016/S2352-4642(19)30334-7 . [DOI] [PubMed] [Google Scholar]
  • 24.Jeong J, Kim R, Subramanian SV. How consistent are associations between maternal and paternal education and child growth and development outcomes across 39 low-income and middle-income countries? J Epidemiol Community Health. 2018;72(5):434–41. Epub 2018/02/14. doi: 10.1136/jech-2017-210102 . [DOI] [PubMed] [Google Scholar]
  • 25.Hernan MA. Methods of Public Health Research—Strengthening Causal Inference from Observational Data. N Engl J Med. 2021;385(15):1345–8. Epub 2021/10/02. doi: 10.1056/NEJMp2113319 . [DOI] [PubMed] [Google Scholar]
  • 26.Hernán MA, Robins JM. Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available. Am J Epidemiol. 2016;183(8):758–64. Epub 20160318. doi: 10.1093/aje/kwv254 ; PubMed Central PMCID: PMC4832051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Hernán MA. Methods of Public Health Research—Strengthening Causal Inference from Observational Data. N Engl J Med. 2021;385(15):1345–1348. doi: 10.1056/NEJMp2113319 [DOI] [PubMed] [Google Scholar]
  • 28.Flood MM, McDonald SJ, Pollock WE, Davey MA. Data accuracy in the Victorian Perinatal Data Collection: Results of a validation study of 2011 data. Health Inf Manag. 2017;46(3):113–26. Epub 20170127. doi: 10.1177/1833358316689688 . [DOI] [PubMed] [Google Scholar]
  • 29.Davey MA, Sloan ML, Palma S, Riley M, King J. Methodological processes in validating and analysing the quality of population-based data: a case study using the Victorian Perinatal Data Collection. Health Inf Manag. 2013;42(3):12–9. doi: 10.1177/183335831304200301 . [DOI] [PubMed] [Google Scholar]
  • 30.The Australian Early Development Census. ABOUT THE AEDC 2022. 2020. [cited 2022 Feb 18]. Available from: https://www.aedc.gov.au/about-the-aedc. [Google Scholar]
  • 31.The Australian Curriculum Assessment and Reporting Aurthority (ACARA). The National Assessment Program for Literacy and Numeracy. Available from: https://nap.edu.au.
  • 32.The Australian Curriculum Assessment and Reporting Aurthority (ACARA). Test development. 2018. Available from: https://www.nap.edu.au/about/test-development. [Google Scholar]
  • 33.Australian Bureau of Statistics. Socio-Economic Indexes for Areas. 2016. Available from: https://www.abs.gov.au/websitedbs/censushome.nsf/home/seifa. [Google Scholar]
  • 34.Tennant PWG, Murray EJ, Arnold KF, Berrie L, Fox MP, Gadd SC, et al. Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations. Int J Epidemiol. 2021;50(2):620–632. doi: 10.1093/ije/dyaa213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.A Guide for Data Integration Projects Involving Commonwealth Data for Statistcial and Research Purposes Australian Government—National Statistcial Service [23 July 2021]. Available from: https://statisticaldataintegration.abs.gov.au/topics/applying-the-separation-principle. [Google Scholar]
  • 36.Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. doi: 10.1093/biomet/70.1.41 [DOI] [Google Scholar]
  • 37.Eddings W, Marchenko Y. Diagnostics for multiple imputation in Stata. Stata J. 2012;12(3):353–367. [Google Scholar]
  • 38.Williamson EJ, Forbes A, Wolfe R. Doubly robust estimators of causal exposure effects with missing data in the outcome, exposure or a confounder. Stat Med. 2012;31:4382–400. doi: 10.1002/sim.5643 [DOI] [PubMed] [Google Scholar]
  • 39.Desai RJ, Franklin JM. Alternative approaches for confounding adjustment in observational studies using weighting based on the propensity score: a primer for practitioners. BMJ. 2019;367:l5657. Epub 2019/10/28. doi: 10.1136/bmj.l5657 . [DOI] [PubMed] [Google Scholar]
  • 40.Luque-Fernandez MA. “ELTMLE: Stata module to provide Ensemble Learning Targeted Maximum Likelihood Estimation”. Statistical Software Components S458337. revised 04 Jul 2021 ed: Boston College Department of Economics; 2017. [Google Scholar]
  • 41.StataCorp. Stata Statistical Software, Release 17. College Station, Texas. 2021. [Google Scholar]
  • 42.R Core Team. R Foundation for Statistical Computing. R: A language and environment for statistical computing. Vienna, Austria; 2021. [Google Scholar]
  • 43.Chambers GM, Paul RC, Harris K, Fitzgerald O, Boothroyd CV, Rombauts L, et al. Assisted reproductive technology in Australia and New Zealand: cumulative live birth rates as measures of success. Med J Aust. 2017;207(3):114–8. Epub 2017/08/03. doi: 10.5694/mja16.01435 . [DOI] [PubMed] [Google Scholar]
  • 44.Pearce A, Scalzi D, Lynch J, Smithers LG. Do thin, overweight and obese children have poorer development than their healthy-weight peers at the start of school? Findings from a South Australian data linkage study. Early Child Res Q. 2016;35:85–94. Epub 2016/05/10. doi: 10.1016/j.ecresq.2015.10.007 ; PubMed Central PMCID: PMC4850238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Sadruddin AFA, Ponguta LA, Zonderman AL, Wiley KS, Grimshaw A, Panter-Brick C. How do grandparents influence child health and development? A systematic review. Soc Sci Med. 2019;239:112476. Epub 2019/09/21. doi: 10.1016/j.socscimed.2019.112476 . [DOI] [PubMed] [Google Scholar]
  • 46.Chung EO, Hagaman A, LeMasters K, Andrabi N, Baranov V, Bates LM, et al. The contribution of grandmother involvement to child growth and development: an observational study in rural Pakistan. BMJ Global Health. 2020;5(8). Epub 2020/08/14. doi: 10.1136/bmjgh-2019-002181 ; PubMed Central PMCID: PMC7418670. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Allen L, Kelly B. Committee on the Science of Children Birth to Age 8: Deepening and Broadening the Foundation for Success. Transforming the Workforce for Children Birth Through Age 8: A Unifying Foundation. editors. Washington (DC); 2015. [PubMed] [Google Scholar]
  • 48.VanderWeele TJ, Ding P. Sensitivity Analysis in Observational Research: Introducing the E-Value. Ann Intern Med. 2017;167(4):268–74. Epub 20170711. doi: 10.7326/M16-2607 . [DOI] [PubMed] [Google Scholar]
  • 49.Goin DE, Casey JA, Kioumourtzoglou MA, Cushing LJ, Morello-Frosch R. Environmental hazards, social inequality, and fetal loss: Implications of live-birth bias for estimation of disparities in birth outcomes. Environ Epidemiol. 2021;5(2):e131. Epub 20210226. doi: 10.1097/EE9.0000000000000131 ; PubMed Central PMCID: PMC8043739. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Philippa Dodd

4 Jul 2022

Dear Dr Lindquist,

Thank you for submitting your manuscript entitled "School-age outcomes among IVF-conceived children: a causal inference analysis using linked population-wide data" for consideration by PLOS Medicine.

Your manuscript has now been evaluated by the PLOS Medicine editorial staff as well as by an academic editor with relevant expertise and I am writing to let you know that we would like to send your submission out for external peer review.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Please re-submit your manuscript within two working days, i.e. by Jul 06 2022 11:59PM.

Login to Editorial Manager here: https://www.editorialmanager.com/pmedicine

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed all checks it will be sent out for review.

Feel free to email us at plosmedicine@plos.org if you have any queries relating to your submission.

Kind regards,

Philippa

Dr Philippa C Dodd, MBBS MRCP PhD

Senior Editor

PLOS Medicine

Decision Letter 1

Philippa Dodd

6 Sep 2022

Dear Dr. Lindquist,

Thank you very much for submitting your manuscript "School-age outcomes among IVF-conceived children: a causal inference analysis using linked population-wide data" (PMEDICINE-D-22-02257R1) for consideration at PLOS Medicine.

Your paper was evaluated by a senior editor and discussed among all the editors here. It was also discussed with an academic editor with relevant expertise, and sent to independent reviewers, including a statistical reviewer. The reviews are appended at the bottom of this email and any accompanying reviewer attachments can be seen via the link below:

[LINK]

In light of these reviews, I am afraid that we will not be able to accept the manuscript for publication in the journal in its current form, but we would like to consider a revised version that addresses the reviewers' and editors' comments. Obviously we cannot make any decision about publication until we have seen the revised manuscript and your response, and we plan to seek re-review by one or more of the reviewers.

In revising the manuscript for further consideration, your revisions should address the specific points made by each reviewer and the editors. Please also check the guidelines for revised papers at http://journals.plos.org/plosmedicine/s/revising-your-manuscript for any that apply to your paper. In your rebuttal letter you should indicate your response to the reviewers' and editors' comments, the changes you have made in the manuscript, and include either an excerpt of the revised text or the location (eg: page and line number) where each change can be found. Please submit a clean version of the paper as the main article file; a version with changes marked should be uploaded as a marked up manuscript.

In addition, we request that you upload any figures associated with your paper as individual TIF or EPS files with 300dpi resolution at resubmission; please read our figure guidelines for more information on our requirements: http://journals.plos.org/plosmedicine/s/figures. While revising your submission, please upload your figure files to the PACE digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at PLOSMedicine@plos.org.

We expect to receive your revised manuscript by Sep 27 2022 11:59PM. Please email us (plosmedicine@plos.org) if you have any questions or concerns.

***Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.***

We ask every co-author listed on the manuscript to fill in a contributing author statement, making sure to declare all competing interests. If any of the co-authors have not filled in the statement, we will remind them to do so when the paper is revised. If all statements are not completed in a timely fashion this could hold up the re-review process. If new competing interests are declared later in the revision process, this may also hold up the submission. Should there be a problem getting one of your co-authors to fill in a statement we will be in contact. YOU MUST NOT ADD OR REMOVE AUTHORS UNLESS YOU HAVE ALERTED THE EDITOR HANDLING THE MANUSCRIPT TO THE CHANGE AND THEY SPECIFICALLY HAVE AGREED TO IT. You can see our competing interests policy here: http://journals.plos.org/plosmedicine/s/competing-interests.

Please use the following link to submit the revised manuscript:

https://www.editorialmanager.com/pmedicine/

Your article can be found in the "Submissions Needing Revision" folder.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please ensure that the paper adheres to the PLOS Data Availability Policy (see http://journals.plos.org/plosmedicine/s/data-availability), which requires that all data underlying the study's findings be provided in a repository or as Supporting Information. For data residing with a third party, authors are required to provide instructions with contact information for obtaining the data. PLOS journals do not allow statements supported by "data not shown" or "unpublished results." For such statements, authors must provide supporting data or cite public sources that include it.

We look forward to receiving your revised manuscript.

Sincerely,

Philippa Dodd, MBBS MRCP PhD

PLOS Medicine

plosmedicine.org

-----------------------------------------------------------

Requests from the editors:

ABSTRACT

Abstract Methods and Findings:

Please quantify the main results with 95% CIs and p values.

Please include the important dependent variables that are adjusted for in the analyses.

In the last sentence of the Abstract Methods and Findings section, please describe the main limitation(s) of the study's methodology.

At this stage, we ask that you include a short, non-technical Author Summary of your research to make findings accessible to a wide audience that includes both scientists and non-scientists. The Author Summary should immediately follow the Abstract in your revised manuscript. This text is subject to editorial change and should be distinct from the scientific abstract. Please see our author guidelines for more information: https://journals.plos.org/plosmedicine/s/revising-your-manuscript#loc-author-summary

INTRODUCTION

Please conclude the Introduction with a clear description of the study question or hypothesis.

METHODS and RESULTS

Please provide 95% CIs and p values where relevant

When a p value is given, please specify the statistical test used to determine it.

GENERAL

in the context of the reviewer comments below, it might be necessary to revise your title

Please do so according to PLOS Medicine's style. Your title must be nondeclarative and not a question. It should begin with main concept if possible. "Effect of" should be used only if causality can be inferred, i.e., for an RCT. Please place the study design ("A randomized controlled trial," "A retrospective study," "A modelling study," etc.) in the subtitle (ie, after a colon)

Please ensure that the study is reported according to the STROBE guideline and include the completed STROBE checklist as Supporting Information. Please add the following statement, or similar, to the Methods: "This study is reported as per the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline (S1 Checklist)." The STROBE guideline can be found here: http://www.equator-network.org/reporting-guidelines/strobe/ When completing the checklist, please use section and paragraph numbers, rather than page numbers.

Thank you for including a prospective analysis plan, please include any changes made to the analyses-- including those made in response to peer review comments-- in the Methods section of the paper, with rationale.

In the manuscript text, please indicate: (1) the specific hypotheses you intended to test, (2) the analytical methods by which you planned to test them, (3) the analyses you actually performed, and (4) when reported analyses differ from those that were planned, transparent explanations for differences that affect the reliability of the study's results. If a reported analysis was performed based on an interesting but unanticipated pattern in the data, please be clear that the analysis was data-driven.

DISCUSSION

Please remove sub-headings such that the discussion is presented follows: a short, clear summary of the article's findings; what the study adds to existing research and where and why the results may differ from previous research; strengths and limitations of the study; implications and next steps for research, clinical practice, and/or public policy; one-paragraph conclusion.

Comments from the academic editor:

1. I do not buy the causal language. All observational studies have potential for unmeasured confounders. I think the use and emphasis of causal language could actually distract attention from the main message, which I think is important. In fact, the unadjusted data indicate an apparent protective effect of IVF which disappears on adjustment (Table 2). It would be good to add 95% CI to the unadjusted analysis to see if this was beyond the play of chance. But it is apparent that measured parental characteristics are associated with an increased chance of using IVF and a decreased risk of poor educational outcome. Hence, it follows that unmeasured parental characteristics associated with an increased chance of using IVF and a decreased risk of poor educational outcome could be masking an adverse effect of IVF and the use of causal language is wrong.

2. It would be good to have a positive control, i.e. data on a factor which we know is associated with educational outcome. That way we know their methods can detect an association if there is one there. Preterm birth is an obvious one. Perhaps an easy way to do this is to provide coefficients for all the covariables that they included in adjustment.

3. It would actually be quite interesting to see what the results are if they applied a more commonly used method of multivariate adjustment, such as logistic regression or propensity score analysis just to see how different (if at all) the results are with the method they employ. Comments 2 and 3 could be addressed by providing a supplementary table of unadjusted and adjusted ORs from logistic regression to allow comparison.

4. I agree with the comments of the statistician. But it is reassuring that their overall assessment is positive.

Comments from the reviewers:

Reviewer #1 (methodological reviewer): This aim of this study was to determine the causal effect of in-vitro fertilization (IVF) conception on primary schoolage childhood developmental and educational outcomes, compared with outcomes following spontaneous conception.

Comments:

"Covariates to be considered for inclusion in the statistical analysis models were decided a priori by the authorship team whose expertise included epidemiology, perinatology, reproductive endocrinology and education"

The authors have satisfactorily adjusted for potential confounding in the analysis.

"For the pre-specified statistical analysis plan (SAP) and the directed acyclic graph (DAG), see supplementary file (eFigure 1 in S3)."

The authors have suitably provided the DAG in the supplementary material.

Can the authors also please provide a copy of the SAP in the supplementary material (it is not currently clear if or where this is attached to the file)?

"A doubly robust inverse-probability-weighted regression adjustment (IPWRA) model43,44 was used..."

The authors have applied technically appropriate and rigorous statistical methods within the context of this research.

"Provided assumptions are satisfied, the estimates generated from these routines can be interpreted as the population average causal effect"

Can the authors please include a transparent and thorough discussion on the assumptions that are required for causal inferences to be drawn?

"Analysis involved detailed examination of missing data, consideration of missing outcome data, multiple imputation of missing covariate data, consideration of clustering within mothers and adjustments to effect size modelling. Finally, sensitivity analysis was also performed to address identified sources of potential bias."

Can the authors please provide more detail on these important analyses within the main article here?

"Overall, a total of 11,059 IVF-conceived children and 401,654 spontaneously conceived children were included in the study (2,614 IVF cases and 100,184 controls were in both study arms)."

Whilst the authors have examined that "Analysis of the linked and nonlinked cases showed little evidence of association between linkage and exposure status (p = 0.80); that is, IVF cases were just as likely to be included in the final linked cohort as controls", can the authors please further comment on whether the final included samples analysed in this study can be considered to be representative of the wider populations of interest within the context of this research?

"Outcome data were missing for 5.6% of the AEDC-linked cohort. The vast majority (92%) of these missing cases were children with special needs (5.2% of overall cohort). There was no evidence of an association between the presence of missing outcome and exposure status (p = 0.68). Sensitivity analysis was performed by 1) excluding children with special needs and 2) including these children, with multiple imputation of their missing outcomes (see supplementary file: eTable 4 and 5 - Sensitivity Analysis (AEDC) in S11 and s12). "

and

"Spontaneously conceived children were more likely to have missing NAPLAN data (7.6%) than IVF-conceived children (5.9%, p<0.001). During the primary analysis, missing outcomes related to a child being absent or withdrawing from the test, were imputed. The results presented include 7,222 children who were exempt from sitting the NAPLAN, with their results set to the lowest possible outcome score. Sensitivity analysis was performed by 1) excluding these children and 2) including the exempt cases, with multiple imputation of their missing outcomes. There was no meaningful difference in the results (see supplementary file: eTable 6 and 7 - Sensitivity Analysis (NAPLAN) in S13 and S14). "

The authors have appropriately communicated and handled missing outcome data within the analysis.

"Maternal education level was missing for 30.5% and maternal post-school education was missing for 31.6%."

and

"Second parent school education level was missing in 13.8% of cases and post-school education missing in 15.4% of cases. "

Whilst the authors discuss missing data in the discussion of the study limitations, can the authors please clarify and expand on how they dealt with missing covariate data in the Methods and Results sections?

Overall, the results are presented clearly and the main study limitations have been thoroughly addressed.

Reviewer #2: The authors report the lack of an association of IVF with later childhood educational outcomes using routinely collected data from Australia. This is an extremely well conducted and well written study, with robust and appropriate sensitivity analyses which adds substantially to the field and will reassures many parents.

Minor comments:

The authors make a big deal of the causal analysis, and I would suggest statistical review to confirm the appropriateness of the methodology and causal claims. I am conscious that the same used to be said regarding matching and that has now shown to be incorrect. I was not convicned that teh paper would materially suffer by removing the "casual anlaysis" claims in the title or intorudction etc.

I wondered if graphical representation of the results would further enhance the paper.

Reviewer #3: This study examined associations of ART conception with School-age outcomes using register data from Australia. The authors should be commended for trying to provide more reliable causal evidence from observational data, something we should all be doing. I have only a few minor comments

eTable 1 should be brought into the main paper and / or a separate section in the methods should be added on the target trial emulation procedure

Where is the study protocol published / date it was published?

In line with the glossary, the exposure of interest should be referred to as ART rather than IVF

The aim of the study was to emulate a target trial however, baseline characteristics differed considerably between the treatment groups. Can you describe the implications of this. Does it mean that trial emulation failed? Is there a way you can try to fix this?

If data are available, it would be of interest to present results separately for fresh and frozen embryo transfers. Please consider this.

Any attachments provided with reviews can be seen via the following link:

[LINK]

Decision Letter 2

Philippa Dodd

18 Oct 2022

Dear Dr. Lindquist,

Thank you very much for re-submitting your manuscript "School-age outcomes among IVF-conceived children: a causal inference analysis using linked population-wide data" (PMEDICINE-D-22-02257R2) for review by PLOS Medicine.

I have discussed the paper with my colleagues and the academic editor and it was also seen again by 4 reviewers. I am pleased to say that provided the remaining editorial and production issues are dealt with we are planning to accept the paper for publication in the journal.

The remaining issues that need to be addressed are listed at the end of this email. Any accompanying reviewer attachments can be seen via the link below. Please take these into account before resubmitting your manuscript:

[LINK]

***Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.***

In revising the manuscript for further consideration here, please ensure you address the specific points made by each reviewer and the editors. In your rebuttal letter you should indicate your response to the reviewers' and editors' comments and the changes you have made in the manuscript. Please submit a clean version of the paper as the main article file. A version with changes marked must also be uploaded as a marked up manuscript file.

Please also check the guidelines for revised papers at http://journals.plos.org/plosmedicine/s/revising-your-manuscript for any that apply to your paper. If you haven't already, we ask that you provide a short, non-technical Author Summary of your research to make findings accessible to a wide audience that includes both scientists and non-scientists. The Author Summary should immediately follow the Abstract in your revised manuscript. This text is subject to editorial change and should be distinct from the scientific abstract.

We expect to receive your revised manuscript within 1 week. Please email us (plosmedicine@plos.org) if you have any questions or concerns.

We ask every co-author listed on the manuscript to fill in a contributing author statement. If any of the co-authors have not filled in the statement, we will remind them to do so when the paper is revised. If all statements are not completed in a timely fashion this could hold up the re-review process. Should there be a problem getting one of your co-authors to fill in a statement we will be in contact. YOU MUST NOT ADD OR REMOVE AUTHORS UNLESS YOU HAVE ALERTED THE EDITOR HANDLING THE MANUSCRIPT TO THE CHANGE AND THEY SPECIFICALLY HAVE AGREED TO IT.

Please ensure that the paper adheres to the PLOS Data Availability Policy (see http://journals.plos.org/plosmedicine/s/data-availability), which requires that all data underlying the study's findings be provided in a repository or as Supporting Information. For data residing with a third party, authors are required to provide instructions with contact information for obtaining the data. PLOS journals do not allow statements supported by "data not shown" or "unpublished results." For such statements, authors must provide supporting data or cite public sources that include it.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

Please note, when your manuscript is accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you've already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosmedicine@plos.org.

If you have any questions in the meantime, please contact me or the journal staff on plosmedicine@plos.org.  

We look forward to receiving the revised manuscript by Oct 25 2022 11:59PM.   

Sincerely,

Philippa Dodd, MBBS MRCP PhD

Senior Editor 

PLOS Medicine

plosmedicine.org

------------------------------------------------------------

Requests from Editors:

GENERAL

Please address all editor and reviewer comments detailed below, in full

Please remove the funding/financial disclosure/

DATA AVAILABILITY STATEMENT

Please provide a URL for The Centre for Victorian Data Linkage

AUTHOR SUMMARY

Thank you for including an author summary.

Line 74: Consider an alternative term to “defect” perhaps “congenital anomalies” in place of “…birth defects, congenital abnormalities…” to account for both structural and functional anomalies, or something similar

REFERENCES

For in-text reference call-outs, citations should be placed within square brackets and preceding punctuation, as follows: “…asymptomatically [2,4].”

Please check to ensure that the bibliography is listed in line with our guidance which can be found here: https://journals.plos.org/plosmedicine/s/submission-guidelines#loc-references

TABLE 1

To improve accessibility to the reader, please adjust row width/text spacing to ensure IQRs are reported in a single row or place below the relevant data point

SUPPORTING INFORMATION

Please ensure all figures and tables in the supporting files have abbreviations defined

S2 TABLE 1 - Please define IVF, ITT, AEDC, NAPLAN

S4 TABLE 2b – please define “#" and "ICSI"

S5 eMETHODS - please ensure referencing Is formatted as detailed above including in-text reference call-outs

S6 FIGURE 2 - please define NAPLAN - please check and amend throughout all supplementary files

S10 ALL FIGURES - please define the abbreviations - LBOTE, SEIFA, NAPLAN

S16 - please define AEDC, NAPLAN, DV2

S17 TABLE 9 and 10 - please define AEDC and NAPLAN

SOCIAL MEDIA

To help us extend the reach of your research, please provide any Twitter handle(s) that would be appropriate to tag, including your own, your coauthors’, your institution, funder, or lab. Please respond to this email with any handles you wish to be included when we tweet this paper.

Comments from Reviewers:

Reviewer #1: Many thanks to the authors for satisfactorily considering and responding to each comment in turn, suitably amending the manuscript as requested.

Reviewer #2: I would stand by my comments that removal of the causal claim in the title would improve the paper, I am not a fan of playing different referees of each other several of us have highlighted that the causal language may be overstated.

"School-age outcomes among IVF-conceived children" is snappier and cuts to the chase.

Reviewer #4: I am grateful for the opportunity to review this interesting and innovative study that seeks to estimate the total causal effect of conception by assisted reproductive technology on educational outcomes in school-age children. I was approached to provide a methodological review of the appropriateness and accuracy of the causal inference methods as well as the reporting and use of language. My review will focus primarily on these points.

Overall, I believe the study is has been well conducted and the research team should be commended for their hard work and diligence. I believe the research is scientifically sound but I believe the quality of the manuscript could be improved with some language and presentational changes as detailed below.

1) TITLE: 'a causal inference analysis' is a vague term. I think it would be better for the study to be described more precisely as something like, 'a target trial emulation study using augmented inverse probability of treatment weighting'.

2) INTRODUCTION, METHODS, AND RESULTS: A causal inference approach encourages a focus on interval estimation rather than (null) hypothesis testing. I therefore find it somewhat strange that the study states a null hypothesis at the end of the introduction and goes on to report null-distribution p-values. I think it would be less jarring if the authors instead described their aim/s as something more like 'to estimate the total causal effect of conception by IVF on the risk of developmental vulnerability'. Similar language would apply in the methods and results, e.g. rather than saying that 'The null hypothesis of no causal effect of IVF conception on developmental vulnerability was supported by our findings…' the study would simply report something like 'the estimated total causal effect of conception by IVF on the risk of developmental vulnerability was indistinguishable from zero (risk difference = -0.3%, 95% CI: -3.7% to 3.1%)'.

3) INTRODUCTION (lines 127-140): Although a target trial emulation study is indeed designed to 'mimic' a randomized controlled trial, I think this wording is likely to raise unfortunate associations among readers familiar with poorly-conducted propensity score analyses as 'mimicking randomised controlled trials'. I would therefore recommend toning down the language a little and simply stating that the study aimed to emulate a target trial by using the method introduced by Hernan et al with the augmented inverse probability of treatment weighting estimator. Similarly, I don't particularly like the language of achieving 'balance' in the distribution of baseline characteristic, in part because this is not what propensity score methods do (they aim to balance the outcome propensities, not the covariates). Comments about how this is 'similar to what occurs when participants are randomised' may be theoretically true, but this is unlikely to ever be achieved in practice. That said, I commend the authors for being clear about their causal aims. As outlined by Haber et al 2022 (Causal and Associational Language in Observational Health Research: A Systematic Evaluation) it is important for researchers to be clear where they seek to estimate a causal effect. It is just not necessary to 'over egg' this ambition or the methods used to achieve this aim. Thus, it is good to state an aim to 'estimate the total causal effect of conception by IVF on developmental vulnerability', and interpret the results as 'estimates of' this total causal effect, but perhaps not so good to start talking about 'mimicking randomised controlled trials'.

4) METHODS: I would prefer the target trial emulation table to be presented in the main manuscript. I believe Plos Medicine is an online-only journal, so I see no reason why this should not be possible.

5) METHODS: There is a gap between the allocation and completion of the treatment because the conception must survive until live birth to be counted as belonging to either exposure group. In the target trial, what would happen to the pregnancies that don't result in live birth? I expect there will be an unequal chance of surviving to live birth between the in-vivo and in-vitro conceptions, creating the chance of selection bias. This ought to be acknowledged as a potential source of bias or, ideally, accounted for as competing events.

6) METHODS: In terms of exclusion criteria, what happened to the twins and other multiple pregnancies? I cannot see them described in the exclusion criteria.

7) METHODS: Regarding the target trial table, I think the authors could be more explicit in both the target trial and (in particular) the emulation study columns. I know it seems strange, but where the details of the emulation study are identical to the target trial, this really ought to be explicitly stated (either by repeating the details or simply saying 'same as for the target trial'). For example, if the inclusion criteria for the trial are 'all couples wanting to conceive and weight the ability to conceive' then the emulation column should repeat the same. In general, the emulation column does not really seem to state the expected criteria. E.g. the assignment procedures box should summarise your method for achieving conditional exchangeability and your causal contrasts box should start the target estimands. The recent paper by Matthews et al in the BMJ (Target trial emulation: applying principles of randomised trials to observational studies) offers a little help here about what should be mentioned in the table.

8) METHODS: I think that the supplementary methods should be included in the main manuscript, which is currently rather light on details.

9) METHODS, RESULTS, and throughout: Since this study uses a DAG, I believe it would benefit from following the reporting recommendations outlined in Tennant et al 2021 (Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations).

10) METHODS and RESULTS: The authors present 'crude' associations in the non-weighted sample. I believe these results may have been added on the instruction of a previous reviewer, as per traditional. Alas, these associations should not be reported as they are likely to be extremely biased and prone to misinterpretation. Only the best estimate of the target estimands should be reported in the main manuscript. All sub-analyses involving alternative estimators (whether unweighted logistic regression, TLME, or - if absolutely necessary - the unweighted and unconditional analyses) should be reserved for the supplementary materials.

11) RESULTS: Null hypothesis significance testing is strongly discouraged in epidemiological analyses. The null-distribution p-values are therefore unnecessary and do not need to be reported. The authors may wish to present their E-values alongside the point estimates and confidence intervals instead.

Any attachments provided with reviews can be seen via the following link:

[LINK]

Decision Letter 3

Philippa Dodd

14 Nov 2022

Dear Dr. Lindquist,

Thank you very much for re-submitting your manuscript "School-age outcomes among IVF-conceived children." (PMEDICINE-D-22-02257R3) for review by PLOS Medicine.

I have discussed the paper with my colleagues and the academic editor and it was also seen again by 1 reviewer. I am pleased to say that provided the remaining editorial and production issues are dealt with we are planning to accept the paper for publication in the journal.

The remaining issues that need to be addressed are listed at the end of this email. Any accompanying reviewer attachments can be seen via the link below. Please take these into account before resubmitting your manuscript:

[LINK]

***Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.***

In revising the manuscript for further consideration here, please ensure you address the specific points made by each reviewer and the editors. In your rebuttal letter you should indicate your response to the reviewers' and editors' comments and the changes you have made in the manuscript. Please submit a clean version of the paper as the main article file. A version with changes marked must also be uploaded as a marked up manuscript file.

Please also check the guidelines for revised papers at http://journals.plos.org/plosmedicine/s/revising-your-manuscript for any that apply to your paper. If you haven't already, we ask that you provide a short, non-technical Author Summary of your research to make findings accessible to a wide audience that includes both scientists and non-scientists. The Author Summary should immediately follow the Abstract in your revised manuscript. This text is subject to editorial change and should be distinct from the scientific abstract.

We expect to receive your revised manuscript within 1 week. Please email us (plosmedicine@plos.org) if you have any questions or concerns.

We ask every co-author listed on the manuscript to fill in a contributing author statement. If any of the co-authors have not filled in the statement, we will remind them to do so when the paper is revised. If all statements are not completed in a timely fashion this could hold up the re-review process. Should there be a problem getting one of your co-authors to fill in a statement we will be in contact. YOU MUST NOT ADD OR REMOVE AUTHORS UNLESS YOU HAVE ALERTED THE EDITOR HANDLING THE MANUSCRIPT TO THE CHANGE AND THEY SPECIFICALLY HAVE AGREED TO IT.

Please ensure that the paper adheres to the PLOS Data Availability Policy (see http://journals.plos.org/plosmedicine/s/data-availability), which requires that all data underlying the study's findings be provided in a repository or as Supporting Information. For data residing with a third party, authors are required to provide instructions with contact information for obtaining the data. PLOS journals do not allow statements supported by "data not shown" or "unpublished results." For such statements, authors must provide supporting data or cite public sources that include it.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

Please note, when your manuscript is accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you've already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosmedicine@plos.org.

If you have any questions in the meantime, please contact me or the journal staff on plosmedicine@plos.org.  

We look forward to receiving the revised manuscript by Nov 17 2022 11:59PM.   

Sincerely,

Pippa

Philippa Dodd, MBBS MRCP PhD

PLOS Medicine

plosmedicine.org

------------------------------------------------------------

Requests from Editors:

1) Thank you for revising your title in context of previous reviewer comments. With these in mind, please revise your title according to PLOS Medicine's style. Your title must be nondeclarative and not a question. It should begin with main concept if possible. "Effect of" should be used only if causality can be inferred, i.e., for an RCT. Please place the study design ("A randomized controlled trial," "A retrospective study," "A modelling study," etc.) in the subtitle (ie, after a colon).

2) Where you report adjusted analyses in data tables 3, 4 and 5 of the main manuscript, please also include the unadjusted analyses, either within an additional column in the existing tables or as additional supplementary files.

3) Thank you for updating your data availability statement. Please ensure that the URL is placed in the manuscript submission form when you re-submit your manuscript.

4) Thank you for including twitter handles. Please also ensure that these are placed in the manuscript submission form when you re-submit your manuscript.

5) Please remove the COI and data availability statements from the end of the main manuscript and include only in the manuscript submission form

6) Throughout, please replace "Fig" with "Figure", including in the supplementary files

7) Please define DV2 in supplementary figures 13 and 14 - please check throughout and ensure all abbreviations are clearly defined within relevant capitions.

Comments from the Academic Editor:

The authors have done a nice job with revisions and happy to move to accept

Any attachments provided with reviews can be seen via the following link:

[LINK]

Decision Letter 4

Philippa Dodd

23 Nov 2022

Dear Dr Lindquist, 

On behalf of my colleagues and the Academic Editor, Dr Sarah Stock, I am pleased to inform you that we have agreed to publish your manuscript "School-age outcomes among IVF-conceived children: a population-wide cohort study" (PMEDICINE-D-22-02257R4) in PLOS Medicine.

Before your manuscript can be published you will need to address the following which we asked for but could not locate:

1) The completed STROBE checklist was not available in this version of your manuscript, please include AND

2) Please add the following statement, or similar, to the Methods: "This study is reported as per the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guideline (S1 Checklist)." When you re-submit please indicate where in the manuscript this sentence has been placed.

3) In addition please remove spaces between in text reference callouts (e.g. line 99 "...conception [2, 3]." should read, "...conception [2,3]."

4) Please define all abbreviations in figure 1 in an appropriate figure caption

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Once you have received these formatting requests, please note that your manuscript will not be scheduled for publication until you have made the required changes.

In the meantime, please log into Editorial Manager at http://www.editorialmanager.com/pmedicine/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process. 

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with medicinepress@plos.org. If you have not yet opted out of the early version process, we ask that you notify us immediately of any press plans so that we may do so on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Thank you again for submitting to PLOS Medicine. We look forward to publishing your paper. 

Best wishes, 

Pippa

Philippa Dodd, MBBS MRCP PhD 

PLOS Medicine

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Direct acyclic graph.

    (DOCX)

    S1 File. Methods: Description of outcome metrics.

    (DOCX)

    S2 File. Study protocol.

    (DOCX)

    S3 File

    Tables A and B. Table A. Successful linkages by birth year. Table B. Annual cycle summaries from major Victorian IVF providers 2010–2014.

    (DOCX)

    S4 File

    Fig A. Analysis flow chart (NAPLAN).

    (DOCX)

    S5 File. Methods: Multiple imputation model summary and model diagnostics.

    Table A. Missing data summary (NAPLAN). Fig A. NAPLAN convergence. Fig B. NAPLAN density plots of observed and imputed data. Fig C. NAPLAN distribution of outcome and covariates after imputation in m = 1 dataset.

    (DOCX)

    S6 File

    Figs A–D: Distribution and overlap of manually calculated stabilised weights. Fig A. AEDC imputation #1. Fig B. AEDC imputation #13. Fig C. NAPLAN imputation #1. Fig D. NAPLAN imputation #13.

    (DOCX)

    S7 File

    Figs A–C: Variable standardised mean differences. Fig A. NAPLAN imputation #1 variable standardised mean differences. Fig B. NAPLAN imputation #13 variable standardised mean differences. Fig C. AEDC imputation #7 variable standardised mean differences.

    (DOCX)

    S8 File

    Tables A and B. Table A. Sensitivity analysis–AEDC (special needs multiply imputed). Table B. Sensitivity analysis–AEDC (special needs excluded).

    (DOCX)

    S9 File

    Tables A and B. Table A. Sensitivity analysis–NAPLAN (exempt multiply imputed). Table B. Sensitivity analysis–NAPLAN (exempt excluded).

    (DOCX)

    S10 File

    Table A–Sensitivity analysis–binary outcomes TMLE.

    (DOCX)

    S11 File

    Figs A and B. Fig A. Sensitivity analysis AEDC primary outcome–E-value estimation. Fig B. Sensitivity analysis NAPLAN primary outcome–E-value estimation.

    (DOCX)

    S12 File

    Tables A and B. Table A. Traditional regression and treatment effect models (AEDC). Table B. Traditional regression and treatment effect models (NAPLAN).

    (DOCX)

    S13 File. STROBE guideline checklist.

    (DOCX)

    Attachment

    Submitted filename: PLOSMed_Response_IVF_2022_FINAL.docx

    Attachment

    Submitted filename: PlosMED_Revisions2.0.docx

    Data Availability Statement

    Data for this study was provided by various data custodians and linked by the Centre for Victorian Data linkage (https://www.health.vic.gov.au/reporting-planning-data/the-centre-for-victorian-data-linkage). With relevant ethical approval, data are available upon request to the governing data custodians.


    Articles from PLOS Medicine are provided here courtesy of PLOS

    RESOURCES