Skip to main content
Biological Psychiatry Global Open Science logoLink to Biological Psychiatry Global Open Science
. 2021 May 5;1(2):156–164. doi: 10.1016/j.bpsgos.2021.04.007

Prediction of Autism Risk From Family Medical History Data Using Machine Learning: A National Cohort Study From Denmark

Linda Ejlskov a,d, Jesper N Wulff b, Amy Kalkbrenner e, Christine Ladd-Acosta f, M Danielle Fallin g,h, Esben Agerbo a,c,d, Preben Bo Mortensen a,c,d, Brian K Lee i,j, Diana Schendel a,d,j,
PMCID: PMC9616292  PMID: 36324994

Abstract

Background

A family history of specific disorders (e.g., autism, depression, epilepsy) has been linked to risk for autism spectrum disorder (ASD). This study examines whether family history data could be used for ASD risk prediction.

Methods

We followed all Danish live births, from 1980 to 2012, of Denmark-born parents for an ASD diagnosis through April 10, 2017 (N = 1,697,231 births; 26,840 ASD cases). Linking each birth to three-generation family members, we identified 438 morbidity indicators, comprising 73 disorders reported prospectively for each family member. We tested various models using a machine learning approach. From the best-performing model, we calculated a family history risk score and estimated odds ratios and 95% confidence intervals for the risk of ASD.

Results

The best-performing model comprised 41 indicators: eight mental conditions (e.g., ASD, attention-deficit/hyperactivity disorder, neurotic/stress disorders) and nine nonmental conditions (e.g., obesity, hypertension, asthma) across six family member types; model performance was similar in training and test subsamples. The highest risk score group had 17.0% ASD prevalence and a 15.3-fold (95% confidence interval, 14.0–17.1) increased ASD risk compared with the lowest score group, which had 0.6% ASD prevalence. In contrast, individuals with a full sibling with ASD had 9.5% ASD prevalence and a 6.1-fold (95% confidence interval, 5.9–6.4) higher risk than individuals without an affected sibling.

Conclusions

Family history of multiple mental and nonmental conditions can identify more individuals at highest risk for ASD than only considering the immediate family history of ASD. A comprehensive family history may be critical for a clinically relevant ASD risk prediction framework in the future.

Keywords: Autism, Epidemiology, Family history, Machine learning, Population based, Risk score


Autism spectrum disorder (ASD) is a severe neurodevelopmental condition affecting 1%–3% of the population (1). It has lifelong impacts and is associated with considerable personal and societal costs (2, 3, 4). With increasing incidence of ASD (5) and accompanying societal expense (6), better understanding of ASD etiology and better capability to identify groups at a high risk are of critical importance.

While the causes of ASD are not completely understood, it is suspected to be multifactorial, involving polygenic inheritance as well as environmental and behavioral risk factors (1,7). One of the most well-established ASD risk factors is a family history of autism (8,9). Such a history is believed to reflect especially genetic factors but also shared environmental, social, nutritional, and other potentially modifiable risk factors among relatives (10). Traditionally, research into family morbidity in ASD has focused on mental disorders within the immediate family (11,12). Other studies have shown elevated risks associated with immediate family members diagnosed with autoimmune disorders (13,14), congenital defects (15,16), neurologic disorders (8,17,18), cardiometabolic disorders (19,20), and asthma and allergies (21) as well as with relatives of different degrees of relatedness (8). It is hypothesized that these associations are based on shared pathogenic mechanisms, especially genetic factors, between ASD and these other conditions (22). Although some previous studies demonstrate that extended family history is associated with ASD, they typically consider single or few disorders or limited types of family members such as parents, siblings, or cousins in 1-by-1 analyses (13,23). For this reason, the structure of family morbidity that underlies ASD occurrence is not well understood, and furthermore, it is unclear as to which components of family morbidity are the most important in predicting autism occurrence.

This study was designed to rigorously and systematically identify whether family history of mental and nonmental conditions could be used to predict ASD risk. To achieve this aim, we gathered family history data on a nationwide Danish cohort and their three-generation family members. We used state-of-the-art machine learning techniques to explore competing models. From the optimal model, we calculated a continuous risk score and tested its performance, including comparison with common measures of psychiatric family history in ASD.

Methods and Materials

Study Cohort

The study cohort consisted of all people born in Denmark from January 1, 1980, through December 2012 from Denmark-born parents who could be identified in the Danish Civil Registration System (24) and the Danish Medical Birth Register (25). Using the unique personal identification number given to all Danish residents, we linked cohort members to their mother, father, full siblings, grandparents, aunts, uncles, and cousins via the Danish Civil Registration System (Figure S1).

Outcome

The study cohort was followed from birth through April 10, 2017, in the Psychiatric Central Research Register (26). We defined the ASD outcome as the cohort member receiving one of the following ICD codes: ICD-8 299.00, 299.01, 299.02, 299.03; ICD-10 F84.0, F84.1, F84.5, F84.8, F84.9 (27). In Denmark, persons suspected of having ASD or other mental or behavioral disorders are referred, e.g., by general practitioners or school psychologists, to a psychiatric department for evaluation and are assigned a diagnosis by a psychiatrist; Danish health care is universal and free of charge. All diagnoses are reported to the Psychiatric Central Research Register without regard to the need for treatment or educational provisions.

Family History

We followed all cohort family members for mental disorder diagnoses reported to the Psychiatric Central Research Register (beginning in 1969, when it was established) and for nonmental disorder diagnoses reported to the National Patient Register (28) (beginning in 1977, when it was established) through April 10, 2017. Reporting of diagnoses to these registers is done only by medical specialists. Based on literature and our previous work, we selected 73 candidate disorders spanning mental, cardiometabolic, neurologic, congenital defects, autoimmune, asthma, and allergy conditions (Table S1). The first time a family member received one of the 73 diagnoses during follow-up (1969 to April 10, 2017, for mental diagnoses; 1977 to April 10, 2017, for nonmental diagnoses), we registered the family member type (6 possible: mother, father, full sibling, grandparent, aunt/uncle, cousin) (Figure S1) and diagnosis to create a morbidity indicator (1 = occurred, 0 = not occurred), for a total of 438 family member type–disorder indicators (Figures S2 and S3 for the prevalence of each morbidity indicator).

Covariates

We obtained information from the Danish Medical Birth Register, Danish Civil Registration System, and Statistics Denmark on each cohort member’s birth weight, gestational age, birth year, maternal and paternal age, and parental educational attainment at birth of the child (Table 1).

Table 1.

Descriptive Characteristics of the Study Cohort

Characteristics No ASD, n = 1,670,391, ASD, n = 26,840,
n (%) n (%)
Sex, Male 851,349 (51.0%) 19,900 (74.1%)
Intellectual Disability 7697 (0.5%) 3342 (12.5%)
Gestational Weeks
 ≤36 96,874 (5.9%) 2040 (7.7%)
 37–40 1,111,458 (67.6%) 17,467 (65.6%)
 ≥41 436,989 (26.6%) 7112 (26.7%)
Maternal Age >35 Years 211,979 (12.7%) 3797 (14.1%)
Paternal Age >35 Years 430,612 (25.8%) 7654 (28.5%)
Birth Weight, g
 <2501 88,542 (5.3%) 1810 (6.8%)
 2501–3000 210,534 (12.7%) 3448 (12.9%)
 3001–4000 1,089,957 (65.6%) 16,506 (61.8%)
 ≥4001 273,317 (16.4%) 4929 (18.5%)
Birth Year
 1980 51,048 (3.1%) 153 (0.6%)
 1981 47,172 (2.8%) 172 (0.6%)
 1982 46,860 (2.8%) 236 (0.9%)
 1983 45,259 (2.7%) 248 (0.9%)
 1984 45,873 (2.7%) 292 (1.1%)
 1985 47,435 (2.8%) 342 (1.3%)
 1986 48,334 (2.9%) 353 (1.3%)
 1987 48,578 (2.9%) 399 (1.5%)
 1988 50,616 (3.0%) 500 (1.9%)
 1989 52,444 (3.1%) 583 (2.2%)
 1990 53,925 (3.2%) 692 (2.6%)
 1991 53,912 (3.2%) 788 (2.9%)
 1992 56,527 (3.4%) 877 (3.3%)
 1993 55,769 (3.3%) 970 (3.6%)
 1994 57,302 (3.4%) 1213 (4.5%)
 1995 57,101 (3.4%) 1189 (4.4%)
 1996 54,272 (3.2%) 1279 (4.8%)
 1997 53,666 (3.2%) 1416 (5.3%)
 1998 52,138 (3.1%) 1470 (5.5%)
 1999 51,757 (3.1%) 1546 (5.8%)
 2000 51,834 (3.1%) 1668 (6.2%)
 2001 50,275 (3.0%) 1496 (5.6%)
 2002 49,391 (3.0%) 1361 (5.1%)
 2003 50,024 (3.0%) 1277 (4.8%)
 2004 50,572 (3.0%) 1158 (4.3%)
 2005 50,565 (3.0%) 1055 (3.9%)
 2006 51,207 (3.1%) 985 (3.7%)
 2007 50,321 (3.0%) 857 (3.2%)
 2008 50,788 (3.0%) 656 (2.4%)
 2009 48,470 (2.9%) 561 (2.1%)
 2010 48,593 (2.9%) 503 (1.9%)
 2011 44,783 (2.7%) 345 (1.3%)
 2012 43,579 (2.6%) 200 (0.7%)
Highest Parental Educational Attainment at Child’s Birth
 Early childhood education, ISCED 0–2 212,329 (12.7%) 4183 (15.6%)
 Upper secondary education, ISCED 3 794,704 (47.7%) 12,859 (48.0%)
 Short, bachelor’s or equivalent, ISCED 5–6 469,097 (28.1%) 6940 (25.9%)
A Part of the Test Subsample 334,055 (20.0%) 5392 (20.1%)
Have Information on at Least One of Each Family Member Type 1,178,351 (70.5%) 18,115 (67.5%)
Only Child 273,021 (16.3%) 6202 (23.1%)

Mean (SD) Mean (SD)

Number of Siblings 1.3 (0.9) 1.1 (0.9)
Number of Grandparents 3.6 (1.1) 3.8 (0.8)
Number of Aunts/Uncles 2.6 (1.8) 2.7 (1.7)
Number of Cousins 4.6 (4.1) 4.6 (4.0)

ASD, autism spectrum disorder; ISCED, International Standard Classification of Education.

Statistical Approach

Our goal was to use machine learning to find an optimal model predicting ASD diagnosis. The phases of analysis are detailed in Figure 1 and Table S2. A brief description follows.

Figure 1.

Figure 1

Overview of prediction model development. ASD, autism spectrum disorder; AUC, area under the curve.

Model Development Phase

Stage 1: To avoid overfitting, we randomly split the cohort into test (20%) and training (80%) subsamples.

Stage 2: We developed and tuned multiple machine learning algorithms on the training subsample using 10-fold cross-validation. Algorithms included (29) random forest (RF), extreme gradient boosting (EGB), the traditional generalized linear model, elastic net, neural networks, support vector machines, K-nearest neighbors, and three different ensemble learning methods (30)—averaging, stacking, and hill climbing. Because of class imbalance (a ratio of 1 ASD case to 63 noncases) (31), we used undersampling to balance the class distribution at a ratio of 1:1. We optimized each model using the F measure, the harmonic mean of positive predictive value, and sensitivity (32,33).

Stage 3: We then used the F score, area under the curve, sensitivity (true positive rate), positive predictive value, deviance, kappa, and specificity (negative predictive value) to compare performance across different algorithms (34,35).

Stage 4: We repeated stages 2 and 3 for different candidate morbidity indicators. First, we trained a model using all indicators. Then, we restricted them to the best indicators as defined by variable importance (Figure S4) using the RF and EGB algorithms: the top 41 most important (i.e., present among the top 30 in the RF or the EGB), the top 21 (i.e., among the top 15 in the RF or the EGB), and the top 3. We compared each model against a basic model with one indicator for any parental psychiatric history before birth of the index person (Yes/No). The latter representation of family psychiatric history is widely used in ASD research (11,12).

To detect effects of sex or family size on model, we conducted several sensitivity analyses replicating stages 2 and 3. First, we trained models for males and females, separately. Second, we trained a model with the candidate morbidity indicators as counts, denoting how many times a particular diagnosis had occurred within each family member type, rather than the bivariate Yes/No indicator.

Stage 5: The resulting best algorithm was evaluated on the test subsample to obtain an out-of-sample estimate of performance (36).

Model Deployment Phase

Stage 6: We calculated the family morbidity risk score (FMRS) using predicted probabilities based on the best-performing model and divided the FMRS range into 10 segments of equal length. To evaluate the association between the FMRS and ASD, we modeled a logistic regression with ASD occurrence as a function of the FMRS adjusted for sex, birth weight, gestational age, birth year, maternal, paternal age, and highest parental educational attainment at child’s birth, estimating odds ratios (ORs) with 95% confidence intervals (CIs). We evaluated potential interactions between the FMRS and sex, birth weight, and parental socioeconomic position, respectively. We also estimated associations considering parental psychiatric history (Yes/No) or full sibling ASD history (Yes/No) as the family morbidity measure in each model. There were little missing data on covariates (a total of 2%), and for this reason, we choose not to perform multiple imputation (37).

Effects of Family Size and Missing Family Linkages

We adjusted the association between ASD and FMRS for family size, including the number of aunts and uncles, cousins, siblings, and grandparents in the same model. We further assessed the association in different subpopulations: cohort members who were an only child, cohort members who had at least one missing link to a family member type, and cohort members who had at least one of each family member type (including at least 1 full sibling). Finally, we assessed whether the FMRS performed similarly in those who had been a part of the test subsample and not included in model development. All analyses were done using the mlr package (38) with R version 3.5.2 (39).

Ethical Approval

This study was approved by the Danish Data Protection Agency. Informed consent is not required for purely register-based research of preexisting personal data in Denmark.

Results

Of the 1,697,231 cohort members, 26,840 (1.6%) received an ASD diagnosis during follow-up. Compared with persons without ASD, cohort members with ASD were more likely to be either below or above normal birth weight, be born prematurely, be an only child, have parents older than 35 years and with lower educational attainment, and have a slightly lower mean number of siblings but similar mean numbers of linked grandparents, aunts and uncles, and cousins (Table 1).

Model Development

After initial assessment, we excluded 85 indicators with fewer than 40 exposed ASD cases, because these rare indicators increased computation time without improving model performance, leaving 353 candidate morbidity indicators for further analysis (Table S3). We chose the EGB algorithm to estimate the FMRS because EGB has the best balance of a high relative performance on all measures compared with the other algorithms combined with a low standard deviation (see Table S4 and Figure S5). As can be seen in Figure 2, the four EGB competing models (using all, top 41, top 21, or top 3 indicators) performed better on all investigated performance measures compared with parental psychiatric history alone. However, the models were still far from being clinically relevant for ASD prediction (e.g., test subsample F score = 0.054, area under the curve = 0.64, true positive rate = 0.44, positive predictive value = 0.029; virtually identical performance measures of optimal model in training subsample and test subsample). For the algorithm based on the top 41 indicators, we observed a 12% increase in the F score, a 15% increase in area under the curve and F score, a 50% increase in true positive rate, and a 9% increase in positive predictive value (see Table S5). Similar performance measures were observed for the model developed when using the count indicators (Table S6). There was little indication of a difference in the optimal model depending on the sex of the cohort member (Figures S6 and S7).

Figure 2.

Figure 2

Fit statistics for the extreme gradient boosting algorithm when including all 353 indicators, the top 41, the top 21, and the top 3 estimated on the training (top left) and the test (top right) set. Smoothed density plot (bottom) of family morbidity risk score (FMRS) based on the top 41 indicators stratified by autism spectrum disorder (ASD)/no ASD. AUC, area under the curve; CV, cross-validation; PPV, positive predictive value; TPR, true positive rate.

Deployment Phase: FMRS

The distribution of the FMRS based on the EGB algorithm can be seen in Figure S8. Regardless of the number of morbidity indicators included, most cohort members had an FMRS on the lower end of the spectrum. Cohort members with an ASD diagnosis had a slightly higher average FMRS (0.53 vs. 0.48, p < .0001) and were much more likely to be in the uppermost end of the FMRS spectrum compared with non-ASD cohort members. Because of the similar model performance measures and distributions of the top 41 indicators compared with the FMRS calculated using all 353 indicators (Figure 2 and Tables S4 and S5) and somewhat improved FMRS distribution compared with the model using the top 21 indicators (Figure S8), we view the FMRS based on the top 41 indicators as the overall best and simplest representation of family risk.

Table 2 shows an overview of the 41 morbidity indicators, comprising all family member types and both mental (8) and nonmental (9) disorders, used to calculate the FMRS, and Figure 3 shows the importance of each indicator. Among the psychiatric disorders were stress/neurotic disorders for all family member types; ASD or attention-deficit/hyperactivity disorder for mother, sibling, and cousins; other psychological development disorder for sibling and cousins; personality disorder for mother, father, grandparents, and aunts/uncles; and parental depression. Sibling ASD was the most important of all indicators. Among the 57 nonmental disorders, obesity was most common and present for 5 of 6 family member types (save fathers); hypertension for mother, sibling, grandparents, and aunts/uncles; and asthma for mother, grandparents, and cousins.

Table 2.

An Overview of the 41 Morbidity Indicators Included in the Family Morbidity Risk Score by Diagnosis and Family Member Type

Diagnosisa Family Member Type
Mother Father Full Sibling Grandparent Aunt/Uncle Cousin
Neurotic/Stress/Somatoform Disorder (Not OCD) x x x x x x
Obesity x x x x x
Hypertension x x x x
Personality Disorder x x x x
ASD x x x
ADHD x x x
Asthma x x x
Depression x x
Psychological Development Disorder (Not ASD) x x
Psychoactive Substance Abuse x x
Diabetes x
Episodic and Paroxysmal Disorder (Not Epilepsy) x
Nerve, Nerve Root, and Plexus Disorders x x
Other Neurologic Disorders x
Musculoskeletal Defects x
Allergies x
Behavioral/Emotional Disorder of Juvenile Onset (Not ADHD or Tic) x

See Table S1 for more information on the specific ICD codes.

ADHD, attention-deficit/hyperactivity disorder; ASD, autism spectrum disorder; OCD, obsessive-compulsive disorder.

a

Sorted by number of family member types.

Figure 3.

Figure 3

Importance ranking of 30 most important predictors in the family morbidity risk score. ADHD, attention-deficit/hyperactivity disorder; AS, autism spectrum disorder; Au, aunt; Behav. dis child onset, behavioral disorder childhood onset; Cou, cousin; dis., disorder; Gp, grandparent; Psych dev. dis, psychological developmental disorder; Sib, sibling; sub. abuse, substance abuse.

There was a dose-response relationship, with higher levels of family morbidity corresponding to higher ASD risk, reaching a 15.4-fold increase in the highest versus the lowest FMRS group (OR, 15.3; 95% CI, 14.0–17.1) (Figure 4 and Table 3; Table S7). The risk for ASD was increased twofold when considering parental psychiatric history alone (OR, 1.98; 95% CI, 1.97–2.08) and 6.1-fold (OR, 6.1; 95% CI, 5.86–6.38) considering full sibling ASD history alone. The ratio of cases to noncases decreased steadily with increasing family morbidity, reaching a prevalence of 17% (corresponding to 1 ASD case per 6 people) in the highest FMRS group. Table S8 provides the adjusted ORs for the three other family morbidity models (all indicators, top 21, top 3).

Figure 4.

Figure 4

Adjusted odds ratios with 95% confidence interval (CI) for the association between family morbidity history and autism spectrum disorder (ASD), adjusted for sex, birth weight, gestational age, birth year, maternal, paternal age, and highest parental educational attainment. psych., psychiatric.

Table 3.

Number of ASD Cases, Prevalence, OR, and Expected Frequency of ASD for Different Levels of the Three Different Family History Models

No ASD ASD % ASD ORa 1 ASD Case Per No. of Cohort Members Levels Family History Model
1,353,858 18,686 1.4% Ref 1/73 No psych. Parental psych.
285,964 7832 2.7% 1.9 1/38 Psych.
1,614,787 23,896 1.5% Ref 1/69 No ASD Sibling ASD
25,035 2622 9.5% 6.1 1/11 ASD
13,483 88 0.6% Ref 1/154 1 (lowest) FMRS
961,695 10,409 1.1% 1.1 1/93 2
346,825 5447 1.5% 1.4 1/65 3
163,238 3474 2.1% 1.8 1/48 4
84,439 2410 2.8% 2.4 1/36 5
31,178 1185 3.7% 3.0 1/27 6
12,303 649 5.0% 3.9 1/20 7
11,787 893 7.0% 5.9 1/14 8
12,443 1465 10.5% 9.0 1/9 9
2431 498 17.0% 15.3 1/6 10 (highest)

ASD, autism spectrum disorder; FMRS, family morbidity risk score; OR, odds ratio; Ref, reference; Psych., psychiatric.

a

Adjusted for sex, birth weight, gestational age, birth year, maternal, paternal age, and highest parental educational attainment.

The FMRS performed similarly for only-children, for cohort members with complete linkages, and when restricted to the test subsample (Table S9). For the cohort members with missing information on at least one family member type (n = 485,814), the highest adjusted ORs for each FMRS level were attenuated slightly, but the dose-response relationship remained. Adjusting for the number of aunts and uncles, cousins, siblings, and grandparents also attenuated the associations slightly (Table S10).

Discussion

We applied state-of-the-art machine learning techniques to a total population sample to derive an optimal ASD risk prediction model based on disorders diagnosed in family members across three generations. The FMRS derived from the optimal model demonstrated better predictive performance and provided a more fine-grained differentiation between levels of family risk for ASD compared with dichotomous measures of any parental psychiatric history or full sibling ASD. For example, the FMRS levels 4–10 were associated with a gradient of elevated ASD risk (80% to 15-fold increased risk) and comprised 20% of the entire cohort and 40% of all ASD cases, while having a sibling with ASD (Yes/No) is associated with a quite high risk (sixfold) but comprises only 1.7% of the cohort and only 9.9% of ASD cases. Furthermore, the FMRS had stronger associations with ASD than many other well-studied population-based risk factors (1) such as fetal growth, which has about a 50% increased ASD risk (40).

Notably, the most influential indicators spanned all family member types and both mental and nonmental disorders. While previous studies have shown 1-by-1 associations between a wide range of different disorders and family member types and ASD (8), to our knowledge, this is the first study to demonstrate which combination of morbidities and family member types may be the most important to consider and further elucidates the complex underlying family morbidity structure in ASD risk. For example, across all models, sibling ASD and attention-deficit/hyperactivity disorder were the most important predictors, which is consistent with the literature (41); however, stress-based and personality disorders were also influential psychiatric indicators across family member types. These results are consistent with recent findings showing significant genetic (22) and nongenetic (42) correlations between ASD and these diagnoses.

Cardiometabolic disorders (specifically obesity and hypertension) emerged as the most influential indicators among 57 nonmental disorders. Notably, 5 of 6 obesity indicators across family member types were present among the top 41 morbidity indicators. This finding is supported by several reports of maternal obesity risk for ASD and other neurodevelopmental disorders (20, 43, 44, 45), to which our study adds obesity in the extended family to the list of potentially relevant risk factors. The predictive model provides no information regarding any potential causal link between familial obesity and ASD, such as whether the extended family obesity risk reflects shared detrimental lifestyles and other social and environmental factors that may influence epigenetic and nongenetic prenatal ASD risks (instead of, or in addition to, genetic connections). Whether both psychiatric and cardiometabolic disorders are the best indicators of the underlying risk of ASD or, as predictors, provide clues as to ASD pathogenesis would need to be further investigated.

Strengths and Limitations

The FMRS was developed on an entire population, using a machine learning approach, and is estimable for all members of a population, regardless of missing information on one more of the family member types, similar to a polygenic score. We performed a wide variety of sensitivity analyses, which indicate that the FMRS is robust to size and structure of family, showing similar results in only-child, small families, and big families, as well as by sex of the cohort member. The optimal model performance measures and FMRS results were virtually the same in both the training and test subsamples. However, this study also has limitations. While we have adjusted for birth year in the regressions, the presence of certain morbidities in certain family member types included in the risk score may be influenced by changes in diagnostic criteria and practice over time, leading to possible diagnosis misclassification (measurement error) and reduced model predictive accuracy. We included diagnoses that had been reported in the literature regarding conditions in the family and co-occurring conditions that were associated with ASD, but future work could consider a more comprehensive array of conditions. While we considered possible model differences by sex, future work could also consider whether predictive models might also differ among ASD subgroups, such as ASD with or without intellectual disability. Because we are using the models for prediction and not causal inference, confounding is not a concern. There is some tendency for more common diagnoses to be included in the FMRS, which may partly reflect structural tendencies in the data or specific underlying morbidity structures. The timing of diagnoses has not been taken into account; therefore, future studies investigating the nature of the association between family history and ASD risk should consider the potential for reverse causation. The extent to which the results from this study can be replicated in other study populations is unknown; however, identification of the most influential indicators being consistent with existing literature supports generalizability of the FMRS. From a clinical feasibility perspective, future work should also consider the predictive ability of models limited to mental and nonmental disorders in the immediate family. We will make our FMRS model available to investigators upon request to promote replication and further study.

Conclusions

In conclusion, family history of multiple mental and nonmental conditions can identify more individuals at highest risk for ASD than only considering the immediate family history of ASD. Results, which yielded a family history risk score estimable for every person, underscore the complexity of family history underlying ASD occurrence and the potential importance of conditions beyond immediate family history of mental disorders in identification of at-risk groups. A comprehensive family history may be a critical component in a clinically relevant ASD risk prediction framework in the future.

Acknowledgments and Disclosures

The work was supported by the National Institute of Environmental Health Sciences of the National Institutes of Health (NIH) (Grant No. R01ES026993 [to DS, AK]) and by the Lundbeck Foundation (iPSYCH; Grant Nos. R102-A9118 and R155-2014-1724 [to PBM]).

The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

The authors report no biomedical financial interests or potential conflicts of interest.

Footnotes

Supplementary material cited in this article is available online at https://doi.org/10.1016/j.bpsgos.2021.04.007.

Supplementary Material

Supplementary Materials
mmc1.pdf (2.8MB, pdf)

References

  • 1.Lyall K., Croen L., Daniels J., Fallin M.D., Ladd-Acosta C., Lee B.K., et al. The changing epidemiology of autism spectrum disorders. Annu Rev Public Health. 2017;38:81–102. doi: 10.1146/annurev-publhealth-031816-044318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Xu G., Strathearn L., Liu B., Bao W. Prevalence of autism spectrum disorder among US children and adolescents, 2014–2016. JAMA. 2018;319:81–82. doi: 10.1001/jama.2017.17812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Buescher A.V.S., Cidav Z., Knapp M., Mandell D.S. Costs of autism spectrum disorders in the United Kingdom and the United States. JAMA Pediatr. 2014;168:721–728. doi: 10.1001/jamapediatrics.2014.210. [DOI] [PubMed] [Google Scholar]
  • 4.Van Heijst B.F.C., Geurts H.M. Quality of life in autism across the lifespan: A meta-analysis. Autism. 2015;19:158–167. doi: 10.1177/1362361313517053. [DOI] [PubMed] [Google Scholar]
  • 5.Schendel D.E., Thorsteinsson E. Cumulative incidence of autism into adulthood for birth cohorts in Denmark, 1980–2012. JAMA. 2018;320:1811–1813. doi: 10.1001/jama.2018.11328. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Leigh J.P., Du J. Brief report: Forecasting the economic burden of autism in 2015 and 2025 in the United States. J Autism Dev Disord. 2015;45:4135–4139. doi: 10.1007/s10803-015-2521-7. [DOI] [PubMed] [Google Scholar]
  • 7.Schmidt R.J., Iosif A.M., Guerrero Angel E., Ozonoff S. Association of maternal prenatal vitamin use with risk for autism spectrum disorder recurrence in young siblings. JAMA Psychiatry. 2019;76:391–398. doi: 10.1001/jamapsychiatry.2018.3901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Xie S., Karlsson H., Dalman C., Widman L., Rai D., Gardner R.M., et al. Family history of mental and neurological disorders and risk of autism. JAMA Netw Open. 2019;2 doi: 10.1001/jamanetworkopen.2019.0154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hansen S.N., Schendel D.E., Francis R.W., Windham G.C., Bresnahan M., Levine S.Z., et al. Recurrence risk of autism in siblings and cousins: A multinational, population-based study. J Am Acad Child Adolesc Psychiatry. 2019;58:866–875. doi: 10.1016/j.jaac.2018.11.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bai D., Yip B.H.K., Windham G.C., Sourander A., Francis R., Yoffe R., et al. Association of genetic and environmental factors with autism in a 5-country cohort. JAMA Psychiatry. 2019;76:1035–1043. doi: 10.1001/jamapsychiatry.2019.1411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Daniels J.L., Forssen U., Hultman C.M., Cnattingius S., Savitz D.A., Feychting M., Sparen P. Parental psychiatric disorders associated with autism spectrum disorders in the offspring. Pediatrics. 2008;121:e1357–e1362. doi: 10.1542/peds.2007-2296. [DOI] [PubMed] [Google Scholar]
  • 12.Jokiranta E., Brown A.S., Heinimaa M., Cheslack-Postava K., Suominen A., Sourander A. Parental psychiatric disorders and autism spectrum disorders. Psychiatry Res. 2013;207:203–211. doi: 10.1016/j.psychres.2013.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wu S., Ding Y., Wu F., Li R., Xie G., Hou J., Mao P. Family history of autoimmune diseases is associated with an increased risk of autism in children: A systematic review and meta-analysis. Neurosci Biobehav Rev. 2015;55:322–332. doi: 10.1016/j.neubiorev.2015.05.004. [DOI] [PubMed] [Google Scholar]
  • 14.Atladóttir H.O., Pedersen M.G., Thorsen P., Mortensen P.B., Deleuran B., Eaton W.W., Parner E.T. Association of family history of autoimmune diseases and autism spectrum disorders. Pediatrics. 2009;124:687–694. doi: 10.1542/peds.2008-2445. [DOI] [PubMed] [Google Scholar]
  • 15.Wier M.L., Yoshida C.K., Odouli R., Grether J.K., Croen L.A. Congenital anomalies associated with autism spectrum disorders. Dev Med Child Neurol. 2006;48:500–507. doi: 10.1017/S001216220600106X. [DOI] [PubMed] [Google Scholar]
  • 16.Schendel D.E., Autry A., Wines R., Moore C. The co-occurrence of autism and birth defects: Prevalence and risk in a population-based cohort. Dev Med Child Neurol. 2009;51:779–786. doi: 10.1111/j.1469-8749.2009.03310.x. [DOI] [PubMed] [Google Scholar]
  • 17.Christensen J., Overgaard M., Parner E.T., Vestergaard M., Schendel D. Risk of epilepsy and autism in full and half siblings—A population-based cohort study. Epilepsia. 2016;57:2011–2018. doi: 10.1111/epi.13595. [DOI] [PubMed] [Google Scholar]
  • 18.Christensen D., Van Naarden Braun K., Doernberg N.S., Maenner M.J., Arneson C.L., Durkin M.S., et al. Prevalence of cerebral palsy, co-occurring autism spectrum disorders, and motor functioning—Autism and Developmental Disabilities Monitoring Network, USA, 2008. Dev Med Child Neurol. 2014;56:59–65. doi: 10.1111/dmcn.12268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Xiang A.H., Wang X., Martinez M.P., Walthall J.C., Curry E.S., Page K., et al. Association of maternal diabetes with autism in offspring. JAMA. 2015;313:1425–1434. doi: 10.1001/jama.2015.2707. [DOI] [PubMed] [Google Scholar]
  • 20.Li M., Fallin M.D., Riley A., Landa R., Walker S.O., Silverstein M., et al. The association of maternal obesity and diabetes with autism and other developmental disabilities. Pediatrics. 2016;137 doi: 10.1542/peds.2015-2206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Croen L.A., Grether J.K., Yoshida C.K., Odouli R., Van de Water J. Maternal autoimmune diseases, asthma and allergies, and childhood autism spectrum disorders: A case-control study. Arch Pediatr Adolesc Med. 2005;159:151–157. doi: 10.1001/archpedi.159.2.151. [DOI] [PubMed] [Google Scholar]
  • 22.Grove J., Ripke S., Als T.D., Mattheisen M., Walters R.K., Won H., et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet. 2019;51:431–444. doi: 10.1038/s41588-019-0344-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sullivan P.F., Magnusson C., Reichenberg A., Boman M., Dalman C., Davidson M., et al. Family history of schizophrenia and bipolar disorder as risk factors for autism. Arch Gen Psychiatry. 2012;69:1099–1103. doi: 10.1001/archgenpsychiatry.2012.730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Pedersen C.B. The Danish Civil Registration System. Scand J Public Health. 2011;39(suppl 7):22–25. doi: 10.1177/1403494810387965. [DOI] [PubMed] [Google Scholar]
  • 25.Bliddal M., Broe A., Pottegård A., Olsen J., Langhoff-Roos J. The Danish Medical Birth Register. Eur J Epidemiol. 2018;33:27–36. doi: 10.1007/s10654-018-0356-1. [DOI] [PubMed] [Google Scholar]
  • 26.Mors O., Perto G.P., Mortensen P.B. The Danish Psychiatric Central Research Register. Scand J Public Health. 2011;39(suppl 7):54–57. doi: 10.1177/1403494810395825. [DOI] [PubMed] [Google Scholar]
  • 27.Pedersen C.B., Mors O., Bertelsen A., Waltoft B.L., Agerbo E., McGrath J.J., et al. A comprehensive nationwide study of the incidence rate and lifetime risk for treated mental disorders. JAMA Psychiatry. 2014;71:573–581. doi: 10.1001/jamapsychiatry.2014.16. [DOI] [PubMed] [Google Scholar]
  • 28.Lynge E., Sandegaard J.L., Rebolj M. The Danish National Patient Register. Scand J Public Health. 2011;39(suppl 7):30–33. doi: 10.1177/1403494811401482. [DOI] [PubMed] [Google Scholar]
  • 29.Hastie T., Tibshirani R., Friedman J. 2nd ed. Springer; Switzerland: 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. [Google Scholar]
  • 30.Dietterich T.G. Ensemble Methods in Machine Learning. 2000. https://web.engr.oregonstate.edu/∼tgd/publications/mcs-ensembles.pdf Available at: Accessed April 20, 2021.
  • 31.Krawczyk B. Learning from imbalanced data: Open challenges and future directions. Prog Artif Intell. 2016;5:221–232. [Google Scholar]
  • 32.Poldrack R.A., Huckins G., Varoquaux G. Establishment of best practices for evidence for prediction: A review. JAMA Psychiatry. 2020;77:534–540. doi: 10.1001/jamapsychiatry.2019.3671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sasaki Y. The truth of the F-measure. 2007. https://www.cs.odu.edu/∼mukka/cs795sum09dm/Lecturenotes/Day3/F-measure-YS-26Oct07.pdf Available at. Accessed May 15, 2021.
  • 34.Sun Y., Wong A.K.C., Kamel M.S. Classification of imbalanced data: A review. Int J Patt Recognit Artif Intell. 2009;23:687–719. [Google Scholar]
  • 35.Phung S., Bouzerdoum A., Nguyen G. In: Pattern Recognition. Yin P., editor. InTechOpen; Vukovar, Croatia: 2009. Learning pattern classification tasks with imbalanced data sets; pp. 193–208. [Google Scholar]
  • 36.James G., Witten D., Hastie T., Tibshirani R. 6th ed. Springer-Verlag New York; New York: 2014. An Introduction to Statistical Learning: With Applications in R. [Google Scholar]
  • 37.Wulff J., Ejlskov L. Multiple imputation by chained equations in praxis: Guidelines and review. Electron J Bus Res Methods. 2017;15:41–56. [Google Scholar]
  • 38.Bischl B., Lang M., Kotthoff L., Schiffner J., Richter J., Studerus E., et al. Mlr: Machine learning in R. J Mach Learn Res. 2016;17:1–5. [Google Scholar]
  • 39.R Core Team R: A language and environment for statistical computing. 2015. https://www.r-project.org/ Available at: Accessed January 6, 2021.
  • 40.Abel K.M., Dalman C., Svensson A.C., Susser E., Dal H., Idring S., et al. Deviance in fetal growth and risk of autism spectrum disorder. Am J Psychiatry. 2013;170:391–398. doi: 10.1176/appi.ajp.2012.12040543. [DOI] [PubMed] [Google Scholar]
  • 41.Sandin S., Lichtenstein P., Kuja-Halkola R., Larsson H., Hultman C.M., Reichenberg A. The familial risk of autism. JAMA. 2014;311:1770–1777. doi: 10.1001/jama.2014.4144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Schnabel A., Youssef G.J., Hallford D.J., Hartley E.J., McGillivray J.A., Stewart M., et al. Psychopathology in parents of children with autism spectrum disorder: A systematic review and meta-analysis of prevalence. Autism. 2020;24:26–40. doi: 10.1177/1362361319844636. [DOI] [PubMed] [Google Scholar]
  • 43.Kong L., Norstedt G., Schalling M., Gissler M., Lavebratt C. The risk of offspring psychiatric disorders in the setting of maternal obesity and diabetes. Pediatrics. 2018;142 doi: 10.1542/peds.2018-0776. [DOI] [PubMed] [Google Scholar]
  • 44.Sanchez C.E., Barry C., Sabhlok A., Russell K., Majors A., Kollins S.H., Fuemmeler B.F. Maternal pre-pregnancy obesity and child neurodevelopmental outcomes: A meta-analysis. Obes Rev. 2018;19:464–484. doi: 10.1111/obr.12643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Rahman R., Kodesh A., Levine S.Z., Sandin S., Reichenberg A., Schlessinger A. Identification of newborns at risk for autism using electronic medical records and machine learning. Eur Psychiatry. 2020;63:e22. doi: 10.1192/j.eurpsy.2020.17. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials
mmc1.pdf (2.8MB, pdf)

Articles from Biological Psychiatry Global Open Science are provided here courtesy of Elsevier

RESOURCES