Abstract
Identification of genetic biomarkers associated with autism spectrum disorders (ASDs) could improve recurrence prediction for families with a child with ASD. Here, we describe clinical microarray findings for 253 longitudinally phenotyped ASD families from the Baby Siblings Research Consortium (BSRC), encompassing 288 infant siblings. By age 3, 103 siblings (35.8%) were diagnosed with ASD and 54 (18.8%) were developing atypically. Thirteen siblings have copy number variants (CNVs) involving ASD-relevant genes: 6 with ASD, 5 atypically developing, and 2 typically developing. Within these families, an ASD-related CNV in a sibling has a positive predictive value (PPV) for ASD or atypical development of 0.83; the Simons Simplex Collection of ASD families shows similar PPVs. Polygenic risk analyses suggest that common genetic variants may also contribute to ASD. CNV findings would have been pre-symptomatically predictive of ASD or atypical development in 11 (7%) of the 157 BSRC siblings who were eventually diagnosed clinically.
Subject terms: Rare variants, Structural variation, Predictive markers, Autism spectrum disorders
Siblings of those with autism spectrum disorder (ASD) have increased likelihood of ASD or related subclinical traits. Here, studying 253 ASD families, D’Abate et al. test the predictive value of genomic copy number variation involving ASD-associated loci, with confirmation in a second cohort.
Introduction
Behavioral assessments remain the gold standard for autism spectrum disorder (ASD) diagnosis1, and prospective analysis permits objective and longitudinal assessment for the earliest symptoms. Published estimates of sibling recurrence for ASD range from 6.9 to 19.5%2–5. Moreover, of younger siblings of autistic probands, herein referred to simply as “infant siblings”, who are not diagnosed with ASD, up to 30–40% have subclinical ASD traits and/or suboptimal developmental functioning6.
ASD and related subclinical traits show familial clustering, with a substantial portion of familial liability attributed to genetic factors7,8. Subclinical symptoms in first- and second-degree relatives support an important role for genetic factors in producing an autistic phenotype9. The genetic architecture of ASD is being resolved by studying families with different characteristics10–21, and dozens of copy number variant (CNV) loci and ASD-relevant genes and loci are known, many of which overlap those associated with other neurodevelopmental disorders14,17,18. De novo and inherited rare (<1% in a population) CNVs and other pathogenic variants are found in ~ 5–40% of individuals with ASD, depending on the cohort examined10,14,17,20. Chromosomal microarray to detect CNVs is the first-tier laboratory test for clinical genetic evaluation following an ASD diagnosis22.
The most effective way to reduce symptoms of ASD is with early intervention, targeting behavior, and skills development23. In search of biomarkers for early indentification, we investigated whether CNVs affecting ASD-related loci correlate (pre- and post-symptomatically) with phenotypic outcomes in the Baby Siblings Research Consortium (BSRC) cohort of infant siblings whose family history is associated with a higher probability of developing ASD (Fig. 1). We analyze CNVs from 253 families registered in the BSRC24 while blinded to the infant siblings’ phenotype status. At enrollment, each family included a proband diagnosed with ASD, and at least 1 younger sibling (Supplementary Table 1). The BSRC longitudinal phenotyping design enables a predictive study of CNVs in this infant cohort (see Methods). Our analyses reveal that the detection of ASD-relevant CNVs is indeed predictive of ASD or atypical development in this sibling population and can be used to inform risk estimates for individuals and their families, with potential impact on their therapeutic trajectory.
Results
Clinical evaluation of infant siblings within families
The study group included the 253 probands, 288 later-born infant siblings comprising the primary diagnostic sample (Fig. 1), and 34 siblings who did not meet cohort enrollment criteria (i.e., >3 years, or status determined only through parental reports), but for whom we had phenotype designation and genotype data. Refer to “Subject recruitment and clinical evaluations” in Methods section for criteria for various atypical outcomes listed below. At age 3, 103/288 siblings (35.8%) in 94/253 families (37.2%) had a formal diagnosis of ASD; 54 were “atypically developing” (18.8%), and 131 were developing typically (45.5%) (Supplementary Table 2). Among the 54 infant siblings with atypical development (but not an ASD diagnosis) were the following phenotypic constellations: ASD symptoms (n = 5), ASD symptoms with developmental, language and/or adaptive behavior delay (n = 9); developmental delay (n = 15), language delay (n = 10), language and adaptive behavior delays (n = 5), deficits in adaptive behavior alone (n = 3), and attention deficit hyperactivity disorder (ADHD) symptoms or externalizing behaviors (n = 7). Of the 30 families with 2 or more infant siblings, 23 had 1 or more assessed as having ASD and/or atypical development and 7 had all sibs typically developing at age 3. The male:female ratio was 3.7:1 for ASD-affected infant siblings, 1.6:1 for atypically developing siblings, and 1.2:1 for non-ASD infant siblings.
Genomic findings in probands and infant siblings
From 253 probands, we identified 15 CNVs (in 13 individuals; 5.1%) that were deemed ASD-relevant (see Methods; Supplementary Tables 3 and 4). Where inheritance could be ascertained, 6 CNVs (in 5 probands) were de novo (3 deletions, 3 duplications) and 8 CNVs (5 deletions, 3 duplications) were inherited (7 maternally, 1 paternally). For proband 14-0152-001, inheritance was unknown, as parental samples were unavailable. All 8 inherited variants were shared with an infant sibling (Fig. 2), of whom 4 were diagnosed with ASD (Fig. 3a) and 3 showed atypical development without a formal ASD diagnosis (Fig. 3b).
Among the 288 infant siblings of the probands, 13 carried ASD-relevant CNVs (Fig. 2), of which 6 were considered pathogenic and 7 were clinically defined as variants of unknown significance overlapping genes implicated in ASD. Five infant siblings had ASD-relevant CNVs that were not shared with the related probands; 2 had ASD, 2 were atypically developing, and 1 was typically developing (Fig. 3c, d). Of the 2 typically developing children with ASD-relevant CNVs, 14-0376-004 (Fig. 3d) had a 610 kb deletion at 16p11.2 and 12-8115-004 (Fig. 3b) had a (shared) 1.7 Mb duplication at 16p13.11. Their evaluations at age 36 months did not indicate any developmental delays or ASD symptoms. Both variants are associated with recurrent and variably expressed syndromes, often involving ASD12,25,26.
Overall, among siblings of ASD probands, we found ASD-related CNVs in 6 of 103 with ASD at age 3 years, in 5 of 54 with atypical development, and in 2 of 131 with neurotypical development. Four children (Fig. 3) who did not meet study inclusion criteria had ASD-related CNVs deemed to be pathogenic. Using only CNVs considered to be pathogenic or likely pathogenic18,27, among infant siblings the positive predictive values (PPVs) of these variants were 0.50 for ASD and 0.83 for combined ASD/atypical development. Other predictive statistics are shown in Table 1.
Table 1.
ASD (all ASD-relevant CNVs) | ASD or atypical development (all ASD-relevant CNVs) | ASD (excluding VUS) | ASD or atypical development (excluding VUS) | |
---|---|---|---|---|
Statistic | ||||
Sensitivity (0.95 CI) | 0.06 (0.02–0.12) | 0.07 (0.04–0.12) | 0.03 (0.01–0.08) | 0.03 (0.01–0.07) |
Specificity (0.95 CI) | 0.96 (0.92–0.98) | 0.98 (0.95–1.00) | 0.98 (0.95–1.00) | 0.99 (0.96–1.00) |
Positive predictive value (0.95 CI) | 0.46 (0.19–0.75) | 0.85 (0.55–0.98) | 0.50 (0.12–0.88) | 0.83 (0.36–1.00) |
Negative predictive value (0.95 CI) | 0.65 (0.59–0.70) | 0.47 (0.41–0.53) | 0.65 (0.59–0.70) | 0.46 (0.40–0.52) |
Genotype/phenotype | ||||
Condition + | 103a | 157b | 103a | 157b |
CNV +/condition + | 6 | 11 | 3 | 5 |
CNV +/condition – | 7 | 2 | 3 | 1 |
CNV –/condition – | 178 | 129 | 182 | 130 |
CNV –/condition + | 97 | 146 | 100 | 152 |
Total infant siblings | 288 | 288 | 288 | 288 |
VUS variant of unknown significance
Genotype refers to the identification of ASD-relevant CNVs observed in infant siblings
aAffected sibs (n = 103)
bAffected sibs (n = 103) + non-ASD sibs with atypical development (n = 54) (see “Subject recruitment and clinical evaluations” in Methods)
Whole-genome sequences (WGS) were available from 91 families of our cohort. We identified no additional ASD-relevant CNVs, and of the potentially interesting sequence-level variants (Supplementary Table 5), none were found in families with previously recognized ASD-relevant CNVs (Supplementary Fig. 1).
This study adds to growing evidence that specific biomarkers might contribute to pre-symptomatic detection of infants likely to develop ASD or other developmental disorders, at least from sibships that include an ASD proband. Eleven of 157 siblings who developed ASD or were atypically developing at age 3 carried a CNV worthy of clinical follow-up (either inherited or de novo), 5 (3.2%) carried a pathogenic/likely pathogenic CNV and 6 (3.8%) carried a variant of unknown significance (VUS).
Of the 7 total VUSs overlapping a gene associated with a neuropsychiatric disorder, 6 were in siblings with ASD or atypical development, with none in neurotypical children, despite nearly equal numbers in each subgroup. Although not reported as pathogenic in a clinical setting, VUSs involving ASD-relevant genes may nonetheless be contributing factors for ASD or atypical development, and of interest to families14,22. Two of 131 typically developing siblings had CNVs considered as VUS or likely pathogenic. These CNVs at 16p11.2 or 16p13.11 have frequencies in the general population that range from 0.03 to 0.04% and 0.15 to 0.25%28–30 and have been associated with reduced cognitive and general functioning28,31 and adult-onset phenotypes32, respectively.
To further assess the predictive impact of ASD-relevant CNVs in a larger cohort, we analyzed published data from 2110 families from the Simons Simplex Collection (SSC). This cohort differs from the BRSC in being a quartet design with 4214 parents, 2124 ASD-affected probands and 2423 ASD-unaffected children, assembled to study de novo variants (single-point phenotyping; average age of diagnosis 8.9 (±3.5) years for males and 9.11 (±3.7) for females; 6.6:1 male to female ratio for probands)33. Of the SSC “unaffected” sibs, applying similar criteria as for the BSRC cohort revealed 288 (11.9%) with atypical behavioral and developmental profiles: 33 were suspected to have elevated ASD traits (Social Responsiveness Scale-Parent Report Total T-Score > 6034); 139 had emotional and behavioral problems (Child Behavioral Checklist-Parent Report Total Problems T-Score > 6035); 65 displayed mild-moderate adaptive behavior deficits (Vineland Adaptive Behavior Scales, Adaptive Behavior Composite36 < 85); 51 met criteria in more than 1 category. While blinded to proband and sibling status, we searched for ASD-relevant CNVs with the same classification criteria used for the BSRC (see Methods).
We found 118 ASD-relevant CNVs in 116 of 2124 probands (5.5%; 72.0% (85) pathogenic/likely pathogenic and 28.0% (33) VUS) and 64 CNVs in 63 of 2423 unaffected sibs (2.6%; 34.4% (22) pathogenic/likely pathogenic and 65.6% (42) VUS) (Supplementary Data File 1 and Supplementary Table 6). These results are similar to those found in probands and unaffected sibs in the BSRC cohort (5.1% and 3.8%, respectively). Seven (11.1%) of these sibs with ASD-relevant CNVs were deemed atypically developing. When considering pathogenic and likely pathogenic variants across the SSC, the PPV for ASD was 0.79, and for ASD or atypical development was 0.83 (Supplementary Table 7). From the SSC cohort, we identified 47 CNVs impacting 8 of the genes/loci harboring ASD-relevant CNVs in the siblings of the BSRC. Thirty of these CNVs were carried by probands, 2 by atypically developing sibs and 15 by unaffected sibs (Supplementary Table 6). The percentage of ASD-relevant CNVs in unaffected sibs in the SSC is similar to that observed in unaffected sibs in the BSRC. Furthermore, data analyzed from both cohorts yielded lower percentages of ASD-relevant variants in non-ASD sibs compared with probands, while noting that select variants are found in symptomatic carriers in the general population.
Polygenic transmission disequilibrium test
Recognizing emerging evidence for the role of combinations of common genetic variants in ASD susceptibility13 and their potential efficacy in predicting ASD risk, we determined the contribution of polygenic risk scores (PRS) to the phenotype in the BSRC cohort using the polygenic transmission disequilibrium test13. In families for which genotype data were available for both parents, probands and at least 1 infant sibling, we observed a statistically significant over-transmission of risk variants from parents to probands (n = 189, mean difference (cohort z-score based) = 0.13, p = 0.01). There were non-significant differences of risk transmission for unaffected siblings (n = 112, mean = −0.003, p = 0.97), atypically developing siblings (n = 44, mean = −0.004, p = 0.97) and ASD-affected siblings (n = 93, mean = 0.07, p = 0.27) (Supplementary Fig. 2). Power to reject the null hypothesis (assuming the PRS explained 2.45% of phenotypic variance37) was ~98% for the proband (n = 189) and 56% for the affected sibling (n = 93). The significant over-transmission of risk from parents to probands suggested that common genetic variants may contribute to the phenotype in the BSRC sample. This PRS analysis in the SSC revealed a similar statistical trend. Using a paired Student’s t-test, we observed a significant difference in risk of transmission for SSC probands (n = 2118, mean = 0.547, p < 1e-10) and a non-significant risk of transmission for unaffected sibs (n = 2130, mean = −0.021, p = 0.116) and atypically developing sibs (n = 286, mean = −0.012, p = 0.744).
Discussion
Early identification of the CNVs described in this study could be used to tailor recurrence risk estimates to individual families, with potential for intensified surveillance for infants at increased likelihood due to positive CNV findings. There is evidence that infants as young as 7 months with subtle features of ASD could benefit from tailored interventions23,38,39, but there are also also recent negative trial data40,41.
This study had possible limitations. First, given the 37% recurrence of ASD in our BSRC families, compared with 6.9–19.5% found previously2–5, this cohort had selective over-recruitment of infant siblings with ASD. A similar over-recruitment was noted for atypically developing sibs, which partially accounts for the difference in rate between cohorts (29.2% in BSRC compared to 11.9% in SSC). This oversampling, not uncommon in genetic studies, may bias estimates of positive and negative predictive value of biomarkers. As well, PPV estimates related to CNV detection must be considered relative to overall ASD rates among participants. Further impacting the predictive value statistics, we only considered diagnoses made by 3 years of age. Despite the stability of an ASD diagnosis at 36 months, clinical phenotypes are known to be fluid throughout development and apparent non-ASD siblings might later demonstrate ASD or another disorder as they age42,43. Likewise, longitudinal studies following high-probability siblings from 3 years of age to middle childhood observed that an ASD diagnosis was revoked in 5.6–20% of children44,45. Second, considering the massive effort in phenotyping, the BSRC sample would be considered comparably large, but participants with ASD-relevant CNVs were still few, limiting power of the analysis. The primary BSRC study involved a cohort with increased probability of ASD, and while the findings are generally consistent with those from the SSC, other family structures may yield different findings.
The recurrent CNVs described here often display variable expressivity and reduced penetrance for ASD, particularly when sex is considered46. For these reasons, we explored inclusion of siblings with atypical forms of ASD when determining PPV. We attribute the low sensitivity (0.03 in BSRC) of these markers to the vast heterogeneity in ASD etiology, but this parameter may improve with clearer genotype-phenotype correlations17,47. We can also consider coupling these already useful CNV indicators of ASD with metrics captured by other genetic approaches such as WGS and PRS discussed here, as well as other techniques, such as brain imaging48 and other early phenotypic assessment49.
Families must receive adequate counseling when such variants are found, at any stage of life, including the prenatal setting, so that they fully understand the potential ramifications of these findings. This sibling cohort also provided evidence for certain CNVs as potential early biological predictors of ASD or other developmental difficulties, whether or not they shared these with their respective probands (Fig. 2), which will also influence interpretation.
Methods
Subject recruitment and clinical evaluations
From 9 research sites registered with the BSRC (The Center for Applied Genomics, Genetics, and Genome Biology, The Hospital for Sick Children; Autism Research Center, Bloorview Research Institute; Autism Research Center, IWK Health Center and Dalhousie University; Department of Psychology, UC San Diego; Center for Autism and Related Disorders, Kennedy Krieger Institute; Department of Psychology, University of Miami; MIND Institute, Department of Psychiatry, UC Davis; Department of Psychology, University of Washington; Vanderbilt Kennedy Center Treatment and Research Institute for Autism Spectrum Disorders, Vanderbilt Kennedy Center; Autism Research Center, University of Alberta), participants from 253 families enrolled (134 from USA sites and 119 from Canada). Recruitment was through organizations serving individuals with ASD and their families, referrals from medical professionals, web-based media, or word-of-mouth, as previously described2. This study was approved by the Research Ethics Board (REB) at The Hospital for Sick Children (REB # 0019980189) and informed consent was obtained from participants or their legal guardians, when appropriate. A total of 253 probands, 322 siblings (including 288 infant siblings), and 447 parents (242 mothers; 205 fathers) constituted the final cohort and were analyzed.
All families had at least 1 child (i.e., the proband) diagnosed with ASD according to the Diagnostic and Statistical Manual for Mental Disorders, 4th Edition (DSM-IV). We excluded probands with a genetic syndrome that could account for their ASD (e.g., Fragile-X, Rett Syndrome)50. Younger siblings were recruited at a mean age of 10.2 ± 8.4 months; we followed them, and determined their clinical outcomes with respect to ASD at a mean age of 37.4 ± 2.3 months. An expert clinician determined diagnostic status by clinical best-estimate, informed by the Autism Diagnostic Observation Schedule (ADOS) (calibrated severity score (CSS) ≥ 4 as threshold)51, DSM-IV criteria, psychometric assessment of language and cognitive development (Mullen Scales of Early Learning (MSEL)52), adaptive functioning (Vineland Adaptive Behavioral Scales (VABS)36), and overall clinical impression.
Adapting criteria from published reports of this cohort6, from among infant siblings not diagnosed with ASD, we defined another subgroup as “atypically developing”. This classification was based on the presence of elevated ASD symptoms (CSS ≥ 3, requisite scores on the Autism Diagnostic Index-Revised and clinical impression), developmental delay (MSEL composite score > 1 SD below the mean), language delay (MSEL expressive and/or receptive language subscale score > 1 SD below the mean), deficits in adaptive behavior (VABS composite score or subscale score > 1 SD below the mean) or other atypical behavior patterns (e.g., externalizing behaviors) or disorders (e.g., ADHD), as indicated by the clinician best-estimate diagnoses and/or scores above cutoff.
From 253 families, 144 mothers and 127 fathers self-reported the Broader Autism Phenotype Questionnaire (BAPQ) (Supplementary Table 8). The BAPQ includes items mapping onto 3 broader scales (Aloof, Rigid, and Pragmatic Language), which comprise ASD-related subclinical traits previously reported in a subset of parents. A cutoff of 3.15 was shown to optimize agreement with expert clinical assessment of the Broader Autism Phenotype (BAP)53.
Baby Siblings Research Consortium (BSRC) sample collection
For microarray analysis, biological samples were obtained from 251 probands, 321 siblings and 444 parents (241 mothers; 203 fathers) at their respective recruiting sites (total samples = 1016). Blood samples from U.S. sites (n = 517) were submitted to the DNA and Cell Repository at Rutgers University, and extracted DNA was subsequently sent to The Center for Applied Genomics (TCAG). Genomic DNA used for genotyping on microarray was extracted from whole blood (86.0%; 874/1016 samples), saliva (0.4%; 4/1016), lymphoblastoid cell lines (10.4%;106/1016), or source undocumented (3.1%; 32/1016).
Microarray genotyping quality control metrics
Genomic DNA samples were processed on the high-density Affymetrix CytoScanTM HD microarray platform at TCAG (2.67 million copy number markers) following protocols in our other published studies10,17,18,54. Quality control thresholds were imposed, as specified by the manufacturer: Waviness standard deviation (Waviness SD) ≤ 0.12; median absolute pairwise difference (MAPD) ≤ 0.25; and single-nucleotide polymorphism (SNP) quality control (SNP QC) ≥ 15.0. One father’s sample did not meet these criteria, but the sample was retained in the analysis to help identify CNV segregation. We used PLINK55 software to identify loss of heterozygosity (LOH) and Mendelian inconsistencies using the 750,000 informative SNPs available on the array. We observed Mendelian inconsistencies in 3 families, and eliminated them from the study.
Variant detection and characterization
We detected CNVs using 4 algorithms: Chromosome Analysis Suite (ChAS) (Affymetrix Inc., USA), iPattern56, Nexus57, and Partek58. We sequentially applied a series of data constraints to ascertain high-confidence, rare CNVs27. We considered only stringent CNVs, called by ≥2 algorithms, at least 1 of which was ChAS or iPattern, for downstream analyses. CNVs on the X chromosome were called uniquely by ChAS and iPattern. We eliminated all calls on the Y chromosome. We retained CNVs ≥ 15 kb in length and overlapping ≥10 consecutive probes to reduce the detection of false-positive calls. They were then restricted to those in which ≤70% spanned a segmental duplication and ≥75% of the variant was present in a copy number stable region as previously defined59. We used a platform-matched control dataset consisting of 873 individuals with no reported psychiatric history, from the Ontario Populations Genomics Platform (OPGP)60, to classify CNVs as rare. We considered a CNV as rare if it did not exceed 50% reciprocal overlap with a CNV found in <0.1% of the control dataset. To corroborate a CNV’s status as rare, we compared to an additional 4 unrelated control populations totaling 9978 individuals: the Collaborative Genetic Study of Nicotine Dependence (COGEND)61 and KORA62, genotyped on the Illumina Omni 2.5 M; the SAGE consortium controls63, Ontario Colorectal Cancer case-control study cohort64,65 and the Health, Aging, and Body Composition (Health ABC) Study66, genotyped on the Illumina 1 M; the Ottawa Heart Institute controls67, and POPGEN68, both genotyped on the Affymetrix 6.0 microarray. We sequenced whole genomes of 84 probands, 118 infant siblings and 158 parents (86 mothers; 72 fathers), from 91 families, using DNA from whole blood17.
We used criteria adapted from the American College of Medical Genetics classification69 and an established annotation strategy14,18,70 to classify CNVs as ASD-relevant and pathogenic, likely pathogenic or variant of unknown significance (VUS). A CNV was considered pathogenic or likely pathogenic if (a) it was associated with an established genomic disorder of which ASD is a characteristic (e.g., 16p11.2 microdeletion), or (b) it overlapped a coding exon of a high-confidence ASD-susceptibility gene (e.g., SHANK3; Supplementary Data File 3). We considered whether the CNV overlapping the gene was de novo (pathogenic) or inherited (likely pathogenic). Variants overlapping exons of long noncoding RNA PTCHD1-AS71, and specific noncoding exons of the MBD5 gene72, which constitute the critical region of 2q23.1 microdeletion syndrome, were retained in the analysis. We further defined a class of VUS as ASD-relevant if they overlapped exons of candidate ASD-susceptibility genes or related neuropsychiatric disorder genes (Supplementary Data File 3) and had a frequency ~0.1% in the Database of Genomic Variants. We assessed the efficacy of the variants so identified, as markers for ASD or atypical phenotype status, using the epiR package in R73 to calculate positive predictive value, negative predictive value, sensitivity and specificity. All CNVs in infant siblings were classified while blinded to the phenotype status of the individual.
Whole-genome sequence data were processed as previously described17. We defined rare loss of function and de novo damaging missense variants as in Yuen et al.10. We prioritized single-nucleotide variants and indels that overlapped genes associated with ASD and other related neurodevelopment disorders, and considered whether variants with similar transcriptional consequences were found in other ASD cases.
Molecular validation and characterization of CNVs
The presence of de novo and ASD-relevant CNVs was confirmed via real-time quantitative PCR (qPCR) using the TaqMan© Copy Number Assay and SYBR® Green methods; all experiments were conducted in triplicate. For SYBR® Green assays, an amplicon 90–140 base pairs in length was amplified using 2 sets of primers positioned ≥ 500 bp from both reported breakpoints. A similar amplicon designed within the FOXP2 locus served as a 2-copy control74. All TaqMan© Copy Number Assays involved predesigned probes located in the gene of interest and RNaseP, which served as an endogenous control. All experiments included both male and female control samples (HapMap samples: NA10851 (male) and NA15510 (female)).
Simons Simplex Collection (SSC) to assess CNV false discovery
In order to assess the possible false discovery for CNVs associated with ASD, we performed a separate CNV analysis on 2110 families from the SSC33. This included 2107 mothers, 2107 fathers, 2124 ASD probands, and 2,425 siblings. Families were recruited as previously described (Fischbach and Lord, 2010). Of the 2425 sibs, 2093 are designated unaffected. Of the remaining 332 siblings, 2 have ASD and have thus been excluded from the analysis, leaving 2423 sibs unaffected by ASD available for CNV analysis.
As we did for unaffected sibs in the BSRC cohort of infant siblings, we looked for atypical developmental outcomes in these non-ASD sibs using the psychometric tools available. We considered the potential presence of ASD traits using the Social Responsiveness Scale (SRS)-Parent Report (n = 2298)34; mild-moderate developmental delay by identifying adaptive behavior deficits using the VABS (n = 2368) and other emotional and behavioral concerns as identified through the Child Behavioral Checklist (CBCL)-Parent Report35 (n = 448 for ages 1.5–5 test; n = 1833 for ages 6–18 test). To define atypical development, we applied a cutoff of >1 SD below the mean for the total scores for the SRS and the CBCL, expressed as T-scores (i.e., T-scores > 60), and the Adaptive Behavior Composite from the VABS, expressed as a standard score (i.e., scores < 85). Sibs had to meet this cutoff on at least 1 of the 3 tests to qualify as atypically developing.
We received microarray intensity data in the form of .IDAT files for 8761 individuals genotyped on 3 different microarray platforms: 1246 on the Illumina Human1Mv1; 3826 on the Illumina Human1M-Duov3; and 3689 on the Illumina HumanOmni2.5–4v1. We called CNVs using the same pipeline as for the infant sibling cohort, with the exception that the following 3 algorithms were employed, as previously described:18 PennCNV75, QuantiSNP76, and iPattern. Stringent CNVs were called by a minimum of 2 of these algorithms, with at least one being iPattern. We applied criteria for filtering and prioritizing variants as described above. CNVs were not molecularly characterized, as no DNA for this cohort was available.
Polygenic transmission disequilibrium test
We analyzed the contribution of common genetic variants to ASD risk for families from the infant sibling cohort for which microarray data were available for the proband (n = 189), at least 1 infant sib (n = 193) and both parents. We generated PRS using PRSice77 (clump-kb 250, clump-p 1.000000, clump-r2 0.100000, info-base 0.9), using a p-value threshold of 0.1 as suggested by Grove et al.78 in Supplemental Fig. 4.4.1. In total, 246,607 SNPs were included in both the SNP genotypes from the Affymetrix CytoScanTM HD array and the iPSYCH-PGC_ASD_Nov2017 GWAS summary statistic file, with 9,112,386 variants, of which 17,362 were removed due to having an INFO score less than 0.90. For the study genotypes, we began with 749,157 SNPs from the Affymetrix CytoScanTM HD array, removing 83,849 SNPs with a call rate >90% and 97,770 SNPs with minor allele frequency < 5%. Using PRSice, 87,215 genotype array variants were removed (e.g., A- > T), with 228,151 intersecting the ASD GWAS variants. A total of 52,923 SNPs remained after linkage disequlibrium-based clumping, of which 11,401 met the specified p-value threshold, and were thus used for PRS computation.
We performed the same analysis for 2104 families from the Simons Simplex Collection. All families consisted of 2x parents (n = 2104 mothers; n = 2104 fathers), a proband (n = 2118) and at least 1 unaffected sibling (n = 2416). Using identical criteria for inclusion of SNPs, 53,803 SNPs remained after linkage disequilibrium-based clumping, and 11,161 SNPs were enriched for in cases versus controls as per a p-value of 0.1, and therefore used for the generation of PRS.
Previous estimates for narrow-sense heritability (h2g) for ASD are varied, depending on phenotype definition, study design, genotyping method, and other factors. Estimates include 12%37, 52%79, and 83%80; we used h2g = 60% in the power calculation. Power was calculated under the liability threshold model assuming prevalence of ASD of 1.5%81.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We would like to thank the families who took part in this study, and the affiliated clinicians and research staff who assisted in recruitment and clinical evaluations of the participants. The Center for Applied Genomics provided technical and conceptual support in the form of genotyping, sequencing, CNV calling and data processing. This study was funded jointly by Autism Speaks, Autism Speaks Canada, the Simons Foundation Autism Research Initiative, Canadian Institutes of Health Research (CIHR), Canada Foundation for Innovation (CFI), The Hospital for Sick Children Foundation, Genome Canada/Ontario Genomics, Kids Brain Health Network, Canadian Institutes for Advanced Research (CIFAR), Ontario Brain Institute, Women and Children’s Health Research Institute at the University of Alberta, the Government of Ontario, and the University of Toronto McLaughlin Center at the University of Toronto. The authors wish to acknowledge the resources of Autism Speaks, MSSNG (www.mss.ng), as well as the generosity of the donors who supported this program. We also thank the participating families for their time and contributions to this database. Data collection was supported in part in the USA by National Institutes of Health (NIH) R01 HD047417 (D.M.), NIH R01 HD057284 (W.L.S., D.M.) and NIMH R01 MH059630 (R.L.). L.D.A. is funded by the Research Training Competition Doctoral Scholarships awarded by the Hospital for Sick Children. R.K.C.Y. is funded by the CIHR Postdoctoral Fellowship, NARSAD Young Investigator Award and Thrasher Early Career Award. L.Z holds the Stollery Children’s Foundation Chair in Autism. S.W.S. holds the GlaxoSmithKline-CIHR Chair in Genome Sciences at the University of Toronto and The Hospital for Sick Children.
Source data
Author contributions
L.D.A., L.Z. and S.W.S. conceived the project design and methodology. L.D.A. analyzed the data with S.W., R.K.C.Y., K.T., J.W., B.T. and J.H. from The Center for Applied Genomics. R.D. performed the polygenic transmission disequilibrium test. J.B., S.B., K.D., R.L., J.L., D.M., S.O., I.M.S., W.L.S., Z.E.W. and L.Z. recruited families from their respective clinical sites and shared all phenotype information on study subjects. DNA samples were processed by J.H. G.Y. compiled the phenotype information and created the comprehensive BSRC clinical database. The manuscript was written by L.D.A, J.A.B. and S.W.S. All authors approved the manuscript.
Data availability
All raw microarray data for 1016 individuals that were genotyped on the Affymetrix CytoScanTM HD Array were submitted to dbGaP and can be accessed via a data access committee using dbGaP accession number phs001876.v1.p1. Whole-genome sequence data are accessible through the Autism Speaks MSSNG database (https://research.mss.ng).
Competing interests
S.W.S. is on Scientific Advisory Committees for Population Bio and Deep Genomics, and his institution, The Hospital for Sick Children, has licensed to Lineagen software code he co-developed: “Method of determining disease causality of genome mutations”.
Footnotes
Peer review information Nature Communications thanks David Dimmock and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information is available for this paper at 10.1038/s41467-019-13380-2.
References
- 1.Falkmer T, Anderson K, Falkmer M, Horlin C. Diagnostic procedures in autism spectrum disorders: a systematic literature review. Eur. Child Adolesc. Psychiatry. 2013;22:329–340. doi: 10.1007/s00787-013-0375-0. [DOI] [PubMed] [Google Scholar]
- 2.Ozonoff S, et al. Recurrence risk for autism spectrum disorders: a Baby Siblings Research Consortium study. Pediatrics. 2011;128:e488–e495. doi: 10.1542/peds.2010-2825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Grønborg TK, Schendel DE, Parner ET. Recurrence of autism spectrum disorders in full- and half-siblings and trends over time: a population-based cohort study. JAMA Pediatr. 2013;167:947–953. doi: 10.1001/jamapediatrics.2013.2259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Risch N, et al. Familial recurrence of autism spectrum disorder: evaluating genetic and environmental contributions. Am. J. Psychiatry. 2014;171:1206–1213. doi: 10.1176/appi.ajp.2014.13101359. [DOI] [PubMed] [Google Scholar]
- 5.Messinger DS, et al. Early sex differences are not autism-specific: A Baby Siblings Research Consortium (BSRC) study. Mol. Autism. 2015;6:32. doi: 10.1186/s13229-015-0027-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Charman T, et al. Non-ASD outcomes at 36 months in siblings at familial risk for autism spectrum disorder (ASD): a baby siblings research consortium (BSRC) study. Autism Res. J. Int. Soc. Autism Res. 2017;10:169–178. doi: 10.1002/aur.1669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Colvert E, et al. Heritability of autism spectrum disorder in a UK population-based twin sample. JAMA Psychiatry. 2015;72:415–423. doi: 10.1001/jamapsychiatry.2014.3028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bai Dan, Yip Benjamin Hon Kei, Windham Gayle C., Sourander Andre, Francis Richard, Yoffe Rinat, Glasson Emma, Mahjani Behrang, Suominen Auli, Leonard Helen, Gissler Mika, Buxbaum Joseph D., Wong Kingsley, Schendel Diana, Kodesh Arad, Breshnahan Michaeline, Levine Stephen Z., Parner Erik T., Hansen Stefan N., Hultman Christina, Reichenberg Abraham, Sandin Sven. Association of Genetic and Environmental Factors With Autism in a 5-Country Cohort. JAMA Psychiatry. 2019;76(10):1035. doi: 10.1001/jamapsychiatry.2019.1411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sandin S, et al. The familial risk of autism. JAMA. 2014;311:1770–1777. doi: 10.1001/jama.2014.4144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yuen RKC, et al. Whole-genome sequencing of quartet families with autism spectrum disorder. Nat. Med. 2015;21:185–191. doi: 10.1038/nm.3792. [DOI] [PubMed] [Google Scholar]
- 11.Ye K, et al. Measuring shared variants in cohorts of discordant siblings with applications to autism. Proc. Natl Acad. Sci. U.S.A. 2017;114:7073–7076. doi: 10.1073/pnas.1700439114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Weiss LA, et al. Association between microdeletion and microduplication at 16p11.2 and autism. N. Engl. J. Med. 2008;358:667–675. doi: 10.1056/NEJMoa075974. [DOI] [PubMed] [Google Scholar]
- 13.Weiner DJ, et al. Polygenic transmission disequilibrium confirms that common and rare variation act additively to create risk for autism spectrum disorders. Nat. Genet. 2017;49:978–985. doi: 10.1038/ng.3863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tammimies K, et al. Molecular diagnostic yield of chromosomal microarray analysis and whole-exome sequencing in children with autism spectrum disorder. JAMA. 2015;314:895–903. doi: 10.1001/jama.2015.10078. [DOI] [PubMed] [Google Scholar]
- 15.Sebat J, et al. Strong association of de novo copy number mutations with autism. Science. 2007;316:445–449. doi: 10.1126/science.1138659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Sanders SJ, et al. Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams syndrome region, are strongly associated with autism. Neuron. 2011;70:863–885. doi: 10.1016/j.neuron.2011.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.C Yuen RK, et al. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat. Neurosci. 2017;20:602–611. doi: 10.1038/nn.4524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Pinto D, et al. Convergence of genes and cellular pathways dysregulated in autism spectrum disorders. Am. J. Hum. Genet. 2014;94:677–694. doi: 10.1016/j.ajhg.2014.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Marshall CR, et al. Structural variation of chromosomes in autism spectrum disorder. Am. J. Hum. Genet. 2008;82:477–488. doi: 10.1016/j.ajhg.2007.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Levy D, et al. Rare de novo and transmitted copy-number variation in autistic spectrum disorders. Neuron. 2011;70:886–897. doi: 10.1016/j.neuron.2011.05.015. [DOI] [PubMed] [Google Scholar]
- 21.Leppa VM, et al. Rare inherited and de novo CNVs reveal complex contributions to ASD risk in multiplex families. Am. J. Hum. Genet. 2016;99:540–554. doi: 10.1016/j.ajhg.2016.06.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Miller DT, et al. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am. J. Hum. Genet. 2010;86:749–764. doi: 10.1016/j.ajhg.2010.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zwaigenbaum L, et al. Early intervention for children with autism spectrum disorder under 3 years of age: recommendations for practice and research. Pediatrics. 2015;136(Suppl 1):S60–S81. doi: 10.1542/peds.2014-3667E. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Messinger D, et al. Beyond autism: a baby siblings research consortium study of high-risk children at three years of age. J. Am. Acad. Child Adolesc. Psychiatry. 2013;52:300–308.e1. doi: 10.1016/j.jaac.2012.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ramalingam A, et al. 16p13.11 duplication is a risk factor for a wide spectrum of neuropsychiatric disorders. J. Hum. Genet. 2011;56:541–544. doi: 10.1038/jhg.2011.42. [DOI] [PubMed] [Google Scholar]
- 26.Hanson E, et al. The cognitive and behavioral phenotype of the 16p11.2 deletion in a clinically ascertained population. Biol. Psychiatry. 2015;77:785–793. doi: 10.1016/j.biopsych.2014.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Marshall CR, Scherer SW. Detection and characterization of copy number variation in autism spectrum disorder. Methods Mol. Biol. Clifton NJ. 2012;838:115–135. doi: 10.1007/978-1-61779-507-7_5. [DOI] [PubMed] [Google Scholar]
- 28.Stefansson H, et al. CNVs conferring risk of autism or schizophrenia affect cognition in controls. Nature. 2014;505:361–366. doi: 10.1038/nature12818. [DOI] [PubMed] [Google Scholar]
- 29.Männik K, et al. Copy number variations and cognitive phenotypes in unselected populations. JAMA. 2015;313:2044–2054. doi: 10.1001/jama.2015.4845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kendall KM, et al. Cognitive performance among carriers of pathogenic copy number variants: analysis of 152,000 UK biobank subjects. Biol. Psychiatry. 2017;82:103–110. doi: 10.1016/j.biopsych.2016.08.014. [DOI] [PubMed] [Google Scholar]
- 31.Kendall KM, et al. Cognitive performance and functional outcomes of carriers of pathogenic copy number variants: analysis of the UK Biobank. Br. J. Psychiatry J. Ment. Sci. 2019;214:297–304. doi: 10.1192/bjp.2018.301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ingason A, et al. Copy number variations of chromosome 16p13.1 region associated with schizophrenia. Mol. Psychiatry. 2011;16:17–25. doi: 10.1038/mp.2009.101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Fischbach GD, Lord C. The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron. 2010;68:192–195. doi: 10.1016/j.neuron.2010.10.006. [DOI] [PubMed] [Google Scholar]
- 34.Constantino, J. N. & Gruber, C. P. Social Responsiveness Scale: Manual. Los Angeles, CA: Western Psychological Services (2005).
- 35.Achenback, T. M. in The Use of Psychological Testing for Treatment Planning and Outcomes Assessment. (ed. Maruish, M. E.) 429–466 (Lawrence Erlbaum Associates Publishers, 1999).
- 36.Sparrow, S. S. et al. Vineland adaptive behavior scales. Circle Pines, MN: American Guidance Service Inc (1984).
- 37.Autism Spectrum Disorder Working Group of the Psychiatric Genomics Consortium. et al. Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 2019;51:431–444. doi: 10.1038/s41588-019-0344-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dawson G, et al. Randomized, controlled trial of an intervention for toddlers with autism: the Early Start Denver Model. Pediatrics. 2010;125:e17–e23. doi: 10.1542/peds.2009-0958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Rogers SJ, et al. Autism treatment in the first year of life: a pilot study of infant start, a parent-implemented intervention for symptomatic infants. J. Autism Dev. Disord. 2014;44:2981–2995. doi: 10.1007/s10803-014-2202-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kasari Connie. Time to rethink pre-emptive interventions for infants with early signs of autism spectrum disorder. The Lancet Child & Adolescent Health. 2019;3(9):586–587. doi: 10.1016/S2352-4642(19)30234-2. [DOI] [PubMed] [Google Scholar]
- 41.Whitehouse Andrew J O, Varcin Kandice J, Alvares Gail A, Barbaro Josephine, Bent Catherine, Boutrus Maryam, Chetcuti Lacey, Cooper Matthew N, Clark Alena, Davidson Emma, Dimov Stefanie, Dissanayake Cheryl, Doyle Jane, Grant Megan, Iacono Teresa, Maybery Murray, Pillar Sarah, Renton Michelle, Rowbottam Catherine, Sadka Nancy, Segal Leonie, Slonims Vicky, Taylor Carol, Wakeling Scott, Wan Ming Wai, Wray John, Green Jonathan, Hudry Kristelle. Pre-emptive intervention versus treatment as usual for infants showing early behavioural risk signs of autism spectrum disorder: a single-blind, randomised controlled trial. The Lancet Child & Adolescent Health. 2019;3(9):605–615. doi: 10.1016/S2352-4642(19)30184-1. [DOI] [PubMed] [Google Scholar]
- 42.Ozonoff S, et al. Diagnosis of autism spectrum disorder after age 5 in children evaluated longitudinally since infancy. J. Am. Acad. Child Adolesc. Psychiatry. 2018;57:849–857.e2. doi: 10.1016/j.jaac.2018.06.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Miller M, et al. School-age outcomes of infants at risk for autism spectrum disorder. Autism Res. J. Int. Soc. Autism Res. 2016;9:632–642. doi: 10.1002/aur.1572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Brian J, et al. Stability and change in autism spectrum disorder diagnosis from age 3 to middle childhood in a high-risk sibling cohort. Autism Int. J. Res. Pract. 2016;20:888–892. doi: 10.1177/1362361315614979. [DOI] [PubMed] [Google Scholar]
- 45.Shephard E, et al. Mid-childhood outcomes of infant siblings at familial high-risk of autism spectrum disorder. Autism Res. J. Int. Soc. Autism Res. 2017;10:546–557. doi: 10.1002/aur.1733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Kirov G, et al. The penetrance of copy number variations for schizophrenia and developmental delay. Biol. Psychiatry. 2014;75:378–385. doi: 10.1016/j.biopsych.2013.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Kosmicki JA, et al. Refining the role of de novo protein-truncating variants in neurodevelopmental disorders by using population reference samples. Nat. Genet. 2017;49:504–510. doi: 10.1038/ng.3789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hazlett HC, et al. Early brain development in infants at high risk for autism spectrum disorder. Nature. 2017;542:348–351. doi: 10.1038/nature21369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Zwaigenbaum L, Penner M. Autism spectrum disorder: advances in diagnosis and evaluation. BMJ. 2018;361:k1674. doi: 10.1136/bmj.k1674. [DOI] [PubMed] [Google Scholar]
- 50.Autism Genome Project Consortium. et al. Mapping autism risk loci using genetic linkage and chromosomal rearrangements. Nat. Genet. 2007;39:319–328. doi: 10.1038/ng1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gotham K, Pickles A, Lord C. Standardizing ADOS scores for a measure of severity in autism spectrum disorders. J. Autism Dev. Disord. 2009;39:693–705. doi: 10.1007/s10803-008-0674-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Mullen, E. M. Mullen scales of early learning. Circle Pines, MN: American Guidance Service Inc (1995). (AGS ed.).
- 53.Hurley RSE, Losh M, Parlier M, Reznick JS, Piven J. The broad autism phenotype questionnaire. J. Autism Dev. Disord. 2007;37:1679–1690. doi: 10.1007/s10803-006-0299-3. [DOI] [PubMed] [Google Scholar]
- 54.Zarrei M, et al. De novo and rare inherited copy-number variations in the hemiplegic form of cerebral palsy. Genet. Med. J. Am. Coll. Med. Genet. 2018;20:172–180. doi: 10.1038/gim.2017.83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Pinto D, et al. Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants. Nat. Biotechnol. 2011;29:512–520. doi: 10.1038/nbt.1852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Darvishi K. Application of Nexus copy number software for CNV detection and analysis. Curr. Protoc. Hum. Genet. 2010;Chapter 4:Unit 4.14.1–28. doi: 10.1002/0471142905.hg0414s65. [DOI] [PubMed] [Google Scholar]
- 58.Downey T. Analysis of a multifactor microarray study using Partek genomics solution. Methods Enzymol. 2006;411:256–270. doi: 10.1016/S0076-6879(06)11013-7. [DOI] [PubMed] [Google Scholar]
- 59.Zarrei M, MacDonald JR, Merico D, Scherer SW. A copy number variation map of the human genome. Nat. Rev. Genet. 2015;16:172–183. doi: 10.1038/nrg3871. [DOI] [PubMed] [Google Scholar]
- 60.Uddin M, et al. A high-resolution copy-number variation resource for clinical and population genetics. Genet. Med. J. Am. Coll. Med. Genet. 2015;17:747–752. doi: 10.1038/gim.2014.178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bierut LJ, et al. Variants in nicotinic receptors and risk for nicotine dependence. Am. J. Psychiatry. 2008;165:1163–1171. doi: 10.1176/appi.ajp.2008.07111711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Verhoeven VJM, et al. Genome-wide meta-analyses of multiancestry cohorts identify multiple new susceptibility loci for refractive error and myopia. Nat. Genet. 2013;45:314–318. doi: 10.1038/ng.2554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Bierut LJ, et al. A genome-wide association study of alcohol dependence. Proc. Natl. Acad. Sci. U. S. A. 2010;107:5082–5087. doi: 10.1073/pnas.0911109107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Figueiredo JC, et al. Genotype-environment interactions in microsatellite stable/microsatellite instability-low colorectal cancer: results from a genome-wide association study. Cancer Epidemiol. Biomark. Prev. Publ. Am. Assoc. Cancer Res. Cosponsored Am. Soc. Prev. Oncol. 2011;20:758–766. doi: 10.1158/1055-9965.EPI-10-0675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Newcomb PA, et al. Colon Cancer Family Registry: an international resource for studies of the genetic epidemiology of colon cancer. Cancer Epidemiol. Biomark. Prev. Publ. Am. Assoc. Cancer Res. Cosponsored Am. Soc. Prev. Oncol. 2007;16:2331–2343. doi: 10.1158/1055-9965.EPI-07-0648. [DOI] [PubMed] [Google Scholar]
- 66.Goodpaster BH, et al. The loss of skeletal muscle strength, mass, and quality in older adults: the health, aging and body composition study. J. Gerontol. A. Biol. Sci. Med. Sci. 2006;61:1059–1064. doi: 10.1093/gerona/61.10.1059. [DOI] [PubMed] [Google Scholar]
- 67.Stewart AFR, et al. Kinesin family member 6 variant Trp719Arg does not associate with angiographically defined coronary artery disease in the Ottawa Heart Genomics Study. J. Am. Coll. Cardiol. 2009;53:1471–1472. doi: 10.1016/j.jacc.2008.12.051. [DOI] [PubMed] [Google Scholar]
- 68.Krawczak M, et al. PopGen: population-based recruitment of patients and controls for the analysis of complex genotype-phenotype relationships. Community Genet. 2006;9:55–61. doi: 10.1159/000090694. [DOI] [PubMed] [Google Scholar]
- 69.Kearney HM, et al. American College of Medical Genetics standards and guidelines for interpretation and reporting of postnatal constitutional copy number variants. Genet. Med. J. Am. Coll. Med. Genet. 2011;13:680–685. doi: 10.1097/GIM.0b013e3182217a3a. [DOI] [PubMed] [Google Scholar]
- 70.Yuen RKC, et al. Genome-wide characteristics of de novo mutations in autism. NPJ Genom. Med. 2016;1:160271–1602710. doi: 10.1038/npjgenmed.2016.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Ross, P. J. et al. Synaptic dysfunction in human neurons with autism-associated deletions in PTCHD1-AS. Biol. Psychiatry S0006322319315471 (2019). 10.1016/j.biopsych.2019.07.014. [DOI] [PMC free article] [PubMed]
- 72.Talkowski ME, et al. Assessment of 2q23.1 microdeletion syndrome implicates MBD5 as a single causal locus of intellectual disability, epilepsy, and autism spectrum disorder. Am. J. Hum. Genet. 2011;89:551–563. doi: 10.1016/j.ajhg.2011.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Team, R. C. R: A language and environment for statistical computing. (Team, R. C, 2013).
- 74.Iafrate AJ, et al. Detection of large-scale variation in the human genome. Nat. Genet. 2004;36:949–951. doi: 10.1038/ng1416. [DOI] [PubMed] [Google Scholar]
- 75.Wang K, et al. PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 2007;17:1665–1674. doi: 10.1101/gr.6861907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Colella S, et al. QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data. Nucl. Acids Res. 2007;35:2013–2025. doi: 10.1093/nar/gkm076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Euesden J, Lewis CM, O’Reilly PF. PRSice: polygenic risk score software. Bioinforma. Oxf. Engl. 2015;31:1466–1468. doi: 10.1093/bioinformatics/btu848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Grove J, et al. Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 2019;51:431–444. doi: 10.1038/s41588-019-0344-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Gaugler T, et al. Most genetic risk for autism resides with common variation. Nat. Genet. 2014;46:881–885. doi: 10.1038/ng.3039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Sandin S, et al. The heritability of autism spectrum disorder. JAMA. 2017;318:1182–1184. doi: 10.1001/jama.2017.12141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Christensen DL, et al. Prevalence and characteristics of autism spectrum disorder among 4-year-old children in the autism and developmental disabilities monitoring network. J. Dev. Behav. Pediatr. JDBP. 2016;37:1–8. doi: 10.1097/DBP.0000000000000235. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All raw microarray data for 1016 individuals that were genotyped on the Affymetrix CytoScanTM HD Array were submitted to dbGaP and can be accessed via a data access committee using dbGaP accession number phs001876.v1.p1. Whole-genome sequence data are accessible through the Autism Speaks MSSNG database (https://research.mss.ng).