Abstract
Purpose.
Pediatric attention deficit hyperactivity disorder (ADHD, MIM: 143465) is highly heritable, yet the genetic architecture of the condition remains poorly understood. The current study tested the hypothesis that rare and common genetic variants reflect distinct genetic pathways to ADHD.
Methods.
Genome sequencing was completed for 150 pediatric ADHD cases and 370 controls. ADHD polygenic scores were derived and compared across five methods, including two published GWAS and two publicly available catalogs. Likely pathogenic rare variants were identified with a previously published customized annotation and classification pipeline followed by manual curation using established ACMGG variant interpretation guidelines.
Results.
ADHD cases had higher ADHD polygenic scores and lower IQ polygenic scores. Likely pathogenic variants for ADHD were identified in 13% of cases and 0.5% of controls. ADHD polygenic scores among cases without rare variants were higher than cases carrying rare variants. ADHD cases were predicted by ADHD and IQ PGS, ancestry, and rare variant status with 70% area under the curve.
Conclusions.
The genetic etiology of ADHD is likely multifactorial, with independent contributions from common and rare variants. Genome wide association studies of ADHD may have increased power to detect common genetic loci if individuals with rare variants are excluded.
Keywords: ADHD, Polygenic Risk, Rare Variants, Genetics, Genomics
Introduction
Pediatric attention deficit hyperactivity disorder (ADHD, MIM: 143465) is a prevalent and highly heritable disorder with approximately 75% broad based heritability1. Despite this, the genetic architecture of ADHD remains largely unknown. Both rare2,3 and common4,5 genomic risk factors have been identified, but these account for only a minority of diagnostic variance. The single nucleotide polymorphism (SNP) liability-based heritability is only 28% and common genetic loci for ADHD overlap considerably with other psychiatric and cognitive traits6. Recently, our group reported that up to 50% of individuals with ADHD may have rare disruptive variants to genes implicated in neurodevelopment2. In the current study, we examine whether the combination of rare and common genetic risk factors explains additional variance in pediatric ADHD.
Polygenic scores (PGS), which estimate the degree of common genetic liability for a disorder within a particular individual, have been proposed as a potential clinical tool to aid behavioral diagnoses and identify affected individuals prior to their enrollment in formal schooling7. PGS for ADHD have been associated with categorical ADHD diagnoses, continuously measured ADHD traits8,9, and neurobiological correlates of ADHD8. However, effect sizes for published ADHD PGS are modest, typically explaining < 4% of the phenotypic variance in the disorder8,10. Moreover, specificity of the ADHD PGS is weak, with strong associations between ADHD PGS and comorbid psychiatric symptoms, such as substance use disorders, bipolar disorder, and anxiety8,9,11.
The known polygenic contributions to ADHD are highly heterogeneous. To date, only two genome wide association studies (GWAS) have identified statistically significant loci associated with ADHD12,13. These investigations found a total of 33 statistically significant loci, only six of which overlapped across studies. Lack of polygenic overlap across GWAS points to methodology as another likely source of variability in PGS estimates for ADHD. PGS scores may be calculated from a single GWAS or from aggregate data published in publicly available catalogs14,15. Different PGS calculations for the same disorder often do not correlate with one another, as was demonstrated by an investigation of polygenic risk for psychiatric and medical diagnoses in the UK Biobank16. Moreover, the significance thresholds used to identify variants of interest from existing ADHD GWAS range from p = 5 × 10−8 to p = 1.0 8. A common approach in the extant literature is exploratory testing of multiple significance thresholds, with selection of a final cutoff based on optimized case-control differentiation8.
Recent work suggests that an additional source of heterogeneity in ADHD PGS analyses may be the contribution of rare genetic variants. Rare variants associated with well-characterized pediatric ADHD cases (absent autism or intellectual disabilities) include both de novo and inherited variants, particularly to genes involved in histone methylation2. One study found that children with ADHD who had large, rare copy number variants (CNVs) had lower ADHD PGS than those without17, which would support distinct contributions of common and rare variants to the disorder. By contrast, another study found that among healthy controls, individuals with CNVs had lower ADHD PGS than those without, while there were no differences in PGS by CNV status among ADHD cases18. Lastly, a study of individuals with ADHD that found an increased burden of rare missense and disruptive variants to genes implicated by GWAS19 suggests that common and rare variants for ADHD are overlapping
Individual variability in the phenotypic presentations of ADHD, including the timing of diagnosis and degree of functional impairment, can be explained at least in part by other risk and protective factors. For example, high effortful control, a temperament-based measure of self-regulation and goal directedness, was shown to mitigate the effects of family risk for ADHD on ADHD outcomes in youth enrolled in the longitudinal ABCD study20. Similarly, higher IQ was associated with improved pharmacological treatment response in a large sample of children with ADHD21. As IQ scores are likewise a highly heritable trait, it stands to reason that polygenic load for IQ may attenuate the association between polygenic risk for ADHD and child diagnostic outcomes.
In the current study, we investigated moderators of ADHD PGS in a sample of 150 children with ADHD and 370 comparison controls on whom we had completed genetic sequencing. First, we identified rare, likely disruptive variants in this cohort. Second, we compared ADHD PGS effect sizes using PGS derived from 4 distinct variant lists: GWAS published in 201913 and 202312, the publicly available PGS catalog, and the NHGRI-EBI GWAS Catalog22. Third, we tested our hypothesis that PGS would be lower in ADHD cases with rare disruptive variants, indicating two distinct etiological pathways to ADHD (common versus rare genetic liability). Finally, we examined whether a genetic protective factor, IQ PGS (derived from a publicly available dataset) moderated the effect of ADHD genetic liability on diagnostic outcomes.
Methods
Participants
ADHD participants were 4–17 year old children enrolled in the PrecisionLink Biobank for Health Discovery at Boston Children’s Hospital23. Participants at Boston Children’s Hospital were invited to enroll, which included providing research access to electronic health records and linking biospecimens, when available. Participants completed informed consent with a research coordinator prior to contributing samples. Participants with available genetic data as of 2022 whose medical record indicated an ADHD diagnosis (ICD-10 codes F90.0, F90.1, F90.0, or F90.0) and did not indicate autism spectrum disorder or intellectual disability (ICD-10 codes F70, F84) were included in the study. Three participants with known large copy number variants (CNVs) were excluded.
Control participants were unaffected family members of individuals enrolled in the Children’s Rare Disease Collaborative at Boston Children’s Hospital24 who had complete GS as of November 2022. The control cohort included family members who had available clinical diagnostic records that did not indicate any of the ADHD diagnostic codes listed above. Cohorts in this collaborative include neurological and medical diseases, typically with dominant genetic causes25; thus, unaffected family members are not expected to carry genetic burden.
All procedures were approved by the Boston Children’s Hospital Institutional Review Board. Sex and ancestry were inferred through genetic analyses (Table 1). Eight control participants did not have sufficient X chromosome data to infer sex; these individuals were excluded from sex comparison analyses.
Table 1.
Participant Demographics
| ADHD | Controls | |
|---|---|---|
|
|
||
| N | 150 | 370 |
| Female | 48 (32%) | 196 (53%) |
| Ancestry: African | 12 (8%) | 5 (1%) |
| Ancestry: Admixed American | 3 (2%) | 9 (2%) |
| Ancestry: East Asian | 2 (1%) | 9 (2%) |
| Ancestry: European | 133 (89%) | 347 (94%) |
| Heterozygote Variant | 19 (13%) | 2 (0.5%) |
Genome sequencing and variant calling
Blood derived DNA collected from participants underwent short read genome sequencing (GS) to an average coverage of 40X as part of the Children’s Rare Disease Collaborative at Boston Children’s Hospital24. Using the Illumina DRAGEN Bio-IT Platform using on-premises servers and version 3.9.3 of the Germline DNA pipeline, raw sequencing data were processed under read mapping, QC, and variant calling. Read alignment, read trimming, sorting, duplicate marking, and small variant calling were done with default parameters and using reference genome build hg38. All individuals included in the analysis (150 cases and 370 controls) were jointly variant called and genotyped25, to ensure consistency of genotype calling across all individuals.
Rare Variant Identification
All VCFs were processed through our semi-automated custom analytical pipeline (previously described2,26), including the following steps. First, VCFs were merged and filtered for high-quality calls (GQ >30, PASS filter, AAF > 0.15) using BCFtools. Candidate variants, defined as those with the greatest likelihood of being damaging met the following criteria: a max population allele frequency (maxAF) across all gnomAD v2.1.1 populations of 0.002%, AD>2, DP>7, AC gnomAD exomes <5, AC gnomAD genomes < 5, number of homozygous occurrences in gnomAD 0, damaging prediction (LOF1, LOF2, LOF3, NsynD3). Additional filtration was applied based on the variant type: LOF alleles required the gene to be constrained in the gnomAD 2.1.1 database with a pLI>0.8 or loss-of-function o/e < 0.3; NsynD3 alleles required the impacted genes to have missense constrain with either a misZ>=3.08 or o/e < 0.3. Candidate variants that could explain the ADHD diagnoses in the individuals were identified from these filtered variants through manual curation by applying the ACMGG criteria, along with previous reports and studies of the variants extracted from HGMD, ClinVar, OMIM, and SFARI gene databases, resulting in a set of variants of uncertain significance including both LOF and likely damaging missense variants in genes with established and putative roles in NDDs.
Gene functional enrichment analyses were performed using the 19 genes with rare variants possibly explaining the ADHD using the Enrichr27 tool which allows for assessment of enriched processes and pathways from databases including the 2023 WikiPathway database and Gene Ontology molecular processes. The p-value provided by Enrichr is corrected for multiple hypotheses using the Benjamini-Hochberg procedure. The combined score integrates the p-value and the odd ratio by multiply the scores as a compromise between the methods and was reported to provide an improved ranking of enrichments.
ADHD Polygenic Scores
For comparison and completeness, five methods were used to compute ADHD PGS. In the first two approaches, effect sizes were obtained from the top results reported by Demontis et al., in either 201913 or 202312. From the 2019 table, we were able to match 9 out of 12 variants to those called from our GS sequencing; we found high linkage disequilibrium (LD; r2 ≥ 0.7) proxy variants for the other three. From the 2023 table, we matched 25 of the 27 variants and did not find any proxies for the remaining two. In the third approach, we downloaded SNP effect sizes for ADHD from the PGS Catalog score ID PGS002746 (source publication PGP00035828) and found 458,934 matching variants of 513,659. Then PLINK29 was used to generate polygenic risk scores for each sample in this study. For the fourth and fifth approaches, we downloaded summary statistics from the NHGRI-EBI GWAS Catalog for study GCST9027513630 and found 4,092,115 matching variants in our data, out of 8,785,478 possible variants. For this set of GWAS results, we first used PRSice-231 to calculate PGS for each sample in the study. PRSice-2 calculated the model fit using 10,002 p-value thresholds ranging from 5e-08 to 1 and selected 9.25e-03 as the threshold providing the best model fit (p-value = 4.08e-04). For another approach, we used PRS-CSx with a single European reference panel to calculate weights for each variant in the summary statistics. We acquired weights for 1,067,688 variants overlapping with the reference panel and used PLINK to calculate PGS for each sample in the study.
IQ Polygenic Scores
PGS for intelligence were computed using effect sizes of 25,424 matching variants (out of 26,145) from the PGS Catalog score ID PGS001919 (source publication PGP00026332) and 4,577,641 matching variants (out of 9,295,118) from the GWAS Catalog study GCST00625033.
Analytic Approach
Statistical analyses were done in R v4.1.0. ADHD and IQ PGS demonstrated normal distributions with skew and kurtosis < |2.0|. Diagnostic, sex and ancestry group differences in PGS were computed with independent-samples t-tests and one-way analyses of variance (ANOVA). Effect sizes were estimated with Cohen’s d. The optimal ADHD PGS was selected as that with the largest effect size and was used in subsequent analyses. Hierarchical logistic regression was estimated to evaluate the additive and interactive contributions of IQ PGS to ADHD outcomes. Power analysis indicated 85% power to detect a small effect (d = 0.29) between cases and controls. However, we were underpowered to compare PGS between rare variant heterozygote cases and non-heterozygous cases and controls (85% power to detect a large effect, d = 0.71). Thus, we focused on effect sizes rather than p-values for these exploratory comparisons.
Next, we estimated receiver operating characteristic (ROC) analysis (R package ROCR) using the formula in the optimal step of the hierarchical logistic regression. The sample was randomly split into 1,000 test (70%) and train (30%) datasets. Mean area under the curve (AUC) and 95% confidence interval estimates were estimated from the results.
Results
Rare Variant Discovery
Rare SNV and short insertion and deletion variants possibly explaining the ADHD diagnoses were identified in 13% (19/150) of the cases (Supplemental Table 1), including 7% with likely pathogenic variants (11/150), 4% (6/150) with LOF variants in neurodevelopment-related genes with less established disease associations, and 1% (2/150) with possibly damaging missense variants in disease-associated genes. Interestingly, genes impacted by these variants were enriched for histone modification (p = 0.03) pathways in the 2023 WikiPathway database and the Histone H3 Methyltransferase Activity (GO:0140938, p=0.013) molecular process, consistent with prior findings in ADHD2,34, see Figure 1. In the control group, two participants were identified with likely pathogenic splicing variants (Supplemental Table 2). One individual carried a variant in the PLCB1 gene (HGNC:15917, NM_015192.4:c.1250+1G>A: NC_000020.11:g.8708753G>A), which is associated with epileptic encephalopathy. The second individual harbored a previously classified likely pathogenic variant (ClinVar ID: 3019100) in the KCNQ5 gene (HGNC:6299, NM_001160132.2:c.1277+2T>C: NC_000006.12:g.73129851T>C), linked to encephalopathy and ID. Consequently, these two control individuals were excluded from subsequent analyses.
Figure 1.

Rare variant gene enrichment analyses. Enrichment of A) molecular function gene ontology terms and B) functional pathways from Wikipathway 2023 genes with rare variants in our ADHD cohort. Combined score is calculated as the log product of Fisher’s exact test p-value and rank deviation z-score.
ADHD PGS Source Identification
ADHD PGS was significantly higher in cases compared to controls using the Demontis et al. 2023 and GWAS Catalog weights, with both the PRSice-2 and PRScsx methods (Table 2). The largest effect was found with the GWAS Catalog computed with PRSice-2; thus, this calculation was used in the subsequent analyses.
Table 2.
Case Control ADHD-PGS Effect Size By Source
| ADHD PGS Source | t(df) | p | Cohen’s d |
|---|---|---|---|
|
| |||
| Demontis 201914 | −0.35 (208) | .72542 | −0.04 |
| Demontis 202313 | 2.13 (218) | .03418 | 0.23 |
| PGS Catalog16 | 1.76 (223) | .08031 | 0.18 |
| GWAS Catalog22, PRSice-232 | 3.24 (274) | .00132 | 0.32 |
| GWAS Catalog22, PRS-CSx33 | 2.69 (270) | .00761 | 0.27 |
Note: Independent samples t-tests were computed with unequal variances assumed.
Demographic and Diagnostic Associations with ADHD and IQ PGS
ADHD PGS did not differ by sex (t[508] = 0.62, p = .5337, d = 0.06). As expected, the omnibus test revealed a significant effect of ancestry on ADHD PGS: F(3,514) = 24.70, p < .00001. Post-hoc pairwise comparisons using Tukey’s HSD revealed lower ADHD PGS among those with European or Admixed American descent, compared to those with East Asian or African descent (d range = −1.64 to −1.65, ps < .002). Considering the 2% of control cases with likely pathogenic variants, the Relative Risk = 26.
ADHD cases had significantly lower IQ PGS compared to controls, using the GWAS Catalog: t(516) = 4.92, p = .00001, Cohen’s d = −0.50. IQ PGS did not differ by sex (t[508] = 1.19, p = .236; d = 0.11). IQ PGS varied significantly across ancestry (F[3,514] = 69.32, p < .00001). IQ PGS was higher among participants with European or Admixed American descent compared to East Asian or African descent; and higher among individuals with Eastern Asian compared to African descent (d range = 2.27 to 3.37, ps < .001).
Rare Variant Sensitivity Analyses
ADHD PGS
ADHD PGS was compared across rare variant heterozygote cases, non-heterozygote cases, and controls (Figure 2). The omnibus effect was significant: F(2,515) = 6.14, p = .0023. Post-hoc pairwise comparisons revealed that non-heterozygote cases had higher ADHD PGS compared to controls (d = 0.36, p = .0015). Non-heterozygote cases had higher ADHD PGS than heterozygote cases, with a similar effect size, although as expected this did not reach statistical significance (d = 0.29, p = .4028). In contrast, ADHD PGS among heterozygote cases was comparable to that of controls (d = −0.036, p = .9845).
Figure 2.

Distributions of ADHD polygenic scores across Controls (left), cases without rare variants (ADHD−; middle), and cases carrying rare variants (ADHD+; right). ADHD PGS are reported as z-scores (mean=0, standard deviation = 1). Cohen’s d effect sizes are reported to indicate differences between two groups. **p = .001.
IQ PGS
There was a significant omnibus effect of the three groups on IQ PGS: F(2,515) = 12.3451, p < .00001 (Figure 3). Post-hoc pairwise comparisons revealed a significantly lower IQ PGS in non-heterozygote cases compared to controls (d = −0.50, p < .0001). Non-heterozygote cases had a lower IQ PGS compared to heterozygote cases, but the effect size was small (d = −0.17, p = .6737). IQ PGS was lower among heterozygote cases than controls, with a small effect size (d = −0.26, p = .4227).
Figure 3.

Distributions of IQ polygenic scores across Controls (left), cases without rare variants (ADHD−; middle), and cases carrying rare variants (ADHD+; right). IQ PGS are reported as z-scores (mean=0, standard deviation = 1). Cohen’s d effect sizes are reported to indicate differences between two groups. ***p < .0001.
Hierarchical Logistic Regressions
A series of hierarchical logistic regressions with ADHD case/control status as the dependent variable was computed to examine additive and interactive effects of ADHD and IQ PGS on diagnostic outcomes (Table 3). Ancestry and rare variant heterozygote status were included as covariates. In the first step, ADHD PGS was associated with case-control status over and above ancestry. In the second step, IQ PGS was added; IQ PGS was independently associated with case control status, and the effect of ADHD PGS now only approached significance. In the third step, the interaction between ADHD and IQ PGS was not statistically significant.
Table 3.
Hierarchical Logistic Regressions
| B | SE | p | z | R 2 | |
|---|---|---|---|---|---|
|
|
|||||
| Step 1 | 0.14 | ||||
| African Ancestry | 1.76 | 0.97 | .0691 | 1.82 | |
| Eastern Ancestry | −1.17 | 1.33 | .3756 | −0.89 | |
| European Ancestry | 0.50 | 0.80 | .5318 | 0.63 | |
| Heterozygote | 17.74 | 515.48 | .9725 | 0.03 | |
| ADHD PGS | 0.34 | 0.11 | .0027 | 3.00 | |
| Step 2 | 0.15 | ||||
| African Ancestry | 0.88 | 1.02 | .3890 | 0.86 | |
| Eastern Ancestry | −1.49 | 1.33 | .2635 | −1.12 | |
| European Ancestry | 0.56 | 0.80 | .4897 | 0.69 | |
| Heterozygote | 17.78 | 513.86 | .9724 | 0.04 | |
| ADHD PGS | 0.22 | 0.12 | .0739 | 1.79 | |
| IQ PGS | −0.43 | 0.14 | .0015 | −3.17 | |
| Step 3 | 0.15 | ||||
| African Ancestry | 1.00 | 1.05 | .3430 | 0.95 | |
| Eastern Ancestry | −1.44 | 1.34 | .2827 | −1.074 | |
| European Ancestry | 0.54 | 0.80 | .4985 | 0.68 | |
| Heterozygote | 17.77 | 513.75 | .9724 | 0.04 | |
| ADHD PGS | 0.22 | 0.12 | .0684 | 1.82 | |
| IQ PGS | −0.45 | 0.14 | .0016 | −3.15 | |
| ADHD PGS x IQ PGS |
0.05 | 0.12 | .6621 | 0.44 | |
Note: PGS=polygenic score. ADHD = attention deficit hyperactivity disorder. Heterozygote = individual with a rare variant. Polygenic scores were standardized within the sample prior to analyses to improve interpretation.
The second step of the hierarchical regression was identified as the optimal model, because the R2 was larger than that of step 1 and comparable to that of step 3, and the interaction effect in step 3 was not significant. The mean AUC was 0.695 (95% CI: 0.693 – 0.698).
Sensitivity Analyses
The majority of participants in the sample were of European ancestry, and the ADHD GWAS derived from European datasets. Thus, we conducted sensitivity analyses using the subsample of participants with European ancestry (N = 118 non-heterozygote cases; N = 15 heterozygote cases; N = 345 controls). The results were consistent with the full sample. ADHD PGS was higher in non- heterozygote cases compared to controls (d = 0.29, p = .0212). Non-heterozygote cases had a higher ADHD PGS than heterozygote cases, with a medium effect that did not reach statistical significance (d = 0.44, p = .2097). Controls and heterozygote cases did not differ on ADHD PGS (d = −0.16, p = 0.7769). In the logistic regression models, IQ PGS was significantly negatively associated with case status, over and above heterozygote status and ADHD PGS; the interaction term between ADHD and IQ PGS was not significant. See Supplementary Materials for full results.
Discussion
As expected, a significant minority (13%) of the ADHD sample had identifiable rare variants that were likely associated with their ADHD diagnoses. Analysis of the rare variants revealed enrichment for genes involved in histone methylation, consistent with prior work2,34. Results of the PGS analyses indicated that the NHGRI-EBI GWAS Catalog22 provided the most sensitive ADHD PGS for distinguishing cases and controls. Contrary to our expectations, the interaction between IQ PGS and ADHD PGS was not significantly predictive of case-control status. Rather, IQ and ADHD PGS each explained unique variance in case-control status. However, the variance explained by ADHD PGS was significantly reduced when IQ PGS was included in the model, indicating some overlap between these two genetic measures. Our results were consistent when the analyses were limited to participants with European ancestry.
Our exploratory analyses suggested that cases with a rare heterozygous variant had lower ADHD PGS than cases without a known variant. These results, although preliminary, point to an equifinal etiology for ADHD diagnosis, with distinct polygenic and rare variant genetic pathways to the disorder. This contrasts with a diathesis-stress model that would suggest that the combination of high PGS and rare variants lead to ADHD. Nonetheless, we were not able to rule out the possibility that PGS contributes to variable expression of ADHD symptom severity and psychiatric comorbidity profile among affected children with a rare variant, as has been suggested for other genetic disorders35. Nor can we exclude the possibility that low ADHD PGS protects against the effect of more mild rare variants, as we only included variants meeting a classification of at least likely pathogenic by the established ACMGG criteria in our cases.
We found reduced ADHD PGS burden among rare variant heterozygote cases; if replicated in a larger sample, this result has implications for future GWAS. Specifically, individuals with candidate rare variants may dilute or alter the results of common variant associations. More refined, sensitive PGS may be obtained by excluding individuals with rare variants from GWAS samples. Conversely, our results suggest that individuals with ADHD with high PGS scores are less likely to carry pathogenic rare variants, consistent with previous reports for other traits36.
With logistic regressions, we tested the hypothesis that polygenic load for a protective trait, such as high IQ, could mitigate the association between ADHD PGS and diagnostic outcome. We found partial support for this hypothesis with confirmation of the additive model, wherein the effect of ADHD PGS on diagnostic status was reduced when IQ PGS was included. However, there was no interaction between the two polygenic scores, indicating that PGS for high IQ does not protect against increased polygenic vulnerability for ADHD. While a prior report found an association between ADHD PGS and performance on cognitive tests12, our analysis confirmed the complementary association, i.e., between IQ PGS and ADHD diagnosis. Altogether, our results suggest that the combination of moderately elevated ADHD PGS and moderately decreased IQ PGS may increase risk for ADHD diagnosis. Thus, this line of inquiry presents a third possible genetic pathway to ADHD, namely polygenic vulnerability for ADHD in combination with polygenic vulnerability for lower cognition.
Previous studies have established a link between DNA methylation patterns and ADHD, with significant methylation changes being identified between individuals with ADHD and healthy controls37–39. Our results further support this connection, demonstrating an enrichment of rare damaging variants in histone methyltransferase genes in ADHD. Importantly, many histone methyltransferase genes are implicated in other neurodevelopmental disorders and exhibit variable expressivity and incomplete penetrance40–42. Even more, other histone methyltransferase genes are known to display unique DNA methylation signatures, suggesting that underlaying rare variation within these genes can impact the transcriptional regulation of many other genes, possibly explaining the phenotypic variability amongst individuals43. Therefore, rare methylation-related variants could lead to altered expression of genes identified by GWAS, without the need for an elevated PGS, though studies with larger family-based genomic sequencing cohorts will be required to fully assess the possible link between PGS and rare variation in methylation genes.
We compared five common approaches to deriving ADHD PGS. There was significant variability in effect sizes across these methods, all of which have been used in prior publications. We hypothesize that the GWAS published in 2019 did not confer a large effect size in our sample due to the small number of alleles that were reported5. However, we were surprised by the lack of significant difference found for ADHD PGS derived from the PRS Catalog, although the effect was in the expected direction. The largest effect sizes were found with the GWAS catalog. The effect was likely larger when using PRSice-2 because this method allowed us to include more variants than with PRS-CSx. PRS-CSx requires rsIDs which are not always available, particularly for more recently identified variants. Altogether, the variability in effect sizes across methods likely reflects the heterogeneity among and within ADHD GWAS samples, and underscores that currently, PGS measurement lacks sufficient reliability and validity for use as a standalone instrument in ADHD clinical practice7.
Our results should be interpreted in the context of the study limitations. First, our sample size was modest, limiting power to conduct replication analyses or detect small effects. This is in part a result of our careful phenotyping and genotyping of the samples, which did not include any children with ASD or ID diagnoses. Additionally, detection of rare variants in the ADHD and control samples was carried out in a comprehensive manner. As expected, the number of rare variant cases in the sample was relatively small, which limited our interpretation of comparisons between cases with and without heterozygous variants. Yet, we did identify a large relative risk for rare variants in the ADHD sample (RR = 26), underscoring the importance of accounting for this etiology in genomic studies of the disorder. In future work, we will seek to replicate our results in larger, publicly available GS datasets.
In conclusion, pediatric ADHD is a phenotypically and genetically heterogeneous disorder. Polygenic risk and rare variants may represent distinct etiological pathways to ADHD, and PGS for IQ may increase predictive value for diagnostic likelihood. Rare variant discovery analysis supported our previous report that methylation genes play an important role in ADHD.
Supplementary Material
Acknowledgements
Thank you to the participating families for their contributions to this research. The authors acknowledge material and/or data support from the PrecisionLink Biobank for Health Discovery at Boston Children’s Hospital.
Funding Statement
This study was supported by funding from the National Institute of Mental Health to A.B.A. (R00MH116064–04S1) and to R.D. and A.B.A. (R01MH137118) and by the Children’s Rare Disease Collaborative at Boston Children’s Hospital. This work was supported in part by Cooperative Agreement U01TR002623 from the National Center for Advancing Translational Sciences/NIH and the PrecisionLink Project at Boston Children’s Hospital.
Footnotes
Conflict of Interest
None of the authors have conflicts of interest to disclose.
Ethics Declaration
All procedures were approved by the Boston Children’s Hospital Institutional Review Board. Adult participants completed informed consent and provided permission for their minor children to participate. Children ages 7 and older completed assent.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Data Availability
The data are available internally to all BCH researchers and clinicians through the genomics analysis platform. To facilitate external access, the CRDC data has also been made available to the Genomic Information Commons project (https://www.genomicinformationcommons.org/). Code available upon request. Clinical and genomic data are available to BCH researchers and their collaborators via request to the Biobank Sample and Data Access Committee. Through the Genomic Information Commons (GIC) (https://www.genomicinformationcommons.org/), PrecisionLink Biobank genomic data can be queried and requested by researchers at any of the established member sites.
References
- 1.Faraone SV & Larsson H Genetics of attention deficit hyperactivity disorder. Molecular psychiatry 24, 562–575 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Arnett AB et al. Rare De Novo and Inherited Genes in Familial and Nonfamilial Pediatric Attention-Deficit/Hyperactivity Disorder. JAMA pediatrics 178, 81–84 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Rajagopal VM et al. Differences in the genetic architecture of common and rare variants in childhood, persistent and late-diagnosed attention-deficit hyperactivity disorder. Nature genetics 54, 1117–1124 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Demontis D et al. Genome-wide analyses of ADHD identify 27 risk loci, refine the genetic architecture and implicate several cognitive domains. Nature Genetics, 1–11 (2023). [Google Scholar]
- 5.Demontis D et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nature genetics 51, 63 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chang S, Yang L, Wang Y & Faraone SV Shared polygenic risk for ADHD, executive dysfunction and other psychiatric disorders. Translational Psychiatry 10, 1–9 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wray NR et al. From basic science to clinical application of polygenic risk scores: a primer. JAMA psychiatry 78, 101–109 (2021). [DOI] [PubMed] [Google Scholar]
- 8.Ronald A, de Bode N & Polderman TJ Systematic review: how the attention-deficit/hyperactivity disorder polygenic risk score adds to our understanding of ADHD and associated traits. Journal of the American Academy of Child & Adolescent Psychiatry 60, 1234–1277 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Martin J et al. Sex-specific manifestation of genetic risk for attention deficit hyperactivity disorder in the general population. Journal of Child Psychology and Psychiatry 59, 908–916 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Li JJ & He Q Polygenic scores for ADHD: a meta-analysis. Research on Child and Adolescent Psychopathology 49, 297–310 (2021). [DOI] [PubMed] [Google Scholar]
- 11.LaBianca S et al. Polygenic profiles define aspects of clinical heterogeneity in attention deficit hyperactivity disorder. Nature Genetics, 1–11 (2023). [Google Scholar]
- 12.Demontis D et al. Genome-wide analyses of ADHD identify 27 risk loci, refine the genetic architecture and implicate several cognitive domains. Nature genetics 55, 198–208 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Demontis D et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nature genetics 51, 63–75 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Welter D et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic acids research 42, D1001–D1006 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lambert SA et al. The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nature Genetics 53, 420–425 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Mollon J et al. Low Stability and Specificity of Polygenic Risk Scores for Major Psychiatric Disorders Limit their Clinical Utility. Biological Psychiatry (2025). [Google Scholar]
- 17.Martin J, O’donovan M, Thapar A, Langley K & Williams N The relative contribution of common and rare genetic variants to ADHD. Translational psychiatry 5, e506–e506 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chang X, Qu H, Liu Y, Glessner J & Hakonarson H A Protective Role of Low Polygenic Risk Score in Healthy Individuals Carrying Attention-Deficit/Hyperactivity Disorder-Associated Copy Number Variations. Biol Psychiatry 95, 881–887 (2024). [DOI] [PubMed] [Google Scholar]
- 19.Demontis D et al. Risk variants and polygenic architecture of disruptive behavior disorders in the context of attention-deficit/hyperactivity disorder. Nature communications 12, 576 (2021). [Google Scholar]
- 20.Peisch V, Li V & Arnett AB Effortful Control Protects Against Familial Liability for ADHD: Longitudinal Results from the ABCD Study in the United States. Research on Child and Adolescent Psychopathology, 1–10 (2023). [Google Scholar]
- 21.Owens EB et al. Which treatment for whom for ADHD? Moderators of treatment response in the MTA. Journal of consulting and clinical psychology 71, 540 (2003). [DOI] [PubMed] [Google Scholar]
- 22.Sollis E et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic acids research 51, D977–D985 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bourgeois FT et al. Development of the precision link biobank at Boston Children’s Hospital: challenges and opportunities. Journal of Personalized Medicine 7, 21 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Rockowitz S et al. Children’s rare disease cohorts: an integrative research and clinical genomics initiative. NPJ genomic medicine 5, 29 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.French CE et al. Hospital-wide access to genomic data advanced pediatric rare disease research and clinical outcomes. NPJ Genomic Medicine 9, 1–11 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.O’Connell M et al. Diverse clinical presentation of SPTBN1 variants: Complex versus primary attention-deficit/hyperactivity disorder. American Journal of Medical Genetics Part A 197, e63851 (2025). [DOI] [PubMed] [Google Scholar]
- 27.Chen EY et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC bioinformatics 14, 1–14 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lahey BB et al. Associations of polygenic risk for attention-deficit/hyperactivity disorder with general and specific dimensions of childhood psychological problems and facets of impulsivity. Journal of psychiatric research 152, 187–193 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Chang CC et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, s13742–015-0047–8 (2015). [Google Scholar]
- 30.Pedersen EM et al. ADuLT: An efficient and robust time-to-event GWAS. Nature Communications 14, 5553 (2023). [Google Scholar]
- 31.Choi SW & O’Reilly PF PRSice-2: Polygenic Risk Score software for biobank-scale data. Gigascience 8, giz082 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Privé F et al. Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort. The American Journal of Human Genetics 109, 12–23 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Savage JE et al. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nature genetics 50, 912–919 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Olfson E et al. Ultra-rare de novo damaging coding variants are enriched in attention-deficit/hyperactivity disorder and identify risk genes. (2023). [Google Scholar]
- 35.Oetjens M, Kelly M, Sturm A, Martin C & Ledbetter D Quantifying the polygenic contribution to variable expressivity in eleven rare genetic disorders. Nature communications 10, 4897 (2019). [Google Scholar]
- 36.Jurgens SJ et al. Adjusting for common variant polygenic scores improves yield in rare variant association analyses. Nat Genet 55, 544–548 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Neumann A et al. Association between DNA methylation and ADHD symptoms from birth to school age: a prospective meta-analysis. Transl Psychiatry 10, 398 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.van Dongen J et al. Epigenome-wide Association Study of Attention-Deficit/Hyperactivity Disorder Symptoms in Adults. Biol Psychiatry 86, 599–607 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.van Mil NH et al. DNA methylation profiles at birth and child ADHD symptoms. J Psychiatr Res 49, 51–9 (2014). [DOI] [PubMed] [Google Scholar]
- 40.Weerts MJA et al. Delineating the molecular and phenotypic spectrum of the SETD1B-related syndrome. Genet Med 23, 2122–2137 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.O’Donnell-Luria AH et al. Heterozygous Variants in KMT2E Cause a Spectrum of Neurodevelopmental Disorders and Epilepsy. Am J Hum Genet 104, 1210–1222 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Cif L et al. KMT2B-related disorders: expansion of the phenotypic spectrum and long-term efficacy of deep brain stimulation. Brain 143, 3242–3261 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lee S et al. Comparison of methylation episignatures in KMT2B- and KMT2D-related human disorders. Epigenomics 14, 537–547 (2022). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data are available internally to all BCH researchers and clinicians through the genomics analysis platform. To facilitate external access, the CRDC data has also been made available to the Genomic Information Commons project (https://www.genomicinformationcommons.org/). Code available upon request. Clinical and genomic data are available to BCH researchers and their collaborators via request to the Biobank Sample and Data Access Committee. Through the Genomic Information Commons (GIC) (https://www.genomicinformationcommons.org/), PrecisionLink Biobank genomic data can be queried and requested by researchers at any of the established member sites.
