Skip to main content
Human Genetics and Genomics Advances logoLink to Human Genetics and Genomics Advances
. 2024 Aug 12;5(4):100340. doi: 10.1016/j.xhgg.2024.100340

Copy-number variants differ in frequency across genetic ancestry groups

Laura M Schultz 1,8,, Alexys Knighton 2, Guillaume Huguet 3, Zohra Saci 3, Martineau Jean-Louis 3, Josephine Mollon 4,5, Emma EM Knowles 4,5, David C Glahn 4,5, Sébastien Jacquemont 3,6, Laura Almasy 1,7,∗∗
PMCID: PMC11401192  PMID: 39138864

Summary

Copy-number variants (CNVs) have been implicated in a variety of neuropsychiatric and cognitive phenotypes. We found that deleterious CNVs are less prevalent in non-European ancestry groups than they are in European ancestry groups of both the UK Biobank (UKBB) and a US replication cohort (SPARK). We also identified specific recurrent CNVs that consistently differ in frequency across ancestry groups in both the UKBB and SPARK. These ancestry-related differences in CNV prevalence present in both an unselected community population and a family cohort enriched with individuals diagnosed with autism spectrum disorder (ASD) strongly suggest that genetic ancestry should be considered when probing associations between CNVs and health outcomes.

Keywords: SPARK, UK Biobank, duplication, deletion, autism, recurrent CNV, LOEUF, neuropsychiatric disorder


The authors report that CNVs annotated as recurrent or deleterious vary in frequency across genetic similarity groups in both the UK Biobank and SPARK. They also identify individual recurrent CNVs previously associated with neuropsychiatric phenotypes that consistently differ in frequency across genetic similarity groups in both cohorts.

Main text

Copy-number variants (CNVs), which are duplicated or deleted genomic segments larger than 1,000 bp,1 are associated with a wide range of human complex traits2,3,4,5,6,7 and diseases.8,9,10,11 They are especially well studied in neurodevelopmental12,13,14,15 and psychiatric disorders,4,15,16,17,18 most notably autism spectrum disorder (ASD)19,20,21,22 and schizophrenia.23,24,25 Unfortunately, most CNV association studies have been limited to European (EUR) ancestry groups2,5,6,7,8,9,11 or pooled across ancestry groups.10,13,14,15,17,19,20,21,23 While some studies have characterized CNVs in African (AFR)26,27,28,29 and other non-EUR ancestry groups,15,30,31,32 efforts to compare CNV frequencies across ancestry groups have been inconclusive due to limited sample sizes.14,33,34,35,36,37,38,39,40 Since genetic variation among human populations41,42 contributes to differential disease risk,43,44,45,46 immune response,47,48 and pharmacogenomics,49,50 it is plausible that ancestry-related differences in CNV frequency could impact precision medicine. Hence, we took advantage of previously overlooked diversity within the UK Biobank (UKBB) to compare CNV frequency across four genetic ancestry groups: individuals with inferred EUR (n = 51,334), south Asian (SAS; n = 8,848), or AFR (n = 8,447) ancestry plus a subset of EUR-ancestry individuals who self-identified as “white British” (WB; n = 385,636) (Figure S1). Even though the SAS and AFR groups each represent about 2% of the UKBB, they are orders of magnitude larger than the non-EUR groups included in previous CNV studies.

We called autosomal CNVs consisting of at least 50,000 bp from microarray data using PennCNV and QuantiSNP (supplemental methods). When we considered all CNVs, we found that the carrier frequency was generally similar across ancestry groups. Nonetheless, there were more deletion (DEL) carriers in the EUR and SAS groups and duplication (DUP) carriers in the WB group than expected given the group size (Figure 1A). For all ancestry groups, DEL carriers were more common than DUP carriers.

Figure 1.

Figure 1

CNV carrier prevalence by ancestry group in the UK Biobank

(A) When CNVs are not filtered based on recurrence or burden, deletion (DEL) carrier prevalence is higher than expected for the South Asian ancestry group (SAS), and duplication (DUP) carrier prevalence is lower than expected for the African ancestry group (AFR) (χ2 = 146.7).

(B) When considering only recurrent CNVs, there were fewer AFR DEL and DUP carriers than expected (χ2 = 48.145).

(C) Filtering instead on burden (total 1/LOEUF ≥ 5.7), there were fewer DEL and DUP carriers than expected for both AFR and SAS (χ2 = 126.89).

(D) Limiting to carriers of recurrent CNVs with total 1/LOEUF ≥ 5.7, DEL carrier prevalence was lower than expected for both AFR and SAS, but DUP carrier prevalence was only lower than expected for AFR (χ2 = 61.397). Plus signs indicate significantly higher than expected carrier prevalence, and minus signs indicate significantly lower than expected carrier prevalence. Ancestry-specific carrier prevalence was computed as the number of carriers of at least one DEL (or DUP) divided by the total number of individuals in that ancestry group. Simulated p values (2,000 replicates) were <0.0005 for all four χ2 tests of independence. ∗∗False discovery rate (FDR)-corrected p < 0.005; ∗∗∗FDR-corrected p < 0.0005. An additional category (“neither”) that was included in each χ2 test is not shown in the bar charts.

Given that CNVs have a range of effect sizes, as judged by their association with health outcomes or by their degree of evolutionary constraint, we considered several subsets of CNV carriers. First, we limited our analyses to carriers of a pre-selected set of recurrent CNVs with common breakpoints that were previously observed in multiple individuals and found to be associated with neuropsychiatric phenotypes (Table S1). We observed 50 unique recurrent DEL (Table S2) and 60 unique recurrent DUP (Table S3) at these loci in the UKBB. Of the 110 unique recurrent CNVs observed in the combined sample, 106 were present in the WB. The four CNVs not observed in the WB group were present in the EUR group, where we observed 77 unique recurrent CNVs. The 40 unique recurrent CNVs observed in the AFR group and the 36 in the SAS group were subsets of those observed in the WB and EUR groups; none of the 110 recurrent CNVs were unique to the non-EUR ancestry groups in the UKBB. There were significantly fewer AFR-ancestry recurrent DEL and DUP carriers than expected under the null hypothesis that recurrent CNV carrier frequency is independent of ancestry group (Figure 1B).

Second, we filtered CNV carriers using the loss-of-function observed/expected upper bound fraction (LOEUF)51 constraint metric. When we limited our analyses to carriers of CNVs with constraint scores equivalent to disruption of at least two predicted loss-intolerant genes (i.e., total 1/LOEUF ≥ 5.7 summed across the genes within the CNVs carried by an individual), ancestry-related differences persist and become even stronger. Consistent with previous evidence that DELs in coding sequences are under stronger purifying selection than DUPs,52 the burden-filtered carrier frequency for DELs was substantially lower than that for DUPs for all four ancestry groups. The deleterious DEL carrier frequency for WB individuals was nearly twice that of AFR or SAS individuals, both when we filtered by 1/LOEUF summed across all CNVs (Figure 1C) and when we limited the summation to recurrent CNVs (Figure 1D). Likewise, carriers of deleterious DUPs were more prevalent in the WB group and less prevalent in the AFR group than expected. We obtained similar results when we excluded all individuals related at the third-degree or closer (Figure S2; Table S4).

We also examined the prevalence of individual recurrent CNVs. To maximize power, we limited our analysis to the 11 recurrent CNVs (5 DELs and 6 DUPs) that each had a total of ≥275 observations across the four ancestry groups. We found ancestry-related differences in the prevalence of six of these CNVs (Figure 2), with some CNVs being more prevalent in WB individuals but others occurring at higher rates in the AFR and SAS groups. While there were some related individuals among the carriers of some of the CNVs, the pattern of frequency differences was essentially unchanged when we excluded all individuals with third-degree or closer relatives from the dataset (Figure S3; Table S4). Hence, the observed differences cannot be explained by related individuals carrying the same CNVs.

Figure 2.

Figure 2

Prevalence of individual recurrent CNVs across ancestry groups in the UK Biobank

There were significant differences between expected and observed counts for 6 out of the 11 recurrent CNVs selected for analysis, χ2 = 425.3, simulated p < 0.0005 (2,000 replicates). Carrier prevalence was calculated as the number of carriers of a given recurrent CNV divided by the total number of individuals in that ancestry group. Plus and minus signs indicate that the standardized residuals were statistically significantly higher or lower than zero, respectively, after FDR correction. ∗FDR-corrected p < 0.05; ∗∗FDR-corrected p < 0.005; ∗∗∗FDR-corrected p < 0.0005. An additional category (“none of the above”) that was included in the χ2 test is not shown in the bar chart.

Ascertainment bias is another potential explanation for the lower rate of deleterious CNVs in some ancestry groups within the UKBB. Immigrants may be less likely to participate in the UKBB, especially if they are carriers of deleterious CNVs. While EUR, AFR, and SAS individuals were more likely than WB individuals to have been born outside the UK or Ireland, the proportions of SAS and AFR individuals who were immigrants did not significantly differ for recurrent CNV carriers compared to non-carriers (Figure S4), suggesting that differential immigration rates cannot explain the differences in CNV prevalence across ancestry groups.

Demographic differences between the UKBB ancestry groups could also contribute to the observed differences in CNV prevalence. On the whole, UKBB participants are more likely to be female, healthier, wealthier, and older than the general population.53 However, individuals in the SAS and AFR ancestry groups tend to be younger than those in the WB and EUR groups, and the female-to-male ratios differed across the ancestry groups (Tables S5 and S6). Furthermore, median Townsend deprivation index scores suggest that there are socioeconomic differences that could be associated with ancestry, sex, and recurrent CNV carrier status (Table S7). Given that pathogenic CNVs have been linked to socioeconomic status,54 sex,55 and age at death,56 we used propensity score matching to balance the sample sizes and control for the potential effects of these variables on CNV carrier rates by downsampling the WB group to create two subgroups matched to the AFR and SAS groups (Figure S5; Tables S8 and S9). The AFR-ancestry group had significantly lower odds of carrying both unfiltered and 1/LOEUF-filtered recurrent DELs and DUPs than its matched WB group (all two-sided Fisher’s exact test p <0.0000005; Figure 3). Indeed, the matched comparisons yielded more pronounced differences than the unmatched ones despite having smaller sample sizes. In contrast, differences between the SAS and matched WB group were somewhat attenuated. Nonetheless, the SAS group showed significantly lower odds of carrying 1/LOEUF-filtered recurrent DELs when compared to its matched WB group (p < 0.00001).

Figure 3.

Figure 3

Odds of carrying deleterious CNVs in AFR- and SAS-ancestry individuals compared to WB individuals in the UK Biobank after propensity-score matching on Townsend deprivation index, age, and sex

Odds were computed for each ancestry group as the number of carriers divided by the number of non-carriers of a given class of recurrent CNV. ORs were computed as AFR odds divided by WB odds (purple dots) or SAS odds divided by WB odds (blue dots). Error bars indicate 95% Fisher confidence limits for the OR computed using the matched data, and the p value is for the corresponding two-sided Fisher’s exact test for that OR. The red dashed lines indicate the expected OR of 1 (i.e., equal odds). AFR odds were lower than the WB odds at each level of filtering, but SAS odds were not consistently lower than the WB odds. For comparison purposes, ORs were also calculated using the data from all 385,636 white British (WB), 8,447 African (AFR), and 8,848 South Asian (SAS) individuals and are included as open circles.

We also used the matched datasets to compare the odds of carrying our 11 recurrent CNVs of interest (Figure 4) and found that most of the observed ancestry-related differences in CNV prevalence remained significant. Using two-sided Fisher’s exact tests of the carrier odds ratios (ORs), we found that the AFR group had significantly lower odds of carrying 2q13 NPHP1 DEL, CRYL1 DEL, 15q11.2 DEL, 15q13.3 BP4.5-BP5 CHRNA7 DUP, 16p13.11 DUP, and 22q11.2 proximal (with low-copy-number repeat A [LCR-A]) DUP, as well as higher odds of carrying 2q13 NPHP1 DUP compared to its matched WB group. The SAS group also had significantly lower odds of carrying 2q13 NPHP1 DEL, CRYL1 DEL, 15q11.2 DEL, 15q13.3 BP4.5-BP5 CHRNA7 DUP, and 22q11.2 proximal (with LCR-A) DUP, and higher odds of carrying 2q13 NPHP1 DUP compared to its matched WB group. Additionally, the SAS group had higher odds of carrying ZNF92 DEL and 15q11.2 DUP compared to its matched WB group.

Figure 4.

Figure 4

Odds of carrying individual recurrent CNVs for non-EUR compared to WB-ancestry groups in the UK Biobank

Odds were computed for each ancestry group as the number of carriers divided by the number of non-carriers of a given recurrent CNV. ORs were computed as AFR odds divided by WB odds (purple dots) or SAS odds divided by WB odds (blue dots). Error bars indicate 95% Fisher confidence limits for the OR computed using matched data, and the p value is for the corresponding two-sided Fisher’s exact test for that OR. The red dashed lines indicate the expected OR of 1 (i.e., equal odds). Compared to the WB odds, AFR odds were significantly lower for 6 and significantly higher for 1 of the 11 selected recurrent CNVs. SAS odds were significantly lower than WB odds for 5 of the same 6 recurrent CNVs and significantly higher than WB for 3 recurrent CNVs, including the same one observed to be higher for AFR. The two blue half circles correspond to ORs of zero; neither of those recurrent CNVs were observed in the SAS-ancestry group. For comparison purposes, ORs were also calculated using the data from all 385,636 WB, 8,447 AFR, and 8,848 SAS individuals and are included as open circles.

Finally, we replicated our study using SPARK, a younger US cohort enriched with individuals diagnosed with ASD and intellectual disability (ID) (Tables S10–S12). We observed 51 unique recurrent DEL (Table S13) and 51 unique recurrent DUP (Table S14) across the three largest inferred ancestry groups (Figure S6) comprising 46,869 EUR, 7,870 admixed American (AMR), and 3,680 AFR individuals. We used propensity score matching to balance the sample sizes and control for potentially confounding effects of ASD and ID status, age, and sex on recurrent CNV carrier rates by downsampling the EUR group to create two subgroups matched to the AFR and AMR groups (Figure S7; Tables S15 and S16). Replicating our UKBB results, we found that the SPARK AFR group had significantly lower odds of carrying both unfiltered and 1/LOEUF-filtered recurrent DELs and DUPs than its matched EUR group (all two-sided Fisher’s exact test p < 0.005; Figure 5). The AMR group had significantly lower odds of carrying both unfiltered and 1/LOEUF-filtered recurrent DUPs (two-sided Fisher’s exact test p < 0.005; Figure 5).

Figure 5.

Figure 5

Odds of carrying deleterious CNVs in AFR- and AMR-ancestry individuals compared to EUR individuals in SPARK after propensity-score matching on ASD and ID status, age, and sex

Odds were computed for each ancestry group as the number of carriers divided by the number of non-carriers of a given class of recurrent CNV. ORs were computed as AFR odds divided by EUR odds (purple dots) or AMR odds divided by WB odds (blue dots). Error bars indicate 95% Fisher confidence limits for the OR computed using matched data, and the p value is for the corresponding two-sided Fisher’s exact test for that OR. The red dashed lines indicate the expected OR of 1 (i.e., equal odds). AFR carrier odds were lower than the EUR odds at each level of filtering, but AMR odds were significantly lower only for recurrent DUP carriers. For comparison purposes, ORs were also calculated using the data from all 46,869 EUR, 3,680 AFR, and 7,870 admixed AMR individuals and are included as open circles.

We also used the matched datasets to compare the odds of carrying the same 11 recurrent CNVs we analyzed for the UKBB along with 2 additional recurrent CNVs that had at least 75 copies each in SPARK despite not meeting our inclusion criteria for the UKBB (Figure 6). Replicating our UKBB results, the SPARK AFR group showed significant differences for 2q13 NPHP1 DEL, 15q11 DEL, 2q13 NPHP1 DUP, 15q13.3 BP4.5-BP5 CHRNA7 DUP, and 16p13.11 DUP carrier odds relative to its matched EUR group (two-sided Fisher’s exact test p < 0.05). Additionally, the SPARK AFR group had significantly lower odds of carrying NRXN1 DEL and higher odds of carrying 16p12.1 DEL relative to its matched EUR group, and the SPARK AMR group had significantly lower odds of carrying NRXN1 DEL, 15q11.2 DEL, and 1q21 TAR DUP relative to its matched EUR group (two-sided Fisher’s exact test p < 0.05).

Figure 6.

Figure 6

Odds of carrying individual recurrent CNVs for non-EUR- compared to EUR-ancestry groups in SPARK

Odds were computed for each ancestry group as the number of carriers divided by the number of non-carriers of a given recurrent CNV. ORs were computed as AFR odds divided by EUR odds (purple dots) or AMR odds divided by EUR odds (blue dots). Error bars indicate 95% Fisher confidence limits for the OR computed using matched data, and the p value is for the corresponding two-sided Fisher’s exact test for that OR. The red dashed lines indicate the expected OR of 1 (i.e., equal odds). The purple half circle corresponds to an OR of zero; this recurrent CNV was not observed in the AFR-ancestry group. For comparison purposes, ORs were also calculated using the data from all 46,869 EUR, 3,680 AFR, and 7,870 admixed AMR individuals and are included as open circles.

We demonstrated that ancestry-related differences in CNV carrier prevalence are present in both unselected community populations (UKBB) and cohorts enriched with ASD-diagnosed individuals (SPARK). We replicated the observed differences between the AFR and WB cohorts in the UKBB by comparing the AFR and EUR cohorts in SPARK, which is notable given the ascertainment differences, differing genotyping platforms, and presumed genetic differences between the homogeneous WB subset of the UKBB and an EUR-ancestry cohort from the United States.57 Furthermore, SAS (UKBB) and AMR (SPARK) ancestry groups also exhibited unique patterns of CNV prevalence, demonstrating that differences in CNV carrier prevalence cannot be generalized as EUR vs. non-EUR differences.

One potential explanation for the difference in prevalence of specific CNVs across ancestry groups is that population variation in the flanking sequence may affect the probability of deletions and duplications in a region. For example, chromosomal regions with polymorphic inversions have been shown to be enriched for recurrent CNVs associated with developmental delay and neuropsychiatric disorders.58

Given that AFR populations have been shown to have greater genetic diversity,59 the finding of fewer rare CNVs in the AFR groups of UKBB and SPARK is somewhat surprising. One possible explanation for our ancestry-divergent results could be the Eurocentric focus of previous studies of genetic variants.43 To focus on deleterious CNVs, we limited some of our analyses to recurrent CNVs that had been previously implicated in neuropsychiatric disorders and/or filtered our results based on the LOEUF metric. Our targeted recurrent CNVs were originally identified in cohorts dominated by EUR-ancestry individuals, so it is conceivable that there are as-yet undiscovered CNVs with medical relevance that are more common in other ancestry groups. Also, the LOEUF constraint metric was derived from the gnomAD version 2 reference database, which includes sequences from 64,603 individuals from EUR populations and only 12,487 individuals from AFR (or African American) and 15,308 individuals from SAS populations.51 We expect that the discovery of additional recurrent CNVs, especially those smaller than 50 kb, will be facilitated by the increasing availability of long-read whole-genome sequence data collected from diverse populations, such as All of Us.40,60 In any case, it is likely that differing linkage disequilibrium patterns35 and flanking sequences61 also contributed to the ancestry-related differences in CNV frequency that we found. Future CNV studies utilizing long-read sequences should clarify the extent to which polymorphic differences in the flanking sequence across populations contribute to the observed differences in CNV frequency.

These findings are limited by the sample sizes of the AFR, SAS, and AMR ancestry groups in the UKBB and SPARK, which are much smaller than those of the WB and EUR ancestry groups. Larger CNV studies of diverse samples are needed. Additional limitations include our focus on only CNVs >50 kb, a by-product of using genotype array data to call CNVs, and the choice of LOEUF as a metric of the deleteriousness of a CNV. Given that LOEUF is focused on coding genes, non-coding CNVs that impact important regulatory elements may have been missed in our filtered set of deleterious CNVs with 1/LOEUF ≥ 5.7. Also, while the recurrent CNVs we selected for analysis (Table S1) had all been associated previously with neuropsychiatric disorders,13,23,62,63,64,65,66,67,68,69,70,71 their ClinGen dosage sensitivity scores72 do not universally implicate them as being clinically relevant. Of the recurrent CNVs that we identified as having significant ancestry-related frequency differences (Figures 4 and 6), ClinGen currently classifies only the NRXN1 and 15q11.2 DEL as having sufficient evidence for haploinsufficiency, the 16p12.1 DEL as having emerging evidence for haploinsufficiency, the 22q11.2 proximal (with LCR-A) DUP as having sufficient evidence for triplosensitivity, and the 16.p13.11 DUP as having emerging evidence for triplosensitivity (Table S17). The strengths of the study include the use of propensity score matching to address potential sources of bias and the consistency of results across samples with very different demographic profiles and ascertainment schemes.

Although classifying individuals into continental ancestry groups imposes discrete categories onto what is actually a continuum of human genetic variation,73,74 it has nonetheless been a useful approach for considering the potential effects of population structure on genome-wide association analyses.75,76 To the extent that there is admixture between the groups we have defined, it would be expected to reduce the genetic differences between groups. Thus, the differences we have described should be viewed as conservative estimates. We recommend that future CNV association studies also be adjusted for population structure, ideally in a quantitative way that reflects the continuous spectrum of genetic variation, to limit the risk of spurious discoveries.

Data and code availability

This paper solely uses publicly accessible, deidentified data. UK Biobank and SPARK data are available to qualified researchers upon application. We provide our custom Python code and a detailed description of our CNV calling pipeline at https://martineaujeanlouis.github.io/MIND-GENESPARALLELCNV/.

Acknowledgments

This work used the UK Biobank, a major biomedical database, under application number 40980. We are grateful to all the families in SPARK, the SPARK clinical sites, and the SPARK staff, and we appreciate obtaining access to phenotypic and genetic data on SFARI Base. Our research was funded by the National Institute of Mental Health (U01MH119690).

Declaration of interests

The authors declare no competing interests.

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.xhgg.2024.100340.

Contributor Information

Laura M. Schultz, Email: schultzl1@chop.edu.

Laura Almasy, Email: almasyl@chop.edu.

Web resources

Clinical Genome Resource (ClinGen), https://search.clinicalgenome.org/

CNV calling pipeline, https://martineaujeanlouis.github.io/MIND-GENESPARALLELCNV/

KING: Kinship-based Inference for GWAS, https://www.kingrelatedness.com/

The R Project for Statistical Computing, https://www.r-project.org

R package e1071, https://cran.r-project.org/web/packages/e1071/e1071.pdf

Simons Powering Autism Research (SPARK), https://www.sfari.org/resource/spark/

UK Biobank, https://www.ukbiobank.ac.uk/

Supplemental information

Document S1. Supplemental methods, Figures S1‒S10, and Tables S4‒S12 and S15‒S18
mmc1.pdf (2.9MB, pdf)
Data S1. Tables S1‒S3, S13, and S14
mmc2.xlsx (32.4KB, xlsx)
Document S2. Article plus supplemental information
mmc3.pdf (5.4MB, pdf)

References

  • 1.Pös O., Radvanszky J., Buglyó G., Pös Z., Rusnakova D., Nagy B., Szemes T. DNA copy number variation: Main characteristics, evolutionary significance, and pathological aspects. Biomed. J. 2021;44:548–559. doi: 10.1016/j.bj.2021.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kendall K.M., Bracher-Smith M., Fitzpatrick H., Lynham A., Rees E., Escott-Price V., Owen M.J., O'Donovan M.C., Walters J.T.R., Kirov G. Cognitive performance and functional outcomes of carriers of pathogenic copy number variants: Analysis of the UK Biobank. Br. J. Psychiatry. 2019;214:297–304. doi: 10.1192/bjp.2018.301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Alexander-Bloch A., Huguet G., Schultz L.M., Huffnagle N., Jacquemont S., Seidlitz J., Saci Z., Moore T.M., Bethlehem R.A.I., Mollon J., et al. Copy number variant risk scores associated with cognition, psychopathology, and brain structure in youths in the Philadelphia Neurodevelopmental Cohort. JAMA Psychiatr. 2022;79:699–709. doi: 10.1001/jamapsychiatry.2022.1017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Mollon J., Almasy L., Jacquemont S., Glahn D.C. The contribution of copy number variants to psychiatric symptoms and cognitive ability. Mol. Psychiatry. 2023;28:1480–1493. doi: 10.1038/s41380-023-01978-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Owen D., Bracher-Smith M., Kendall K.M., Rees E., Einon M., Escott-Price V., Owen M.J., O’Donovan M.C., Kirov G. Effects of pathogenic CNVs on physical traits in participants of the UK Biobank. BMC Genom. 2018;19:867. doi: 10.1186/s12864-018-5292-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Hujoel M.L.A., Sherman M.A., Barton A.R., Mukamel R.E., Sankaran V.G., Terao C., Loh P.-R. Influences of rare copy-number variation on human complex traits. Cell. 2022;185:4233–4248.e27. doi: 10.1016/j.cell.2022.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Auwerx C., Lepamets M., Sadler M.C., Patxot M., Stojanov M., Baud D., Mägi R., Estonian Biobank Research Team. Porcu E., Reymond A., Kutalik Z. The individual and global impact of copy-number variants on complex human traits. Am. J. Hum. Genet. 2022;109:647–668. doi: 10.1016/j.ajhg.2022.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Crawford K., Bracher-Smith M., Owen D., Kendall K.M., Rees E., Pardiñas A.F., Einon M., Escott-Price V., Walters J.T.R., O’Donovan M.C., et al. Medical consequences of pathogenic CNVs in adults: Analysis of the UK Biobank. J. Med. Genet. 2019;56:131–138. doi: 10.1136/jmedgenet-2018-105477. [DOI] [PubMed] [Google Scholar]
  • 9.Aguirre M., Rivas M.A., Priest J. Phenome-wide burden of copy-number variation in the UK Biobank. Am. J. Hum. Genet. 2019;105:373–383. doi: 10.1016/j.ajhg.2019.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Collins R.L., Glessner J.T., Porcu E., Lepamets M., Brandon R., Lauricella C., Han L., Morley T., Niestroj L.-M., Ulirsch J., et al. A cross-disorder dosage sensitivity map of the human genome. Cell. 2022;185:3041–3055.e25. doi: 10.1016/j.cell.2022.06.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Auwerx C., Jõeloo M., Sadler M.C., Tesio N., Ojavee S., Clark C.J., Mägi R., Estonian Biobank Research Team. Reymond A., Kutalik Z., et al. Rare copy-number variants as modulators of common disease susceptibility. Genome Med. 2024;16:5. doi: 10.1186/s13073-023-01265-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Merikangas A.K., Corvin A.P., Gallagher L. Copy-number variants in neurodevelopmental disorders: Promises and challenges. Trends Genet. 2009;25:536–544. doi: 10.1016/j.tig.2009.10.006. [DOI] [PubMed] [Google Scholar]
  • 13.Coe B.P., Witherspoon K., Rosenfeld J.A., van Bon B.W.M., Vulto-van Silfhout A.T., Bosco P., Friend K.L., Baker C., Buono S., Vissers L.E.L.M., et al. Refining analyses of copy number variation identifies specific genes associated with developmental delay. Nat. Genet. 2014;46:1063–1071. doi: 10.1038/ng.3092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Birnbaum R., Mahjani B., Loos R.J.F., Sharp A.J. Clinical characterization of copy number variants associated with neurodevelopmental disorders in a large-scale multiancestry biobank. JAMA Psychiatr. 2022;79:250–259. doi: 10.1001/jamapsychiatry.2021.4080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zarrei M., Burton C.L., Engchuan W., Higginbotham E.J., Wei J., Shaikh S., Roslin N.M., MacDonald J.R., Pellecchia G., Nalpathamkalam T., et al. Gene copy number variation and pediatric mental health/neurodevelopment in a general population. Hum. Mol. Genet. 2023;32:2411–2421. doi: 10.1093/hmg/ddad074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Malhotra D., Sebat J. CNVs: Harbingers of a rare variant revolution in psychiatric genetics. Cell. 2012;148:1223–1241. doi: 10.1016/j.cell.2012.02.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Calle Sánchez X., Helenius D., Bybjerg-Grauholm J., Pedersen C., Hougaard D.M., Børglum A.D., Nordentoft M., Mors O., Mortensen P.B., Geschwind D.H., et al. Comparing copy number variations in a Danish case cohort of individuals with psychiatric disorders. JAMA Psychiatr. 2022;79:59–69. doi: 10.1001/jamapsychiatry.2021.3392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Rees E., Kirov G. Copy number variation and neuropsychiatric illness. Curr. Opin. Genet. Dev. 2021;68:57–63. doi: 10.1016/j.gde.2021.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sebat J., Lakshmi B., Malhotra D., Troge J., Lese-Martin C., Walsh T., Yamrom B., Yoon S., Krasnitz A., Kendall J., et al. Strong association of de novo copy number mutations with autism. Science. 2007;316:445–449. doi: 10.1126/science.1138659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Leppa V.M., Kravitz S.N., Martin C.L., Andrieux J., Le Caignec C., Martin-Coignard D., DyBuncio C., Sanders S.J., Lowe J.K., Cantor R.M., Geschwind D.H. Rare inherited and de novo CNVs reveal complex contributions to ASD risk in multiplex families. Am. J. Hum. Genet. 2016;99:540–554. doi: 10.1016/j.ajhg.2016.06.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Fu J.M., Satterstrom F.K., Peng M., Brand H., Collins R.L., Dong S., Wamsley B., Klei L., Wang L., Hao S.P., et al. Rare coding variation provides insight into the genetic architecture and phenotypic context of autism. Nat. Genet. 2022;54:1320–1331. doi: 10.1038/s41588-022-01104-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Vicari S., Napoli E., Cordeddu V., Menghini D., Alesi V., Loddo S., Novelli A., Tartaglia M. Copy number variants in autism spectrum disorders. Prog. Neuro-Psychopharmacol. Biol. Psychiatry. 2019;92:421–427. doi: 10.1016/j.pnpbp.2019.02.012. [DOI] [PubMed] [Google Scholar]
  • 23.Marshall C.R., Howrigan D.P., Merico D., Thiruvahindrapuram B., Wu W., Greer D.S., Antaki D., Shetty A., Holmans P.A., Pinto D., et al. Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nat. Genet. 2017;49:27–35. doi: 10.1038/ng.3725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Bassett A.S., Scherer S.W., Brzustowicz L.M. Copy number variations in schizophrenia: Critical review and new perspectives on concepts of genetics and disease. Am. J. Psychiatry. 2010;167:899–914. doi: 10.1176/appi.ajp.2009.09071016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Maury E.A., Sherman M.A., Genovese G., Gilgenast T.G., Kamath T., Burris S.J., Rajarajan P., Flaherty E., Akbarian S., Chess A., et al. Schizophrenia-associated somatic copy-number variants from 12,834 cases reveal recurrent NRXN1 and ABCB11 disruptions. Cell Genom. 2023;3:100356. doi: 10.1016/j.xgen.2023.100356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.McElroy J.P., Nelson M.R., Caillier S.J., Oksenberg J.R. Copy number variation in African Americans. BMC Genet. 2009;10:15. doi: 10.1186/1471-2156-10-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Nyangiri O.A., Noyes H., Mulindwa J., Ilboudo H., Kabore J.W., Ahouty B., Koffi M., Asina O.F., Mumba D., Ofon E., et al. Copy number variation in human genomes from three major ethno-linguistic groups in Africa. BMC Genom. 2020;21:289. doi: 10.1186/s12864-020-6669-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Wineinger N.E., Pajewski N.M., Kennedy R.E., Wojczynski M.K., Vaughan L.K., Hunt S.C., Gu C.C., Rao D.C., Lorier R., Broeckel U., et al. Characterization of autosomal copy-number variation in African Americans: The HyperGEN study. Eur. J. Hum. Genet. 2011;19:1271–1275. doi: 10.1038/ejhg.2011.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Yilmaz F., Null M., Astling D., Yu H.-C., Cole J., Santorico S.A., Hallgrimsson B., Manyama M., Spritz R.A., Hendricks A.E., Shaikh T.H. Genome-wide copy number variations in a large cohort of Bantu African children. BMC Med. Genomics. 2021;14:129. doi: 10.1186/s12920-021-00978-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gautam P., Jha P., Kumar D., Tyagi S., Varma B., Dash D., Mukhopadhyay A., Indian Genome Variation Consortium. Mukerji M. Spectrum of large copy number variations in 26 diverse Indian populations: Potential involvement in phenotypic diversity. Hum. Genet. 2012;131:131–143. doi: 10.1007/s00439-011-1050-5. [DOI] [PubMed] [Google Scholar]
  • 31.Gao Y., Yang X., Chen H., Tan X., Yang Z., Deng L., Wang B., Kong S., Li S., Cui Y., et al. A pangenome reference of 36 Chinese populations. Nature. 2023;619:112–121. doi: 10.1038/s41586-023-06173-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Sánchez S., Juárez U., Domínguez J., Molina B., Barrientos R., Martínez-Hernández A., Carnevale A., Grether-González P., Mayen D.G., Villarroel C., et al. Frequent copy number variants in a cohort of Mexican-Mestizo individuals. Mol. Cytogenet. 2023;16:2. doi: 10.1186/s13039-022-00631-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Redon R., Ishikawa S., Fitch K.R., Feuk L., Perry G.H., Andrews T.D., Fiegler H., Shapero M.H., Carson A.R., Chen W., et al. Global variation in copy number in the human genome. Nature. 2006;444:444–454. doi: 10.1038/nature05329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wong K.K., deLeeuw R.J., Dosanjh N.S., Kimm L.R., Cheng Z., Horsman D.E., MacAulay C., Ng R.T., Brown C.J., Eichler E.E., Lam W.L. A comprehensive analysis of common copy-number variations in the human genome. Am. J. Hum. Genet. 2007;80:91–104. doi: 10.1086/510560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Armengol L., Villatoro S., González J.R., Pantano L., García-Aragonés M., Rabionet R., Cáceres M., Estivill X. Identification of copy number variants defining genomic differences among major human groups. PLoS One. 2009;4:e7230. doi: 10.1371/journal.pone.0007230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Itsara A., Cooper G.M., Baker C., Girirajan S., Li J., Absher D., Krauss R.M., Myers R.M., Ridker P.M., Chasman D.I., et al. Population analysis of large copy number variants and hotspots of human genetic disease. Am. J. Hum. Genet. 2009;84:148–161. doi: 10.1016/j.ajhg.2008.12.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Sudmant P.H., Mallick S., Nelson B.J., Hormozdiari F., Krumm N., Huddleston J., Coe B.P., Baker C., Nordenfelt S., Bamshad M., et al. Global diversity, population stratification, and selection of human copy-number variation. Science. 2015;349:aab3761. doi: 10.1126/science.aab3761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Sudmant P.H., Rausch T., Gardner E.J., Handsaker R.E., Abyzov A., Huddleston J., Zhang Y., Ye K., Jun G., Fritz M.H.Y., et al. An integrated map of structural variation in 2,504 human genomes. Nature. 2015;526:75–81. doi: 10.1038/nature15394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Collins R.L., Brand H., Karczewski K.J., Zhao X., Alföldi J., Francioli L.C., Khera A.V., Lowther C., Gauthier L.D., Wang H., et al. A structural variation reference for medical and population genetics. Nature. 2020;581:444–451. doi: 10.1038/s41586-020-2287-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Ebert P., Audano P.A., Zhu Q., Rodriguez-Martin B., Porubsky D., Bonder M.J., Sulovari A., Ebler J., Zhou W., Serra Mari R., et al. Haplotype-resolved diverse human genomes and integrated analysis of structural variation. Science. 2021;372:eabf7117. doi: 10.1126/science.abf7117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Bamshad M., Wooding S., Salisbury B.A., Stephens J.C. Deconstructing the relationship between genetics and race. Nat. Rev. Genet. 2004;5:598–609. doi: 10.1038/nrg1401. [DOI] [PubMed] [Google Scholar]
  • 42.Edwards A.W.F. Human genetic diversity: Lewontin's fallacy. Bioessays. 2003;25:798–801. doi: 10.1002/bies.10315. [DOI] [PubMed] [Google Scholar]
  • 43.Sirugo G., Williams S.M., Tishkoff S.A. The missing diversity in human genetic studies. Cell. 2019;177:26–31. doi: 10.1016/j.cell.2019.02.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Szpiech Z.A., Mak A.C.Y., White M.J., Hu D., Eng C., Burchard E.G., Hernandez R.D. Ancestry-dependent enrichment of deleterious homozygotes in runs of homozygosity. Am. J. Hum. Genet. 2019;105:747–762. doi: 10.1016/j.ajhg.2019.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Esoh K., Wonkam A. Evolutionary history of sickle-cell mutation: Implications for global genetic medicine. Hum. Mol. Genet. 2021;30:R119–R128. doi: 10.1093/hmg/ddab004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Daneshpajouhnejad P., Kopp J.B., Winkler C.A., Rosenberg A.Z. The evolving story of apolipoprotein l1 nephropathy: The end of the beginning. Nat. Rev. Nephrol. 2022;18:307–320. doi: 10.1038/s41581-022-00538-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Nédélec Y., Sanz J., Baharian G., Szpiech Z.A., Pacis A., Dumaine A., Grenier J.-C., Freiman A., Sams A.J., Hebert S., et al. Genetic ancestry and natural selection drive population differences in immune responses to pathogens. Cell. 2016;167:657–669.e621. doi: 10.1016/j.cell.2016.09.025. [DOI] [PubMed] [Google Scholar]
  • 48.Randolph H.E., Fiege J.K., Thielen B.K., Mickelson C.K., Shiratori M., Barroso-Batista J., Langlois R.A., Barreiro L.B. Genetic ancestry effects on the response to viral infection are pervasive but cell type specific. Science. 2021;374:1127–1133. doi: 10.1126/science.abg0928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Yang H.-C., Chen C.-W., Lin Y.-T., Chu S.-K. Genetic ancestry plays a central role in population pharmacogenomics. Commun. Biol. 2021;4:171. doi: 10.1038/s42003-021-01681-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Corpas M., Siddiqui M.K., Soremekun O., Mathur R., Gill D., Fatumo S. Addressing ancestry and sex bias in pharmacogenomics. Annu. Rev. Pharmacol. Toxicol. 2024;64:53–64. doi: 10.1146/annurev-pharmtox-030823-111731. [DOI] [PubMed] [Google Scholar]
  • 51.Karczewski K.J., Francioli L.C., Tiao G., Cummings B.B., Alföldi J., Wang Q., Collins R.L., Laricchia K.M., Ganna A., Birnbaum D.P., et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–443. doi: 10.1038/s41586-020-2308-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Schrider D.R., Hahn M.W. Gene copy-number polymorphism in nature. Proc. Biol. Sci. 2010;277:3213–3221. doi: 10.1098/rspb.2010.1180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Fry A., Littlejohns T.J., Sudlow C., Doherty N., Adamska L., Sprosen T., Collins R., Allen N.E. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am. J. Epidemiol. 2017;186:1026–1034. doi: 10.1093/aje/kwx246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Burghel G.J., Khan U., Lin W.-Y., Whittaker W., Banka S. Presence of pathogenic copy number variants (CNVs) is correlated with socioeconomic status. J. Med. Genet. 2020;57:70–72. doi: 10.1136/jmedgenet-2019-106292. [DOI] [PubMed] [Google Scholar]
  • 55.Jacquemont S., Coe B.P., Hersch M., Duyzend M.H., Krumm N., Bergmann S., Beckmann J.S., Rosenfeld J.A., Eichler E.E. A higher mutational burden in females supports a “female protective model” in neurodevelopmental disorders. Am. J. Hum. Genet. 2014;94:415–425. doi: 10.1016/j.ajhg.2014.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Kuningas M., Estrada K., Hsu Y.-H., Nandakumar K., Uitterlinden A.G., Lunetta K.L., van Duijn C.M., Karasik D., Hofman A., Murabito J., et al. Large common deletions associate with mortality at old age. Hum. Mol. Genet. 2011;20:4290–4296. doi: 10.1093/hmg/ddr340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Gouveia M.H., Bentley A.R., Leal T.P., Tarazona-Santos E., Bustamante C.D., Adeyemo A.A., Rotimi C.N., Shriner D. Unappreciated subcontinental admixture in Europeans and European Americans and implications for genetic epidemiology studies. Nat. Commun. 2023;14:6802. doi: 10.1038/s41467-023-42491-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Porubsky D., Höps W., Ashraf H., Hsieh P., Rodriguez-Martin B., Yilmaz F., Ebler J., Hallast P., Maria Maggiolini F.A., Harvey W.T., et al. Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders. Cell. 2022;185:1986–2005.e1926. doi: 10.1016/j.cell.2022.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Auton A., Abecasis G.R., Altshuler D.M., Durbin R.M., Abecasis G.R., Bentley D.R., Chakravarti A., Clark A.G., Donnelly P., Eichler E.E., et al. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Bick A.G., Metcalf G.A., Mayo K.R., Lichtenstein L., Rura S., Carroll R.J., Musick A., Linder J.E., Jordan I.K., Nagar S.D., et al. Genomic data in the All of Us research program. Nature. 2024;627:340–346. doi: 10.1038/s41586-023-06957-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Hayeck T.J., Stong N., Baugh E., Dhindsa R., Turner T.N., Malakar A., Mosbruger T.L., Shaw G.T.-W., Duan Y., Ionita-Laza I., et al. Ancestry adjustment improves genome-wide estimates of regional intolerance. Genetics. 2022;221:iyac050. doi: 10.1093/genetics/iyac050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Kendall K.M., Rees E., Escott-Price V., Einon M., Thomas R., Hewitt J., O’Donovan M.C., Owen M.J., Walters J.T.R., Kirov G. Cognitive performance among carriers of pathogenic copy number variants: Analysis of 152,000 UK Biobank subjects. Biol. Psychiatry. 2017;82:103–110. doi: 10.1016/j.biopsych.2016.08.014. [DOI] [PubMed] [Google Scholar]
  • 63.Lindstrand A., Davis E.E., Carvalho C.M.B., Pehlivan D., Willer J.R., Tsai I.C., Ramanathan S., Zuppan C., Sabo A., Muzny D., et al. Recurrent CNVs and snvs at the NPHP1 locus contribute pathogenic alleles to Bardet-Biedl syndrome. Am. J. Hum. Genet. 2014;94:745–754. doi: 10.1016/j.ajhg.2014.03.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Matsunami N., Hadley D., Hensel C.H., Christensen G.B., Kim C., Frackelton E., Thomas K., da Silva R.P., Stevens J., Baird L., et al. Identification of rare recurrent copy number variants in high-risk autism families and their prevalence in a large ASD population. PLoS One. 2013;8:e52239. doi: 10.1371/journal.pone.0052239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Rosenfeld J.A., Traylor R.N., Schaefer G.B., McPherson E.W., Ballif B.C., Klopocki E., Mundlos S., Shaffer L.G., Aylsworth A.S., 1q21.1 Study Group Proximal microdeletions and microduplications of 1q21.1 contribute to variable abnormal phenotypes. Eur. J. Hum. Genet. 2012;20:754–761. doi: 10.1038/ejhg.2012.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Sanders S.J., Ercan-Sencicek A.G., Hus V., Luo R., Murtha M.T., Moreno-De-Luca D., Chu S.H., Moreau M.P., Gupta A.R., Thomson S.A., et al. Multiple recurrent de novo CNVs, including duplications of the 7q11.23 Williams Syndrome region, are strongly associated with autism. Neuron. 2011;70:863–885. doi: 10.1016/j.neuron.2011.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Sanders S.J., He X., Willsey A.J., Ercan-Sencicek A.G., Samocha K.E., Cicek A.E., Murtha M.T., Bal V.H., Bishop S.L., Dong S., et al. Insights into autism spectrum disorder genomic architecture and biology from 71 risk loci. Neuron. 2015;87:1215–1233. doi: 10.1016/j.neuron.2015.09.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Sanders S.J., Sahin M., Hostyk J., Thurm A., Jacquemont S., Avillach P., Douard E., Martin C.L., Modi M.E., Moreno-De-Luca A., et al. A framework for the investigation of rare genetic disorders in neuropsychiatry. Nat. Med. 2019;25:1477–1487. doi: 10.1038/s41591-019-0581-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Satterstrom F.K., Kosmicki J.A., Wang J., Breen M.S., De Rubeis S., An J.-Y., Peng M., Collins R., Grove J., Klei L., et al. Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell. 2020;180:568–584.e23. doi: 10.1016/j.cell.2019.12.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Stefansson H., Meyer-Lindenberg A., Steinberg S., Magnusdottir B., Morgen K., Arnarsdottir S., Bjornsdottir G., Walters G.B., Jonsdottir G.A., Doyle O.M., et al. CNVs conferring risk of autism or schizophrenia affect cognition in controls. Nature. 2014;505:361–366. doi: 10.1038/nature12818. [DOI] [PubMed] [Google Scholar]
  • 71.Vacic V., McCarthy S., Malhotra D., Murray F., Chou H.-H., Peoples A., Makarov V., Yoon S., Bhandari A., Corominas R., et al. Duplications of the neuropeptide receptor gene vipr2 confer significant risk for schizophrenia. Nature. 2011;471:499–503. doi: 10.1038/nature09884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Thaxton C., Good M.E., DiStefano M.T., Luo X., Andersen E.F., Thorland E., Berg J., Martin C.L., Rehm H.L., Riggs E.R., et al. Utilizing ClinGen gene-disease validity and dosage sensitivity curations to inform variant classification. Hum. Mutat. 2022;43:1031–1040. doi: 10.1002/humu.24291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Lewis A.C.F., Molina S.J., Appelbaum P.S., Dauda B., Di Rienzo A., Fuentes A., Fullerton S.M., Garrison N.A., Ghosh N., Hammonds E.M., et al. Getting genetic ancestry right for science and society. Science. 2022;376:250–252. doi: 10.1126/science.abm7530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.National Academies of Sciences, Engineering, & Medicine . The National Academies Press; 2023. Using Population Descriptors in Genetics and Genomics Research: A New Framework for an Evolving Field. [DOI] [PubMed] [Google Scholar]
  • 75.Marchini J., Cardon L.R., Phillips M.S., Donnelly P. The effects of human population structure on large genetic association studies. Nat. Genet. 2004;36:512–517. doi: 10.1038/ng1337. [DOI] [PubMed] [Google Scholar]
  • 76.Tian C., Gregersen P.K., Seldin M.F. Accounting for ancestry: Population substructure and genome-wide association studies. Hum. Mol. Genet. 2008;17:R143–R150. doi: 10.1093/hmg/ddn268. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supplemental methods, Figures S1‒S10, and Tables S4‒S12 and S15‒S18
mmc1.pdf (2.9MB, pdf)
Data S1. Tables S1‒S3, S13, and S14
mmc2.xlsx (32.4KB, xlsx)
Document S2. Article plus supplemental information
mmc3.pdf (5.4MB, pdf)

Data Availability Statement

This paper solely uses publicly accessible, deidentified data. UK Biobank and SPARK data are available to qualified researchers upon application. We provide our custom Python code and a detailed description of our CNV calling pipeline at https://martineaujeanlouis.github.io/MIND-GENESPARALLELCNV/.


Articles from Human Genetics and Genomics Advances are provided here courtesy of Elsevier

RESOURCES