Abstract
Transposable Elements (TEs) are mobile genetic elements that constitute half of the human genome. Recent studies suggest that polymorphic non-reference TEs (nrTEs) may contribute to cognitive diseases, such as schizophrenia, through a cis-regulatory effect. The aim of this work is to identify sets of nrTEs putatively linked to an increased risk of developing schizophrenia. To do so, we inspected the nrTE content of genomes from the Dorsolateral Prefrontal Cortex of schizophrenic and control individuals, and identified 38 nrTEs which possibly contribute to the emergence of this psychiatric disorder. Furthermore, we performed in silico functional inferences and found, for instance, that 9 of the 38 nrTEs act as expression/alternative splicing quantitative trait loci (eQTLs/sQTLs) in the brain, suggesting a possible role in shaping the human cognitive genome structure. Therefore, to our knowledge, this is the first attempt at identifying polymorphic nrTEs that can contribute to the functionality of the brain. Finally, we suggest that a neurodevelopmental genetic mechanism, which involves evolutionarily young nrTEs, can be the key to understanding the ethiopathogenesis of this complex disorder.
Keywords: schizophrenia, retrotransposons, eQTL, evolution, polymorphic TEs
Introduction
Transposable Elements are DNA sequences which have the ability to move around in the genome. TEs constitute 53–60% of the human DNA1,2 and are essential elements in driving genome evolution3. Among non-LTR retrotransposons (Alu, LINE and SVA), only LINE-1 (L1) can actively transpose, while Alu and SVA rely on L1’s machinery to mobilize themselves4. While the vast majority of TEs are no longer transpositionally active, they can still play a functional role as exapted enhancers or transcriptional start sites5–8, by inserting Transcription Factor Binding Sites (TFBS)9,10 or by acting as novel RNA genes such as long non-coding RNAs (lnc-RNAs)11. Therefore, TEs participate in regulating the expression of nearby genes, at transcriptional and post-transcriptional levels, providing a crucial role as both cis- and trans-regulatory RNA sequences12. Baillie and colleagues13 also found that protein-coding loci are disproportionately affected by TEs, with over-representation of L1s in introns and Alus in exons. Overall, TEs seem to predominantly affect neurogenesis and synaptic function, with studies suggesting a putative regulatory role of TEs in the neural genome14–18. Nonetheless, only initial research work has been systematically performed on this issue and the TE-controlled regulatory architecture of the human genome still needs to be better explored and investigated. According to recent studies, TEs’ insertion polymoprhisms “can be mapped as cis- expression quantitative trait loci with substantial effects on gene expression, especially at loci involved in immune response and cognitive function”19,20. A polymorphic TE insertion can be exapted as a functional element depending on its site of insertion within the genome, or it can disrupt an already existing enhancer21. Additionally, polymorphic TEs have been shown to be closely associated with complex phenotypes in GWAS investigations suggesting that polymorphic non-reference TEs (nrTEs) may contribute to disease phenotypes through cis- regulatory effects22–24. Interestingly, nrTEs are relatively young compared to fixed TEs, therefore they are likely to have had a role in the most recent phases of the evolution of our species, which particularly involved the brain and superior cognitive abilities.
In recent years, mounting data provided evidence that epigenetic mechanisms and TEs are playing a key role in schizophrenia and other neurological disorders17, 25–27. For example, Bundo and colleagues15 described an increased number of somatic L1 retrotransposition in the Dorso-Lateral PreFrontal Cortex (DLPFC, Brodmann’s area 46) of people affected by schizophrenia, observing that the total number of brain-specific L1 insertions tended to be higher in schizophrenia patients, an observation confirmed by Doyle et al.18. In our own previous work17 and in a recent review28, L1 insertion sites were also reported to be preferentially localized to synapse- and schizophrenia-related genes. Guffanti et al.25 developed a method to quantify the tissue-specific expression of TEs (as well as other ncRNAs), and found more than 650,000 expressed TEs in the DLPFC of post-mortem human brains: about 114,000 expressed TEs are differentially expressed between schizophrenia cases and healthy controls and mostly represented by primate- or human-specific elements.
A recent study suggests another potential key role for TEs in rewiring the local functional architecture of Human Accelerated Regions (HARs) in Schizophrenia and Bipolar Disorder29. Indeed, HARs have been implicated in neurodevelopmental and neuropsychiatric disorders30–32, and most HARs are known to act as developmental enhancers that are involved in controlling and regulating human cognition33–37.
In this study, our goal was to identify polymorphic TEs that can potentially contribute to schizophrenia. Inspecting the non-reference TE content of DLFPC genomes of schizophrenic individuals (SCZ), we will 1) compare SCZ with control (CTRL) genomes; 2) check the population-specific/geographic distribution of the identified variants; 3) perform haplotype-based association tests; 4) in silico explore the nrTEs possible functional roles as cis-regulatory elements of protein-coding genes and as putative modifiers of known HARs.
Subjects And Methods
DNA from the DLPFC of ten schizophrenic patients and ten psychiatrically healthy controls has been obtained from the UCI Brain Bank, following an UCI/IRB approved protocol.
DNA from the 20 DLPFC samples has been aligned to the human reference genome hs37d5 (http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/) with BWA-mem38. After sorting and merging with Samtools39, the GATK best practices were applied to generate VCF files that include SNPs as well as Indels (https://gatk.broadinstitute.org/hc/en-us/articles/360035894711-About-the-GATK-Best-Practices).
We searched for non-reference TEs (nrTEs: Alu, LINE1 and SVA) with the Mobile Element Locator Tool (MELT) v.2.1.540, using MELT-Split with default parameters on our twenty high-coverage genomes (Supplementary Table 1). To analyze the geographic variability of the putative schizophrenia-related nrTEs, we additionally selected 125 samples from the “1000 Genomes Project” phase 341. These 125 samples were analyzed with MELT jointly with SCZ and CTRL samples. Only “PASS” sites were included in a single final VCF file and only nrTEs mapping in genic or regulatory regions (introns, exons, promoters, terminators and UnTranslated Regions, UTRs) on autosomal chromosomes were considered for further analyses. Fisher tests of independence were performed to identify which nrTEs show significantly different frequencies in SCZ and CTRL. Tests were performed with one and two degrees of freedom, respectively, for allelic and genotype frequencies. nrTEs that yielded nominally significant tests (pval < 0.05) at least for allele and/or genotype frequencies were considered as putatively related with schizophrenia.
To assess the genetic relationships among the individuals included in our dataset, as well as their ancestry, we implemented a principal component analysis (PCA) and ADMIXTURE analysis42, both on the whole variant dataset (single nucleotide polymorphisms, SNPs, and nrTEs) and the nrTE-based only. Quality control (QC) was performed with the PLINK software43.
We also performed a haplotype reconstruction procedure with SHAPEIT v.1.944 on the whole variant dataset to contextualize the genotyped nrTEs into their local genetic environment and evaluate the frequency of the corresponding haplotypes within the DLPFC cohort. We then performed an association test using Beagle v.3.3.245 on the nrTEs with significantly different allelic and/or genotype frequencies between SCZ and CTRL.
As highlighted in the introduction, TEs can act as cis regulatory elements by, for example, modifying the expression of nearby genes and inducing alternative splicing. Therefore, we checked if non-reference TEs may act as eQTLs and/or sQTLs, by comparing our significant results with those from Cao et al.46, based on the GTEx dataset47.
Moreover, we verified whether the statistically significant non-reference TEs (i.e., nrTEs with significantly different allele/genotype frequencies between cases and controls) are located close to genes previously studied in the context of schizophrenia.
We also compared our 38 nrTEs with the lists of HARs as originally identified by Pollard et al.48,49, Prabhakar et al.50, Bird et al.51, Capra et al.52 and Gittelman et al.53, to check if some of the identified TEs are located in those regions.
Results
non-reference retrotransposon insertions
We identified 7 952 nrTEs in genic/regulatory regions: 6 542 Alu (82.3%), 1 065 LINE-1 (13.4%) and 345 SVA (4.3%), as shown in Table 1, using MELT-Split on our 145 samples (10 SCZ, 10 CTRL and 125 normal individuals from 1000 Genomes Project, 1KGP).
We checked the chromosomal distribution of the 7 952 nrTEs and found no significant difference (Fisher pval > 0.63) between the expected and observed content of the different families of TEs (SVA, LINE-1 and Alu) (Supplementary Figs. 1–3).
Population Structure Of The Dataset
To contextualize our 20 DLPFC samples in the worldwide genomic landscape, we performed PCA and ADMIXTURE analyses on nrTEs genotypes and found that the best estimate for K in the latter is 3, with CV error = 0.37350, including 125 1KGP samples from 5 populations. Our results (Figs. 1 and 2) show that nrTEs are useful predictors of the genomic structure of the different human populations, as confirmed by the intermediate position of Indians (ITU) between Europeans (CEU) and Chinese (CHB), as well as by the clear differentiation between Eurasian and African samples.
Figure 1.
Principal Component Analysis (PCA) of the DLPFC and 1KGP samples based only on non-reference TEs. Pink: Europeans (CEU); green: Han Chinese in Beijing (CHB); brown: controls from the DLPFC (CTRL); red: Indian Telugus (ITU); yellow: Luhya in Kenya (LWK); blue: schizophrenic individuals from the DLPFC (SCZ); violet: Yoruba in Nigeria (YRI).
Figure 2.
ADMIXTURE plot based only on nrTEs. K=3 is shown (CV error = 0.37350).
The ADMIXTURE plot shows that CTRL and SCZ share the same ancestral components of CEU, (violet). Han Chinese (orange) have their own ancestral component, as well as the two African populations (colored). ITUs show a mixture of European and Asian components. These results, as a whole, are coherent with those obtained using SNPs41,54.
Accordingly, nrTEs show systematic differences in allele frequencies across populations: 3 131 of 7 952 non-reference TEs, 2 711 Alu (41.4%), 332 LINE-1 (31.1%) and 88 SVA (25.5%), have a significant geographic stratification (Fisher pval < 0.05) (Supplementary Table 2), with 2 263 (28%) presenting with an allele frequency > 5%. Among these, 1 501 nrTEs are found only in African populations, 833 are exclusive of non-African populations (Europeans, Indian Telugus and Chinese) and 955 are common to all the five groups (see Supplementary Table 2 and Supplementary Fig. 6). Both methods (PCA and ADMIXTURE) highlight that DLPFC samples overlap with CEU, with the partial exception of a single SCZ sample, which presents signs of admixture with a Sub-Saharan African source (as represented by YRI and LWK).
These findings suggest that SCZ and CTRL are genetically homogeneous; consequently, variants associated with the disease condition do not depend on underlying population structure.
Comparison Between SCZ And CTRL Subjects
We then compared the distribution of allele and genotype insertion frequencies of nrTEs (herein, ‘counts’) between SCZ cases and normal CTRL. We detected 38 non-reference TEs with significantly different allele/genotype counts between cases and controls: three LINE1s, three SVAs and 32 Alus, considering only those TEs that yielded significant Fisher tests for allele and/or genotype counts55,56 at the nominal p-value = 0.05 (Table 2). Given the limited size of our sample, no correction for multiple testing was performed. All significant nrTEs belong to evolutionarily recent elements (L1Ta and AluY), with two exceptions: the L1 on chr12:126802943 (undetermined subfamily) and the Alu on chr7:141748320 (which belongs to the subfamily Sz, older than Y).
Of these 38 nrTEs, 11 show a significant difference in allele counts only, 14 in genotype counts only and 13 in both allele and genotype counts. Interestingly, most of these TEs also show evidence of differential segregation (Fisher test, pval < 0.05) in human populations (27 for allele counts, 24 for genotype counts, 23 for both).
The most significant allele-wise results (pval < 0.01) among the 38 significant nrTEs include three Alus and one SVA, whose insertion can be found on: chr4:17150918 and chr4:23511024 (both AluY and only observed in SCZ), chr11:40727097 (AluYa5, only observed in CTRL) and chr20:5268423 (SVA, more frequent in SCZ). The three Alus show a statistically significant geographical distribution (Fig. 3 and Supplementary Table 3), presenting variable insertion frequencies across populations, while the SVA on chr20:5268423 has similar allele frequencies in all the considered populations.
Figure 3.
Allele frequencies of Alus on chr4:17150918 (A), chr4:23511024 (B) and chr11:40727097 (C), compared to SVA on chromosome chr20:5268423 (D). Darker colors indicate the presence of the TE (+), while lighter colors indicate the absence (−). Following populations are displayed: Europeans (blue), Indian Telugus (red), Chinese (green) and Africans (yellow), represented by Luhya in Kenya and Yoruba in Nigeria. Allele frequencies for schizophrenic individuals and healthy controls are shown in violet and pink, respectively.
Haplotype-based Association Analysis
We then performed haplotype-based analysis using Beagle on the previously detected 38 variants (Table 2) and obtained two significant results. The first one was for a 188bp haplotype including the AluYb on chr5:100497396, in the promoter of the gene ST8SIA4. This haplotype is characterized by 4 polymorphisms: T + TG (where “+” points out to the presence of the nrTE and the other letters represent single nucleotide variants). It is present with 15 copies in CTRL and 2 in SCZ, suggesting a strong association (pval = 6.86 × 10−5) between the presence of the haplotype and the absence of the disease.
The second significant result was for a 1 172 bp haplotype including the locus of the AluYb7 on chr4:23511024 in the promoter of MIR548AJ2. Interestingly, such haplotype is characterized by the absence of the insertion, with polymorphisms GC-TTI (where “I” stands for InDel and “-” indicates the absence of the nrTE); accordingly, it was found with 19 copies in CTRL and 6 copies in SCZ (pval = 3.93 × 10−5), while the mentioned Alu is completely absent in CTRL samples (Table 2) and present in 7 SCZ, only in an heterozygous condition.
In-silico Functional Inferences For Non-reference TEs
Notably, at least seven genes putatively mapped by our significant non-reference TEs have been already associated with schizophrenia: LRRC4C57, LRRC758–60, ST8SIA461–63, MGAM64, ADAMTS165, MIR548AJ266 and SCN5A, which is linked to the Brugada syndrome67–69.
We then compared our set of 38 significant nrTEs with the eQTLs and sQTLs TEs lists produced by Cao et al.46 using the GTEx dataset47 (Supplementary Table 4). Indeed, 27 TEs (3 LINE-1 and 24 Alu) (68.4%) were detected as potential eQTLs acting in different tissues, 7 of which are expressed in the brain. As for sQTLs, 13 (34.21%) of our TEs (2 LINE-1 and 11 Alu) were detected as potentially contributing to alternative splicing in different tissues, 2 of them supposedly acting in the brain (chr11:76990585 and chr2:36476695). All sQTLs TEs were also eQTLs, with the single exception of an Alu (chr5:159122155, in the promoter of ADRA1B) which works only as sQTL.
We also compared our set of 38 nrTEs with the lists of HARs produced by other Authors (see Materials and Methods) and found that none of these nrTEs are located into HARs.
Discussion
Far from being “junk”, it has been shown that Transposable Elements, such as non-Long Terminal Repeats retrotransposons, can contribute to human genomic diversity in various ways. Mounting data suggest both a positive and a detrimental role of retrotransposons in shaping the human cognitive traits17 and in the development of brain and Central Nervous System (CNS) structures70,71. Several authors also suggest that retrotransposons have an important role in some neurological and psychiatric disorders, such as schizophrenia15,16,18,25,26.
However, the full impact of TEs on the human genome is still unclear, both for technological/methodological limitations as well as for our current lack of knowledge of their precise effects and interactions with other genetic/epigenetic elements. Since a relationship between cognitive disorders and reference TEs has been the subject of several recent studies15,25,26 with this work, we aimed to provide a first overview of non-reference TEs that can potentially contribute to increasing the risk of developing schizophrenia.
In a first step, to evaluate whether nrTEs distribution can contribute to population substructure (= nrTEs frequency differences due to specific populations origin/evolutionary trajectory) and lead to specific population patterns, we inspected the genetic distribution of traditional DNA variants (SNPs, Indels, CNVs) and of nrTEs in our 20 DLPFC subjects together with 125 worldwide individuals that we collected from the 1KGP. Our PCA and Admixture results show that nrTEs are present with mostly population-specific frequencies within our world-wide dataset, similarly to the well-known patterns previously detected in SNP-based studies and we ultimately confirm that nrTEs too contribute to the higher genomic diversity of African populations compared to non-African’s41,54,72.
Our results (Figs. 1 and 2) also show that our SCZ and CTRL individuals fall within the European genomic variability (represented by CEU in both PCA and Admixture analyses) and share a predominant European ancestral component, except for a single individual showing signs of admixture with a Sub-Saharan African source.
Therefore, our results confirm both that polymorphic nrTEs can be used as reliable markers for reconstructing the genomic structure (and potentially the history) of the considered samples/populations, as previously suggested by others40,73,74 and that our SCZ and CTRL sample presents a clear nrTEs genetic homogeneity, which allows to exclude spurious associations due to hidden population substructure.
To our knowledge ours is the first attempt at detecting non-reference (polymorphic) TEs as possibly associated with schizophrenia. Even though our results are based on a relatively low number of subjects, spurious associations are unlikely, given the homogeneity and absence of population substructure of our SCZ/CTRL sample. Indeed, we identified 38 nrTEs whose frequencies are significantly different between Schizophrenics and Controls. Allele frequency differences for these TEs are similar or even higher than those observed between the most different ‘control’ populations for the same insertions, suggesting that it is highly improbable that the observed differences emerged by chance. Allele-wise, the most significant results (pval < 0.01) are three Alus and one SVA, which respectively fall on chr4:17150918, chr4:23511024, chr11:40727097 and chr20:5268423. The first two Alus are found only in SCZ, the third only in CTRL and the SVA on chromosome 20 is more frequent in SCZ subjects. The Alu on chr11:40727097 is completely absent in SCZ (but present in 27.4% of the other individuals) (Fig. 4C) and is located in the second intron of the Leucine Rich Repeat Containing 4C (LRRC4C) gene, which is highly expressed in the frontal cortex and has been associated with a positive response to antipsychotic therapy with lurasidone in SCZ patients57. In our case, the absence of this Alu insertion is preferentially associated with the schizophrenic condition (see Tables 2 and 3).
After inferring the haplotypic context surrounding the 38 nrTEs of interest, we performed an association test with Beagle, which returned two significant results: one is the AluYb on chr5:100497396, located in the promoter of the gene ST8 Alpha-N-Acetyl-Neuraminide Alpha-2,8-Sialyltransferase 4 (ST8SIA4). The haplotype-based analysis revealed that the haplotype (T + TG) with the presence of the nrTE is found with 15 copies in CTRL and 2 in SCZ; indeed, there was a strong association (pval = 6.86 × 10−5) between the presence of the nrTE and the absence of the considered trait (schizophrenia). Therefore, we could hypothesize that the haplotype with the nrTE has a protective role against the disease. Further in vitro or in vivo experiments could elucidate this relationship.
The other haplotype (GC-TTI) is characterized by the absence of the AluYb7 on chr4:23511024: indeed, this nrTE is completely absent in CTRL samples and present, only in heterozygous condition, in 7 SCZ patients. Therefore, our hypothesis is that the presence of the element is putatively related with an increased risk of developing schizophrenia. Moreover, the Alu is located in the promoter of MIR548AJ2, one of the 108 genome-wide significant loci for schizophrenia reported by Ripke and colleagues66.
As highlighted in the introduction, TEs can act in cis, for example, by altering the expression of a gene or by having an impact on its alternative splicing. Therefore, we also looked at the potential role of our 38 significantly different nrTEs by comparing them with the lists produced by Cao and colleagues46 based on the GTEx dataset. We found that 27 nrTEs act as eQTLs in different tissues, with 7 showing a putative eQTL effect in the brain. For instance, the AluYb3a1 on chr9:91099740 acts as eQTL in the frontal cortex and is located in the terminator of SPIN1 (Spindlin 1). Moreover, 13 nrTEs act as sQTLs, two of them in the brain: the AluYg6 chr11:76990585 and the AluYb7 on chr2:36476695. These two Alus are located in the third intron of GDPD4 (Glycerophosphodiester Phosphodiesterase Domain-Containing Protein 4) and in the terminator of the CRIM1 gene, respectively. Interestingly, the AluYb7 on chr2:36476695 acts as sQTL in the frontal cortex, and CRIM1 encodes for the cysteine-rich neuron motor 1 protein, which is developmentally regulated and involved in CNS development and organogenesis75.
Finally, recent research suggests that TEs could change the local functional architecture of HARs in schizophrenia and bipolar disorder29. Therefore, we checked whether our 38 nrTEs are located near (within 1,000bp) or into known HARs (see Materials and Methods), however no match was detected.
In conclusion, our analysis provides the first overview of nrTEs possibly related to an increased risk of developing schizophrenia. We have identified 38 nrTEs of interest and highlighted that several of these elements can have a remarkable impact on the expression, alternative splicing and functionality of nearby genes by cross-checking our results with those available in recently published studies. We also defined two haplotypes in which the presence of the nrTE is either protective against the disease or associated with the schizophrenic condition. Our results, as well as those from other papers dealing with nrTEs, are based on presence/absence of TEs, i.e considering them as biallelic markers; future research based on long-read sequencing will also need to include TEs sequence variability. Moreover, we expect that a similar framework applied to a larger cohort of subjects could confirm and possibly extend our results, and experimental validation of the identified nrTEs will elucidate their effective impact on the cognitive genome.
Having identified both reference25 and non-reference TEs associated with an increased risk to develop schizophrenia suggests that a neurodevelopmental genetic mechanism is at play in the etiopathogenesis of this complex disorder. Under this hypothesis, and given that TEs controlling for the functional architecture of the neural genome are mostly evolutionarily recent (either human-only or primate-specific), schizophrenia can emerge as a trade-off between our ongoing cognitive evolution and possible molecular flaws of a not yet completed evolutionary process.
Acknowledgements
The Authors would like to thank Melanie Oakes and Valentina Ciobanu for their valuable technical support, Dr. Maria Giulia Bacalini and Dr. Cristian Taccioli for their useful suggestions about the in-silico analyses. This study was funded by the NIH grant NIMH R21-MH115327-01.
Abbreviations
- 1KGP
1000 Genomes Project
- CEU
Europeans
- CHB
Han Chinese in Beijing
- CNS
central nervous system
- CTRL
control individuals
- DLPFC
dorsolateral prefrontal cortex
- eQTL
expression quantitative trait loci
- HAR
human accelerated region
- ITU
Indian Telugus
- LWK
Luhya in Kenya
- nrTE
non-reference transposable element
- PCA
principal component analysis
- SCZ
schizophrenic individuals
- sQTL
alternative splicing quantitative trait loci
- TE
transposable element
- YRI
Yoruba in Nigeria
Footnotes
Conflict of interest
The Authors declare no competing interests.
Supplementary Tables
Supplementary Tables 1–4 are not available with this version.
Tables 1 to 3 are available in the Supplementary Files section.
Supplementary Files
References
- 1.de Koning A. P. J., Gu W., Castoe T. A., Batzer M. A. & Pollock D. D. Repetitive Elements May Comprise Over Two-Thirds of the Human Genome. PLoS Genet. 7, e1002384 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hoyt S. J. et al. From telomere to telomere: The transcriptional and epigenetic state of human repeat elements. Science 376, eabk3112 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Guio L. & González J. New Insights on the Evolution of Genome Content: Population Dynamics of Transposable Elements in Flies and Humans. in Evolutionary Genomics (ed. Anisimova M.) vol. 1910 505–530 (Springer; New York, 2019). [DOI] [PubMed] [Google Scholar]
- 4.Beck C. R., Garcia-Perez J. L., Badge R. M. & Moran J. V. LINE-1 elements in structural variation and disease. Annu. Rev. Genomics Hum. Genet. 12, 187–215 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rangwala S. H., Zhang L. & Kazazian H. H. Many LINE1 elements contribute to the transcriptome of human somatic cells. Genome Biol. 10, R100 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Deininger P. Alu elements: know the SINEs. Genome Biol. 12, 236 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Su M., Han D., Boyd-Kirkup J., Yu X. & Han J.-D. J. Evolution of Alu elements toward enhancers. Cell Rep. 7, 376–385 (2014). [DOI] [PubMed] [Google Scholar]
- 8.Babaian A. & Mager D. L. Endogenous retroviral promoter exaptation in human cancer. Mob. DNA 7, 24 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Emera D. & Wagner G. P. Transformation of a transposon into a derived prolactin promoter with function during human pregnancy. Proc. Natl. Acad. Sci. U. S. A. 109, 11246–11251 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lynch V. J. et al. Ancient transposable elements transformed the uterine regulatory landscape and transcriptome during the evolution of mammalian pregnancy. Cell Rep. 10, 551–561 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hezroni H. et al. Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep. 11, 1110–1122 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ali A., Han K. & Liang P. Role of Transposable Elements in Gene Regulation in the Human Genome. Life 11, 118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Baillie J. K. et al. Somatic retrotransposition alters the genetic landscape of the human brain. Nature 479, 534–537 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Coufal N. G. et al. L1 retrotransposition in human neural progenitor cells. Nature 460, 1127–1131 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bundo M. et al. Increased L1 Retrotransposition in the Neuronal Genome in Schizophrenia. Neuron 81, 306–313 (2014). [DOI] [PubMed] [Google Scholar]
- 16.Erwin J. A., Marchetto M. C. & Gage F. H. Mobile DNA elements in the generation of diversity and complexity in the brain. Nat. Rev. Neurosci. 15, 497–506 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Guffanti G. et al. LINE1 insertions as a genomic risk factor for schizophrenia: Preliminary evidence from an affected family. Am. J. Med. Genet. Part B Neuropsychiatr. Genet. Off. Publ. Int. Soc. Psychiatr. Genet. 171, 534–545 (2016). [DOI] [PubMed] [Google Scholar]
- 18.Doyle G. A. et al. Analysis of LINE-1 Elements in DNA from Postmortem Brains of Individuals with Schizophrenia. Neuropsychopharmacol. Off. Publ. Am. Coll. Neuropsychopharmacol. 42, 2602–2611 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Fueyo R., Judd J., Feschotte C. & Wysocka J. Roles of transposable elements in the regulation of mammalian transcription. Nat. Rev. Mol. Cell Biol. 23, 481–497 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Goubert C., Zevallos N. A. & Feschotte C. Contribution of unfixed transposable element insertions to human regulatory variation. Philos. Trans. R. Soc. B Biol. Sci. 375, 20190331 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang L., Norris E. T. & Jordan I. K. Human Retrotransposon Insertion Polymorphisms Are Associated with Health and Disease via Gene Regulatory Phenotypes. Front. Microbiol. 8, 1418 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Payer L. M. et al. Alu insertion variants alter gene transcript levels. Genome Res. 31, 2236–2248 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Payer L. M. et al. Structural variants caused by Alu insertions are associated with risks for many human diseases. Proc. Natl. Acad. Sci. 114, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sekar A. et al. Schizophrenia risk from complex variation of complement component 4. Nature 530, 177–183 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Guffanti G. et al. Novel Bioinformatics Approach Identifies Transcriptional Profiles of Lineage-Specific Transposable Elements at Distinct Loci in the Human Dorsolateral Prefrontal Cortex. Mol. Biol. Evol. 35, 2435–2453 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Misiak B., Ricceri L. & Sąsiadek M. M. Transposable Elements and Their Epigenetic Regulation in Mental Disorders: Current Evidence in the Field. Front. Genet. 10, 580 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Macciardi F. et al. A retrotransposon storm marks clinical phenoconversion to late-onset Alzheimer’s disease. GeroScience 44, 1525–1550 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Terry D. M. & Devine S. E. Aberrantly High Levels of Somatic LINE-1 Expression and Retrotransposition in Human Neurological Disorders. Front. Genet. 10, 1244 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Erady C. et al. Novel open reading frames in human accelerated regions and transposable elements reveal new leads to understand schizophrenia and bipolar disorder. Mol. Psychiatry 27, 1455–1468 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cheung J. P., Tubbs J. D. & Sham P. C. Extended gene set analysis of human neuro-psychiatric traits shows enrichment in brain-expressed human accelerated regions across development. Schizophr. Res. 246, 148–155 (2022). [DOI] [PubMed] [Google Scholar]
- 31.Doan R. N. et al. Mutations in Human Accelerated Regions Disrupt Cognition and Social Behavior. Cell 167, 341–354.e12 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hubisz M. J. & Pollard K. S. Exploring the genesis and functions of Human Accelerated Regions sheds light on their role in human evolution. Curr. Opin. Genet. Dev. 29, 15–21 (2014). [DOI] [PubMed] [Google Scholar]
- 33.Boyd J. L. et al. Human-chimpanzee differences in a FZD8 enhancer alter cell-cycle dynamics in the developing neocortex. Curr. Biol. CB 25, 772–779 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Franchini L. F. & Pollard K. S. Human evolution: the non-coding revolution. BMC Biol. 15, 89 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Girskis K. M. et al. Rewiring of human neurodevelopmental gene regulatory programs by human accelerated regions. Neuron 109, 3239–3251.e7 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Levchenko A., Kanapin A., Samsonova A. & Gainetdinov R. R. Human Accelerated Regions and Other Human-Specific Sequence Variations in the Context of Evolution and Their Relevance for Brain Development. Genome Biol. Evol. 10, 166–188 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ziffra R. S. et al. Single-cell epigenomics reveals mechanisms of human cortical development. Nature 598, 205–213 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Li H. & Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform.Bioinforma. Oxf. Engl. 25, 1754–1760 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Li H. et al. The Sequence Alignment/Map format and SAMtools. Bioinforma. Oxf. Engl. 25, 2078–2079 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gardner E. J. et al. The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology. Genome Res. 27, 1916–1929 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.1000 Genomes Project Consortium et al. A global reference for human genetic variation. Nature 526, 68–74 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Alexander D. H., Novembre J. & Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Purcell S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Delaneau O., Zagury J.-F. & Marchini J. Improved whole-chromosome phasing for disease and population genetic studies. Nat. Methods 10, 5–6 (2013). [DOI] [PubMed] [Google Scholar]
- 45.Browning S. R. & Browning B. L. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet. 81, 1084–1097 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Cao X. et al. Polymorphic mobile element insertions contribute to gene expression and alternative splicing in human tissues. Genome Biol. 21, 185 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Pollard K. S. et al. An RNA gene expressed during cortical development evolved rapidly in humans. Nature 443, 167–172 (2006). [DOI] [PubMed] [Google Scholar]
- 49.Pollard K. S. et al. Forces shaping the fastest evolving regions in the human genome. PLoS Genet. 2, e168 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Prabhakar S., Noonan J. P., Pääbo S. & Rubin E. M. Accelerated evolution of conserved noncoding sequences in humans. Science 314, 786 (2006). [DOI] [PubMed] [Google Scholar]
- 51.Bird C. P. et al. Fast-evolving noncoding sequences in the human genome. Genome Biol. 8, R118 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Capra J. A., Erwin G. D., McKinsey G., Rubenstein J. L. R. & Pollard K. S. Many human accelerated regions are developmental enhancers. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 368, 20130025 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Gittelman R. M. et al. Comprehensive identification and analysis of human accelerated regulatory DNA. Genome Res. 25, 1245–1255 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Mallick S. et al. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations. Nature 538, 201–206 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Lewis C. M. Genetic association studies: design, analysis and interpretation. Brief. Bioinform. 3, 146–153 (2002). [DOI] [PubMed] [Google Scholar]
- 56.Crespi B., Summers K. & Dorus S. Adaptive evolution of genes underlying schizophrenia. Proc. Biol. Sci. 274, 2801–2810 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Li J., Yoshikawa A., Brennan M. D., Ramsey T. L. & Meltzer H. Y. Genetic predictors of antipsychotic response to lurasidone identified in a genome wide association study and by schizophrenia risk genes. Schizophr. Res. 192, 194–204 (2018). [DOI] [PubMed] [Google Scholar]
- 58.Hathy E. et al. Generation of multiple iPSC clones from a male schizophrenia patient carrying de novo mutations in genes KHSRP, LRRC7, and KIR2DL1, and his parents. Stem Cell Res. 51, 102140 (2021). [DOI] [PubMed] [Google Scholar]
- 59.Hathy E. et al. Investigation of de novo mutations in a schizophrenia case-parent trio by induced pluripotent stem cell-based in vitro disease modeling: convergence of schizophrenia- and autism-related cellular phenotypes. Stem Cell Res. Ther. 11, 504 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Carlisle H. J. et al. Deletion of Densin-180 Results in Abnormal Behaviors Associated with Mental Illness and Reduces mGluR5 and DISC1 in the Postsynaptic Density Fraction. J. Neurosci. 31, 16194–16207 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Curto Y., Alcaide J., Röckle I., Hildebrandt H. & Nacher J. Effects of the Genetic Depletion of Polysialyltransferases on the Structure and Connectivity of Interneurons in the Adult Prefrontal Cortex. Front. Neuroanat. 13, 6 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Volk D. W., Edelson J. R. & Lewis D. A. Altered expression of developmental regulators of parvalbumin and somatostatin neurons in the prefrontal cortex in schizophrenia. Schizophr. Res. 177, 3–9 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kröcher T. et al. A crucial role for polysialic acid in developmental interneuron migration and the establishment of interneuron densities in the mouse prefrontal cortex. Dev. Camb. Engl. 141, 3022–3032 (2014). [DOI] [PubMed] [Google Scholar]
- 64.Wu X. et al. Genome-wide study of copy number variation implicates multiple novel loci for schizophrenia risk in Han Chinese family trios. iScience 24, 102894 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Pantazopoulos H. et al. Molecular signature of extracellular matrix pathology in schizophrenia. Eur.J. Neurosci. 53, 3960–3987 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Schizophrenia Working Group of the Psychiatric Genomics Consortium. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Spellmann I. et al. QTc prolongation in short-term treatment of schizophrenia patients: effects of different antipsychotics and genetic factors. Eur. Arch. Psychiatry Clin. Neurosci. 268, 383–390 (2018). [DOI] [PubMed] [Google Scholar]
- 68.Roden D. M. The Brugada ECG and Schizophrenia. Circ. Arrhythm. Electrophysiol. 7, 365–367 (2014). [DOI] [PubMed] [Google Scholar]
- 69.Roberts E. GABAergic malfunction in the limbic system resulting from an aboriginal genetic defect in voltage-gated Na+-channel SCN5A is proposed to give rise to susceptibility to schizophrenia. Adv. Pharmacol. San Diego Calif 54, 119–145 (2006). [DOI] [PubMed] [Google Scholar]
- 70.Suarez N. A., Macia A. & Muotri A. R. LINE-1 retrotransposons in healthy and diseased human brain. Dev. Neurobiol. 78, 434–455 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Ferrari R., Grandi N., Tramontano E. & Dieci G. Retrotransposons as Drivers of Mammalian Brain Evolution. Life Basel Switz. 11, 376 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Byrska-Bishop M. et al. High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell 185, 3426–3440.e19 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Rishishwar L., Tellez Villa C. E. & Jordan I. K. Transposable element polymorphisms recapitulate human evolution. Mob. DNA 6, 21 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Watkins W. S. et al. The Simons Genome Diversity Project: A Global Analysis of Mobile Element Diversity. Genome Biol. Evol. 12, 779–794 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Kolle G., Georgas K., Holmes G. P., Little M. H. & Yamada T. CRIM1, a novel gene encoding a cysteine-rich repeat protein, is developmentally regulated and implicated in vertebrate CNS development and organogenesis. Mech. Dev. 90, 181–193 (2000). [DOI] [PubMed] [Google Scholar]



