Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 Oct 27:2024.10.24.619909. [Version 1] doi: 10.1101/2024.10.24.619909

African origin haplotype protective for Alzheimer’s disease in APOEε4 carriers: exploring potential mechanisms

Luciana Bertholim-Nasciben 1, Karen Nuytemans 1,2, Derek Van Booven 1, Farid Rajabli 1,2, Sofia Moura 1, Aura M Ramirez 1, Derek M Dykxhoorn 1,2, Liyong Wang 1,2, William K Scott 1,2, David A Davis 3, Regina T Vontell 3, Katalina F McInerney 4, Michael L Cuccaro 1,2, Goldie S Byrd 5, Jonathan L Haines 6, Marla Gearing 7, Larry D Adams 1, Margaret A Pericak-Vance 1,2; ADSP8, Juan I Young 1,2, Anthony J Griswold 1,2,*,#, Jeffery M Vance 1,2,*,#
PMCID: PMC11527192  PMID: 39484566

Abstract

APOEε4 is the strongest genetic risk factor for Alzheimer’s disease (AD) with approximately 50% of AD patients carrying at least one APOEε4 allele. Our group identified a protective interaction between APOEε4 with the African-specific A allele of rs10423769, which reduces the AD risk effect of APOEε4 homozygotes by approximately 75%. The protective variant lies 2Mb from APOE in a region of segmental duplications (SD) of chromosome 19 containing a cluster of pregnancy specific beta-1 glycoprotein genes (PSGs) and a long non-coding RNA. Using both short and long read sequencing, we demonstrate that rs10423769_A allele lies within a unique single haplotype inside this region of segmental duplication. We identified the protective haplotype in all African ancestry populations studied, including both West and East Africans, suggesting the variant has an old origin. Long-read sequencing identified both structural and DNA methylation differences between the protective rs10423769_A allele and non-protective haplotypes. An expanded variable number tandem repeat (VNTR) containing multiple MEF2 family transcription factor binding motifs was found associated with the protective haplotype (p-value = 2.9e-10). These findings provide novel insights into the mechanisms of this African-origin protective variant for AD in APOEε4 carriers and supports the importance of including all ancestries in AD research.

Introduction

Individuals with local African (AFR) ancestral genetic background surrounding APOEε4 have decreased AD risk associated with the APOEε4 allele compared to individuals of European ancestry, thereby demonstrating some natural protective effect in the AFR background. 1,2 We recently identified a strong protective interaction between APOEε4 and the A allele of rs10423769, which reduces the AD risk effect of APOEε4 homozygotes by approximately 75% in African Americans (AA), AFR individuals from Ibadan (Nigeria), and genetically admixed Puerto Rican individuals. 3 The frequency of the minor rs10423769 A allele is ~12% in AFR but only ~0.03% in EUR populations. This protective locus is located 2Mb upstream of APOE and lies within a highly segmentally duplicated region of chr19 containing a cluster of pregnancy-specific β-1 glycoprotein (PSG) genes as well a long non-coding RNA (ENSG00000282943) (Figure 1). Interestingly, the PSG genes are primarily expressed in the placenta with little, if any, expression in the brain. 4,5

Figure 1.

Figure 1.

UCSC browser view of surrounding region of rs10423769_A alelle (marked by vertical red line). The annotation of segmental duplication includes all non-allelic intrachromosomal and interchromosomal alignments greater than 1 kb and with more than 90% of sequence identity, excluding common repeats or satellite sequences.

The mechanisms involved in the protective interaction of this locus with APOEε4 are unknown. However, molecular investigation of the protective locus is complicated by its genomic context. The PSG region is rich in segmental duplications (SDs) that are difficult to resolve using short-read sequencing data, as traditional sequencing reads are not long enough to be precisely aligned to a specific locus in these repetitive regions. 6 In addition, SDs constitute hotspots of recurrent rearrangement by nonallelic homologous recombination, resulting in high occurrence of copy number variations (CNVs), gene conversion, and structural variants (SVs) 7,8. SVs could affect chromatin structure, which we have previously described as differing between AFR and EUR brain astrocytes. 9,10 In addition, the SD region of the PSGs has been previously reported to have many deletions and duplications that vary between ancestries. 4

To gain insight into its mechanisms of protection, we performed an initial genetic characterization of this protective locus. First, we refined the haplotype harboring the protective locus using a large, short read-sequencing database, including individuals from diverse ancestries from the Alzheimer’s Disease Sequencing Project (ADSP). Second, we used long-read sequencing to confirm the location of the haplotype in this highly segmentally duplicated region of chromosome 19. Third, we used the long-read sequencing data to identify the SVs and DNA methylation profiles in the PSG region. Finally, we explored an expanded variable number tandem repeat (VNTR) enriched in multiple MEF2 family transcription factors binding motifs present in this genomic region. These findings provide novel insights into the potential mechanisms underlying this AFR-origin protective variant for AD in APOEε4 carriers. Although biologically complex, a better understanding of the genetic and molecular factors involved in the protection against risk of APOEε4 driven by this region presents a therapeutic opportunity for all ancestries.

Methods

Haplotype block analysis

We extracted phased genotype data from rs10423769_A allele carriers from the Alzheimer’s Disease Sequencing Project Release 4 (ADSP R4: ng00067.v10) of short-read whole genome sequencing (Web resources). In the region of 70 kb surrounding rs10423769, we selected all SNPs with a MAF ≥ 0.05 and Hardy–Weinberg equilibrium (HWE) exact test p-value > 0.001 in the African population of the 1000 Genomes project (1000G) dataset for analysis (n=301). 11 Haploview 4.2 software was used to define the haplotype blocks.12 Plink v1.90 13 (Web resources) was used to calculate Linkage Disequilibrium (LD) using ADSP R4. LDhap from the LDlink (5.6.5 Release) web-based application 14 (Web resources) was used to map and calculate frequencies of the haplotypes harboring the rs10423769_A allele across population groups from the 1000G Phase 3.

Long read whole genome sequencing (LRWGS)

LRWGS was generated from DNA extracted from either cerebellum samples excised from frozen brain or peripheral blood samples from 38 individuals that were heterozygous or homozygous for rs10423769_A, or non-carriers, as described in Supplementary Table 1. Brain samples were obtained from the biorepository of the John P. Hussman Institute for Human Genomics (HIHG) and Brain Endowment Bank at the University of Miami, as well as Emory University Goizueta Alzheimer’s Disease Research Center (GADRC) Brain Bank. Blood samples were obtained from participants as part of ongoing research projects in studying Alzheimer’s disease in individuals of African ancestries (AG052410 and AG072547, P.I. M. Pericak-Vance).

DNA was extracted in the HIHG biorepository using the AutoGen FlexSTAR using standard procedures without further size selection. Libraries were constructed using the SQK-LSK109 ligation kit from Oxford Nanopore Technology (ONT). Samples were loaded onto PromethION R9.4.1 flow cells and sequenced in 72-hour data acquisition runs on the PromethION24 device. Base calling was performed with Guppy version 3.3.2 which simultaneously produces MM and ML methylation tags in the unaligned bam file. Resulting bam files were then converted to fastq files (samtools v1.2) preserving these tags in the meta data for each read). Resulting FASTQs were aligned to GRCh38 using minimap2 v2.17-r941 where the methylation tags were preserved from the previous step. Small variant calling was performed with Clair3 (v1.0.3). Sniffles2 (v2.0.7) was used for structural variant calling (default parameters were used individually on each sample, then a joint call was performed with default parameters with sniffles2). Aligned BAM files were examined using the Integrative Genome Viewer v.2.4.10 (Web resources) in Third Gen quick consensus mode.

Local assembly and motif finding

TREAT (Tandem REpeat Annotation Toolkit) assembly tool with Otter 15 (Web resources) was used for local assembly of tandem repeat regions in chr19 and annotation of VNTRs. The association between rs10423769 allele and the tandem repeat region length measured in bp was evaluated using Wilcoxon rank sum test. MEME Suite 16 (Web resources) was used for de novo motif finding and the algorithm FIMO (version 5.5.5) 17 within MEME Suite was used to identify known transcription factors (TF) binding motifs in the region. We screened defined transcription factors binding site databases JASPAR CORE 2022 (vertebrates non-redundant) 18 and selected those binding sites with q-value < 0.001.

Allele-specific differential methylation analysis

BAM files with methylation tags were phased by Longshot v0.4.5 19 using a region of +/− 40 kb from rs10423769 with a minimum coverage of 10 and minimum alternative allele fraction of 0.35. Allele-specific bedmethyl files with aggregation of modified bases were obtained with the tool modkit v0.1.12 (Web resources) with the options --partition-tag HP, --combine-strands, --cpg, and --ignore h. The DMA module from Nanomethphase v 1.2.0 20 were used for differential methylation analysis with -smf set to FALSE.

Results

Haplotype block analysis (based on short-read ADSP data)

We determined the composition of the haplotype harboring the rs10423769_A using the ADSP R4 set of short read whole genome sequencing. We identified 1,962 individuals carrying at least one rs10423769_A allele. Six haplotype blocks were identified in the region surrounding rs10423769_A (Figure 2). No recombination is observed between the most frequent haplotypes (~53%) in block 1, block 2 (harboring the rs10423769_A) and the remaining blocks 3-6. Supporting this, rs10423769 has a D' > 0.95 (LOD ≥ 2) with all SNPs from these six blocks, with the exception of two positions (rs8107144 and rs7250796) with D' < 0.3. Thus, we determined that the minimum shared haplotype harboring rs10423769_A allele was approximately 21kb (chr19:43099521-43120243) spanning six haplotype blocks. An extended haplotype including blocks 7-9 (chr19:43121359- 43132912) was also identified with a D' = 0.76 between blocks 6 and 7.

Figure 2.

Figure 2.

Haplotype blocks identified in 1,962 individuals carrying at least one rs10423769_A allele using Haploview 4.2 and ADSP. Lines between blocks indicate co-occurrence of adjacent haplotypes in individuals with line thickness representing frequency of co-occurrence across individuals. Haplotype block frequencies are shown in the right of each block (≥ 0.05). Multiallelic D' is shown on the bottom of crossing areas, which represents the level of recombination between blocks. Blocks with D' > 0.8 were considered the same haplotype. Rs10423769 is marked by an orange box. The 21kb (chr19:43099521-43120243) minimum shared haplotype is marked by a red box and the 11 kb extended haplotype (chr19:43121359- 43132912) is marked by blue boxes.

In order to support these findings, we performed LD analysis using r2 statistics and the entire ADSP R4 dataset of ~36,000 individuals, which showed almost perfect linkage (r2 >0.95) between rs10423769_A and 13 markers distributed along the six blocks of the 21 kb minimum shared haplotype (Supplementary Table 2). Eleven markers distributed along the 11 kb extended haplotype showed r2 >0.75.

Frequency of the haplotype across populations

Using the allele frequency information from 1000G to verify the population-specific frequency of the 21 kb minimum shared haplotype, we found the same haplotype across multiple admixed populations containing African ancestry (African American, Afro-Caribbean, and Puerto Ricans admixed populations) (Figure 3), but not in individuals of Mexican, Peruvian, East Asian, South Asian or European populations. Interestingly, the haplotype frequency was similar between West (Nigeria) and East (Kenya) populations.

Figure 3.

Figure 3.

Population-specific frequency of the minimum shared haplotype harboring the rs10423769_A allele in 1000G. LD pruned haplotype across 10 blocks determined by Haploview are shown. Frequencies were calculated with the tool LDhap from LDlink web-based application.

Validation of minimum shared haplotype with LRWGS

Since rs10423769_A allele is in an area of segmental duplication, LRWGS was used to validate whether rs10423769 is a true variant or an artifact of incorrect mapping in the SD region. LRWGS of 38 brain and blood samples including rs10423769_A homozygous, heterozygous and non-carriers (Supplementary Table 1) was performed with ONT yielding an average genome coverage of 21.5 ± 5.3X and average read length of 10.4 ± 3.0 kb. The average depth of high-quality reads (MAPQ ≥ 60) for the rs10423769_A position (chr19:43100929) was 16.0 ± 4.9X and all reads mapped uniquely for this specific position. In addition, analysis of potential secondary alignments of reads spanning the 21 kb minimum shared haplotype confirmed that the variant pattern identified along the haplotype block are specific to this region and not found contiguously in any other region of the genome (Figure 4). The frequency of variants in almost perfect linkage disequilibrium (r2 >0.95) were confirmed in the LRWGS variant calling (Supplementary Table 2).

Figure 4.

Figure 4.

Read the coverage and map of the 21 kb minimum shared haplotype from two representative individuals homozygous for either the rs10423769_A or rs10423769_G alleles. The green star marker represents the rs10423769_A allele, and all other listed markers are in LD>0.95 with rs10423769_A. Colored lines represent IGV consensus SNPs: A, green; C, blue; T, red; G, orange.

Analysis of structural variation

After mapping the haplotype associated with rs10423769_A, we explored other genomic and epigenomic features that could contribute to the functional mechanisms involved in the protection afforded by this locus. The LRWGS data were used to detect germline SVs in the 2 Mb region (chr19: 43000000-45000000) spanning from the protective locus to APOE. Sniffles2 identified 87 SVs in the region, including 42 deletions, 43 insertions, and two breakends. One insertion located at chr19:43132126 (32 kb from rs10423769) was identified with a frequency of 0.80 in rs10423769_A homozygotes and 0.07 in non-carriers. This insertion is located in an annotated repetitive region (772bp, chr19:43,131,850-43,132,621) (Web resources) approximately 32 kb from the protective locus, on block 9 of the expanded haplotype (Figure 2). Further investigation of the 772bp region of the reference genome indicates the presence of a VNTR with repetitions of a 29 bp tandem repeat pattern. Local assembly of this region revealed that the A allele is associated with expanded VNTR alleles (p-value = 2.944e-10, Figure 5) containing a higher number of the 29 bp tandem repeat. We analyzed the region for the presence of motifs and known TF binding sites and identified that the 29 bp repetitive sequence carries predicted binding sites motifs for the MEF2 family of transcription factors. MEF2D had the highest score by FIMO motif finding tool, 17,18 followed by MEF2B and MEF2A (Figure 6).

Figure 5.

Figure 5.

VNTR length correlation with rs10423769 haplotype (p-value = 2.944e-10).

Figure 6.

Figure 6.

Motif analysis of the expanded VNTR allele associated with rs10423769_A. A. 29 bp Repetitive sequence identified by MEME Suite 16. B. MEF2 family of TF binding motifs with FIMO motif finding tool q-value 17,18.

Differential methylation analysis

We also hypothesized that the mechanism of protection could be related to differential DNA methylation at the haplotype. Thus, we performed allele-specific methylation analysis of five heterozygote rs10423769_A/G brain samples to evaluate methylation differences in the 21 kb minimum shared haplotype and surrounding region. We identified 17 differentially methylated positions (DMP) (FDR < 0.01) comparing rs10423769_A to rs10423769_G haplotypes. Further analysis indicated that differences in methylation on those positions occurred due to base changes in the haplotype sequences between rs10423769_A to rs10423769_G haplotypes, with gain or loss of CpG sites (Figure 7).

Figure 7.

Figure 7.

Allele-specific differential methylation analysis in the rs10423769_A allele 21 kb minimum shared haplotype and surrounding region using 5 brain samples heterozygous for rs10423769. The dotted line indicates FDR <0.01. “Methylation diff” refers to the difference in mean methylation levels between the rs10423769_A and rs10423769_G haplotypes. “Methylation diff” is shown for DMP with FDR < 0.01.

Discussion

Herein we report the initial efforts to characterize the protective locus tagged by the A allele of rs10423769 which reduces the AD risk effect in AFR ancestry APOEε4 homozygotes by approximately 75%. Understanding the mechanisms involved in the protective effect is challenging, since the variant is located 2 Mb away from APOE, in a large area of SD containing a PSG gene cluster composed of 10 genes (PSG1-9, PSG11). Given the SD in the region, one of the first questions to answer was whether the locus was unique or duplicated. We demonstrate that it is a unique feature lying in a large area of segmental duplications. We confirmed that the minimum shared haplotype is from AFR origin and is found in all AFR populations represented in 1000 Genomes and admixed AFR populations, but not in Mexican ancestry, Peruvian, East Asian, South Asian and European populations. The presence of the haplotype in both Western and Eastern African populations suggests it is likely old in its AFR origin.

The haplotype of ~21kb shared by all rs10423769_A carriers and the extended haplotype of another 11 kb in high LD with rs10423769_A (r2 = 0.75) overlap with the lncRNA (PSG11-AS1/ENSG00000282943) (Supplementary figure 1). Overall, this lncRNA is expressed in all tissues at very low levels in the cerebral cortex according to The Genotype-Tissue Expression (GTEx) Project Release V8 (Web resources), with cerebellum, cerebellar hemisphere, and cultured fibroblasts having higher levels of expression. 21 In contrast, the PSG genes belong to family of glycoproteins that are primarily expressed in the human placenta. 4,5 Interestingly, data from the Allen Institute for Brain Science (Web resources) suggest that APOE has its highest overall expression just before and after birth. More studies will be needed to elucidate any potential involvement of the PSG genes or lncRNA directly in the protective effect on APOEε4.

The PSG locus was previously reported to have a higher frequency of copy number variations, deletions and duplications, compared to the genome average, with differences in frequency and distinct breakpoints between AFR and non-AFR haplotypes.4,22 However, given the duplication pattern in this region, the reliability of such reports is uncertain. Thus, we used LRWGS around the SD to characterize the region and the protective and non-protective haplotypes. We did not see obvious evidence of the SV patterns previously reported. This may be because of the technological limitations of these previous reports that cannot account for potential changes in segmental duplication patterns between individuals that may be reported as SVs. Gaining a better understanding of the genomic structure of this locus is critical since it is possible that differences in SVs or pattern of SDs between the protective and the risk haplotypes could influence chromatin interactions or other regulatory mechanisms in the area. In fact, it has been shown that chromatin reorganization happening during cellular aging leads to the re-expression of PSG genes, 23 suggesting a possibility of differences in SV affecting PSG gene expression. While the depth of reads allowed us to validate the structure of the protective haplotype, it was not enough to allow assembly of the entire 0.55 Mb SD region. This will require a much higher read depth, perhaps as high as 100x.

Several neurodegenerative diseases have been associated with VNTRs 24 in general and AD specifically. For example, increased length of a 25 bp repeat unit located in intron 18 of ABCA7 was associated with increased AD risk. 25 Interestingly, the protective haplotype was associated with expanded VNTR alleles which are enriched for a 29 bp motif with multiple MEF2 binding motifs. These types of clusters and VNTRs can be found in many areas of the genome, but what is compelling here is the significantly larger VNTR associated with the protective haplotype vs. the non-protective haplotype. The MEF2 family of TF have an important role during both development and adulthood, participating in neuronal development, synaptic plasticity, cognitive reserve and neurodegenerative diseases, by controlling the expression of several genes and miRNAs 26. The expression of all MEF2 isoforms is high in the brain, with the expression of MEF2A and MEF2D increasing with neuronal differentiation and maturation, whereas the expression of MEF2C remains relatively stable throughout. 27 Barker et al. (2022) found that MEF2 transcriptional network demonstrated the strongest association with predictive good cognition towards the end of life. Overexpression of MEF2A/C in a mouse model of tauopathy had positive effects on cognitive flexibility. 28 However, other authors suggest MEF2 has a negative effect on memory function. 29 Thus, depending on the interaction with other co-factors, such as chromatin-modifying enzymes or polymerase complex, and the cell type, MEF2s can either activate or suppress gene expression 27. It was reported that a MEF2A variant (p.Pro279Leu), which decreases MEF2A’s function in transcriptional activity, was significantly enriched in LOAD patients. 30 In addition, elevated methylation at an enhancer region of MEF2A that reduced MEF2A expression has been reported in AD, 31 further linking MEF2 decreased activity with AD. Further studies are needed to investigate if and how this VNTR may contribute to lowering APOEε4 risk.

Differences in methylation status between AD cases and controls have been noted, including in the APOE region. 32-34 We identified several DMP in the protective haplotype, 2 Mb away from APOE. Long-range effects of methylation as far as 10 Mb from promoter regions have been documented to play a role in regulation of gene expression. 35 Therefore, further follow-up of these DMPs is needed to determine the role of these alterations on APOE expression specifically or other genes in the region, and on AD risk in general.

One important question to be addressed in the future is whether the protective association of rs10423769_A with APOEε4 involves lowering of APOEε4 expression. Single nuclei RNA-sequencing data from our group suggests that the expression of APOE is much lower in one individual who is an APOE ε4/ε4 carrier and homozygote for rs10423769_A when compared to rs10423769_G carriers. 9 Efforts are underway to identify brain material carrying both the rs10423769_A allele and APOE4, but the low availability of tissue from the African ancestry population makes this more challenging.

Overall, while dozens of risk variants for AD have been described over the last decade, protective variants for AD have received less attention. Through evolution, protective mechanisms, usually with minimal side effects, have been established naturally. Increasing the number of studies on these protective variants to understand their processes is essential for the advancement of therapeutics in AD. In addition, our study illustrates the importance of including diverse populations in genetics studies to ensure broad representation and open opportunities to uncover and understand Alzheimer disease and biological mechanisms from a wider perspective.

Supplementary Material

Supplement 1
media-1.pdf (190.2KB, pdf)

Acknowledgments

This research was supported by the National Institute on Aging through the following grant numbers: U01AG072579, RF1AG059018, R01AG070864, R01AG072547, U01AG066767, P30AG066511, and U01AG052410. Additional funding was received by a Zenith Award (Alzheimer’s Association, J.M.V), the BrightFocus foundation (A2018425S, J.M.V) and Alzheimer's Association Research Fellowship to Promote Diversity (AARFD-24-1309441, L.B.N.). We are grateful to the many participants, researchers, and staff who contributed significantly to this study.

Footnotes

Data and Code Availability

Short read WGS from the ADSP is available via NIAGADS (ADSP R4: ng00067.v10) (Web resources). Long read sequencing is available upon request from the corresponding author. No custom code was created for this manuscript. All data processing was performed using publicly available software as referred to in the Methods.

Declaration of interests

The authors declare no competing interests.

Web resources

Alzheimer’s Disease Sequencing Project Release 4, https://adsp.niagads.org/

Plink v1.90, www.cog-genomics.org/plink/1.9/

LDhap and LDlink, https://ldlink.nih.gov/

Integrative Genome Viewer (IGV), http://software.broadinstitute.org/software/igv/

TREAT (Tandem REpeat Annotation Toolkit), https://github.com/holstegelab/treat/

MEME Suite, https://meme-suite.org/meme/tools/meme/

FIMO, https://meme-suite.org/meme/tools/fimo/

Modkit, https://github.com/nanoporetech/modkit/

Tandem repeat annotation, https://github.com/PacificBiosciences/pbsv/tree/master/annotations/

BrainSpan Atlas of the Developing Human Brain (Allen Institute for Brain Science), https://www.brainspan.org/rnaseq/search/index.html

Genotype-Tissue Expression (GTEx), https://www.gtexportal.org/

References

  • 1.Rajabli F., Feliciano B.E., Celis K., Hamilton-Nelson K.L., Whitehead P.L., Adams L.D., Bussies P.L., Manrique C P., Rodriguez A., Rodriguez V., et al. (2018). Ancestral origin of ApoE ε4 Alzheimer disease risk in Puerto Rican and African American populations. PLoS Genet 14, 1–13. 10.1371/journal.pgen.1007791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Blue E.E., Horimoto A.R.V.R., Mukherjee S., Wijsman E.M., and Thornton T.A. (2019). Local ancestry at APOE modifies Alzheimer’s disease risk in Caribbean Hispanics. Alzheimer’s & Dementia 15, 1524–1532. 10.1016/j.jalz.2019.07.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Rajabli F., Beecham G.W., Hendrie H.C., Baiyewu O., Ogunniyi A., Gao S., Kushch N.A., Lipkin-Vasquez M., Hamilton-Nelson K.L., Young J.I., et al. (2022). A locus at 19q13.31 significantly reduces the ApoE ε4 risk for Alzheimer’s Disease in African Ancestry. PLoS Genet 18, e1009977. 10.1371/journal.pgen.1009977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Chang C.L., Semyonov J., Cheng P.J., Huang S.Y., Park J. Il, Tsai H.-J., Lin C.-Y., Grützner F., Soong Y.K., Cai J.J., et al. (2013). Widespread Divergence of the CEACAM/PSG Genes in Vertebrates and Humans Suggests Sensitivity to Selection. PLoS One 8, e61701. 10.1371/journal.pone.0061701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Moore T., and Dveksler G.S. (2014). Pregnancy-specific glycoproteins: complex gene families regulating maternal-fetal interactions. Int J Dev Biol 58, 273–280. 10.1387/ijdb.130329gd. [DOI] [PubMed] [Google Scholar]
  • 6.Vollger M.R., Dishuck P.C., Harvey W.T., DeWitt W.S., Guitart X., Goldberg M.E., Rozanski A.N., Lucas J., Asri M., Abel H.J., et al. (2023). Increased mutation and gene conversion within human segmental duplications. Nature 2023 617:7960 617, 325–334. 10.1038/s41586-023-05895-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Bailey J.A., Gu Z., Clark R.A., Reinert K., Samonte R. V., Schwartz S., Adams M.D., Myers E.W., Li P.W., and Eichler E.E. (2002). Recent segmental duplications in the human genome. Science (1979) 297, 1003–1007. 10.1126/science.1072047. [DOI] [PubMed] [Google Scholar]
  • 8.Sharp A.J., Locke D.P., McGrath S.D., Cheng Z., Bailey J.A., Vallente R.U., Pertz L.M., Clark R.A., Schwartz S., Segraves R., et al. (2005). Segmental duplications and copy-number variation in the human genome. Am J Hum Genet 77, 78–88. 10.1086/431652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Celis K., Moreno M.D.M.M., Rajabli F., Whitehead P., Hamilton-Nelson K., Dykxhoorn D.M., Nuytemans K., Wang L., Flanagan M., Weintraub S., et al. (2023). Ancestry-related differences in chromatin accessibility and gene expression of APOE ε4 are associated with Alzheimer’s disease risk. Alzheimer’s & Dementia. 10.1002/ALZ.13075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Griswold A.J., Celis K., Bussies P.L., Rajabli F., Whitehead P.L., Hamilton-Nelson K.L., Beecham G.W., Dykxhoorn D M., Nuytemans K., Wang L., et al. (2021). Increased APOE ε4 expression is associated with the difference in Alzheimer’s disease risk from diverse ancestral backgrounds. Alzheimer’s and Dementia 17, 1179–1188. 10.1002/alz.12287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Auton A., Abecasis G.R., Altshuler D.M., Durbin R.M., Bentley D.R., Chakravarti A., Clark A.G., Donnelly P., Eichler E.E., Flicek P., et al. (2015). A global reference for human genetic variation. Nature 526, 68–74. 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wang N., Akey J.M., Zhang K., Chakraborty R., and Jin L. (2002). Distribution of recombination crossovers and the origin of haplotype blocks: The interplay of population history, recombination, and mutation. Am J Hum Genet 71, 1227–1234. 10.1086/344398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Purcell S., Neale B., Todd-Brown K., Thomas L., Ferreira M.A.R., Bender D., Maller J., Sklar P., De Bakker P.I.W., Daly M.J., et al. (2007). PLINK: A tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81, 559–575. 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Machiela M.J., and Chanock S.J. (2015). LDlink: A web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 31, 3555–3557. 10.1093/bioinformatics/btv402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Tesi N., Salazar A., Zhang Y., van der Lee S., Hulsman M., Knoop L., Wijesekera S., Krizova J., Schneider A.-F., Pennings M., et al. (2024). Characterising tandem repeat complexities across long-read sequencing platforms with TREAT and otter. bioRxiv, 2024.03.15.585288. 10.1101/2024.03.15.585288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bailey T.L., Johnson J., Grant C.E., and Noble W.S. (2015). The MEME Suite. Nucleic Acids Res 43, W39–W49. 10.1093/nar/gkv416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Grant C.E., Bailey T.L., and Noble W.S. (2011). FIMO: scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018. 10.1093/bioinformatics/btr064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Castro-Mondragon J.A., Riudavets-Puig R., Rauluseviciute I., Berhanu Lemma R., Turchi L., Blanc-Mathieu R., Lucas J., Boddie P., Khan A., Manosalva Pérez N., et al. (2022). JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles. Nucleic Acids Res 50, D165–D173. 10.1093/nar/gkab1113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Edge P., and Bansal V. (2019). Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing. Nat Commun 10. 10.1038/s41467-019-12493-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Akbari V., Garant J.M., O’Neill K., Pandoh P., Moore R., Marra M.A., Hirst M., and Jones S.J.M. (2021). Megabase-scale methylation phasing using nanopore long reads and NanoMethPhase. Genome Biol 22, 1–21. 10.1186/s13059-021-02283-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Carithers L.J., Ardlie K., Barcus M., Branton P.A., Britton A., Buia S.A., Compton C.C., DeLuca D.S., Peter-Demchok J., Gelfand E.T., et al. (2015). A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project. Biopreserv Biobank 13, 311–319. 10.1089/bio.2015.0032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sudmant P.H., Rausch T., Gardner E.J., Handsaker R.E., Abyzov A., Huddleston J., Zhang Y., Ye K., Jun G., Fritz M.H.Y., et al. (2015). An integrated map of structural variation in 2,504 human genomes. Nature 526, 75–81. 10.1038/nature15394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Liu Z., Ji Q., Ren J., Yan P., Wu Z., Wang S., Sun L., Wang Z., Li J., Sun G., et al. (2022). Large-scale chromatin reorganization reactivates placenta-specific genes that drive cellular aging. Dev Cell 57, 1347–1368.e12. 10.1016/j.devcel.2022.05.004. [DOI] [PubMed] [Google Scholar]
  • 24.Wang H., Wang L.-S., Schellenberg G., and Lee W.-P. (2023). The role of structural variations in Alzheimer’s disease and other neurodegenerative diseases. Front Aging Neurosci 14. 10.3389/fnagi.2022.1073905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.De Roeck A., Duchateau L., Van Dongen J., Cacace R., Bjerke M., Van den Bossche T., Cras P., Vandenberghe R., De Deyn P.P., Engelborghs S., et al. (2018). An intronic VNTR affects splicing of ABCA7 and increases risk of Alzheimer’s disease. Acta Neuropathol 135, 827–837. 10.1007/s00401-018-1841-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Lisek M., Przybyszewski O., Zylinska L., Guo F., and Boczek T. (2023). The Role of MEF2 Transcription Factor Family in Neuronal Survival and Degeneration. Int J Mol Sci 24. 10.3390/ijms24043120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Assali A., Harrington A.J., and Cowan C.W. (2019). Emerging roles for MEF2 in brain development and mental disorders. Curr Opin Neurobiol 59, 49–58. 10.1016/j.conb.2019.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Barker S.J., Raju R.M., Wang J., Davila-velderrain J., Abdurrob F., Milman N.E.P., Abdelaal K., Ashley L., Yu L., Bennett D.A., et al. (2022). MEF2s are key regulators of cognitive function and confer resilience to neurodegeneration. 13. 10.1126/scitranslmed.abd7695.MEF2s. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Rashid A.J., Cole C.J., and Josselyn S.A. (2014). Emerging roles for MEF2 transcription factors in memory. Genes Brain Behav 13, 118–125. 10.1111/gbb.12058. [DOI] [PubMed] [Google Scholar]
  • 30.González P., Álvarez V., Menéndez M., Lahoz C.H., Martínez C., Corao A.I., Calatayud M.T., Peña J., García-Castro M., and Coto E. (2007). Myocyte enhancing factor-2A in Alzheimer’s disease: Genetic analysis and association with MEF2A-polymorphisms. Neurosci Lett 411, 47–51. 10.1016/j.neulet.2006.09.055. [DOI] [PubMed] [Google Scholar]
  • 31.Li H., Wang F., Guo X., and Jiang Y. (2021). Decreased MEF2A Expression Regulated by Its Enhancer Methylation Inhibits Autophagy and May Play an Important Role in the Progression of Alzheimer’s Disease. Front Neurosci 15, 1–15. 10.3389/fnins.2021.682247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Shireby G., Dempster E.L., Policicchio S., Smith R.G., Pishva E., Chioza B., Davies J.P., Burrage J., Lunnon K., Seiler Vellame D., et al. (2022). DNA methylation signatures of Alzheimer’s disease neuropathology in the cortex are primarily driven by variation in non-neuronal cell-types. Nat Commun 13, 1–14. 10.1038/s41467-022-33394-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Foraker J., Millard S.P., Leong L., Thomson Z., Chen S., Keene C.D., Bekris L.M., Yu C.E., and Fischer A. (2015). The APOE Gene is Differentially Methylated in Alzheimer’s Disease. Journal of Alzheimer’s Disease 48, 745–755. 10.3233/JAD-143060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Liu J., Zhao W., Ware E.B., Turner S.T., Mosley T.H., and Smith J.A. (2018). DNA methylation in the APOE genomic region is associated with cognitive function in African Americans. BMC Med Genomics 11, 1–13. 10.1186/s12920-018-0363-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kim S., Park H.J., Cui X., and Zhi D. (2020). Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer. Sci Rep 10, 1–12. 10.1038/s41598-020-60845-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.pdf (190.2KB, pdf)

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES