Abstract
Transposable elements (TEs) have played a significant role in the evolution of host genome by triggering genomic rearrangements. TEs have been studied in various research fields, ranging from population genomics to personalized medicines. Human-specific TEs and TEs existing in the human genome have been well studied. Unlike them, non-human primate-specific TEs remain shrouded in mystery. However, the study of TE-mediated genomic or genetic variations through comparative genomics is essential to understand mechanisms which TEs utilize to modify species-specific genome architecture and to cause species-specific diseases, Therefore, we have studied chimpanzee-specific TEs as well as human-specific TEs. At first, we identified human-specific HERV-K integrated into the human genome after the divergence of human and chimpanzee. Then, for a comparative study of HERV-Ks and non-human ERVs, we extracted chimpanzee-specific endogenous retroviruses (PtERVs) from the chimpanzee genome. We identified 256 chimpanzee-specific PtERVs and characterized them, focusing on their estimated evolutionary age, polymorphism level in chimpanzee populations, and potential impact on the difference between the human and chimpanzee genomes.
Keywords: chimpanzee genome, chimpanzee-specific endogenous retrovirus (PtERV), polymorphism, retrotransposition, transcriptome diversification
Identification of the Chimpanzee-Specific PtERVs
After the completion of human whole-genome sequencing, the whole-genome sequences of various primates including chimpanzee, gorilla, orangutan, and rhesus monkey were also been sequenced and released, which leads to comparative genomics.1-5 The first version of human whole-genome sequence had a lot of unsequenced regions and assembly errors. Enormous efforts have been put to improve the sequence data, and most of the errors are corrected in the latest version of human reference genome sequence. Human and the non-human primate genome sequences are available with UCSC genome browser (http://genome.ucsc.edu), and species-specific endogenous retrovirus can be retrieved through the UCSC utility. However, the quality of the latest version of the chimpanzee and other primate genome sequences is not good as much as that of the human genome sequence, and the reference genome sequence data still contain unsequenced genomic regions and assembly errors. The relatively poor quality of non-human primate genome sequences is somewhat troublesome because it leads to a high proportion of false positives of TEs. Thus, after computational mining of TEs from the sequence database, the extracted elements have to be experimentally validated, which limits the study of TEs in non-human primates. Despite the difficulty, we tried to study ERVs which exist in the chimpanzee genome. Based on chimpanzee reference genome sequence (PanTro3; October 2010 released), we computationally mined a total 693 of PtERV candidates using UCSC table browser and annotated these elements using RepeatMasker. And then, these elements including their 2kb of flanking sequences were aligned with orthologous sequences of human (hg19; February 2009 released), gorilla (gorGor3; May 2011 released), and orangutan (ponAbe2; July 2007 released) reference genomes. Through these approaches, we identified 256 chimpanzee-specific PtERVs which were not present in the human, gorilla and orangutan genomes. Chimpanzee is the closest species to human and ∼95% of its whole-genome sequence is shared with the human genome sequence.6 TEs are partially responsible for the 5% genomic difference between humans and chimpanzees. Thus, we expected that the study of chimpanzee ERVs would shed a light in not only chimpanzee-specific ERV evolution but also the genomic difference caused by species-specific ERVs between humans and chimpanzees. We previously published an article about human-specific HERV-K.7 Twenty nine human-specific HERV-K insertions were identified in the study (Fig. 1). Later, we analyzed chimpanzee-specific PtERVs and their roles in triggering genomic variations.8
Figure 1.

The proportion of human-specific HERV-Ks and chimpanzee-specific PtERVs in the human and chimpanzee genomes, respectively. The left diagram regarding HERV-Ks in the human genome was drawn based on the Buzdin et al. (2003) and Shin et al. (2013). The right diagram regarding PtERVs in the chimpanzee genome was drawn based on the Mun et al. (2014). The number of black lollipops means the number of retrotransposition-competent ERVs in each host genome.
Characterization of Chimpanzee-Specific PtERVs
Chimpanzee-specific endogenous retrovirus (PtERV) is fossilized in the chimpanzee genome through infection into germ line and retrotransposition after divergence of the human and chimpanzee genomes.9 Similar to the human endogenous retrovirus (HERV), PtERV has a classical structure, composing of 3 ORFs (gag, pol, and env) encoding essential viral machineries and long terminal repeat (LTR) existing in both ends of the internal ORF region.10 We computationally extracted 256 chimpanzee-specific PtERV candidates.8 They exist in 3 different types in the chimpanzee genome: full-length PtERV, solitary LTR, and truncated PtERV. Among them, 14 full-length PtERVs and 23 solitary LTRs were new in that they had not been reported in any of previous studies (Fig. 1).9,10 The chimpanzee-specific PtERVs located on chimpanzee chromosomes 7, 13, and Y; Y chromosome had 15 full-length PtERVs.8 It was suggested that the host genome has adapted various mechanisms to eliminate TEs or weaken their effects during evolution.11,12 One of the mechanisms is a homologous recombination. Homologous recombination occurs between PtERVs due to a high level of sequence identity between the elements, leading to the elimination of the element. Considering that recombination rate is relatively low on the chromosome Y, compared with other chromosomes, the accumulation of full-length PtERVs on the chromosome Y is well explained.13,14 Given that there was no preferential insertion site for PtERV insertions, it is likely that PtERVs randomly insert into the chimpanzee genome, and then are fixed or eliminated from the host genome via the host defense mechanism.15
Unlike human-specific HERV-Ks whose majority is solitary LTRs, 121 out of 256 chimpanzee-specific PtERVs are full-length (Fig. 1). It is a remarkable difference between human-specific HERV-K and chimpanzee-specific ERV.7,8 Compared with human LTRs, there was a high level of sequence divergence between PtERV LTRs. The divergent sequence reduces the homologous recombination between PtERV LTRs. One of the mechanisms to generate solitary LTR is the homologous recombination between LTRs.16 Taken together, it can be suggested that the relatively low rate of recombination between PtERV LTRs is responsible for the higher copy number of full-length ERVs in the chimpanzee than human genome. In fact, human-specific HERV-Ks have propagated in a single lineage while chimpanzee-specific ERVs have propagated in 2 different lineages, CERV1 and CERV2 which are distinguished from one another based on their LTR sequence. However, we could not rule out other possibilities to explain the high copy number of chimpanzee-specific ERVs.
Recent Integrations of PtERVs and Their Polymorphism in the Chimpanzee Genome
PtERV retaining act ORFs and LTRs could be retrotranspositionally competent because it is still capable to produce its viral machineries which lead to retrotransposition through “copy and paste” mechanism or reinfection by making their particle.17 We closely examined chimpanzee-specific full-length PtERVs computationally and experimentally, and found 8 retrotransposition-competent PtERVs (rcPtERVs) (Fig. 1). The eight elements contained intact ORFs which are not disrupted by frameshift or nonsense mutation. Based on their LTR sequences, 3 and 5 rcPtERVs belonged to CERV1 and CERV2 families, respectively. We estimated the age of the 2 families and the result showed that CERV2 family (2.43-3.16 myrs) is younger than CERV1 family (8.80-11.45 myrs). PtERVs recently integrated into a new genomic region are likely to be polymorphic in the chimpanzee genome: presence/absence polymorphism.18 The polymorphism level of chimpanzee-specific PtERVs was examined using a DNA panel composed of 12 chimpanzee individuals. The result showed that 11 chimpanzee-specific PtERVs are polymorphic in the chimpanzee genome and the polymorphic rate of CERV2 family (22.2%) is higher than that of CERV1 family (20.9%). After full-length PtERV insertion, PtERV is likely to accumulate mutations over time or be removed by host defense mechanisms. Thus, it may as well that younger PtERV family has more rcPtERVs and higher polymorphic rate in the chimpanzee genome.
PtERVs Causing Chimpanzee-Specific Genomic Deletions
The chimpanzee genome has undergone species-specific genomic and genetic changes, since the divergence from human, its closest species. Retrotransposons are one of major sources to cause the changes in various primate genomes including the chimpanzee genome. They have increased their copy numbers in the chimpanzee genome via various mechanisms such as retrotransposition, reinfection, and recombination-mediated duplication events.3,6-8,19 Retrotransposons are able to cause host genomic variations using various mechanisms which include insertion, gene conversion, insertion-mediated deletion, recombination and transduction.20 Most of PtERVs have propagated via a classical insertion whose hallmark is target side duplication (TSD). However, some PtERVs resulted from non-classical insertions, called NCPI. The elements produced by NCPI event are lack of TSD and instead exhibit a short junctional microhomology that is shared with the host genomic region neighboring the element. NCPI accompanies host genomic deletions.8 We found that 5 chimpanzee-specific PtERVs were integrated into the chimpanzee genome via NCPI and deleted 51,366 bp of the chimpanzee genome.
Transcript Variants Caused by PtERVs
ERVs are able to insert into the genic region of the host genome. They can destroy host genes through the insertion into the exonic region. In addition, ERVs residing in the intronic region are also able to modify the expression of host genes by causing alternative splicing via their cis-elements, which could lead to abnormal proteins with deleterious function and genetic diseases, ultimately.21,22 It was reported that some mobile genetic elements could provide cis-element altering the cellular environments such as chromatin structure and epigenetic modification.22-25 Thus, ERVs are vulnerable to host defense mechanism which could delete the elements from the host genome. Due to the host defense mechanism, only small portion of the elements go to be fixed in the host genome. The host defense mechanism is probably more efficient against ERVs inserting into the genic region. In fact, most ERVs are located in intergenic and heterochromatin regions. Nonetheless, some elements are fixed in the genic region although they locate in UTR (untranslated region) or intronic region.26-28 Some of chimpanzee-specific PtERVs exist in the intronic region of known coding genes. PtERV residing inside PNRC2 gene causes chimpanzee-specific transcripts whose transcription starts from different transcription start site (TSS). PNRC2 transcripts from non-chimpanzee primates including human have a long 5’ UTR while its chimpanzee counterpart has a relatively short UTR. It suggests that a potential alternative PNRC2 promoter was created by PtERV insertion in the chimpanzee genome.8 Thus, PtERVs locating either in or nearby the genic region have a potential to influence the expression of the respective genes, which could lead to phenotypical differences between chimpanzee and other primates.22,30-33
Concluding Remarks
In this commentary, we characterize chimpanzee-specific PtERVs and describe their potential role in causing genomic changes in the chimpanzee genome. Since the divergence of human and chimpanzee, PtERVs have contributed to chimpanzee genomic fluidity and chimpanzee-specific genomic variations via insertion-mediated deletion and the homologous recombination between them. In addition, PtERV is able to lead to alternative gene transcripts by its insertion into the genic region. Further studies with chimpanzee RNAs or cell-lines will elucidate more about the impact of the chimpanzee-specific PtERVs on phenotypical divergence in chimpanzees.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
References
- 1. Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang SP, Wang Z, Chinwalla AT, Minx P, et al. Comparative and demographic analysis of orang-utan genomes. Nature 2011; 469:529-33; PMID:21270892; http://dx.doi.org/ 10.1038/nature09687 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Rhesus Macaque Genome S, Analysis C, Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, et al. Evolutionary and biomedical insights from the rhesus macaque genome. Science 2007; 316:222-34; PMID:17431167; http://dx.doi.org/ 10.1126/science.1139247 [DOI] [PubMed] [Google Scholar]
- 3. Chimpanzee S, Analysis C. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 2005; 437:69-87; PMID:16136131; http://dx.doi.org/ 10.1038/nature04072 [DOI] [PubMed] [Google Scholar]
- 4. Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, Hobolth A, Lappalainen T, Mailund T, Marques-Bonet T, et al. Insights into hominid evolution from the gorilla genome sequence. Nature 2012; 483:169-75; PMID:22398555; http://dx.doi.org/ 10.1038/nature10842 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Lander ES LL, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al. International Human Genome Sequencing Consortium. Nature 2001; 412:565; PMID:11237011; http://dx.doi.org/ 10.1038/3508762711237011 [DOI] [Google Scholar]
- 6. Varki A, Altheide TK. Comparing the human and chimpanzee genomes: searching for needles in a haystack. Genome Res 2005; 15:1746-58; PMID:16339373; http://dx.doi.org/ 10.1101/gr.3737405 [DOI] [PubMed] [Google Scholar]
- 7. Shin W, Lee J, Son SY, Ahn K, Kim HS, Han K. Human-specific HERV-K insertion causes genomic variations in the human genome. PLoS One 2013; 8:e60605; PMID:23593260; http://dx.doi.org/ 10.1371/journal.pone.0060605 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Mun S, Lee J, Kim YJ, Kim HS, Han K. Chimpanzee-specific endogenous retrovirus generates genomic variations in the chimpanzee genome. PLoS One 2014; 9:e101195; PMID:24987855; http://dx.doi.org/ 10.1371/journal.pone.0101195 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Yohn CT, Jiang Z, McGrath SD, Hayden KE, Khaitovich P, Johnson ME, Eichler MY, McPherson JD, Zhao S, Paabo S, et al. Lineage-specific expansions of retroviral insertions within the genomes of African great apes but not humans and orangutans. PLoS Biol 2005; 3:e110; PMID:15737067; http://dx.doi.org/ 10.1371/journal.pbio.0030110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Polavarapu N, Bowen NJ, McDonald JF. Identification, characterization and comparative genomics of chimpanzee endogenous retroviruses. Genome Biol 2006; 7:R51; PMID:16805923; http://dx.doi.org/ 10.1186/gb-2006-7-6-r51 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Smit AF. Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr Opin Genet Dev 1999; 9:657-63; PMID:10607616; http://dx.doi.org/ 10.1016/S0959-437X(99)00031-3 [DOI] [PubMed] [Google Scholar]
- 12. Maksakova IA, Romanish MT, Gagnier L, Dunn CA, van de Lagemaat LN, Mager DL. Retroviral elements and their hosts: insertional mutagenesis in the mouse germ line. PLoS genetics 2006; 2:e2; PMID:16440055 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Kjellman C, Sjogren HO, Widegren B. The Y chromosome: a graveyard for endogenous retroviruses. Gene 1995; 161:163-70; PMID:7665072; http://dx.doi.org/ 10.1016/0378-1119(95)00248-5 [DOI] [PubMed] [Google Scholar]
- 14. Kuroki Y, Toyoda A, Noguchi H, Taylor TD, Itoh T, Kim DS, Kim DW, Choi SH, Kim IC, Choi HH, et al. Comparative analysis of chimpanzee and human Y chromosomes unveils complex evolutionary pathway. Nat Genet 2006; 38:158-67; PMID:16388311; http://dx.doi.org/ 10.1038/ng1729 [DOI] [PubMed] [Google Scholar]
- 15. Linheiro RS, Bergman CM. Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster. PLoS One 2012; 7:e30008; PMID:22347367; http://dx.doi.org/ 10.1371/journal.pone.0030008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Vinogradova TV, Leppik LP, Nikolaev LG, Akopov SB, Kleiman AM, Senyuta NB, Sverdlov ED. Solitary human endogenous retroviruses-K LTRs retain transcriptional activity in vivo, the mode of which is different in different cell types. Virology 2001; 290:83-90; PMID:11883008; http://dx.doi.org/ 10.1006/viro.2001.1134 [DOI] [PubMed] [Google Scholar]
- 17. Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang SP, Wang Z, Chinwalla AT, Minx P, et al. Comparative and demographic analysis of orang-utan genomes. Nature 2011; 469:529–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Rhesus Macaque Genome S, Analysis C, Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, et al. Evolutionary and biomedical insights from the rhesus macaque genome. Science 2007; 316:222–34. [DOI] [PubMed] [Google Scholar]
- 19. Chimpanzee S, Analysis C. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 2005; 437:69–87. [DOI] [PubMed] [Google Scholar]
- 20. Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, Hobolth A, Lappalainen T, Mailund T, Marques-Bonet T, et al. Insights into hominid evolution from the gorilla genome sequence. Nature 2012; 483:169–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Lander ES LL, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al. International Human Genome Sequencing Consortium. Nature 2001; 412:565. [Google Scholar]
- 22. Varki A, Altheide TK. Comparing the human and chimpanzee genomes: searching for needles in a haystack. Genome Res 2005; 15:1746–58. [DOI] [PubMed] [Google Scholar]
- 23. Shin W, Lee J, Son SY, Ahn K, Kim HS, Han K. Human-specific HERV-K insertion causes genomic variations in the human genome. PLoS One 2013; 8:e60605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Mun S, Lee J, Kim YJ, Kim HS, Han K. Chimpanzee-specific endogenous retrovirus generates genomic variations in the chimpanzee genome. PLoS One 2014; 9:e101195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Yohn CT, Jiang Z, McGrath SD, Hayden KE, Khaitovich P, Johnson ME, Eichler MY, McPherson JD, Zhao S, Paabo S, et al. Lineage-specific expansions of retroviral insertions within the genomes of African great apes but not humans and orangutans. PLoS Biol 2005; 3:e110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Polavarapu N, Bowen NJ, McDonald JF. Identification, characterization and comparative genomics of chimpanzee endogenous retroviruses. Genome Biol 2006; 7:R51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Smit AF. Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr Opin Genet Dev 1999; 9:657–63. [DOI] [PubMed] [Google Scholar]
- 28. Maksakova IA, Romanish MT, Gagnier L, Dunn CA, van de Lagemaat LN, Mager DL. Retroviral elements and their hosts: insertional mutagenesis in the mouse germ line. PLoS genetics 2006; 2:e2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Kjellman C, Sjogren HO, Widegren B. The Y chromosome: a graveyard for endogenous retroviruses. Gene 1995; 161:163–70. [DOI] [PubMed] [Google Scholar]
- 30. Kuroki Y, Toyoda A, Noguchi H, Taylor TD, Itoh T, Kim DS, Kim DW, Choi SH, Kim IC, Choi HH, et al. Comparative analysis of chimpanzee and human Y chromosomes unveils complex evolutionary pathway. Nat Genet 2006; 38:158–67. [DOI] [PubMed] [Google Scholar]
- 31. Linheiro RS, Bergman CM. Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster. PLoS One 2012; 7:e30008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Vinogradova TV, Leppik LP, Nikolaev LG, Akopov SB, Kleiman AM, Senyuta NB, Sverdlov ED. Solitary human endogenous retroviruses-K LTRs retain transcriptional activity in vivo, the mode of which is different in different cell types. Virology 2001; 290:83–90. [DOI] [PubMed] [Google Scholar]
- 33. Mills RE, Bennett EA, Iskow RC, Devine SE. Which transposable elements are active in the human genome? Trends Genet 2007; 23:183–91. [DOI] [PubMed] [Google Scholar]
- 34. O’Donnell KA, Burns KH. Mobilizing diversity: transposable element insertions in genetic variation and disease. Mobile DNA 2010; 1:21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Watanabe H, Fujiyama A, Hattori M, Taylor TD, Toyoda A, Kuroki Y, Noguchi H, BenKahla A, Lehrach H, Sudbrak R, et al. DNA sequence and comparative analysis of chimpanzee chromosome 22. Nature 2004; 429:382–8. [DOI] [PubMed] [Google Scholar]
- 36. Cordaux R, Batzer MA. The impact of retrotransposons on human genome evolution. Nature reviews Genetics 2009; 10:691–703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Bourque G. Transposable elements in gene regulation and in the evolution of vertebrate genomes. Curr Opin Genet Dev 2009; 19:607–12 [DOI] [PubMed] [Google Scholar]
- 38. Warnefors M, Pereira V, Eyre-Walker A. Transposable elements: insertion pattern and impact on gene expression evolution in hominids. Mol Biol Evol 2010; 27:1955–62. [DOI] [PubMed] [Google Scholar]
- 39. Kondo Y, Issa JP. Enrichment for histone H3 lysine 9 methylation at Alu repeats in human cells. J Biol Chem 2003; 278:27658–62. [DOI] [PubMed] [Google Scholar]
- 40. Huda A, Bowen NJ, Conley AB, Jordan IK. Epigenetic regulation of transposable element derived human gene promoters. Gene 2011; 475:39–48. [DOI] [PubMed] [Google Scholar]
- 41. Sela N, Kim E, Ast G. The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates. Genome Biol 2010; 11:R59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Yoder JA, Walsh CP, Bestor TH. Cytosine methylation and the ecology of intragenomic parasites. Trends Genet 1997; 13:335–40. [DOI] [PubMed] [Google Scholar]
- 43. McDonald JF, Matzke MA, Matzke AJ. Host defenses to transposable elements and the evolution of genomic imprinting. Cytogenetic and genome research 2005; 110:242–9. [DOI] [PubMed] [Google Scholar]
- 44. Lippman Z, Gendrel AV, Black M, Vaughn MW, Dedhia N, McCombie WR, Lavine K, Mittal V, May B, Kasschau KD, et al. Role of transposable elements in heterochromatin and epigenetic control. Nature 2004; 430:471–6. [DOI] [PubMed] [Google Scholar]
- 45. Buzdin A, Ustyugova S, Khodosevich K, Mamedov I, Lebedev Y, Hunsmann G, Sverdlov E. Human-specific subfamilies of HERV-K (HML-2) long terminal repeats: three master genes wer [DOI] [PubMed] [Google Scholar]
