Abstract
Endogenous retroviruses (ERVs) have been integrated into vertebrate genomes and have momentously affected host organisms. Horses (Equus caballus) have been domesticated and selected for elite racing ability over centuries. ERVs played an important role in the evolutionary diversification of the horse genome. In the present study, we identified six equine ERV families (EqERVs-E1, I1, M2, P1, S1, and Y4), their full-length viral open reading frames (ORFs), and elucidated their phylogenetic relationships. The divergence time of EqERV families assuming an evolutionary rate of 0.2%/Myr indicated that EqERV-S3 (75.4 million years ago; mya) on chromosome 10 is an old EqERV family and EqERV-P5 (1.2 Mya) on chromosome 12 is a young member. During the evolutionary diversification of horses, the EqERV-I family diverged 1.7 Mya to 38.7 Mya. Reverse transcription quantitative real-time PCR (RT-qPCR) amplification of EqERV pol genes showed greater expression in the cerebellum of the Jeju horse than the Thoroughbred horse. These results could contribute further dynamic studies for horse genome in relation to EqERV gene function.
Keywords: EqERV, Jeju horse, pol gene, quantitative real-time RT-PCR, thoroughbred horse
INTRODUCTION
Modern horses (Equus caballus) have evolved for over 50 million years from the genus Eohippus. Since 4000 B.C., horses have been selected by humans on the basis of strength, speed, and endurance (Gu et al., 2009). A large number of horse subspecies have been domesticated and bred in regard to improvement of improve speed, endurance, and agility. Only one wild subspecies (Przewalski’s horse) has survived to the present day (Lau et al., 2009; Orlando et al., 2013). Thoroughbred horses have been specifically bred for racing ability; therefore, many studies have examined genetic traits related to athletic performance (Bower et al., 2012; Hill et al., 2010; Park et al., 2012). The Jeju breed of horses has been developed over centuries by interbreeding the native horses of Jeju, Korea with the 160 Mongolian horses introduced by the invading Mongolian army in 1276. These horses have been bred for farming, as a food source, and for racing in the Jeju province in South Korea. Jeju horses are small; however, they are robust and possess a high level of endurance. Many studies have focused on elucidating the phylogenetic relationships as well as genetic characteristics of Jeju horses (Cho, 2007; Shin et al., 2002). Research on Thoroughbred horses has focused on improving racing ability. Therefore, genetic research on the Jeju horse may help predicting of phylogenetic relationships and identifying traits specific to the Jeju horse. New approaches are required for the investigation of racing ability, subspecies-specific traits, evolutionary traits, as well as the phylogenetic diversity of horse subspecies.
Retroviral infections resulting in germline integration have an important role in vertebrate evolutionary diversification. Endogenous retrovirus (ERV) sequences account for about 5–10% of host genome and have been identified in all mammals and most vertebrates. Following integration into the host genome, retroviruses are transmitted from ancestors to progeny according to Mendelian inheritance principles and eventually form repeat elements (long terminal repeat (LTR) elements) (Blikstad et al., 2008; Jern and Coffin, 2008). The full-length ERV genome comprises gag (group-specific antigen), prt (protease), pol (polymerase), and env (envelope) genes as well as LTR elements at both ends. Primer binding sites (PBS) also exist between the 5′-LTR and the gag gene. Most ERVs display a truncated LTR, although some ERVs display a full-length sequence. These ERV sequences induce genomic instability via mutational events, such as retrotranspositions and genomic rearrangements (Esnault et al., 2005; Hughes and Coffin, 2001). When two LTR elements are integrated into the host genome, they initially display identical sequences that subsequently diverge at a rate of 0.13–0.21% over a period of a million years. Therefore, it is possible to estimate the point in time of integration by assessing the divergence between two LTR sequences (Tristem, 2000). ERVs are classified according to their PBS that are highly homologous to host tRNA complement sequences (Tristem, 2000).
Human endogenous retroviruses (HERVs) have many well-known functions, e.g., they are potential pathogens, alternative LTR promoters, and are involved in syncytin production in humans (Dunn et al., 2006; Malassine et al., 2005; Ruprecht et al., 2008). Horses also have ERVs, referred to as equine ERVs (EqERVs) (Brown et al., 2012; Garcia-Etxebarria and Jugo, 2012; Jern and Coffin, 2008; van der Kuyl, 2011). In the equine genome, full-length beta-EqERV genomes have been identified (van der Kuyl, 2011). Fifteen EqERV types belonging to three classes were detected by in silico methods (Garcia-Etxebarria and Jugo, 2012). ERVs have the potential to alter the phenotype of different species, and EqERVs may also provide evolutionary information related to the traits of horses (Gifford and Tristem, 2003; Jern and Coffin, 2008). According to transcriptome analyses, EqERV-mRNAs are expressed in various horse tissues (Brown et al., 2012). The expression of ERVs has previously been identified by the detection of ERV-derived transcripts (Ahn and Kim, 2009; Perron et al., 2008). Some ERV coding regions (gag, pol, and env) have complete open reading frames (ORFs); interrupted ERV ORFs have also been detected. ERV genes are transcribed at high levels, and corresponding proteins are expressed more frequently in cancer than in normal tissues (Ahn and Kim, 2009; Kang et al., 2014; Ruprecht et al., 2008). Some ERV families tend to exhibit ubiquitous expression in various organs, while others show tissue-specific expression patterns (Ahn and Kim, 2009; Ahn et al., 2011b). We identified EqERVs in the horse genome and characterized their genomic organization. To date, no studies have been undertaken to evaluate the expression patterns of EqERVs in horse samples. Thus, the EqERV expression patterns were examined in the cerebrum and cerebellum of a Thoroughbred as well as a Jeju horse.
MATERIALS AND METHODS
Ethics statement
All animal trials approved by the Pusan National University-Institutional Animal Care and Use Committee (PNU-IACUC) (Approval Number for horse tissue samples: PNU-2013-0411). All biopsies were taken under the guidance of a veterinarian. Efforts were made to minimize animal stress and pain.
Horse samples
Cerebrum and Cerebellum tissue of individual Thoroughbred and Jeju horses obtained from the Subtropical Animal Experiment Station of the Rural Development Administration, South Korea. Total RNA was extracted using a standard TRIzol®-based purification protocol (Invitrogen, USA) and quantified using NanoDrop technology (Thermo Fisher Scientific Inc., USA). Then, RNA was adjusted to concentrations of 500 ng per samples. M-MLV (Moloney-Murine Leukemia Virus) reverse transcriptase (Promega, USA) was used for cDNA synthesis according to instructions of the manufacturer (Promega).
Identification of EqERV families
The Equus caballus (equCab2) genome assembly of a Thoroughbred horse was retrieved from NCBI Assembly (http://www.ncbi.nlm.nih.gov/assembly/286598/). In UCSC Genome Browser, BLAT was used to search for ERVs in the horse genome, and well-known full-length retrovirus DNA sequences were used as search queries. The resulting sequences were expanded to include upstream and downstream regions for total lengths of 1.5×, then retrieved ERV-sequences were queried using RetroTector (http://retrotector.neuro.uu.se/). EqERV was used to refer to the ERVs described in this work based on the previously described nomenclature (Garcia-Etxebarria and Jugo, 2012).
Bioinformatic analysis
EqERVs were classified according to the PBS and identified by length. Each LTR sequence was aligned using ClustalW in MEGA7 (http://www.megasoftware.net/). The point in time of integration of EqERVs was assessed based on the divergence of each LTR (Johnson and Coffin, 1999). The tRNA sequences were downloaded from the genomic tRNA database (http://gtrnadb.ucsc.edu) and the PBS sites were aligned with tRNA sequences in BioEdit (Hall, 1999). The PBS sites not matching with tRNA sequences were realigned using the RetroTector PBS site (Sperber et al., 2009). Sequence divergences between two LTR sequences were assessed by the Kimura two-parameter method in MEGA7 (Kimura, 1980). Neighbor-joining trees were constructed using MEGA7, with 100 bootstrap replicates (Kimura, 1980). The percentage of bootstrap replicates supporting the branch is revealed at each node. The ORFs of gag, pol, env region were retrieved by a BLASTx search (Johnson et al., 2008).
RT-qPCR amplification
EqERV pol transcripts were detected by reverse transcription quantitative real-time PCR (RT-qPCR) using the primers shown in Table 1. We designed primer sets using the Primer3 program (http://bioinfo.ut.ee/primer3-0.4.0/). The RT regions of the EqERV pol gene were amplified. Each reaction only yielded one amplification product as determined by melting curve analyses. No-template controls were consistently tested negative. For normalization, the beta-2-microglobulin (B2M) gene was amplified as previously described (Ahn et al., 2011a). RT-qPCR amplification was performed as follows: 30 cycles of 95°C for 30 s, 55°C for 30 s, 72°C for 30 s, and a final elongation at 72°C for 7 min using a Mastercycler® Pro S Thermocycler (Effendorf, Hamburg, Germany). The conditions for the RT-qPCR amplification of EqERV elements and the B2M gene were 45 cycles of 95°C for 10 s, 58°C for 15 s, and 72°C for 15 s. A melting curve analysis was performed for 30 s at 55°C to 99°C using a Rotor-Gene Q (Qiagen, Hilden, Germany). All samples were amplified in triplicate to guarantee reproducibility.
Table 1.
List of RT-qPCR primers used for the amplification of the EqERV pol gene in Thoroughbred and Jeju horses
| EqERV family | Direction | Primer sequences | Product size (bp) | Location in genome |
|---|---|---|---|---|
| EqERV-E1 pol | Forward | GGTACAGAGAGGGAGGCACA | 132 | chr1:183,840,430-183,840,561 |
| Reverse | TTGAGGCTGATTGCTTTTGA | |||
| EqERV-I1 pol | Forward | ACCCCATCTGCACTGAAATC | 140 | chrX:74,309,270-74,309,383 |
| Reverse | CCCGAGTTGAGGGTTAGGG | |||
| EqERV-P1 pol | Forward | TGTGGGTCCTTCTAGTTGTGG | 134 | chr7:49,472,758-49,472,891 |
| Reverse | CGAGGGGTTAAGTTCACACG | |||
| EqERV-M2 pol | Forward | TGGAAAAAGGCAAAGACAAA | 125 | chr5:16,128,363-16,128,487 |
| Reverse | CTGAAGTGGCCACCAACTTT | |||
| EqERV-S1 pol | Forward | CATGGCACTGCTCATCAAAC | 114 | chrX:95,866,328-95,866,441 |
| Reverse | TTCCTTTTTCTCCCCAAAGC | |||
| EqERV-Y4 pol | Forward | GGGAGGTCAGAGCCTTGTTT | 142 | chr1:29,481,664-29,481,805 |
| Reverse | ACGTCCTTTTCTTGGCATCC | |||
| B2M (for normalization) | Forward | CCTGCTCGGGCTACTCTC | 89 | chr1:144,494,425-144,497,779 |
| Reverse | CATTCTCTGCTGGGTGACG |
RESULTS AND DISCUSSION
Identification and point of divergence of EqERVs in the horse genome
We analyzed EqERVs in the horse genome by an in silico method using the RetroTector. We confirmed the locations in the genome, retroviral genes, PBS regions, and LTR pairs of each EqERV family. In the horse whole genome, a total of 22 EqERV types were detected by PBS analysis (EqERV-E, I, M, P, S, and Y) (Fig. 1 and Table 2). Based on a comparative analysis of the reverse complement of PBS and tRNA sequences, we determined the first amino acid combined with the tRNA, and assigned the following EqERV family names: EqERV-E1, EqERV-I1, EqERV-M2, EqERV-P1, EqERV-S1, and EqERV-Y4 (Fig. 2). In all EqERV families, the surface (SU) component of the env gene was interrupted. The matrix (MA) domain of the gag gene was only detected, and full-length ORFs of viral genes (gag and env) were identified in EqERV-E1. The prt gene was truncated in EqERV-P1, EqERV-S1, and EqERV-Y4. EqERV-P1 and EqERV-S1 showed a long capsid (CA) domain compared to viruses of other families, however full-length ORFs of the gag gene were not detected. Additionally, long integrase (IN) and transmembrane (TM) domains were detected in EqERV-P1, while a long reverse transcriptase (RT) domain in the pol region was observed for EqERV-I1. The EqERV-S1 has full-length ORFs of three viral genes (gag, pol, and env), but other EqERVs has at least one interrupted ORFs in these sequences (Fig. 1). In the host, the activation of restriction factors was not need to full-length ORFs, and interrupted ORFs could regulate immune systems or other biological roles (Malfavon-Borja and Feschotte, 2015). Thus, all these EqERV domains appear to have important biological roles in horses.
Fig. 1. Structural analysis of full-length EqERVs located at specific loci on horse chromosomes.

The depicted structures were analyzed using EqERV sequences from the horse genome equCab2 assembly with the RetroTector10 program. The interrupted regions and full-length ORFs were indicated as dot lines and solid lines with bold type, respectively. These EqERVs harbor a 5′ LTR-gag-(prt)-pol-env-3′ LTR structure.
Table 2.
Divergence times and evolutionary rates of EqERV families
| Evolutionary time | |||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|||||||||
| EqERV Family | *PBS | Location (equCab2) | Strand | Length (kb) | **Structure | Divergence (%) | r=0.3%/Myr | r=0.2%/Myr | r=0.15%/Myr |
| EqERV-E1 | Glu | chr1:183,833,213-183,842,637 | (+) | 9.4 | Full | 16.6 | 27.7 | 41.6 | 55.5 |
| EqERV-I1 | Ile | chrX:74,300,522-74,309,530 | (+) | 9.0 | Full | 2.9 | 4.8 | 7.2 | 9.5 |
| EqERV-I2 | Ile | chr11:46,777,889-46,786,573 | (−) | 8.7 | Full | 1.2 | 1.9 | 2.9 | 3.9 |
| EqERV-I3 | Ile | chr5:27,325,566-27,334,031 | (−) | 8.5 | Full | 0.7 | 1.1 | 1.7 | 2.2 |
| EqERV-I4 | Ile | chr7:47,466,052-47,474,526 | (−) | 8.5 | Full | 9.4 | 15.6 | 23.5 | 31.3 |
| EqERV-I5 | Ile | chr11:55,535,777-55,544,146 | (−) | 8.3 | Full | 15.5 | 25.8 | 38.7 | 51.6 |
| EqERV-I6 | Ile | chr1:111,609,131-111,617,366 | (+) | 8.2 | Full | 8.0 | 13.3 | 20.0 | 26.7 |
| EqERV-I7 | Ile | chr4:105,090,068-105,096,011 | (−) | 5.9 | Δ3′LTR | Not detected | - | - | - |
| EqERV-M1 | Met | chr7:43,693,479-43,702,540 | (−) | 9.1 | Full | 2.7 | 4.5 | 6.8 | 9.1 |
| EqERV-M2 | Met | chr5:16,127,484-16,135,151 | (−) | 7.7 | Δ3′LTR | Not detected | - | - | - |
| EqERV-P1 | Pro | chr11:49,468,622-49,477,753 | (+) | 9.1 | Full | 1.7 | 2.9 | 4.4 | 5.8 |
| EqERV-P2 | Pro | chr2:11,712,296-11,721,196 | (+) | 8.9 | Full | 2.4 | 4.1 | 6.1 | 8.1 |
| EqERV-P3 | Pro | chr4:26,026,629-26,034,772 | (+) | 8.1 | Δ3′LTR | Not detected | - | - | - |
| EqERV-P4 | Pro | chr10:26,653,364-26,660,774 | (−) | 7.4 | Δ3′LTR | Not detected | - | - | - |
| EqERV-P5 | Pro | chr12:15,551,778-15,559,089 | (−) | 7.3 | Full | 0.5 | 0.8 | 1.2 | 1.5 |
| EqERV-S1 | Ser | chrX:44,328,238-44,336,882 | (+) | 8.6 | Full | 1.3 | 2.1 | 3.1 | 4.2 |
| EqERV-S2 | Ser | chr20:32,762,746-32,770,917 | (−) | 8.2 | Full | 30.2 | 50.3 | 75.4 | 100.5 |
| EqERV-S3 | Ser | chr10:12,943,738-12,950,180 | (+) | 6.4 | Δenv | 10.6 | 17.7 | 26.6 | 35.4 |
| EqERV-Y1 | Tyr | chrX:53,124,155-53,133,636 | (+) | 9.5 | Full | 14.4 | 23.9 | 35.9 | 47.9 |
| EqERV-Y2 | Tyr | chrX:60,642,587-60,651,514 | (+) | 8.9 | Full | 19.6 | 32.7 | 49.1 | 65.5 |
| EqERV-Y3 | Tyr | chr1:27,393,645-27,402,113 | (+) | 8.5 | Full | 1.1 | 1.9 | 2.8 | 3.7 |
| EqERV-Y4 | Tyr | chr1:29,478,335-29,485,604 | (−) | 7.2 | Full | 2.2 | 3.7 | 5.6 | 7.4 |
PBS: Primer binding site,
Full: full-length, Δdeletion. Myr: million years
Divergence was estimated by the Kimura-2-parameter method. Evolutionary time was calculated following as T=d/2r (T: evolutionary time; d: divergence; r: evolutionary rate).
Fig. 2. Alignment of LTR sequences and PBS with tRNA sequences of six novel EqERV families.

EqERV LTR lengths differ depending on the family.
Both LTRs located in the 5′ and 3′ flanking regions are important for retroviral replication. We analyzed the LTR sequences to estimate the point in time of integration based on divergence values, because both LTR sequences can help to estimate their insertion time. The same LTR sequences were attached to each 5′- and 3′-end during retroviral replication, and differences in these LTR sequences arise during evolution. Therefore, it could be possible to estimate the moment of integration time of ERVs based on the divergence values (Johnson and Coffin, 1999). Garcia-Etxebarria and Jugo estimated the point in time of integration at 45–21 million years ago (Mya) and 6-3 Mya (Garcia-Etxebarria and Jugo, 2012). Thus, these two types are the youngest members of class I. Our estimated LTR divergence values indicated that EqERV-Y1 and Y3 proliferated approximately 35.9 Mya and 2.8 Mya based on average evolutionary rate of 0.2% per million years (Myr) (Table 2). This evolutionary rate was also applied to ERV-H divergence in primates (Anderssen et al., 1997). Depend on the species, different evolutionary rates (0.12%/Myr, 0.2%/Myr, and 0.26%/Myr) could be used for ERV studies in humans and apes (Lebedev et al., 2000). In our previous studies, the average evolutionary rate of 0.2%/Myr was used for humans and primates (Yi and Kim, 2006; Yi et al., 2007). After that, integration time based on ERV was estimated in novel methods (Martins and Villesen, 2011), and various species in mammals (Lee et al., 2013). In mammalian genome, insertion time is estimated by the divergence between 5′LTR and 3′LTR (Lee et al., 2013). As shown in Table 2, we estimated the divergence time of EqERV families assuming an evolutionary rate of 0.2%/Myr. Divergence time was estimated as 49.1 Mya in EqERV-Y2 and 5.6 Mya in EqERV-Y4. As an old EqERV family, EqERV-S3 (75.4 Mya) was detected on chromosome 10, and EqERV-P5 (1.2 Mya) was identified as a young member on chromosome 12. During the evolutionary diversification of horses, the diversification of the EqERV-I family took 1.7 Mya to 38.7 Mya (Table 2). The evolution of the horse species occurred over a period of 55 million years (Macfadden, 2005). According to our data and a previous report (Garcia-Etxebarria and Jugo, 2012), some EqERVs integrated into the horse genome during the evolution of the horse lineage. The Equus genus consists of three clades. One clade is the domesticated horse, including Thoroughbred, Arab, Jeju, and Mongolian horse (E. caballus) breeds. The other two clades comprise zebras and donkeys. These clades split 3 million years ago (Macfadden, 2005; Oakenfull et al., 2000). Most EqERV integrations occurred earlier, as indicated by herein presented and other published data (Garcia-Etxebarria and Jugo, 2012). The HERV-S family has been detected in hominoids, Old World monkeys, and New World monkeys. However, it does not exist in prosimians, indicating that HERV-S integrated into the primate genome approximately 43 million years ago (Yi et al., 2004). Taken together, EqERV families integrated at different points in time, and therefore variant evolutionary patterns are observed in the equine genome.
Phylogenetic analysis of identified EqERVs
We compared the relationship of the identified EqERV families with the EqERVs derived from previous studies using phylogenetic analysis (Fig. 3). Firstly, EqERV-beta1 was firstly identified in chromosome 5 (van der Kuyl, 2011). Then, Brown et al. identified the EqERV pol genes of beta-, gamma-, and epsilon-retroviruses (Brown et al., 2012), as well as Garcia-Etxebarria and Jugo identified the three classes of EqERVs in horse genome (Garcia-Etxebarria and Jugo, 2012). The phylogenetic tree was generated by the neighbor-joining method, with the EqERV RT regions, because the same families of ERV have similar RT regions (Tristem, 2000). EqERV-S3 was excluded from the phylogenetic analysis because of its short RT region. HERV-Y is recently identified (Gim et al., 2015), and EqERV-Y is also identified in the horse genome (Garcia-Etxebarria and Jugo, 2012). In the previous study, EqERV7, EqERV8 were class I, EqERV12 was class II, and EqERV10 was class III, respectively (Garcia-Etxebarria and Jugo, 2012). Class I EqERVs were included with epsilon-retrovirus, and our EqERV-Y families were included with gamma-retrovirus. Beta-retrovirus, EqERV-beta1, and class II EqERV12 were included in same group, and class III EqERV10 was out-grouped (Fig. 3). In the previous studies, the phylogenetic relationships of EqERVs were compared with other ERVs (Brown et al., 2012; Garcia-Etxebarria and Jugo, 2012; van der Kuyl, 2011). In this study, we revealed the phylogenetic relationship of six EqERV families, which can provide insights into the family-specific functions.
Fig. 3. Phylogenetic trees based on the pol RT regions of identified EqERVs.
The tree was generated by the neighbor-joining method. Bootstrap evaluation of the branching patterns was executed with 100 replicates. Branch lengths are matched to genetic distance. EqERVs identified in previous studies were indicated as asterisks. *(Brown et al., 2012), **(Garcia-Etxebarria and Jugo, 2012), ***(van der Kuyl, 2011).
Expression analysis of EqERV pol in horse tissues of two subspecies
Many researchers have proposed ERV-derived transcripts or proteins as potential pathological factors. ERV gag, pol, and env gene transcripts and proteins are expressed in various cancer tissues at higher levels than in normal tissues (Ahn and Kim, 2009; Reis et al., 2013). The expression of ERV genes has many effects in host organisms. In a previous study, 978 EqERV-derived gag, pol, and env transcripts were detected in horse tissues based on a whole transcriptome analysis (Brown et al., 2012). According to a phylogenetic analysis of EqERV transcripts, EqERVs are closely related to bovine ERVs. Therefore, it is necessary to elucidate their biological functions and expression patterns. In general, viral genes have potentially detrimental effects, and integrated viral genes in the host genome may also have deleterious effects on the host. The functions of HERV transcripts, HERV proteins, and even HERV-derived particles are extensively in various human tissues, most notably in cancer tissues. The best example of an ERV function is that of HERV-W env, which has an important role in placenta formation. In order to characterize ERV transcripts, it is important to identify tissue-specific expression patterns. Likewise, HERV-W env also presents expression patterns specific for brain diseases or brain cells, which suggests that the expression of ERVs can have immunopathogenic and neuroinflammatory roles in the host (Frank et al., 2005; Perron et al., 2012). Accordingly, the elucidation of ERV expression patterns in the brain is important to determine the exact role of the virus in this area. Therefore, we assessed the expression of EqERV in two brain specimens obtained from a Thoroughbred horse and a Jeju horse. Primers were designed to amplify the pol RT regions in the whole transcriptome using the Tablet graphical viewer (Milne et al., 2013). As shown in Fig. 4, we examined the expression patterns of pol by RT-qPCR. EqERV-E1 pol was expressed at higher levels in the Jeju horse than in the Thoroughbred, particularly in the cerebellum (Fig. 4A). Each pol gene of EqERV-I1, M2, P1, S1, and Y4 showed higher expression in the cerebellum of the Jeju horse than the Thoroughbred (Fig. 4). In humans, HERV-P pol expression was detected in brain and testis tissues (Yi et al., 2007), while HERV-S pol is specifically expressed patterns in the brain and thymus (Yi et al., 2004). The viral ERV genes can induce the proinflammatory cytokines in brain and immune system, and have crucial roles in the development and regulates central nervous system (CNS) (Kremer et al., 2013; Mortelmans et al., 2016). Then, pharmacological-induction of ERV genes could regulate the function of the brain or CNS. These expression data may provide the clues of Thoroughbred and Jeju horse specific pharmacological-induction of ERVs, and improve the breeding system of each horses. Thus, various expression patterns of EqERV pol in brain tissues could be of great use for further functional studies in horse research.
Fig. 4. Quantitative expression of six novel EqERV families in Thoroughbred and Jeju horse tissues by RT-qPCR.

Relative expression levels of EqERV genes were normalized to the expression levels of horse B2M genes. TH1 indicates the Thoroughbred horse cerebrum and TH2 indicates the cerebellum. JH1 indicates the Jeju horse cerebrum and JH2 indicates the cerebellum. Compared between Thoroughbred and Jeju horse in each tissues, significantly different expression patterns were indicated as asterisks (*p < 0.05, **p < 0.005).
ACKNOWLEDGMENTS
This work was supported by a grant from the ministry of Education, Republic of Korea (NRF-2015R1D1A1A01056650).
REFERENCES
- Ahn K., Bae J.-H., Nam K.-H., Lee C.-E., Park K.-D., Lee H.-K., Cho B.-W., Kim H.-S. Identification of reference genes for normalization of gene expression in thoroughbred and Jeju native horse (Jeju pony) tissues. Genes Genom. 2011a;33:245–250. [Google Scholar]
- Ahn K., Han K., Kim H.S. Quantitative analysis of the HERV pol gene in human tissues. Genes Genom. 2011b;33:439–443. [Google Scholar]
- Ahn K., Kim H.S. Structural and quantitative expression analyses of HERV gene family in human tissues. Mol Cells. 2009;28:99–103. doi: 10.1007/s10059-009-0107-y. [DOI] [PubMed] [Google Scholar]
- Anderssen S., Sj⊘ttem E., Svineng G., Johansen T. Comparative analyses of LTRs of the ERV-H family of primate-specific retrovirus-like elements isolated from marmoset, African green monkey, and man. Virology. 1997;234:14–30. doi: 10.1006/viro.1997.8590. [DOI] [PubMed] [Google Scholar]
- Blikstad V., Benachenhou F., Sperber G.O., Blomberg J. Evolution of human endogenous retroviral sequences: a conceptual account. Cell Mol Life Sci. 2008;65:3348–3365. doi: 10.1007/s00018-008-8495-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bower M.A., McGivney B.A., Campana M.G., Gu J., Andersson L.S., Barrett E., Davis C.R., Mikko S., Stock F., Voronkova V., et al. The genetic origin and history of speed in the Thoroughbred racehorse. Nat Commun. 2012;3:643. doi: 10.1038/ncomms1644. [DOI] [PubMed] [Google Scholar]
- Brown K., Moreton J., Malla S., Aboobaker A.A., Emes R.D., Tarlinton R.E. Characterisation of retroviruses in the horse genome and their transcriptional activity via transcriptome sequencing. Virology. 2012;433:55–63. doi: 10.1016/j.virol.2012.07.010. [DOI] [PubMed] [Google Scholar]
- Cho G.J. Genetic relationship and characteristics using microsatellite. J Life Sci. 2007;17:699–705. [Google Scholar]
- Dunn C.A., Romanish M.T., Gutierrez L.E., van de Lagemaat L.N., Mager D.L. Transcription of two human genes from a bidirectional endogenous retrovirus promoter. Gene. 2006;366:335–342. doi: 10.1016/j.gene.2005.09.003. [DOI] [PubMed] [Google Scholar]
- Esnault C., Heidmann O., Delebecque F., Dewannieux M., Ribet D., Hance A.J., Heidmann T., Schwartz O. APOBEC3G cytidine deaminase inhibits retrotransposition of endogenous retroviruses. Nature. 2005;433:430–433. doi: 10.1038/nature03238. [DOI] [PubMed] [Google Scholar]
- Frank O., Giehl M., Zheng C., Hehlmann R., Leib-Mösch C., Seifarth W. Human endogenous retrovirus expression profiles in samples from brains of patients with schizophrenia and bipolar disorders. J Virol. 2005;79:10890–10901. doi: 10.1128/JVI.79.17.10890-10901.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garcia-Etxebarria K., Jugo B.M. Detection and characterization of endogenous retroviruses in the horse genome by in silico analysis. Virology. 2012;434:59–67. doi: 10.1016/j.virol.2012.08.047. [DOI] [PubMed] [Google Scholar]
- Gifford R., Tristem M. The evolution, distribution and diversity of endogenous retroviruses. Virus Genes. 2003;26:291–315. doi: 10.1023/a:1024455415443. [DOI] [PubMed] [Google Scholar]
- Gim J.-A., Han K., Kim H.-S. Identification and expression analysis of human endogenous retrovirus Y (HERV-Y) in various human tissues. Arch Virol. 2015;160:2161–2168. doi: 10.1007/s00705-015-2486-z. [DOI] [PubMed] [Google Scholar]
- Gu J., Orr N., Park S.D., Katz L.M., Sulimova G., MacHugh D.E., Hill E.W. A genome scan for positive selection in thoroughbred horses. PLoS One. 2009;4:e5767. doi: 10.1371/journal.pone.0005767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hall T.A. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999;41:95–98. [Google Scholar]
- Hill E.W., Gu J., Eivers S.S., Fonseca R.G., McGivney B.A., Govindarajan P., Orr N., Katz L.M., MacHugh D.E. A sequence polymorphism in MSTN predicts sprinting ability and racing stamina in thoroughbred horses. PLoS One. 2010;5:e8645. doi: 10.1371/journal.pone.0008645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hughes J.F., Coffin J.M. Evidence for genomic rearrangements mediated by human endogenous retroviruses during primate evolution. Nat Genet. 2001;29:487–489. doi: 10.1038/ng775. [DOI] [PubMed] [Google Scholar]
- Jern P., Coffin J.M. Effects of retroviruses on host genome function. Annu Rev Genet. 2008;42:709–732. doi: 10.1146/annurev.genet.42.110807.091501. [DOI] [PubMed] [Google Scholar]
- Johnson M., Zaretskaya I., Raytselis Y., Merezhuk Y., McGinnis S., Madden T.L. NCBI BLAST: a better web interface. Nucleic Acids Res. 2008;36:W5–W9. doi: 10.1093/nar/gkn201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson W.E., Coffin J.M. Constructing primate phylogenies from ancient retrovirus sequences. Proc Natl Acad Sci USA. 1999;96:10254–10260. doi: 10.1073/pnas.96.18.10254. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang Y.J., Jo J.O., Ock M.S., Chang H.K., Baek K.W., Lee J.R., Choi Y.H., Kim W.J., Leem S.H., Kim H.S., et al. Human ERV3-1 env protein expression in various human tissues and tumours. J Clin Pathol. 2014;67:86–90. doi: 10.1136/jclinpath-2013-201841. [DOI] [PubMed] [Google Scholar]
- Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980;16:111–120. doi: 10.1007/BF01731581. [DOI] [PubMed] [Google Scholar]
- Kremer D., Schichel T., Förster M., Tzekova N., Bernard C., Valk P., Horssen J., Hartung H.P., Perron H., Küry P. Human endogenous retrovirus type W envelope protein inhibits oligodendroglial precursor cell differentiation. Ann Neurol. 2013;74:721–732. doi: 10.1002/ana.23970. [DOI] [PubMed] [Google Scholar]
- Lau A.N., Peng L., Goto H., Chemnick L., Ryder O.A., Makova K.D. Horse domestication and conservation genetics of Przewalski’s horse inferred from sex chromosomal and autosomal sequences. Mol Biol Evol. 2009;26:199–208. doi: 10.1093/molbev/msn239. [DOI] [PubMed] [Google Scholar]
- Lebedev Y.B., Belonovitch O.S., Zybrova N.V., Khil P.P., Kurdyukov S.G., Vinogradova T.V., Hunsmann G., Sverdlov E.D. Differences in HERV-K LTR insertions in orthologous loci of humans and great apes. Gene. 2000;247:265–277. doi: 10.1016/s0378-1119(00)00062-7. [DOI] [PubMed] [Google Scholar]
- Lee A., Nolan A., Watson J., Tristem M. Identification of an ancient endogenous retrovirus, predating the divergence of the placental mammals. Phil Trans R Soc B. 2013;368:20120503. doi: 10.1098/rstb.2012.0503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Macfadden B.J. Evolution. Fossil horses--evidence for evolution. Science. 2005;307:1728–1730. doi: 10.1126/science.1105458. [DOI] [PubMed] [Google Scholar]
- Malassine A., Handschuh K., Tsatsaris V., Gerbaud P., Cheynet V., Oriol G., Mallet F., Evain-Brion D. Expression of HERV-W Env glycoprotein (syncytin) in the extravillous trophoblast of first trimester human placenta. Placenta. 2005;26:556–562. doi: 10.1016/j.placenta.2004.09.002. [DOI] [PubMed] [Google Scholar]
- Malfavon-Borja R., Feschotte C. Fighting fire with fire: endogenous retrovirus envelopes as restriction factors. J Virol. 2015;89:4047–4050. doi: 10.1128/JVI.03653-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martins H., Villesen P. Improved integration time estimation of endogenous retroviruses with phylogenetic data. PLoS One. 2011;6:e14745. doi: 10.1371/journal.pone.0014745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Milne I., Stephen G., Bayer M., Cock P.J., Pritchard L., Cardle L., Shaw P.D., Marshall D. Using Tablet for visual exploration of second-generation sequencing data. Brief Bioniform. 2013;14:193–202. doi: 10.1093/bib/bbs012. [DOI] [PubMed] [Google Scholar]
- Mortelmans K., Wang-Johanning F., Johanning G.L. The role of human endogenous retroviruses in brain development and function. Apmis. 2016;124:105–115. doi: 10.1111/apm.12495. [DOI] [PubMed] [Google Scholar]
- Oakenfull E.A., Lim H.N., Ryder O.A. A survey of equid mitochondrial DNA: Implications for the evolution, genetic diversity and conservation of Equus. Conserv Genet. 2000;1:341–355. [Google Scholar]
- Orlando L., Ginolhac A., Zhang G., Froese D., Albrechtsen A., Stiller M., Schubert M., Cappellini E., Petersen B., Moltke I., et al. Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature. 2013;499:74–78. doi: 10.1038/nature12323. [DOI] [PubMed] [Google Scholar]
- Park K.D., Park J., Ko J., Kim B.C., Kim H.S., Ahn K., Do K.T., Choi H., Kim H.M., Song S., et al. Whole transcriptome analyses of six thoroughbred horses before and after exercise using RNA-Seq. BMC Genomics. 2012;13:473. doi: 10.1186/1471-2164-13-473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perron H., Mekaoui L., Bernard C., Veas F., Stefas I., Leboyer M. Endogenous retrovirus type W GAG and envelope protein antigenemia in serum of schizophrenic patients. Biol Psychiatry. 2008;64:1019–1023. doi: 10.1016/j.biopsych.2008.06.028. [DOI] [PubMed] [Google Scholar]
- Perron H., Germi R., Bernard C., Garcia-Montojo M., Deluen C., Farinelli L., Faucard R., Veas F., Stefas I., Fabriek B.O. Human endogenous retrovirus type W envelope expression in blood and brain cells provides new insights into multiple sclerosis disease. Mult Scler J. 2012;18:1721–1736. doi: 10.1177/1352458512441381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reis B.S., Jungbluth A.A., Frosina D., Holz M., Ritter E., Nakayama E., Ishida T., Obata Y., Carver B., Scher H., et al. Prostate cancer progression correlates with increased humoral immune response to a human endogenous retrovirus GAG protein. Clin Cancer Res. 2013;19:6112–6125. doi: 10.1158/1078-0432.CCR-12-3580. [DOI] [PubMed] [Google Scholar]
- Ruprecht K., Mayer J., Sauter M., Roemer K., Mueller-Lantzsch N. Endogenous retroviruses and cancer. Cell Mol Life Sci. 2008;65:3366–3382. doi: 10.1007/s00018-008-8496-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shin J.A., Yang Y.H., Kim H.S., Yun Y.M., Lee K.K. Genetic polymorphism of the serum proteins of horses in Jeju. J Vet Sci. 2002;3:255–263. [PubMed] [Google Scholar]
- Sperber G., Lovgren A., Eriksson N.E., Benachenhou F., Blomberg J. RetroTector online, a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic sequences. BMC Bioinformatics. 2009;10(Suppl 6):S4. doi: 10.1186/1471-2105-10-S6-S4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tristem M. Identification and characterization of novel human endogenous retrovirus families by phylogenetic screening of the human genome mapping project database. J Virol. 2000;74:3715–3730. doi: 10.1128/jvi.74.8.3715-3730.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van der Kuyl A.C. Characterization of a full-length endogenous beta-retrovirus, EqERV-beta1, in the genome of the horse (Equus caballus) Viruses. 2011;3:620–628. doi: 10.3390/v3060620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yi J.M., Kim H.S. Molecular evolution of the HERV-E family in primates. Arch Virol. 2006;151:1107–1116. doi: 10.1007/s00705-005-0701-z. [DOI] [PubMed] [Google Scholar]
- Yi J.M., Kim T.H., Huh J.W., Park K.S., Jang S.B., Kim H.M., Kim H.S. Human endogenous retroviral elements belonging to the HERV-S family from human tissues, cancer cells, and primates: expression, structure, phylogeny and evolution. Gene. 2004;342:283–292. doi: 10.1016/j.gene.2004.08.007. [DOI] [PubMed] [Google Scholar]
- Yi J.M., Schuebel K., Kim H.S. Molecular genetic analyses of human endogenous retroviral elements belonging to the HERV-P family in primates, human tissues, and cancer cells. Genomics. 2007;89:1–9. doi: 10.1016/j.ygeno.2006.08.010. [DOI] [PubMed] [Google Scholar]

