Abstract
The mobilome, portion of the genome composed of transposable elements (TEs), of Anopheles darlingi was described together with the genome of this species. Here, this mobilome was revised using similarity and de novo search approaches. A total of 5.6% of the A. darlingi genome is derived of TEs. Class I gypsy and copia were the most abundant superfamilies, corresponding to 22.36% of the mobilome. Non-LTR elements of the R1 and Jockey superfamilies account for 11% of the TEs. Among Class II TEs, the mariner superfamily is the most abundant (16.01%). Approximately 87% of the A. darlingi mobilome consist of short, truncated and/or degenerated copies of TEs. Only three retrotransposons, two belonging to gypsy and one to copia superfamilies, are putatively active elements. Only one Class II element, belonging to the mariner superfamily, is putatively active, having 12 copies in the genome. The TE landscape of A. darlingi is formed primarily by degenerated elements and, therefore, somewhat stable. Future applications of TE-based vectors for genetic transformation of A. darlingi should take into consideration mariner and piggyBac transposons, because full length and putatively active copies of these elements are present in its genome.
Keywords: Transposable elements, transposons, mosquitoes, evolvability.
Introduction
The mobilome is the complete set of mobile genetic elements in a genome. In eukaryotes, it is constituted mainly by transposable elements (TEs) (Siefert, 2009), comprising about 45% of the human genome, 20% of D. melanogaster, and more than 50% of the maize genome (SanMiguel et al., 1996; Lander et al., 2001; Kaminker et al., 2002). Transposable elements are drivers of evolution, as a source of genetic variability, generally by promoting chromosome rearrangements, mutations in the coding or regulatory regions of genes, domestication and epigenetic alterations (reviewed in Hua-Van et al., 2011).
TEs are classified as autonomous elements when they are able to produce the enzymes necessary for their own mobilization, or as non-autonomous when they require enzymes produced by related autonomous elements for that activity. TEs have also been classified into two classes, namely RNA-mediated (Class-I) and DNA-mediated (Class-II) elements, according to their transposition mode. TEs often occur as remains or relics of old elements, which are not mobilizable any more. The combination of active, mobilizable, and remnants of TEs constitutes the TE landscape of a genome, which is characteristic of a species. For example, in humans, the LINE-1/L1-element is the only element that is presently active, while in Drosophila melanogaster 30% of the TEs are full length and potentially active (Kaminker et al., 2002). Closely related species can display distinct TE contents. For instance, TE contents vary from 2.7 to 23% among the genomes of 12 Drosophila species (Clark et al., 2007), and among Anopheles species it varies from 1.98 to 17.78% (Neafsey et al., 2015). Yet, a full, integral insight into the mobilome is not provided just by the proportion of a genome occupied by TEs and their classification. Also important is the identification of full length, putatively active elements. This aspect is particularly important for organisms that are potential candidates for genetic manipulation using transposon-based transgenic technologies. In these cases, the characterization of full length and active transposable elements is fundamental to estimate genomic stability and biosafety of the proposed products (Terenius et al., 2008). In genetically transformed organisms, the presence of active endogenous TEs similar to the one(s) used in the transformation vector(s) might interfere with the efficiency of transgene integration and transgene stability due to cross mobilization (Arensburger et al., 2011).
The classification and annotation of TEs is always a challenging task due to their remarkable diversity within and among genomes. TE copies recently inserted into a genome show low sequence variability, though with time passing, copies accumulate mutations, deletions, and/or insertions, becoming decayed TE remnants (Hua-Van et al., 2011; Hoen et al., 2015). Two main approaches are currently used for TEs identification and annotation. Homology-based methods search for sequences similar to known TEs compiled in databases. The de novo approach is based on the search for repetitiveness and structural signatures normally found in TEs (Hoen et al., 2015). New tools for mobilome scrutiny, exploration, and annotation warrant the re-analysis of previously described genomes (Kaminker et al., 2002). Fernández-Medina et al. (2011) who re-analyzed the mobilome of A. gambiae, found new TEs, described complete and potentially active elements, and characterized additional deleted, mutated, and probably inactive copies.
Anopheles darlingi is the principal Neotropical malaria vector, responsible for more than a million malaria cases per year (Oliveira-Ferreira et al., 2010). The genome of this mosquito was sequenced, annotated, and its mobilome described (Marinotti et al., 2013). In that study, TEs were annotated applying a homology-based method, using a “home-made” TE database. De novo search was used only to find MITEs and SINEs. In the present study, the Repbase database was used for a homology-based search, and the Repeatscout program was used for de novo searches. These improved approaches allowed us to advance our knowledge of the A. darlingi mobilome, and to revise the number and annotation of the identified TEs in its genome.
Material and Methods
A. darlingi transposable elements were identified following the pipeline shown in Figure 1. Blastn and tblastx (Altschul et al., 1997) were used to find similarities of A. darlingi genome sequences (GenBank accession number ADMH02000000) (Marinotti et al., 2013) with the TEs references of the Repbase database (version 18.01) (Jurka et al., 2005), considering e-values < 10e-10 as a cutoff to define a Blast “hit”. Redundancies representing hits at overlapping genomic positions, for different TEs, were considered as one hit for further analyses. Each target was expanded 5 kb on each side and searched for TIRs, LTR, TSD and conserved ORFs with the UGENE platform (Okonechnikov et al., 2012). The Censor software (Kohany et al., 2006) implemented with the Repbase database was used for the classification and annotation of TEs. For de novo searches, Repeatscout version 1.0.0 software (Price et al., 2005) was used. The obtained sequences were analyzed using UGENE to look for characteristics described previously (TIRs, LTRs, etc.) and to classify these transposable elements. After obtaining a full library by similarities and de novo searches, the genome was masked to determine the number of transposable elements using RepeatMasker (with -no_is -nolow options) (Smit et al., 2016). TEs were classified using Repbase (DNA, ERV, LTR, Non-LTR), and their number of copies and % of genome were calculated.
Many short retrieved sequences show similarities with known TEs sequences. These short sequences are mentioned in the text as “hits” and classified as belonging to the superfamily corresponding to the best blast hit (lowest e-value). Sequences longer than 200 bp were manually curated with the UGENE platform for the annotation of ORFs, TIR, LTRs, and TSDs. All ORFs were analyzed by Blastp, and those showing similarities with TEs-encoded proteins were manually annotated. The sequences with typical TE characteristics were designated as identifiable TEs (ITEs) and classified as: (1) complete elements - containing TIR (or LTRs) and complete ORFs; (2) degenerated – most often contain TIRs (or LTRs), however, the ORFs present mutations and deletions; (3) truncated - these elements show large deletions; (4) MITEs – short elements, having TIRs. When possible, sequences were assigned into families using Repbase; otherwise they were described as Unknown.
The repeat landscape of TEs found in the A. darlingi genome was constructed with the full TE dataset obtained using the RepeatMasker tool.
Results
The TEs identified in this study correspond to 5.6% of the A. darlingi genome (Table 1). Class I elements correspond to 2.64% and Class II to 2.44% of the genome. For Class I, the superfamilies gypsy and copia were the most abundant. Together, these superfamilies account for 22.36% of all TEs. Non-LTR elements of R1 and Jockey superfamilies compound, together, 11% of all TEs. The mariner superfamily is the most abundant among the Class II elements (16.01%). Endogenous retroviruses (ERVs) correspond to 8.23% of the TEs.
Table 1. Sequences showing significant hits with TEs identified in Anopheles darlingi genome.
Class | Order | Superfamily | Hit number | Occupied size (bp) | TE % | Genome % |
---|---|---|---|---|---|---|
I (2.64%) | LTR (1.40%) | Gypsy | 18007 | 1304425 | 11.27 | 0.75 |
Copia | 17890 | 1283868 | 11.09 | 0.74 | ||
BEL | 1262 | 114186 | 0.98 | 0.06 | ||
DIRS | 740 | 47004 | 0.41 | 0.03 | ||
Others | 2589 | 159289 | 1.37 | 0.09 | ||
Non-LTR (1.25%) | R1 | 7744 | 654295 | 5.65 | 0.37 | |
Jockey | 7685 | 614572 | 5.31 | 0.35 | ||
Tx1 | 4074 | 302983 | 2.62 | 0.17 | ||
L1 | 3944 | 256751 | 2.22 | 0.15 | ||
SINE2/tRNA | 2645 | 199096 | 1.72 | 0.11 | ||
Others | 7100 | 589270 | 5.09 | 0.34 | ||
II (2.44%) | Mariner/Tc1 | 11871 | 1852562 | 16.01 | 1.06 | |
hAT | 11167 | 723088 | 6.23 | 0.41 | ||
EnSpm/CACTA | 6961 | 474890 | 4.10 | 0.27 | ||
Polinton | 4207 | 268339 | 2.32 | 0.15 | ||
Helitron | 2552 | 186142 | 1.61 | 0.11 | ||
Others | 21620 | 1580047 | 13.65 | 0.91 | ||
ERVs (0.53%) | ERV1 | 5835 | 441984 | 3.82 | 0.25 | |
ERV2 | 5562 | 383990 | 3.32 | 0.22 | ||
ERV3 | 670 | 48140 | 0.41 | 0.03 | ||
ERV4 | 68 | 4117 | 0.03 | 0.01 | ||
Others | 1279 | 83874 | 0.72 | 0.04 | ||
TOTAL | 145472 | 11572912 | 100 | 6.65 | ||
TOTAL* | 5.67 |
without redundancy
A remarkable aspect of the A. darlingi mobilome is that it is composed mainly of very short sequences displaying significant similarities with TEs present in the used database. These hits are likely derived from extensively mutated and/or deleted TEs, lacking clearly identifiable TE structural features such as TIRs, LTRs, or TSD, etc.. The sequences containing identifiable TE structural features, ITEs, correspond to only 0.87% of the A. darlingi genome (Table 2). This value was obtained by multiplying the size of each element by copy number, which corresponds to 1.5 Mb. The entire A. darlingi genome was estimated to be 173.9 Mb (Marinotti et al., 2013). Thirty-six ITE elements were found, 26 belonging to the mariner family, one to the piggyBac family, one to kolobok family, two elements are from the gypsy family, one from the copia family, and four DNA/Unknown elements (Table 2). Short descriptions and sequences of each element are presented in List S1 (1.3MB, pdf) of the Supplementary Material.
Table 2. Identifiable TEs found in the An. darlingi genome.
Seq | TE Name | Superfamily | Copies | Size | Censor Hit | ID(%) | Score | Status | TIRs (bp) | TSD |
---|---|---|---|---|---|---|---|---|---|---|
1 | Mariner1-Andl | DNA/Mariner | 190 | 907 | Mariner-N2_SIn | 71 | 1769 | Degen. | - | TA |
2 | Mariner2-Andl | DNA/Mariner | 32 | 941 | ITmD37D_Ele1 | 68 | 1581 | Degen. | - | TA |
3 | Mariner3-Andl | DNA/Mariner | 28 | 1265 | Mariner-6_PBa | 75 | 3657 | Degen. | - | TA |
4 | Mariner4-Andl | DNA/Mariner | 44 | 1194 | Mariner-2_ACe | 79 | 5637 | Truncated | - | TA |
5 | Mariner5-Andl | DNA/Mariner | 91 | 890 | Mariner-2_AEc | 71 | 1376 | Degen. | - | TA |
6 | Mariner6-Andl | DNA/Mariner | 30 | 905 | Mariner-30_SIn | 77 | 2180 | Degen. | - | TA |
7 | Mariner7-Andl | DNA/Mariner | 99 | 912 | Mariner-1_DF | 65 | 274 | Degen. | - | TA |
8 | Mariner8-Andl | DNA/Mariner | 20 | 1673 | AeTango2 | 65 | 2023 | Truncated | - | TA |
9 | Mariner9-Andl | DNA/Mariner | 10 | 738 | CRMAR | 71 | 1749 | MITE | 223 | TA |
10 | Mariner10-Andl | DNA/Mariner | 2 | 489 | MARINER_CA | 72 | 1371 | MITE | 22 | TA |
11 | Mariner11-Andl | DNA/Mariner | 12 | 1664 | Mariner-8-Dan | 68 | 1317 | Put. Active | 230 | TA |
12 | Mariner12-Andl | DNA/Mariner | 46 | 1285 | ITmD37D_Ele3 | 65 | 894 | Degen. | 30 | TA |
13 | Mariner13-Andl | DNA/Mariner | 64 | 906 | Mariner_3_DF | 69 | 1521 | Degen. | 64 | TA |
14 | Mariner14-Andl | DNA/Mariner | 3 | 1321 | Mariner_16_Dan | 70 | 2061 | Degen. | 33 | TA |
15 | Mariner15-Andl | DNA/Mariner | 31 | 1220 | Mariner-16_DAn | 70 | 2390 | Degen. | 23 | TA |
16 | Mariner16-Andl | DNA/Mariner | 3 | 1471 | Mariner-3_DF | 70 | 376 | Degen. | 25 | TA |
17 | Mariner17-Andl | DNA/Mariner | 3 | 886 | Mariner-6_BM | 66 | 597 | Degen. | 29 | TA |
18 | Mariner18-Andl | DNA/Mariner | 42 | 1268 | Tc1-1_TCa | 68 | 331 | Degen. | 215/225 | TA |
19 | Mariner19-Andl | DNA/Mariner | 9 | 792 | Mariner-6_BM | 65 | 539 | MITE | 17 | TA |
20 | Mariner20-Andl | DNA/Mariner | 49 | 759 | Tx_mos | 65 | 577 | MITE | 25 | TA |
21 | Mariner21-Andl | DNA/Mariner | 1 | 796 | MARINER_CA | 71 | 894 | MITE | 30 | TA |
22 | Mariner22-Andl | DNA/Mariner | 2 | 1755 | MARINER_CA | 68 | 1673 | Degen. | 25 | TA |
23 | Mariner23-Andl | DNA/Mariner | 200 | 1212 | Tx_mos | 63 | 532 | Degen. | - | TA |
24 | Mariner24-Andl | DNA/Mariner | 100 | 394 | - | - | - | Degen. | - | TA |
25 | Mariner25-Andl | DNA/Mariner | 100 | 328 | - | - | - | Degen. | - | TA |
26 | Mariner26-Andl | DNA/Mariner | 21 | 702 | Mariner-58_Ccri | - | - | MITE | 28 | TA |
27 | DNAUnknown-Andl1 | DNA/Unknown | 100 | 555 | Degen. | |||||
28 | DNAUnknown-Andl2 | DNA/Unknown | 100 | 234 | - | - | - | Degen. | - | - |
29 | DNAUnknown-Andl3 | DNA/Unknown | 60 | 514 | - | - | - | Degen. | - | - |
30 | DNAUnknown-Andl4 | DNA/Unknown | 40 | 478 | - | - | - | Degen. | - | TTAA |
31 | PiggyBac1_Andl | DNA/piggyBac | 27 | 2954 | piggyBac-1_DBi | 67 | 2143 | Truncated | 19 | TTAA |
32 | Kolobok1-Andl | DNA/Kolobok | 47 | 751 | Kolobok-N1_Dan | 76 | 349 | Degen. | 14 | TTAA |
33 | Helitron1_Andl | DNA/Helitron | 85 | 1327 | Helitron-2_DBp | 76 | 275 | Degen. | - | A/T |
34 | Gypsy1_Andl | LTR/Gypsy | 1 | 5366 | Gypsy-625_AA-I | 67 | 7288 | Put. Active | 210/208 | - |
35 | Gypsy2_Andl | LTR/Gypsy | 1 | 4325 | GYPSY36-I_AG | 68 | 6928 | Put. Active | 167/166 | - |
36 | Copia1_Andl | LTR/Copia | 1 | 4294 | Copia-70_AA-I | 63 | 2624 | Put. Active | 184/202 | - |
mariner elements are predominantly degenerated, and 17 elements were classified with this status. The copy number of these degenerated mariner elements range from 2 to 200 copies, and their sizes vary from 1755 to 889 bp. Six mariner elements were classified as MITEs, ranging from 489 to 796 bp, and are represented by 1 to 49 copies. Two mariner elements were classified as truncated. One putatively active mariner element was identified. It is represented by 12 copies in the genome, has a length of 1664 bp, and long TIRs with 230 bp.
Four degenerated elements were classified as DNA/Unknown because they have characteristics of class II elements, but their similarities with known TEs are not high enough for their classification into known families. The copy number of these elements is generally high, ranging from 40 to more than 100 copies in the genome. Their sizes range from 234 to 555 bp, suggesting they are truncated elements.
A truncated piggyBac element of 2954 bp with a 19 bp TIR was found, with a total of 27 copies. Degenerated kolobok and helitron elements were also found, with 47 and 85 copies, respectively.
Only three Class I putatively active ITEs were found. Two elements are from the gypsy family, having lengths of 5366 and 4325 bp, with LTRs of 210 and 167 bp, respectively. The third element, belonging to the copia family, is 4294 bp in length and has LTRs with 184/202 bp. Only one copy of each of these retrotransposons was found in the A. darlingi genome.
Analysis of nucleotide divergence among the different copies of the found elements allowed to depict a general landscape of the mobilome. The level of Kimura substitution observed among the analyzed sequences is generally high (Figure 2). Few copies of the elements are well conserved, indicating that only few elements are active or have been recently mobilized or duplicated. So the mobilome of A. darlingi is constituted mainly of remains of degraded and ancient elements. In comparison, the proportion of copies showing high similarities is greater in A. gambiae than in A. darlingi (Figure 2), suggesting the presence of active or more recently mobilized elements in A. gambiae.
Discussion
The results of this study show that a higher proportion of the A. darlingi genome (5.61%) is occupied by TEs than the previously reported 2.29% (Marinotti et al., 2013). For other organisms, re-analyses of mobilomes using updated tools and databases also resulted in distinct results. For example, the genomes of Drosophila species and A. gambiae have been re-analyzed resulting in improved descriptions of their mobilomes (Kaminker et al., 2002; Fernández-Medina et al., 2011, Neafsey et al., 2015). Neafsey et al. (2015) compared the genomes of 16 Anopheles species and found that A. albimanus was the mosquito with the genome occupied by the lowest proportion of TEs (1.98%), followed by A. cristyi (2.81%). The Anopheline species with the highest content of TEs in its genome is A. gambiae (17.78%). The authors also reported a direct correlation between genome size and TE content, with species showing smaller genomes tending to have lower TE contents. Similar correlations between genome size and TE content have also been found for other taxa (Sessegolo et al., 2016). The genome size of A. darlingi is among the smallest among the sequenced Anopheles species, and similar to those found in A. albimanus and A. cristyi [≈ 180 Mb] (Table S1 (129.3KB, pdf) ). While the TE content found in those species is around 2%, our re-analyses increased the A. darlingi TE content from 2.29% to 5.61%, a value comparable to those of Anopheline mosquitoes with larger genome sizes (≈ 220 Mb). However, the direct correlation between TE content and genome size has outliers. For example, although A. quadriannulatus and A. gambiae have genomes with similar sizes, the TE content of the former is less than half of that of the latter (Neafsey et al., 2015, see also Table S1 (129.3KB, pdf) ). Although there is variation, in Anopheline mosquitoes, Class I TEs are generally more abundant then those of Class II, as observed also in the present analysis for An. darlingi (Table S2 (120KB, pdf) ).
The increased A. darlingi TE content reported in this study, compared to the previous description (Marinotti et al., 2013; Neafsey et al., 2015) is the result of an approach that utilized improved tools and databases. The database used for homology searches in the present study is larger than the one applied in the previous analysis. Also, it is due to the inclusion, in the present report, of short fragments derived from degenerated TEs. The overall proportional representations of the different TE superfamilies in A. darlingi were maintained between the present and previous analyses (Marinotti et al., 2013), with gypsy as the most abundant LTR element and mariner elements as the most abundant ones among the DNA transposons. However, some differences were seen for other superfamilies. For example, copia was the second most abundant superfamily observed in this study, but only 0.9% of copia elements were registered in the Marinotti et al. (2013) study.
Active or mobilizable elements are associated with evolvability of species and their capacity to environmental adaptation (Fablet and Vieira, 2011; Casacuberta and González, 2013). The TE landscape of A. darlingi is predominantly formed by degenerated elements, contrasting with species such as A. gambiae and D. melanogaster, which harbor a larger number of potentially active elements. In Drosophila melanogaster, 80% of spontaneous mutations are promoted by TE mobilizations, making it an important source of genetic variability (García Guerreiro, 2012). In contrast, only four putatively active elements were found in the assembled A. darlingi genome. It is also remarkable that the copy number found for putative active retrotransposon is very low; only one copy of each element has been identified. Only the putative active mariner element (Mariner11-Andl) has a higher copy number (12 copies). Judging from this landscape structure, the genome of A. darlingi is likely to be stable. Few TEs are capable of, or prone to respond to environmental stressors and likely to promote an increase in mutability.
As a recommendation for future uses of transposable elements as vectors for genetic transformation of A. darlingi, attention should be given to the use of mariner elements, as it was the only putatively active DNA TE found in the sequenced genome. piggyBac elements also deserve certain attention, because a truncated element is present in the genome. The use of other Class II transposable elements is suggested as being safer, as functional elements are not present to promote cross mobilization.
In conclusion, the mobilome of A. darlingi is primarily occupied by degenerated elements, showing a minute number of active elements with small copy number, characteristic of a genome that is rather stable.
Acknowledgments
This study was supported by research grants and fellowships from CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico).
Supplementary material
The following online material is available for this article:
Footnotes
Associate Editor: Carlos F.M. Menck
Conflict of Intererest
The authors have declared that no competing interests exist.
Author contributions
ELSL, MFO, ATRV and OM conceived and designed the study; JFD and MFO conducted the bioinformatics analysis; ELSL, JDD, MFO, ATRV and OM wrote the manuscript. All authors read and approved the final version.
References
- Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]; Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W and Lipman DJ (1997) Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res 25:3389-3402. [DOI] [PMC free article] [PubMed]
- Arensburger P, Hice RH, Wright JA, Craig NL, Atkinson PW. The mosquito Aedes aegypti has a large genome size and high transposable element load but contains a low proportion of transposon-specific piRNAs. BMC Genomics. 2011;12:606. doi: 10.1186/1471-2164-12-606. [DOI] [PMC free article] [PubMed] [Google Scholar]; Arensburger P, Hice RH, Wright JA, Craig NL and Atkinson PW (2011) The mosquito Aedes aegypti has a large genome size and high transposable element load but contains a low proportion of transposon-specific piRNAs. BMC Genomics 12:606. [DOI] [PMC free article] [PubMed]
- Casacuberta E, González J. The impact of transposable elements in environmental adaptation. Mol Ecol. 2013;22:1503–1517. doi: 10.1111/mec.12170. [DOI] [PubMed] [Google Scholar]; Casacuberta E and González J (2013) The impact of transposable elements in environmental adaptation. Mol Ecol 22:1503-1517. [DOI] [PubMed]
- Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman T, Kellis M, Gelbart W, Iye VN, et al. Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007;450:203–218. doi: 10.1038/nature06341. [DOI] [PubMed] [Google Scholar]; Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman T, Kellis M, Gelbart W, Iye VN et al. (2007) Evolution of genes and genomes on the Drosophila phylogeny. Nature 450:203-218. [DOI] [PubMed]
- Fablet M, Vieira C. Evolvability, epigenetics and transposable elements. BioMol Concepts. 2011;2:333–341. doi: 10.1515/BMC.2011.035. [DOI] [PubMed] [Google Scholar]; Fablet M and Vieira C (2011) Evolvability, epigenetics and transposable elements. BioMol Concepts 2:333-341. [DOI] [PubMed]
- Fernández-Medina RD, Struchiner CJ, Ribeiro JM. Novel transposable elements from Anopheles gambiae. BMC Genomics. 2011;12:260. doi: 10.1186/1471-2164-12-260. [DOI] [PMC free article] [PubMed] [Google Scholar]; Fernández-Medina RD, Struchiner CJ and Ribeiro JM (2011) Novel transposable elements from Anopheles gambiae. BMC Genomics 12:260. [DOI] [PMC free article] [PubMed]
- García-Guerreiro MP. What makes transposable elements move in the Drosophila genome? Heredity. 2012;108:461–468. doi: 10.1038/hdy.2011.89. [DOI] [PMC free article] [PubMed] [Google Scholar]; García-Guerreiro MP (2012) What makes transposable elements move in the Drosophila genome? Heredity 108:461-468. [DOI] [PMC free article] [PubMed]
- Hoen DR, Hickey G, Bourque G, Casacuberta J, Cordaux R, Feschotte C, Fiston-Lavier AS, Hua-Van A, Hubley R, Kapusta A, et al. A call for benchmarking transposable element annotation methods. Mobile DNA. 2015;6:13. doi: 10.1186/s13100-015-0044-6. [DOI] [PMC free article] [PubMed] [Google Scholar]; Hoen DR, Hickey G, Bourque G, Casacuberta J, Cordaux R, Feschotte C, Fiston-Lavier AS, Hua-Van A, Hubley R and Kapusta A et al. (2015) A call for benchmarking transposable element annotation methods. Mobile DNA 6:13. [DOI] [PMC free article] [PubMed]
- Hua-Van A, Le Rouzic A, Boutin TS, Fillee J, Capy P. The struggle for life of the genome’s selfish architects. Biol Direct. 2011;6:19. doi: 10.1186/1745-6150-6-19. [DOI] [PMC free article] [PubMed] [Google Scholar]; Hua-Van A, Le Rouzic A, Boutin TS, Fillee J and Capy P (2011) The struggle for life of the genome’s selfish architects. Biol Direct 6:19. [DOI] [PMC free article] [PubMed]
- Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Wali-Chiewicz J. Repbase update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005;110:462–467. doi: 10.1159/000084979. [DOI] [PubMed] [Google Scholar]; Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O and Wali-Chiewicz J (2005) Repbase update, a database of eukaryotic repetitive elements. Cytogenet Genome Res 110:462–467. [DOI] [PubMed]
- Kaminker JS, Bergman CM, Kronmiller B, Carlson J, Svirskas R, Patel S, Frise E, Wheeler DA, Lewis SE, Rubin GM, et al. The transposable elements of the Drosophila melanogaster euchromatin: A genomics perspective. Genome Biol. 2002;3:1–20. doi: 10.1186/gb-2002-3-12-research0084. [DOI] [PMC free article] [PubMed] [Google Scholar]; Kaminker JS, Bergman CM, Kronmiller B, Carlson J, Svirskas R, Patel S, Frise E, Wheeler DA, Lewis SE, Rubin GM et al. (2002) The transposable elements of the Drosophila melanogaster euchromatin: A genomics perspective. Genome Biol 3:1-20. [DOI] [PMC free article] [PubMed]
- Kohany O, Gentles AJ, Hankus L, Jurka J. Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor. BMC Bioinformatics. 2006;25:474. doi: 10.1186/1471-2105-7-474. [DOI] [PMC free article] [PubMed] [Google Scholar]; Kohany O, Gentles AJ, Hankus L and Jurka J (2006) Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor. BMC Bioinformatics 25:474. [DOI] [PMC free article] [PubMed]
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]; Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W et al. (2001) Initial sequencing and analysis of the human genome. Nature. 409:860-921. [DOI] [PubMed]
- Marinotti O, Cerqueira GC, de Almeida LG, Ferro MI, Loreto EL, Zaha A, Teixeira SM, Wespiser AR, Almeida E, Silva A, et al. The genome of Anopheles darlingi, the main Neotropical malaria vector. Nucleic Acids Research. 2013;41:7387–7400. doi: 10.1093/nar/gkt484. [DOI] [PMC free article] [PubMed] [Google Scholar]; Marinotti O, Cerqueira GC, de Almeida LG, Ferro MI, Loreto EL, Zaha A, Teixeira SM, Wespiser AR, Almeida E, Silva A et al. (2013) The genome of Anopheles darlingi, the main Neotropical malaria vector. Nucleic Acids Research 41:7387-7400. [DOI] [PMC free article] [PubMed]
- Neafsey DE, Waterhouse RM, Abai MR, Aganezov SS, Alekseyev MA, Allen JE, Amon J, Arcá B, Arensburger P, Artemov G, et al. Mosquito genomics. Highly evolvable malaria vectors: The genomes of 16 Anopheles mosquitoes. Science. 2015;347:1258522. doi: 10.1126/science.1258522. [DOI] [PMC free article] [PubMed] [Google Scholar]; Neafsey DE, Waterhouse RM, Abai MR, Aganezov SS, Alekseyev MA, Allen JE, Amon J, Arcá B, Arensburger P, Artemov G et al. (2015) Mosquito genomics. Highly evolvable malaria vectors: The genomes of 16 Anopheles mosquitoes. Science 347:1258522. [DOI] [PMC free article] [PubMed]
- Okonechnikov K, Golosova O, Fursov M. The UGENE team. Unipro UGENE: A unified bioinformatics toolkit. Bioinformatics. 2012;28:1166–1167. doi: 10.1093/bioinformatics/bts091. [DOI] [PubMed] [Google Scholar]; Okonechnikov K, Golosova O and Fursov M (2012) The UGENE team. Unipro UGENE: A unified bioinformatics toolkit. Bioinformatics 28:1166-1167. [DOI] [PubMed]
- Oliveira-Ferreira J, Lacerda MV, Brasil P, Ladislau JL, Tauil PL, Daniel-Ribeiro CT. Malaria in Brazil: An overview. Malar J. 2010;9:115. doi: 10.1186/1475-2875-9-115. [DOI] [PMC free article] [PubMed] [Google Scholar]; Oliveira-Ferreira J, Lacerda MV, Brasil P, Ladislau JL, Tauil PL and Daniel-Ribeiro CT (2010) Malaria in Brazil: An overview. Malar J 9:115. [DOI] [PMC free article] [PubMed]
- Price AL, Jones NC, Pevzner PA. De novo identification of repeat families in large genomes. Bioinformatics Suppl. 2005;1:i351–8. doi: 10.1093/bioinformatics/bti1018. [DOI] [PubMed] [Google Scholar]; Price AL, Jones NC and Pevzner PA (2005) De novo identification of repeat families in large genomes. Bioinformatics Suppl 1:i351-8. [DOI] [PubMed]
- SanMiguel P, Tikhonov A, Jin YK, Motchoulskaia N, Zakharov D, Melake-Berhan A, Springer PS, Edwards KJ, Lee M, Avramova Z, et al. Nested retrotransposons in the intergenic regions of the maize genome. Science. 1996;274:765–768. doi: 10.1126/science.274.5288.765. [DOI] [PubMed] [Google Scholar]; SanMiguel P, Tikhonov A, Jin YK, Motchoulskaia N, Zakharov D, Melake-Berhan A, Springer PS, EdwardsKJ, Lee M, Avramova Z et al. (1996) Nested retrotransposons in the intergenic regions of the maize genome. Science 274:765-768. [DOI] [PubMed]
- Sessegolo C, Burlet N, Haudry A. Strong phylogenetic inertia on genome size and transposable element content among 26 species of flies. Biol Lett. 2016;12:20160407. doi: 10.1098/rsbl.2016.0407. [DOI] [PMC free article] [PubMed] [Google Scholar]; Sessegolo C, Burlet N and Haudry A (2016) Strong phylogenetic inertia on genome size and transposable element content among 26 species of flies. Biol Lett 12:20160407. [DOI] [PMC free article] [PubMed]
- Siefert JL. Defining the mobilome. Methods Mol Biol. 2009;532:13–27. doi: 10.1007/978-1-60327-853-9_2. [DOI] [PubMed] [Google Scholar]; Siefert JL (2009) Defining the mobilome. Methods Mol Biol 532:13-27. [DOI] [PubMed]
- Smit AFA, Hubley R, Green P. 2016. RepeatMasker Open-4.0, http://www.repeatmasker.org.; Smit AFA, Hubley R and Green P (2016) RepeatMasker Open-4.0, http://www.repeatmasker.org.
- Terenius O, Marinotti O, Sieglaff D, James AA. Molecular genetic manipulation of vector mosquitoes. Cell Host Microbe. 2008;4:418–423. doi: 10.1016/j.chom.2008.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]; Terenius O, Marinotti O, Sieglaff D and James AA (2008) Molecular genetic manipulation of vector mosquitoes. Cell Host Microbe 4:418- 423. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.