Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2017 Feb 15;15(2):e2001536. doi: 10.1371/journal.pbio.2001536

Ecological networks to unravel the routes to horizontal transposon transfers

Samuel Venner 1,2,*, Vincent Miele 1, Christophe Terzian 2,3,4, Christian Biémont 1, Vincent Daubin 1,2, Cédric Feschotte 5, Dominique Pontier 1,2
PMCID: PMC5331948  PMID: 28199335

Abstract

Transposable elements (TEs) represent the single largest component of numerous eukaryotic genomes, and their activity and dispersal constitute an important force fostering evolutionary innovation. The horizontal transfer of TEs (HTT) between eukaryotic species is a common and widespread phenomenon that has had a profound impact on TE dynamics and, consequently, on the evolutionary trajectory of many species' lineages. However, the mechanisms promoting HTT remain largely unknown. In this article, we argue that network theory combined with functional ecology provides a robust conceptual framework and tools to delineate how complex interactions between diverse organisms may act in synergy to promote HTTs.

Introduction

Horizontal DNA transfer, or the passage of genetic material between organisms by means other than reproduction, while commonly observed in bacteria [1,2], has long been considered rare between multicellular eukaryotic species, with negligible impact on their evolution [3]. However, an increasing number of recent studies, in part fueled by the exponential growth of genome sequencing, have revealed that the transfer of genetic material between multicellular eukaryotes has occurred more commonly than previously appreciated (e.g., [49]; for recent reviews, see [2,10,11]). Among the well-documented cases of horizontal transfer between multicellular eukaryotes, those involving transposable elements (TEs) are by far the most common (for reviews, [5,12,13]). The propensity for TEs relative to non-TE sequences to undergo horizontal transfer may in part be attributed to their inherent mobility [5,14] and to their capacity for rapid genomic amplification following their introduction; this would facilitate the spread of these elements in populations even in the absence of an immediate fitness advantage to the host [15]. In fact, the horizontal transfer of TEs (hereafter “HTT”) can be viewed as a crucial process for the maintenance and propagation of TEs in eukaryotic genomes [5,16]. Consistent with this view, examples of HTT have rapidly accumulated in the literature over the past decade. Notably, a considerable number of HTT events have been reported among Drosophila species, in part because Drosophila represents a prominent model in evolutionary genetics [17] and also because the large number of genomic data available makes them particularly amenable to the development and application of robust statistical approaches to detect HTTs [18]. However, recent studies have uncovered solid cases of HTTs in an increasingly wide range of eukaryotic species as reviewed previously [5,12,13] and illustrated more recently by a flurry of new cases involving a variety of invertebrates, vertebrates, plants, and some of their parasites [1925]. Hence, a wide range of species and all major types of TEs are known to be implicated, regardless of the diversity of their structures and transposition mechanisms [5,12,13,24,25]. Since TEs represent a major component of the nuclear genome of multicellular eukaryotes and an important source of genetic variation catalyzing evolutionary innovation, HTT should be regarded as a pivotal force in eukaryotic genome evolution (Box 1).

Box 1. HTT as a fundamental step in the life cycle of transposable elements

Transposable elements (TEs) represent the single largest component of many large eukaryotic genomes, accounting for at least half of the human genome and an even greater fraction of other complex genomes [2628]. While these elements are best described as genomic parasites [29,30], their accumulation and movement are now recognized as a prolific source of mutation and genetic rearrangements, greatly influencing the evolutionary trajectory of their host species and organismal evolution (for reviews, see [14,26,3135]). As nearly ubiquitous components of eukaryotic chromosomes, TEs are transmitted vertically, i.e., from parents to offspring. However, it has long been appreciated that their persistence over vast evolutionary eons implies an ability to cross species boundaries and invade new genomes through horizontal transmission [5,16]. Following a single horizontal transfer event, a TE copy may rapidly spread through the recipient host population by means of high transposition activity combined with vertical transmission [36]. This initial expanding phase in a new population seems to be crucial because it sets the number of TE copies and their location in the host genome; this provides the breeding ground for future genetic alterations that can impact the evolutionary trajectory of the recipient TE-host species. Constantly high TE transposition activity and/or an ever-increasing abundance of active TE copies in the recipient species is expected, however, to result in excessive genomic instability, which ultimately will be incompatible with the survival of the host individual [15]. Consequently, TE activity is silenced by a variety of host-encoded strategies (such as RNA interference and other small RNA-based mechanisms, DNA methylation, histone modifications, and chromatin changes) as well as self-regulatory mechanisms [3739]. In the face of these mechanisms, empirical and theoretical studies have shown that, in the absence of natural selection acting at the host level to maintain transposition activity, the frequency of active TE copies is bound to decrease in the population and that of defective copies to increase, eventually leading to the extinction of the entire TE family [15,40,41]. Horizontal transfer represents one mechanism by which TEs can escape such extinction by providing an opportunity to colonize new host genomes and repeat the cycle [5,16].

TEs are classified into different families according to their transposition mode or genetic structure (see glossary), and some of them seem to have greater aptitude for HTT than others [42]. Some long-terminal-repeat (LTR) retrotransposons (e.g., the gypsy element in Drosophila), like retroviruses, are capable of producing a functional envelope protein [43,44] that gives them an intrinsic ability to infect new cells [44]. Nevertheless, HTTs have been shown to occur for all major TE types, irrespective of their ability to encode envelope proteins [5,13].

The biological factors and cellular mechanisms promoting HTT in eukaryotes remain poorly understood. There is a growing body of evidence pointing at the role of parasites and pathogens (e.g., viruses, bacteria, or macroparasites [ecto- or endoparasites]) in facilitating HTT [17,22,24,4548]. Yet, to our knowledge, no published attempt has been made to provide a robust framework to synthesize and integrate genomic and ecological data in order to illuminate how complex biological interactions between organisms may promote HTT. Here, we argue that network theory is a powerful approach to characterize the dynamics and disentangle the forces underlying HTT. Network theory delivers a set of tools to effectively model complex systems (i.e., composed of interacting entities) and to analyze their emergent properties, as utilized in physics, social sciences, ecology, and, more recently, in cell biology [4951]. In the area of genomics, we argue that complex systems formed by organisms having complementary properties and working synergistically to support HTT can be formalized using ecological networks, which represent complex interactions between organisms within ecological communities. The emergent property of the network is the shared presence of TE acquired through HTT in otherwise unrelated eukaryotic genomes. Network analysis would thus allow deciphering which organisms and which of their interactions are prone to promote HTT and thereby play a key role in the evolutionary dynamics and maintenance of TEs.

Current approaches to study HTT

Successful HTTs between multicellular and sexual eukaryotic species generally require that (1) one copy of a TE from a donor species reaches the germ line of an individual of the recipient species and is integrated into its genome, (2) germ cells integrating new TE copies produce fully functional gametes, and (3) TE spreads within the population of the recipient species through further transposition into the host genome and vertical transmission of newly formed copies. The successful fixation of the TE in the novel host genome site depends on both genetic drift and selective processes [52]. Note that a TE need not be fixed in the population to reveal HTT. In fact, polymorphic TE insertions are likely to indicate more recent HTT and hence be most relevant for illuminating extent ecological links. Intuitively, the probability for all these steps to be achieved must be extremely low, yet unequivocal cases of HTTs have now been well documented (see references above). The participation of intermediate biological vectors is often evoked to explain HTT across widely diverged species, but we note that the direct transfer of nucleic acids or nucleoprotein complexes, either as free molecules or packaged in extracellular vesicles, is also conceivable (see Box 1) [5,13].

The great majority of studies investigating HTT have been “species-centric”, focusing either on the identification of organisms that carry TEs or on the type of species interactions promoting HTTs. Many investigations have concentrated on parasitic or symbiotic microorganisms (viruses, bacteria, endosymbiotic bacteria, and unicellular eukaryotes) that are most commonly evoked as TE vectors because of their established propensity to transduce and recombine genetic material from their host [7,4351]. Among these potential TE vectors, viruses appear particularly suitable because of their defining ability to enter and exit cells, their propensity to capture and deliver genetic material from and to their host genome, and their capacity to infect germ cell lineages or their precursors [43,5254]. Similarly, bacteriophages are considered a major source of gene transfer in bacteria [2,53]. Large double-stranded DNA viruses represent the most outstanding candidates for facilitating HTTs among eukaryotes, as suggested by numerous reports of TEs clearly derived from a eukaryotic host integrated in their genomes [5,12,13,42,46,5460]. RNA viruses might also promote HTT when TE RNAs are encapsidated and copackaged along with viral genomic RNA [61]. Likewise, “virus-like particles” created by long-terminal-repeat (LTR) retrotransposons and endogenous retroviruses, which are well characterized in vertebrates and have been also detected in insects [62], can enter recipient cells and be transmitted to other organisms (see S1 Text for an expanded discussion).

Other investigations have aimed at identifying the routes for HTT among phylogenetically distant species by building comparative analyses of genomic composition of TEs across species. These studies have implicated macroparasites (e.g., flatworms, filarial nematodes, strepsipteran insects, and blood-feeding triatomine bugs, ticks, and lampreys) as facilitating the passage of various TEs between distantly related hosts [20,22,45,47,59,6365], but predator–prey interactions may also establish a route for HTT [20].

Taken together, these studies suggest that HTT is promoted by various ecological interactions between a wide diversity of organisms. Faced with this complexity and the ever-growing amount of genome sequence data for a wide range of organisms, it has become a necessity to develop a conceptual framework to disentangle the relative importance of the factors and processes underlying HTT. In complement to ecological theories of biodiversity already considered in previous studies (see [66] for a review), we propose that network theory combined with functional ecology provides an adequate conceptual framework and a toolbox to formalize and analyze, from large datasets, the multiplicity of mechanisms and routes underlying HTTs.

A functional ecology perspective on HTT

Functional ecology concentrates on the functional roles of species in the community by focusing on their traits and by analyzing their impact on community dynamics or ecosystem processes. Using networks in a functional ecology approach to understand HTT requires defining the functions necessary to ensure HTT and identifying different classes of organisms that might fulfill these complementary functions (based on their traits) and act synergistically.

Which requirements for HTT?

We identify three complementary functions that must be fulfilled for successful HTT (Fig 1):

Fig 1. Requirements for HTTs.

Fig 1

The figure presents the three complementary functions (defined in the main text) expected to modulate HTTs. The frames near the vertices of the triangle specify the properties required for organisms to ensure those functions and indicate the subdisciplines of biology for identifying them. The triangle allows viewing different gradients along which could be positioned different organisms involved in HTTs. Some eukaryotic species, like bats, would be particularly good TE reservoirs. Other species, like triatomine bugs, would be efficient both in the role of TE reservoir and as ecological connectors and might consequently operate as large hubs in TE dynamics. Other organisms, like DNA or RNA viruses, seem to have the necessary requirements for being efficient “TE molecular vehicles.” Poxviruses, in some circumstances, seem able to play the three functions, alone ensuring the HTT between ecologically close eukaryotic species.

  1. “Molecular vehicle function” reflects the ability to capture TEs in the genome of a donor species and transmit them to the genome of a recipient species. Different types of viruses and intracellular microparasites of eukaryotic cells exhibit highly suitable characteristics to act as molecular vehicles (see S1 Text).

  2. “Reservoir function” corresponds to the acquisition and storage of TE copies within a given population or species over evolutionary time; this function would determine how long a species might act as a “launching platform” for new HTT events. High reservoir ability should be associated with high rate of TE acquisition (e.g., low capacity of the immune system to control the entry and replication of molecular vehicles and/or large population size [67]) and high TE proliferation/maintenance within the genome (e.g., large genome size, vast areas of “dispensable” DNA, and/or slow rates of substitution and deletion, which could help to preserve its intact copies). For example, mammalian genomes exhibit many of these characteristics.

  3. “Ecological connection function” represents the ecological link between different eukaryotic species. The intensity of each link and its direction reflect the frequency of interactions between individuals of the two species, the frequency and direction of transfers of “TE molecular vehicles” during those interactions. Recent studies suggest that host–parasite and prey–predator interactions are the most probable ecological connections involved in HTT [20,45,47,59].

Synergy between organisms promoting HTT

A wide variety of organisms (micro- and macro-organisms) may be involved in HTT. It is, however, highly unlikely that any single organism could efficiently fulfill the three functions defined above. On the one hand, intracellular microparasites such as viruses constitute excellent molecular vehicles for TEs (see S1 Text). However, TEs acquired by viruses from their host, like other nonessential DNA, may be expected to be rapidly removed from viral genomes owing to their fitness cost and the large effective population size of viruses [68]. Consequently, while viruses are likely involved in HTTs among diverse species (e.g., poxviruses, which are known to have a broad host range or to switch host frequently [56]), they seem inadequate to act as long-term TE reservoirs [60], that is, to “store” TEs in their genome for a long enough time to promote their transfer on a wide scale—i.e., between multiple, phylogenetically distant eukaryotic species. On the other hand, some macro-organisms (e.g., those with large genomes, slow mutation rates, and small effective population size, such as vertebrates) are good candidates to store active TEs over a long period and thus act as TE reservoirs. Furthermore, the ecological relationships between macro-organisms (between generalist macroparasites and their hosts or between generalist predators and their prey) can create links to establish a route for HTTs. However, macro-organisms alone (e.g., macroparasites, hosts, prey, or predators) would only be capable of delivering nonreplicative transposition intermediates (i.e., nucleoprotein complexes, partially degraded DNA, or DNA encapsulated in vesicles of cellular origins, such as exosomes) into the circulating fluid (e.g., blood, hemolymph, and sap) of a recipient species (their hosts, macroparasites, predators, or prey, respectively). Such intermediates are not typically self-replicative and therefore are unlikely to reach and infiltrate the germline of the recipient species by themselves. Consequently, macro-organisms alone may not be sufficient for successful HTT. Altogether, these limitations suggest that the most optimal path for HTT might require a complex interplay between very diverse organisms (e.g., both micro- and macro-organisms) acting synergistically to facilitate the process.

An example illustrating the intermingled action of a virus and a macroparasite in HTT is given by the peculiar case of the tripartite system composed of parasitoid braconid wasps, their symbiotic polydnaviruses, and their lepidopteran hosts. Polydnaviruses are integrated as proviruses in the wasp genome and produce viral particles in wasp ovaries that are injected into the lepidopteran host at the same time as wasp eggs, allowing wasp larvae to evade the immune response of their lepidopteran hosts [69]. Sequences representing diverse TEs (mariner-like, gypsy-like, Maverick-like, and DIRS-like elements) were detected in the genomes of polydnaviruses [54,70], suggesting that TEs originating from the wasp genome may be frequently copackaged and delivered to the lepidopteran cells via the virus-like particles produced by polydnaviruses [61,71].

More generally, vector-borne viruses (or microparasites) and their biological vectors could constitute ideal pairs of organisms to connect genomes of diversified host eukaryotic species. Here, the viruses, their intermediate vectors (e.g., ecto- or endomacroparasites), and their final host species would act in synergy by playing the respective functions of molecular vehicles, ecological connectors, and reservoirs. In a context where the viruses multiply within their vector and where the virus vector is in contact with numerous and diverse host species during its evolutionary trajectory (e.g., generalist), the virus vectors can simultaneously act as TE reservoirs and ecological connectors linking a wide range of eukaryotic species. Such macroparasites might consequently operate as large hubs for HTT. Such species may include blood-feeding parasites such as triatomine bugs, ticks, or lampreys, which are also common vectors of microparasites and have been found to share TEs nearly identical to those of some of their known vertebrate hosts [45,47,59]. Similarly, predator–prey relationships, combined with virus transmission, could play an important role in HTT. Like generalist vectors, generalist predators could accumulate TEs of various origins according to the diversity of consumed prey and to their capacity for infection and replication of viruses acquired from prey [58,7275].

In sum, we envision the process of HTT as supported by a complex system of interacting organisms, which act in synergy at many different time and spatial scales ranging from cellular processes to community dynamics. Next, we argue that network theory offers a powerful way of representing and characterizing such a complex system to better comprehend the process of HTT.

Network theory to unravel HTT

Fundamentally, a network provides a framework to model the pairwise links among a set of objects having contrasted properties and to explore the emergent properties at the scale of the whole system. We detail below how networks can be used to model HTTs (hereafter “HTT networks”), and we illustrate, from simulation, their emergent properties, i.e., the similarity of genomes in TE composition. Then, we identify near-term prospects for the construction of HTT networks from empirical data to improve our understanding of the dynamics of TE movements between eukaryotic species.

HTT network characteristics

Using network approach in the context of HTT involves defining three characteristics: (1) the network topology, which captures the diversity of organisms potentially involved in HTT as well as their functional roles and their links; (2) the flow within the network, i.e., the dynamic of TE propagation among the species, which is based on their synergistic action; and (3) the network's emergent properties (or the network state resulting from that dynamic process), which correspond here to the degree of similarity of genomes of distinct species in terms of TE composition, resulting from HTT.

Topology of HTT networks and properties of their elements

An HTT network can be represented as a set of nodes, each node corresponding to a TE reservoir, and edges, which map the connectivity between the nodes. Both nodes and edges can have multiple characteristics and might not be all weighted equally. Nodes can represent eukaryotic species or potentially prokaryote species as soon as they have abilities to maintain TEs in their genome for sufficient time to give future opportunities of HTT. Each node is characterized by its reservoir ability, i.e., the ability to maintain/amplify TEs, which is taken into account in our basic model by a parameter indicative of the maximal number of TE copies that the genome can carry (see S2 Text). The node connectivity depends on the number of pairwise links with other nodes and on the strength and direction of each link and therefore depends on the ecological relationships among reservoir organisms. These relationships can be asymmetric (e.g., a predator eats a prey) or symmetric (e.g., a virus infecting multiple host species). Furthermore, the strength of a link represents the density of “TE molecular vehicles” and the facility with which they transit along the link, capture, and deliver TEs. Finally, multiple types of HTT networks can be built to represent the variety of ecological contexts that can be encountered (see Box 2 for details).

Box 2. Various topologies for HTT networks

Network topology, which reflects species relationships and captures the properties of nodes and links, provides the foundation for modeling HTT. Networks with contrasting topologies can be built (see S1 Fig) [76]. Although they are presented separately here, they can be partially combined according to ecological contexts, to form more complex networks.

Fully connected (or complete) networks

Within these networks, all nodes are directly interconnected (i.e., any possible edge is present). In the context of HTT, this implies that the movement of TEs is not constrained by any particular ecological relationship (i.e., all species interact with each other in the same way). This type of network, although not realistic, reflects optimal conditions for TE spread and can be used to evaluate how ecological interactions impede and channel TE flows.

Random (or Erdös-Rényi) networks

Within these networks, the basic connectivity (number of links per node) follows a Poisson distribution. Node connectivities that strongly deviate from the average connectivity are extremely rare, and consequently, there are typically no hubs, i.e., nodes with very high connectivity relative to other ones. Such networks would be considered as null/neutral HTT networks reflecting no ecological structure.

Scale-free networks

The node connectivity follows a power-law distribution that is characterized by a relatively small number of highly connected nodes corresponding to hubs. A similar pattern may also be obtained from exponential connectivity distribution [77]. Such networks may reflect situations in which HTT is ensured by viruses that would play the dual role of ecological connectors between eukaryotes and TE molecular vehicles (e.g., airborne viruses). Hubs would correspond to eukaryotic species with a great capacity of “TE reservoir” particularly exposed to viruses coming from numerous species. These hubs would therefore play the major role in centralizing and disseminating TEs. For instance, some bats might act as a hub—in particular, the species that (1) can air-travel great distances and are in contact with multiple species (bats of various species prey heavily on insects that transmit viruses, while others eat vertebrates like frogs, rodents, birds, fish, or other bats or feed on the blood of other vertebrates), (2) harbor large loads of a variety of viruses [78] which are potentially molecular vehicles of TEs, (3) have a high TE content in their genome and thus a capacity to be a TE reservoir [79], and (4) share TEs with many other eukaryotic species [20,45,57,72,75].

Modular networks

These networks are composed by groups of nodes (“modules”) that have more connections within than between groups. These networks capture ecological interactions characterized by a partial isolation of species groups, with species strongly interconnected within each group. Such networks could account for spatial isolation of groups of species that evolve on different continents and between which occasional exchanges may occur (e.g., via migratory animals and their macroparasites/viruses) [22,45]. Modular networks might also be useful to model partial isolation between ecosystems and communities (e.g., marine and terrestrial communities/ecosystems). Indeed, a recent study [23] suggests that the horizontal spread of TEs is more likely to occur between aquatic species.

Bipartite networks

These networks model asymmetry of interactions between nodes and are often used in ecology in the context of plant–pollinator, prey–predator, or host–parasite interactions. They are defined by two disjoint sets of nodes with direct interactions between, but not within, sets. Such bipartite networks would capture the synergistic actions of the different organisms potentially involved in HTT. For example, one set of nodes may correspond to a macroparasite species, another one to their potential hosts, and the link between sets would be established by vector-borne viruses. Macroparasites may include blood-feeding triatomine bugs, ticks, or lampreys, which are common vectors of microparasites and have been found to share TEs nearly identical to those found in their vertebrate hosts [45,47,59].

Dynamic of TE flows within the HTT networks

Once defined, the network topology provides the foundation for modeling TE flows between nodes (i.e., organisms’ reservoirs of TEs). This step requires taking into account both the intragenomic dynamics of TEs (amplification dynamics and persistence of TEs within genomes, which constitute the source for further HTT) within each node and the dynamics of transfer along the network links. To explore the interplay between network topology and TE dynamics, we developed a basic model (detailed in S2 Text) based on probabilistic simulations of flows of different TE families in a network of eukaryotic species. The simulations provide some avenues for HTT network analysis.

Emergent properties of networks: Similarities in genomic TE composition

We explore the emergent property of TE flows from simulations. The simulation results can be synthetized by building a matrix crossing all pairs of genomes (network nodes), in which we indicate the similarity in TE composition between each pair of genomes/nodes based on the presence/absence of TE families within those genomes/nodes (Fig 2). The emergent property of TE flows within a network is therefore the expected degree of similarity between genomes (nodes) in terms of shared TEs acquired from horizontal transfers. Here, we choose the Jaccard similarity coefficient [80], which is equivalent to β-diversity, an index widely used in community ecology that was introduced by Whittaker [81], to measure the degree to which species composition differs (from the presence/absence of species) between communities, i.e., between different localities of a same region. By analogy, the “TE β-diversity” would quantify the degree of differentiation in TE composition between a pair of species belonging to the HTT network and thus will be used to build the “simulated TE β-diversity matrix,” hereafter called “simulated β-matrix” (the modeling procedure of TE flows implying several families are detailed in S2 Text).

Fig 2. The simulated β-matrix representative of the HTT network.

Fig 2

Panels A, C, and E represent a random, a scale-free, and a modular HTT network, respectively; panels B, D, and F represent the corresponding simulated β-matrices obtained after simulation of TE dynamics along the given random HTT network, with 20 species, 30 TE families, and 150 HTTs. Species (numbered 1 to 20) are ordered with a hierarchical clustering based on TE β-diversity. The heatmap scale is indicated from the grey gradient shown in panel B. The simulated β-matrices exhibit blocks of species of similar TE content (panels B, D, and F) that can be retrieved by an appropriate cut of the dendrogram (different colors are used for the leaves of the different subtrees induced by this cut). Interestingly, these blocks are topologically coherent in the HTT network (panels A, C, and E). Parameters of the model are given in S2 Text. Networks were represented with the R igraph package with the "nicely" layout.

The β-Matrix, a powerful tool for discriminating HTT networks

We show that the β-matrix constitutes a powerful tool for characterizing and discriminating HTT networks. First, using a characteristic case based on a random sparse, scale-free, and modular network (see Box 2), we show that the simulated β-matrices should allow the groups of connected species as defined by the structure of the original network to be recovered (Fig 2). Second, we performed a systematic simulation analysis in which we show that simulated β-matrices are stable for a given HTT network (Fig 3 shows a strong correlation between simulated β-matrices when there is no shuffled edge in the HTT network, i.e., when this network remains unchanged). Conversely, an increasing level of perturbation in network connectivity (i.e., an increasing number of shuffled edges in the network of reference) leads to a decreased correlation between simulated β-matrices (Fig 3). Different HTT networks give rise to different β-matrices, which provide powerful tools to discriminate even among HTT networks close in their topology. Together, these results strongly suggest that it will be possible to reconstruct the topology of a HTT network from genomic data collected in the different species included in the network.

Fig 3. The simulated β-matrix as an efficient tool to discriminate among HTT networks.

Fig 3

Distribution of Mantel correlation coefficient (mean +/− standard deviation [SD]) between a reference β-matrix obtained for a given HTT network and a β-matrix obtained on a disturbed HTT network (perturbation is expressed by the number of edge shuffles [x-axis] in the given HTT network; no shuffled edge indicates that the network is unchanged). We used the same simulation settings as in Fig 2. The procedure was repeated for 50 random scale-free HTT networks and 10 replicates of each number of edge shuffles. Similar results were obtained by reducing the HTT rate to transposition rate ratio (see S1 Fig). The pattern is similar when using lower numbers of successful HTT; however, the level of correlation increases with the number of HTTs within the network (see S2 Fig).

Perspectives to promote the development of HTT network approaches

Reconstruction of an HTT network from genomic data

Reconstructing an HTT network from genomic data should allow the identification of key components, species, and/or links characterizing the propagation dynamics of TEs. The determination of the “empirical β-matrix” will be the first step for reconstruction of one (or more) plausible HTT networks. As for the simulated TE β-diversity, the empirical TE β-diversity between pairs of species will be calculated from the presence/absence of various TE families. Typically, this can be done based on genome sequences acquired from next-generation sequencing approaches. Because shared TE families can arise from vertical descent, especially when closely related species are under consideration, one will need to choose a similarity criterion for the grouping of TEs in a family that accounts for the expected divergence among the genomes considered [5,12]. In general, more stringent similarity criteria will reveal more recent HTTs and hence be more readily interpretable in terms of the extant network. Relaxing the similarity criterion for detecting HTTs bears the risk of including more ancient links or links to unknown related species rather than those under consideration and hence would introduce spurious links in the network.

Once the empirical β-matrix is determined, the reconstruction of the topology of the corresponding most likely HTT networks would greatly benefit from the many methodological advances in other disciplines (e.g., physics, social science, neurobiology, and community ecology) (see Box 3 for details). One major difficulty of the HTT network reconstruction can be related to the fact that some key species are missing (e.g., species not sampled or extinct). We note, however, that specific methods exist to detect hidden nodes [82]. In our context, the hidden nodes would correspond to reservoir organisms whose incorporation in the network topology would help to recover the empirical β-matrix. Their detection through the network analysis would provide a useful guide for future effort to uncover novel players (e.g., virus and bacterium) with a major role in HTTs.

Box 3. Network reconstruction methods

Knowing the presence/absence of each TE family and the empirical β-matrix, the challenge remains intact regarding the reconstruction (or inference) of the HTT network. Meanwhile, there is room for new methodological developments that could take their inspiration from methods developed in other domains. Indeed, this “reverse-engineering” problem could benefit from the cross-fertilization of ideas developed to reconstruct different kinds of networks (neuroscience with brain networks [85], bioinformatics with gene regulatory networks [86], ecology with food webs [87], or network science [88]). We propose here to pave the way for future developments by mentioning some ideas:

  • Simple measures of correlation between nodes are often used to infer an initial version of a network, which can be completed or modified by expert knowledge. Here, since any element of the β-matrix contains a measure of β-diversity in TE, the matrix can be used as it is to infer putative edges (see [89] for a discussion on this approach for social networks and [90] for gene networks).

  • Numerous available methods are based on mutual information (MI). The MI measures the amount of information that one node contains about another—in other words, the point to which the TE contents of two nodes are redundant. Some implementations are available (e.g., R package Minet [91]; see also [90]) and could help in the first attempt at HTT network reconstruction.

  • Simulation-based methods can also face the challenge of deciphering the original HTT network among the huge quantity of possible networks. Indeed, approximate Bayesian computation methods (ABC [92]) can be applied in our context: it consists in simulating the TE dynamics—as presented here—on numerous candidate networks (possibly preselected by the previous methods, in order to restrict the space of possibility) and selecting the candidate for which the TE β-matrix is the closest to the original one (in the sense of the Mantel correlation).

  • Other popular methods based on probabilistic graphical models (for instance, the Markov random field, but not Bayesian networks, since HTT networks are not directed) could be adapted to our problem, as well as penalization techniques that allow for sparse reconstruction [93].

  • The HTT matrix reconstruction problem could also be formulated into a combinatorial optimization problem: knowing the presence/absence of each TE family in the genomes, the reconstruction eventually ends up as a multiweighted Steiner tree problem [94]; here, there is one weight function per TE family.

As perspective, the performance of the different methods could be evaluated by comparing their ability to reconstruct HTT networks, for example, according to the number of species, the number of TE families, the number of HTT events in the network, or whether or not known ecological relationships between species are introduced.

Once the network topology has been reconstructed (qualitative characterization of the connectivity between nodes), it will then be possible to weight the different components of the network (quantitative characterization of the network elements, i.e., the reservoir capacity of the different nodes and the intensity of the links) in order to maximize the efficiency of the HTT network to recover the similarity of the genomes in their TE composition. Network reconstruction drawn from the quantitative and qualitative approaches would permit major interactions and keystone species—i.e., those that are expected to largely impact the dynamics of HTTs because of their expected high TE reservoir abilities and/or because they constitute hubs and are thus expected to build many links with other species—to be pinpointed. The network should then stimulate further research to identify the peculiar properties of those species and their links (e.g., study of their virome, molecular, cellular, and physiological properties, and their ecological interactions).

HTT networks link ecology and genomic properties

In addition, HTT networks would provide powerful tools for testing formally the relative importance of certain identified ecological interactions, ecological isolation, or the involvement of certain categories of molecular vehicles (see S1 Text) in the dynamics of HTTs.

The reconstruction process can indeed introduce some a priori structural constraints, which are derived from ecological knowledge and capture key structural properties of the most complex and comprehensive food webs [83,84]. To illustrate simple cases, we can impose a bipartite network structure to capture host–parasite or prey–predator interactions (with potential constraints on the direction of HTT along links) or a modular network to capture geographically isolated species or groups of species living in ecosystems that are partially isolated (see Box 2). We illustrate that different topological networks can generate different dynamics of TE propagation (see S3 Fig), which as a result would ultimately affect the distribution of TE families in the genomes. It will then be possible to assess the ability of such networks to recover similarity or divergence of genomes in their TE composition and thus to infer the relative importance of ecological interactions on TE dynamics.

As a complementary example, network analysis could also be used to identify which molecular vehicles, among all potential molecular vehicles, could play a key role in HTTs (e.g., RNA versus DNA viruses; see S1 Text). Such an approach would consist in reconstructing the networks that underlie the transfer of molecular vehicles (by building the “molecular vehicle β-diversity matrix” from the similarity of species of the network in their composition in those vehicles) and testing the capacity of these networks to generate the similarity of genomes in their TE composition.

In summary, while the application of network theory in the context of HTT will require substantive methodological developments, the approach is bound to deliver powerful tools to unravel the complex mechanisms governing the dynamics of propagation of TEs in eukaryotic species.

Conclusion

Here we argue that the conceptual framework and methodological tools provided by network theory can shed new light on the process of HTT. Applying this approach is becoming increasingly feasible thanks to the affordability of genome sequencing and the exponential accumulation of genome sequence data for a wide range of organisms (outside of model species). This outpouring of genome sequence data, together with new analytical tools to systematically detect HTT [12,18,95], should soon enable the placement of a large number of HTT events across a dense network of species, as well as the assembly of an empirical HTT matrix. The increasing availability of public databases on relationships between eukaryotic species as well as their relationships with their microbial communities (e.g., DNA-based diet analysis, host–vector relationship, virome, and microbiome) will allow further exploration of the role of these factors in shaping the HTT network and ultimately the evolutionary dynamics of TEs.

The application of network theory in ecology has yielded profound new insights into the dynamics of communities and ecosystem processes from the properties of interacting organisms [96,97]. We argue that this approach can be adapted to provide a new conceptual framework and methodology to unravel the dynamics of TE movements between eukaryotic species, with TEs being virtually ubiquitous throughout the tree of life. The development of HTT networks will promote cross-disciplinary insights and the merging of concepts and knowledge borrowed from a vast array of biological areas, including ecology, genetics, genomics, cell biology, virology, bacteriology, and parasitology. Such an integrative approach will open up new avenues to perform and interpret large-scale analyses of genome composition resulting from HTT and, consequently, to better understand a pivotal process in the evolution of multicellular eukaryotes.

Supporting information

S1 Fig. Simulated β-matrix: Effect of the ratio between HTT rate and transposition rate on the distribution of Mantel correlation coefficient.

Same legend as in Fig 3 (main text). We used the same simulation settings as in Fig 3 except for the ratio between the HTT rate and the within-genome transposition, which is one per 1000 (panel A) versus one per 100 (this figure, panel B and Fig 3). The result is insensitive to that ratio because (i) the intra-genomic dynamic of TEs is always much faster than their inter-genomic dynamic and (ii) the criterion used to end the simulation remains the number of successful HTTs (here it equals 150).

(PDF)

S2 Fig. Simulated β-matrix: Effect of the number of successful HTTs on the distribution of Mantel correlation coefficient.

Same legend as in Fig 3 (main text). We used the same simulation settings as in Fig 3. We tested the effect of the number of HTTs on the distribution of Mantel correlation coefficient (n = 50, 100, and 150 for panels A, B, and C, respectively). We show similar trends for all the tested situations (an increasing level of perturbation in network connectivity leads to a decreased correlation between simulated β-matrices). However, the level of correlation increases with the number of successful HTTs within the network. This result means that the network reconstruction will be all the easier when the number of HTTs is high within the species group considered.

(PDF)

S3 Fig. Dynamics of propagation of a single TE family within contrasted networks.

Panel A shows different structures of networks in which the nodes (species) have the same average degree of connectivity: random (A1), scale-free (A2), bipartite (A3), and modular (A4) networks (see Box 3 for a detailed description). Panel C represents random networks with different link densities, with the extreme case corresponding to the complete network. Panels B and D describe the diversity of dynamic of propagation of a single TE family in the network. The x-axis represents the number of species infected by the TE. The y-axis corresponds to time (number of iterations) needed to contamination of x species (x-axis) in the network. The box-plots represent the distribution of the time required for the contamination of x species. The width of the box-plot represents the proportion of trajectories in which the TE has contaminated x species. The figure shows networks with distinct topologies (panel A and C) and the dynamics of spread of a unique family of TE in these networks (panel B and D). For each category of networks (panel A and C), 400 simulations were performed. At the beginning of each simulation, a single copy of TE is placed in one of the species of the network, all the 20 network species being tested (20 different initial conditions/modality were therefore tested; 20 repetitions were performed per modality). Panels A and B show that the network structure affects the distribution of the TE propagation dynamics. The scale-free and modular networks generate dynamics different from those obtained in the case of random networks. The scale-free networks generate a very large diversity of TE propagation speeds: TE can spread either very rapidly when hubs are quickly contaminated or very slowly when hubs are slow to be contaminated. In modular networks, trajectories leading to the contamination of a large number of species are rare (strong decrease in the proportion of trajectories with more than 12 contaminated species) because of the presence of partially isolated groups of species. The dynamics of TE propagation in bipartite networks (e.g., host-macroparasite type) seem very close to those obtained from random networks, and therefore further analyses will be needed to detect the impact of bipartite networks on TE propagation. For example, the similarity in TE composition between very distantly related species (e.g., bug and their hosts) in a bipartite network is expected to be much greater than the similarity expected between those species in a random network. Panels C and D display the dynamics of TE propagation in different types of random networks differing by their average degree of connectivity (equal to 2, 3, 4, and the complete network having a maximum link density). The results (Panel D) show that the increase of link density greatly increases the speed of TE propagation and reduces the variability of propagation speed between the simulated TE trajectories. All these results emphasize the importance of ecological network structure in the dynamics of propagation of a single TE family, which should significantly alter the TE composition/similarity of genomes.

(PDF)

S1 Text. Which molecular vehicles for HTT?

(DOCX)

S2 Text. Methods for modeling TE dynamic within the HTT network.

(DOCX)

Acknowledgments

This work was achieved using the computing facilities of the Computing Center of Laboratoire de Biométrie et Biologie Evolutive / Pôle Rhône-Alpes de Bioinformatique (CC LBBE/PRABI). It was performed within the framework of the Laboratoire d’excellence “Dynamiques éco-évolutives des maladies infectieurses” (LABEX ECOFECT, ANR‐ 11‐ LABX‐ 0048) of Université de Lyon, within the program “Investissements d’Avenir” (ANR‐ 11-IDEX‐ 0007) operated by the French National Research Agency (ANR). We would like to thank Sylvain Charlat and Neo Martinez for their valuable comments. This manuscript is dedicated to the memory of Christophe Terzian, whose open mind, expertise, and friendship were essential to this work.

Abbreviations

ABC

approximate Bayesian computation methods

HTT

horizontal transfer of transposable elements

LTR

long terminal repeat

MI

mutual information

SD

standard deviation

TE

transposable element

Data availability

Data are available from the Biometry and Evolutionary Biology laboratory public server (http://pbil.univ-lyon1.fr/datasets/venner2017/).

Funding Statement

This work was supported by a grant from the National Institutes of Health R01GM077582 (to C.F.)

Footnotes

Provenance: Not commissioned; externally peer reviewed

References

  • 1.Rocha EPC. With a little help from prokaryotes. Science. 2013;339(6124):1154–5. 10.1126/science.1234938 [DOI] [PubMed] [Google Scholar]
  • 2.Soucy SM, Huang J, Gogarten JP. Horizontal gene transfer: building the web of life. Nat Rev Genet. 2015;16(8):472–82. 10.1038/nrg3962 [DOI] [PubMed] [Google Scholar]
  • 3.Keeling PJ, Palmer JD. Horizontal gene transfer in eukaryotic evolution. Nat Rev Genet. 2008;9(8):605–18. 10.1038/nrg2386 [DOI] [PubMed] [Google Scholar]
  • 4.Roulin A, Piegu B, Fortune PM, Sabot F, D’Hont A, Manicacci D, et al. Whole genome surveys of rice, maize and sorghum reveal multiple horizontal transfers of the LTR-retrotransposon Route66 in Poaceae. BMC Evol Biol. 2009;9:58 10.1186/1471-2148-9-58 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Schaack S, Gilbert C, Feschotte C. Promiscuous DNA: horizontal transfer of transposable elements and why it matters for eukaryotic evolution. Trends Ecol Evol. 2010;25:537–46. 10.1016/j.tree.2010.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Dunning Hotopp JC. Horizontal gene transfer between bacteria and animals. Trends Genet. 2011;27(4):157–63. 10.1016/j.tig.2011.01.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dupeyron M, Leclercq S, Cerveau N, Bouchon D, Gilbert C. Horizontal transfer of transposons between and within crustaceans and insects. Mob DNA. 2014;5(1):4 10.1186/1759-8753-5-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Baidouri M El, Carpentier MC, Cooke R, Gao D, Lasserre E, Llauro C, et al. Widespread and frequent horizontal transfers of transposable elements in plants. Genome Res. 2014;24(5):831–8. 10.1101/gr.164400.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Sarilar V, Bleykasten-Grosshans C, Neuvéglise C. Evolutionary dynamics of hAT DNA transposon families in Saccharomycetaceae. Genome Biol Evol. 2015;7(1):172–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Syvanen M. Evolutionary implications of horizontal gene transfer. Annu Rev Genet. 2012;46(1):341–58. [DOI] [PubMed] [Google Scholar]
  • 11.Boto L. Horizontal gene transfer in the acquisition of novel traits by metazoans. Proc R Soc London B Biol Sci. 2014;281:20132450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wallau GL, Ortiz MF, Loreto ELS. Horizontal transposon transfer in eukarya: detection, bias, and perspectives. Genome Biol Evol. 2012;4(8):689–99. 10.1093/gbe/evs055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ivancevic AM, Walsh AM, Kortschak RD, Adelson DL. Jumping the fine LINE between species: horizontal transfer of transposable elements in animals catalyses genome evolution. BioEssays. 2013;35(12):1071–82. 10.1002/bies.201300072 [DOI] [PubMed] [Google Scholar]
  • 14.Feschotte C, Pritham EJ. DNA transposons and the evolution of eukaryotic genomes. Annu Rev Genet. 2007;41:331–68. 10.1146/annurev.genet.40.110405.090448 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Le Rouzic A, Boutin TS, Capy P. Long-term evolution of transposable elements. Proc Natl Acad Sci U S A. 2007;104(49):19375–80. 10.1073/pnas.0705238104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hartl DL, Lohe AR, Lozovskaya ER. Modern thoughts on an ancient marinere: function, evolution, regulation. Annu Rev Genet. 1997;31(1):337–58. [DOI] [PubMed] [Google Scholar]
  • 17.Ortiz MF, Wallau GL, Graichen DÂS, Loreto ELS. An evaluation of the ecological relationship between Drosophila species and their parasitoid wasps as an opportunity for horizontal transposon transfer. Mol Genet Genomics. 2015;290(1):67–78. 10.1007/s00438-014-0900-y [DOI] [PubMed] [Google Scholar]
  • 18.Modolo L, Picard F, Lerat E. A new genome-wide method to track horizontally transferred sequences: application to Drosophila. Genome Biol Evol. 2014;6(2):416–32. 10.1093/gbe/evu026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Drezen J-M, Gauthier J, Josse T, Bézier A, Herniou E, Huguet E. Foreign DNA acquisition by invertebrate genomes. J Invertebr Pathol. 2016; [DOI] [PubMed] [Google Scholar]
  • 20.Tang Z, Zhang H-H, Huang K, Zhang X-G, Han M-J, Zhang Z. Repeated horizontal transfers of four DNA transposons in invertebrates and bats. Mob DNA. 2015;6:3 10.1186/s13100-014-0033-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zhang H-H, Li G-Y, Xiong X-M, Han M-J, Zhang X-G, Dai F-Y. TRT, a vertebrate and protozoan Tc1-like transposon: current activity and horizontal transfer. Genome Biol Evol. 2016;8(9):2994–3005. 10.1093/gbe/evw213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Suh A, Witt CC, Menger J, Sadanandan KR, Podsiadlowski L, Gerth M, et al. Ancient horizontal transfers of retrotransposons between birds and ancestors of human pathogenic nematodes. Nat Commun. 2016;7:11396 10.1038/ncomms11396 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Wang X, Liu X. Close ecological relationship among species facilitated horizontal transfer of retrotransposons. BMC Evol Biol. 2016;16(1):201 10.1186/s12862-016-0767-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Sun T, Renner SS, Xu Y, Qin Y, Wu J, Sun G. Two hAT transposon genes were transferred from Brassicaceae to broomrapes and are actively expressed in some recipients. Sci Rep. 2016. July 25;6:30192 10.1038/srep30192 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lin X, Faridi N, Casola C. An ancient trans-kingdom horizontal transfer of Penelope -like retroelements from arthropods to conifers. Genome Biol Evol. 2016;8(4):evw076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Biémont C, Vieira C. Junk DNA as an evolutionary force. Nature. 2006;443(5):521–4. [DOI] [PubMed] [Google Scholar]
  • 27.Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, et al. The B73 maize genome: complexity, diversity, and dynamics. Science. 2009;326(5956):1112–5. 10.1126/science.1178534 [DOI] [PubMed] [Google Scholar]
  • 28.Baucom RS, Estill JC, Chaparro C, Upshaw N, Jogi A, Deragon JM, et al. Exceptional diversity, non-random distribution, and rapid evolution of retroelements in the B73 maize genome. PLoS Genet. 2009;5(11). e1000732 10.1371/journal.pgen.1000732 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Doolittle WF, Sapienza C. Selfish genes, the phenotype paradigm and genome evolution. Nature. 1980;284:601–3. [DOI] [PubMed] [Google Scholar]
  • 30.Orgel LE, Crick FHC. Selfish DNA: the ultimate parasite. Nature. 1980;284:604–7. [DOI] [PubMed] [Google Scholar]
  • 31.Biémont C. A brief history of the status of transposable elements: From junk DNA to major players in evolution. Genetics. 2010;186(4):1085–93. 10.1534/genetics.110.124180 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Oliver KR, Greene WK. Transposable elements: powerful facilitators of evolution. BioEssays. 2009;31(7):703–14. 10.1002/bies.200800219 [DOI] [PubMed] [Google Scholar]
  • 33.Hua-Van A, Le Rouzic A, Boutin TS, Filée J, Capy P. The struggle for life of the genome’s selfish architects. Biol Direct. 2011;6(1):19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Cordaux R, Batzer MA. The impact of retrotransposons on human genome evolution. Nat Rev Genet. 2009;10(10):691–703. 10.1038/nrg2640 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chuong EB, Elde NC, Feschotte C. Regulatory activities of transposable elements: from conflicts to benefits. Nat Rev Genet. 2017; 18(2):71–86. 10.1038/nrg.2016.139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Brookfield JFY. The ecology of the genome—mobile DNA elements and their hosts. Nat Rev Genet. 2005;6(2):128–36. 10.1038/nrg1524 [DOI] [PubMed] [Google Scholar]
  • 37.Charlesworth B, Langley CH. The evolution of self-regulated transposition of transposable elements. Genetics. 1986;112(2):359–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Brookfield JFY, Badge RM. Population genetics models of transposable elements In: Evolution and Impact of Transposable Elements. Springer; 1997. p. 281–94. [PubMed] [Google Scholar]
  • 39.Lohe AR, Hartl DL. Autoregulation of mariner transposase activity by overproduction and dominant-negative complementation. Mol Biol Evol. 1996;13(4):549–55. [DOI] [PubMed] [Google Scholar]
  • 40.Le Rouzic A, Capy P. Population genetics models of competition between transposable element subfamilies. Genetics. 2006;174(2):785–93. 10.1534/genetics.105.052241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Robillard É, Le Rouzic A, Zhang Z, Capy P, Hua-Van A. Experimental evolution reveals hyperparasitic interactions among transposable elements. Proc Natl Acad Sci. 2016;113(51):14763–8. 10.1073/pnas.1524143113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Loreto ELS, Carareto CMA, Capy P. Revisiting horizontal transfer of transposable elements in Drosophila. Heredity. 2008;100(6):545–54. 10.1038/sj.hdy.6801094 [DOI] [PubMed] [Google Scholar]
  • 43.Pélisson A, Song SU, Prud’homme N, Smith PA, Bucheton A, Corces VG. Gypsy transposition correlates with the production of a retroviral envelope-like protein under the tissue-specific control of the Drosophila flamenco gene. EMBO J. 1994;13(18):4401–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Malik HS, Henikoff S, Eickbush TH. Poised for contagion: Evolutionary origins of the infectious abilities of invertebrate retroviruses. Genome Res. 2000;10(9):1307–18. [DOI] [PubMed] [Google Scholar]
  • 45.Gilbert C, Schaack S, Pace JK, Brindley PJ, Feschotte C. A role for host-parasite interactions in the horizontal transfer of transposons across phyla. Nature. 2010;464(7293):1347–50. 10.1038/nature08939 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Gilbert C, Chateigner A, Ernenwein L, Barbe V, Bézier A, Herniou EA, et al. Population genomics supports baculoviruses as vectors of horizontal transfer of insect transposons. Nat Commun. 2014;5:3348 10.1038/ncomms4348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Walsh AM, Kortschak RD, Gardner MG, Bertozzi T, Adelson DL. Widespread horizontal transfer of retrotransposons. Proc Natl Acad Sci U S A. 2013;110(3):1012–6. 10.1073/pnas.1205856110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Panaud O. Horizontal transfers of transposable elements in eukaryotes: The flying genes. C R Biol. 2016;339(7–8):296–9. 10.1016/j.crvi.2016.04.013 [DOI] [PubMed] [Google Scholar]
  • 49.Heleno R, Garcia C, Jordano P, Traveset A, Gómez JM, Blüthgen N, et al. Ecological networks: delving into the architecture of biodiversity. Biol Lett. 2014;10(1):20131000 10.1098/rsbl.2013.1000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gonzalez-Gaitan M, Roux A. When cell biology meets theory. J Cell Biol. 2015;210(7):1041–5. 10.1083/jcb.201504025 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Espagne E, Dupuy C, Huguet E, Cattolico L, Provost B, Martins N, et al. Genome sequence of a polydnavirus: insights into symbiotic virus evolution. Science. 2004;306(5694):286–9. 10.1126/science.1103066 [DOI] [PubMed] [Google Scholar]
  • 52.Szitenberg A, Cha S, Opperman CH, Bird DM, Blaxter ML, Lunt DH. Genetic drift, not life history or RNAi, determine long-term evolution of transposable elements. Genome Biol Evol. 2016;8(9):2964–78. 10.1093/gbe/evw208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Daubin V, Szöllősi GJ. Horizontal gene transfer and the history of life. Cold Spring Harb Perspect Biol. Cold Spring Harbor Lab; 2016;8(4):a018036 10.1101/cshperspect.a018036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Desjardins CA, Gundersen-Rindal DE, Hostetler JB, Tallon LJ, Fadrosh DW, Fuester RW, et al. Comparative genomics of mutualistic viruses of Glyptapanteles parasitic wasps. Genome Biol. 2008;9(12):R183 10.1186/gb-2008-9-12-r183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Marquez CP, Pritham EJ. Phantom, a new subclass of Mutator DNA transposons found in insect viruses and widely distributed in animals. Genetics. 2010;185(4):1507–17. 10.1534/genetics.110.116673 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Piskurek O, Okada N. Poxviruses as possible vectors for horizontal transfer of retroposons from reptiles to mammals. Proc Natl Acad Sci U S A. 2007;104(29):12046–51. 10.1073/pnas.0700531104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Thomas J, Schaack S, Pritham EJ. Pervasive horizontal transfer of rolling-circle transposons among animals. Genome Biol Evol. 2010;2(1):656–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Coates BS. Horizontal transfer of a non-autonomous Helitron among insect and viral genomes. BMC Genomics. 2015;16(1):1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Zhang HH, Feschotte C, Han MJ, Zhang Z. Recurrent horizontal transfers of Chapaev transposons in diverse invertebrate and vertebrate animals. Genome Biol Evol. 2014;6(6):1375–86. 10.1093/gbe/evu112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Gilbert C, Peccoud J, Chateigner A, Moumen B, Cordaux R, Herniou EA. Continuous influx of genetic material from host to virus populations. PLoS Genet. 2016;12(2):e1005838 10.1371/journal.pgen.1005838 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Routh A, Domitrovic T, Johnson JE. Host RNAs, including transposons, are encapsidated by a eukaryotic single-stranded RNA virus. Proc Natl Acad Sci U S A. 2012;109(6):1907–12. 10.1073/pnas.1116168109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Kim A, Terzian C, Santamaria P, Pelisson A, Purd’homme N, Bucheton A. Retroviruses in invertebrates: the gypsy retrotransposon is apparently an infectious retrovirus of Drosophila melanogaster. Proc Natl Acad Sci U S A. 1994;91(4):1285–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Gilbert C, Hernandez SS, Flores-Benabib J, Smith EN, Feschotte C. Rampant horizontal transfer of SPIN transposons in squamate reptiles. Mol Biol Evol. 2012;29(2):503–15. 10.1093/molbev/msr181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Filée J, Rouault J-D, Harry M, Hua-Van A. Mariner transposons are sailing in the genome of the blood-sucking bug Rhodnius prolixus. BMC Genomics. 2015;16(1):1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Wijayawardena BK, Minchella DJ, DeWoody JA. Hosts, parasites, and horizontal gene transfer. Trends Parasitol. 2013;29(7):329–38. 10.1016/j.pt.2013.05.001 [DOI] [PubMed] [Google Scholar]
  • 66.Venner S, Feschotte C, Biémont C. Dynamics of transposable elements: towards a community ecology of the genome. Trends Genet. 2009;25(7):317–323. 10.1016/j.tig.2009.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Groth SB, Blumenstiel JP. Horizontal transfer can drive a greater transposable element load in large populations. J Hered. 2017;108(1):36–44. 10.1093/jhered/esw050 [DOI] [PubMed] [Google Scholar]
  • 68.Mira A, Ochman H, Moran NA. Deletional bias and the evolution of bacterial genomes. Trends Genet. 2001;17(10):589–96. [DOI] [PubMed] [Google Scholar]
  • 69.Yoshiyama M, Tu Z, Kainoh Y, Honda H, Shono T, Kimura K. Possible horizontal transfer of a transposable element from host to parasitoid. Mol Biol Evol. 2001;18(10):1952–8. [DOI] [PubMed] [Google Scholar]
  • 70.Herniou EA, Huguet E, Thézé J, Bézier A, Periquet G, Drezen J-M. When parasitic wasps hijacked viruses: genomic and functional evolution of polydnaviruses. Philos Trans R Soc Lond B Biol Sci. 2013;12(1626):20130051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Rizki RM, Rizki TM. Parasitoid virus-like particles destroy Drosophila cellular immunity. Proc Natl Acad Sci U S A. 1990;87(21):8388–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Pace JK, Gilbert C, Clark MS, Feschotte C. Repeated horizontal transfer of a DNA transposon in mammals and other tetrapods. Proc Natl Acad Sci U S A. 2008;105(44):17023–8. 10.1073/pnas.0806548105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Pagan HJT, Smith JD, Hubley RM, Ray DA. PiggyBac-ing on a primate Genome: novel elements, recent activity and horizontal transfer. Genome Biol Evol. 2010;2:293–303. 10.1093/gbe/evq021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Thomas J, Sorourian M, Ray D, Baker RJ, Pritham EJ. The limited distribution of Helitrons to vesper bats supports horizontal transfer. Gene. 2011;474(1):52–8. [DOI] [PubMed] [Google Scholar]
  • 75.Zhuo X, Feschotte C. Cross-species transmission and differential fate of an endogenous retrovirus in three mammal lineages. PLoS Pathog. 2015;11(11):e1005279 10.1371/journal.ppat.1005279 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Newman M. Networks: an introduction. Oxford university press; 2010. [Google Scholar]
  • 77.Dunne JA, Williams RJ, Martinez ND. Food-web structure and network theory: the role of connectance and size. Proc Natl Acad Sci U S A. 2002;99(20):12917–22. 10.1073/pnas.192407699 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Calisher CH, Childs JE, Field HE, Holmes K V, Schountz T. Bats: important reservoir hosts of emerging viruses. Clin Microbiol Rev. 2006;19(3):531–45. 10.1128/CMR.00017-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Ray DA, Feschotte C, Pagan HJT, Smith JD, Pritham EJ, Arensburger P, et al. Multiple waves of recent DNA transposon activity in the bat, Myotis lucifugus. Genome Res. 2008;18:717–28. 10.1101/gr.071886.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Legendre P, Legendre LFJ. Numerical ecology. Elsevier; 2012. [Google Scholar]
  • 81.Whittaker RH. Vegetation of the Siskiyou mountains, Oregon and California. Ecol Monogr. 1960;30(3):279–338. [Google Scholar]
  • 82.Su R-Q, Lai Y-C, Wang X, Do Y. Uncovering hidden nodes in complex networks in the presence of noise. Sci Rep. 2014;4:3944 10.1038/srep03944 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Williams RJ, Martinez ND. Simple rules yield complex food webs. Nature. 2000;404(6774):180–3. 10.1038/35004572 [DOI] [PubMed] [Google Scholar]
  • 84.Williams RJ, Martinez ND. Success and its limits among structural models of complex food webs. J Anim Ecol. 2008;77(3):512–9. 10.1111/j.1365-2656.2008.01362.x [DOI] [PubMed] [Google Scholar]
  • 85.Sporns O. The human connectome: origins and challenges. Neuroimage. 2013;80:53–61. 10.1016/j.neuroimage.2013.03.023 [DOI] [PubMed] [Google Scholar]
  • 86.Marbach D, Costello JC, Küffner R, Vega NM, Prill RJ, Camacho DM, et al. Wisdom of crowds for robust gene network inference. Nat Methods. 2012;9(8):796–804. 10.1038/nmeth.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.García-Robledo C, Erickson DL, Staines CL, Erwin TL, Kress WJ. Tropical plant–herbivore networks: reconstructing species interactions using DNA barcodes. PLoS ONE. 2013;8(1):e52967 10.1371/journal.pone.0052967 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Guimerà R, Sales-Pardo M. Missing and spurious interactions and the reconstruction of complex networks. Proc Natl Acad Sci U S A. 2009;106(52):22073–8. 10.1073/pnas.0908366106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Liben‐Nowell D, Kleinberg J. The link‐prediction problem for social networks. J Am Soc Inf Sci Technol. 2007;58(7):1019–31. [Google Scholar]
  • 90.Wang YXR, Huang H. Review on statistical methods for gene network reconstruction using expression data. J Theor Biol. 2014;362:53–61. 10.1016/j.jtbi.2014.03.040 [DOI] [PubMed] [Google Scholar]
  • 91.Meyer PE, Lafitte F, Bontempi G. Minet: an open source R/Bioconductor package for mutual information based network inference. BMC Bioinformatics. 2008;9:461 10.1186/1471-2105-9-461 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Marjoram P. Approximation Bayesian Computation. OA Genet. 2013;1(3):853 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics. 2008;9(3):432–41. 10.1093/biostatistics/kxm045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Duin C, Volgenant T. The multi-weighted Steiner tree problem. Ann Oper Res. 1991;33(6):451–69. [Google Scholar]
  • 95.Wallau GL, Capy P, Loreto E, Le Rouzic A, Hua-Van A. VHICA, a new method to discriminate between vertical and horizontal transposon transfer: application to the mariner family within Drosophila. Mol Biol Evol. 2016;33(4):1094–109. 10.1093/molbev/msv341 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Proulx SR, Promislow DEL, Phillips PC. Network thinking in ecology and evolution. Trends Ecol Evol. 2005;20(6):345–53. 10.1016/j.tree.2005.04.004 [DOI] [PubMed] [Google Scholar]
  • 97.Thompson RM, Brose U, Dunne JA, Hall RO, Hladyz S, Kitching RL, et al. Food webs: reconciling the structure and function of biodiversity. Trends Ecol Evol. 2012;27(12):689–97. 10.1016/j.tree.2012.08.005 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Simulated β-matrix: Effect of the ratio between HTT rate and transposition rate on the distribution of Mantel correlation coefficient.

Same legend as in Fig 3 (main text). We used the same simulation settings as in Fig 3 except for the ratio between the HTT rate and the within-genome transposition, which is one per 1000 (panel A) versus one per 100 (this figure, panel B and Fig 3). The result is insensitive to that ratio because (i) the intra-genomic dynamic of TEs is always much faster than their inter-genomic dynamic and (ii) the criterion used to end the simulation remains the number of successful HTTs (here it equals 150).

(PDF)

S2 Fig. Simulated β-matrix: Effect of the number of successful HTTs on the distribution of Mantel correlation coefficient.

Same legend as in Fig 3 (main text). We used the same simulation settings as in Fig 3. We tested the effect of the number of HTTs on the distribution of Mantel correlation coefficient (n = 50, 100, and 150 for panels A, B, and C, respectively). We show similar trends for all the tested situations (an increasing level of perturbation in network connectivity leads to a decreased correlation between simulated β-matrices). However, the level of correlation increases with the number of successful HTTs within the network. This result means that the network reconstruction will be all the easier when the number of HTTs is high within the species group considered.

(PDF)

S3 Fig. Dynamics of propagation of a single TE family within contrasted networks.

Panel A shows different structures of networks in which the nodes (species) have the same average degree of connectivity: random (A1), scale-free (A2), bipartite (A3), and modular (A4) networks (see Box 3 for a detailed description). Panel C represents random networks with different link densities, with the extreme case corresponding to the complete network. Panels B and D describe the diversity of dynamic of propagation of a single TE family in the network. The x-axis represents the number of species infected by the TE. The y-axis corresponds to time (number of iterations) needed to contamination of x species (x-axis) in the network. The box-plots represent the distribution of the time required for the contamination of x species. The width of the box-plot represents the proportion of trajectories in which the TE has contaminated x species. The figure shows networks with distinct topologies (panel A and C) and the dynamics of spread of a unique family of TE in these networks (panel B and D). For each category of networks (panel A and C), 400 simulations were performed. At the beginning of each simulation, a single copy of TE is placed in one of the species of the network, all the 20 network species being tested (20 different initial conditions/modality were therefore tested; 20 repetitions were performed per modality). Panels A and B show that the network structure affects the distribution of the TE propagation dynamics. The scale-free and modular networks generate dynamics different from those obtained in the case of random networks. The scale-free networks generate a very large diversity of TE propagation speeds: TE can spread either very rapidly when hubs are quickly contaminated or very slowly when hubs are slow to be contaminated. In modular networks, trajectories leading to the contamination of a large number of species are rare (strong decrease in the proportion of trajectories with more than 12 contaminated species) because of the presence of partially isolated groups of species. The dynamics of TE propagation in bipartite networks (e.g., host-macroparasite type) seem very close to those obtained from random networks, and therefore further analyses will be needed to detect the impact of bipartite networks on TE propagation. For example, the similarity in TE composition between very distantly related species (e.g., bug and their hosts) in a bipartite network is expected to be much greater than the similarity expected between those species in a random network. Panels C and D display the dynamics of TE propagation in different types of random networks differing by their average degree of connectivity (equal to 2, 3, 4, and the complete network having a maximum link density). The results (Panel D) show that the increase of link density greatly increases the speed of TE propagation and reduces the variability of propagation speed between the simulated TE trajectories. All these results emphasize the importance of ecological network structure in the dynamics of propagation of a single TE family, which should significantly alter the TE composition/similarity of genomes.

(PDF)

S1 Text. Which molecular vehicles for HTT?

(DOCX)

S2 Text. Methods for modeling TE dynamic within the HTT network.

(DOCX)

Data Availability Statement

Data are available from the Biometry and Evolutionary Biology laboratory public server (http://pbil.univ-lyon1.fr/datasets/venner2017/).


Articles from PLoS Biology are provided here courtesy of PLOS

RESOURCES