Skip to main content
Virus Evolution logoLink to Virus Evolution
. 2015 Jun 1;1(1):vev004. doi: 10.1093/ve/vev004

The effect of ecosystem biodiversity on virus genetic diversity depends on virus species: A study of chiltepin-infecting begomoviruses in Mexico

Manuel Rodelo-Urrego 1, Fernando García-Arenal 1,, Israel Pagán 1,*
PMCID: PMC5014474  PMID: 27774278

Abstract

Current declines in biodiversity put at risk ecosystem services that are fundamental for human welfare. Increasing evidence indicates that one such service is the ability to reduce virus emergence. It has been proposed that the reduction of virus emergence occurs at two levels: through a reduction of virus prevalence/transmission and, as a result of these epidemiological changes, through a limitation of virus genetic diversity. Although the former mechanism has been studied in a few host-virus interactions, very little is known about the association between ecosystem biodiversity and virus genetic diversity. To address this subject, we estimated genetic diversity, synonymous and non-synonymous nucleotide substitution rates, selection pressures, and frequency of recombinants and re-assortants in populations of Pepper golden mosaic virus (PepGMV) and Pepper huasteco yellow vein virus (PHYVV) that infect chiltepin plants in Mexico. We then analyzed how these parameters varied according to the level of habitat anthropization, which is the major cause of biodiversity loss. Our results indicated that genetic diversity of PepGMV (but not of PHYVV) populations increased with the loss of biodiversity at higher levels of habitat anthropization. This was mostly the consequence of higher rates of synonymous nucleotide substitutions, rather than of adaptive selection. The frequency of recombinants and re-assortants was higher in PepGMV populations infecting wild chiltepin than in those infecting cultivated ones, suggesting that genetic exchange is not the main mechanism for generating genetic diversity in PepGMV populations. These findings provide evidence that biodiversity may modulate the genetic diversity of plant viruses, but it may differentially affect even two closely related viruses. Our analyses may contribute to understanding the factors involved in virus emergence.

Keywords: begomoviruses, Capsicum annuum glabriusculum, biodiversity, population genetic diversity, plant–virus interactions, recombination

1 Introduction

One of the defining characteristics of viruses is their high capacity to generate genetic diversity (Holmes 2009). This capacity is directly linked to the continuous appearance of drug-resistant strains and of new viruses that overcome the host immune system or the genetic resistance of crops (García-Arenal and McDonald 2003; Holmes 2009; Pennings 2012; Elena et al. 2014). Indeed, around 150 viral diseases have emerged in humans, plants, and wildlife in the last 60 years at accelerating rates (Dobson and Foufopoulos 2001; Anderson et al. 2004; Jones et al. 2008). A fundamental step to reduce the number of emergent virus diseases, and a long-standing goal for evolutionary virologists, is to understand which factors modulate the genetic diversity of virus populations. Because, by definition, infectious diseases require the interaction between various species, at least the host and the parasite, ecosystem biodiversity may be one of such factors. Indeed, theoretical elaborations hypothesize that biodiversity may determine the genetic diversity of parasite populations, including viruses, through changes in the parasite’s epidemiology. According to this hypothesis, the loss of biodiversity would increase host abundance and density, which in turn would lead to higher transmission rates and a higher prevalence of parasites (Keesing, Holt, and Ostfeld 2006; Burdon and Thrall 2008; Keesing et al. 2010; Ostfeld and Keesing 2012). As a consequence of these epidemiological changes, parasite populations would increase in size, accelerating their evolutionary rates (Scholle et al. 2013; Lanfear, Kokko, and Eyre-Walker 2014), which could ultimately lead to higher genetic diversity.

In the past decade, increasing evidence has linked the loss of ecosystem biodiversity to higher transmission rates and prevalence of viruses and other parasites (reviewed in Keesing et al. 2010; Ostfeld and Keesing 2012; Alexander et al. 2014). However, our understanding on how the loss of biodiversity modulates the genetic diversity of virus populations is scarce and relies mostly on indirect evidence (Holmes 2009; Scholle et al. 2013; Lanfear, Kokko, and Eyre-Walker 2014). This is especially so in the case of plant viruses (Alexander et al. 2014), for which most of the existing evidence comes from analyses of plant-virus interactions under different levels of ecosystem anthropization, which is also the major cause of biodiversity loss (Jones 2009; Keesing et al. 2010). At odds with the hypothesis above, comparative genomics analyses of geminivirus species showed a reduction of virus genetic diversity and of recombination rates with increased ecosystem anthropization (Silva et al. 2011, 2012; Lima et al. 2013). On the other hand, inferences of divergence times for several groups of plant viruses have traced their radiation to the origin or the intensification of agriculture (Fargette et al. 2008; Duffy and Holmes 2008; Gibbs et al. 2008; Pagán and Holmes 2010, but see Lefeuvre et al. 2010; Yasaka et al. 2014). These works proposed that such bursts of genetic diversity could have co-occurred with the intrinsic loss of biodiversity characteristic of agroecosystems. Altogether, these results suggest that the effect of ecosystem anthropization, and the concurrent biodiversity loss, on the genetic diversity of plant viruses may depend on the relative importance of mutation and recombination. These apparently contradictory results derive from analyses that combine viral sequences obtained from different hosts, and from different geographic locations. Hence, it is difficult to disentangle whether the reported changes in plant virus genetic diversity are indeed due to ecosystem biodiversity or the result of host and/or local adaptation processes. As a consequence, it is still unclear whether and how habitat anthropization and biodiversity loss affects the genetic diversity of plant virus populations.

In this work, we analyzed the effect of ecosystem biodiversity on the genetic diversity of virus populations using sequence data of two plant begomovirus species that infect wild pepper or ‘chiltepin’, Capsicum annuum var. glabriusculum (Dunal) Heiser and Pickersgill, in Mexico. To avoid the limitations of previous works, virus isolates were collected from populations of the same host (chiltepin), within the natural distribution area of the host in Mexico, and from habitats under three levels of anthropization: populations where human intervention is limited to occasional harvesting of chiltepin fruits were considered as wild (Votava, Nabham, and Bosland 2002). Chiltepin populations either tolerated or protected within anthropic habitats were considered as an intermediate level of anthropization (i.e., ‘let-standing’ plants sensu Casas et al. (2007)). Cultivated populations, where chiltepin is under incipient domestication, were considered as the highest level of anthropization. Importantly, we have recently reported that increasing level of anthropization of chiltepin populations is associated with lower biodiversity (plant species richness of the habitat and host species genetic diversity) and with higher host plant density. Also, we showed that biodiversity is the main predictor of virus infection risk (Pagán et al. 2012). Hence, the currently coexisting three levels of anthropization and the association between level of anthropization, biodiversity, and risk of virus infection make chiltepin a unique host to analyze the effect of biodiversity in the genetic diversity of plant virus populations.

In the Mexican chiltepin populations, the most prevalent plant viruses are two species of the genus Begomovirus (Family Geminiviridae): Pepper golden mosaic virus (PepGMV) and Pepper huasteco yellow vein virus (PHYVV) (Rodelo-Urrego et al. 2013). Both viruses have a two-segmented genome (DNA-A and DNA-B) of circular single-stranded DNA. DNA-A encodes viral proteins required for replication, control of gene expression, overcoming of host defenses, and encapsidation (replication protein, Rep; replication enhancer, REn; transcriptional activator protein, TrAP; coat protein, CP), whereas DNA-B encodes two proteins involved in intra- and inter-cellular movement (nuclear shuttle protein, NSP; movement protein, MP) (Torres-Pacheco et al. 1993, 1996). PepGMV and PHYVV are horizontally transmitted by the whitefly Bemisia tabaci Gennadius (Homoptera, Aleyrodidae) in a persistent manner and are not seed transmitted (Rojas et al. 2005; Navas-Castillo, Fiallo-Olive, and Sanchez-Campos 2011). These viruses were first described in Mexico in the early 1990s infecting pepper crops, and available data strongly suggest that they have a narrow natural host range, as they have been reported infecting only Capsicum and some related genera within the Solanaceae (Brown and Poulos 1990; Garzón-Tiznado et al. 1993; Torres-Pacheco et al. 1996; Navas-Castillo, Fiallo-Olive, and Sanchez-Campos 2011). We have previously shown that the higher PepGMV and PHYVV prevalence in chiltepin populations at increasing levels of habitat anthropization is explained by biodiversity loss (Pagán et al. 2012; Rodelo-Urrego et al. 2013). Our data also suggested that higher virus prevalence could be associated with increased transmission rates, as virus infection was more frequent in environmental conditions that favored the presence of the vector B. tabaci (Rodelo-Urrego et al. 2013). This observation is compatible with higher prevalence and transmission rate at decreasing biodiversity, i.e., those epidemiological changes predicted to affect virus genetic diversity, occurring in chiltepin-infecting PepGMV and PHYVV populations.

We obtained the full-length sequences of eighty-nine PepGMV and PHYVV isolates from host populations with different levels of biodiversity/ anthropization. We estimated the synonymous and non-synonymous substitution rates and the selection pressures in the corresponding virus populations. Because both viruses belong to the family Geminiviridae, for which recombination is thought to play a key role in generating genetic diversity, we also determined the frequency of recombinants (and re-assortment of genomic segments) rates in PepGMV and PHYVV populations. Using these data, we explored the role of biodiversity, plant density, and virus prevalence in the intra-specific genetic diversity of virus populations. Our results indicate that genetic diversity of PepGMV (but not of PHYVV) populations increases with the loss of biodiversity at higher levels of habitat anthropization. This is primarily due to a higher rate of synonymous nucleotide substitutions, rather than to mutations that lead to amino acid changes. Interestingly, the frequency of recombinants and re-assortants was higher in PepGMV populations infecting wild chiltepin than in those infecting cultivated ones, which suggests that genetic exchange plays a secondary role in generating genetic diversity of this virus. Hence, our results provide evidence that biodiversity may be relevant in determining the genetic diversity of plant viruses, as is the case of PepGMV, but that this finding cannot be extended even to closely related viruses such as PHYVV.

2 Materials and methods

2.1 Collection of PepGMV and PHYVV isolates and sequencing of full-length genomic DNAs

Field surveys are described in Pagán et al. (2012), and detection of PepGMV- and PHYVV-infected plants was as described in Rodelo-Urrego et al. (2013). Data on species richness, chiltepin genetic diversity, and plant density, and PepGMV and PHYVV prevalence in the surveyed chiltepin populations are available in Pagán et al. (2012) and Rodelo-Urrego et al. (2013). Plants positive for PepGMV and PHYVV infection were enriched in circular DNAs with TempliPhi® kit (GE Healthcare Life Sciences, Chalfont St Giles, UK), following manufacturer’s protocol. Utilizing the product of this reaction as template, the complete nucleotide sequence of both genomic DNAs was obtained using different sets of primers. Primers were designed to produce five fragments in such a way that adjacent fragments overlapped in at least 120 nt. Primer sequences and their locations in the genome of a reference strain are summarized in Supplementary Table S1. Overlapping regions of adjacent fragments presented 99–100 per cent nucleotide identity. Thus, the complete nucleotide sequence of each genomic DNA was determined by assembling the five fragments using Lasergene’s SeqMan Pro (DNASTAR, Madison, WI). To avoid artifactual recombination profiles and sequencing of low frequency variants non-representative of the virus population, no cloning step was performed prior to genome sequencing.

Sequencing resulted in two datasets containing the complete genomic sequence of forty-seven PepGMV isolates (nineteen from wild, ten from let-standing, and eighteen from cultivated habitats) and forty-two PHYVV isolates (nineteen from wild, eleven from let-standing, and twelve from cultivated habitats). Sequences were aligned beginning at the nicking site in the invariant non-anucleotide located at the origin of replication. In all cases, alignments were constructed using MUSCLE 3.7 (Edgar 2004) and adjusted manually according to the amino acid sequences using Se-Al (Rambaut 1996). The locations and years of sampling are presented in Fig. 1 and Supplementary Table S2.

Figure 1.

Figure 1.

Location of sampled chiltepin populations. Map shows the location of populations from wild (W), let-standing (L), and cultivated (C) populations within six biogeographical provinces in Mexico. Number of PepGMV and PHYVV sequences from each location is shown between parentheses. Full names of each location can be found in Supplementary Table S2.

2.2 Association analyses

Association between habitat and PepGMV and PHYVV phylogenies was assessed using Parsimony Score, Association Index, and Monophyletic Clade Size (MC) statistics (Parker, Rambaut, and Pybus 2008). Parsimony Score and Association Index statistics indicate the phylogenetic clustering of viruses that come from the same habitat across entire trees. The MC statistic assesses the association between habitat and virus phylogeny by estimating the size of the largest cluster of sequences coming from the same habitat. These analyses were performed using BaTS (Parker, Rambaut, and Pybus 2008). Real-tree distributions were obtained using BEAST v1.7.5 (Drummond and Rambaut 2007). Null distributions for the three statistics were obtained from 1,000 data replications.

2.3 Estimation of genetic and haplotype diversity, and of selection pressures

Genetic diversity (π) was estimated for each genomic DNA molecule and for each gene, as average pairwise nucleotide differences, using the Tamura-Nei nucleotide substitution model as implemented in MEGA 5.2.2 (Tamura et al. 2011). Standard errors (SE) of each measure were based on 1,000 replicates bootstrap. Haplotype diversity indexes were calculated in DnaSP v.5 (Rozas et al. 2003).

Selection pressures for each gene were measured as the averaged site-specific ratio between the mean number of non-synonymous (dN) and synonymous (dS) nucleotide substitutions per site (dN/dS) using the single-likelihood ancestor counting, the fixed effect likelihood, and the random effects likelihood methods implemented in the HyPhy package (Kosakovsky-Pond and Frost 2005). Because the three methods led to the same conclusions, only the single-likelihood ancestor counting results are shown. In all cases, dN/dS estimates were based on input neighbor-joining trees inferred using the MG94 nucleotide substitution model. Individual values of dN and dS were also obtained. To test for the robustness of these estimates against the effects of recombination, we estimated selection pressures in sequence datasets where (1) recombinant sequences and (2) recombinant fragments were removed. Analyses in these two types of sequence datasets led to similar conclusions than using all the sequences.

Because the number of sequences used to obtain estimates of genetic diversity and selection pressures was generally small, we tested the robustness of our estimates against effect of a single sequence performing a jackknife analysis. Since real and jackknifed estimates did not significantly differ in any case, for simplicity only estimates of real datasets are presented in the text.

2.4 Detection of recombination and pseudorecombination

We determined the occurrence of recombination within and between PepGMV and PHYVV populations for each genomic DNA molecule. To do so, we utilized forty-seven PepGMV and forty-two PHYVV fully sequenced isolates. Recombination breakpoints were detected using four methods, based on different assumptions (Posada 2002), and available in RDP4 (http://darwin.uvigo.es/rdp/rdp.html): RDP, BOOTSCAN/RECSCAN, Siscan, and Chimaera, with default parameters (Martin et al. 2010). Statistical significance was obtained using a Bonferroni-corrected cutoff of P ≤ 0.05. To minimize false positives, only recombination signals detected by the four methods (P < 0.05) were accounted as true events (RDP project files are available upon request). Two recombinant sequences were considered to belong to the same recombination profile when: (1) they shared the same major and minor parentals and (2) a recombination breakpoint in one sequence was <50 nt apart (window size in the recombination detection analysis) of the equivalent recombination breakpoint in the other sequence. The frequency of recombination profiles was used for statistical analyses.

The frequency of pseudorecombination in PepGMV and PHYVV populations was explored by analyzing the topological congruence between the maximum likelihood phylogenies of DNA-A and DNA-B of each virus. We used the CopyCat program (Meier-Kolthoff et al. 2007), which incorporates a wrapper for the program ParaFit (Legendre, Desdevises, and Bazin 2002). ParaFit is a distance-based approach that assesses the fit between DNA-A and DNA-B phylogenetic distance matrices by transforming them into principal coordinates. Statistical significance was evaluated by performing 9,999 permutations to test the null hypothesis of no congruence between DNA-A and DNA-B trees. The same analyses were performed using the program Jane 4 (http://www.cs.hmc.edu/∼hadas/jane/), which uses a topology-based method (Conow et al. 2010), with comparable results (available upon request). Pseudorecombination was also analyzed by concatenating the sequences of the DNA-A and DNA-B of each virus isolate and scanning these sequences for recombination breakpoints near the artificial joint between the two genomic DNAs as described above.

2.5 Statistical analyses

Differences in genetic diversity, synonymous and non-synonymous substitution rates, and selection pressures were calculated using parametric (general linear models) and non-parametric (permutation tests) analyses. Since both approaches led to similar conclusions, for simplicity, only general linear model analyses are shown. Variation in the frequency of recombinants between PepGMV and PHYVV, between genomic DNAs, and between levels of anthropization of the host population were analyzed by Fisher’s exact test, with a Yates correction for small sample size when necessary (Sokal and Rohlf 1995). Bivariate tests were used to analyze the association between ecological factors of the chiltepin populations (species diversity as number of plant species, genetic diversity as expected heterozygosity, and plant density as plants/m2) and evolutionary parameters of the virus populations (genetic diversity, selection pressures, rate of synonymous and non-synonymous mutations, and frequency of recombinant profiles). Evolutionary parameters used in these bivariate tests were estimated grouping sequences according to population, and eight populations with at least four sequences each were retained for the analysis. Linear and non-linear models were considered in each bivariate analysis, and the model with the highest r-value was chosen as the best explaining the relationship between each pair of variables. In addition, for each bivariate analysis model, selection analyses were performed (Burnham and Anderson 2002). Linear and non-linear model were ranked according to Akaike Information criteria (AIC) scores (R library: AICcmodavg), and the model with the lowest AIC was selected as the best explanatory model. In all cases, both approaches led to the same results. Statistical analyses were performed using the software package SPSS 20 (SPSS Inc., Chicago, IL).

3 Results

3.1 Genetic diversity of chiltepin-infecting PepGMV and PHYVV populations

Overall, haplotype diversity indexes calculated for each DNA component of PepGMV and PHYVV were equal to 1.0, indicating that each isolate had a different haplotype. Genetic diversity (π) was significantly lower in DNA-A than in DNA-B for both PepGMV (0.049 ± 0.003 vs. 0.097 ± 0.009; F1,2161 = 273.1; P < 1 × 104) and PHYVV (0.053 ± 0.004 vs. 0.125 ± 0.006; F1,1721 = 191.7; P < 1 × 104). Both viruses showed similar genetic diversity in the DNA-A (F1,1941 = 1.23; P = 0.060), but π was lower in the DNA-B of PepGMV than in that of PHYVV (F1,1941 = 41.92; P < 1 × 104) (Table 1). Genetic diversities were also obtained for each gene within the PepGMV and the PHYVV genomes (Supplementary Table S3). In both viruses, genes from DNA-A presented lower genetic diversities than those in DNA-B (F > 586.57; P < 1 × 104). For PepGMV, all genes encoded by the DNA-A showed similar π values (π: 0.047–0.050; F3,4323 = 2.42; P = 0.172), whereas in DNA-B, the MP gene presented lower genetic diversity than the NSP gene (π: 0.083 ± 0.008 vs. 0.111 ± 0.009; F1,2161 = 15.06; P < 1 × 104). Similarly, for PHYVV, all genes contained in the DNA-A showed similar π values (π: 0.046–0.069; F3,3443 = 1.05; P = 0.278); and in the DNA-B, the MP had a lower genetic diversity than the NSP gene (π: 0.079 ± 0.006 vs. 0.170 ± 0.014; F1,1721 = 110.0; P < 1 × 104) (Supplementary Table S3).

Table 1.

Genetic diversity (π), dN/dS, dN, and dS values in the PepGMV and PHYVV genomic DNAs.

Virus Habitata πb
dN/dSc
dNc
dSc
DNA-A DNA-B DNA-A DNA-B DNA-A DNA-B DNA-A DNA-B
PepGMV Wild 0.025 ± 0.003 0.091 ± 0.010 0.217 ± 0.018 0.116 ± 0.008 0.017 ± 0.001 0.040 ± 0.002 0.078 ± 0.004 0.281 ± 0.009
Let-standing 0.043 ± 0.002 0.098 ± 0.010 0.168 ± 0.012 0.090 ± 0.035 0.018 ± 0.001 0.024 ± 0.002 0.107 ± 0.006 0.270 ± 0.009
Cultivated 0.068 ± 0.002 0.105 ± 0.008 0.157 ± 0.012 0.112 ± 0.005 0.022 ± 0.001 0.033 ± 0.002 0.140 ± 0.006 0.349 ± 0.013
Alld 0.049 ± 0.003 0.097 ± 0.009 0.167 ± 0.004 0.109 ± 0.002 0.021 ± 0.001 0.034 ± 0.001 0.124 ± 0.002 0.316 ± 0.004
PHYVV Wild 0.059 ± 0.005 0.121 ± 0.006 0.139 ± 0.016 0.107 ± 0.008 0.022 ± 0.001 0.037 ± 0.001 0.159 ± 0.005 0.346 ± 0.011
Let-standing 0.057 ± 0.004 0.124 ± 0.006 0.129 ± 0.051 0.115 ± 0.005 0.021 ± 0.001 0.039 ± 0.002 0.139 ± 0.012 0.344 ± 0.017
Cultivated 0.053 ± 0.006 0.129 ± 0.007 0.141 ± 0.008 0.111 ± 0.003 0.020 ± 0.001 0.040 ± 0.002 0.159 ± 0.009 0.363 ± 0.019
Alld 0.053 ± 0.004 0.125 ± 0.006 0.135 ± 0.005 0.113 ± 0.002 0.021 ± 0.001 0.039 ± 0.001 0.157 ± 0.002 0.349 ± 0.005

aLevel of anthropization (wild; let-standing; cultivated; ALL: wild + let-standing + cultivated).

bValues are mean ± standard errors based in 1,000 replicates bootstrap.

cValues are mean ± standard error based on pairwise determination of dN/dS, dN, and dS.

dValues are mean ± standard error based on values for the four genes of DNA-A concatenated, and on the two genes of DNA-B.

To analyze how anthropization of the host population affected the genetic diversity of PepGMV and PHYVV, we first performed association analyses between habitat and virus phylogenies, which indicated that sequences of PepGMV and PHYVV clustered significantly according to habitat, either wild or cultivated (P < 0.038), with the exception of DNA-B sequences of PHYVV from cultivated habitats (P = 0.487) (Supplementary Table S4). Hence, PepGMV and PHYVV populations are, in general, genetically structured according to habitat. Thus, we calculated π in populations of both viruses belonging to wild, let-standing, and cultivated habitats (Table 1). Genetic diversity in the DNA-A of PepGMV differed among habitats (F2,368 = 82.66; P < 1 × 104). The π value was lower in wild than in let-standing populations, and both were lower than in cultivated populations (0.025 ± 0.001, 0.043 ± 0.002, and 0.068 ± 0.002, respectively; P < 0.014). Also, π values in the DNA-B of PepGMV depended on the habitat (F2,368 = 7.68; P = 1 × 103), being higher in cultivated than in wild habitats (π: 0.105 ± 0.008 vs. 0.091 ± 0.010; P = 3 × 103) and intermediate in let-standing populations (π: 0.098 ± 0.010). For PHYVV, differences according to habitat anthropization were not found either in the DNA-A (π: 0.053–0.059; F2,291 = 1.92; P = 0.149) or in the DNA-B (π: 0.121–0.129; F2,291 = 0.96; P = 0.386). The same patterns were observed when each gene was analyzed separately (Supplementary Table S3). Thus, higher levels of habitat anthropization were associated with higher genetic diversity of PepGMV but did not affect PHYVV.

3.2 Selection pressures in chiltepin-infecting PepGMV and PHYVV populations

We explored whether the observed changes in the genetic diversity of PepGMV populations at increasing habitat anthropization were the consequence of adaptive and/or neutral evolution. To do so, we estimated selection pressures as dN/dS ratio (ω), as well as dN and dS values, for each genomic component and gene of PepGMV isolates in each habitat. The same data were also estimated for PHYVV isolates (Table 1 and Supplementary Table S3).

For PepGMV, average dN/dS in the DNA-A was higher in wild than in let-standing and cultivated populations (ω: 0.217 ± 0.018 vs. 0.0168–0.157 ± 0.012; F2,368 = 13.07; P < 1 × 104) (Table 1). Similar results were obtained for the CP and TrAP genes (F2,368 > 4.89; P < 9 × 103), but dN/dS values of the REn and Rep genes did not differ across habitats (F2,368 < 2.52; P > 0.114) (Supplementary Table S3). In accordance with these results, the frequency of codons under purifying selection in the four genes of the DNA-A of PepGMV was higher in cultivated than in wild habitats (5 per cent vs. 2 per cent; χ21 < 7.81, P > 0.005), the rest of codons being under neutral evolution. In the DNA-B, dN/dS was similar in the three habitats (ω: 0.090–0.112; F2,368 = 1.15; P = 0.319), with the same trend in both the MP and the NSP genes (F2,368 < 2.48; P > 0.116) (Table 1 and Supplementary Table S3). The frequency of codons under purifying selection did not vary across habitats (14 per cent vs. 18 per cent; χ21 < 2.32, P > 0.128). No codons under positive selection were found. For PHYVV, dN/dS in the DNA-A did not depend on the level of habitat anthropization, either considering all genes together (ω: 0.129–0.141; F2,291 = 0.411; P = 0.663) or each gene individually (F2,291 < 2.26; P > 0.106). Selection pressures in the DNA-B also did not differ between habitats (ω: 0.107–0.115; F2,291 = 0.726; P = 0.485), and similar results were obtained for the MP and the NSP genes (F2,291 < 1.65; P > 0.195) (Table 1 and Supplementary Table S3). Accordingly, the frequency of codons under purifying selection did not vary across habitats in neither DNA (2 per cent vs. 3 per cent and 11 per cent vs. 13 per cent, for DNA-A and DNA-B, respectively; χ21 < 0.54, P > 0.462). No codons under positive selection were found in the PHYVV genome.

Average dN in the DNA-A of PepGMV was higher in cultivated than let-standing and wild populations (dN: 0.022 ± 0.001 vs. 0.018–0.017 ± 0.001; F2,368 = 10.09; P < 1 × 104). The same result was obtained for the REn and Rep genes (F2,368 > 43.30; P < 1 × 104), but dN of the CP and TrAP was not affected by the level of habitat anthropization (F2,368 > 0.189; P < 0.828). In the DNA-B, dN was higher in cultivated than in let-standing and wild populations, either considering the complete genomic segment (dN: 0.040 ± 0.002 vs. 0.024–0.032 ± 0.002; F2,368 = 12.82; P < 1 × 104) or the MP and NSP genes separately (F2,368 > 7.85; P < 1 × 104) (Table 1 and Supplementary Table S3). For PHYVV, habitat anthropization did not affect dN either in the DNA-A (dN: 0.020–0.022; F2,291 = 2.72; P = 0.067) or in the DNA-B (dN: 0.039–0.043; F2,291 = 1.13; P = 0.324) (Table 1), and the same was found when each gene was analyzed separately (F2,291 < 2.20; P > 0.060) (Supplementary Table S3).

Average dS in the DNA-A of PepGMV was higher in cultivated than in let-standing and wild habitats (dS: 0.140 ± 0.005 vs. 0.107–0.078 ± 0.006; F2,368 = 50.79; P < 1 × 104) (Table 1). Similar results were obtained when each gene was analyzed separately (F2,368 > 45.82; P < 1 × 104) (Supplementary Table S3). For the DNA-B, dS was again higher in cultivated than in let-standing and wild populations (dS: 0.349 ± 0.013 vs. 0.270–0.281 ± 0.009; F2,368 = 12.15; P < 1 × 104) (Table 1), and the same trend was observed for each DNA-B gene separately (F2,368 > 12.00; P < 1 × 104) (Supplementary Table S3). For PHYVV, dS in the DNA-A did not depend on the level of habitat anthropization, either when all genes were considered together (dS: 0.139–0.159; F2,291 = 2.07; P = 0.128) or each of them separately (F2,291 < 1.57; P > 0.210) (Supplementary Table S3). Similarly, habitat anthropization did not affect dS values of the DNA-B considering all genes together (dS: 0.344–0.363; F2,291 = 0.37; P = 0.695), and the MP and NSP genes individually (F2,291 < 1.17; P > 0.313) (Table 1 and Supplementary Table S3).

In summary, genes of PepGMV and PHYVV are under purifying selection, which is stronger in the DNA-B. Both dN and dS are generally smaller in DNA-A than in DNA-B genes, which is compatible with the higher genetic diversity that we observed in the DNA-B of the two studied begomoviruses. Finally, habitat anthropization is associated with a smaller increase of dN (adaptive mutations), and with a much larger increase of dS (neutral mutations), in the DNA-A and DNA-B of PepGMV. This could explain the observed changes in genetic diversity and selection pressures across habitats.

3.3 Frequency of recombinants in chiltepin-infecting PepGMV and PHYVV populations

As recombination is thought to play a key role in generating genetic diversity of begomoviruses (Lefeuvre and Moriones 2015), we analyzed the frequency of recombinants in PepGMV and PHYVV populations and their distribution across levels of anthropization.

The DNA-A and DNA-B full-length sequences of forty-seven PepGMV isolates and of forty-two PHYVV isolates were used to determine the frequency of inter- and intra-specific recombinants in the virus populations. Only six DNA-B sequences were detected as inter-specific recombinants between PepGMV and PHYVV, and the rest of recombinant isolates were intra-specific (Supplementary Figs S1 and S2). For PepGMV, recombination signal was detected in eighteen DNA-A and twenty DNA-B sequences, with 57 per cent (27/47) of isolates being recombinants in at least one of the DNA components (Table 2 and Supplementary Fig. S1). For PHYVV, six recombinant sequences in the DNA-A and nine in the DNA-B were detected, with 31 per cent (13/42) of isolates showing recombination signal in at least one DNA component (Table 2 and Supplementary Fig. S2). For both viruses, frequency of recombinants in the DNA-A was similar to that in the DNA-B of the same virus (χ21 < 0.73, P > 0.55). However, PepGMV showed a higher frequency of recombinants than PHYVV either considering recombinants in at least one DNA component or analyzing recombination in each genomic component separately (χ21 > 4.51, P < 0.04). Recombinant isolates were found in populations under the three levels of human management and in all the years sampled (Supplementary Figs S1 and S2).

Table 2.

Number of intra-specific recombinants and of breakpoints in the genome of chiltepin-infecting PepGMV and PHYVV isolates.

Virus All
Wild
Let-standing
Cultivated
Recombinantsa Breakpointsb Recombinantsa Breakpointsb Recombinantsa Breakpointsb Recombinantsa Breakpointsb
PepGMV DNA-A 38% (18/47) 20 63% (12/19) 16 40% (4/10) 5 11% (2/18) 4
DNA-B 43% (20/47) 21 42% (8/19) 19 50% (5/10) 14 39% (7/18) 8
Bothc 57% (27/47) 41 68% (13/19) 35 70% (7/10) 19 39% (7/18) 12
PHYVV DNA-A 14% (6/42) 5 5% (1/19) 2 18% (2/11) 4 25% (3/12) 3
DNA-B 21% (9/42) 19 21% (4/19) 10 18% (2/11) 4 25% (3/12) 6
Bothc 31% (13/42) 24 21% (4/19) 12 27% (3/11) 8 50% (6/12) 9

Virus isolates may have more than one recombinant fragment in the same genomic DNA. The same recombination breakpoint may appear in isolates from different habitats.

aPercentage of recombinant sequences (number of recombinants out of the total number of isolates sequenced).

bNumber of different breakpoints in the recombinant sequences.

cNumber of recombinants in at least one of the genomic DNAs out of the total number of isolates sequenced and the corresponding number of breakpoints.

Recombination breakpoints were also mapped. In PepGMV, twenty recombination breakpoints were detected in the DNA-A and twenty-one in the DNA-B, whereas in PHYVV, five and nineteen breakpoints were detected in DNA-A and DNA-B, respectively (Fig. 2). Localization of these recombination breakpoints in the virus genome components indicated the presence of several recombination ‘hotspots’ (Fig. 2). Two ‘hotspots’ in the DNA-B were detected associated with inter-specific recombination: regions between nucleotides 1646–1722 and 2106–2122 (both in the MP gene), with 29 per cent (4/14) of the recombination breakpoints each (Fig. 2). All the inter-specific recombinants had a PepGMV isolate as a major parental. Intra-specific recombination ‘hotspots’ were also detected in both PepGMV and PHYVV genomes. In PepGMV, two recombination ‘hotspots’ were found in DNA-A. One between nucleotides 845–923 (3’-end of the CP gene), which included 20 per cent (4/20) of recombination breakpoints, with one of these present in eight isolates. The second was located between nucleotides 2066–2169 (central region of the Rep gene) and included 30 per cent (6/20) of the breakpoints, with one of them present in six isolates and other present in three (Fig. 2). Also, two recombination ‘hotspots’ were found in the DNA-B: the region between nucleotides 865–933 (3’-end of the NSP gene), which contained 53 per cent (11/21) of recombination breakpoints, with one present in eight isolates; and the region between nucleotides 1443–1490 (3’-end of the MP gene), which contained 24 per cent (5/21) of the recombination breakpoints with one of them present in eight isolates. In PHYVV, two recombination ‘hotspots’ were detected in the DNA-A: regions between nucleotides 341–364 (5’-end of the CP gene) and 2471–2543 (intergenic region), each of them containing half of the recombination breakpoints. No recombination ‘hotspots’ were found in DNA-B, where recombination breakpoints were evenly distributed across this genomic segment.

Figure 2.

Figure 2.

Distribution and abundance of recombination breakpoints in both the DNA-A and the DNA-B of PepGMV and PHYVV. x axis indicates the position (nt) of each intra-specific (blue line) and inter-specific (red line) recombination breakpoint in each component of the viral genome. Colored arrows denote the position and orientation of each gene. CP, blue; REn, green; TrAP, yellow; Rep, red; NSP, purple; MP, orange. y axis indicates the number of virus isolates in which each recombination breakpoint was detected.

The effect of habitat anthropization in the frequency of recombinants was also analyzed (Table 2). For such analyses, the frequency of recombinant profiles, rather than the frequency of recombinant sequences, was used (see Supplementary Figs S1 and S2). Inter-specific recombinants were more frequent in wild than in let-standing and cultivated habitats (χ21 = 5.94, P = 0.051). Indeed, 83 per cent (4/5) of the inter-specific recombinants came from wild habitats. Frequency of intra-specific recombinants in the PepGMV DNA-A was higher in virus populations from wild than from let-standing and cultivated habitats (χ21 > 4.67, P < 0.010). The level of anthropization of the host population did not affect the frequency of intra-specific recombinants in the DNA-B of PepGMV or in either of the PHYVV genomic DNAs (χ21 < 2.53, P > 0.283).

Thus, the frequency of intra-specific recombinants is high in PepGMV and PHYVV, being higher in the first virus species. In addition, the frequency of PepGMV recombinants decreases in cultivated populations when compared with let-standing and wild ones, but the frequency of PHYVV recombinants is similar across habitats.

3.4 Frequency of re-assortants in chiltepin-infecting PepGMV and PHYVV populations

We also analyzed the frequency of PepGMV and PHYVV re-assortants, as re-assortants are also a potential source of genetic diversity for begomoviruses (Briddon et al. 2010). To do so, we followed two approaches: (1) exploring the extent of congruence between the DNA-A and the DNA-B phylogenies and (2) concatenating the DNA-A and DNA-B sequences, and analyzing the presence of recombination breakpoints near the artificial boundaries of both genomic components.

For PepGMV, the position of 43 per cent (20/47) of the sequences was incongruent between the DNA-A and the DNA-B phylogenies, which was considered as indicative of re-assortants (Fig. 3). The frequency of sequence pairs that showed incongruent phylogenetic positions was higher in wild than in let-standing and cultivated populations (68 per cent, 13/19; 50 per cent, 5/10; and 28 per cent, 5/18, respectively) (χ22 = 9.36, P = 0.009). For PHYVV, 26 per cent (11/42) of the sequences showed incongruent positions between the DNA-A and DNA-B phylogenies, their frequency being similar in the three habitats (21 per cent, 4/19; 30 per cent, 3/10, and 22 per cent, 4/18; in wild, let-standing, and cultivated habitats) (χ22 = 0.32, P = 0.854) (Fig. 3).

Figure 3.

Figure 3.

Composite phylogenies of the DNA-A and DNA-B of PepGMV and PHYVV. Lines denote isolates with significant (green) and non-significant (red) phylogenetic congruence between DNA-A and DNA-B as detected by CopyCat. Isolates are identified by a three-letter code indicating the chiltepin population (see Supplementary Table S2) followed by the number of the sample, a one-letter code indicating the level of human management (C, cultivated; L, let-standing; W, wild) and the year of collection.

Results of the tests for phylogenetic congruence were confirmed by the analysis of the presence of recombination breakpoints in concatenated sequences of the DNA-A and DNA-B. For PepGMV, 53 per cent (25/47) of the sequences had a recombination breakpoint near the boundary of the two genomic DNAs and therefore were considered as re-assortants. These were more frequent in wild than in let-standing and cultivated populations (74 per cent, 14/19; 50 per cent, 5/10; and 33 per cent, 6/18, respectively) (χ22 = 6.10, P = 0.047). For PHYVV, 40 per cent (17/42) of the sequences were re-assortants, their frequency not being affected by the level of anthropization (8/19, 4/11, and 5/12, for wild, let-standing, and cultivated populations, respectively) (χ22 = 0.11, P = 0.949).

These results indicate that re-assortment occurs more frequently in PepGMV than in PHYVV populations. PepGMV re-assortants are more frequent in wild than in let-standing and cultivated habitats, whereas the frequency of PHYVV re-assortants is similar in all habitats.

3.5 Association of ecological and epidemiological factors in the genetic diversity of PepGMV populations

To test whether changes in habitat species richness, host genetic diversity, host plant density, and virus prevalence were associated with the genetic diversity of PepGMV populations, we calculated π, dN/dS, dN, dS, and the frequency of recombinants in the DNA-A and in the DNA-B of isolates from eight PepGMV populations (see Section 2). Using this dataset, we performed bivariate analyses of these evolutionary parameters onto the chiltepin expected heterozygosity (He), the habitat species richness (SR), the chiltepin plant density, and the PepGMV prevalence in the corresponding host populations considering linear and non-linear models. Results for the best-ranked model explaining each bivariate relationship are shown in Fig. 4. Bivariate analyses for which none of the considered models fitted the data are not shown.

Figure 4.

Figure 4.

Bivariate relationships between ecological and epidemiological factors of chiltepin populations and evolutionary parameters of PepGMV populations. Significant regressions of plant species richness, chiltepin genetic diversity and plant density, and PepGMV prevalence onto the genetic diversity, the rate of non-synonymous and synonymous substitutions, the frequency of recombinants, and the frequency of different recombinant profiles in PepGMV populations are represented for each genomic component: DNA-A (blue) and DNA-B (red). Species richness (SR) is expressed as number of species; host genetic diversity (He) is expressed as expected heterozygosity; host plant density is expressed as plants/m2, and virus prevalence as percentage of infected plants over the total of plants sampled. Note the different scales in the x- and y axis depending on the factor-parameter combination. Data correspond to eight populations with at least four fully sequenced viral isolates.

The genetic diversity of the PepGMV DNA-A was negatively associated with SR (R2 = 0.28; P = 0.042) and positively associated with plant density (R2 = 0.56; P = 0.050). The dN and dS values were negatively associated with SR (R2 > 0.22; P < 0.055) and positively associated with plant density and PepGMV prevalence (R2 > 0.25; P > 0.062). Similarly, the percentage of recombinants and of different recombination profiles in the DNA-A was positively associated with SR (R2 = 0.84; P = 0.004 and R2 = 0.73; P = 0.014, respectively) and negatively associated with plant density and PepGMV prevalence (R2 > 0.52; P < 0.057 and R2 > 0.47; P < 0.049) (Fig. 4).

In the DNA-B of PepGMV, genetic diversity and dN/dS was not significantly associated with any of the four variables considered (r < 0.24; P > 0.263). The dN and dS values were negatively associated with He (R2 > 0.58; P > 0.046) and positively associated with plant density (R2 > 0.27; P > 0.043). The percentage of recombinants, and of different recombination profiles, in the DNA-B was positively associated with SR (R2 = 0.84; P = 0.004 and R2 = 0.50; P = 0.055, respectively) and negatively correlated with PepGMV prevalence (R2 = 0.79; P = 0.007 and R2 = 0.63; P = 0.034) (Fig. 4). No other bivariate analyses yielded a significant correlation, and analyses are not shown. Finally, parallel bivariate analyses for six PHYVV populations yielded non-significant correlations in all cases (R2 < 0.28; P > 0.097).

Thus, when having an effect, lower chiltepin genetic diversity and habitat species richness increase genetic diversity, dN and dS, and reduce the frequency of recombinants in PepGMV populations; whereas lower plant density and virus prevalence have the opposite effect on the four evolutionary parameters.

4 Discussion

Ecosystem biodiversity provides fundamental services that greatly impact human welfare (Naeem et al. 2009). Growing evidence indicates that one such service is the ability to reduce disease risk (Keesing et al. 2010; Ostfeld and Keesing 2012; Joseph et al. 2013). Understanding the mechanisms by which biodiversity loss influences the appearance of disease outbreaks becomes critical to maintain this ecosystem service, even more in the current situation of accelerating biodiversity declines (Mace, Masundire, and Baillie 2005; WWF Living Planet Report 2012). It has been proposed that the reduction of disease risk is achieved by two mechanisms: first, by reducing parasite prevalence/transmission (Keesing, Holt, and Ostfeld 2006, Ostfeld and Keesing 2012) and, as a result of these epidemiological changes, by reducing parasite genetic diversification (Scholle et al. 2013; Lanfear, Kokko, and Eyre-Walker 2014). Although the former mechanism has been studied in a variety of parasites and hosts (Ostfeld and Keesing 2012; Pagán et al. 2012; Laporta et al. 2013; Alexander et al. 2014), the latter has seldom been analyzed (Burdon and Thrall 2008; Scholle et al. 2013; Alexander et al. 2014). Here, we provide evidence of the role that biodiversity has in determining the genetic diversity of two plant begomoviruses—PepGMV and PHYVV—infecting host populations under different levels of anthropization.

The patterns of genetic diversity in chiltepin-infecting PepGMV and PHYVV populations share similarities between them and with those described for other begomoviruses. First, values of genetic diversity are in the same order of magnitude than in other begomoviruses that infect non-cultivated hosts and are generally higher than for crop-infecting begomoviruses (Silva et al. 2011; Lima et al. 2013). Second, PepGMV and PHYVV proteins are under purifying selection, as generally described for most begomoviruses (Sanz et al. 1999; Silva et al. 2011; González-Aguilera et al. 2012; Lima et al. 2013; Rocha et al. 2013). Third, PepGMV and PHYVV populations had relatively high frequency of recombinants and re-assortants, which has been widely documented to have a key role in the evolution of begomoviruses (e.g., Sanz et al. 2000; García-Andrés et al. 2006; Briddon et al. 2010; Martin et al. 2011; Lima et al. 2013; Silva et al. 2014). Fourth, our analysis revealed recombination ‘hotspots’ in the PepGMV and PHYVV genomes already described for other begomoviruses (Fig. 2): A recombination hotspot in the 3’-end of the CP gene is present in Tomato yellow leaf curl virus, resulting in biologically functional recombinants (García-Andrés et al. 2007). The central region of the Rep gene has been also described as highly tolerant to recombination in a number of begomoviruses (Lefeuvre et al. 2007a), and the 5’-end of this gene seems to be a recombination ‘hotspots’ in mono- and bipartite begomoviruses (Lefeuvre et al. 2007a,b).

Although these common features could suggest that both viruses would evolve similarly under conditions of habitat anthropization and biodiversity loss, our results indicate that this is not the case. Habitat anthropization was associated with increased genetic diversity of PepGMV but not of PHYVV. We can only speculate on the causes of this differential effect. For instance, in cultivated chiltepin populations, 80 per cent of plants infected with PHYVV are also infected with PepGMV, whereas this proportions is significantly smaller in let-standing and wild populations (62 per cent and 56 per cent, respectively; χ22 = 10.10, P = 6 × 103) (Rodelo-Urrego et al. 2013). Because co-infection has been shown to reduce virus genetic diversity (Dennehy et al. 2013), it may counteract the effect of reduced biodiversity in PHYVV populations. Alternatively, analyses of association between habitat and PepGMV and PHYVV phylogenies indicated a much stronger association in PepGMV than in PHYVV populations (see MC P values in Supplementary Table S4). These analyses suggest that across-habitat migration is higher in PHYVV than in PepGMV populations, which may mask the effects of habitat anthropization. The positive relationship between PepGMV genetic diversity and habitat anthropization corroborates previous data of our group based on analyses of a fragment of the CP gene (Rodelo-Urrego et al. 2013). Work of our group has also shown that increased level of anthropization in chiltepin populations is associated with lower biodiversity and higher plant density (Pagán et al. 2012) and with higher PepGMV prevalence (Rodelo-Urrego et al. 2013). Accordingly, here we observed that PepGMV genetic diversity was negatively correlated with habitat species richness and positively correlated with plant density and virus prevalence in chiltepin populations. These results support theoretical predictions on the effect of biodiversity in the genetic diversity of virus populations (Burdon and Thrall 2008; Scholle et al. 2013; Lanfear, Kokko, and Eyre-Walker 2014). Also, our results are compatible with previous reports of faster diversification rates in animal virus populations with higher transmission rates and prevalence (Kurath et al. 2003; Einer-Jensen et al. 2004; Volk et al. 2010). Remarkably, our observations are in apparent contradiction with recent works pointing to a higher genetic diversity of begomovirus populations in non-cultivated than in cultivated hosts (Silva et al. 2011, 2012; Lima et al. 2013; Rocha et al. 2013). However, these works compare the genetic diversity of different viruses, isolated from different habitats. Hence, results derived from such works are difficult to compare with ours as they might be reflecting virus species-specific dynamics, rather than the effect of habitat anthropization. To our knowledge, our work is the first one using an approach that allows a direct association between changes in virus genetic diversity and level of habitat anthropization.

The larger PepGMV genetic diversity at higher levels of habitat anthropization was associated with a small increase of non-synonymous nucleotide substitution rates and with a much larger increase of synonymous substitution rates (Table 1). Accordingly, most sites in the PepGMV genome were under neutral evolution, just a few under purifying selection and none under diversifying selection. Hence, neutral evolution, rather than adaptive selection, might be responsible for the higher PepGMV variability in cultivated chiltepin populations, in agreement with observations in other begomoviruses (Lima et al. 2013). In the absence of adaptive selection, virus populations increase their genetic diversity by accelerating mutation rates or enlarging population sizes (Moya et al. 2000). Analyses of the PepGMV CP gene indicated that nucleotide substitution rates did not vary across levels of habitat anthropization (Rodelo-Urrego et al. 2013). In turn, here we showed that increasing host plant density and virus prevalence, conditions that favor higher virus population sizes, were associated with higher dN and dS values. Since plant density and virus prevalence are higher in cultivated habitats, higher genetic diversity of PepGMV populations in these habitats might be explained by larger virus population sizes.

Because recombination plays a key role in the emergence and evolution of begomoviruses (Lefeuvre and Moriones 2015), we also analyzed the effect of habitat anthropization in the frequency of PepGMV recombinants. This frequency was higher in wild host populations, where species richness is higher, and both chiltepin density and PepGMV prevalence are lower. Such conditions reduce the probability of virus encounter with susceptible individuals, which in turn reduce virus transmission rate and population size, favoring genetic drift. Theory predicts that genetic drift may suffice to promote recombination (Barton 2010), which could explain our results. Indeed, the frequency of PepGMV recombinants was positively correlated with habitat species richness and negatively correlated with plant density and virus prevalence. The effect of habitat anthropization and biodiversity loss in the frequency of PepGMV recombinants is in marked contrast to trends observed for the genetic diversity of PepGMV and suggest that mutation, rather than recombination, accounts for most of the variation in the genetic diversity of PepGMV across habitats. Indeed, such primary role of mutation over recombination has been shown for other begomoviruses (Lima et al. 2013). It could be argued that the frequency of recombinants is not necessarily representative of the contribution of recombination to the genetic diversity of the PepGMV populations when, for instance, lower recombination rates involve: (1) exchange of larger genomic fragments or (2) more genetically divergent parentals. However, the length of the recombinant fragments (DNA-A: 1,000–1,200 nt; DNA-B: 500–700 nt on average), and the genetic diversity between recombination parentals (DNA-A: 0.056–0.057; DNA-B: 0.155–0.169), was similar regardless of the level of habitat anthropization. Therefore, it is likely that our results reflect a smaller contribution of recombination to the genetic diversity of PepGMV populations.

Some cautionary comments, however, are called upon our results. First, we did not attempt to obtain a direct estimate of the relative contribution of mutation and recombination to the genetic diversity PepGMV and PHYVV populations, as it has been done for other begomoviruses (Lima et al. 2013). Hence, our conclusion that mutation accounts for most of the variability in PepGMV genetic diversity across habitats is derived from indirect evidence. Second, the analyses of association between PepGMV evolutionary parameters and ecological/epidemiological factors are based on data from eight chiltepin populations. We are aware that this might be a small sample size. However, it was enough to detect significant, and in many cases strong, correlations between the studied parameters. Third, for these association analyses, estimates of PepGMV evolutionary parameters are based on datasets of four to ten sequences per virus population. This is again a small figure, which may not be representative of the genetic diversity of the virus populations. Nevertheless, rarefaction analyses indicated that four to ten sequences should be enough to capture a significant proportion of the chiltepin-infecting PepGMV genetic diversity (Supplementary Fig. S3). Fourth, it can be argued that rarefaction curves for PHYVV populations are not saturated, which may result in underestimates of PHYVV genetic diversity (Supplementary Fig. S3). However, for the same sample size, our estimates of genetic diversity were still higher in PHYVV than in PepGMV populations. Moreover, the estimates of PHYVV genetic diversity presented here are comparable to those obtained in Rodelo-Urrego et al. (2013) for the same virus populations but using up to seventeen CP sequences per population. Hence, such bias in the estimate of virus genetic diversity should not invalidate our results. Fifth, because virus sequences were sampled in different years, our results may be biased if the virus populations were genetically structured according to sampling date. However, root-to-tip regressions and NST values (calculated as described in Rodelo-Urrego et al. (2013)) indicated that there is no such genetic structure (data available upon request). Analyses in other host-pathogen system would help to tests the generality of our observations.

With the above-mentioned caveats, our results provide evidence of the relevant role that biodiversity and habitat anthropization may have in shaping the genetic diversity of plant virus populations. The reported increase in PepGMV genetic diversity associated with habitat anthropization, and biodiversity loss supports theoretical predictions. Because genetic diversification has been proposed to be involved in the appearance of new viral diseases (Holmes 2009), our results may contribute to understand the factors driving virus emergence. Such factors may have differential effects even in two closely related viruses as PepGMV and PHYVV, which highlights the complexity of developing generally applicable predictive models of virus emergence.

Data availability

Sequences determined in this work can be found in the EMBL database under accession numbers: LN848751 to LN848928.

Supplementary data

Supplementary data is available at VEVOLU online.

Supplementary Table S1
ve_vev004_index.html (702B, html)

Acknowledgements

Antolín López-Quirós and Miguel Ángel Mora provided excellent technical assistance. This work was, in part, supported by grants BIOCON05/101 (Fundación BBVA, Spain) and CGL2013-44952-R (Plan Estatal de I+D+i, Spain). I.P. was supported by a Ramón y Cajal contract from the Ministerio de Economía y Competitividad (RYC-2011-08574) and M.R.-U. by a UPM PhD fellowship. We thank two anonymous reviewers for their very useful comments.

Conflict of interest. None declared.

References

  1. Alexander H. M., et al. (2014) ‘Plant-Virus Interactions and the Agro-Ecological Interface’, European Journal of Plant Pathology, 138: 529–47. [Google Scholar]
  2. Anderson P. K., et al. (2004) ‘Emerging Infectious Diseases of Plants: Pathogen Pollution, Climate Change and Agrotechnology Drivers’, Trends in Ecology and Evolution, 19: 535–44. [DOI] [PubMed] [Google Scholar]
  3. Barton N. H. (2010) ‘Mutation and the Evolution of Recombination’, Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences, 365: 1281–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Briddon R. W., et al. (2010) ‘Distinct Evolutionary Histories of the DNA-A and DNA-B Components of Bipartite Begomoviruses’, BMC Evolutionary Biology, 10: 97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Brown J. K., Poulos B. T. (1990) ‘Serrano Golden Mosaic Virus: A Newly Identified Whitefly-Transmitted Geminivirus of Pepper and Tomato in the United States and Mexico’, Plant Disease, 74: 720. [Google Scholar]
  6. Burdon J. J., Thrall P. H. (2008) ‘Pathogen Evolution across the Agro-Ecological Interface: Implications for Disease Management’, Evolutionary Applications, 1: 57–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Burnham K. P., Anderson D. R. (2002) Model Selection and Multi- model Inference: A Practical Information Theoretic Approach, 2nd edn. New York: Springer-Verlag. [Google Scholar]
  8. Casas A., et al. (2007) ‘In Situ Management and Domestication of Plants in Mesoamerica’, Annals of Botany, 100: 1101–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Conow C., et al. (2010) ‘Jane: A new tool for the co-phylogeny reconstruction problem’, Algorithms for Molecular Biology, 5: 16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Dennehy J. J., et al. (2013) ‘Frequent Coinfection Reduces RNA Virus Population Genetic Diversity’, Journal of Heredity, 104: 704–12. [DOI] [PubMed] [Google Scholar]
  11. Dobson A., Foufopoulos J. (2001) ‘Emerging Infectious Pathogens of Wildlife’, Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences, 356: 1001–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Drummond A. J., Rambaut A. (2007) ‘BEAST: Bayesian Evolutionary Analysis by Sampling Trees’, BMC Evolutionary Biology, 7: 214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Duffy S., Holmes E. C. (2008) ‘Phylogenetic Evidence for Rapid Rates of Molecular Evolution in the Single-Stranded DNA Begomovirus Tomato Yellow Leaf Curl Virus’, Journal of Virology, 82: 957–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Edgar R. C. (2004) ‘MUSCLE: Multiple Sequence Alignment with High Accuracy and High Throughput’, Nucleic Acids Research, 32: 1792–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Einer-Jensen K., et al. (2004) ‘Evolution of the Fish Rhabdovirus Viral Haemorrhagic Septicaemia Virus’, Journal of General Virology, 85: 1167–79. [DOI] [PubMed] [Google Scholar]
  16. Elena S. F., Fraile A., García-Arenal F. (2014) ‘Evolution and Emergence of Plant Viruses’, Advances in Virus Research, 88: 161–91. [DOI] [PubMed] [Google Scholar]
  17. Fargette D., et al. (2008) ‘Diversification of Rice Yellow Mottle Virus and Related Viruses Spans the History of Agriculture from the Neolithic to the Present’, PLoS Pathogens, 4: e1000125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. García-Arenal F., McDonald B. A. (2003) ‘An Analysis of the Durability of Resistance to Plant Viruses’, Phytopathology, 93: 941–52. [DOI] [PubMed] [Google Scholar]
  19. García-Andrés S., et al. (2006) ‘Begomovirus Genetic Diversity in the Native Plant Reservoir Solanum nigrum: Evidence for the Presence of a New Virus Species of Recombinant Nature’, Virology, 350: 433–42. [DOI] [PubMed] [Google Scholar]
  20. García-Andrés S., et al. (2007) ‘Frequent Occurrence of Recombinants in Mixed Infections of Tomato Yellow Leaf Curl Disease-Associated Begomoviruses’, Virology, 365: 210–19. [DOI] [PubMed] [Google Scholar]
  21. Garzón-Tiznado J. A., et al. (1993) ‘Inoculation of Peppers with Infectious Clones of a New Geminivirus by Biolistic Procedure’, Phytopathology, 83: 514–21. [Google Scholar]
  22. Gibbs A. J., et al. (2008) ‘The Prehistory of Potyviruses: Their Initial Radiation was during the Dawn of Agriculture’, PLoS One, 3: e2523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. González-Aguilera J., et al. (2012) ‘Genetic Structure of a Brazilian Population of the Begomovirus Tomato Severe Rugose Virus (ToSRV)’, Tropical Plant Pathology, 37: 346–53. [Google Scholar]
  24. Holmes E. C. (2009) ‘The Evolutionary Genetics of Emerging Viruses’, Annual Review of Ecology, Evolution, and Systematics, 40: 353–72. [Google Scholar]
  25. Jones K. E., et al. (2008) ‘Global Trends in Emerging Infectious Diseases’, Nature, 451: 990–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Jones R. A. C. (2009) ‘Plant Virus Emergence and Evolution: Origins, New Encounter Scenarios, Factors Driving Emergence, Effects of Changing World Conditions, and Prospects for Control’, Virus Research, 141: 113–30. [DOI] [PubMed] [Google Scholar]
  27. Joseph M. B., et al. (2013) ‘Does Life History Mediates Changing Disease Risk When Communities Disassemble?’, Ecology Letters, 16: 1405–12. [DOI] [PubMed] [Google Scholar]
  28. Keesing F., Holt R. D., Ostfeld R. S. (2006) ‘Effects of Species Diversity on Disease Risk’, Ecology Letters, 9: 485–98. [DOI] [PubMed] [Google Scholar]
  29. Keesing F., et al. (2010) ‘Impacts of Biodiversity on the Emergence and Transmission of Infectious Diseases’, Nature, 468: 647–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Kosakovsky-Pond S. L., Frost S. D. (2005) ‘Datamonkey: Rapid Detection of Selective Pressure on Individual Sites of Codon Alignments’, Bioinformatics, 21: 2531–3. [DOI] [PubMed] [Google Scholar]
  31. Kurath G., et al. (2003) ‘Phylogeography of Infectious Haematopoietic Necrosis Virus in North America’, Journal of General Virology, 84: 803–14. [DOI] [PubMed] [Google Scholar]
  32. Lanfear R., Kokko H., Eyre-Walker A. (2014) ‘Population Size and the Rate of Evolution’, Trends in Ecology and Evolution, 29: 33–41. [DOI] [PubMed] [Google Scholar]
  33. Laporta G. Z., et al. (2013) ‘Biodiversity Can Help Prevent Malaria Outbreaks in Tropical Forests’, PLoS Neglected Tropical Diseases, 7: e2139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lefeuvre P., Moriones E. (2015) ‘Recombination as a Motor of Host Switches and Virus Emergence: Geminiviruses as Case Studies’, Current Opinion in Virology, 10: 14–9. [DOI] [PubMed] [Google Scholar]
  35. Lefeuvre P., et al. (2007a) ‘Begomovirus ‘Melting Pot’ in the South-West Indian Ocean Islands: Molecular Diversity and Evolution through Recombination’, Journal of General Virology, 88: 3458–68. [DOI] [PubMed] [Google Scholar]
  36. Lefeuvre P., et al. (2007b) ‘Avoidance of Protein Fold Disruption in Natural Virus Recombinants’, PLoS Pathogens, 3: e181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Lefeuvre P., et al. (2010) ‘The Spread of Tomato Yellow Leaf Curl Virus from the Middle East to the World’, PLoS Pathogens, 6: e1001164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Legendre P., Desdevises Y., Bazin E. (2002) ‘A Statistical Test for Host-Parasite Coevolution’, Systematic Biology, 51: 217–34. [DOI] [PubMed] [Google Scholar]
  39. Lima A. T. M., et al. (2013) ‘Synonymous Site Variation due to Recombination Explains Higher Genetic Variability in Begomovirus Populations Infecting Non-Cultivated Hosts’, Journal of General Virology, 94: 418–31. [DOI] [PubMed] [Google Scholar]
  40. Mace G. M., Masundire H., Baillie J. E. M. (2005) ‘Biodiversity’. In: Hassan R., Scholes R., Ash N. (eds) Ecosystems and Human Well-Being: Current State and Trends , Vol. 1, pp. 77–122. Washington: Island Press. [Google Scholar]
  41. Martin D. P., et al. (2010) ‘RDP3: A Flexible and Fast Computer Program for Analyzing Recombination’, Bioinformatics, 26: 2462–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Martin D. P., et al. (2011) ‘Recombination in Eukaryotic Single Stranded DNA Viruses’, Viruses, 3: 1699–738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Meier-Kolthoff J. P., et al. (2007) ‘COPYCAT: Cophylogenetic Analysis Tool’, Bioinformatics, 23: 898–900. [DOI] [PubMed] [Google Scholar]
  44. Moya A., et al. (2000) ‘The Evolution of RNA Viruses: A Population Genetics View’, Proceedings of the National Academy of Sciences of the United States of America, 97: 6967–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Naeem S., et al. (2009) Biodiversity, Ecosystem Functioning, and Human Wellbeing: An Ecological and Economic Perspective . Oxford: Oxford University Press. [Google Scholar]
  46. Navas-Castillo J., Fiallo-Olive E., Sanchez-Campos S. (2011) ‘Emerging Virus Diseases Transmitted by Whiteflies’, Annual Review of Phytopathology, 49: 219–48. [DOI] [PubMed] [Google Scholar]
  47. Ostfeld R. S., Keesing F. (2012) ‘Effects of Host Diversity on Infectious Disease’, Annual Review of Ecology, Evolution, and Systematics, 43: 157–82. [Google Scholar]
  48. Pagán I., Holmes E. C. (2010) ‘Long-Term Evolution of the Luteoviridae: Time Scale and Mode of Virus Speciation’, Journal of Virology, 84: 6177–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Pagán I., et al. (2012) ‘Effect of Biodiversity Changes in Disease Risk: Exploring Disease Emergence in a Plant-Virus System’, PLoS Pathogens, 8: e1002796. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Parker J., Rambaut A., Pybus O. G. (2008) ‘Correlating Viral Phenotypes with Phylogeny: Accounting for Phylogenetic Uncertainty’, Infection, Genetics and Evolution, 8: 239–46. [DOI] [PubMed] [Google Scholar]
  51. Pennings P. S. (2012) ‘Standing Genetic Variation and the Evolution of Drug Resistance in HIV’, PLoS Computational Biology, 8: e1002527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Posada D. (2002) ‘Evaluation of Methods for Detecting Recombination from DNA Sequences: Empirical Data’, Bioinformatics, 14: 817–8. [DOI] [PubMed] [Google Scholar]
  53. Rambaut A. (1996) Se-Al: Sequence Alignment Editor. <http://evolve.zoo.ox.ac.uk/> accessed 18 May 2015. [Google Scholar]
  54. Rocha C. S., et al. (2013) ‘Brazilian Begomovirus Populations are Highly Recombinant, Rapidly Evolving, and Segregated Based on Geographical Location’, Journal of Virology, 87: 5784–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Rodelo-Urrego M., et al. (2013) ‘Landscape Heterogeneity Shapes Host-Parasite Interactions and Results in Apparent Plant–Virus Codivergence’, Molecular Ecology, 22: 2325–40. [DOI] [PubMed] [Google Scholar]
  56. Rojas M., et al. (2005) ‘Exploiting Chinks in the Plant’s Armor: Evolution and Emergence of Geminiviruses’, Annual Review of Phytopathology, 43: 361–94. [DOI] [PubMed] [Google Scholar]
  57. Rozas J., et al. (2003) ‘DNA Polymorphism Analyses by the Coalescent and Other Methods’, Bioinformatics, 19:2496–7. [DOI] [PubMed] [Google Scholar]
  58. Sanz A. I., et al. (1999) ‘Genetic Variability of Natural Populations of Cotton Leaf Curl Geminivirus, a Single-Stranded DNA Virus’, Journal of Molecular Evolution, 49: 672–81. [DOI] [PubMed] [Google Scholar]
  59. Sanz A. I., et al. (2000) ‘Multiple Infection, Recombination and Genome Relationships among Begomovirus Isolates Found in Cotton and Other Plants in Pakistan’, Journal of General Virology, 81: 1839–49. [DOI] [PubMed] [Google Scholar]
  60. Scholle S. O., et al. (2013) ‘Viral Substitution Rate Variation can Arise from the Interplay between Within-Host and Epidemiological Dynamics’, The American Naturalist, 182: 494–513. [DOI] [PubMed] [Google Scholar]
  61. Silva F. N., et al. (2014) ‘Recombination and Pseudorecombination Driving the Evolution of the Begomoviruses Tomato Severe Rugose Virus (ToSRV) and Tomato Rugose Mosaic Virus (ToRMV): Two Recombinant DNA-A Components Sharing the Same DNA-B’, Virology Journal, 11: 66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Silva S. J. C., et al. (2011) ‘High Genetic Variability and Recombination in a Begomovirus Population Infecting the Ubiquitous Weed Cleome affinis in Northeastern Brazil’, Archives of Virology, 156: 2205–13. [DOI] [PubMed] [Google Scholar]
  63. Silva S. J. C., et al. (2012) ‘Species Diversity, Phylogeny and Genetic Variability of Begomovirus Populations Infecting Leguminous Weeds in Northeastern Brazil’, Plant Pathology, 61: 457–67. [Google Scholar]
  64. Sokal R. R., Rohlf F. J. (1995) Biometry, 3rd edn W.H. Freeman and Company: New York. [Google Scholar]
  65. Tamura K., et al. (2011) ‘MEGA5: Molecular Evolutionary Genetics Analysis Using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods’, Molecular Biology and Evolution, 28: 2731–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Torres-Pacheco I., et al. (1993) ‘Complete Nucleotide Sequence of Pepper huasteco virus: Analysis and Comparison with Bipartite Geminiviruses’, Journal of General Virology, 74: 2225–31. [DOI] [PubMed] [Google Scholar]
  67. Torres-Pacheco I., et al. (1996) ‘Detection and Distribution of Geminiviruses in Mexico and the Southern United States’, Phytopathology, 86: 1186–92. [Google Scholar]
  68. Volk S., et al. (2010) ‘Genome-Scale Phylogenetic Analyses of Chikungunya Virus Reveal Independent Emergences of Recent Epidemics and Various Evolutionary Rates’, Journal of Virology, 84: 6497–504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Votava E. J., Nabham G. P., Bosland P. W. (2002) ‘Genetic Diversity and Similarity Revealed via Molecular Analysis among and within an In Situ Population and ex situ Accessions of Chiltepin (Capsicum annuum var. glabriusculum)’, Conservation Genetics, 3: 123–9. [Google Scholar]
  70. WWF Living Planet Report (2012) Biodiversity, Biocapacity and Better Choices. <wwf.panda.org> accessed 18 May 2015. [Google Scholar]
  71. Yasaka R., et al. (2014) ‘The Temporal Evolution and Global Spread of Cauliflower Mosaic Virus, a Plant Pararetrovirus’, PLoS One, 9: e85641. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Table S1
ve_vev004_index.html (702B, html)

Data Availability Statement

Sequences determined in this work can be found in the EMBL database under accession numbers: LN848751 to LN848928.


Articles from Virus Evolution are provided here courtesy of Oxford University Press

RESOURCES