Abstract
Reassortment between different species or strains plays a key role in the evolution of multipartite plant viruses and can have important epidemiological implications. Identifying geographic locations where reassortant lineages are most likely to emerge could be a valuable strategy for informing disease management and surveillance efforts. We developed a predictive framework to identify potential geographic hot spots of reassortment based upon spatially explicit analyses of genome constellation diversity. To demonstrate the utility of this approach, we examined spatial variation in the potential for reassortment among Cardamom bushy dwarf virus (CBDV; Nanoviridae, Babuvirus) isolates in Northeast India. Using sequence data corresponding to six discrete genome components for 163 CBDV isolates, a quantitative measure of genome constellation diversity was obtained for locations across the sampling region. Two key areas were identified where viruses with highly distinct genome constellations cocirculate, and these locations were designated as possible geographic hot spots of reassortment, where novel reassortant lineages could emerge. Our study demonstrates that the potential for reassortment can be spatially dependent in multipartite plant viruses and highlights the use of evolutionary analyses to identify locations which could be actively managed to facilitate the prevention of outbreaks involving novel reassortant strains.
Keywords: disease emergence, geographic hot spot, multipartite virus, reassortment
Introduction
Plant pathogens present a major challenge to food security and local economies as an estimated 10–16% of global food production is annually lost to disease (Strange and Scott 2005; Chakraborty and Newton 2011). Using evolutionary analyses to understand the processes that contribute to pathogen diversity and population structure could have important applications in plant disease management (Burdon and Thrall 2008; Prasanna et al. 2010; Acosta-Leal et al. 2011). For example, spatially explicit analyses which reveal how genetic variation is partitioned within and among plant pathogen populations can allow us to make predictions about future disease dynamics and to identify locations where novel pathogens may emerge (Burdon and Thrall 2008; Thrall et al. 2011). Here, we demonstrate how spatially explicit analyses of genetic diversity can be utilized to study geographic differences in the potential for reassortment in multipartite plant viruses.
Reassortment, or pseudo-recombination, occurs only in viruses with segmented genomes and involves the exchange of discrete genome components between different species or genetically distinct strains which coreplicate within the same host cell. This process generates hybrid progeny with novel combinations of genome components inherited from different parental viruses and may lead to the emergence of highly virulent strains (Hou and Gilbertson 1996; Pita et al. 2001; Gu et al. 2007; Chakraborty et al. 2008; Nelson et al. 2008; Chen et al. 2009) or facilitate adaptation to alternative hosts (Idris et al. 2008; Ince et al. 2013). Identifying geographic locations where reassortant lineages are most likely to emerge could be an important strategy to inform disease management and surveillance efforts (Pearce et al. 2009). However, relevant research emphasis has focused almost exclusively on viruses with segmented genomes that are important in the context of global health, such as influenza viruses (Koehler et al. 2008; Pearce et al. 2009; O'Keefe et al. 2010; Ramey et al. 2010; Wille et al. 2011; Barton et al. 2013; Fuller et al. 2013). There have been no explicit attempts to assess spatial variation in the potential for reassortment in multipartite plant viruses, such as begomoviruses (Geminiviridae), bromoviruses (Bromoviridae), nanoviruses (Nanoviridae), and tospoviruses (Bunyaviridae), which pose serious threats to the production of staple food crops and other economically important crops. Agricultural landscapes are often highly fragmented, containing a mosaic of patches of host, reservoir, and nonhost species, and this can lead to spatial genetic structure of plant pathogen populations (Plantegenest et al. 2007). Indeed, several multipartite plant viruses exhibit high levels of spatial population structure (Karan et al. 1994; Tsompana et al. 2005; Prasanna et al. 2010). As this creates geographic constraints on the ability of viruses to overlap and exchange genetic material (Prasanna et al. 2010; Martin et al. 2011a), population structure may lead to geographic differences in the potential for reassortment between genetically distinct strains.
Given that reassortment generates novel combinations of genome components derived from different species or strains, the diversity of combinations that are observed in a population (hereafter referred to as genome constellation diversity) can provide an indication of the frequency of past reassortment events (Dugan et al. 2008). Analyses of genome constellation diversity could also be used to make predictions about the occurrence of future reassortment events. This is because the probability that coinfections will involve viruses with the potential to reassort is determined by the diversity of genotypes with distinct genome constellations which cocirculate in a given location (Barton et al. 2013). Assuming that disease incidence is sufficiently high for coinfections to occur, reassortment is more likely to occur in locations where viruses exhibit a variety of genome constellations, than in locations where the majority of viruses are genetically identical or highly homogeneous in all genome components. Identifying geographic locations where genome constellation diversity is conducive to reassortment could thus be an informative approach used to target areas for increased surveillance and control of multipartite plant viruses or animal viruses with segmented genomes.
For reassortment to be detectable and relevant in an evolutionary and epidemiological context, hybrid progeny must be viable and exhibit little or no fitness deficit relative to other genotypes which cocirculate in a population. Therefore, when genome components from different parental viruses are reassembled, they must be able to function as efficiently as they did within the genomic backgrounds in which they evolved (Martin et al. 2011a). High levels of nucleotide sequence dissimilarity between parental viruses can constrain the fitness of hybrid progeny by disrupting coevolved intragenome interactions (Martin et al. 2005, 2011b; Escriu et al. 2007; Lefeuvre et al. 2007; Rokyta and Wichman 2009). Although the extent to which hybrid fitness declines with increasing genetic distance between parental strains can vary according to the genomic region which is exchanged, genetic exchange among plant viruses rarely yields viable progeny when levels of parental nucleotide sequence identity are lower than 90% (Martin et al. 2005; Escriu et al. 2007; Lefeuvre et al. 2007). However, reassortment between viruses which belong to the same species could often generate viable hybrids if the genome components which are inherited from different parental genotypes are functionally compatible (Grigoras et al. 2014). Surveillance and control efforts should thus be focused in locations where sequence divergence among potential parental viruses is sufficiently high for reassortment to yield hybrid progeny with novel phenotypes, yet sufficiently low for intragenome interactions to be preserved.
We developed a predictive framework to identify potential geographic hot spots of reassortment among Cardamom bushy dwarf virus (CBDV, Nanoviridae, Babuvirus) genotypes in Northeast India. CBDV is an aphid-borne nanovirus within the Babuvirus genus (Mandal et al. 2004, 2013). Nanoviruses have multipartite genomes, consisting of up to 12 individually encapsidated, circular, single-stranded DNA (ssDNA) genome components (Mandal 2010), and are known to undergo reassortment (Timchenko et al. 2000; Hu et al. 2007; Fu et al. 2009; Hyder et al. 2011; Stainton et al. 2012; Grigoras et al. 2014; Savory and Ramakrishnan 2014). CBDV infects large cardamom, Amomum subulatum, a crop of considerable economic importance in sub-Himalayan regions of Northeast India, Nepal, and Bhutan. It is the causal agent of ‘foorkey’ disease, which is characterized by excessive sprouting of dwarf tillers, reduced yield and mortality, and severely constrains large cardamom production (Mandal et al. 2013). Importantly, CBDV isolates from Northeast India exhibit moderately low levels of genetic diversity (Savory and Ramakrishnan 2014). This suggests that reassortment could generate viable hybrid lineages, as levels of nucleotide sequence identity between potential parental viruses may be sufficiently high for DNA–protein and protein–protein interaction networks to be preserved. Indeed, reassortment appears to have played an important role in the evolutionary dynamics of CBDV in the past (Savory and Ramakrishnan 2014). Using a phylogenetic approach to assign viruses to specific genome constellations, we first assessed the potential for reassortment by investigating whether isolates with distinct genome constellations cocirculate in the same geographic localities. We then made predictions about where reassortment is most likely to occur based upon geographic differences in genome constellation diversity.
Methods and materials
Sample collection and study area
Leaf samples were collected in 2011 and 2012 from CBDV-infected plants at a range of locations and altitudes throughout the state of Sikkim and the Darjeeling district of West Bengal, Northeast India (Table S1). Samples were dried with silica gel and were maintained in Ziplock bags until DNA extraction. Large cardamom plants are cultivated across an elevation gradient ranging from approximately 500 to 2000 m above sea level. Due to the rugged topology of the landscape, there are limited continuous tracts in which climatic conditions are suitable for large cardamom cultivation, and large-scale tea plantations dominate the landscape in Darjeeling. These factors have resulted in a highly patchy distribution of large cardamom plantations. As it was not possible to sample with equal intensity across space, this is reflected in our sampling distribution.
DNA extraction and PCR amplification of CBDV genome components
Approximately 100 mg of leaf tissue from 163 infected plant samples was ground to a fine paste with a mortar and pestle using liquid nitrogen. Total genomic plant DNA and viral DNA were then simultaneously extracted using a Nucleospin Kit (Genetix Biotech Asia Pvt. Ltd., Bangalore, Karnataka, India). Full-length sequences corresponding to six discrete CBDV genome components (DNA-R, DNA-U3, DNA-S, DNA-M, DNA-C, and DNA-N; each approximately 1.1 kb in length) were directly obtained from these DNA extractions using previously described primers (Savory and Ramakrishnan 2014). We refer to the virus sample obtained from a single infected plant as an isolate. However, in the event that any of the plants were coinfected by multiple CBDV genotypes, the sequences obtained for the corresponding isolate would represent consensus sequences. PCRs were performed in 10 μL reaction volumes, containing 3.4 μL H2O, 5 μL Multiplex PCR Master Mix (Qiagen), 0.3 μL of each primer, and 1 μL DNA template. PCR protocols were identical for all six genome components and involved initial denaturation for 15 min at 95°C, 34 cycles of denaturation for 30 s at 94°C, annealing for 1 min at 55°C, and extension for 1 min at 72°C, and then a final extension period of 10 min at 72°C. Sequencing was performed using a 3130xl Genetic Analyzer from Applied Biosystems (Life Technologies, Carlsbad, CA, USA). Sequences were aligned and edited using MEGA version 5.05 (Tamura et al. 2011). GenBank accession numbers are as follows: KF710463 – KF710625 (DNA-R), KF710626 – KF710788 (DNA-U3), KF710789 – KF710951 (DNA-S), KF10952 – KF11114 (DNA-M), KF711115 – KF711277 (DNA-C), and KF11278 – KF711440 (DNA-N; Table S1).
Phylogenetic analyses
Bayesian phylogenies were reconstructed for each genome component using a Markov chain Monte Carlo (MCMC) method implemented in BEAST version 1.7.1 (Drummond et al. 2012). Prior to the analyses, a region of 46 bp in length was removed from the 3′ end of the DNA-R sequences of 12 isolates (GenBank accession numbers: KF10478, KF10542, KF10563, KF10577 – KF10583, KF10593, KF10605), and a region of approximately 120 bp in length was removed from the 3′ end of the DNA-U3 sequences of 16 isolates (GenBank accession numbers: KF10630, KF10632, KF10672, KF10673, KF10693, KF10698, KF10712, KF10714, KF10716, KF10718, KF10719, KF10722, KF10733, KF10760, KF10775, KF10781) because intercomponent recombination was considered to confound the phylogenetic inference. The recombinants were identified by applying recombination detection tests implemented in RDP4 (Martin et al. 2010) to an alignment of the sequences obtained for all six genome components (Savory and Ramakrishnan 2014). For all genome components, we applied a general time reversible (GTR) model of nucleotide substitution with gamma-distributed rate heterogeneity across sites and an uncorrelated, relaxed, lognormal molecular clock model (Drummond et al. 2006). The GTR model of nucleotide substitution was selected using jModelTest 2 (Darriba et al. 2012). Phylogenetic analyses were run twice for each component. For the DNA-R, DNA-S, and DNA-C datasets, each analysis was run for 20 000 000 MCMC generations, with trees and parameters being sampled every 1000 generations. For the DNA-U3, DNA-M, and DNA-N datasets, each analysis was run for 100 000 000, 40 000 000, and 50 000 000 MCMC generations, respectively, and trees and parameters were sampled every 5000, 2000, and 2500 generations, respectively. Convergence of the MCMC chains for each genome component was confirmed using TRACER version 1.5 (Rambaut and Drummond 2007). The posterior distribution of trees from the two independent runs for each genome component was combined using LogCombiner version 1.7.1 (Drummond et al. 2012) after removal of 20% burn-in. Maximum clade credibility (MCC) trees were obtained from the posterior distribution of trees for each genome component using TreeAnnotator version 1.7.1 (Drummond et al. 2012), and trees were visualized using FigTree version 1.3.1 (Rambaut 2009).
Phylogenetic assignment and nonmetric multidimensional scaling
Major and minor clades that were supported by Bayesian posterior probabilities of ≥0.8 were identified following visualization of the phylogenetic tree for each genome component. For all genome components, isolates were assigned to one of two major clades, and for three components (DNA-S, DNA-M, and DNA-N), isolates were further divided into minor clades (Figures S1–S6). Genetic distances within and between clades were calculated according to the mean number of pairwise nucleotide substitutions per site (Nei 1987) using DnaSP version 5 (Rozas and Rozas 1999; Table S2). Genome constellations were characterized for each isolate based upon the specific combination of clades to which they were inferred to belong (Fig. 1). Viruses which were assigned to the same genome constellations thus had shared evolutionary histories and high levels of sequence identity for all genome components (Figures S1–S6; Table S2). Nonmetric multidimensional scaling (NMDS) with Bray–Curtis dissimilarities was used to collapse the clade membership information to a two-dimensional dataset, and the genome constellation of each isolate was subsequently described using NMDS scores 1 and 2 (Fig. 1). This allowed us to obtain an index of dissimilarity for each pair of isolates based upon the distance between their genome constellations in two-dimensional ordination space. NMDS was implemented using the Vegan package (Oksanen et al. 2008) in R version 2.15 (R Development Core Team 2013) and was repeated iteratively to minimize the stress (a stress value of 0.062 was achieved), the degree of mismatch between the calculated distances between each pair of isolates and their pairwise distances in ordination space.
Figure 1.
Flow chart of methods.
Spatial variation in the potential for reassortment
A regular grid of points of 2 km increments was generated to cover the extent of the study area using Python version 2.7.3 (http://www.python.org; Fig. 1). Geographic distances from each grid point to each sampling location were calculated, and then, grid points which had <2 CBDV isolates collected within a 5 km radius were discarded from the analysis (Fig. 1). For each remaining grid point, the median pairwise ordination distance among all CBDV isolates collected within a 5 km radius was calculated to obtain a quantitative measure of genome constellation diversity for that location (Fig. 1). We considered 5 km to be an appropriate spatial scale at which to measure genome constellation diversity based on an assumption that vector-mediated and/or human-mediated dispersal of CBDV isolates could conceivably occur across this distance. Two approaches were used to account for unequal sampling intensity across the landscape (Fig. 1). Firstly, grid points were assigned a low confidence score if 2–4 samples had been collected within a 5 km radius and a high confidence score if five or more samples had been collected within a 5 km radius. Secondly, a resampling approach was implemented to recalculate the median pairwise ordination distances for all grid points for which more than five isolates had been collected within a 5 km radius. Median pairwise ordination distances were recalculated 500 times for each of these grid points by repeatedly resampling a subset of five isolates without replacement. The overall median of the median pairwise ordination distances for a given grid point was then used as the final measure of genome constellation diversity for that location. Spatial variation in genome constellation diversity was visually represented using QGIS version 2.2 (Quantum GIS Development Team 2014; http://www.qgis.org).
Analysis of population structure
Population structure was assessed using STRUCTURE version 2.3.3 (Pritchard et al. 2000), which implements a model-based Bayesian clustering algorithm to identify groups of genetically similar individuals and divergent populations. The analysis was performed on a single nucleotide polymorphism (SNP) dataset containing 967 polymorphic sites from across the six genome components that were observed to vary in at least two isolates (singleton SNPs were not considered). To estimate the number of CBDV subpopulations, K values (the number of populations) were allowed to vary from 1 to 5, and five independent runs for each K value were implemented. Initial runs with different burn-in and parameter estimation periods were compared to ensure the reliability of posterior probability estimates. The final run included an initial burn-in period of 100 000 iterations and a parameter estimation period of 100 000 iterations. The admixture model with correlated allele frequencies between populations was selected to account for individuals with mixed ancestry and to allow for similar allele frequencies between populations. This model is appropriate for viruses which exhibit high rates of genetic exchange and migration (Prasanna et al. 2010). The optimum number of subpopulations was determined using the ΔK method (Evanno et al. 2005), which was implemented in STRUCTURE HARVESTER (Earl and vonHoldt 2012). Q values (indicating the proportion of ancestry from each of K clusters) from the five independent runs for the optimal K value were compiled using CLUMPP (Jakobsson and Rosenberg 2007) and visualized using Distruct (Rosenberg 2004) and then were used to assign individuals to inferred subpopulations. Individuals were considered to be admixed if they could be assigned to two or more subpopulations with Q values of 0.15 or above. The clustering algorithm which is implemented in STRUCTURE assumes that polymorphic sites in the data being assessed do not display strong levels of linkage disequilibrium. Given that both recombination and reassortment occur in nanoviruses (Hu et al. 2007; Fu et al. 2009; Hyder et al. 2011; Stainton et al. 2012; Wang et al. 2013; Grigoras et al. 2014; Savory and Ramakrishnan 2014) and that these processes are likely to break down linkage between polymorphic sites on the same genome component and on different genome components respectively, we expected levels of linkage disequilibrium to be low.
To verify the population divisions that were inferred by STRUCTURE, an analysis of molecular variance (amova) was implemented in ARLEQUIN version 3.5 (Excoffier and Lischer 2010) after removal of admixed individuals. amova tests the partitioning of molecular variation among predefined subpopulations and yields fixation indices (FST values) which describe the extent of genetic differentiation for a given level of population subdivision. The significance of the FST value was assessed using a nonparametric permutation test with 1000 iterations (Weir and Cockerham 1984).
Results
Genome constellation diversity
Six discrete genome components were sequenced for 163 CBDV isolates collected throughout Sikkim and the Darjeeling district of West Bengal, Northeast India. These components included DNA-R, which encodes the CBDV master replication protein, DNA-U3, which has an unknown function, DNA-S, which encodes the CBDV coat protein, DNA-M, which encodes the CBDV movement protein, DNA-C, which encodes the CBDV cell cycle link protein, and DNA-N, which encodes the CBDV nuclear shuttle protein (GenBank accession numbers are listed in Table S1). Following phylogenetic assignment of each isolate to a specific combination of major and minor clades, 34 unique genome constellations were observed. The frequencies of these different constellations were highly skewed (Figure S7), suggesting that there may be fitness advantages associated with specific genome constellations that are perhaps environment-dependent. Importantly, the apparent stability of certain genome constellations implies that at least in some locations, a high proportion of viral isolates are likely to share the same constellation.
Geographic overlap of isolates with distinct genome constellations
To ascertain whether epidemiologically relevant reassortment events are likely to occur among CBDV isolates, we assessed whether isolates with distinct genome constellations cocirculate in the same geographic localities. Pairwise ordination distances (reflecting the level of dissimilarity between the genome constellations of each pair of isolates) were positively associated with pairwise geographic distances (R2 = 0.29, P < 0.001), demonstrating that genome constellations get increasingly dissimilar with greater geographic distances between isolates (Fig. 2). Nevertheless, isolates with distinct genome constellations were observed at relatively small spatial scales at which dispersal may readily occur. For instance, at the scale of a single plantation or adjacent small plantations (0–0.5 km), pairwise ordination distances ranged from 0 (identical genome constellations) to 1.25, and at slightly larger spatial scales (0–5.0 km), pairwise ordination distances ranged from 0 to 1.6. Notably, the maximum pairwise ordination distances observed at these scales exceed the median (0.99) and upper quartile (1.39) pairwise ordination distances, respectively, when pairwise comparisons among all 163 CBDV isolates were considered (Fig. 2). This suggests that a considerable proportion of the total genome constellation diversity that is observed across the entire sampling region can be present in the same geographic locality. Assuming that dispersal readily occurs across short distances (e.g. ≤5.0 km) and that disease incidence is sufficiently high for coinfections to occur, these results indicate that there is high potential for reassortment between CBDV isolates with distinct genome constellations.
Figure 2.

Pairwise ordination distances versus pairwise geographic distances. The red dashed line corresponds to the median pairwise ordination distance, and the blue dashed line corresponds to the upper quartile pairwise ordination distance when pairwise comparisons for all Cardamom bushy dwarf virus isolates were considered.
Identifying geographic hot spots of reassortment
To examine geographic differences in the potential for reassortment, we considered how genome constellation diversity changes across space, using the median pairwise ordination distance among all isolates collected within a 5 km radius as the measure of genome constellation diversity for a given location. Considerable variation in genome constellation diversity was observed across the sampling region (Fig. 3). This variation does not reflect geographic differences in sampling intensity because no clear relationship was observed between median pairwise ordination distances and the number of viral isolates considered (correlation coefficient < 0.1; Figure S8). Therefore, our results imply that the potential for reassortment is spatially dependent on CBDV.
Figure 3.

Spatial variation in genome constellation diversity in the North (N), East (E), South (S), and West (W) districts of Sikkim and the Darjeeling district of West Bengal (W.B). Genome constellation diversity was calculated using the median pairwise ordination distance for all isolates collected within a 5 km radius of each grid point. Shaded areas represent locations where grid points were assigned a low confidence score (<5 samples).
The majority of locations, including those which had been assigned a high confidence score (five or more samples collected within a 5 km radius), had relatively low levels of genome constellation diversity (Fig. 3). Although distinct genome constellations could be introduced into these locations by migration, the current lack of diversity suggests that novel reassortant lineages are unlikely to emerge in these areas. Locations which harbor isolates with a relatively high diversity of genome constellations were observed in specific areas of West Sikkim and East Sikkim (Fig. 3). These locations had been assigned a high confidence score and had median pairwise ordination distances ranging between 0.6 and 1.0. The high levels of genome constellation diversity observed among isolates in these areas are broadly comparable to the overall level of genome constellation diversity observed when pairwise comparisons among all 163 CBDV isolates were considered (median pairwise ordination distances for all isolates = 0.99). The latter represents a level of genome constellation dissimilarity that could be expected between two isolates that are selected at random from any locations across the sampling region. Therefore, the high levels of diversity observed in specific areas of West Sikkim and East Sikkim are analogous to those which may occur under a scenario in which CBDV isolates are not spatially structured. As there is a high chance that two isolates which coinfect the same host plant will have distinct genome constellations under such a scenario, we designated the locations as possible geographic hot spots of reassortment.
We expected that the areas which exhibit the highest levels of genome constellation diversity should be situated in locations where geographically defined subpopulations overlap. An analysis of population structure which was implemented using STRUCTURE version 2.3.3 (Pritchard et al. 2000) supported the existence of two geographically stratified subpopulations of CBDV isolates (Fig. 4A,B). Population subdivision was verified by an amova test which yielded a highly significant FST statistic (FST = 0.31, P < 0.001). While 116 of the isolates could be assigned to one subpopulation, 39 were assigned to the other. The remaining eight isolates were inferred to be admixed, and, of these, six had been collected in the vicinity of the predicted hot spots of reassortment in East Sikkim (four isolates) and West Sikkim (two isolates). The distributions of the two CBDV subpopulations supported our expectation that spatial overlap underlies the high genome constellation diversity observed in the East Sikkim reassortment hot spot (Fig. 4B). However, the high genome constellation diversity observed in the West Sikkim reassortment hot spot appears to have arisen due to past migration and subsequent admixture (Fig. 4B).
Figure 4.

(A) Clustering of isolates based on a Bayesian analysis of population structure implemented in STRUCTURE version 2.3.3 (optimal number of populations: K = 2). Vertical bars correspond to individual Cardamom bushy dwarf virus isolates. Isolates were grouped according to their districts of origin. The two inferred populations are indicated by different colors, and when both colors are present within an individual bar, this indicates admixture. (B) Population membership and admixture of Cardamom bushy dwarf virus isolates collected in the North (N), East (E), South (S), and West (W) districts of Sikkim and the Darjeeling district of West Bengal (W.B). The geographic coordinates of isolates in the area of the predicted reassortment hot spot in West Sikkim have been jittered to facilitate visualization of admixed isolates and isolates which were assigned to different populations.
Discussion
Targeted management strategies which reduce the potential for reassortment in specific locations by controlling disease incidence could facilitate the prevention of outbreaks involving novel reassortant strains. However, few studies have considered how the potential for future reassortment events can vary across space (Pearce et al. 2009; Fuller et al. 2013). We have demonstrated that the potential for reassortment can be spatially dependent in multipartite plant viruses and have introduced a framework which can be used to make predictions about where novel reassortant lineages are most likely to emerge. Additional information could be incorporated to refine these predictions if available. In particular, data regarding disease incidence, viral load, and/or the duration of infectivity could be highly informative, as these factors influence the frequencies at which genetically distinct strains coreplicate within the same host cells (Martin et al. 2011a).
The majority of locations throughout the sampling region appear to contain CBDV isolates with a relatively low level of genome constellation diversity, indicating that a high proportion of isolates in these locations share the same genome constellation. Although we did not account for reassortment among closely related isolates which were assigned to the same clades for all genome components, genetic exchange between such viruses may be irrelevant in an evolutionary and epidemiological context because the phenotypes of hybrid progeny are likely to be the same as, or highly similar to, those of the parental genotypes. Our analysis highlighted two key areas which harbor CBDV isolates with a relatively high diversity of genome constellations. As these represent locations where novel reassortant lineages may emerge if coinfections occur, the information which we have generated in this study could yield guidelines for policymakers and landowners involved in the formulation of large cardamom disease management strategies.
Our assessment of spatial population genetic structure indicated that the high levels of genome constellation diversity can be attributed to the overlap of isolates from distinct subpopulations (East Sikkim) and to one or more past migration events from one subpopulation to another (West Sikkim). CBDV is naturally transmitted by the aphids Pentalonia nigronervosa (Varma and Capoor 1964) and Micromyzus kalimpongensis (Basu and Ganguly 1968). Topological features of the landscape and/or environmental variables associated with elevation may indirectly influence patterns of CBDV movement and lead to the observed patterns of population structure by affecting aphid dispersal. However, it has been suggested that human transport of infected tillers is the primary mechanism underlying CBDV movement over long distances (Nair 2011). Therefore, it is perhaps more likely that socioeconomic factors and local transport networks underlie the observed patterns of spatial population genetic structure. Given that the reassortment hot spots that were predicted by our analysis could also be detected as high-risk areas using well-established population genetic analyses, such as implemented in STRUCTURE, these approaches could feasibly be used to identify areas where control strategies should be implemented. However, such analyses can be problematic as they do not always yield consistent results across multiple runs and are sensitive to variation in sample size (Kalinowksi 2011). The approach that we have developed does not rely on accurate inference of populations or subpopulations and explicitly quantifies a measure of genome constellation diversity for each location while accounting for differences in sampling intensity across space.
As hybrid viability will largely be determined by the genomic locations and types of nucleotide differences that exist between parental viruses, one cannot make accurate predictions about the fitness of reassortant lineages simply based upon overall levels of parental sequence divergence. Indeed, in laboratory-constructed Maize streak virus (MSV, Geminiviridae, Mastrevirus) recombinants, the level of parental sequence dissimilarity that can be tolerated varies according to the number of intragenome interactions in which the exchanged genomic regions are known to be involved (Martin et al. 2005). Patterns of reassortment are nonrandom in CBDV (Savory and Ramakrishnan 2014) and other nanoviruses (Stainton et al. 2012; Grigoras et al. 2014), suggesting that different genome components vary in their abilities to function efficiently when introduced into foreign genomic backgrounds. Infectious cloned genomes have recently been produced for several nanovirus species, and these may allow the fitness consequences of exchanging specific genome components to be experimentally assessed (Grigoras et al. 2014). Such information could be used to refine spatial predictions of reassortment hot spots, for instance by incorporating a weighting parameter within the framework which penalizes parental sequence divergence above a threshold level that can be specified independently for each genome component.
Focusing surveillance and control efforts in locations where cocirculating viruses exhibit a variety of genome constellations may be effective even in the absence of detailed information regarding possible fitness costs associated with reassortment. This is because reassortment between parental viruses with highly distinct genome constellations would enable expansive regions of sequence space to be explored. For a multipartite virus with six discrete genome components, reassortment between two parents which differ in two components could yield only four (22) possible genome constellations, but reassortment between parents which differ in all six components could yield 64 (26) possible genome constellations (including the parental genomes and chimeric genomes containing components derived from each parent). Although many of the reassortant lineages may be selected against, some may be viable and some may even confer a selective advantage over other genotypes in the population.
The framework that we developed in this study is based upon current levels of genome constellation diversity. Several studies have demonstrated that genetic diversity can remain stable over time in plant virus populations (García-Arenal et al. 2001). However, genetic diversity can be highly dynamic in some virus populations (Ghedin et al. 2005; Rambaut et al. 2008). Transient events, such as selective sweeps or population bottlenecks, can temporarily reduce genetic diversity (Li and Roossinck 2004; Rambaut et al. 2008), and migration can introduce variation into a subpopulation that was previously genetically homogeneous. Nevertheless, we believe that the approach we have introduced could have applications in plant disease management as it may allow surveillance and control efforts to be implemented effectively for a variety of multipartite plant viruses in strategic locations, such as areas where subpopulations overlap. These may represent high-risk locations, where reassortment could facilitate evolutionary innovation and lead to the emergence of novel lineages with the capacity to evade host immunity, expand host ranges, or overcome treatment interventions.
Acknowledgments
We thank the Spice Board (Government of India) for allowing us to access their large cardamom research stations in Pangthang (East Sikkim) and Kabi (North Sikkim), and Dr. Suhel Quader for helpful comments regarding methodology. This work was funded by a Department of Biotechnology (DBT, Government of India) grant entitled ‘Technological Innovations and Ecological Research for the Sustainable Use of Bioresources in Sikkim’.
Data archiving statement
Sequences that were used in this study are available from GenBank, and accession numbers are provided in the supplementary materials.
Supporting Information
Additional Supporting Information may be found in the online version of this article:
Figure S1. DNA-R phylogeny.
Figure S2. DNA-U3 phylogeny.
Figure S3. DNA-S phylogeny.
Figure S4. DNA-M phylogeny.
Figure S5. DNA-C phylogeny.
Figure S6. DNA-N phylogeny.
Figure S7. Frequency distribution of unique genome constellations.
Figure S8. Median pairwise ordination distance (genome constellation diversity) versus number of samples collected within a 5 km radius.
Table S1. List of CBDV isolates with corresponding districts, geographic coordinates, elevations, and GenBank accession numbers for each genome component.
Table S2. Genetic distances within and between clades for each genome component.
Literature cited
- Acosta-Leal R, Duffy S, Xiong Z, Hammond RW, Elena SF. Advances in plant virus evolution: translating evolutionary insights into better disease management. Phytopathology. 2011;101:1136–1148. doi: 10.1094/PHYTO-01-11-0017. [DOI] [PubMed] [Google Scholar]
- Barton HD, Rohani P, Stallknecht DE, Brown B, Drake JM. Subtype diversity and reassortment potential for co-circulating avian influenza viruses at a diversity hot spot. Journal of Animal Ecology. 2013 doi: 10.1111/1365-2656.12167. doi: 10.1111/1365-2656.12167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Basu AN, Ganguly B. A note on the transmission of foorkey disease of large cardamom by the aphid, Micromyzus kalimpongensis Basu. Indian Phytopathology. 1968;21:127. [Google Scholar]
- Burdon JJ, Thrall PH. Pathogen evolution across the agro-ecological interface: implications for disease management. Evolutionary Applications. 2008;1:57–65. doi: 10.1111/j.1752-4571.2007.00005.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chakraborty S, Newton AC. Climate change, plant diseases and food security: an overview. Plant Pathology. 2011;60:2–14. [Google Scholar]
- Chakraborty S, Vanitharani R, Chattopadhyay B, Fauquet CM. Supervirulent pseudorecombination and asymmetric synergism between genomic components of two distinct species of begomovirus associated with severe tomato leaf curl disease in India. Journal of General Virology. 2008;89:818–828. doi: 10.1099/vir.0.82873-0. [DOI] [PubMed] [Google Scholar]
- Chen LF, Rojas M, Kon T, Gamby K, Xoconostle-Cazares B, Gilbertson RL. A severe symptom phenotype in tomato in Mali is caused by a reassortant between a novel recombinant begomovirus (Tomato yellow leaf curl Mali virus) and a betasatellite. Molecular Plant Pathology. 2009;10:415–430. doi: 10.1111/j.1364-3703.2009.00541.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nature Methods. 2012;9:772. doi: 10.1038/nmeth.2109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drummond AJ, Ho SYW, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biology. 2006;4:e88. doi: 10.1371/journal.pbio.0040088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Molecular Biology and Evolution. 2012;29:1969–1973. doi: 10.1093/molbev/mss075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dugan VG, Chen R, Spiro DJ, Sengamalay N, Zaborsky J, Ghedin E, Nolting J, et al. The evolutionary genetics and emergence of avian influenza viruses in wild birds. PLoS Pathogens. 2008;4:e1000076. doi: 10.1371/journal.ppat.1000076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Earl DA, vonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources. 2012;4:359–361. [Google Scholar]
- Escriu F, Fraile A, García-Arenal F. Constraints to genetic exchange support gene coadaptation in a tripartite RNA virus. PLoS Pathogens. 2007;3:e8. doi: 10.1371/journal.ppat.0030008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology. 2005;14:2611–2620. doi: 10.1111/j.1365-294X.2005.02553.x. [DOI] [PubMed] [Google Scholar]
- Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Molecular Ecology Resources. 2010;10:564–567. doi: 10.1111/j.1755-0998.2010.02847.x. [DOI] [PubMed] [Google Scholar]
- Fu HC, Hu JM, Hung TH, Su HJ, Yeh HH. Unusual events in Banana bunchy top virus strain evolution. Phytopathology. 2009;99:812–822. doi: 10.1094/PHYTO-99-7-0812. [DOI] [PubMed] [Google Scholar]
- Fuller TL, Gilbert M, Martin V, Cappelle J, Hosseini P, Njabo KJ, Aziz SA, et al. Predicting hotspots for influenza virus reassortment. Emerging Infectious Diseases. 2013;19:55–63. doi: 10.3201/eid1904.120903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- García-Arenal F, Fraile A, Malpica MJM. Variability and genetic structure of plant virus populations. Annual Review of Phytopathology. 2001;39:157–186. doi: 10.1146/annurev.phyto.39.1.157. [DOI] [PubMed] [Google Scholar]
- Ghedin E, Sengamalay NA, Shumway M, Zaborsky J, Feldblyum T, Subbu V, Spiro DJ, et al. Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution. Nature. 2005;437:1162–1166. doi: 10.1038/nature04239. [DOI] [PubMed] [Google Scholar]
- Grigoras I, Del Cueto Ginzo AI, Martin DP, Varsani A, Romero J, Mammadov AC, Huseynova IM, et al. Genome diversity and evidence of recombination and reassortment in Nanoviruses from Europe. Journal of General Virology. 2014;95:1178–1191. doi: 10.1099/vir.0.063115-0. [DOI] [PubMed] [Google Scholar]
- Gu H, Zhang C, Ghabrial SA. Novel naturally occurring Bean pod mottle virus reassortants with mixed heterologous RNA1 genomes. Phytopathology. 2007;97:79–86. doi: 10.1094/PHYTO-97-0079. [DOI] [PubMed] [Google Scholar]
- Hou YM, Gilbertson RL. Increased pathogenicity in a pseudorecombinant bipartite geminivirus correlates with intermolecular recombination. Journal of Virology. 1996;70:5430–5436. doi: 10.1128/jvi.70.8.5430-5436.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu JM, Fu HC, Lin CH, Su HJ, Yeh HH. Reassortment and concerted evolution in banana bunchy top virus genomes. Journal of Virology. 2007;81:1746–1761. doi: 10.1128/JVI.01390-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyder MZ, Shah SH, Hameed S, Naqvi SM. Evidence of recombination in the banana bunchy top virus genome. Infection, Genetics and Evolution. 2011;11:1293–1300. doi: 10.1016/j.meegid.2011.04.015. [DOI] [PubMed] [Google Scholar]
- Idris AM, Mills-Lujan K, Martin K, Brown JK. Melon chlorotic leaf curl virus: characterisation and differential reassortment with closest relatives reveal adaptive virulence in the Squash leaf curl virus clade and host shifting by the host-restricted Bean calico mosaic virus. Journal of Virology. 2008;82:1959–1967. doi: 10.1128/JVI.01992-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ince WL, Gueye-Mbaye A, Bennink JR, Yewdell JW. Reassortment complements spontaneous mutation in influenza A virus NP and M1 genes to accelerate adaptation to a new host. Journal of Virology. 2013;87:4330–4338. doi: 10.1128/JVI.02749-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jakobsson M, Rosenberg NA. CLUMP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23:1801–1806. doi: 10.1093/bioinformatics/btm233. [DOI] [PubMed] [Google Scholar]
- Kalinowksi ST. The computer program STRUCTURE does not reliably identify the main genetic clusters within species: simulations and implications for human population structure. Heredity. 2011;106:625–632. doi: 10.1038/hdy.2010.95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karan M, Harding RM, Dale JL. Evidence for two groups of banana bunchy top virus isolates. Journal of General Virology. 1994;75:3541–3546. doi: 10.1099/0022-1317-75-12-3541. [DOI] [PubMed] [Google Scholar]
- Koehler AV, Pearce JM, Flint PL, Franson JC, Ip HS. Genetic evidence of intercontinental movement of avian influenza in a migratory bird: the northern pintail (Anas acuta. Molecular Ecology. 2008;21:4754–4762. doi: 10.1111/j.1365-294X.2008.03953.x. [DOI] [PubMed] [Google Scholar]
- Lefeuvre P, Lett J-M, Reynaud B, Martin DP. Avoidance of protein fold disruption in natural virus recombinants. PLoS Pathogens. 2007;3:e181. doi: 10.1371/journal.ppat.0030181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, Roossinck MJ. Genetic bottlenecks reduce population variation in an experimental RNA virus population. Journal of Virology. 2004;78:10582–10587. doi: 10.1128/JVI.78.19.10582-10587.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mandal B. Advances in small isometric multicomponent ssDNA viruses infecting plants. Indian Journal of Virology. 2010;21:18–30. doi: 10.1007/s13337-010-0010-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mandal B, Mandal S, Pun KB, Varma A. First report of the association of a nanovirus with foorkey disease of large cardamom in India. Plant Disease. 2004;88:428. doi: 10.1094/PDIS.2004.88.4.428A. [DOI] [PubMed] [Google Scholar]
- Mandal B, Shilpi S, Barman AR, Mandal S, Varma A. Nine novel DNA components associated with the foorkey disease of large cardamom: evidence of a distinct babuvirus species in Nanoviridae. Virus Research. 2013;178:297–305. doi: 10.1016/j.virusres.2013.09.027. [DOI] [PubMed] [Google Scholar]
- Martin DP, van der Walt E, Posada D, Rybicki EP. The evolutionary value of recombination is constrained by genome modularity. PLoS Genetics. 2005;1:e51. doi: 10.1371/journal.pgen.0010051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin DP, Lemey P, Lott M, Moulton V, Posada D, Lefeuvre P. RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics. 2010;26:2462–2463. doi: 10.1093/bioinformatics/btq467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin DP, Biagini P, Lefeuvre P, Golden M, Roumagnac P, Varsani A. Recombination in eukaryotic single stranded DNA viruses. Viruses. 2011a;3:1699–1738. doi: 10.3390/v3091699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin DP, Lefeuvre P, Varsani A, Hoareau M, Semegni J-V, Dijoux B, Vincent C, et al. Complex recombination patterns arising during geminivirus coinfections preserve and demarcate biologically important intragenome interaction networks. PLoS Pathogens. 2011b;7:e1002203. doi: 10.1371/journal.ppat.1002203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nair KPP. Agronomy and Economy of Black Pepper and Cardamom: The King and Queen of Spices. New York, NY: Elsevier; 2011. [Google Scholar]
- Nei M. Molecular Evolutionary Genetics. New York: Columbia University Press; 1987. [Google Scholar]
- Nelson MI, Viboud C, Simonsen L, Bennett RT, Griesemer SB, St George K, Taylor J, et al. Multiple reassortment events in the evolutionary history of H1N1 influenza A virus since 1918. PLoS Pathogens. 2008;4:e1000012. doi: 10.1371/journal.ppat.1000012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O'Keefe KJ, Silander OK, McCreery H, Weinreich DM, Wright KM, Chao L, Edwards SV, et al. Geographic differences in sexual reassortment in RNA phage. Evolution. 2010;64:3010–3023. doi: 10.1111/j.1558-5646.2010.01040.x. [DOI] [PubMed] [Google Scholar]
- Oksanen J, Kindt R, Legendre P, O'Hara B, Simpson GL, Sólymos P, Stevens MHH, et al. Vegan: Community Ecology Package. 2008. http://cran.r-project.org/web/packages/vegan/ (accessed on 29 January 2013) [Google Scholar]
- Pearce JM, Ramey AM, Flint PL, Koehler AV, Fleskes JP, Franson JC, Hall JS, et al. Avian influenza at both ends of a migratory flyway: characterising viral genomic diversity to optimise surveillance plans for North America. Evolutionary Applications. 2009;2:457–468. doi: 10.1111/j.1752-4571.2009.00071.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pita JS, Fondong VN, Sangare A, Otim-Nape GW, Ogwal S, Fauquet CM. Recombination, pseudorecombination and synergism of geminiviruses are determinant keys to the epidemic of severe cassava mosaic disease in Uganda. Journal of General Virology. 2001;82:655–665. doi: 10.1099/0022-1317-82-3-655. [DOI] [PubMed] [Google Scholar]
- Plantegenest M, Le May C, Fabre F. Landscape epidemiology of plant diseases. Journal of the Royal Society Interface. 2007;4:963–972. doi: 10.1098/rsif.2007.1114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prasanna HC, Sinha DP, Verma A, Singh M, Singh B, Rai M, Martin DP. The population genomics of begomoviruses: global scale population structure and gene flow. Virology Journal. 2010;7:220. doi: 10.1186/1743-422X-7-220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotypic data. Genetics. 2000;155:945–959. doi: 10.1093/genetics/155.2.945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quantum GIS Development Team. 2014. Quantum GIS geographic information system. Open source geospatial foundation project http://www.qgis.org (accessed on 25 February 2014)
- R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2013. http://www.R-project.org/ (accessed on 30 April 2012) [Google Scholar]
- Rambaut A. 2009. FigTree v. 1.3.1. Computer program and documentation distributed by the author at http://tree.bio.ed.ac.uk/software/ (accessed on 7 September 2012)
- Rambaut A, Drummond AJ. 2007. Tracer v.1.5. Computer program and documentation distributed by the authors at http://beast.bio.ed.ac.uk/Tracer (accessed on 7 September 2012)
- Rambaut A, Pybus OG, Nelson MI, Viboud C, Taubenberger JK, Holmes EC. The genomic and epidemiological dynamics of human influenza A virus. Nature. 2008;453:615–619. doi: 10.1038/nature06945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramey AM, Pearce JM, Ely CE, Guy LM, Irons DB, Derksen DV, Ip HS. Transmission and reassortment of avian influenza viruses at the Asian-North American interface. Virology. 2010;406:352–359. doi: 10.1016/j.virol.2010.07.031. [DOI] [PubMed] [Google Scholar]
- Rokyta DR, Wichman HA. Genic incompatibilities in two hybrid bacteriophages. Molecular Biology and Evolution. 2009;26:2831–2839. doi: 10.1093/molbev/msp199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosenberg NA. Distruct: a program for the graphical display of population structure. Molecular Ecology Notes. 2004;4:137–138. [Google Scholar]
- Rozas J, Rozas R. DnaSP version 3: an integrated program for molecular population genetics and molecular evolution analysis. Bioinformatics. 1999;15:174–175. doi: 10.1093/bioinformatics/15.2.174. [DOI] [PubMed] [Google Scholar]
- Savory FR, Ramakrishnan U. Asymmetric patterns of reassortment and concerted evolution in Cardamom bushy dwarf virus. Infection, Genetics and Evolution. 2014;24:15–24. doi: 10.1016/j.meegid.2014.02.012. [DOI] [PubMed] [Google Scholar]
- Stainton D, Kraberger S, Walters M, Wiltshire EJ, Rosario K, Halafihi M, Lolohea S, et al. Evidence of inter-component recombination, intra-component recombination and reassortment in Banana bunchy top virus. Journal of General Virology. 2012;93:1103–1119. doi: 10.1099/vir.0.040337-0. [DOI] [PubMed] [Google Scholar]
- Strange RN, Scott PR. Plant disease: a threat to global food security. Annual Review of Phytopathology. 2005;43:83–116. doi: 10.1146/annurev.phyto.43.113004.133839. [DOI] [PubMed] [Google Scholar]
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance and maximum parsimony methods. Molecular Biology and Evolution. 2011;28:2731–2739. doi: 10.1093/molbev/msr121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thrall PH, Oakeshott JG, Fitt G, Southerton S, Burdon JJ, Sheppard A, Russell RJ, et al. Evolution in agriculture: the application of evolutionary approaches to the management of biotic interactions in agro-ecosystems. Evolutionary Applications. 2011;4:200–215. doi: 10.1111/j.1752-4571.2010.00179.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Timchenko T, Katul L, Sano Y, de Kouchkovsky F, Vetten HJ, Gronenborn B. The master rep concept in nanovirus replication: identification of missing genome components and potential for natural genetic reassortment. Virology. 2000;274:189–195. doi: 10.1006/viro.2000.0439. [DOI] [PubMed] [Google Scholar]
- Tsompana M, Abad J, Purugganan M, Moyer JW. The molecular population genetics of the tomato spotted wilt virus (TSWV) genome. Molecular Ecology. 2005;14:53–66. doi: 10.1111/j.1365-294X.2004.02392.x. [DOI] [PubMed] [Google Scholar]
- Varma PM, Capoor SP. ‘Foorkey’ disease of large cardamom. Indian Journal of Agricultural Science. 1964;34:56–62. [Google Scholar]
- Wang H-I, Chang C-H, Lin P-H, Fu H-C, Tang C, Yeh H-H. Application of motif-based tools on evolutionary analysis of multipartite single-stranded DNA viruses. PLoS ONE. 2013;8:e71565. doi: 10.1371/journal.pone.0071565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–1370. doi: 10.1111/j.1558-5646.1984.tb05657.x. [DOI] [PubMed] [Google Scholar]
- Wille M, Robertson GJ, Whitney H, Bishop MA, Runstadler JA, Lang AS. Extensive geographic mosaicism in avian influenza viruses from gulls in the northern hemisphere. PLoS ONE. 2011;6:e20664. doi: 10.1371/journal.pone.0020664. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

