Abstract
Yams (Dioscorea spp.) host a diverse range of badnaviruses (genus Badnavirus, family Caulimoviridae). The first complete genome sequence of Dioscorea bacilliform RT virus 3 (DBRTV3), which belongs to the monophyletic species group K5, is described. This virus is most closely related to Dioscorea bacilliform SN virus (DBSNV, group K4) based on a comparison of genome sequences. Recombination analysis identified a unique recombination event in DBRTV3, with DBSNV likely to be the major parent and Dioscorea bacilliform AL virus (DBALV) the minor parent, providing the first evidence for recombination in yam badnaviruses. This has important implications for yam breeding programmes globally.
Electronic supplementary material
The online version of this article (doi:10.1007/s00705-017-3605-9) contains supplementary material, which is available to authorized users.
Keywords: Yam, Dioscorea spp., Dioscorea rotundata, Episomal badnavirus, Endogenous Dioscorea bacilliform virus, Complete genome, Recombination, Phylogeny, West Africa
Yams (Dioscorea spp.) are an important staple food worldwide that play a major role in food security and income generation, particularly in West Africa [1]. Cultivated yam plants are propagated vegetatively through their tubers, resulting in the accumulation of viruses in yam germplasm and leading to an urgent need for a sustainable supply of virus-free planting material. The combination of vegetative propagation and the lack of strategic control measures promote the spread of viruses and the occurrence of multiple infections. The association of symptoms and the impact on yields attributable to individual viruses has as a result not been determined accurately to date.
Dioscorea bacilliform viruses (DBVs) (family Caulimoviridae, genus Badnavirus) are a concern for the safe movement of yam germplasm because of their high prevalence [2, 3]. Yam plants are hosts to a diverse range of badnaviruses, with recent findings suggesting the frequent occurrence of mixed infections in West African yam germplasm [4] as well as the presence of endogenous Dioscorea bacilliform viruses (eDBVs) as integrated forms of these viruses in D. cayenensis-rotundata genomes [5–7]. To date, seven distinct DBV genomes have been completely sequenced: Dioscorea bacilliform AL virus (DBALV), Dioscorea bacilliform AL virus 2 (DBALV2), Dioscorea bacilliform ES virus (DBESV), Dioscorea bacilliform RT virus 1 (DBRTV1), Dioscorea bacilliform RT virus 2 (DBRTV2), Dioscorea bacilliform TR virus (DBTRV), and Dioscorea bacilliform SN virus (DBSNV) [4, 8–11]. Several hundred partial badnavirus nucleotide sequences have also been generated by PCR using the badnavirus-specific primer pair Badna-FP/-RP [12], amplifying a 579 bp-fragment of the reverse transcriptase (RT)-ribonuclease H (RNaseH) domain used for taxonomic assessment of badnaviruses [13]. Phylogenetic analysis of these sequences led to the proposition of 15 badnavirus species whose members are associated with plants of the genus Dioscorea spp. According to the International Committee on Taxonomy of Viruses (ICTV), the demarcation criterion for species within the genus Badnavirus is sequence divergence of > 20% in a partial RT-RNaseH coding region [2–4, 6, 13].
A routine screening for episomal DBV infections of yam leaves showing viral symptoms was carried out by rolling circle amplification (RCA) in D. rotundata accessions maintained in the yam plant collection at the Natural Resources Institute (NRI, Chatham, UK), growing in conditions as described by Mumford and Seal [14]. For this, total nucleic acids were extracted from fresh yam leaf tissue using a modified CTAB method, as described by Kenyon et al. [2], and analysed by RCA following conditions described previously [4]. The yam breeding line TDr 89/02475 showed viral symptoms (mottling and chlorotic spots) associated with DBV and Yam mosaic virus (YMV) infections (Fig. 1A). This line was previously identified to be infected with DBRTV1 [4]. Restriction digestion of the RCA product of TDr 89/02475 using endonuclease BamH1 (NEB, UK) yielded the fragments of the expected sizes of 6.4 and 1.2 kbp for DBRTV1 (data not shown). To confirm the DBRTV1 infection in TDr 89/02475, we sequenced the partial RT-RNaseH domain used for classification of members of the genus Badnavirus. This was done by the excision and purification of the RCA fragments, followed by badnavirus-specific PCR using the Badna-FP/-RP primers and the RCA fragments as templates [4, 13]. The expected amplification product of 579 bp was obtained in a PCR using the 6.4-kbp fragment as template. Direct sequencing of the purified PCR product resulted in a mixture of sequences, and hence the PCR products were cloned into pGEM-T Easy Vector (Promega, UK). Five transformants were selected at random and sequenced. Three clones confirmed the expected DBRTV1 infection. However, the remaining two clones (A1-2 and A1-4) contained sequences that were 99% identical to NGl3841Dc (GenBank accession number KX008585), which was identified by RCA in our previous study [4]. The sequence NGl3841Dc was found to belong to the yam badnavirus monophyletic species group K5 defined by Kenyon et al. [2]. Sequencing of the complete episomal genome of this K5 yam badnavirus was undertaken. Outward-facing primers (DBRTV3-F/DBRTV3-R; see Fig. 1B and Table S1) were designed based on the partial RT-RNaseH sequences, and genomic TDr 89/02475 DNA was used as template for long PCR. The 50-µl PCR reaction mixture contained 1 µl of DNA template (~ 250 ng), 0.5 µM each primer, 0.25 mM each dNTP, 2.5 U of DreamTaq DNA polymerase and 1X DreamTaq Green buffer (Thermo Scientific, UK) containing 2 mM MgCl2. The cycle conditions for the long-PCR amplification were 95 °C for 5 min, followed by 30 cycles of 94 °C for 20 s, 58 °C for 30 s, 72 °C for 7 min, and a final extension of 72 °C for 7 min. These conditions generated a single PCR product of the expected 7-8 kbp size (data not shown), which was subsequently cloned using a TOPO® XL Cloning Kit (Invitrogen, UK). The recombinant clone A9-6 was selected and fully sequenced twice using specific sequencing primers designed for genome walking (Table S1). A 7097-bp sequence was assembled using Geneious R10 (Biomatters, New Zealand). This sequence overlapped (115 bp at the 5’end, 55 bp at the 3’end) with the partial RT-RNaseH sequence present in clones A1-2 and A1-4. Combining these sequences resulted in a 7506-bp sequence (GenBank accession number MF476845) representing a consensus sequence of the entire viral genome of a new yam badnavirus member belonging to DBV species group K5.
The consensus genome sequence (Fig. 1B) displayed all of the hallmarks of a typical badnavirus [13], and we propose the name “Dioscorea bacilliform RT virus 3” (DBRTV3) for this virus. DBRTV3 (7506 bp long) has a GC content of 43.3% and contains the expected putative host cytoplasmic initiator methionine tRNA (tRNAMet)-binding site (5’-TGGTATCAGAGCTTGGTT-3’) located within the intergenic region (IGR) at position 1-18 designating the beginning of the viral genome [15]. A potential TATA-box and a putative poly(A) tail were found within the IGR of DBRTV3 (Fig. 1B). Sequence analysis revealed three ORFs, where the start and stop codons of ORFs 1 and 2 and ORFs 2 and 3 overlapped by the ATGA motif in a -1 translational frame relative to the preceding ORF. No internal AUG codons were identified in ORF1 or 2, which agrees with the leaky scanning model of translation typical of members of the genus Badnavirus [13].
Analysis of deduced amino acid sequences identified proteins with molecular weights of 16.9, 14.3 and 215.7 kDa encoded by ORFs 1, 2 and 3, respectively. The ORF3 polyprotein of DBRTV3 has the characteristic features of members of the family Caulimoviridae, including the zinc knuckle (Zn knuckle), pepsin-like aspartate protease (PR), reverse transcriptase (RT), and ribonuclease H (RNaseH) (Fig. 1B) [13]. The coat protein (CP) and movement protein (MP) described by Xu et al. [16] were also located.
Molecular phylogenetic analysis based on 579-bp-long partial nucleotide sequences of the badnavirus RT-RNaseH domain of DBRTV3, DBALV, DBALV2, DBESV, DBRTV1, DBRTV2, DBTRV, DBSNV and all 19 yam badnavirus sequences available in the GenBank database with nucleotide identity values > 80% in similarity searches with the NCBI Basic Local Alignment Search Tool (BLAST) showed that DBRTV3 belongs to the monophyletic species group K5 described by Kenyon et al. [2] and is 99% identical to the sequence NGl3841Dc (Fig. 1C, left panel).
A phylogenetic tree was constructed from full-length DBV genome sequences and badnavirus type members of host plants other than yam (Fig. 1C, right panel). The resulting tree shows that (1) yam badnaviruses form a well-supported clade in which (2) DBALV2 and DBESV as well as DBTRV and DBRTV1 group closely together, as previously reported by Sukal et al. [9], and that (3) DBRTV3 and DBSNV represent sister taxa in the genus Badnavirus.
Sequence comparisons of DBRTV3 and all other fully sequenced episomal DBVs were performed (Table 1). The nucleotide sequence of the RT-RNaseH domain displayed 65.7% to 76.1% sequence identity to the corresponding region of the other DBV genomes, which is below the species demarcation criterion for the genus Badnavirus of 80% identity in this domain [13]. This confirms that the DBRTV3 sequence is the first complete genome sequence of a virus belonging to the previously described species group K5 [2]. Additional sequence comparisons confirmed that DBRTV3 is a distinct yam-infecting badnavirus, with the genome sequence of DBSNV (group K4) being the most similar.
Table 1.
Complete genome | RT-RNaseH domaina | ORF1 | ORF2 | ORF3 | |
---|---|---|---|---|---|
DBALV | 60.2% | 66.3% (73.3%) | 76.9% (84.7%) | 55.4% (47.9%) | 60.0% (61.3%) |
DBALV2 | 51.0% | 71.8% (72.2%) | 53.1% (45.5%) | 45.2% (35.9%) | 55.5% (51.4%) |
DBESV | 48.7% | 65.7% (68.8%) | 53.4% (44.1%) | 45.7% (32.8%) | 51.8% (48.7%) |
DBRTV1 | 59.2% | 69.7% (75.6%) | 66.9% (72.0%) | 58.0% (53.2%) | 63.0% (64.3%) |
DBRTV2 | 55.4% | 69.1% (70.5%) | 61.2% (59.4%) | 55.5% (50.4%) | 59.0% (59.2%) |
DBSNV | 73.4% | 76.1% (81.8%) | 83.6% (91.0%) | 72.7% (71.3%) | 75.8% (82.2%) |
DBTRV | 60.4% | 66.3% (75.0%) | 71.3% (79.2%) | 54.7% (54.9%) | 61.8% (64.4%) |
Taking advantage of the growing number of complete yam badnavirus genome sequences falling into distinct DBV species groups, we performed recombination analysis using full-length DBV genome sequences, which were aligned using MAFFT [17] and then analysed in the RDP4 software package using default settings [18]. Of a total of 14 possible recombination events (Table S2), only a single event (Fig. 2) was detected with a very high degree of confidence by all seven recombination detection methods (RDP, GENECONV, BootScan, MaxChi, Chimaera, SiScan and 3Seq) available in RDP4 [18] all showing significant p-values (Table S2). The putative recombination site was located in the IGR of DBRTV3 and extended into the 5’end of ORF1. Significant differences in tree topologies were revealed by phylogenetic analysis of the recombined and non-recombined regions of the DBV genomes. A tree constructed using only the non-recombined region showed that DBRTV3 clustered together with DBSNV (Fig. 2, bottom panel), whereas DBRTV3 clustered with DBALV in a tree constructed using only the recombined region (Fig. 2, top panel). Therefore, DBRTV3 was identified to be the recombinant with DBSNV and DBALV as the viruses most closely related to the major and minor parent, respectively (Table S2). DBSNV was originally isolated from a wild Dioscorea sansibarensis plant in Benin [11], whereas DBALV was identified in a D. alata plant sampled in Nigeria [8]. The recombinant DBRTV3 originated from a D. rotundata breeding line maintained at the International Institute of Tropical Agriculture (IITA, Ibadan, Nigeria). Therefore, the opportunity for recombination between DBSNV and DBALV is not clear, but the literature suggests at least the latter is common throughout West Africa [3, 8].
Recombination is an important driving force in viral evolution, and this study provides the first evidence for potentially extensive recombination in yam badnaviruses. It is interesting to note that four out of 14 possible recombination events were detected using parent-like sequences inferring unknown parents (Table S2), which suggests that the full genetic diversity of yam badnaviruses (complete genomes) is underestimated and unknown at present. The extent of recombination among DBV genomes will become clearer once more-extensive sequencing of episomal badnaviruses from West Africa and other yam-growing regions of the world has been performed.
Recombination in geminiviruses has previously been shown to originate from mixed infections [19]. Naturally occurring mixed infections of yam with more than one DBV isolate have been reported to be the norm recently [4, 10], and further studies of the phenomenon of recombination among DBVs can be expected to provide more detail about recombinant isolates in the future. Propagation of a wide assortment of yam germplasm at yam research centres and breeding programmes may facilitate recombination, as badnaviruses by themselves generally do not cause marked symptoms and hence may be cultivated in conditions that facilitate their transmission between germplasm, leading to recombination events and the emergence of more-virulent isolates. Therefore, there is an urgent need to develop reliable diagnostic tools for DBVs to help make rapid decisions on the health status of yam planting material, particularly in yam germplasm and seed yam distribution centres. For this, we plan to develop DBRTV3-specific diagnostic primers to be used in virus indexing assays. It remains remotely possible that DBRTV3 is an endogenous sequence that was inserted without rearrangement, a phenomenon that is occasionally found [20, 21]. Future work will be performed to test for the potential existence of eDBV forms of the DBRTV3 sequence in yam germplasm using Southern hybridization techniques similar to those described by Seal et al. [5] and Umber et al. [6].
In conclusion, the first complete genome sequence belonging to a member of yam badnavirus monophyletic species group K5 isolated from a Dioscorea rotundata breeding line is described. We propose this new member of the genus Badnavirus to be designated “Dioscorea bacilliform RT virus 3” (DBRTV3). Based on the comparison of full-length genome sequences, DBSNV was identified as the closest relative of DBRTV3. DBSNV was also found to be the major parent in a unique recombination event identified in DBRTV3, with DBALV likely to be the minor parent. The results provide the first evidence for recombination among yam badnavirus genomes. This finding implies that breeding programmes should introduce strict control measures to prevent the transmission of badnaviruses from one yam breeding line to avoid the potential creation of mixed infections that could lead to recombinant badnaviruses with increased virulence.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Acknowledgements
The authors gratefully acknowledge the support of this work by the Bill & Melinda Gates Foundation (BMGF) under the “Development of On-Farm Robust Diagnostic Toolkits for Yam Viruses” grant to the Natural Resources Institute (NRI). Ajith Rathnayake was supported by a Vice-Chancellor PhD studentship awarded by the University of Greenwich. Funding to support open access is provided by the Bill & Melinda Gates Foundation. We would like to thank Roman Zipaj for assistance with design of the figures. Yam germplasm material was kindly provided by Dr. Lava Kumar at the International Institute of Tropical Agriculture (IITA) in Ibadan, Nigeria.
Compliance with ethical standards
Ethical standard statement
This study contained no experiments involving human participants or animals.
Conflict of interest
The authors declare that they have no conflict of interest.
Author consent
The authors declare their consent to the content of the manuscript.
Footnotes
Electronic supplementary material
The online version of this article (doi:10.1007/s00705-017-3605-9) contains supplementary material, which is available to authorized users.
References
- 1.Asiedu R, Sartie A. Crops that feed the World 1. Yams. Food Secur. 2010;2:305–315. doi: 10.1007/s12571-010-0085-0. [DOI] [Google Scholar]
- 2.Kenyon L, Lebas BSM, Seal SE. Yams (Dioscorea spp.) from the South Pacific Islands contain many novel badnaviruses: implications for international movement of yam germplasm. Arch Virol. 2008;153:877–889. doi: 10.1007/s00705-008-0062-5. [DOI] [PubMed] [Google Scholar]
- 3.Bousalem M, Durand O, Scarcelli N, et al. Dilemmas caused by endogenous pararetroviruses regarding the taxonomy and diagnosis of yam (Dioscorea spp.) badnaviruses: analyses to support safe germplasm movement. Arch Virol. 2009;154:297–314. doi: 10.1007/s00705-009-0311-2. [DOI] [PubMed] [Google Scholar]
- 4.Bömer M, Turaki A, Silva G, et al. A sequence-independent strategy for amplification and characterisation of episomal badnavirus sequences reveals three previously uncharacterised yam Badnaviruses. Viruses. 2016;8:188. doi: 10.3390/v8070188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Seal S, Turaki A, Muller E, et al. The prevalence of badnaviruses in West African yams (Dioscorea cayenensis-rotundata) and evidence of endogenous pararetrovirus sequences in their genomes. Virus Res. 2014;186:144–154. doi: 10.1016/j.virusres.2014.01.007. [DOI] [PubMed] [Google Scholar]
- 6.Umber M, Filloux D, Muller E, et al. The genome of African yam (Dioscorea cayenensis-rotundata complex) hosts endogenous sequences from four distinct badnavirus species. Mol Plant Pathol. 2014;15:790–801. doi: 10.1111/mpp.12137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Turaki AA, Bömer M, Silva G, et al. PCR-DGGE analysis: unravelling complex mixtures of badnavirus sequences present in yam germplasm. Viruses. 2017 doi: 10.3390/v9070181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Briddon RW, Phillips S, Brunt A, Hull R. Analysis of the sequence of Dioscorea alata bacilliform virus; comparison to other members of the badnavirus group. Virus Genes. 1999;18:277–283. doi: 10.1023/A:1008076420783. [DOI] [PubMed] [Google Scholar]
- 9.Sukal A, Kidanemariam D, Dale J, et al. Characterization of badnaviruses infecting Dioscorea spp. in the Pacific reveals two putative novel species and the first report of Dioscorea bacilliform RT virus 2. Virus Res. 2017;238:29–34. doi: 10.1016/j.virusres.2017.05.027. [DOI] [PubMed] [Google Scholar]
- 10.Umber M, Gomez R-M, Gélabale S, et al. The genome sequence of Dioscorea bacilliform TR virus, a member of the genus Badnavirus infecting Dioscorea spp., sheds light on the possible function of endogenous Dioscorea bacilliform viruses. Arch Virol. 2017;162:517–521. doi: 10.1007/s00705-016-3113-3. [DOI] [PubMed] [Google Scholar]
- 11.Seal S, Muller E. Molecular analysis of a full-length sequence of a new yam badnavirus from Dioscorea sansibarensis. Arch Virol. 2007;152:819–825. doi: 10.1007/s00705-006-0888-7. [DOI] [PubMed] [Google Scholar]
- 12.Yang IC, Hafner GJ, Revill PA, et al. Sequence diversity of South Pacific isolates of Taro bacilliform virus and the development of a PCR-based diagnostic test. Arch Virol. 2003;148:1957–1968. doi: 10.1007/s00705-003-0163-0. [DOI] [PubMed] [Google Scholar]
- 13.King AMQ, Adams MJ, Carstens EB, Lefkowitz EJ. Virus Taxon. San Diego: Elsevier; 2012. Family—Caulimoviridae; pp. 429–443. [Google Scholar]
- 14.Mumford RA, Seal SE. Rapid single-tube immunocapture RT-PCR for the detection of two yam potyviruses. J Virol Methods. 1997;69:73–79. doi: 10.1016/S0166-0934(97)00141-9. [DOI] [PubMed] [Google Scholar]
- 15.Medberry SL, Lockhart BEL, Olszewski NL. Properties of Commelina yellow mottle virus’s complete DNA sequence, genomic discontinuities and transcript suggest that it is a pararetrovirus. Nucl Acids Res. 1990;18:5505–5513. doi: 10.1093/nar/18.18.5505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Xu D, Mock R, Kinard G, Li R. Molecular analysis of the complete genomic sequences of four isolates of Gooseberry vein banding associated virus. Virus Genes. 2011;43:130–137. doi: 10.1007/s11262-011-0614-8. [DOI] [PubMed] [Google Scholar]
- 17.Katoh K, Toh H. Recent developments in the MAFFT multiple sequence alignment program. Brief Bioinform. 2008;9:286–298. doi: 10.1093/bib/bbn013. [DOI] [PubMed] [Google Scholar]
- 18.Martin DP, Murrell B, Golden M, et al. RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015;1:vev003. doi: 10.1093/ve/vev003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.García-Andrés S, Tomás DM, Sánchez-Campos S, et al. Frequent occurrence of recombinants in mixed infections of tomato yellow leaf curl disease-associated begomoviruses. Virology. 2007;365:210–219. doi: 10.1016/j.virol.2007.03.045. [DOI] [PubMed] [Google Scholar]
- 20.Richert-Pöggeler KR, Noreen F, Schwarzacher T, et al. Induction of infectious petunia vein clearing (pararetro) virus from endogenous provirus in petunia. EMBO J. 2003;22:4836–4845. doi: 10.1093/emboj/cdg443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Chabannes M, Baurens FC, Duroy PO, et al. Three infectious viral species lying in wait in the banana genome. J Virol. 2013;87:8624–8637. doi: 10.1128/JVI.00899-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hasegawa M, Kishino H, Yano T. Dating the human-ape split by a molecular clock of mitochondrial DNA. Evolution (N Y) 1985;22:160–174. doi: 10.1007/BF02101694. [DOI] [PubMed] [Google Scholar]
- 23.Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.