Abstract
We report here the complete genome sequence of an emerging genotype of tobacco streak virus (TSV) infecting zucchini squash in Florida (TSV_FL13-07), obtained using deep sequencing of short RNAs (sRNAs) and validation by Sanger sequencing. TSV_FL13-07 shares only <90% sequence identity in all three genomic RNAs to several known U.S. isolates.
GENOME ANNOUNCEMENT
Ilarviruses belong to the family Bromoviridae and have a nonenveloped quasispherical virion with a segmented positive single-stranded RNA (ssRNA) genome, a 3′ tRNA-like structure, and a 5′ cap. Tobacco streak virus (TSV), a type member of Ilarvirus, has a wide host range infecting many plant species in >30 families worldwide. TSV is transmitted by thrips and can be seed transmissible. In April 2013, a zucchini type of summer squash, “Senator,” grown in an 80-acre field in Homestead, FL, displayed unusual severe mosaic and epinasty symptoms, with an approximately 5% infection rate, and with one corner of the field near 100% infection. Screening with serological tests to 13 commonly known viruses in cucurbits (Agdia, USA) was negative. To determine the causal agent, we applied small RNA sequencing technology (1–3). Small RNA libraries were prepared as described previously (4) and sequenced using the Illumina HiSeq 2000. The sRNA sequences were assembled using the bioinformatics pipeline (3) without presubtraction of the host RNAs due to the unavailability of the zucchini squash genome sequence. Based on their strong sequence identities to the TSV okra Indian isolate (FJ561302, FJ561303, and FJ561301), the contigs were assembled into three genomic RNAs. Complete TSV genomic sequences were obtained after sequencing additional reverse transcription PCR (RT-PCR) products covering sequence gaps. The 5′ and 3′ terminal sequences in three TSV genomic RNAs were determined using rapid amplification of cDNA ends (RACE) (Ambion, USA). To facilitate 5′ adaptor ligation in RACE, the 5′ cap was removed by tobacco acid phosphatase (Epicentre, USA) to generate a 5′-monophosphate. The RT-PCR products were cloned into the TOPO-TA vector (Life Technologies, USA) and sequenced with Sanger sequencing. The complete TSV genome (isolate FL13-07) comprises three RNAs with 3,525, 2,898, and 2,211 nucleotides (nt) for RNA1, RNA2, and RNA3, respectively. RNA1 contains a single open reading frame (ORF) encoding a replicase (1,092 amino acids [aa]). RNA2 has two ORFs, with 2a encoding RNA-dependent RNA polymerase (790 aa) and 2b encoding a protein (199 aa) with an unknown function. RNA3 contains two ORFs encoding a movement protein (290 aa) and a coat protein (238 aa). BLASTn searches to the NCBI databases revealed that TSV_FL13-07 shares the highest sequence identities (99%) in RNA1 to several TSV isolates from India (pumpkin, GenBank accession no. FJ561299; okra, FJ561302; okra, KF264471; sunflower, KF264472; sunflower, KF264473; sunflower, KF264474; and soybean, KJ825823) but only 87 to 88% identities to several isolates in the United States (tobacco, JX073656; tobacco, U80934; and soybean, FJ403375). Similarly, RNA2 shares the highest sequence identity (98%) to two Indian isolates (okra, FJ561303; and pumpkin, FJ561300) but only 83 to 84% identity to several known U.S. isolates (soybean, FJ403376; tobacco, JX073657; and Nicotiana, U77738) and 80 to 88% to other isolates in Australia (Verbesina encelioides, JX463338; and Helianthus annuus, JX463335). For RNA3, a higher sequence identity (98% to 99%) was again shared with those isolates from India (squash, FJ655170; and Vigna unguiculata, FJ655171), with only 91% to an Australian isolate (Verbesina encelioides, JX463339) and 90% to two U.S. isolates (tobacco, JX073658; and soybean, FJ403377). In summary, sequence analyses indicated that TSV_FL13-07 represents an emerging genotype in the United States that shares only <90% sequence identities to other known U.S. isolates.
Nucleotide sequence accession numbers.
The nucleotide sequences of TSV_FL13-07 (RNA1, RNA2, and RNA3) have been deposited in GenBank under the accession numbers KM504246, KM504247, and KM504248, respectively.
ACKNOWLEDGMENTS
The sRNA deep sequencing was conducted by the Genomics Resources Core Facility at the Weill Cornell Medical College in New York, NY. Sanger sequencing was carried out by Functional Biosciences, Inc. (Madison, WI).
We thank Andrea Gilliard and Alan Wilder for their excellent technical assistance.
This work was supported in part by the USDA, National Institute of Food and Agriculture SCRI 2012-01507-229756 (to K.-S.L. and Z.F.).
Footnotes
Citation Padmanabhan C, Gao S, Li R, Zhang S, Fei Z, Ling K-S. 2014. Complete genome sequence of an emerging genotype of tobacco streak virus in the United States. Genome Announc. 2(6):e01138-14. doi:10.1128/genomeA.01138-14.
REFERENCES
- 1. Kreuze JF, Perez A, Untiveros M, Quispe D, Fuentes S, Baker I, Simon R. 2009. Complete viral genome sequence and discovery of novel viruses by deep sequencing of small RNAs: a generic method for diagnosis, discovery and sequencing of viruses. Virology 388:1–7. 10.1016/j.virol.2009.03.024. [DOI] [PubMed] [Google Scholar]
- 2. Li R, Gao S, Hernandez AG, Wechter WP, Fei Z, Ling K-S. 2012. Deep sequencing of small RNAs in tomato for virus and viroid identification and strain differentiation. PLoS One 7:e37127. 10.1371/journal.pone.0037127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Li R, Gao S, Fei Z, Ling K-S. 2013. Complete genome sequence of a new tobamovirus naturally infecting tomatoes in Mexico. Genome Announc. 1(5):e00794-13. 10.1128/genomeA.00794-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Chen Y-R, Zheng Y, Liu B, Zhong S, Giovannoni J, Fei Z. 2012. A cost-effective method for Illumina small RNA-Seq library preparation using T4 RNA ligase 1 adenylated adapters. Plants Methods 8:41. 10.1186/1746-4811-8-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
