Abstract
We report here the first complete sequences of two Keystone virus (KEYV) genomes isolated from Florida in 2005, which include the first two publicly available complete large (L) gene sequences. The sequences of the KEYV L segments show 75.99 to 83.86% nucleotide similarity with those of other viruses in the California (CAL) serogroup of bunyaviruses.
GENOME ANNOUNCEMENT
Keystone virus (KEYV) belongs to the California (CAL) serogroup of arboviruses in the family Bunyaviridae (1). The name Keystone was derived from the geographic location in Florida where it was first isolated (2). Bunyaviruses are enveloped viruses with negative-sense single-stranded RNA (ssRNA) and tripartite genomes consisting of large (L), medium (M), and small (S) RNA segments (3). The CAL serogroup contains 16 arboviruses (1) that have been associated with human cases of encephalitis (4) and influenza-like or febrile illnesses in Europe (4–6). This study generated the first two complete genome sequences for the KEYV isolated in 2005 from the Tampa Bay area, FL.
The KEYVs were amplified in Vero cells, and viral RNA was extracted from cell culture supernatant using TRIzol and chloroform. Viral cDNA was synthesized using random hexamers and oligo(dT), randomly amplified, and prepared for next-generation sequencing using a sequence-independent single-primer amplification (SISPA) method, as described previously (7). The SISPA products were normalized and pooled into a single reaction mixture that was size selected (300 to 500 bp) using the SPRIselect reagent kit (Beckman Coulter, USA) for Illumina MiSeq version 2 with 300-bp paired-end reads.
Sequencing reads were binned by bar code and trimmed to remove SISPA bar codes, random hexamer sequences, and low-quality bases, and de novo assembly of the reads was performed using CLC bio’s clc_novo_assemble. De novo contigs were searched against the GenBank nonredundant nucleotide database (NT) using BLASTN, which identified the longest contigs with high similarity matches to the KEYV M and S segments. Since no KEYV L segment sequence was available in GenBank at the time, Jamestown Canyon virus sequence was used to identify the largest de novo contig of the KEYV L segment in our samples. Once initial partial references were identified for each segment, these were used as scaffolds to build new reference sequences using an iterative cycle of mapping assembly employing CLC bio’s clc_ref_assemble_long program. These were then analyzed by custom software built using the J. Craig Venter Institute (JCVI)’s Jillion Framework to identify read sequences that extended beyond the reference. Multiple-sequence alignments of these sequences were then used to generate a consensus beyond the 5′ and 3′ edges of the reference. Concatenating these new consensus sequences to the previous reference created a longer sequence that was then input as a new reference in a new mapping assembly that incorporated more reads. This iterative process continued until the reference for each segment contig reached the 5′ and 3′ termini. For these genomes, all segments shared an 11-bp conserved 5′ terminal sequence, in coding sense (5′-AGTAGTGTACT-3′) and a 3′ terminal sequence, in coding sense (5′-GAGCACACTAC-3′). The final assemblies were annotated using VIGOR (8, 9) and submitted to GenBank.
Comparing these two KEYV genomes with other CAL serogroup virus genomes shows a high degree of sequence diversity; the S, M, and L segments show nucleotide similarities ranging from 67.73% to 86.15%. Although the two new KEYVs do not show a large amount of diversity (98.3% similar), the first publically available KEY L segment sequences might aid in the phylogenetic analysis of CAL serogroup viruses.
Nucleotide sequence accession numbers.
The full genomic sequence data are deposited in NCBI’s GenBank database under accession numbers KT630288 through KT630293.
ACKNOWLEDGMENTS
We thank Susmita Shrivastava in the JCVI Bioinformatics Group for submitting the genome to NCBI, as well as the members of the Joint Technology Center at the J. Craig Venter Institute (JCVI) for performing the Illumina sequencing.
This project was funded with federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services under grant no. U19-AI-110819 (Genomic Center for Infectious Diseases U19).
The content of this article is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Footnotes
Citation Stockwell TB, Heberlein-Larson LA, Tan Y, Halpin RA, Fedorova N, Katzel DA, Smole S, Unnasch TR, Kramer LD, Das SR. 2015. First complete genome sequences of two Keystone viruses from Florida. Genome Announc 3(5):e01255-15. doi:10.1128/genomeA.01255-15.
REFERENCES
- 1.Hammon WM, Reeves WC, Sather G. 1952. California encephalitis virus, a newly described agent. II. Isolations and attempts to identify and characterize the agent. J Immunol 69:493–510. [PubMed] [Google Scholar]
- 2.Bond JO, Hammon WM, Lewis AL, Sather GE, Taylor DJ. 1966. California group arboviruses in Florida and report of a new strain, keystone virus. Public Health Rep 81:607–613. [PMC free article] [PubMed] [Google Scholar]
- 3.Elliott RM. 1990. Molecular biology of the Bunyaviridae. J Gen Virol 71:501–522. doi: 10.1099/0022-1317-71-3-501. [DOI] [PubMed] [Google Scholar]
- 4.LeDuc JW. 1987. Epidemiology and ecology of the California serogroup viruses. Am J Trop Med Hyg 37:60S–68S. [DOI] [PubMed] [Google Scholar]
- 5.Bowen MD, Jackson AO, Bruns TD, Hacker DL, Hardy JL. 1995. Determination and comparative analysis of the small RNA genomic sequences of California encephalitis, Jamestown Canyon, Jerry Slough, Melao, Keystone and Trivittatus viruses (Bunyaviridae, genus Bunyavirus, California serogroup). J Gen Virol 76:559–572. doi: 10.1099/0022-1317-76-3-559. [DOI] [PubMed] [Google Scholar]
- 6.Grimstad PR, Calisher CH, Harroff RN, Wentworth BB. 1986. Jamestown Canyon virus (California serogroup) is the etiologic agent of widespread infection in Michigan humans. Am J Trop Med Hyg 35:376–386. [DOI] [PubMed] [Google Scholar]
- 7.Djikeng A, Halpin R, Kuzmickas R, Depasse J, Feldblyum J, Sengamalay N, Afonso C, Zhang X, Anderson NG, Ghedin E, Spiro DJ. 2008. Viral genome sequencing by random priming methods. BMC Genomics 9:5. doi: 10.1186/1471-2164-9-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wang S, Sundaram JP, Spiro D. 2010. VIGOR, an annotation program for small viral genomes. BMC Bioinformatics 11:451. doi: 10.1186/1471-2105-11-451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wang S, Sundaram JP, Stockwell TB. 2012. VIGOR extended to annotate genomes for additional 12 different viruses. Nucleic Acids Res 40:W186–W192. doi: 10.1093/nar/gks528. [DOI] [PMC free article] [PubMed] [Google Scholar]
