The genome sequence of a baculovirus isolated from Choristoneura diversana is 122,827 bp long and contains 150 putative open reading frames (ORFs). The virus is closely related to alphabaculoviruses isolated from insect species of the genus Choristoneura.
ABSTRACT
The genome sequence of a baculovirus isolated from Choristoneura diversana is 122,827 bp long and contains 150 putative open reading frames (ORFs). The virus is closely related to alphabaculoviruses isolated from insect species of the genus Choristoneura.
ANNOUNCEMENT
Choristoneura diversana is a univoltine species belonging to the Tortricidae family, which is widely distributed in the Palearctic ecozone. Symptoms of nuclear polyhedrosis have been observed in larvae collected from C. diversana outbreak populations (1). A causal agent of nuclear polyhedrosis in C. diversana has not been reported; however, nucleopolyhedroviruses (NPVs) isolated from Choristoneura species such as C. muriana (ChmuNPV), C. occidentalis (ChocNPV), and C. rosaceana (ChroNPV) and two distinct NPVs from C. fumiferana (CfMNPV and CfDEFNPV) have been characterized (2–5). To characterize the nuclear polyhedrosis of C. diversana, an NPV was isolated from the insect, and the genome sequence was determined.
As per a previously described method (6), virions were purified from viral occlusion bodies prepared from three larval samples of C. diversana collected at a Todomatsu fir stand in Hokkaido, Japan, which had been preserved at the Forestry and Forest Products Research Institute since 1972. Viral genomic DNA was prepared as described previously (6). A paired-end sequence library with a 350-bp insert was prepared using a TruSeq DNA PCR-free kit (Illumina), and 101-base paired-end reads were generated on the HiSeq 2500 platform (Illumina) using a commercial sequencing service (Macrogen, Japan). Trimming of low-quality ends (Phred quality scores of <Q20) and adaptors was performed, and reads less than 40 bases long were filtered out using BBDuk2 (https://github.com/BioInfoTools/BBMap/blob/master/sh/bbduk2.sh) with the parameter settings ktrimright = t, k = 27, hdist = 1, qtrim = rl, trimq = 20, minlength = 40, trimoverlap = t, and minoverlap = 24. Assembly was conducted first using 5% of the total processed reads (1,052,696 of 21,053,934 reads) using de novo assembly in Geneious Prime 2019.2.3 (Biomatters) with default parameters of low sensitivity/fastest setting except that the “only use paired hits during assembly” option was used. This generated a contig with overlapping sequences at both ends, from which a circular contig was created. All processed reads were then used to assemble the circular contig, permitting 20,940,902 reads to be assembled and resulting in a mean coverage of 16,856× (standard deviation, 2,051×). An ambiguous region with lower coverage inside the orf1629 gene was amplified using PCR, and the DNA was sequenced using Sanger sequencing. The PCR generated two amplicons (482 and 656 bp) containing repeat sequences. The genome sequence reported here used the sequence of the longer amplicon. Putative open reading frames (ORFs) encoding more than 50 amino acids were identified using the Glimmer 3 version 1.5 (7) algorithm trained for ChmuNPV ORF annotations using Geneious Prime 2019.2.3 (Biomatters) and were manually edited.
The genome of the C. diversana NPV (ChdiNPV) was 122,827 bp long with a G+C content of 50.2% and was estimated to have 150 ORFs, including 6 ORFs unique to ChdiNPV. The remaining 144 ORFs had similarities to other baculovirus genes, including all baculovirus core genes. Global alignment using EMBOSS Stretcher (8) showed that the ChdiNPV genome had 95.4%, 81.6%, 81.2%, and 75.8% nucleotide identities to ChmuNPV, ChocNPV, CfMNPV, and ChroNPV, respectively. A blastp search with a cutoff E value of 1 × 10−5 using blast+ (9) found that ChdiNPV shared 140, 136, 136, and 136 ORFs with ChmuNPV, ChocNPV, CfMNPV, and ChroNPV, respectively. The mean amino acid identities of ChdiNPV putative proteins with orthologous proteins of the above-mentioned 4 viruses were 98.3%, 87.7%, 87.4%, and 84.4%, respectively. Phylogenic analysis using concatenated sequences of LEF-8 and PIF-2 indicates that ChdiNPV belongs to the group I alphabaculoviruses and is the closest to ChmuNPV in lineage, including ChocNPV and CfMNPV. ChroNPV was aligned to this lineage (Fig. 1A). Although the closest relationship identified was between ChdiNPV and ChmuNPV, several divergent regions were detected in an alignment conducted using progressiveMauve (10) implemented in Geneious Prime 2019.2.3 (Biomatters) using the default settings (Fig. 1B), which contributed to the discrepancy of the presence or absence of ORFs in each viral genome.
FIG 1.
(A) Phylogenetic tree of alphabaculovirus. Amino acid sequences of lef-8 and pif-2 gene products were aligned with MUSCLE (11) using the default settings and concatenated after removing ambiguously aligned residues using the -gappyout command in trimAl version 1.2 (12). A maximum likelihood tree was inferred by employing the LG + G + I + F model of amino acid substitution (number of substitution rate categories, 4; gamma shape parameter, 0.796; proportion of invariable sites, 0.222) in the PhyML 3.0 Web server (13). The numbers at the nodes indicate Shimodaira-Hasegawa-like local support values (>50%). Branch termini are labeled according to virus names and GenBank accession numbers. ChdiNPV is shown on a green background. The group II alphabaculovirus cluster is compressed. (B) Divergent regions of the ChdiNPV and ChmuNPV genomes. In each panel, green bars in the identity graph show identical bases in the alignment columns between the viral genome sequences. A scale in the alignment, represented by the positions from A of the polyhedrin ATG start codon, is shown on the top of the identity graph. A schematic diagram of the alignment in each divergent region is shown by gray boxes and black lines below the identity graph. The gray boxes and black lines indicate nucleotide sequences and alignment gaps, respectively. Protein-coding sequences on the ChdiNPV and ChmuNPV genomes are indicated by yellow arrows above and below the alignment diagram, respectively.
Data availability.
The genome sequence of ChdiNPV has been deposited in DDBJ/EMBL/GenBank under the accession number LC516821, with raw sequence read data available at the DDBJ Sequence Read Archive under the accession number DRA009434.
ACKNOWLEDGMENT
This study was supported by JSPS KAKENHI grant numbers JP17K19432 and JP16H05052.
REFERENCES
- 1.Katagiri K. 1973. A newly discovered entomopoxvirus of Choristoneura diversana (Lepidoptera: Tortricidae). J Invertebr Pathol 22:300–302. doi: 10.1016/0022-2011(73)90152-3. [DOI] [Google Scholar]
- 2.Rohrmann GF, Erlandson MA, Theilmann DA. 2014. Genome sequence of an alphabaculovirus isolated from Choristoneura murinana. Genome Announc 2:e01135-13. doi: 10.1128/genomeA.01135-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Thumbi DK, Béliveau C, Cusson M, Lapointe R, Lucarotti CJ. 2013. Comparative genome sequence analysis of Choristoneura occidentalis Freeman and C. rosaceana Harris (Lepidoptera: Tortricidae) alphabaculoviruses. Plos One 8:e68968. doi: 10.1371/journal.pone.0068968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lauzon HAM, Jamieson PB, Krell PJ, Arif BM. 2005. Gene organization and sequencing of the Choristoneura fumiferana defective nucleopolyhedrovirus genome. J Gen Virol 86:945–961. doi: 10.1099/vir.0.80489-0. [DOI] [PubMed] [Google Scholar]
- 5.de Jong JG, Lauzon HAM, Dominy C, Poloumienko A, Carstens EB, Arif BM, Krell PJ. 2005. Analysis of the Choristoneura fumiferana nucleopolyhedrovirus genome. J Gen Virol 86:929–943. doi: 10.1099/vir.0.80490-0. [DOI] [PubMed] [Google Scholar]
- 6.Takatsuka J, Okuno S, Nakai M, Kunimi Y. 2016. Genetic and phenotypic comparisons of viral genotypes from two nucleopolyhedroviruses interacting with a common host species, Spodoptera litura (Lepidoptera: Noctuidae). J Invertebr Pathol 139:42–49. doi: 10.1016/j.jip.2016.07.009. [DOI] [PubMed] [Google Scholar]
- 7.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Madeira F, Park YM, Lee J, Buso N, Gur T, Madhusoodanan N, Basutkar P, Tivey ARN, Potter SC, Finn RD, Lopez R. 2019. The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res 47:W636–W641. doi: 10.1093/nar/gkz268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Darling AE, Mau B, Perna NT. 2010. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. Plos One 5:e11147. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Edgar RC. 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O. 2010. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol 59:307–321. doi: 10.1093/sysbio/syq010. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence of ChdiNPV has been deposited in DDBJ/EMBL/GenBank under the accession number LC516821, with raw sequence read data available at the DDBJ Sequence Read Archive under the accession number DRA009434.

