Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2014 May 15;2(3):e00398-14. doi: 10.1128/genomeA.00398-14

Genome Sequence of a Novel Iflavirus from mRNA Sequencing of the Butterfly Heliconius erato

Gilbert Smith 1,, Aide Macias-Muñoz 1, Adriana D Briscoe 1
PMCID: PMC4022809  PMID: 24831145

Abstract

Here, we report the genome sequence of a novel iflavirus strain recovered from the neotropical butterfly Heliconius erato. The coding DNA sequence (CDS) of the iflavirus genome was 8,895 nucleotides in length, encoding a polyprotein that was 2,965 amino acids long.

GENOME ANNOUNCEMENT

Heliconius butterflies are a diverse group of around 40 neotropical species that are widely studied for their wing color pattern diversity and Müllerian mimicry, and display a striking pattern of both divergent and convergent evolution in wing phenotypes (1, 2). Heliconius erato is native to Central and South America where a huge diversity of wing phenotypes are expressed across races from different geographic locations (3). Iflaviruses are single-stranded RNA viruses that infect insect hosts, often leading to developmental problems and the death of the host (4). However, little is known about the viral (and bacterial) assemblages infecting Heliconius butterflies.

We sequenced the mRNA of one male and one female of H. erato, obtained from a Costa Rican butterfly farm, Suministros Entomológicos Costarricenses, S.A. RNA was extracted from two individuals for each of four tissue types (the antenna, mouth-parts, head, and legs) using TRIzol (Life Technologies). RNA sequencing libraries were prepared using the mRNA-Seq sample preparation kit (Illumina) and sequenced using an Illumina HiSeq 2000 sequencer by the UCI Genomics High-Throughput Facility. In total, eight libraries were sequenced, producing an average library size of approximately 24 million 100-bp paired-end reads, after quality control filters were applied. Paired-end libraries were then de novo assembled into putative mRNA transcripts using Trinity (5). The Trinity-assembled nucleotide (nt) contigs were translated into amino acid (aa) sequences, and a BLASTp comparison to the NCBI NR protein database (6) revealed the presence of a viral polyprotein.

The best BLASTp hit for the discovered viral polyprotein matched a strain of iflavirus previously described from the Chinese oak silkmoth, Antheraea pernyi. The H. erato iflavirus polyprotein amino acid sequence demonstrated a 63% pairwise similarity to the Chinese oak moth iflavirus polyprotein. The coding DNA sequence (CDS) of the H. erato iflavirus was 8,895 nucleotides in length, encoding a 2,965-aa polyprotein. The entire iflavirus genome was 9,910 nucleotides long, which included a 906-nt 5′ untranslated region (UTR) and 110-nt 3′ UTR, and had a G+C content of 35.3%.

Iflavirus genomes typically encode a single polyprotein that is comprised of conserved domains for structural proteins in the N-terminal region, and non-structural proteins, RNA helicase, protease, and RNA-dependent RNA polymerase (RdRp) proteins in the C-terminal region (4). The 2,965-aa polyprotein of the H. erato iflavirus strain contained an arrangement of conserved domains common to that of previously described insect iflavirus genomes (7). This included two rhv-like picornavirus capsid protein domain-like motifs (Pfam entry, cd00205), an RdRp motif (Pfam entry, cd01699), RNA helicase (Pfam entry, PF00910), a cricket paralysis virus (CRPV) capsid protein-like motif (Pfam entry, PF08762), and a 3-C cysteine protease motif (Pfam entry, PF00548).

We present the complete genome of a novel strain of iflavirus isolated from H. erato. This H. erato iflavirus genomic sequence encodes a full and complete polyprotein when compared to other iflavirus genomes. The increasing availability of mRNA sequencing data from a wide range of insect species promises to aid in the understanding of the ecology and evolution of insect viral infection.

Nucleotide sequence accession number.

The genome sequence was deposited at DDBJ/EMBL/GenBank under accession no. KJ679438.

ACKNOWLEDGMENTS

Thanks to Furong Yuan for providing advice on the preparation of RNA-Seq libraries.

This material was based in part upon work supported by the National Science Foundation under cooperative agreement no. DBI-0939454 and grant IOS-1257627 to A.D.B.

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Footnotes

Citation Smith G, Macias-Muñoz A, Briscoe AD. 2014. Genome sequence of a novel iflavirus from mRNA sequencing of the butterfly Heliconius erato. Genome Announc. 2(3):e00398-14. doi:10.1128/genomeA.00398-14.

REFERENCES

  • 1. Jiggins CD, Naisbit RE, Coe RL, Mallet J. 2001. Reproductive isolation caused by colour pattern mimicry. Nature 411:302–305. 10.1038/35077075 [DOI] [PubMed] [Google Scholar]
  • 2. Mallet J, Joron M. 1999. Evolution of diversity in warning color and mimicry: Polymorphisms, shifting balance, and speciation. Annu. Rev. Ecol. Syst. 30:201–233. 10.1146/annurev.ecolsys.30.1.201 [DOI] [Google Scholar]
  • 3. Supple M, Papa R, Counterman B, McMillan WO. 2014. The genomics of an adaptive radiation—insights across the Heliconius speciation continuum, p 249–271 In Landry CR, Aubin-Horth N. (ed), Advances in experimental medicine and biology, vol 781 Ecological genomics: ecology and the evolution of genes and genomes; Springer, Dordrecht, Netherlands: [DOI] [PubMed] [Google Scholar]
  • 4. van Oers MM. 2010. Genomics and biology of iflaviruses, p 231–250 In Asgari S, Johnson K. (ed), Insect virology. Caister Academic Press, Norfolk, United Kingdom [Google Scholar]
  • 5. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, Chen Z, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A. 2011. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29:644–652. 10.1038/nbt.1883 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL. 2008. NCBI BLAST: a better web interface. Nucleic Acids Res. 36:W5–W9. 10.1093/nar/gkn201 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Sparks ME, Gundersen-Rindal DE, Harrison RL. 2013. Complete genome sequence of a novel iflavirus from the transcriptome of Halyomorpha halys, the brown marmorated stink bug. Genome Announc. 1(6):e00910-13. 10.1128/genomeA.00910-13 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES