ABSTRACT
Here, we announce the complete genome of a previously undescribed papillomavirus from a betta fish, Betta splendens. The genome is 5,671 bp with a GC content of 38.2%. Variants were detected in public databases. This genome is most similar to papillomaviruses that infect sea bass (52.9 % nucleotide identity).
KEYWORDS: Papillomavirus, Siamese fighting fish, cenote-taker 2, deep sequencing, viruses
ANNOUNCEMENT
Papillomaviruses (PVs) are double-stranded circular DNA viruses that belong to the consortium of small DNA tumor viruses (1). Papillomavirus genomes range in size from 5 to 8.6 kb. Virions are non-enveloped, icosahedral particles approximately 55 nm in diameter and have been identified in most classes of vertebrates (1–3). To date, 16 PVs that infect fishes or have a putative fish host have been identified. The genomes of these viruses are notably smaller (5–6 kb) than mammalian PVs, possess a subset of the open reading frames (ORFs) named in that lineage, and constitute the exclusive members of the subfamily, Secondpapillomavirinae (family, Papillomaviridae; genus, Alefpapillomavirus) (4). The core ORFs in this group of viruses that are shared with mammalian PVs include E1, E2, L1, and L2. An E4 homolog has been identified in some genomes. Notably, homologs of the E6/E7 proteins associated with neoplasia are encoded in some of these fish PV genomes. In cases where PVs are associated with clinical disease in fishes, often multiple small to large, raised nodules that appear gray or black in color are observed. They are typically benign (1, 3, 5).
Betta fish (Betta splendens; Siamese fighting fish) are freshwater fish native to southern Asia and commercially important in the ornamental fish trade (6–8). Although bacterial and viral diseases have been described in this species, the extent of putative microbial pathogens is unknown (9–11). Here, a moribund betta fish that presented with lethargy and later died (October 2018) at The University of the West Indies, St. Augustine, School of Veterinary Medicine, was examined. No raised nodules were observed. DNA was extracted from pooled liver and splenic tissues using a DNeasy Blood & Tissue DNA extraction kit (Qiagen). DNA was subjected to Repli-G (Qiagen) preamplification and submitted to GENEWIZ, Inc. A DNA library was created using a NEB NextUltra DNA Library Preparation kit, and sequencing was performed using a 2 × 150 paired-end configuration on an Illumina HiSeq instrument. Reads were quality trimmed and assembled using Megahit (v1.2.9) and screened for viral sequences in Cenote Taker 2 (12, 13). The genome of a novel papillomavirus consisting of 5,671-bp genome with a GC content of 38.2% was recovered. Reads were mapped to the draft genome using CLC Genomics Workbench (v21.0.2) and represented 2.84% (3,116,785) of the total reads (109,600,338). Average coverage was 81,380×.
Six partially overlapping ORFs including E6/E7 homologs (OncN and OncC), replicative helicase (E1), E2 protein, and the capsid proteins L1 and L2 were found. The inferred intron-containing L1 sequence is reminiscent of fish papillomaviruses MZ570863 and KX643372. Highest amino acid identity was observed for seabass PV (QYW06029) E1 protein (54.3%). Lower sequence identity (23.9%–54.3%) was observed for the E1 proteins of other fish-associated PVs (Fig. 1).
Fig 1.
Genome map and phylogenetic analyses of fish papillomavirus E1 proteins. (A) Genome map depicting predicted coding sequences (CDS). Black arrows illustrate splice donor/ acceptor sites. MacVector (v18.6.1) was used to circularize the genome and annotate genome features. Gene names were assigned based on BLASTP search results. (B) Midpoint rooted maximum likelihood phylogeny of E1 proteins of fish papillomaviruses. E1 proteins from papillomaviruses with fish hosts were recovered from the NCBI database via BLASTP query (9/2023). Sequences were aligned with the Geneious Prime (v2023.2.1) plugin MUSCLE v.5.1 using the PPP algorithm (14). The alignment was used to infer a maximum likelihood phylogenetic tree in IQ-Tree 1.6.12 using model GTR + F + I + G4 (15). The phylogenetic tree was visualized using iTOL (V6) (16). Branch tips are labeled with sample accession number, and the betta papillomavirus is highlighted in bold. (C) Pairwise comparisons of fish-associated PV E1 amino acid sequences using the sequence demarcation tool (v1.3) (17). The betta papillomavirus is highlighted in bold.
While this virus was identified in a moribund fish, the association with morbidity is unclear. The complete genome presented here supports future development of molecular tools to further our understanding of the prevalence of this virus in ornamental fish and its role as a potential infectious agent.
ACKNOWLEDGMENTS
Any use of trade, product, or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government. The USDA is an equal opportunity employer and provider.
Contributor Information
Luke R. Iwanowicz, Email: luke.iwanowicz@usda.gov.
John J. Dennehy, Queens College Department of Biology, USA
DATA AVAILABILITY
The genome sequence has been deposited in GenBank under accession number OQ633419. The raw reads were deposited in BioProject, PRJNA880623; BioSample, SAMN30874153; SRA, SRR21609248. Variant genomes were assembled from SRA DRR320248 and SRR18231402.
REFERENCES
- 1. Gaynor AM, Fish S, Duerr RS, Cruz Jr FD, Pesavento P. 2015. Identification of a novel papillomavirus in a northern fulmar (Fulmarus glacialis) with viral production in cartilage. Vet Pathol 52:553–561. doi: 10.1177/0300985814542812 [DOI] [PubMed] [Google Scholar]
- 2. Iwanowicz LR, Young KT, Adams CR, Blazer VS, Smith GD, Cornman RS. 2020. Draft genome sequence of an adomavirus associated with raised mucoid skin lesions on smallmouth bass (Micropterus dolomieu). Microbiol Resour Announc 9:e01479-19. doi: 10.1128/MRA.01479-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Surján A, Fónagy E, Eszterbauer E, Harrach B, Doszpoly A. 2021. Complete genome sequence of a novel fish papillomavirus detected in farmed wels catfish (Silurus glanis). Arch Virol 166:1–4. doi: 10.1007/s00705-021-05123-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Kraberger S, Austin C, Farkas K, Desvignes T, Postlethwait JH, Fontenele RS, Schmidlin K, Bradley RW, Warzybok P, Van Doorslaer K, Davison W, Buck CB, Varsani A. 2022. Discovery of novel fish papillomaviruses: from the Antarctic to the commercial fish market. Virology 565:65–72. doi: 10.1016/j.virol.2021.10.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. López-Bueno A, Mavian C, Labella AM, Castro D, Borrego JJ, Alcami A, Alejo A. 2016. Concurrence of iridovirus, polyomavirus, and a unique member of a new group of fish papillomaviruses in lymphocystis disease-affected gilthead sea bream. J Virol 90:8768–8779. doi: 10.1128/JVI.01369-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Monvises A, Nuangsaeng B, Sriwattanarothai N, Panijpan B. 2009. The Siamese fighting fish: well-known generally but little-known scientifically. ScienceAsia 35:8–16. doi: 10.2306/scienceasia1513-1874.2009.35.008 [DOI] [Google Scholar]
- 7. Tamba SP, Purba A, Kusuma YE, Vidyastuti MAS, Dharma S. 2021. Implementation of the rank order centroid (ROC) method to determine the favorite betta fish. Infokum 9:381–386. [Google Scholar]
- 8. Fisheries Do . 2019. Fisheries single window data system. Department of Fisheries Bangkok, Thailand. [Google Scholar]
- 9. Gibson-Kueh S, Netto P, Ngoh-Lim GH, Chang SF, Ho LL, Qin QW, Chua FHC, Ng ML, Ferguson HW. 2003. The pathology of systemic Iridoviral disease in fish. J Comp Pathol 129:111–119. doi: 10.1016/s0021-9975(03)00010-0 [DOI] [PubMed] [Google Scholar]
- 10. Novotny L, Halouzka R, Matlova L, Vavra O, Bartosova L, Slany M, Pavlik I. 2010. Morphology and distribution of granulomatous inflammation in freshwater ornamental fish infected with mycobacteria. J Fish Dis 33:947–955. doi: 10.1111/j.1365-2761.2010.01202.x [DOI] [PubMed] [Google Scholar]
- 11. Orós J, Priestnall SL, Suárez-Bonnet A. 2022. Histopathological description of iridophoromas resembling skin nodule syndrome in Siamese fighting fish Betta splendens. Dis Aquat Organ 151:23–27. doi: 10.3354/dao03686 [DOI] [PubMed] [Google Scholar]
- 12. Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. 2015. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31:1674–1676. doi: 10.1093/bioinformatics/btv033 [DOI] [PubMed] [Google Scholar]
- 13. Tisza MJ, Belford AK, Domínguez-Huerta G, Bolduc B, Buck CB. 2021. Cenote-taker 2 democratizes virus discovery and sequence annotation. Virus Evol 7:veaa100. doi: 10.1093/ve/veaa100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Edgar RC. 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797. doi: 10.1093/nar/gkh340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Trifinopoulos J, Nguyen L-T, von Haeseler A, Minh BQ. 2016. W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res 44:W232–W235. doi: 10.1093/nar/gkw256 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Letunic I, Bork P. 2019. Interactive tree of life (iTOL) v4: recent updates and new developments. Nucleic Acids Res 47:W256–W259. doi: 10.1093/nar/gkz239 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Muhire BM, Varsani A, Martin DP. 2014. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. PLoS One 9:e108277. doi: 10.1371/journal.pone.0108277 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence has been deposited in GenBank under accession number OQ633419. The raw reads were deposited in BioProject, PRJNA880623; BioSample, SAMN30874153; SRA, SRR21609248. Variant genomes were assembled from SRA DRR320248 and SRR18231402.

