The complete genome sequence of molluscum contagiosum virus 1 (MOCV1) isolate NT2017 was sequenced from a tissue sample from an Australian woman. The genome consisted of 185,655 bp encoding 169 predicted open reading frames. Phylogenetically, isolate NT2017 was most closely related to an MOCV1 strain from Slovenia.
ABSTRACT
The complete genome sequence of molluscum contagiosum virus 1 (MOCV1) isolate NT2017 was sequenced from a tissue sample from an Australian woman. The genome consisted of 185,655 bp encoding 169 predicted open reading frames. Phylogenetically, isolate NT2017 was most closely related to an MOCV1 strain from Slovenia.
ANNOUNCEMENT
Molluscum contagiosum virus (MOCV) belongs to the genus Molluscipoxvirus, in the family Poxviridae, and possesses a large double-stranded DNA genome of approximately 190 kbp and a G+C content of >60% (1). Early genomic studies suggested the existence of four major MOCV genotypes designated MOCV1 to MOCV4 (2) and several genotypic variants (3, 4). MOCV1 is the most prevalent genotype worldwide and causes cutaneous infections in the pediatric population, with only rare occurrences in adults (5). MOCV infection with mild symptoms has previously been reported in Australia (6, 7). Except for a single unpublished genome of MOCV (GenBank accession no. MH646551), there are no other genomes available for MOCVs detected in Australia. Here, we report the identification and genomic characterization of MOCV1 isolate NT2017.
The cutaneous tissue sample was collected from the left-hand side of the nose of a 39-year-old woman in 2017. Histologically MOCV-diagnosed formalin-fixed paraffin-embedded (FFPE) cutaneous tissue was aseptically sectioned, and total genomic DNA (gDNA) was extracted using a QIAamp DNA-FFPE tissue kit (Qiagen, USA). The library preparation was conducted with 1 ng of total gDNA using the Illumina Nextera XT DNA library prep kit, as reported previously (8, 9). The quality and quantity of the prepared library were assessed using an Agilent TapeStation system by the Genomic Platform, La Trobe University. Cluster generation and sequencing were performed using a MiSeq reagent kit version 3, generating 301-bp paired-end reads, as described previously (10, 11). The sequence data were processed using Geneious version 10.2.2 and CLC Genomics Workbench version 9.5.4 as previously described (12, 13). A total of 4,695,968 raw reads were preprocessed to remove adapters, ambiguous base calls, and poor-quality reads, followed by mapping against the human genome (Homo sapiens GRCh38, GenBank accession no. GCF_000001405.26) to remove host DNA contamination. Unmapped reads were used as input data for de novo assembly using the SPAdes assembler version 3.10.1 (14). This yielded a single contig of 185,602 bp. The resulting contig was further checked and validated by mapping back the trimmed reads. The draft genome sequence was further checked by mapping with a reference MOCV1 genome (GenBank accession no. U60315). This resulted in the generation of a 185,655-bp genome, including two inverted terminal repeats, each 2,505 bp long with a G+C content of 62%, and an average coverage of 404.20×. The MOCV1 genome was used as a reference genome for the annotation of the newly sequenced isolate using Geneious version 10.2.2, and further verification of the predicted open reading frames (ORFs) longer than 50 amino acids was performed using CLC Genomics Workbench version 9.5.4. Pairwise identity of the representative Molluscipoxvirus species against the MOCV1 isolate on the basis of complete genome nucleotide sequences was calculated using Base-by-Base software (15).
The MOCV1 isolate NT2017 genome contained 169 predicted methionine-initiated ORFs encoding proteins, which have been annotated as putative genes and were numbered from left to right. Comparative analysis of the protein sequences encoded by the predicted ORFs using BLASTX and BLASTP (16) identified homologs with significant protein sequence similarity to other MOCVs (E value, ≤e-5) for 168 ORFs, and only an open reading frame A (ORF-A) was found to be unique.
Phylogenetic analysis was conducted using selected full-genome sequences under the genus Molluscipoxvirus; NT2017 clustered in the same subclade as MOCV1 strains isolated from Slovenia (with nucleotide identities ranging from 99.86 to 99.87) with high bootstrap support (∼100%) (Fig. 1).
FIG 1.
Phylogenetic relationship between MOCV isolated from Australia and other selected MOCV strains under the genus Molluscipoxvirus. The selected genomes were aligned with MAFTT version 7.450 using the G-INS-i (gap open penalty, 1.53; offset value, 0.123) algorithm implemented in Geneious version 10.2.2 (17). A maximum likelihood tree was constructed from the selected molluscipoxviruses using Geneious version 10.2.2. The numbers on the left show bootstrap values as percentages, and the MOCV1 isolate NT2017 is highlighted in blue. The labels at the branch tips refer to the original Molluscipoxvirus GenBank accession number followed by the abbreviated species name, isolate name, and country of isolation.
Data availability.
This whole-genome sequence of MOCV1 isolate NT2017 has been deposited in DDBJ/ENA/GenBank under the accession no. MN202187. The version described in this paper is the first version, MN202187.1. The raw sequencing data from this study have been deposited in the NCBI Sequence Read Archive (SRA) under the accession no. SRR12569148 (BioProject no. PRJNA660887; BioSample accession no. SAMN15963002).
ACKNOWLEDGMENT
Subir Sarker is the recipient of an Australian Research Council Discovery Early Career Researcher Award (grant no. DE200100367) funded by the Australian government. The funder had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
REFERENCES
- 1.Zorec TM, Kutnjak D, Hosnjak L, Kusar B, Trcko K, Kocjan BJ, Li Y, Krizmaric M, Miljkovic J, Ravnikar M, Poljak M. 2018. New insights into the evolutionary and genomic landscape of molluscum contagiosum virus (MCV) based on Nine MCV1 and six MCV2 complete genome sequences. Viruses 10:586. doi: 10.3390/v10110586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Darai G, Reisner H, Scholz J, Schnitzler P, Lorbacher de Ruiz H. 1986. Analysis of the genome of molluscum contagiosum virus by restriction endonuclease analysis and molecular cloning. J Med Virol 18:29–39. doi: 10.1002/jmv.1890180105. [DOI] [PubMed] [Google Scholar]
- 3.Nakamura J, Muraki Y, Yamada M, Hatano Y, Nii S. 1995. Analysis of molluscum contagiosum virus genomes isolated in Japan. J Med Virol 46:339–348. doi: 10.1002/jmv.1890460409. [DOI] [PubMed] [Google Scholar]
- 4.Yamashita H, Uemura T, Kawashima M. 1996. Molecular epidemiologic analysis of Japanese patients with molluscum contagiosum. Int J Dermatol 35:99–105. doi: 10.1111/j.1365-4362.1996.tb03270.x. [DOI] [PubMed] [Google Scholar]
- 5.Leung AKC, Barankin B, Hon KLE. 2017. Molluscum contagiosum: an update. Recent Pat Inflamm Allergy Drug Discov 11:22–31. doi: 10.2174/1872213X11666170518114456. [DOI] [PubMed] [Google Scholar]
- 6.Thompson CH, De Zwart-Steffe RT, Biggs IM. 1990. Molecular epidemiology of Australian isolates of molluscum contagiosum. J Med Virol 32:1–9. doi: 10.1002/jmv.1890320102. [DOI] [PubMed] [Google Scholar]
- 7.Konya J, Thompson CH. 1999. Molluscum contagiosum virus: antibody responses in persons with clinical lesions and seroepidemiology in a representative Australian population. J Infect Dis 179:701–704. doi: 10.1086/314620. [DOI] [PubMed] [Google Scholar]
- 8.Sarker S, Isberg RS, Moran LJ, Araujo DR, Elliott N, Melville L, Beddoe T, Helbig JK. 2019. Crocodilepox virus evolutionary genomics supports observed poxvirus infection dynamics on saltwater crocodile (Crocodylus porosus). Viruses 11:1116. doi: 10.3390/v11121116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sarker S, Isberg SR, Milic NL, Lock P, Helbig KJ. 2018. Molecular characterization of the first saltwater crocodilepox virus genome sequences from the world’s largest living member of the Crocodylia. Sci Rep 8:5623. doi: 10.1038/s41598-018-23955-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sarker S, Roberts HK, Tidd N, Ault S, Ladmore G, Peters A, Forwood JK, Helbig K, Raidal SR. 2017. Molecular and microscopic characterization of a novel Eastern grey kangaroopox virus genome directly from a clinical sample. Sci Rep 7:16472. doi: 10.1038/s41598-017-16775-7. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 11.Sutherland M, Sarker S, Vaz PK, Legione AR, Devlin JM, Macwhirter PL, Whiteley PL, Raidal SR. 2019. Disease surveillance in wild Victorian cacatuids reveals co-infection with multiple agents and detection of novel avian viruses. Vet Microbiol 235:257–264. doi: 10.1016/j.vetmic.2019.07.012. [DOI] [PubMed] [Google Scholar]
- 12.Sarker S, Batinovic S, Talukder S, Das S, Park F, Petrovski S, Forwood JK, Helbig KJ, Raidal SR. 2020. Molecular characterisation of a novel pathogenic avipoxvirus from the Australian magpie (Gymnorhina tibicen). Virology 540:1–16. doi: 10.1016/j.virol.2019.11.005. [DOI] [PubMed] [Google Scholar]
- 13.Sutherland M, Sarker S, Raidal SR. 2019. Molecular and microscopic characterisation of a novel pathogenic herpesvirus from Indian ringneck parrots (Psittacula krameri). Vet Microbiol 239:108428. doi: 10.1016/j.vetmic.2019.108428. [DOI] [PubMed] [Google Scholar]
- 14.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hillary W, Lin S-H, Upton C. 2011. Base-By-Base version 2: single nucleotide-level analysis of whole viral genome alignments. Microb Inform Exp 1:2. doi: 10.1186/2042-5783-1-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW. 2013. GenBank. Nucleic Acids Res 41:D36–D42. doi: 10.1093/nar/gks1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Katoh K, Standley DM. 2013. MAFFT Multiple Sequence Alignment Software version 7: improvements in performance and usability. Mol Biol Evol 30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This whole-genome sequence of MOCV1 isolate NT2017 has been deposited in DDBJ/ENA/GenBank under the accession no. MN202187. The version described in this paper is the first version, MN202187.1. The raw sequencing data from this study have been deposited in the NCBI Sequence Read Archive (SRA) under the accession no. SRR12569148 (BioProject no. PRJNA660887; BioSample accession no. SAMN15963002).

