Skip to main content
Microbiology Spectrum logoLink to Microbiology Spectrum
. 2022 Aug 30;10(5):e02285-22. doi: 10.1128/spectrum.02285-22

Capturing Structural Variants of Herpes Simplex Virus Genome in Full Length by Oxford Nanopore Sequencing

Rajagopalan Saranathan a, Emmanuel Asare a, Lawrence Leung a, Anna Paula de Oliveira a, Katherine E Kaugars a, Claire V Mulholland a, Regy Lukose a, Michael Berney a, William R Jacobs Jr a,
Editor: JJ Mirandab
PMCID: PMC9602439  PMID: 36040163

ABSTRACT

Genome sequencing and assembly of viral genomes within the Herpesviridae family, particularly herpes simplex virus (HSV), have been challenging due to the large size (~154 Kb), high GC content (68%), and nucleotide variations arising during replication. Oxford Nanopore Technology (ONT) has been successful in obtaining read lengths ranging from 100 Kb up to 2.3 Mb. We have optimized DNA extraction and sequencing with ONT to capture the whole genome of HSV-1 as a single read. Although previous studies described the presence of four different genome isomers of HSV, we provided the first report on capturing all four variants’ full-length genome as single reads. These isomers were found to be present in almost equal proportion in the sequenced DNA preparation.

IMPORTANCE With the advent of next-generation sequencing platforms, genome sequencing of viruses can be performed in a relatively shorter time frame in even the most austere conditions. Ultralong read sequencing platforms, such as Oxford Nanopore Technology (ONT), have made it possible to capture the full-length genome of DNA viruses as a single read. By optimizing ONT for this purpose, we captured the genome (~154 Kb) of a clinical strain of herpes simplex virus 1 (HSV-1). Additionally, we captured full-length sequences of the four isomers of lab-grown HSV-1 virus and were able to determine the frequency of each within the isogenic population. This method will open new directions in studying the significance of these isomers and their clinical relevance to HSV-1 infections. It will also improve basic studies on the recombination and replication of this virus.

KEYWORDS: genome isomers, Oxford Nanopore sequencing, herpes simplex virus

OBSERVATION

Herpes simplex virus 1 (HSV-1) is a large double-stranded DNA virus that belongs to the Alphaherpesvirinae subfamily within the Herpesviridae family. HSV-1 is widespread in the human population, causing mild to severe skin and neurological infections (13). The genome of HSV-1 is complex and comprised of unique long (UL) and unique short (US) regions flanked by repeat elements on both ends. The termini of unique long (UL) and short (US) regions are flanked by the terminal and internal repeat of the long region (TRL and IRL) and the terminal and internal repeat of the short region (IRS and TRS) (Fig. 1A) (4, 5). Although HSV-1 has been reported to form genome isomers because of recombination events during replication, the significance of these isomers in virulence or viral fitness is not completely understood (6). The presence of these isomers was identified earlier by restriction digestion, southern blotting, and probe-based methods (711). Capturing HSV variants with their nucleotide sequence has been a challenge using short-read shotgun sequencing platforms due to extremely high GC content (70 to 85%) in some regions, such as the inverted repeats of the genome; genetic variations; and complexities associated with isolating intact viral genomic DNA (1216). In addition, short reads would need extensive coverage and computational processing and thus, cannot be reliably assembled to generate a full-length HSV-1 genome. Recently, Chang et al. (17) identified the DNA isomers of HSV-2 strain G by ONT but reported them as partial genome fragments. Because some of the replication- and virulence-related genes are present in repeat regions, capturing both copies of repeat elements is important to understand the process, but this is possible only with the full-length genome sequence (8). Furthermore, obtaining a full-length accurate HSV genome will be valuable for studying determinants of drug resistance, virulence, pathogenesis, and viral evolution (16). In this observation, we performed a hybrid assembly to obtain the genome sequence of an HSV-1 clinical strain using both ONT long reads and Illumina short reads.

FIG 1.

FIG 1

HSV-1 B3x 1.1 genome variants identified by Nanopore sequencing. (A) Schematic representation of HSV-1 genome with the repeat elements (TRL, IRL, TRS, IRS) on both ends of unique long (UL) and unique short (US) regions. (B) All four isomers, such as prototype (P), inverted short (IS), inverted long and short (ILS), and inverted long (IL) were identified by performing dot plot. Ultralong reads ranging from 145 kb to 155 kb were mapped against a reference genome (accession no. NC_001806) to identify the genome isomers using http://dgenies.toulouse.inra.fr/run.

We recently developed a method to extract intact full-length genomic DNA from HSV-1 and HSV-2, followed by sequencing and capturing their genome as a single read by ONT. Genomic DNA from HSV-1 clinical strain (B3x 1.1) (18) was isolated following the protocol optimized earlier (19). Briefly, Vero cells were infected at a multiplicity of infection (MOI) of 10−2 PFU/cell for 1 h at 37°C. When 90 to 100% cytopathic effect was observed, the supernatant and cells were collected and centrifuged at 300 × g for 10 min; the cell pellet was washed and resuspended in 10 mM Tris (pH 7.5) with protease inhibitor cocktail, homogenized, and centrifuged at 3000 × g. The pellet was resuspended in 10 mM Tris (pH 7.5) containing 5 mM MgCl2 and 180 units of benzonase and incubated for 30 min at room temperature. This was followed by centrifugation. The pellet was resuspended in TNE (50 mM Tris pH 7.5, 0.1 M NaCl, 10 mM EDTA), centrifuged 10,000 × g, followed by resuspension in TNE containing 0.5% SDS and 0.5 mg/mL pronase and incubated for 1 h at 37°C. The sample was then extracted with phenol:chloroform:isoamyl alcohol (25:24:1). The DNA was centrifuged at 15,000 rpm, washed with 70% ethanol, resuspended in a precipitation solution (5 M NaCl and 13% PEG 8000 in distilled water), and placed on ice. The sample was then centrifuged at 15,000 rpm, washed with 70% ethanol, and dissolved in TE buffer. The genomic library was prepared with the extracted DNA using the Ligation Sequencing kit (SQK-LSK109) following the manufacturer’s instructions and sequenced using a MinION sequencing unit. Wide orifice pipette tips were used throughout the extraction and library preparation procedure. We also performed shotgun sequencing on B3x 1.1 genomic DNA using a benchtop Miseq sequencer (Illumina, USA). The genomic library was prepared using the Nextera XT library preparation kit and sequenced with a 150-cycle (2 × 75 bp) v3 reagent kit following the manufacturer’s instructions.

With ONT sequencing, we obtained 104,524 reads with an average length of 1294 bp. We also obtained 15 ultralong reads >145 Kb, among which six reads were >150 Kb in length, and captured the whole genome of the virus. To identify the genome isomers of the virus and their proportion, we performed dot plot analysis using the HSV-1 prototype (accession no. KU310657.1) as a reference genome. We observed four different forms of plots with different sequence files signifying the presence of all four isomers in our DNA preparation (Fig. 1B). We also annotated the 15 ultralong reads of which six were prototype (P), five were inverted long type (IL), one was inverted short type (Is) and three were inverted long and short type (ILS). Due to the relatively high error rate of Oxford Nanopore sequencing, we used Illumina MiSeq data to polish ONT genomes for each of the four isomers. Around 3 million short reads (75 bp) were obtained providing a mean mapped read depth of ~900 to 1000×. Following trimming with Trim Galore we used BWA-MEM (https://arxiv.org/abs/1303.3997) to map short reads to ONT sequences and corrected base errors (SNPs, indels, and ambiguous calls) using Pilon v1.24 (20). We performed four iterative cycles of genome polishing with BWA-MEM and Pilon after which no further gains were observed.

We quantified the percentage of isomers from the ONT sequence data by retaining reads which are >17 Kb in length using Geneious Prime software. The filtered sequences were annotated and segregated by picking sequences that had both inverted repeat sequence elements (IRL and IRS) with at least 1 Kb sequence on both sides to identify the US and UL genes on both sides. We then counted the annotated reads fitting to each isomer and quantified them. Among the 630 sorted quantifiable reads, prototype (P), inverted long (IL), inverted short (IS), and inverted long and short (ILS) isomers were found to be 27%, 21%, 23%, and 29%, respectively (Fig. 2). Similar to previous observations on isomer distribution, we detected the isomers to be present in nearly equal percentage in the sequenced genomic DNA preparation.

FIG 2.

FIG 2

Quantification of HSV-1 B3x 1.1 genome isomers. Quantification of HSV-1 B3x 1.1 genome isomers was performed using Geneious prime (version 2022.0.1). Reads that were more than 17 kb were retained and among which the sequences with both inverted repeat sequence elements (IRL and IRS) with at least 1 kb sequence on both ends were selected. Among the selected quantifiable reads prototype (P), inverted long (IL), inverted short (IS), and inverted long and short (ILS) isomers were found to be 27%, 21%, 23%, and 29%, respectively.

In conclusion, the manuscript presents an optimized method of DNA extraction and sequencing to capture the whole genome of the HSV-1 virus and its isomers as a single read. Although earlier studies identified these isomers, their significance is not completely understood. One study found that for the HSV-1 KOS strain, the 4 isomers were in equivalent ratios when grown in vitro, which is consistent with previous studies (8). However, when the same strain was isolated from the trigeminal ganglia of infected mice, the ILS isomer was predominant, suggesting that some isomers may have a fitness advantage in a living organism (8). The method in the manuscript was optimized to digest host cell DNA and any HSV DNA that is not packaged in virions, leaving only the DNA from encapsulated virions. Therefore, this analysis is likely comprised of DNA from intact HSV virions and would give a more accurate picture of the relative amounts of each isomer in the infective population of viruses. However, even DNA encapsulated in HSV virus particles may be noninfectious, and the relationship between isomers and infectivity is yet to be fully elucidated. The ultralong read sequencing technologies such as ONT have opened up new directions to study these isomers. In the future, sequencing clinical specimens from patients might provide more useful information about specific isomers and their virulence, or their role in causing infections in humans. Furthermore, this method could also be used to screen mutants of HSV for defects in isomerization, thereby contributing to the understanding of the basic mechanisms of isomer formation.

Data availability.

Full-length genome of HSV-1_B3x 1.1 isomers B3x 1.1A (P), B3x 1.1B (IS), B3x 1.1C (IL), and B3x 1.1D (ILS) were submitted in GenBank and their accession numbers are prototype (ON783214), inverted short (ON783215), inverted long (ON783216) and Inverted Long and short (ON783217). ONT Sequencing data and Illumina paired-end reads were deposited in the SRA database with the accession numbers SRR20796393 and SRR20796392 under the Bioproject ID PRJNA846339.

Supplementary Material

Reviewer comments
reviewer-comments.pdf (207KB, pdf)

ACKNOWLEDGMENTS

We acknowledge the NIH for our funding from grant R21AI156853-01 and grant R01AI026170-33A1.

WRJ is a co-inventor of Recombinant Herpes Vaccine Vector technologies, which was developed in the laboratory of WRJ and Betsy Herold, is described in two U.S. patent applications (10,980,874 and 9,999,665), and is licensed to X-Vax Technology, Inc. WRJ is a consultant and stockholder of X-Vax Technology, Inc., and WRJ receives financial support from X-Vax Technology, Inc. for research.

Footnotes

Supplemental material is available online only.

Supplemental file 1
Supplemental material. Download spectrum.02285-22-s0001.pdf, PDF file, 1.7 MB (1.7MB, pdf)

[This article was published on 30 August 2022 without a conflict of interest statement. The statement was added to the revised version, posted on 8 September 2022.]

Contributor Information

William R. Jacobs, Jr., Email: william.jacobs@einsteinmed.edu.

JJ Miranda, Barnard College, Columbia University.

REFERENCES

  • 1.Miranda-Saksena M, Denes CE, Diefenbach RJ, Cunningham AL. 2018. Infection and transport of herpes simplex virus type 1 in neurons: role of the cytoskeleton. Viruses 10:92. doi: 10.3390/v10020092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Arduino PG, Porter SR. 2008. Herpes simplex virus type 1 infection: overview on relevant clinico-pathological features. J Oral Pathol Med 37:107–121. doi: 10.1111/j.1600-0714.2007.00586.x. [DOI] [PubMed] [Google Scholar]
  • 3.James C, Harfouche M, Welton NJ, Turner KM, Abu-Raddad LJ, Gottlieb SL, Looker KJ. 2020. Herpes simplex virus: global infection prevalence and incidence estimates, 2016. Bull World Health Organ 98:315–329. doi: 10.2471/BLT.19.237149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.McGeoch DJ. 1987. The genome of herpes simplex virus: structure, replication and evolution. J Cell Sci Suppl 7:67–94. doi: 10.1242/jcs.1987.supplement_7.6. [DOI] [PubMed] [Google Scholar]
  • 5.Slobedman B, Zhang X, Simmons A. 1999. Herpes simplex virus genome isomerization: origins of adjacent long segments in concatemeric viral DNA. J Virol 73:810–813. doi: 10.1128/JVI.73.1.810-813.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Jenkins FJ, Roizman B. 1986. Herpes simplex virus 1 recombinants with noninverting genomes frozen in different isomeric arrangements are capable of independent replication. J Virol 59:494–499. doi: 10.1128/JVI.59.2.494-499.1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Bataille D, Epstein AL. 1997. Equimolar generation of the four possible arrangements of adjacent L components in herpes simplex virus type 1 replicative intermediates. J Virol 71:7736–7743. doi: 10.1128/JVI.71.10.7736-7743.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Mahiet C, Ergani A, Huot N, Alende N, Azough A, Salvaire F, Bensimon A, Conseiller E, Wain-Hobson S, Labetoulle M, Barradeau S. 2012. Structural variability of the herpes simplex virus 1 genome in vitro and in vivo. J Virol 86:8592–8601. doi: 10.1128/JVI.00223-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Jacob RJ, Morse LS, Roizman B. 1979. Anatomy of herpes simplex virus DNA. XII. Accumulation of head-to-tail concatemers in nuclei of infected cells and their role in the generation of the four isomeric arrangements of viral DNA. J Virol 29:448–457. doi: 10.1128/JVI.29.2.448-457.1979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hayward GS, Frenkel N, Roizman B. 1975. Anatomy of herpes simplex virus DNA: strain differences and heterogeneity in the locations of restriction endonuclease cleavage sites. Proc Natl Acad Sci USA 72:1768–1772. doi: 10.1073/pnas.72.5.1768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.McGeoch DJ. 1989. The genomes of the human herpesviruses: contents, relationships, and evolution. Annu Rev Microbiol 43:235–265. doi: 10.1146/annurev.mi.43.100189.001315. [DOI] [PubMed] [Google Scholar]
  • 12.Lee K, Kolb AW, Sverchkov Y, Cuellar JA, Craven M, Brandt CR. 2015. Recombination analysis of herpes simplex virus 1 reveals a bias toward GC content and the inverted repeat regions. J Virol 89:7214–7223. doi: 10.1128/JVI.00880-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Morse AM, Calabro KR, Fear JM, Bloom DC, McIntyre LM. 2017. Reliable detection of herpes simplex virus sequence variation by high-throughput resequencing. Viruses 9:226. doi: 10.3390/v9080226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Szpara ML, Parsons L, Enquist LW. 2010. Sequence variability in clinical and laboratory isolates of herpes simplex virus 1 reveals new mutations. J Virol 84:5303–5313. doi: 10.1128/JVI.00312-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lopez-Munoz AD, Rastrojo A, Kropp KA, Viejo-Borbolla A, Alcami A. 2021. Combination of long- and short-read sequencing fully resolves complex repeats of herpes simplex virus 2 strain MS complete genome. Microb Genom 7:000586. doi: 10.1099/mgen.0.000586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Karamitros T, Harrison I, Piorkowska R, Katzourakis A, Magiorkinis G, Mbisa JL. 2016. De Novo assembly of human herpes virus type 1 (HHV-1) genome, mining of non-canonical structures and detection of novel drug-resistance mutations using short- and long-read next generation sequencing technologies. PLoS One 11:e0157600. doi: 10.1371/journal.pone.0157600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chang W, Jiao X, Sui H, Goswami S, Sherman BT, Fromont C, Caravaca JM, Tran B, Imamichi T. 2022. Complete genome sequence of herpes simplex virus 2 strain G. Viruses 14:536. doi: 10.3390/v14030536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Petro CD, Weinrick B, Khajoueinejad N, Burn C, Sellers R, Jacobs WR, Jr, Herold BC. 2016. HSV-2 ΔgD elicits FcγR-effector antibodies that protect against clinical isolates. JCI Insight 1:e88529. doi: 10.1172/jci.insight.88529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kaugars K, Dardick J, de Oliveira AP, Weiss KA, Lukose R, Kim J, Leung L, Rajagopalan S, Wolin S, Akabas L, Knipe DM, Bajic G, Jacobs WR, Jr. 2021. A recombinant herpes virus expressing influenza hemagglutinin confers protection and induces antibody-dependent cellular cytotoxicity. Proc Natl Acad Sci USA 118:e2110714118. doi: 10.1073/pnas.2110714118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reviewer comments
reviewer-comments.pdf (207KB, pdf)
Supplemental file 1

Supplemental material. Download spectrum.02285-22-s0001.pdf, PDF file, 1.7 MB (1.7MB, pdf)

Data Availability Statement

Full-length genome of HSV-1_B3x 1.1 isomers B3x 1.1A (P), B3x 1.1B (IS), B3x 1.1C (IL), and B3x 1.1D (ILS) were submitted in GenBank and their accession numbers are prototype (ON783214), inverted short (ON783215), inverted long (ON783216) and Inverted Long and short (ON783217). ONT Sequencing data and Illumina paired-end reads were deposited in the SRA database with the accession numbers SRR20796393 and SRR20796392 under the Bioproject ID PRJNA846339.


Articles from Microbiology Spectrum are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES