Skip to main content
PLOS Pathogens logoLink to PLOS Pathogens
. 2021 Jan 19;17(1):e1008594. doi: 10.1371/journal.ppat.1008594

Intra-host changes in Kaposi sarcoma-associated herpesvirus genomes in Ugandan adults with Kaposi sarcoma

Jan Clement Santiago 1,#, Jason D Goldman 2,3,#, Hong Zhao 1, Alec P Pankow 1, Fred Okuku 4, Michael W Schmitt 2, Lennie H Chen 1, C Alexander Hill 1, Corey Casper 2,3,¤, Warren T Phipps 2,3, James I Mullins 1,2,5,6,*
Editor: Ashlee V Moses7
PMCID: PMC7845968  PMID: 33465147

Abstract

Intra-host tumor virus variants may influence the pathogenesis and treatment responses of some virally-associated cancers. However, the intra-host variability of Kaposi sarcoma-associated herpesvirus (KSHV), the etiologic agent of Kaposi sarcoma (KS), has to date been explored with sequencing technologies that possibly introduce more errors than that which occurs in the viral population, and these studies have only studied variable regions. Here, full-length KSHV genomes in tumors and/or oral swabs from 9 Ugandan adults with HIV-associated KS were characterized. Furthermore, we used deep, short-read sequencing using duplex unique molecular identifiers (dUMI)–random double-stranded oligonucleotides that barcode individual DNA molecules before library amplification. This allowed suppression of PCR and sequencing errors to ~10−9/base as well as afforded accurate determination of KSHV genome numbers sequenced in each sample. KSHV genomes were assembled de novo, and rearrangements observed were confirmed by PCR and Sanger sequencing. 131-kb KSHV genome sequences, excluding major repeat regions, were successfully obtained from 23 clinical specimens, averaging 2.3x104 reads/base. Strikingly, KSHV genomes were virtually identical within individuals at the point mutational level. The intra-host heterogeneity that was observed was confined to tumor-associated KSHV mutations and genome rearrangements, all impacting protein-coding sequences. Although it is unclear whether these changes were important to tumorigenesis or occurred as a result of genomic instability in tumors, similar changes were observed across individuals. These included inactivation of the K8.1 gene in tumors of 3 individuals and retention of a region around the first major internal repeat (IR1) in all instances of genomic deletions and rearrangements. Notably, the same breakpoint junctions were found in distinct tumors within single individuals, suggesting metastatic spread of rearranged KSHV genomes. These findings define KSHV intra-host heterogeneity in vivo with greater precision than has been possible in the past and suggest the possibility that aberrant KSHV genomes may contribute to aspects of KS tumorigenesis. Furthermore, study of KSHV with use of dUMI provides a proof of concept for utilizing this technique for detailed study of other virus populations in vivo.

Author summary

Kaposi sarcoma (KS) is a leading cancer in sub-Saharan Africa and in persons with HIV co-infection. Kaposi sarcoma-associated herpesvirus (KSHV, also referred to as human herpesvirus-8, or HHV-8) is the etiologic agent of KS, but the factors that contribute to the development of KS, which occurs in only a small subset of infected individuals, remain largely unknown. While strain differences or mutations in other tumor viruses are known to affect the risk and progression of their associated cancers, whether genetic variation in KSHV is important to the natural history of KS is unclear. Most studies of KSHV diversity have only characterized ~4% of its 165-kb genome, and the observed variation in some studies is likely to have been impacted by PCR or cloning artifacts. To precisely define genomic diversity of KSHV in vivo, we evaluated full-length viral genomes (except the internal repeat regions) using a technique that greatly lowers sequencing error rates and thus measures genomic diversity much more accurately than previous studies. In addition, we extended our analyses to look for potential tumor-specific changes in the KSHV genomes by examining viruses in both tumor and non-tumor tissues. To these ends, we performed highly sensitive, single-molecule sequencing of whole KSHV genomes in paired KS tumors and oral swabs from 9 individuals with KS. We found that KSHV genomes were virtually identical within the 9 individuals, with no evidence of quasispecies formation or multi-strain infection. However, KSHV genome aberrations and gene-inactivating mutations were found to be common in KS tumors, often impacting the same genes and genomic regions across individuals. Whether theses mutations influence KS tumorigenesis or result from genomic instability commonly found in tumors warrants further study. Lastly, aberrant KSHV genomes were found to be shared by distinct tumors within individuals, suggesting the capacity of KS tumor cells to metastasize and seed new lesions.

Introduction

Kaposi Sarcoma (KS) is one of the most common cancers of HIV-infected individuals [1,2], with the burden of disease disproportionately borne by people in sub-Saharan Africa [3]. A gamma herpesvirus, Kaposi sarcoma-associated herpesvirus (KSHV), is the etiologic agent of KS and is consistently detected in tumor tissues [4,5]. KSHV is shed in saliva, which is thought to be a primary mode of transmission [68]. Only a small fraction of KSHV infections progress to KS, and the factors contributing to this progression are poorly understood. The development of KS is often associated with HIV infection and immunosuppression [9], but others factors, including KSHV genome variation, may contribute to differential outcomes of KSHV infection.

Studies of other human oncogenic viruses have revealed that viral genetic variation or de novo mutations may be important to their pathogenicity, as is the case for cancers associated with human papilloma viruses and Merkel cell polyomavirus [10]. Epstein Bar virus (EBV), another gamma-herpesvirus like KSHV, is associated with a variety of neoplasms. In EBV infections, intra-host selection and compartmentalization of viruses may have a role in pathogenesis of EBV-associated cancers [1113]. Additionally, EBV strains isolated from nasopharyngeal carcinomas (NPC) have unique genomic [14,15] and phenotypic [1618] variations compared to other strains and isolates from geographically clustered individuals without cancer [19]. NPC-associated strains were also found to have increased tropism for epithelial- versus B-cells [16,20]. These findings suggest that viral genetic heterogeneity can affect EBV virulence. Whether KSHV genetic variation can similarly influence KS pathogenesis or manifestation is unknown.

KSHV encodes oncogenes that dysregulate cell cycle, cell-to-cell adhesion, inflammation and angiogenesis [2123], and it is therefore plausible that heterogeneity in these genes might result in differences in KSHV infection outcomes and clinical manifestations. For example, polymorphisms in the microRNA (miRNA) region of KSHV have been correlated with the development of multicentric Castleman disease and KSHV-associated inflammatory cytokine syndrome with and without KS [24,25]. K1, the most variable KSHV gene, is conventionally used for KSHV subtyping. There are reports of certain K1 subtypes being associated with more aggressive KS [2628], although correlations of KSHV genetic subtypes with worse KS clinical presentation or outcomes has not been consistently observed [2932].

Studying whole KSHV genomes provides a far more comprehensive picture of diversity than the variable regions alone. Most publicly available KSHV genome sequences have been reported in only the last 5 years [3337], and they demonstrate that polymorphisms in the ~130-kb KSHV non-repeat genomic region outside of the K1 gene contribute much more to KSHV diversity than the 0.9 kb hypervariable K1 gene by itself [33,34]. KSHV genomes have signatures of ancient recombination [36,38], resulting in mosaic genomes that could complicate disease risk association solely with K1 subtypes. It is possible that infecting KSHV strains may have similar pathogenicity, as principal component analysis of 70 KSHV whole genomes from saliva of different individuals with and without KS did not reveal strain specificity for KS [36]. However, further studies are needed to evaluate the possible impact of KSHV strain variation on KS development and clinical outcomes.

Whether KSHV infection commonly displays intra-host diversity is also unclear. Some studies examining virus in different anatomic sites or multiple isolates from an individual have reported detection of KSHV quasispecies, multi-strain infections [3944] and intra-host KSHV viral evolution [37,42,45], while some studies of individuals with AIDS-associated KS have only reported a single persisting strain [4649]. Apparent recombination events in the evolutionary history of KSHV [36,38,46,50,51] do suggest that co-infection of divergent KSHV strains occur, at least sporadically. However, limitations of commonly employed PCR technologies can undermine reliable interpretation of observed intra-host KSHV variation. For example, assessment of intra-host diversity can be easily biased by artifacts introduced during sample preparation and when sequencing PCR products [47]. Short read, next generation sequencing can also have high error rates due to PCR misincorporation, end-repair artifacts, insufficient sequencing depth, and DNA damage from long, repeated high-temperature incubations during PCR and enrichment reactions [5255]. Application of newer and more sensitive sequencing technologies is needed to further reliably assess whether intra-host viral diversity is a common feature of KS.

To much more accurately assess minor intra-host KSHV sequence variation as well as tumor specific changes than has been done previously, we determined viral genome sequences in distinct anatomic compartments (oral and tumor sites), using a highly sensitive short-read sequencing method termed “duplex sequencing” [56]. This method incorporates duplex unique molecular identifiers (dUMI), which are double-stranded strings of random base pairs used to barcode individual DNA molecules before PCR amplification and enrichment [56]. By utilizing dUMI-consensus reads of each DNA molecule in a sample library, PCR-associated errors are reduced to ~10−9, revealing the original sequence variation within a sample [56]. In the present study, we report the results of successfully enriching and duplex sequencing whole KSHV genomes from tumors and oral swabs from 9 Ugandan adults with HIV-associated KS.

Methods

Ethics statement

All participants provided written informed consent. This protocol was approved by the Fred Hutchinson Cancer Research Center Institutional Review Board, the Makerere University School of Medicine Research and Ethics Committee (SOMREC), and the Uganda National Council on Science and Technology (UNCST).

Study cohort and specimen collection

Specimens were obtained from participants enrolled in the “HIPPOS” Study, an ongoing prospective cohort study, begun in 2012, of KS patients initiating treatment at the Uganda Cancer Institute (UCI) in Kampala, Uganda. Participants were eligible for the HIPPOS study if they were >18 years of age with biopsy-proven KS, and naïve to antiretroviral therapy (ART) and chemotherapy at enrollment. Participants attended 12 study visits over a one-year period and received treatment for KS consisting of ART and chemotherapy (a combination of bleomycin and vincristine or paclitaxel) per standard protocols by UCI physicians. At each visit, participants received a detailed physical exam to assess clinical response using the ACTG KS response criteria [57].

Participants provided plasma samples at each visit for KSHV, CD4 and HIV viral load testing, and in addition, provided up to 12 biopsies of KS lesions before, during, and after KS treatment. KS tumor biopsies were obtained using 4mm punch biopsy tools after cleaning the skin with alcohol, and either snap-frozen at the clinic site and stored in liquid nitrogen (LN2) or placed in RNAlater (Sigma-Aldrich, Cat. # R0901) and stored at -80°C. Study clinicians collected swabs of the oral mucosa at each study visit and participants self-collected oral swabs at home for 1 week after the visit after education on the sample collection technique, as previously validated by our group in Uganda [58]. Briefly, a Dacron swab is inserted into the mouth and vigorously rubbed along the buccal mucosa, gums, and hard palate. The swab is then placed in 1 mL of filter-sterilized digestion buffer [59] and stored at ambient temperature [60] before being placed at -20°C for storage.

DNA preparation

DNA was extracted from 300μL homogenized tumor lysates using the AllPrep DNA/RNA Mini Kit (QIAGEN, Cat. # 80204) and eluted into 100μL EB Buffer. For oral swab specimens, DNA was extracted from the swab tip eluate using the QIAamp Mini Kit (Qiagen, Cat. # 51304) following the manufacturer’s protocol. Purification of DNA from saliva stabilized in RNAprotect Saliva Reagent (Qiagen) was performed following the manufacturer’s protocol with the following modifications: There was no initial pelleting or PBS wash, 20 μL proteinase K was used per 200 μL specimen, and DNA was eluted in 50 μL water. DNA was quantified using a NanoDrop 1000 Spectrophotometer (ThermoScientific).

PCR

All PCR preparations were done in a PCR-clean room, except for the addition of control templates. PCR was conducted using the PrimeSTAR GXL kit (Takara, Cat. # R050B) with ThermaStop (Thermagenix) added. Cycling conditions were: 98°C for 2 mins; 35 cycles of 98°C for 10 secs, 60–65°C (depending on primer) for 15 secs, 68°C for 1min/kb; 68°C for 3 mins and then hold at 4°C. Primer sequences are listed in S1 Table.

Copy number quantification

KSHV genome copy numbers were quantified by droplet digital PCR (ddPCR) using the QX200 Droplet Generator and Reader (Bio-Rad), with ddPCR SuperMix for Probes (No dUTP) (Bio-Rad, Cat. # 186–3024). Primers and probes (S1 Table) were designed to detect 4 KSHV-unique genes K2/vIL-6, ORF16/vBCL-2, ORF50/RTA and ORF73/LANA (Fig 1A), with KSHV genome copy number reported as the average of the 4 measures. 420 ng BCBL-1 cell line DNA diluted 1:10,000 (~475 genome copies) was used as positive control, 1 ng human genomic DNA (Bioline, Cat. # BIO-35025) as negative control, and water as no template control. Cycling conditions were: 95°C for 10 mins; 40 cycles of: 94°C for 30 secs, 56°C for 30 secs, 60°C for 1 min; one cycle at 98°C 10 for mins, and then hold at 12°C. The KSHV on-target percent was calculated using the copy number quantification by ddPCR normalized to the total nucleic acid concentration.

Fig 1. KSHV genomes in BCBL-1 cells have low point mutational diversity.

Fig 1

(A) Schematic representation of a linear KSHV genome, with genes colored in green and the major repeat regions in orange. The locations of the K1, vIL-6, vBCL-2, RTA, LANA and K15 genes used for genome quantitation are indicated in green type. (B) Raw (light blue), sUMI-consensus (blue) and dUMI-consensus (dark blue) read coverage along the de novo assembled, BCBL-1 KSHV genome. (C) Bubble plot of minor sequence variants. Each bubble represents a position within the genome at which a variant base or indel was detected, colored by whether they were predicted to be silent or protein-altering mutations. Mutations likely to be silent included synonymous and intergenic point mutations, while protein-altering mutations included non-synonymous, nonsense and frameshift mutations. Bubble height represents variant base frequency among dUMI-consensus reads at that position. Vertical grey columns represent masked repeat regions in which no reliable alignments were possible.

UMI-addition and library preparation (Fig 2)

Fig 2. Workflow for analyzing intra-host KSHV genome diversity from clinical samples.

Fig 2

Each study participant contributed multiple KS tumors and oral swabs, but only those samples with the highest viral loads were reported here. Sequencing libraries were prepared from DNA extracts from each sample with adaptors containing duplex Unique Molecular Identifiers (dUMIs, see S1 Fig). Adaptor-labelled DNA libraries were enriched using biotinylated RNA baits homologous to KSHV sequences. Captured DNA was PCR-amplified to levels sufficient for Illumina HiSeq sequencing. For most samples, libraries were subjected to a second round of enrichment and PCR amplification. Upon sequencing, whole KSHV genomes were first assembled de novo from each sample without the use of dUMIs. The sample-specific genomes generated (sample-consensus) were then used as reference to map the consensus of reads with identical dUMI-tags (i.e., dUMI-consensus reads).

To obtain ~500-bp DNA fragments, 10–20 ng/μL of DNA extract in 100 μL chilled TLE buffer (10mM Tris, pH8.0, 0.1mM EDTA) was sheared using a Bioruptor (Diagenode) on high power for up to 15 min. Fragment sizes were assessed on 1.5% agarose gels. Sheared DNA was bead-purified using 1.2X volume of Agencourt AMPure XP Beads (Beckman Coulter Cat. # A63880) and eluted in 50 μL water. Library preparation (end repair, A-tailing and adapter-ligation) was performed using the KAPA HyperPrep Library Preparation Kit (Cat. # KR0961/KK8503). Double-stranded DNA adapters contained a random 12-bp dUMI sequence and a defined 5-bp spacer sequence added to Illumina TruSeq adaptor sequences [61] (S1 Fig). Subsequently, DNA was bead-purified with 1X volume of beads and eluted in 50 μL water.

DNA libraries were subjected to pre-enrichment amplification with primers mws13 & mws20 (S1 Table and S1 Fig). PCR conditions were: 95°C 4 mins; 5–8 cycles of 98°C 20 sec, 60°C 45 sec, 72°C 45 sec; 72°C 3 mins, 4°C hold. If the bead-purified elution from the end repair and adapter step had more than 240 ng total, it was divided into 50 μL PCR reactions of ≤240 ng and pooled after amplification. PCR products were then bead-purified as above with 1.2X volume beads and elution in 100 μL water, quantified with Nanodrop, and their sizes assessed using a Bio-analyzer (Agilent DNA 7500) or Qiaxcel (QIAGEN AM420).

Library enrichment & sequencing

Biotinylated RNA baits for enriching KSHV sequences in the library were those designed in [62] and were obtained from Agilent, Inc. (Santa Clara, CA). The design was a 120-bp, 12X tiling of the genome of KSHV isolate GK18 (Genbank ID: AF148805.2). The diversity of the bait library was further increased by including K1, ORF75, K15, ORF26 and TR sequences of strains JSC-1 (Genbank ID: GQ994935.1), DG1 (Genbank ID: JQ619843.1), BC-1 (Genbank ID: U75698.1), BCBL-1 (Genbank ID: HQ404500.1), Sau3A (Genbank ID: U93872.2), and of all Western and African isolates in [29,33] (Genbank ID: AF130259, AF130266, AF130267, AF130281, AF130305, AF133039, AF133040, AF133043, AF133044, AF151687, AF171057, AF178780, AF178810, AF220292, AF220293, AY329032, KT271453-KT271468).

Target enrichment was performed using SureSelect Target Enrichment Kit v1.7 (Agilent) with all suggested volumes reduced by half. DNA hybridized to biotinylated-RNA baits was captured with streptavidin beads (Dynabeads MyOne Streptavidin T1, Invitrogen) and resuspended in 20μL water. The DNA-streptavidin bead mixture was used directly in post-enrichment PCR amplification with primers mws13 and mws21, the latter of which includes a sample index sequence (S1 Table and S1 Fig). The PCR cycle number ranged from 10–16, with products monitored every 2 to 3 cycles on a TapeStation (Agilent) to ensure correct fragment sizes (~500bp). When over-amplification resulted in library fragment sizes much larger than expected, a single “reconditioning”PCR cycle with fresh reagents was done [63]. PCR products were cleaned using 1.2X volume AMPure XP beads and the eluted DNA library was sequenced using Illumina HiSeq 2500 with 100-bp paired end reads. For some tumor samples with low KSHV copy numbers and all oral swab samples, a second library enrichment was performed.

De novo assembly of sample-consensus genomes

Initially, a sample-consensus KSHV genome (Fig 2) was generated de novo from paired-end reads of each sample using custom scripts (S2 Fig, https://github.com/MullinsLab/HHV8-assembly-SPAdes). At this stage, the first 17-bp from read ends were trimmed to remove dUMI sequences. Next, reads were subjected to windowed quality filtering using sickle pe [64] with a quality threshold of 30 and a window size 10% of read length. Filtered reads were aligned to a human genome (GRCh38 p12, GenBank GCA_000001405.27) using bwa mem [65]. Unmapped reads were used as input into the de novo assembler SPAdes v3.11.1 [66], with the setting -k 21,35,55,71,81. This often yielded 3 to 4 scaffolds that together encompassed the entire 131-kb unique sequence regions of KSHV, bounded by the major repeat regions: Internal Repeat 1 (IR1), Internal Repeat 2 (IR2), LANA central repeat and Terminal Repeats (TR) (Fig 1A). Next, all scaffolds over 500 bp were aligned using bwa mem to the genome of reference KSHV isolate GK18. From the aligned scaffolds a draft genome was generated in Geneious (Biomatters, Ltd) with manual correction as needed. To finish the assembly, GapFiller v1.1 [67] was used, setting bwa as the aligner and filtered paired-end reads as the input library. The genomes were annotated in Geneious based on the GK18 reference, also adding the annotation for long non-coding RNA T1.4 based on [68]. The major repeat regions were masked with Ns since they were poorly resolved by assembly of short reads that can map to multiple locations within the repeat regions.

Variant identification from dUMI-consensus reads

Paired-end reads, including their dUMI sequence tags, were mapped by bwa to sample-consensus genomes (Fig 2) using a Makefile adapted from [61] (https://github.com/MullinsLab/Duplex-Sequencing). Briefly, all reads mapping to the same genomic position were collapsed by single strand UMIs (sUMI) to make sUMI-consensus reads (S2 Fig). Complementary UMI tags from opposing strands were matched to create dUMI-consensus reads, thus removing nearly all PCR polymerase misincorporation and chimera artifacts. Nine bases from both read ends were then trimmed to minimize read end artifacts. Discrepancies between mapped dUMI-consensus reads and the sample-consensus genomes were manually inspected in Geneious and misalignments around homopolymer runs were corrected. Only the remaining discrepancies were considered to be sequence variants that existed prior to PCR amplification.

All genome and subgenome sequence alignments were generated using MAFFT [69] [algorithm FFT-NS-i x1000, scoring matrix 1PAM/k = 2], and all phylogenetic trees were inferred using RAxML [70] (-f d, GTR gamma, N = 100 starting trees), using a representative KSHV genome from each individual. The NeighborNet phylogenetic network was generated using SplitsTree5, excluding gap sites [71]. Consensus genome sequences were deposited in GenBank (Accession numbers MT510648MT510670, MT936340, see Table 1) annotated with the genomic rearrangements, when present.

Table 1. Origin and processing results from specimens for KSHV genome analysis.

Pt ID Age Sex Plasma HIV RNA (copies/mL) CD4 T cells /μL Sample ID Sample Type % on-target Median read coverage Standard Deviation Median dUMI-consensus read coverage Standard Deviation Genome length (excluding masked repeats) # positions with variants Mean frequency of variant base Genbank Accession
Pre-enrichment Post-enrichment
n/a n/a n/a n/a n/a BCBL-1 cell line 0.0009% 15.61% 16,805 1,858 302 52.49 132,676 81 1.35% MT936340
U003 25 M 759,635 45 U003-C Tumor 0.0037% 7.15% 16,265 1,715 270 30.47 131,292 21 1.80% MT510648
U003-o1 Oral swab 0.0000% 35.20% 49,598 26,446 7 6.22 131,102 12 - MT510649
U003-o2 Oral swab 0.0003% 31.60% 66,527 13,995 310 68.48 131,129 218 0.51% MT510650
U003-o3 Oral swab 0.0000% 41.90% 65,984 21,979 47 21.31 131,143 24 - MT510651
U004 37 M 277,655 85 U004-C Tumor 0.0016% 17.20% 19,582 3,078 34 7.41 131,277 12 - MT510663
U004-D Tumor 0.0017% 75.10% 60,340 10,253 1,291 280.27 131,237 73 0.73% MT510665
U004-o1 Oral swab 0.0001% 18.90% 37,794 6,280 56 11.70 131,277 31 - MT510664
U007 26 M 91,096 136 U007-B Tumor 0.0006% 86.90% 75,001 19,771 1,172 268.44 131,352 156 0.17% MT510654
U007-o1 Oral swab 0.0001% 27.60% 43,707 6,258 206 25.54 131,126 61 1.93% MT510655
U008 56 M 860,937 488 U008-B Tumor 0.0049% 16.97% 19,309 4,168 637 148.42 131,142 31 0.93% MT510656
U008-D Tumor 0.0040% 29.98% 26,209 4,674 195 42.95 131,102 24 0.70% MT510657
U008-o1 Oral swab 0.0000% 23.50% 48,339 6,869 114 16.62 131,116 76 2.11% MT510658
U020 27 M 118,191 370 U020-B Tumor 0.0070% 1.47% 2,913 14,971 24 196.60 131,102 10 2.40% MT510666
U020-C Tumor 0.0003% 34.38% 420 3,674 3 2.51 131,471 30 - MT510667
U020-o1 Oral swab 0.0000% 77.30% 73,287 22,429 80 21.66 131,115 22 - MT510668
U023 33 F 338,285 191 U023-o1 Oral 0.0001% 2.70% 19,889 6,683 2 1.30 131,122 2 - MT510669
U030 40 M 100,184 70 U030-C Tumor 0.0135% 15.68% 38,935 6,664 490 62.59 131,282 17 1.08% MT510670
U032 23 F 587,149 274 U032-B Tumor 0.0003% 43.50% 69,815 9,269 890 67.12 131,266 45 0.41% MT510652
U032-o1 Oral 0.0000% 7.70% 7,311 3,218 1 0.80 130,842 - - MT510653
U034 47 F 130,375 237 U034-B Tumor 0.0014% 84.40% 74,927 13,969 1,747 189.51 131,248 107 0.11% MT510659
U034-C Tumor 0.0013% 30.70% 17,968 2,505 133 24.36 131,088 11 1.46% MT510660
U034-o1 Oral swab 0.0000% 7.30% 7,851 2,818 2 1.22 130,754 2 - MT510661
U034-o2 Oral swab 0.0000% 6.00% 4,085 1,735 1 0.62 130,884 - - MT510662

Integration analysis

Systematic searches for KSHV integration into the human genome were done in two ways. First, each library was searched using local BLASTN against both human and KSHV sequences and then using the Perl script SummonChimera [72] to extract coordinates of potential integration sites. Second, a sample-consensus KSHV genome was appended as an extra chromosome to the human genome reference GRCh38 p12. The appended human genome reference was used to map sUMI consensus reads via Speedseq [73] to generate alignment files with only discordant or split reads. These were input into LUMPY for structural variant analysis [74]. Human chromosome sequences linked to KSHV sequences were taken to be putative integration sites.

Results

Assessment of the dUMI sequencing protocol with a KSHV infected cell line

As part of the optimization of the dUMI-sequencing protocol, KSHV genome sequences were first obtained from an early passage of BCBL-1, a KSHV-infected PEL cell line [75]. BCBL-1 cells were grown as previously described [76]. After DNA extraction, KSHV sequences corresponded to ~0.16% of the total DNA using a ddPCR assay for ORF73 and T0.7-K12, and normalized by comparison to the human gene POLG (DNA polymerase subunit gamma). Following a single round of bait capture, the fraction of sequence reads corresponding to KSHV from BCBL-1 DNA extracts (i.e., the “on-target” level), was 15.6%, corresponding to 173-fold enrichment.

Sequencing of the BCBL-1 KSHV genome produced a median coverage of 16,805 reads per base excluding the repeat regions. Collapsing raw reads by identical sUMI to generate sUMI-consensus reads resulted in a median of 2,677 sUMI reads per base. When collapsed further into consensus sequences derived from both strands, a median of 302 dUMI reads per base was obtained that were essentially free of PCR errors (Table 1 and Fig 1B). Since each dUMI tags a unique DNA molecule before PCR, the number of unique dUMI tags indicates the number of unique viral templates sequenced [56,77]. Using this measure, 302 also approximated the number of KSHV genomes sampled from BCBL-1.

Eighty-one base positions (0.06%) in the BCBL1 consensus KSHV genome had detectable variants in dUMI-consensus reads, and the average frequency of minor variants was 1.35%. No variant exceeded 14% of the total dUMI-consensus reads at any position (Fig 1C). No doubling of read coverage was found within the 9-kb genomic region previously reported in the BCBL-1-derived KSHV recombinant clone BAC-36 [78].

The consensus, de novo-assembled KSHV genome in BCBL-1 had 3 differences from the published BAC-36 sequence: a C➔A change in the noncoding sequence before ORF K5 (BAC-36 position 24,630), 2 additional Gs in a homopolymer run at BAC-36 position 25,210), and a synonymous T➔C change in the K7 gene (BAC-36 position 28,409). No variant bases were found in dUMI-consensus reads at the equivalent positions of this BCBL-1 KSHV sequence, indicating that the 3 BAC-36 sequence variants were not present in this passage of the BCBL-1 line at detectable levels (i.e., <1 copy per 302 genomes).

KSHV sequence derivation from tumor tissues and oral swabs

KSHV genome sequences were successfully obtained from samples provided by 9 participants with HIV-associated KS, including 12 KS tumors and 11 oral swabs. (Table 1). The representation of KSHV DNA in a sample was determined by ddPCR analysis of KSHV genes vIL-6, vBCL-2, RTA and LANA (Fig 1A) and provided as the percentage “on-target” KSHV DNA. These levels ranged from 0.03% to 1.35% (median 0.17%) in tumors, while most oral swab samples were below 0.01% on-target (Table 1). Following one enrichment with RNA baits, KSHV DNA corresponded to a median of 1.3% on-target, and after a second enrichment a median of 24.2% on-target, for a median final enrichment of 123,955-fold.

Median read coverage across KSHV genomes, excluding the major repeat regions, was 22,896 for tumors and 37,794 for oral swab samples. After collapsing mapped reads by dUMI, the median dUMI-consensus read coverage was 380 for tumors and 27 for oral swabs (Table 1 and S3A and S3B Fig). The lower dUMI-consensus read coverage of oral swab KSHV sequences, despite having higher raw read coverage than in tumors, was due to oral swab sample libraries having lower amounts of KSHV DNA and higher proportions of PCR duplicates. This resulted from low viral genome input requiring more rounds of enrichment and PCR cycles, and more significant DNA degradation during storage. Indeed, only 11 of 43 oral specimens attempted yielded acceptable library quality for whole KSHV genome sequencing, compared to 12 of 13 tumor samples. Since the median dUMI-consensus read coverage corresponds to the number of viral genomes sampled, tumor U032-B had the highest number of genomes analyzed at 1,653. We set the lowest number of genomes accepted for confident assignment of variant frequencies to be 100 (S4A Fig); below this number dUMI-consensus read coverage was judged to be too sparse. U020-B was an exception due to most genomes having a large deletion, to be discussed below. For other samples with sequenced genomes below 80, dUMI-consensus reads generated were insufficient to cover the entire KSHV genome, even if whole KSHV genomes could be assembled from raw reads. Overall, read coverage was relatively uniform along the KSHV genome for most tumors (S3A Fig) and all adequately sampled oral swabs (S3B Fig).

Very few point mutations were found in dUMI-consensus reads from either tumors (Fig 3A) or oral swabs (Fig 3B). Excluding the major repeat regions, the number of genome positions with a detectable intrasample variant base ranged from 2–218 (<0.01–0.17%) (Table 1). These frequencies were lower or comparable to those in the BCBL-1 cell line, although clinical samples had detectable variation in long homopolymer runs not observed in the BCBL-1 viruses. The sample-consensus genome was generally the only KSHV sequence present in each sample, hence, there was no evidence for the existence of quasispecies [79].

Fig 3. Point mutational diversity in KSHV genomes from tumors and oral swabs.

Fig 3

Bubble plots of minor sequence variants remaining after removal of PCR errors, in KSHV genomes from tumors (A) and oral swabs (B). Each bubble represents a variant base or indel, colored by whether they were predicted to be silent or protein-altering mutations. Silent mutations include synonymous and intergenic point mutations, while protein-altering mutations included non-synonymous, nonsense and frameshift mutations. Hollow circles represent mutations occurring in homopolymer runs. Bubble heights represent the frequency of the variant base among dUMI-consensus reads at that position. Vertical gray columns represent the masked repeat regions. The region containing the K1 gene is indicated with arrows at the bottom of the figure.

Artifacts resulting from the end-repair step in DNA library preparation, which precedes the application of dUMI tags, cannot be corrected by duplex sequencing [56,61,80]. Hence, 9 bases were trimmed from ends of dUMI-consensus reads before analyses, substantially reducing the variation observed in the raw reads. The minor base variants remaining in all samples revealed a preponderance of C➔A and G➔T substitutions (S4B Fig) as well as differences in homopolymer run lengths (Fig 3A and 3B). However, most minor variants were supported by only one dUMI-consensus read. Overall, mean variant frequency and median dUMI-consensus read coverage were inversely correlated (S4A Fig). Since the remaining variation cannot be distinguished from artifacts, true minor variant frequencies could be even lower than reported here.

KSHV genomes were virtually identical at the point mutational level between tumors and oral swabs from the same individual

Intra-individual single nucleotide differences between tumor and oral swabs ranged in number from 0–2 across entire ~131-kb genomes, not counting the major repeat regions. Notably, there were almost no intra-individual polymorphisms in the KSHV hypervariable gene K1 (Fig 3A and 3B). Hence, no evidence for minor KSHV variants or multi-strain infections was found in these individuals.

KSHV genomes were distinct across the 9 participants, with sequence differences ranging from 3.06–4.85%. They included K1 subtypes A5, B1 and C3 (Fig 4A) and K15 alleles P and M (Fig 4B). While K1 and K15 are the most variable KSHV genes, polymorphisms along the rest of the genome have been reported to contribute more in aggregate to the total diversity of KSHV [33,34,36]. Consistent with this, maximum-likelihood phylogenetic trees using entire KSHV genomes (Fig 4C) were topologically distinct from those of K1 or K15. Moreover, due to signatures of recombination in the evolutionary history of KSHV [36,38], differing phylogenies along sections of the KSHV genome may be better represented by a phylogenetic network (Fig 4D), in which higher degrees of conflict result in a more web-like structure rather than a tree.

Fig 4. KSHV phylogenetic relationships in variable regions K1 and K15 and whole genomes.

Fig 4

Phylogenetic trees of (A) K1 genes, (B) K15 genes and (C) whole genomes from this study and of select genomes from other publications. K1 and K15 subtypes are indicated in the K1 (A) and K15 (B) trees. (D) A neighbor-net phylogenetic network of all published KSHV genomes to date, color-coded by genome types proposed in [36]: P1 in green, P2 in blue, N in purple, M1 in brown and M2 in red. All de novo-assembled genomes from this study are in bolded italics.

Aberrant KSHV genome structures in tumors

Among the 12 tumor-derived KSHV genomes examined, 7 had anomalous read coverage that shifted abruptly once or twice along the viral genome (S3A Fig). In contrast, oral swab KSHV genomes from the same individuals had uniform read coverage while being identical in sequence. This argues against preferential target capture by RNA baits or sequencing biases. Repeating the enrichment and sequencing, conducted on some samples, reproduced their distinctive read coverages. Split reads accumulated at the points of abrupt shifts in read coverage, even after collapsing reads by their dUMI consensus, indicating that these were not PCR template switching artifacts. Specific viral genomic anomalies observed are detailed below, along with any additional evidence showing that they represented real structural aberrations in viral genomes:

Tumor U003-C

Read coverage in U003-C was high (average of 15,635) and uniform across the KSHV genome except for a 6-bp gap within the K8.1 gene intron up to the first base of the second K8.1 exon (Fig 5A). No read indicated a deletion, nor was any read found with its mate pair located across the 6-bp gap. This region was PCR amplified from unsheared U003-C tumor DNA using conserved primers flanking the gap (Fig 5B), and no PCR product was detected. In contrast, an intact K8.1 intron sequence was amplified and sequenced from a tumor from another person (Fig 5C).

Fig 5. KSHV genomes in the U003-C tumor harbor a deletion within the K8.1 gene.

Fig 5

(A) Read coverage of the U003-C KSHV genome, showing a 6-bp gap (red arrow) where no read pairs were mapped. (B) Cartoon of the de novo-assembly sequences generated at either side of the gap, both ended within the K8.1 gene intron and continued into terminal repeat (TR) sequences. Green and yellow arrows show the directions of the K8.1 gene and terminal repeat sequences, respectively. Blue arrows show the position of PCR primers used to confirm breakpoint junctions, with the expected PCR product sizes. (C) PCR products generated from U003-C tumor DNA using primers flanking the gap. The 443-bp PCR product expected if the K8.1 gene intron was intact was not detected from U003-C (left column), and was detected in tumor U007-B (right column) from another person. (D) Hemi-nested PCR of U003-C tumor DNA for the K8.1-TR (left) and TR-K8.1 (right) junctions produced products of the predicted sizes. These structures were confirmed by Sanger sequencing. No K8.1-TR or TR-K8.1 junction fragment was produced from BCBL-1 DNA. The light bands at the TR-K8.1 lane under BCBL-1 were determined from Sanger sequencing to be amplicons generated from the forward primer sequence (indicated with * in panel B) overlapping with K8.1; this primer was used since the rest of the connected TR sequence assembled was GC-rich and unsuitable for primer design.

De novo assembly revealed that the reverse complement of TR sequences continued from the gap position (Fig 5B). K8.1-TR junctions were confirmed by PCR with primers flanking the junctions (Fig 5D) and Sanger sequencing. Inversion of the 60-kb 3’ end of the U003-C genome, starting inside K8.1, is a parsimonious explanation for the breakpoints.

Tumor U004-D

The first 3kb, from K1 to the end of gene ORF4, had 1.5X read coverage compared to the rest of the KSHV genome (S3A Fig). However, no split reads or chimeric read pairs were found to explain this result from a genome rearrangement or deletion.

Tumor U008-B and D

U008-B had 1.7X greater read coverage over a 14.8-kb segment from inside K3 to inside ORF19 (GK18 reference positions 19,168 to 33,980, Fig 6A), including IR1 (masked). This was corroborated by ddPCR quantitation of vBCL-2, inside the 1.7X coverage region, with 1.7–1.9-fold higher gene copy number in the tumor compared to the vIL-6, RTA and LANA regions (Table 2).

Fig 6. KSHV genomes in two tumors from participant U008 had a 14.8-kb region flanking Internal Repeat 1 (IR1) duplicated and translocated to Internal Repeat 2 (IR2).

Fig 6

Total, sUMI and dUMI-consensus read coverage of tumor B (A) and D (B) genomes from individual U008. (C) Annotations of the region with 1.5-2X read coverage, with genes in green, repeat regions in orange, and long non-coding RNAs in red. Many reads on the edge of this region continue into IR2 (red arrows). Annotations are from the KSHV reference isolate GK18. (D) Cartoon showing the duplication of the 14.8 kb region into IR2 and the PCR primers used to examine the genomic rearrangement in unsheared tumor DNA extracts from tumors U008-B and U008-D. PCR products produced from primer pairs numbered in D from U008-B and BCBL-1 (E) and in U008-D (F). All visible bands were excised from the agarose gel and sequenced, confirming the indicated junction sequences. Primer pairs # 9–12 produced no PCR products discernible on an agarose gel.

Table 2. Gene copy numbers in tumor or oral swab DNA.
Sample vIL-6 vBCL-2 RTA LANA
U003-C N/A N/A N/A 9,664
U003-o1 127 100 120 135
U003-o2 1,298 1,238 1,452 1,628
U003-o3 191 184 169 199
U004-C 1,243 1,274 1,205 998
U004-D 4,433 4,466 4,543 4,290
U004-o1 117 128 120 112
U007-B 1,842 1,864 2,040 1,925
U007-o1 376 356 322 344
U008-B 19,140 33,629 19,910 19,195
U008-D 24,360 34,755 12,737 17,189
U008-o1 129 136 119 138
U020-B 49,600 55,850 4,550 5,500
U020-C 3,658 5,033 145 139
U020-o1 254 231 234 248
U020-o2 100 183 123 225
U023-o1 62 57 50 66
U030-C N/A N/A N/A 59,730
U030-o1 2 5 5 7
U032-B 476 520 494 466
U032-o1 9 0 2 0
U034-B 1,920 1,887 1,870 1,793
U034-C 13,083 13,335 8,148 6,552
U034-o1 9 6 9 7
U034-o2 6 11 0 1

N/A, not quantified.

Inferring from split reads, the 14.8-kb segment was translocated to inside IR2 (to GK18 position 119,496, Fig 6D). This was confirmed in the unsheared tumor DNA by PCR and Sanger sequencing using primers spanning the breakpoint (Fig 6D and 6E, lanes 4 and 5). Other primer pair combinations were tested to see if there were DNA species with the 14.8-kb segment inverted, deleted in place, duplicated in tandem or rearranged in other ways. None generate detectable PCR products except for primer pairs showing that the 14.8-kb segment also exists in the native configuration (Fig 6E). Thus, the 14.8-kb segment was copied into IR2 but had not been deleted from its original location.

In a parallel study of viral transcriptomes [81], abundant expression of a chimeric Kaposin transcript fused to the same 14.8-kb segment was found in tumor U008-B, consistent with the viral genome structure we observed. Another tumor from the same participant, U008-D (Fig 6B), had 100% nucleotide identity and was confirmed to have the same duplication and breakpoint junctions (Fig 6F).

Tumor U020-B

Read coverage abruptly dropped 12.8-fold over the last ~90 kb of the KSHV genome in this tumor (Fig 7A). This was consistent with ddPCR quantitation, with vIL-6 and vBCL-2 gene amplicons having 9.0–12.3-fold higher levels than RTA and LANA (Table 2). The coverage shifted before the end of ORF25 (GK18 position 46,615) and reads at this breakpoint continue into TR sequences ~90 kb downstream (Fig 7C). Thus, U020-B appeared to have KSHV genome variants with a ~90-kb deletion, or formally, a 12.8X amplification of a 46-kb subgenomic region. No U020-B tumor DNA remained to allow confirmation of this breakpoint.

Fig 7. KSHV genomes in U020-B and U020-C have large, distinct deletions.

Fig 7

Total, sUMI and dUMI-consensus read coverages of U020-B (A) and U020-C (B) KSHV genomes. GK18 reference annotations in the high-coverage regions of U020-B (C) and U020-C (D). (E) Cartoon showing the region encompassing the high coverage region of U020-C viral genomes, leaving a 16.5-kb region connected to TR. (F) PCR primers used to examine genome structure. Primers to unique genomic sequences are in blue, and primers to repeat sequences are in orange. (G) PCR products produced from primer pairs numbered in E, with DNA from U020-C and BCBL-1 as templates. Bands from lanes 1, 2 and 5 were excised from the agarose gel and sequenced. Faint bands in lanes 7 (five bands) and 9 (one) under U020-C were extracted from the gel but did not yield enough DNA for Sanger sequencing, except for the light 1-kb band in lane 7. Top BLAST hits to this sequence were human phosphatidylserine synthase 2 gene sequences, likely amplified due of spurious primer homology. The ~16 kb band in lane 3 under BCBL-1 was confirmed by Sanger sequencing to be ORF11 and ORF18 sequences from its two ends, and lane 10 under BCBL-1 corresponded to sequences common to many cloning vectors such as pCMV-VEE-GFP. Row 11 primers in (F), in which the forward primer binds to unique genomic sequences preceding the TR, yielded no discernible product.

Tumor U020-C

This tumor had a 30-fold difference in read coverage and breakpoints inside ORF11 and ORF18 (Fig 7B). ddPCR quantitation demonstrated gene copy numbers of vIL-6 and vBCL-2 amplicons to be 25–36-fold higher than for RTA and LANA (Table 2). The spike in read coverage occurred over a 16.2 kb region (GK18 positions 16,942 to 33,011). Again, chimeric reads were found at either ends of this region continuing into TR sequences (Fig 7D), indicating fusion with the TR (Fig 7E). Junction fragment-specific PCR and Sanger sequencing confirmed the 3’ junction (Fig 7G, lane 5). No PCR product was produced from the other putative breakpoint junction, TR-ORF11 (Fig 7F, lane 4 primers in K2 and TR). However, the latter result is likely due to failure of PCR priming to the extremely GC-rich TR sequences, which was largely unsuitable for PCR priming [82]. Three forward primer designs paired with the reverse TR primer that had worked for the 3' junction (Fig 7G, lane 5), produced no PCR products even from BCBL-1 DNA. Finally, no product was produced from primer pairs outside and flanking the 16.5kb region (Fig 7G, lane 3), as well as with a primer binding upstream of TR (Fig 7G, lane 11), but these were most likely due to these primers being complementary to the deleted region (<3 reads per base). Determining the presence of >16 kb DNA from very low template amounts requires very long PCR extension times and high template numbers and were beyond the limit of our PCR assays.

Tumor U030-C

Greater than 30,000 reads/position were uniformly observed throughout most of the KSHV genome. However, coverage dropped or was missing within the K15 gene (S3A Fig). The remaining K15 sequences corresponded to the K15 M-allele, which is less common than the P allele but was included in our RNA bait design (GenBank U75698). PCR amplification and Sanger sequencing of this region showed that the U030-C tumor did contain some copies of the entire M-allele K15 sequence. The U030-C sample-consensus genome was finished with this Sanger sequence result, since no dUMI-consensus reads mapped to the gaps in K15. In the parallel RNAseq study of tumors of this same participant, U030-B and C, transcripts of K15 were also lacking, unlike tumors from all other participants [81].

The same aberrant KSHV genomes are found in multiple lesions from the same individual

In the case of U008-B and U008-D, two tumors biopsied from distinct lesions on the left leg (S5 Fig and S2 Table), full-length genome sequencing showed that they had the same 14.8 kb KSHV subgenomic sequence duplicated in IR2 (Fig 6F). PCR primers across those breakpoints were used to screen for the same structures in 6 other distinct lesions (S2 Table) from this individual, and none had this duplication. In contrast, four additional tumors tested from participant U003 had the same inversion breakpoints, detected by PCR, as tumor U003-C (Fig 8). Moreover, no intact K8.1 sequences were detected in 2 of these 4 tumors by nested PCR of the region spanning the K8.1 intron gap (Fig 8). These biopsies came from distinct lesions in the left leg (S2 Table). Lastly, in participant U020, the ORF18-TR junction sequences found spanning the U020-C genomic deletion was not detected in the 2 other tumors tested.

Fig 8. KSHV genome structures in participant U003.

Fig 8

Junction sequences marking the genomic aberration in U003-C were detected in all 4 other tumors tested, while intact K8.1 sequences were detected in only 2. The cartoon shows the breakpoints in the K8.1 intron of U003-C extending into TR sequences, along with PCR primers used to confirm the genome structure. PCR products from other tumors of participant U003 and from the BCBL-1 cell line are shown at the bottom of the figure. All visible bands were excised from the agarose gel and their structures confirmed by Sanger sequencing.

Mutations in sample-consensus KSHV genomes from tumors impacted protein coding sequences

Among the 7 participants with KSHV sequences from at least one oral swab and one tumor, sample-consensus KSHV genomes were identical in the oral and tumor samples of 2 participants and had a few differences in 4 others. In the remaining participant, U004, the sample-consensus KSHV genome in one tumor was identical to that in oral but the second tumor had mutations. The mutations unique to tumors were typically nonsynonymous point mutations resulting in highly dissimilar residues or were mutations that might disrupt their expression (Table 3).

Table 3. Unique KSHV mutations observed in tumors compared to oral swabs from the same individual.

Sample ID Tumor-specific differences
U003-C K12 synonymous mutation within miR-K10
genomic inversion starting at K8.1
U004-C NONE
U004-D ORF32 nonsynonymous mutation R56Q
K15 nonsynonymous mutation A290P
28-nt deletion within the K8.1 promoter
3-kb segment duplication from before K1 to after ORF4
U007-B NONE
U008-B duplication of 14.8 kb segment around IR1 into IR2
- breakpoints inside K3 & ORF19
- genes duplicated: ORF70, K4.1, K4.2, K5, K6, K7, ORF16, ORF17, ORF17.5, ORF18
U008-D same as U008-B
U020-B ORF25 nonsynonymous mutation Q594K
genomic deletion connecting end of ORF25 coding sequence to TR sequences, 47 kb remaining
U020-C ORF11 nonsynonymous mutation T396P
K3 nonsynonymous mutation F88L in transmembrane domain
K8.1 nonsense mutation at start of 2nd exon
genomic deletion leaving only 16 kb segment surrounding IR1 connected to TR sequences
- breakpoints inside ORF11 and ORF18
- ~30X coverage for: K2, ORF2, K3, ORF70, K4, K4.1, K4.2, K5, K6, K7, ORF16, ORF7, ORF17.5
U032-B ORF63 nonsynonymous mutation T848A
U034-B NONE
U034-C NONE

Several of the mutations or genome aberrations observed in tumors occurred in structural genes (S3 Table), and frequently involved the K8.1 gene, which encodes an envelope glycoprotein: The U003 inversion breakpoint cleaved the K8.1 gene; U004-D had an R56Q mutation in its ORF32 tegument protein coding sequence, as well as a 28-nt deletion in the promoter region of K8.1 (S6A Fig). The deletion was after the K8.1 core promoter sequence [83], but encompassed the K8.1 transcription start site [84]; the ORF25 major capsid protein in U020-B had a Q594K mutation, in addition to the U020-B genomic deletion that started further downstream inside ORF25; U020-C had a nonsense mutation at the beginning of the second K8.1 exon; and U032-B had a T848A mutation in ORF63, a tegument protein.

The only intra-host synonymous point mutation observed was in ORF K12 of U003-C (GK18 position 118,082). This C to T change occurred within the oncogenic microRNA K10 (miR-K10a-3p) sequence embedded in the K12 transcript. The three oral swab samples from this participant maintained the consensus C at this position (S6B Fig), whereas the 4 other tumors from this participant examined had T at this position, with tumor U003-G having a minor population of viruses with the consensus C (S6C Fig). The change was outside the seed sequence of miR-K10a-3p and may have resulted in a slightly more stable stem loop precursor (ΔG -32.40 ➔ -32.70, S7 Fig).

Lack of evidence for integration of KSHV sequences into human chromosomes

No de novo-assembled scaffolds, split reads or improperly-paired read mappings suggested any instance of KSHV sequences fused to human DNA. Nevertheless, attempts were made to systematically search for human-KSHV chimeric sequences. The methods employed were the same as those used to screen for all integrated herpesviruses sequences in public databases [85] and EBV integration sites in in primary gastric and nasopharyngeal carcinomas [86]. The KSHV genome inversions, duplications and deletions described above were detected with high confidence values. In contrast, putative breakpoints joining human and KSHV sequences were supported by only tens of reads, about 2 orders of magnitude lower in number, and often involved LANA repeats into low-complexity human repeat sequences. These were judged to be artifacts.

Co-infection with EBV detected predominantly in oral swabs

Some scaffolds during de novo assembly correspond to EBV sequences. Nearly all oral swab samples yielded multiple EBV-mapping scaffolds up to 73 kb, with no region of the EBV genome over-represented. In contrast, EBV-sequences were detected in only 5 of 12 tumors, and in all cases were sequences flanking the EBNA-1 repeat. The proportion of reads mapping to EBV in oral swabs ranged from 0–33%, median 1.8%, whereas in tumors the range was from 0–0.5%, median 0.002% (S4 Table). No other eukaryote viruses were identified.

Discussion

This study is the first to explore KSHV intra-host diversity at the whole genome level, across tumor and non-tumor sites, and provides an unprecedented level of precision to KSHV genome sequence analysis definition in clinical specimens. KSHV genome sequences were obtained essentially error-free from 11 oral swabs and 12 KS tumor samples provided by 9 Ugandan adults with KS. More than 100 genome copies per sample were sequenced from most tumors, while fewer copies were sequenced from oral swabs due to lower input and sample quality. By incorporating dUMI, PCR misincorporation errors and template switching artifacts were substantially eliminated, permitting detection of variants as infrequent as 0.01% and a theoretical error rate of 1/10−9, approximately the DNA replication error rate in eukaryotic cells [56]. Given that herpesviruses have among the lowest replication error rates among viruses [87], next-generation sequencing errors would have corresponded to a greater proportion of suspected mutations and confounded discernment of any rare variants in a potentially heterogeneous mixture, regardless of read depth [88].

We did not observe evidence of KSHV quasispecies within individuals [79], consistent with the low mutation rates of large dsDNA viruses [87]. Less than 0.01% of all base positions in the 131-kb KSHV genomes (excluding the major repeat regions), or <1 site per genome, were found to have a detectable variant, and these were typically supported by only one dUMI-consensus read. While the possibility of artifacts cannot be totally excluded, the exceedingly low intra-sample variation observed is within the published resolution of dUMI sequencing [56]. There are reports of intra-host KSHV variability in certain KSHV-endemic populations [39], in children [45], in iatrogenic settings [4143] and in blood of AIDS-KS patients [44], but these findings were arrived at by Sanger sequencing of PCR amplicon clones of hypervariable regions in K1 or other genes. Such protocols are more likely to detect errors that occurred during PCR. Our study found virtually no intra-sample or intra-host diversity even at K1 in the 9 individuals examined. While second enrichment could conceivably bias sequencing results, comparing doubly-enriched oral swabs libraries to their matching, singly-enriched tumor libraries resulted in identical sequences. Nevertheless, since recombination is evident in the evolutionary history of KSHV [36,38,46,50], co-infections by multiple KSHV strains have occurred, but this does not appear to be a common feature of KSHV infection.

At the point mutational level, sample-consensus KSHV genomes in tumors and oral swabs within the same individuals were nearly identical, differing by at most two point mutations. None of the differences occurred in introns and intergenic regions, and nearly all were nonsynonymous changes into dissimilar amino acids. Notably, the sole intra-host synonymous mutation observed occurred inside K12/Kaposin A of tumor U003-C (GK18 position 118,082), within the tumorigenic microRNA miR-K10a-3p [89]. The single nucleotide change, outside the seed sequence [90], is predicted to result in a slightly more stable stem loop precursor (ΔG -32.40➔-32.70, S7 Fig). The oral swab counterpart maintained the database consensus, and among published KSHV genomes, only tumor-derived ZM106 (GenBank KT271458) also had T at this position. The biologic implications of this change are not clear, but polymorphisms within this region have been reported to affect viral microRNA levels in KS lesions [91].

A striking finding in our study was the frequency of aberrant KSHV genomes in KS tumors, summarized in Fig 9. At least 4 of the 7 participants whose samples were examined had KSHV with major inversions, deletions or duplications in their tumors. In contrast, no aberrant genome structures were found in oral swabs from the same individuals. Moreover, aberrant genomes comprised the majority of the KSHV genomes in the tumors in which they were detected, from 1.7-fold (U008-B) to 30-fold (U020-C) more aberrant genomes than full-length intact KSHV genomes (Table 2 and Fig 7A and 7B). Further, no intact viral genomes were detected in 3 of 5 examined tumors from participant U003 (Figs 5 and 8). We did not find evidence of integration of KSHV into human chromosomal DNA; all breakpoints detected in KSHV sequence reads, when mapped, connected to other regions of the KSHV genome.

Fig 9. Schematic representation of the 5 aberrant KSHV genomes discovered in KS tumors.

Fig 9

Specific details and evidence for each are referred to in the text and in Table 3. The nonsense mutation in K8.1 of U020-C were found in the remaining full-length genomes detectable in this tumor.

The fact that aberrant KSHV genomes were only observed in tumor samples is intriguing yet their significance is unclear, since it is well known that tumor cells suffer substantial genomic instability [92]. KSHV genome aberrations have been reported previously: The first whole genome sequence of KSHV published reported a 33-kb portion of the KSHV unique central region duplicated into the TR region [93]. A study of 16 tumor-derived KSHV whole genomes from Zambia reported 4 that had regions with as much as 3-fold more coverage than sample average, although the regions were not specified [33]. A PCR screen for some KSHV genes showed that some KS tumors and KSHV-infected B-cell lines can harbor deleted KSHV genomes [94], and one such B-cell line proliferated faster than the parental BCBL-1 line [94]. The infecting KSHV virus in this line had an 82-kb deletion from the 5’ end of its genome, was lytic replication-incompetent, and could be packaged by a helper virus.

The genomic locations and character of tumor-associated mutations we observed suggest that mutations that propagated to high copy numbers may not have been random. However, again, this could represent regions of particular susceptibility to genome instability or they may contribute to tumorigenesis. All point and genomic mutations we observed impacted protein coding sequences, and most rearrangement breakpoints truncated lytic gene products (Tables 3 and S3). Given that the KSHV genome densely encodes many immunomodulatory, angiogenic and anti-apoptotic factors [2123], it is therefore possible that some mutations observed here could contribute to KS disease, or could involve viral genes that are unnecessary within established KS tumors, or contribute to immune evasion of transformed host cells.

The genomic region around IR1 featured prominently in genomic rearrangements in 4 tumors, potentially leading to their over-expression relative to other KSHV genes. For example, tumors U008-B and U008-D had a 14.8-kb portion of their genomes, from inside K3 to ORF19, duplicated into within IR2 (Fig 6). In a parallel RNAseq study, tumor U008-B had been found to abundantly express a chimeric transcript of the 14.8 kb section fused to IR2 sequences transcribed from a strong latency-associated promoter [81]. Distinct deletions were observed in tumors U020-B and U020-C from another participant, but the genomic regions retained, aside from TR sequences, again included the IR1 region (Fig 7).

IR1 is one of the origins of lytic replication, and transcripts around IR1 are among the most highly expressed in KS tumors [81]. These include two long non-coding RNAs that have indispensable roles during lytic reactivation of KSHV, T1.4 [68,95,96] and PAN [9799]. PAN has been shown to interact with promoters of cellular genes involved in inflammation, cell cycle regulation and metabolism, and exogenous expression of PAN alone enhanced cell growth phenotypes [100]. Recently, virally-encoded circular RNAs encoded within PAN were discovered to be abundant in clinical samples and were inducible in KSHV-infected cell lines [101103]. Other non-coding transcripts are potentially expressed from this region but their biological significance is unknown [99,104]. Finally, most ORFs encoded in the 14.8-kb retained region are lytic genes that have functions in subverting adaptive (K5/MIR2) [105,106] or innate immunity (K4/vCCL-2, K4.1/vCCL-3, K4.2 and K6/vCCL-1)[106109], and apoptosis (K7 and ORF16/vBCL-2) [106,109], all of which might enhance the survivability of a host cell. There is evidence to suggest that KSHV lytic gene expression is crucial to KS pathogenesis [110], and that residual lytic gene expression plays a role in latent KSHV persistence in vivo [111], so it is possible that viral mutations in lytic genes could impact viral persistence or KS disease.

In addition to mutations with the IR1 region, the late lytic gene K8.1 was found to be mutated in KS tumors from 3 individuals (Fig 9). U003-C had an inversion breakpoint at the K8.1 intron, U004-D had a 28-bp deletion ending at 4 bases upstream of the first K8.1 exon, and the U020-C full-length minor genome had a nonsense mutation at the start of the second K8.1 exon. Furthermore, no intact K8.1 gene sequence was detectable by hemi-nested PCR in most tumors tested from participant 003 (Figs 5C and 8). Interestingly, truncations in K8.1 had been reported previously, and all were from KS tumor isolates. The original GK18 isolate has a 74-bp deletion at the 3’ end (GenBank ID AF148805 K8.1 annotation); the Zambian isolate ZM124 (GenBank ID: KT271466) has a 25-nt deletion resulting in a frameshift and premature stop [33], and; Japanese isolate Miyako1 has a stop codon early in its first exon (GenBank ID LC200586 miscellaneous annotation). Gene K8.1 encodes an envelope glycoprotein that interacts with heparin sulfate for attachment [112115]. It is not required for entry into endothelial [113] or 293 cells [116], although it had recently been shown to be necessary for infection of primary and cultured B-cells [117]. The K8.1 protein is among the most immunogenic KSHV proteins [118120] and can be targeted by cellular immunity [121]. It is therefore conceivable that the preponderance of K8.1 mutations might be due to potent immune targeting of cells expressing K8.1 glycoproteins. Evading immune responses by omission of K8.1 expression may confer better survival of the host tumor cell. Altogether these findings suggest there could be selection pressure against K8.1 expression, which warrants further study.

That the same viral mutational signatures were found in separate KS lesions provides strong evidence that tumor-associated and likely defective KSHV genomes can spread by metastasis, or with a helper virus. Identical KSHV genome rearrangements in distinct tumors implies that the viruses were clonally related, since independently acquiring the same breakpoint coordinates is highly improbable. Participant U008 had 2 distinct tumors with the same 14.8-kb sequence duplicated into IR2 (Figs 6 and S5). Participant U003 had a KSHV genome inversion that would likely disable production of infectious virions, yet the signature K8.1-TR breakpoint junction was present in all his 5 examined tumors over 3 months (Figs 5 and 8 and S2 Table). Only 3 of the 5 had intact K8.1 sequence detectable by PCR (Fig 8). This suggests that the genomic inversion and K8.1 loss of function was not too detrimental to KS tumor persistence and implies a capacity for tumor-associated KSHV to continually seed tumors in other anatomic sites. To our knowledge, evidence of possible metastatic spread of KSHV genomes has not been previously reported, and these findings may have significant implications for understanding the progression of KS. However, consistent with the helper virus hypothesis, all deleted genomes retained the terminal repeat sequences, which contain the putative virion packaging (pac1) and cleavage sites.

In summary, highly accurate deep sequencing revealed that whole KSHV genomes in paired oral swab and KS tumors from individuals with advanced KS were virtually identical at the point mutational level. The use of dUMI provides a proof of concept for utilizing this technique to study other DNA viruses such as human cytomegalovirus, which can exhibit substantial intra-host genome heterogeneity [122]. Where there were differences, the viruses detected in saliva had the database consensus genotype while viruses detected in some tumors had novel mutations. We observed that KS tumors can harbor KSHV with genomic aberrations affecting coding regions, which raises the possibility that these mutations could alter the function of these genes and were of biologic significance. Moreover, our study demonstrated that specific KSHV genome aberrations can be found in distinct tumors from the same individuals, suggesting that infected tumor cells can seed other KS lesions. Future studies characterizing KSHV genomes in multiple tumors within individuals among a larger cohort of KS studies will help determine if tumor-associated KSHV mutations similar to those observed in our study result from tumor-associated genome instability, or contribute to KS tumorigenesis, or are associated with KS clinical presentation or clinical outcomes.

Supporting information

S1 Fig. dUMI-adaptors and primers for duplex sequencing.

During library preparation, sheared DNA fragments were A-tailed and ligated with forked, double-stranded oligonucleotides containing Illumina TruSeq universal adaptor sequences, 12-random base pairs as dUMI and spacer sequences. The adapted DNA libraries were PCR amplified before enrichment with primers mws13 and mws20, which bind to Illumina Truseq adaptors. Primer mws21 containing sample index ID for multiplex sequencing was used for PCR following enrichment. DNA libraries post processing are shown at the bottom.

(TIF)

S2 Fig. Workflow of genome assembly and variant analysis.

KSHV genomes were first assembled de novo from sequence reads of each sample, before being used as reference for mapping their respective dUMI-consensus reads (adapted from [61], see details in the Methods section). Discrepancies in bases between the sample-consensus genome and mapped dUMI-consensus reads were taken to be real intra-sample variants.

(TIF)

S3 Fig. Read coverage in all reported samples.

Raw (light blue), sUMI (blue) and dUMI-consensus (dark blue) read coverage in log scale along the de novo assembled, sample-consensus KSHV genomes in tumors (A) and oral swabs (B) examined in this study. Major repeat regions were masked and seen here as no coverage regions.

(TIF)

S4 Fig. Potential sequencing artifacts.

(A) KSHV intrasample variant frequency as a function of read coverage. Sample variant frequencies were shown in Table 1 when at least 100 viral genomes were sampled, since below that level, minor variant frequencies were judged to be unreliable. (B) Intra-sample minor variants detected in dUMI-consensus reads of all samples by type of base substitution.

(TIF)

S5 Fig. U008-B and U008-D were from distinct lesions on the left leg.

U008-B biopsy was obtained from lesions in the upper thigh, while U008-D was biopsied from a large lesion on the knee.

(TIF)

S6 Fig. Mutations of KSHV genomes in tumors from participants 004 and 003.

(A) Alignment of KSHV genomes from Participant 004, showing a 28-bp deletion in the K8.1 promoter in U004-D. U004-D and U004-C are from tumors while U004-o1 is from an oral swab. (B) The only intra-host synonymous mutation found in this study, within miR-K10 in participant 003. (C) Sequence chromatograms of miR-K10 in other tumors of participant U003, with a T in all tumors and a mixture of T and the database consensus C in a minority of viruses in U003-G.

(TIF)

S7 Fig. Predicted secondary structure of the stem loop precursor of miR-K10a-3p.

The structure of pre-miR-K12-10a was predicted using mfold (http://unafold.rna.albany.edu/?q=mfold/RNA-Folding-Form), indicating the mature miRNA, seed sequence (grey) and the intra-host polymorphism (red) found in participant U003. The G➔A change in RNA sequence resulted in a slightly more stable stem loop (ΔG -32.40 ➔ -32.70).

(TIF)

S1 Table. Adapter, primer and probe sequences used for library prep and ddPCR.

(XLSX)

S2 Table. Anatomic sites, biopsy times and tumor morphologies of the KS lesions.

(XLSX)

S3 Table. Tumor KSHV-unique mutations observed, in gene order.

(XLSX)

S4 Table. Number of dUMI-consensus reads from BCBL-1, tumor and oral swab samples mapping to EBV genome (strain B-95/AG876).

(XLSX)

Acknowledgments

We thank the study participants who contributed invaluable specimens that make this study possible. We also thank D. Depledge and J. Breuer for sharing their RNA bait sequences for capturing KSHV genomic sequences and M. Lagunoff for kindly providing us with an early passage BCBL-1 cell line and critical reading of this paper.

Data Availability

All whole KSHV genome sequence files are available from the NCBI, with GenBank Accession numbers: MT510648, MT510670, MT936340. All other relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work was supported by National Institutes of Health (https://www.nih.gov) grants to U54 CA190146 (WTP), K23 CA 150931 (WTP) and the University of Washington Centers for AIDS Research Retroviruses and Molecular Data Sciences Core (P30 AI027757; JIM). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Engels EA, Pfeiffer RM, Goedert JJ, Virgo P, McNeel TS, Scoppa SM, et al. Trends in cancer risk among people with AIDS in the United States 1980–2002. AIDS. 2006;20:1645–54. 10.1097/01.aids.0000238411.75324.59 . [DOI] [PubMed] [Google Scholar]
  • 2.Mbulaiteye SM, Katabira ET, Wabinga H, Parkin DM, Virgo P, Ochai R, et al. Spectrum of cancers among HIV-infected persons in Africa: the Uganda AIDS-Cancer Registry Match Study. Int J Cancer. 2006;118:985–90. 10.1002/ijc.21443 . [DOI] [PubMed] [Google Scholar]
  • 3.Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D. Global cancer statistics. CA Cancer J Clin. 2011;61:69–90. 10.3322/caac.20107 . [DOI] [PubMed] [Google Scholar]
  • 4.Sarid R, Gao S-J. Viruses and human cancer: from detection to causality. Cancer Lett. 2011;305:218–27. 10.1016/j.canlet.2010.09.011 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Moore PS, Chang Y. The conundrum of causality in tumor virology: the cases of KSHV and MCV. Semin Cancer Biol. 2014;26:4–12. 10.1016/j.semcancer.2013.11.001 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Koelle DM, Huang M, Chandran B, Vieira J, Piepkorn M, Corey L. Frequent Detection of Kaposi’s Sarcoma—Associated Herpesvirus (Human Herpesvirus 8) DNA in Saliva of Human Immunodeficiency Virus-Infected Men: Clinical and Immunologic Correlates. J Infect Dis. 1997. 10.1086/514045 . [DOI] [PubMed] [Google Scholar]
  • 7.Casper C, Lampinen TM, Morrow RA, Corey L, Wald A. HIV Infection and Human Herpesvirus-8 Oral Shedding. 2004;35: 233–238. [DOI] [PubMed] [Google Scholar]
  • 8.Phipps W, Saracino M, Selke S, Huang M-L, Jaoko W, Mandaliya K, et al. Oral HHV-8 replication among women in Mombasa. Kenya J Med Virol. 2014;86:1759–65. 10.1002/jmv.23941 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Mesri EA, Cesarman E, Boshoff C. Kaposi’s sarcoma and its associated herpesvirus. Nature Reviews Cancer. NIH Public Access; 2010. pp. 707–719. 10.1038/nrc2888 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Haley CT, Mui UN, Vangipuram R, Rady PL, Tyring SK. Human Oncoviruses: Mucocutaneous Manifestations, Pathogenesis, Therapeutics, and Prevention (Part I: Papillomaviruses and Merkel cell polyomavirus). J Am Acad Dermatol. 2018. [DOI] [PubMed] [Google Scholar]
  • 11.Renzette N, Somasundaran M, Brewster F, Coderre J, Weiss ER, McManus M, et al. Epstein-Barr virus latent membrane protein 1 genetic variability in peripheral blood B cells and oropharyngeal fluids. J Virol. 2014;88:3744–55. 10.1128/JVI.03378-13 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Weiss ER, Lamers SL, Henderson JL, Melnikov A, Somasundaran M, Garber M, et al. Early Epstein-Barr Virus Genomic Diversity and Convergence toward the B95.8 Genome in Primary Infection. J Virol. 2017;92: JVI.01466–17. 10.1128/JVI.01466-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Shair KHY, Reddy A, Cooper VS. New Insights from Elucidating the Role of LMP1 in Nasopharyngeal Carcinoma. Cancers (Basel). 2018;10 10.3390/cancers10040086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kwok H, Tong AHY, Lin CH, Lok S, Farrell PJ, Kwong DLW, et al. Genomic sequencing and comparative analysis of Epstein-Barr virus genome isolated from primary nasopharyngeal carcinoma biopsy. Lo K-W, editor. PLoS One. 2012;7: e36939 10.1371/journal.pone.0036939 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Kwok H, Wu CW, Palser AL, Kellam P, Sham PC, Kwong DLW, et al. Genomic diversity of Epstein-Barr virus genomes isolated from primary nasopharyngeal carcinoma biopsy samples. J Virol. 2014;88:10662–72. 10.1128/JVI.01665-14 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Tsai M-H, Lin X, Shumilov A, Bernhardt K, Feederle R, Poirey R, et al. The biological properties of different Epstein-Barr virus strains explain their association with various types of cancers. Oncotarget. 2017;8:10238–54. 10.18632/oncotarget.14380 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Dheekollu J, Malecka K, Wiedmer A, Delecluse H-J, Chiang AKS, Altieri DC, et al. Carcinoma-risk variant of EBNA1 deregulates Epstein-Barr Virus episomal latency. Oncotarget. 2017;8:7248–64. 10.18632/oncotarget.14540 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Church TM, Verma D, Thompson J, Swaminathan S. Efficient Translation of Epstein-Barr Virus (EBV) DNA Polymerase Contributes to the Enhanced Lytic Replication Phenotype of M81 EBV. J Virol. 2018;92 10.1128/JVI.01794-17 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Liu Q, Han A, You S, Yang Q, Liang Y, Dong Y. The association of genomic variation of Epstein-Barr virus BamHI F fragment with the proliferation of nasopharyngeal carcinoma. APMIS. 2010;118:657–64. 10.1111/j.1600-0463.2010.02642.x . [DOI] [PubMed] [Google Scholar]
  • 20.Tsai M-H, Raykova A, Klinke O, Bernhardt K, Gärtner K, Leung CS, et al. Spontaneous lytic replication and epitheliotropism define an Epstein-Barr virus strain found in carcinomas. Cell Rep. 2013;5:458–70. 10.1016/j.celrep.2013.09.012 . [DOI] [PubMed] [Google Scholar]
  • 21.Abere B, Schulz TF. KSHV non-structural membrane proteins involved in the activation of intracellular signaling pathways and the pathogenesis of Kaposi’s sarcoma. Curr Opin Virol. 2016;20:11–9. 10.1016/j.coviro.2016.07.008 . [DOI] [PubMed] [Google Scholar]
  • 22.Yan L, Majerciak V, Zheng ZM, Lan K. Towards Better Understanding of KSHV Life Cycle: from Transcription and Posttranscriptional Regulations to Pathogenesis Virologica Sinica. Science Press; 2019. pp. 135–161. 10.1007/s12250-019-00114-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Dittmer DP, Damania B, Silverberg M, Robbins H, Pfeiffer R, Shiels M, et al. Kaposi sarcoma–associated herpesvirus: immunobiology, oncogenesis, and therapy. J Clin Invest. 2016;126:3165–75. 10.1172/JCI84418 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Marshall V, Parks T, Bagni R, Wang CD, Samols MA, Hu J, et al. Conservation of Virally Encoded MicroRNAs in Kaposi Sarcoma–Associated Herpesvirus in Primary Effusion Lymphoma Cell Lines and in Patients with Kaposi Sarcoma or Multicentric Castleman Disease. J Infect Dis. 2007;195:645–59. 10.1086/511434 . [DOI] [PubMed] [Google Scholar]
  • 25.Ray A, Marshall V, Uldrick T, Leighty R, Labo N, Wyvill K, et al. Sequence analysis of Kaposi sarcoma-associated herpesvirus (KSHV) microRNAs in patients with multicentric Castleman disease and KSHV-associated inflammatory cytokine syndrome. J Infect Dis. 2012;205:1665–76. 10.1093/infdis/jis249 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Boralevi F, Masquelier B, Denayrolles M, Dupon M, Pellegrin JL, Ragnaud JM, et al. Study of human herpesvirus 8 (HHV-8) variants from Kaposi’s sarcoma in France: is HHV-8 subtype A responsible for more agressive tumors? J Infect Dis. 1998;178:1546–7. 10.1086/314471 . [DOI] [PubMed] [Google Scholar]
  • 27.Mancuso R, Biffi R, Valli M, Bellinvia M, Athanasia T, Ferrucci S, et al. HHV8 a subtype is associated with rapidly evolving classic Kaposi’s sarcoma. J Med Virol. 2008;80:2153–60. 10.1002/jmv.21322 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Isaacs T, Abera AB, Muloiwa R, Katz AA, Todd G. Genetic diversity of HHV8 subtypes in South Africa: A5 subtype is associated with extensive disease in AIDS-KS. J Med Virol. 2016;88:292–303. 10.1002/jmv.24328 . [DOI] [PubMed] [Google Scholar]
  • 29.Cook PM, Whitby D, Calabro ML, Luppi M, Kakoola DN, Hjalgrim H, et al. Variability and evolution of Kaposi’s sarcoma-associated herpesvirus in Europe and Africa. AIDS. 1999;13:1165–76. 10.1097/00002030-199907090-00004 . [DOI] [PubMed] [Google Scholar]
  • 30.Lacoste V, Judde JG, Brière J, Tulliez M, Garin B, Kassa-Kelembho E, et al. Molecular epidemiology of human herpesvirus 8 in Africa: Both B and A5 K1 genotypes, as well as the M and P genotypes of K14.1/K15 loci, are frequent and widespread. Virology. 2000;278:60–74. 10.1006/viro.2000.0629 . [DOI] [PubMed] [Google Scholar]
  • 31.Kadyrova E, Lacoste V, Duprez R, Pozharissky K, Molochkov V, Huerre M, et al. Molecular epidemiology of Kaposi’s sarcoma-associated herpesvirus/human herpesvirus 8 strains from Russian patients with classic, posttransplant, and AIDS-associated Kaposi’s sarcoma. J Med Virol. 2003;71:548–56. 10.1002/jmv.10530 . [DOI] [PubMed] [Google Scholar]
  • 32.Cordiali-Fei P, Trento E, Giovanetti M, Lo Presti A, Latini A, Giuliani M, et al. Analysis of the ORFK1 hypervariable regions reveal distinct HHV-8 clustering in Kaposi’s sarcoma and non-Kaposi’s cases. J Exp Clin Cancer Res. 2015;34:1 10.1186/s13046-014-0119-0 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Olp LN, Jeanniard A, Marimo C, West JT, Wood C. Whole-Genome Sequencing of Kaposi’s Sarcoma-Associated Herpesvirus from Zambian Kaposi’s Sarcoma Biopsy Specimens Reveals Unique Viral Diversity. J Virol. 2015;89:12299–308. 10.1128/JVI.01712-15 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Awazawa R, Utsumi D, Katano H, Awazawa T, Miyagi T, Hayashi K, et al. High Prevalence of Distinct Human Herpesvirus 8 Contributes to the High Incidence of Non-acquired Immune Deficiency Syndrome-Associated Kaposi’s Sarcoma in Isolated Japanese Islands. J Infect Dis. 2017;216:850–8. 10.1093/infdis/jix424 [DOI] [PubMed] [Google Scholar]
  • 35.Osawa M, Mine S, Ota S, Kato K, Sekizuka T, Kuroda M, et al. Establishing and characterizing a new primary effusion lymphoma cell line harboring Kaposi’s sarcoma–associated herpesvirus. Infect Agent Cancer. 2016;11:37 10.1186/s13027-016-0086-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Sallah N, Palser AL, Watson SJ, Labo N, Asiki G, Marshall V, et al. Genome-Wide Sequence Analysis of Kaposi Sarcoma-Associated Herpesvirus Shows Diversification Driven by Recombination. J Infect Dis. 2018;218:1700–10. 10.1093/infdis/jiy427 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Caro-Vegas C, Sellers S, Host KM, Seltzer J, Landis J, Fischer WA, et al. Runaway Kaposi Sarcoma-associated herpesvirus replication correlates with systemic IL-10 levels. Virology. 2020;539:18–25. 10.1016/j.virol.2019.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hayward GS, Zong JC. Modern evolutionary history of the human KSHV genome. Curr Top Microbiol Immunol. 2007;312:1–42. 10.1007/978-3-540-34344-8_1 . [DOI] [PubMed] [Google Scholar]
  • 39.Beyari MM, Hodgson TA, Cook RD, Kondowe W, Molyneux EM, Scully CM, et al. Multiple Human Herpesvirus–8 Infection. J Infect Dis. 2003;188:678–89. 10.1086/377504 . [DOI] [PubMed] [Google Scholar]
  • 40.Beyari MM, Hodgson TA, Kondowe W, Molyneux EM, Scully CM, Porter SR, et al. Genotypic profile of human herpesvirus 8 (Kaposi’s sarcoma-associated herpesvirus) in urine. J Clin Microbiol. 2004;42:3313–6. 10.1128/JCM.42.7.3313-3316.2004 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Al-Otaibi LM, Ngui SL, Scully CM, Porter SR, Teo CG. Salivary human herpesvirus 8 shedding in renal allograft recipients with Kaposi’s sarcoma. J Med Virol. 2007;79:1357–65. 10.1002/jmv.20929 . [DOI] [PubMed] [Google Scholar]
  • 42.Al-Otaibi LM, Al-Sulaiman MH, Teo CG, Porter SR. Extensive oral shedding of human herpesvirus 8 in a renal allograft recipient. Oral Microbiol Immunol. 2009;24:109–15. 10.1111/j.1399-302X.2008.00481.x . [DOI] [PubMed] [Google Scholar]
  • 43.Al-Otaibi LM, Moles DR, Porter SR, Teo CG. Human herpesvirus 8 shedding in the mouth and blood of hemodialysis patients. J Med Virol. 2012;84:792–7. 10.1002/jmv.23245 . [DOI] [PubMed] [Google Scholar]
  • 44.Leao JC, de Faria ABS, Fonseca DDD, Gueiros LAM, Silva IHM, Porter SR. Intrahost genetic variability of human herpes virus-8. J Med Virol. 2013;85:636–45. 10.1002/jmv.23491 . [DOI] [PubMed] [Google Scholar]
  • 45.Mbulaiteye S, Marshall V, Bagni RK, Wang C, Mbisa G, Bakaki PM, et al. Molecular Evidence for Mother-to-Child Transmission of Kaposi Sarcoma–Associated Herpesvirus in Uganda and K1 Gene Evolution within the Host. J Infect Dis. 2006;193:1250–7. 10.1086/503052 . [DOI] [PubMed] [Google Scholar]
  • 46.Zong J, Ciufo DM, Viscidi R, Alagiozoglou L, Tyring S, Rady P, et al. Genotypic analysis at multiple loci across Kaposi’s sarcoma herpesvirus (KSHV) DNA molecules: Clustering patterns, novel variants and chimerism. J Clin Virol. 2002;23:119–48. 10.1016/s1386-6532(01)00205-0 . [DOI] [PubMed] [Google Scholar]
  • 47.Zong J-C, Arav-Boger R, Alcendor DJ, Hayward GS. Reflections on the interpretation of heterogeneity and strain differences based on very limited PCR sequence data from Kaposi’s sarcoma-associated herpesvirus genomes. J Clin Virol. 2007;40:1–8. 10.1016/j.jcv.2007.06.012 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Stebbing J, Wilder N, Ariad S, Abu-Shakra M. Lack of intra-patient strain variability during infection with Kaposi’s sarcoma-associated herpesvirus. Am J Hematol. 2001;68:133–4. 10.1002/ajh.1165 . [DOI] [PubMed] [Google Scholar]
  • 49.Meng YX, Spira TJ, Bhat GJ, Birch CJ, Druce JD, Edlin BR, et al. Individuals from North America, Australasia, and Africa are infected with four different genotypes of human herpesvirus 8. Virology. 1999;261:106–19. 10.1006/viro.1999.9853 . [DOI] [PubMed] [Google Scholar]
  • 50.Poole LJ, Zong JC, Ciufo DM, Alcendor DJ, Cannon JS, Ambinder R, et al. Comparison of genetic variability at multiple loci across the genomes of the major subtypes of Kaposi’s sarcoma-associated herpesvirus reveals evidence for recombination and for two distinct types of open reading frame K15 alleles at the right-hand end. J Virol. 1999;73:6646–60. 10.1128/JVI.73.8.6646-6660.1999 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Kakoola DN, Sheldon J, Byabazaire N, Bowden RJ, Katongole-Mbidde E, Schulz TF, et al. Recombination in human herpesvirus-8 strains from Uganda and evolution of the K15 gene. J Gen Virol. 2001;82:2393–404. 10.1099/0022-1317-82-10-2393 . [DOI] [PubMed] [Google Scholar]
  • 52.Costello M, Pugh TJ, Fennell TJ, Stewart C, Lichtenstein L, Meldrim JC, et al. Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation. Nucleic Acids Res. 2013;41:e67 10.1093/nar/gks1443 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Chen G, Mosier S, Gocke CD, Lin M-T, Eshleman JR. Cytosine Deamination Is a Major Cause of Baseline Noise in Next-Generation Sequencing. Mol Diagn Ther. 2014;18:587–93. 10.1007/s40291-014-0115-2 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Ma X, Shao Y, Tian L, Flasch DA, Mulder HL, Edmonson MN, et al. Analysis of error profiles in deep next-generation sequencing data. Genome Biol. 2019;20:50 10.1186/s13059-019-1659-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Park G, Park JK, Shin S-H, Jeon H-J, Kim NKD, Kim YJ, et al. Characterization of background noise in capture-based targeted sequencing data. Genome Biol. 2017;18:136 10.1186/s13059-017-1275-2 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Schmitt MW, Kennedy SR, Salk JJ, Fox EJ, Hiatt JB, Loeb LA. Detection of ultra-rare mutations by next-generation sequencing. Proc Natl Acad Sci U S A. 2012;109:14508–13. 10.1073/pnas.1208715109 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Krown SE, Metroka C, Wernz JC. Kaposi’s sarcoma in the acquired immune deficiency syndrome: a proposal for uniform evaluation, response, and staging criteria. AIDS Clinical Trials Group Oncology Committee. J Clin Oncol. 1989;7:1201–7. 10.1200/JCO.1989.7.9.1201 . [DOI] [PubMed] [Google Scholar]
  • 58.Johnston C, Orem J, Okuku F, Kalinaki M, Saracino M, Katongole-Mbidde E, et al. Impact of HIV infection and Kaposi sarcoma on human herpesvirus-8 mucosal replication and dissemination in Uganda. PLoS One. 2009;4:e4222 10.1371/journal.pone.0004222 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Ryncarz AJ, Goddard J, Wald A, Huang ML, Roizman B, Corey L. Development of a high-throughput quantitative assay for detecting herpes simplex virus DNA in clinical samples. J Clin Microbiol. 1999;37:1941–7. 10.1128/JCM.37.6.1941-1947.1999 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Zaniello B, Huang M-L, Cheng A, Selke S, Wald A, Jerome KR, et al. Consistent viral DNA quantification after prolonged storage at ambient temperature. J Virol Methods. 2016;228:91–4. 10.1016/j.jviromet.2015.11.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Kennedy SR, Schmitt MW, Fox EJ, Kohrn BF, Salk JJ, Ahn EH, et al. Detecting ultralow-frequency mutations by Duplex Sequencing. Nat Protoc. 2014;9:2586–606. 10.1038/nprot.2014.170 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Depledge DP, Palser AL, Watson SJ, Lai IY-C, Gray ER, Grant P, et al. Specific Capture and Whole-Genome Sequencing of Viruses from Clinical Samples. PLoS One. 2011;6. 10.1371/journal.pone.0027805 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Thompson JR, Marcelino LA, Polz MF. Heteroduplexes in mixed-template amplifications: formation, consequence and elimination by “reconditioning PCR”. Nucleic Acids Res. 2002;30:2083–8. 10.1093/nar/30.9.2083 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Joshi N, Fass J. Sickle: A sliding-window, adaptive, quality-based trimming tool for FastQ files (Version 1.33) [Software]. Available at https://github.com/najoshi/sickle. 2011; 2011. [Google Scholar]
  • 65.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. 2013. [Google Scholar]
  • 66.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77. 10.1089/cmb.2012.0021 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Boetzer M, Pirovano W. Toward almost closed genomes with GapFiller. Genome Biol. 2012;13:R56 10.1186/gb-2012-13-6-r56 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Wang Y, Li H, Chan MY, Zhu FX, Lukac DM, Yuan Y. Kaposi’s Sarcoma-Associated Herpesvirus ori-Lyt-Dependent DNA Replication: cis-Acting Requirements for Replication and ori-Lyt-Associated RNA Transcription. J Virol. 2004;78:8615–29. 10.1128/JVI.78.16.8615-8629.2004 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80. 10.1093/molbev/mst010 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3. 10.1093/bioinformatics/btu033 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies Molecular Biology and Evolution. Oxford University Press; 2006. pp. 254–267. 10.1093/molbev/msj030 [DOI] [PubMed] [Google Scholar]
  • 72.Katz JP, Pipas JM. SummonChimera infers integrated viral genomes with nucleotide precision from NGS data. BMC Bioinformatics. 2014;15:348 10.1186/s12859-014-0348-4 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Chiang C, Layer RM, Faust GG, Lindberg MR, Rose DB, Garrison EP, et al. SpeedSeq: Ultra-fast personal genome analysis and interpretation. Nat Methods. 2015;12:966–8. 10.1038/nmeth.3505 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Layer RM, Chiang C, Quinlan AR, Hall IM. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. 2014;15:R84 10.1186/gb-2014-15-6-r84 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Zhou F-C, Zhang Y-J, Deng J-H, Wang X-P, Pan H-Y, Hettler E, et al. Efficient infection by a recombinant Kaposi’s sarcoma-associated herpesvirus cloned in a bacterial artificial chromosome: application for genetic analysis. J Virol. 2002;76:6185–96. 10.1128/jvi.76.12.6185-6196.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Renne R, Zhong W, Herndier B, McGrath M, Abbey N, Kedes D, et al. Lytic growth of Kaposi’s sarcoma-associated herpesvirus (human herpesvirus 8) in culture. Nat Med. 1996;2:342–6. 10.1038/nm0396-342 [DOI] [PubMed] [Google Scholar]
  • 77.Kivioja T, Vähärautio A, Karlsson K, Bonke M, Enge M, Linnarsson S, et al. Counting absolute numbers of molecules using unique molecular identifiers. Nat Methods. 2012;9:72–4. 10.1038/nmeth.1778 . [DOI] [PubMed] [Google Scholar]
  • 78.Yakushko Y, Hackmann C, Günther T, Rückert J, Henke M, Koste L, et al. Kaposi’s sarcoma-associated herpesvirus bacterial artificial chromosome contains a duplication of a long unique-region fragment within the terminal repeat region. J Virol. 2011;85:4612–7. 10.1128/JVI.02068-10 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Eigen M. On the nature of virus quasispecies. Trends Microbiol. 1996;4:216–8. 10.1016/0966-842X(96)20011-3 . [DOI] [PubMed] [Google Scholar]
  • 80.Peng Q, Xu C, Kim D, Lewis M, DiCarlo J, Wang Y. Targeted Single Primer Enrichment Sequencing with Single End Duplex-UMI. Sci Rep. 2019;9:4810 10.1038/s41598-019-41215-z . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Rose TM, Bruce AG, Barcy S, Fitzgibbon M, Matsumoto LR, Ikoma M, et al. Quantitative RNAseq analysis of Ugandan KS tumors reveals KSHV gene expression dominated by transcription from the LTd downstream latency promoter. Schulz TF, editor. PLoS Pathog. 2018;14: e1007441 10.1371/journal.ppat.1007441 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.McDowell DG, Burns NA, Parkes HC. Localised sequence regions possessing high melting temperatures prevent the amplification of a DNA mimic in competitive PCR. Nucleic Acids Res. 1998;26:3340–7. 10.1093/nar/26.14.3340 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Tang S, Yamanegi K, Zheng Z-M. Requirement of a 12-base-pair TATT-containing sequence and viral lytic DNA replication in activation of the Kaposi’s sarcoma-associated herpesvirus K8.1 late promoter. J Virol. 2004;78: 2609–14. 10.1128/jvi.78.5.2609-2614.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Tang S, Zheng Z-M. Kaposi’s sarcoma-associated herpesvirus K8 exon 3 contains three 5’-splice sites and harbors a K8.1 transcription start site. J Biol Chem. 2002;277: 14547–56. 10.1074/jbc.M111308200 [DOI] [PubMed] [Google Scholar]
  • 85.Cantalupo PG, Katz JP, Pipas JM. Viral sequences in human cancer. Virology. 2018;513:208–16. 10.1016/j.virol.2017.10.017 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Xu M, Zhang W-L, Zhu Q, Zhang S, Yao Y-Y, Xiang T, et al. Genome-wide profiling of Epstein-Barr virus integration by targeted sequencing in Epstein-Barr virus associated malignancies. Theranostics. 2019;9:1115–24. 10.7150/thno.29622 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Sanjuán R, Domingo-Calap P. Mechanisms of viral mutation. Cell Mol Life Sci. 2016;73:4433–48. 10.1007/s00018-016-2299-6 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Salk JJ, Schmitt MW, Loeb LA. Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations Nature Reviews Genetics. Nature Publishing Group; 2018. pp. 269–285. 10.1038/nrg.2017.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Forte E, Raja AN, Shamulailatpam P, Manzano M, Schipma MJ, Casey JL, et al. MicroRNA-Mediated Transformation by the Kaposi’s Sarcoma-Associated Herpesvirus Kaposin Locus. Hutt-Fletcher L, editor. J Virol. 2015;89:2333–41. 10.1128/JVI.03317-14 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Gottwein E, Corcoran DL, Mukherjee N, Skalsky RL, Hafner M, Nusbaum JD, et al. Viral microRNA targetome of KSHV-infected primary effusion lymphoma cell lines. Cell Host Microbe. 2011;10:515–26. 10.1016/j.chom.2011.09.012 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Marshall VA, Labo N, Sztuba-Solinska J, Castro EMC, Aleman K, Wyvill KM, et al. Polymorphisms in KSHV-encoded microRNA sequences affect levels of mature viral microRNA in Kaposi Sarcoma lesions. Oncotarget. 2018;9:35856–69. 10.18632/oncotarget.26321 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Rangel N, Forero-Castro M, Rondón-Lagos M. New insights in the cytogenetic practice: Karyotypic chaos, non-clonal chromosomal alterations and chromosomal instability in human cancer and therapy response Genes. MDPI AG; 2017. pp. 2–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Russo JJ, Bohenzky RA, Chien MC, Chen J, Yan M, Maddalena D, et al. Nucleotide sequence of the Kaposi sarcoma-associated herpesvirus (HHV8). Proc Natl Acad Sci U S A. 1996;93:14862–7. 10.1073/pnas.93.25.14862 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Deng J-H, Zhang Y-J, Wang X-P, Gao S-J. Lytic replication-defective Kaposi’s sarcoma-associated herpesvirus: potential role in infection and malignant transformation. J Virol. 2004;78:11108–20. 10.1128/JVI.78.20.11108-11120.2004 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Wang Y, Tang Q, Maul GG, Yuan Y. Kaposi’s sarcoma-associated herpesvirus ori-Lyt-dependent DNA replication: dual role of replication and transcription activator. J Virol. 2006;80:12171–86. 10.1128/JVI.00990-06 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Liu D, Wang Y, Yuan Y. Kaposi’s Sarcoma-Associated Herpesvirus K8 Is an RNA Binding Protein That Regulates Viral DNA Replication in Coordination with a Noncoding RNA. J Virol. 2018;92 10.1128/JVI.02177-17 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Campbell M, Kung H-J, Izumiya Y. Long non-coding RNA and epigenetic gene regulation of KSHV. Viruses. 2014;6:4165–77. 10.3390/v6114165 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Conrad NK. New insights into the expression and functions of the Kaposi’s sarcoma-associated herpesvirus long noncoding PAN RNA. Virus Res. 2016;212:53–63. 10.1016/j.virusres.2015.06.012 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Chavez-Calvillo G, Martin S, Hamm C, Sztuba-Solinska J. The Structure-To-Function Relationships of Gammaherpesvirus-Encoded Long Non-Coding RNAs and Their Contributions to Viral Pathogenesis. Non-coding RNA. 2018;4 10.3390/ncrna4040024 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Rossetto CC, Tarrant-Elorza M, Verma S, Purushothaman P, Pari GS. Regulation of Viral and Cellular Gene Expression by Kaposi’s Sarcoma-Associated Herpesvirus Polyadenylated Nuclear RNA. J Virol. 2013;87:5540–53. 10.1128/JVI.03111-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Tagawa T, Gao S, Koparde VN, Gonzalez M, Spouge JL, Serquiña AP, et al. Discovery of Kaposi’s sarcoma herpesvirus-encoded circular RNAs and a human antiviral circular RNA. Proc Natl Acad Sci U S A. 2018;115:12805–10. 10.1073/pnas.1816183115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Abere B, Li J, Zhou H, Toptan T, Moore PS, Chang Y. Kaposi’s Sarcoma-Associated Herpesvirus-Encoded circRNAs Are Expressed in Infected Tumor Tissues and Are Incorporated into Virions. MBio. 2020;11 10.1128/mBio.03027-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Toptan T, Abere B, Nalesnik MA, Swerdlow SH, Ranganathan S, Lee N, et al. Circular DNA tumor viruses make circular RNAs. Proc Natl Acad Sci U S A. 2018;115:E8737–45. 10.1073/pnas.1811728115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Taylor JL, Bennett HN, Snyder BA, Moore PS, Chang Y. Transcriptional analysis of latent and inducible Kaposi’s sarcoma-associated herpesvirus transcripts in the K4 to K7 region. J Virol. 2005;79:15099–106. 10.1128/JVI.79.24.15099-15106.2005 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Hu Z, Usherwood EJ. Immune escape of γ-herpesviruses from adaptive immunity. Rev Med Virol. 2014;24:365–78. 10.1002/rmv.1791 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Lee H-R, Lee S, Chaudhary PM, Gill P, Jung JU. Immune evasion by Kaposi’s sarcoma-associated herpesvirus. Future Microbiol. 2010;5:1349–65. 10.2217/fmb.10.105 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Wei F, Zhu Q, Ding L, Liang Q, Cai Q. Manipulation of the host cell membrane by human γ-herpesviruses EBV and KSHV for pathogenesis. Virol Sin. 2016;31:395–405. 10.1007/s12250-016-3817-2 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Wei X, Lan K. Activation and counteraction of antiviral innate immunity by KSHV: an Update. Sci Bull. 2018;63:1223–34. 10.1016/j.scib.2018.07.009 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Manners O, Murphy JC, Coleman A, Hughes DJ, Whitehouse A. Contribution of the KSHV and EBV lytic cycles to tumourigenesis. Curr Opin Virol. 2018;32:60–70. 10.1016/j.coviro.2018.08.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Andrei G, Snoeck R. Kaposi’s sarcoma-associated herpesvirus: The role of lytic replication in targeted therapy. Current Opinion in Infectious Diseases. 2015. pp. 611–624. 10.1097/QCO.0000000000000213 [DOI] [PubMed] [Google Scholar]
  • 111.Grundhoff A, Ganem D. Inefficient establishment of KSHV latency suggests an additional role for continued lytic replication in Kaposi sarcoma pathogenesis. J Clin Invest. 2004;113:124–36. 10.1172/JCI17803 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Wang FZ, Akula SM, Pramod NP, Zeng L, Chandran B. Human herpesvirus 8 envelope glycoprotein K8.1A interaction with the target cells involves heparan sulfate. J Virol. 2001;75:7517–27. 10.1128/JVI.75.16.7517-7527.2001 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Birkmann A, Mahr K, Ensser A, Yağuboğlu S, Titgemeyer F, Fleckenstein B, et al. Cell surface heparan sulfate is a receptor for human herpesvirus 8 and interacts with envelope glycoprotein K8.1. J Virol. 2001;75:11583–93. 10.1128/JVI.75.23.11583-11593.2001 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Akula SM, Pramod NP, Wang F-Z, Chandran B. Human Herpesvirus 8 Envelope-Associated Glycoprotein B Interacts with Heparan Sulfate-like Moieties. Virology. 2001;284:235–49. 10.1006/viro.2001.0921 . [DOI] [PubMed] [Google Scholar]
  • 115.Akula SM, Wang F-Z, Vieira J, Chandran B. Human Herpesvirus 8 Interaction with Target Cells Involves Heparan Sulfate. Virology. 2001;282:245–55. 10.1006/viro.2000.0851 [DOI] [PubMed] [Google Scholar]
  • 116.Luna RE, Zhou F, Baghian A, Chouljenko V, Forghani B, Gao S-J, et al. Kaposi’s Sarcoma-Associated Herpesvirus Glycoprotein K8.1 Is Dispensable for Virus Entry. J Virol. 2004;78: 6389–6398. 10.1128/JVI.78.12.6389-6398.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Dollery SJ, Santiago-Crespo RJ, Chatterjee D, Berger EA. Glycoprotein K8.1A of Kaposi’s Sarcoma-Associated Herpesvirus Is a Critical B Cell Tropism Determinant Independent of Its Heparan Sulfate Binding Activity. Longnecker RM, editor. J Virol. 2018;93 10.1128/JVI.01876-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Li M, MacKey J, Czajak SC, Desrosiers RC, Lackner AA, Jung JU. Identification and characterization of Kaposi’s sarcoma-associated herpesvirus K8.1 virion glycoprotein. J Virol. 1999;73: 1341–9. 10.1128/JVI.73.2.1341-1349.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Raab MS, Albrecht JC, Birkmann A, Yağuboğlu S, Lang D, Fleckenstein B, et al. The immunogenic glycoprotein gp35-37 of human herpesvirus 8 is encoded by open reading frame K8.1. J Virol. 1998;72: 6725–31. 10.1128/JVI.72.8.6725-6731.1998 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Chandran B, Bloomer C, Chan SR, Zhu L, Goldstein E, Horvat R. Human Herpesvirus-8 ORF K8.1 Gene Encodes Immunogenic Glycoproteins Generated by Spliced Transcripts. Virology. 1998;249: 140–149. 10.1006/viro.1998.9316 [DOI] [PubMed] [Google Scholar]
  • 121.Hislop AD, Sabbah S. CD8+ T cell immunity to Epstein-Barr virus and Kaposi’s sarcoma-associated herpes virus Seminars in Cancer Biology. Academic Press; 2008. pp. 416–422. 10.1016/j.semcancer.2008.10.005 [DOI] [PubMed] [Google Scholar]
  • 122.Martí-Carreras J, Maes P. Human cytomegalovirus genomics and transcriptomics through the lens of next-generation sequencing: revision and future challenges Virus Genes. Springer; 2019. pp. 138–164. 10.1007/s11262-018-1627-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Shou-Jiang Gao, Ashlee V Moses

1 Jun 2020

Dear Dr. Mullins,

Thank you very much for submitting your manuscript "Tumor-specific changes in Kaposi sarcoma-associated herpesvirus genomes in Ugandan adults with Kaposi sarcoma" for consideration at PLOS Pathogens. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by three independent reviewers. The reviewers appreciated the novelty and technical proficiency of the study, particularly the use of the high depth dUMI technique and the within-individual comparison of KS tumor samples with those in the oral cavity. However, each reviewer raised significant concerns about the manuscript, including specific aspects of the study methodology and the way in which some of the results were interpreted and discussed. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments. All of the issues deemed as major by the reviewers are, in my opinion, key issues that must be addressed in a revised manuscript, either with a more critical interpretation and discussion of the data and/or through performance of additional experiments. Several minor issues requiring attention were also highlighted. Reviewer suggestions/concerns that are recommended but not essential include modifying the title to more accurately reflect the study design, discussing the question of tumor metastasis in more detail, and revising the speculation about the role of K8.1 in tumorigenesis. 

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Ashlee V. Moses

Associate Editor

PLOS Pathogens

Shou-Jiang Gao

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #1: The study describes the duplex sequencing using unique molecular identifies (dUMI) of 12 matching oral and tumor samples were obtained from 9 Ugandan study participants, with HIV –associated KS. Libraries were enriched for KSHV using RNA baits of 120bp 12x tiling of the KSHV genome, previously described. Briefly, the RNA baits generate 5X coverage of the positive strand of the KSHV genome. The authors further enriched for K1, ORF75, K15, ORF26, and the TR sequences of several KSHV cell lines and western and African strain sequences. Several genomes had the presence of split reads, indicating aberrant genome structures, which were described in further detail and confirmed by PCR. Gene copy numbers of the K2, ORF16, ORF50/RTA, and ORF73/LANA genes were determined and reported to correlate with the observed shifts in read coverage. Variants calling was performed and reported to be non-synonymous, which were mostly inconsistent between the matching oral and tumor samples.

Reviewer #2: This is a technically adept report of the variability of KSHV whole genomes in comparison of human KS tumor samples to those in the oral cavity from the same individuals (in most cases). The novelty of the manuscript lies in the application of the high-depth dUMI method and the resulting high-accuracy base calls and description of intra-host variation that has previously only focused on tumor samples from the same subject and never been reported at this depth. The findings of genomic aberrrations in tumor-derived but not in the oral samples are quite interesting but the results are perhaps over-interpreted as being tumor drivers (lines 51-52 and elsewhere) as opposed to a consequence of the genomic instability inherent in tumor microenvironments that may adversely impact the stability of KSHV episomes in the course of their replication and partitioning to daughter cells. Moreover, the finding that a signature aberration was found in more than one tumor is strong evidence for metastasis of a KS tumor, in the likely absenece of the capacity for the etiological agent to spread via an extracellualr virological mechanism. This reviewer felt this was an important finding that was curiously not highlighted to any significant degree in the manuscript. Because genomic aberration would not necessarily impact the entirety of the collection of KSHV episomes that typically persist in each infected cell, the results are consist with some of the viral genomes being altered and others being intact and the authors did mention that expression from intact genomes might complement the proagation of the defective genomes.

In light of previous findings, the justification for doing the work and for doing the work at the incredible level of sophistication was somewhat lacking given that the outcomes largely showed what might have been anticipated from the literature, with the exception of the genomic duplications, inversions and deletions. But as stated above, the significance of these in pathogenesis remains debatable.

There was some confusion in the presentation of the number of samples being analyzed versus those actually being being compared tissue to tissue. See Line 31 versus 107-108

In the PCR experiments designed to interrogate the genomic aberrations and identify the breakpoints by Sanger sequencing there are instances where one side of the junction is clearly identified and the other is only evident from the deep sequencing method. While this reviewers interpretation of the findings is largely in agreement with the authors', there is not real explanation offered as to why one junction is readily detect and another is not since they should be equimolar in the sample amplification reactions.

Reviewer #3: The manuscript described the whole KSHV genome analysis from KS tumor lesions of 9 Ugandan patients and 7 with paired saliva KSHV sequences for a total of 23 specimens analyzed by next generation sequencing. The results showed that there are very little intra-host sequence variabilities and there is no evidence of KSHV quasispecies or super infection within the same patient. This is in contrast to some earlier studies with analysis of specific KSHV genes, showing KSHV sequence variability and tissue compartmentalization, and will be important to document such findings. The authors also showed that there are however point mutations, sequence deletions, and gene rearrangements in the KS tissues in some of the patients, and in some cases in more than one lesion but not all the lesions analyzed within the same patients. This study in general was quite well done; the sequencing was carefully performed using the duplex sequencing with dUM1 tag, with very low sequence errors, and only unique viral template was sequenced to avoid duplications. This paper is a follow up of the RNAseq of the KSHV mRNA obtained from the same patients by the same group published recently. The study however, focused more on the technical details of sequencing and sequence analysis but less on the biology and significance of the findings. There are no major new findings but this sequencing method validates some of the previous findings like single KSHV subtype per person by Rose et al. Further discussion with emphasis on whether these mutations in KSHV caused tumorigenesis, or whether the rapid replication of cancer cells caused mutations in KSHV genomes, will be helpful.

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be absolutely required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #1: Major comments

1. The study could benefit enormously from including an expert in KSHV and herpesviruses. For instance, the premise that different subtypes of KSHV are associated with different disease manifestations is far from certain (line 79). The EBV data cited in the introduction, likewise, are still widely debated. The field has been down to many false paths due to overblown claims of that nature. The authors say as much on line 88.

Likewise, it is not at all clear if genome recombination is a consistent phenotype of KSHV or an artifact of lab culture. Recombination is rare and not normally associated with disease scores in any of the herpesviruses.

1. The authors overstate the significance of unique molecular identified for DNA sequencing and variation discovery of DNA material. The myriad of FDA-approved tests in cancer as well as other studies show that in the case of DNA, sufficient read depth can overcome many of the concerns raised in the introduction. Errors due to NGS methods play a much larger role in sequencing RNA viruses, as reverse transcriptase has a much higher error rate than modern DNA polymerase used for NGS.

2. It would have been better if in a manuscript about KSHV more relevant references about KSHV were provided and less for EBV and HIV. The dUMI technique and related approaches are sufficiently developed and now commercially available. There is no need to go into extensive detail about it.

3. Sequences need to be submitted to GenBank before acceptance.

4. The use of dUMI is novel and truly instills confidence in the analysis. It suggests that in the control ~ 5 reads give rise to one dUMI and above further collapse 50 raw reads per dUMI (line 251). How do the authors explain the difference in tumor vs oral dUMI coverage? The tumor samples all seem to work very well, but the majority of oral samples failed to yield satisfactory dUMI coverage, even though raw read coverage was similar to tumor coverage?

5. There is an alternative explanation to breakpoints not supported by junction reads, at least theoretical. Any KS tumor includes full-length latent genomes, perhaps defective genomes, perhaps integrated genomes, but also encapsulated virion DNA and, defected, replication intermediates. The proportion of these varies depending on the number of lytic cells in the lesions.

6. It is in the discussion where this paper veers off course. The authors did not generate ">100 error-free genomes" line 444, but only 12 for 12 tumors. The majority of oral samples have less than sufficient mean coverage.

7. The conclusion that the observed intra-tumor rearrangements were not random (line 465) is premature. First, the limited number of samples does not allow for a statistically significant answer to this hypothesis. Second, during latency any mutation outside the genes required for latent persistence is neutral. While mutations that interfere with latent replication are selected against at every cell division. It is premature to speculate on the oncogenic benefit of any of the observed rearrangements (line 470) or specific selection pressures against K8.1. (line 509).

Reviewer #2: The authors should consider moderating the implications of the genomic variation or genomic aberration of viral pathogenesis since it is not clear that either variation or aberration is driving disease, as opposed to a consequence of it.

In lines 92 - 94 the authors mention that intertypic recombinants exist suggesting that co-infection of divergent strains must occur, but in fact it would seem to require co-infection of single cells with differential genetic subtypes. it is unclear whether the sequencing data presented provides evidence of such intracellualr KSHV diversity from a single cell to support the potential for forming intertypic recombinants. This comes up again in the Discussion Lines 458-459- is it co-infection of a single cell or leading to recombination, or accumulation of KSHV episomes leading to some sort of replication error by the cellular replication machinery trying perhaps to resolve the repeat structures in the KSHV episomes? It would seem that the probability of such, otherwise unlikely, events would increase with the episomal burden in cells.

Line 460-462- This reviewer agrees that the genomic aberrations are profoundly interesting and COULD be important causally, but it was unclear (perhaps my fault) what the frequency of such aberrations were of the total KSHV genomes from within a given tumor harboring those mutations. Is this frequency variable and does it associate with any other parameters of KSHV or HIV-1 co-infection- for example tumor morphotype.

Figures 5 and 6; Fig 5-Are the anticipated product sizes for amplicons (4 and 5) deemed to be diagnostic of the rearrangement and duplication correctly calculated because none seem close to those on the Agarose gels accept 1, 2, and 3.

Fig 6- The absence of seemingly critical confirmatory PCR products confounds the appreciation and absolute confirmation of the HTS data. For example Reaction 3 and 4, no product? So it is all based on the interpretation of reaction 5. The figure legend suggest the bands were excised and Sanger sequenced, but there don't appear to be diagnostic bands. So it would seem that one junction is a 'known and the other is implied from but incompletely define

Reviewer #3: 1. Figure 4 C and D, it is not clear why 4D used BCBL and not U0007B as control, as in 4C?

2. For U003 there are a number of protein altering changes in the saliva sequence as compared to the tumor sequences, where are these changes, how many of these changes are common and where do they differ, and how do they affect the proteins’ functions?

3. Why did lane 4 of Fig 6G lane 4 did not show the 1kb PCR product if K2 is linked to TR as expected, while lane 5 did show a strong band that flanks the 3’ insertion site?

4. Fig 5S phylogenetic analysis is important since it compares the 9 KS sequences analyzed with the currently existing whole KSHV genome sequences, thus it should be shown as a regular figure, Moreover, the authors should comment on the UG156, why it is more closely related to the KSHV sequences from the western countries.

5. The KSHV sequences obtained are from homogenized tumor lysates, so how can the authors be certain that the final assembled KSHV sequence represent the actual viral genome in the tumor cells, whether multiple genotypes are within a single cell or in different tumor cell populations, or even are from extracellular viral particles.

6. The authors has focused the discussion on the significance of the role of alternation of the KSHV sequences in KS tumorigenesis and potential immune escape mechanisms by the virus. However, more emphasis should be placed on tumor DNA sequences instability as a result of rapid tumor cell growth and tumorigenesis, and may not have anything to do with viral pathogenesis and tumorigenesis. Moreover, it has been shown that KS tumor cell cultures without detectable KSHV sequences can be established. Thus, changes of KSHV sequences could occur after tumor establishments.

7. The authors have suggested that since K8.1 can generate an effective immune response and thus alternations in K8.1 gene could provide an advantage for viral escape, pathogenesis and tumor growth. However, it has been shown that KS patients still mount a robust humoral immune response against KS, including against K8.1. Thus, it is unlikely that changes in K8.1 paly an important with KS tumorigenesis.

8. The result showing that in some cases more than one tumor lesion in the same patient harbors the same KSHV alternations is interesting. This suggests that KS lesions could be clonal origin due to metastases, but could also arise independently. Such points should be emphasized in the discussion.

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #1: 1. Minor comments

2. On line 253, the authors reference Figure 2B, which does not exist.

3. The author’s reference a previously described duplication in BAC-36 in line 261, and have incorrectly reported the duplication size as 19kb, instead of 9kb.

4. Table 1, page 13. The table is not formatted properly and indicates a missing field on the left-hand side. The mean read coverage seems to represent the number of reads and maybe misleading when looked at with supplementary Figure 3, which reports read coverage. The authors don’t describe if data was normalized. Please use the median instead of mean coverage and provide 95%CI or sd in table 1.

5. The Authors describe the inversion of the K8.1 exon in Tumor U003C, Line 330. The legend in Figure 4, describes the presence of the faint band seen in the BCBL-1 control TR-K8.1, Fig. 4D, as the presence of the forward primer amplicon. Perhaps different primers can be designed to confirm the band/or lack of a band in the control sample. It does not clearly state that the bands found in the control sample were confirmed by Sanger sequencing to be the forward primer amplicons. The text describing the data in Figure 4C in line 335 is different from the figure legend. The Figure and its legend indicate that the samples being compared are from different individuals as opposed to the same patient, as described in the text. There is a Typo in line 330, A ‘U’ is missing in the tumor sample number.

6. Line 368, the authors describe the breakpoints inside ORF11 and ORF18 and fusion into the TR. PCR primers are designed and confirm the 3’ fusion of ORF18 and the TR but not the fusion of ORF11 and the TR at the 5’ end, Fig. 6F, row 4. This is also shown in Fig. 6G, lane 4. The authors did not explain the presence of faint bands in Fig 6G, lane 7, 8, and 9, which may indicate the presence of these PCR products. Figure 6F, lane 5 referenced in line 377 is the same PCR product as the 3’ junction product reported in line 374, Figure 6G, lane 5. The wild type product of lane 2 in Figure 6F and G is much larger than the expected band size. In conclusion, no PCR confirmation was provided to verify this finding.

7. It could be argued that mutations are called when a low number of reads obtained, despite selecting mutations with a frequency of 100. As mentioned in their references, these mutations could be a consequence of DNA damage and other cellular events.

8. The resolution of the Figures could be improved on.

9. (line 301) If variants were supported by only one read, where these excluded from the consensus sequences submitted to GenBank?

10. line 417: Did this SNV disrupt the target sit of miR-K12-10? Please provide a mfold structure comparison.

11. line 439: Would this technique detect HIV, since it does not use reverse transcriptase or was HIV assayed by viral load assay?

12. Please remove line 521 "or with a helper virus" as there is no evidence for reinfection.

Reviewer #2: Some discussion of why a secondary library enrichment of low copy# samples does not bias the sequencing outcomes.

LINE 247- POLG not defined

LINE 275- How does going from a median of 0.17% to 1.3% equal a 6000-fold increase?

LINE 466-469- Brings up a good concept but sufferer from the lack of comparison to data from immunological studies to suggest the immune response to the gene products indicated are associated with protection against KSHV disease. The authors should discuss since Whitby and others have conducted whole proteome B and T cell repertoire screening without defining patterns of immunodominance or correlates of protection.

LINE 480- RNASeq or transcriptomic/array data, in this reviewer's experience, seems to suggest that there is no single pattern of KSHV expression associated with tumors. Almost exclusively lytic exclusviely latent, and mixed patterns have been detected in association with similarly staged tumorigenesis, so I not clear as to the accuracy of the statement.

LINE 489-491-This seems like a list of genes and functions without a clear linkage to the discussion above it. Is the suggestion from the RNASeq data from the same group demonstrating up or down regulation of these genes in concert with the IR1 aberration? The conclusion is not evident and could be clarified.

LINE 932-933- first statement is inaccurate since not all subject provided both samples.

Reviewer #3: Table 2 title needs to be changed. It titled as gene copy in tumor DNA but the table showed oral KSHV sequences which are not from tumors.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here on PLOS Biology: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/plospathogens/s/submission-guidelines#loc-materials-and-methods

Decision Letter 1

Shou-Jiang Gao, Ashlee V Moses

26 Oct 2020

Dear Dr Mullins,

Thank you very much for submitting your manuscript "Intra-host changes in Kaposi sarcoma-associated herpesvirus genomes in Ugandan adults with Kaposi sarcoma" for consideration at PLOS Pathogens. Your revised manuscript was reviewed both by members of the editorial board and two of the reviewers who considered the initial submission. The reviewers felt that the revised manuscript had successfully addressed most of the prior concerns, but felt that a more substantive discussion about the findings of your study and their implications was warranted. Most notably, please discuss in more depth the possibility that the deletions, duplications and changes detected in the tumor cells could be due to the genome instability of the cells, rather than being the cause of tumorigenesis. In addition, please address the query about the figure designation on line 265. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. 

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. 

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Ashlee V. Moses

Associate Editor

PLOS Pathogens

Shou-Jiang Gao

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************

Reviewer Comments (if any, and for reference):

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #1: The authors have addressed the reviewers' concerns.

Reviewer #3: This is a highly technical but interesting study on a unique set of KS patients’ specimens, which include matching saliva and tumor tissues from the same patients. Some with multiple tissue biopsies. There were 7 matched saliva and tumor tissues analyzed, and 4 individuals have more than 1 tumor lesions characterized. The authors showed that there were essentially no intra-patient variations among the sequences analyzed within the same individual, in contrast to some previous studies which have analyzed part of the KSHV genomes and demonstrated intra-patients’ variations. The authors here found that there were gene duplications, inversions and mutations in the tumors from 4 out of the 7 cases. The strength is that the whole KSHV genome sequencing used the molecular identifiers (dUMI) to barcode individual DNA molecules before library amplification so minimize PCR generated mutations. The study was also meticulously conducted and analyzed. The indications of reversions and duplications based on the number of reads were carefully confirmed and analyzed. The revised manuscript has also addressed most of the previous reviewers concerns. The concerns were mostly on the over interpretation of the findings based on the limited cases analyzed. This revision has tempered down most of the discussion on the mechanistic implications of the findings and the authors have made changes to the text. However, there are a couple of points that the authors could further consider.

1. This reviewer agrees with the previous reviewers that the findings on the deletions, duplications and changes are most likely due to the result of genome instability of the tumor cells rather than the cause of tumorigenesis. Similar changes in multiple lesions can be the results of the metastases of tumor cells harboring the mutations in the KSHV genome. This was demonstrated by case U003 that only 3 out of 5 lesions have the mutations and the other two did not, and were similar to the saliva KSHV sequences. This suggests that KS can develop in the absence of changes in the KSHV genome, and changes have likely occurred after tumorigenesis, and thus KSHV genome changes may not have substantial influence on pathogenesis. Further discussing this possibility will be helpful.

2. As one of the reviewer has pointed out “the justification for doing the work and for doing the work at the incredible level of sophistication was somewhat lacking ….” Further strengthening the rationale will be useful.

3. Line 265, are the authors referring to Fig. 1B rather than Fig. 3B?

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be absolutely required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #1: NA

Reviewer #3: (No Response)

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #1: The conflict of interested statement should be expanded upon to explain that the authors company expressly markets the technology used in this manuscript.

Reviewer #3: (No Response)

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/plospathogens/s/submission-guidelines#loc-materials-and-methods

Decision Letter 2

Shou-Jiang Gao, Ashlee V Moses

24 Nov 2020

Dear Dr. Mullins,

We are pleased to inform you that your manuscript 'Intra-host changes in Kaposi sarcoma-associated herpesvirus genomes in Ugandan adults with Kaposi sarcoma' has been provisionally accepted for publication in PLOS Pathogens.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Ashlee V. Moses

Associate Editor

PLOS Pathogens

Shou-Jiang Gao

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************************************************

Reviewer Comments (if any, and for reference):

Acceptance letter

Shou-Jiang Gao, Ashlee V Moses

13 Jan 2021

Dear Mullins,

We are delighted to inform you that your manuscript, "Intra-host changes in Kaposi sarcoma-associated herpesvirus genomes in Ugandan adults with Kaposi sarcoma," has been formally accepted for publication in PLOS Pathogens.

We have now passed your article onto the PLOS Production Department who will complete the rest of the pre-publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Pearls, Reviews, Opinions, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript, if you opted to have an early version of your article, will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. dUMI-adaptors and primers for duplex sequencing.

    During library preparation, sheared DNA fragments were A-tailed and ligated with forked, double-stranded oligonucleotides containing Illumina TruSeq universal adaptor sequences, 12-random base pairs as dUMI and spacer sequences. The adapted DNA libraries were PCR amplified before enrichment with primers mws13 and mws20, which bind to Illumina Truseq adaptors. Primer mws21 containing sample index ID for multiplex sequencing was used for PCR following enrichment. DNA libraries post processing are shown at the bottom.

    (TIF)

    S2 Fig. Workflow of genome assembly and variant analysis.

    KSHV genomes were first assembled de novo from sequence reads of each sample, before being used as reference for mapping their respective dUMI-consensus reads (adapted from [61], see details in the Methods section). Discrepancies in bases between the sample-consensus genome and mapped dUMI-consensus reads were taken to be real intra-sample variants.

    (TIF)

    S3 Fig. Read coverage in all reported samples.

    Raw (light blue), sUMI (blue) and dUMI-consensus (dark blue) read coverage in log scale along the de novo assembled, sample-consensus KSHV genomes in tumors (A) and oral swabs (B) examined in this study. Major repeat regions were masked and seen here as no coverage regions.

    (TIF)

    S4 Fig. Potential sequencing artifacts.

    (A) KSHV intrasample variant frequency as a function of read coverage. Sample variant frequencies were shown in Table 1 when at least 100 viral genomes were sampled, since below that level, minor variant frequencies were judged to be unreliable. (B) Intra-sample minor variants detected in dUMI-consensus reads of all samples by type of base substitution.

    (TIF)

    S5 Fig. U008-B and U008-D were from distinct lesions on the left leg.

    U008-B biopsy was obtained from lesions in the upper thigh, while U008-D was biopsied from a large lesion on the knee.

    (TIF)

    S6 Fig. Mutations of KSHV genomes in tumors from participants 004 and 003.

    (A) Alignment of KSHV genomes from Participant 004, showing a 28-bp deletion in the K8.1 promoter in U004-D. U004-D and U004-C are from tumors while U004-o1 is from an oral swab. (B) The only intra-host synonymous mutation found in this study, within miR-K10 in participant 003. (C) Sequence chromatograms of miR-K10 in other tumors of participant U003, with a T in all tumors and a mixture of T and the database consensus C in a minority of viruses in U003-G.

    (TIF)

    S7 Fig. Predicted secondary structure of the stem loop precursor of miR-K10a-3p.

    The structure of pre-miR-K12-10a was predicted using mfold (http://unafold.rna.albany.edu/?q=mfold/RNA-Folding-Form), indicating the mature miRNA, seed sequence (grey) and the intra-host polymorphism (red) found in participant U003. The G➔A change in RNA sequence resulted in a slightly more stable stem loop (ΔG -32.40 ➔ -32.70).

    (TIF)

    S1 Table. Adapter, primer and probe sequences used for library prep and ddPCR.

    (XLSX)

    S2 Table. Anatomic sites, biopsy times and tumor morphologies of the KS lesions.

    (XLSX)

    S3 Table. Tumor KSHV-unique mutations observed, in gene order.

    (XLSX)

    S4 Table. Number of dUMI-consensus reads from BCBL-1, tumor and oral swab samples mapping to EBV genome (strain B-95/AG876).

    (XLSX)

    Attachment

    Submitted filename: Reviewer responses_final.docx

    Attachment

    Submitted filename: Reviewer_response2.docx

    Data Availability Statement

    All whole KSHV genome sequence files are available from the NCBI, with GenBank Accession numbers: MT510648, MT510670, MT936340. All other relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS Pathogens are provided here courtesy of PLOS

    RESOURCES