Abstract
A prevalence of around 26% of human papillomavirus (HPV) in head and neck squamous cell carcinoma (HNSCC) has been previously reported. HPV induced oncogenesis mainly involving E6 and E7 viral oncoproteins. In some cases, HPV viral DNA has been detected to integrate with the host genome and possibly contributes to carcinogenesis by affecting the gene expression. We retrospectively assessed HPV integration sites and signatures in 80 HPV positive patients with HNSCC, by using a double capture‐HPV method followed by next‐generation Sequencing. We detected HPV16 in 90% of the analyzed cohort and confirmed five previously described mechanistic signatures of HPV integration [episomal (EPI), integrated in a truncated form revealing two HPV‐chromosomal junctions colinear (2J‐COL) or nonlinear (2J‐NL), multiple hybrid junctions clustering in a single chromosomal region (MJ‐CL) or scattered over different chromosomal regions (MJ‐SC) of the human genome]. Our results suggested that HPV remained episomal in 38.8% of the cases or was integrated/mixed in the remaining 61.2% of patients with HNSCC. We showed a lack of association of HPV genomic signatures to tumour and patient characteristics, as well as patient survival. Similar to other HPV associated cancers, low HPV copy number was associated with worse prognosis. We identified 267 HPV‐human junctions scattered on most chromosomes. Remarkably, we observed four recurrent integration regions: PDL1/PDL2/PLGRKT (8.2%), MYC/PVT1 (6.1%), MACROD2 (4.1%) and KLF5/KLF12 regions (4.1%). We detected the overexpression of PDL1 and MYC upon integration by gene expression analysis. In conclusion, we identified recurrent targeting of several cancer genes such as PDL1 and MYC upon HPV integration, suggesting a role of altered gene expression by HPV integration during HNSCC carcinogenesis.
Keywords: carcinogenesis, head and neck squamous cell carcinoma, HPV copy number, HPV integration, MYC, PDL1
Five previously described mechanistic signatures of human papillomavirus (HPV) integration [episomal, integrated in a truncated form revealing two HPV‐chromosomal junctions colinear (2J‐COL) or nonlinear (2J‐NL), multiple hybrid junctions clustering in a single chromosomal region (MJ‐CL) or scattered over different chromosomal regions (MJ‐SC) of the human genome] are confirmed in our head and neck squamous cell carcinoma cohort. Two hundred sixty‐seven HPV‐human junctions scattered on most chromosomes are reported.
Abbreviations
- 2J‐COL
two colinear junctions
- 2J‐NL
two nonlinear junctions
- 2J‐UN
unclassified two junctions
- ASCC
anal squamous cell carcinoma
- CC
cervical cancer
- DFS
disease free survival
- ecDNA
hybrid human–virus extra chromosomal DNA
- EPI
episomal
- HNSCC
head and neck squamous cell carcinoma
- HPV
human papillomavirus
- KLF
Krüppel‐like factors
- M2
pro‐tumoural polarized M2 macrophages
- MJ‐CL
multiple hybrid junctions clustered in a single chromosomal region
- MJ‐SC
multiple hybrid junctions scattered over different chromosomes
- nf‐VIF
nextflow‐based Virus Insertion Finder
- NGS
next generation sequencing
- OPSCC
oropharynx squamous cell carcinoma
- OS
overall Survival
- R1
cancer cells present microscopically on surgically resected specimen
- SCC
squamous cell carcinoma
- TNM
international staging system tumour‐node‐metastasis
1. Introduction
More than 90% of head and neck cancers are squamous cell carcinomas (HNSCC), predominantly affecting the oropharynx, but also the oral cavity, hypopharynx and larynx. HNSCC is the seventh most common cancer worldwide [1]. Treatment options for non‐metastatic HNSCC include surgery or radiation therapy (+/− induction or concomitant chemotherapy) [2]. Relapse is frequent: 10–20% of cases for stage I‐II and up to 50% of cases for stage III. Recurrent/metastatic HNSCC are associated with poor prognosis with an overall survival (OS) inferior to one year. New standards of care for recurrent and/or metastatic HNSCC now include PD1 targeting therapies [3, 4, 5].
HNSCC main risk factors are alcohol and tobacco use and human papillomavirus (HPV) infection. The overall HPV prevalence in HNSCC is 26% and reaches up to 80–90% for oropharyngeal squamous cell carcinomas (OSCC) [6]. A recent review highlighted that the incidence of HPV‐related HNSCC cancers increased in the last decades regardless of age group [7]. HPV role in tumour initiation and progression is clearly established in cervical, anal canal, oropharynx, vulva and penile carcinomas [8].
HPV is a small double‐strand DNA virus of approximately 8000 base pairs. More than 200 genotypes have been identified, among which 13 high risk HPVs are recognized as carcinogens by the International Agency for Research on Cancer. During the oncogenic process, the viral genome can be maintained as extrachromosomal nuclear episomes and/or it can be integrated into the host genome [9]. Carcinogenesis is primarily driven by E6 and E7 viral oncogenes when HPV remains as an episome or if integration happens outside a gene. Then, E6 and E7 oncoproteins expression respectively disrupts p53 and pRb tumour suppressors. Whereas, integration into or near a human cancer related gene can affect the gene expression and directly contribute to carcinogenesis via chromosome instability, gene disruption, or regional amplifications within the cellular genome [10]. HPV integration has been reported to cause E1/E2 disruption and expression of alternate E6 transcripts [10, 11, 12, 13].
We previously developed the HPV double capture method followed by next‐generation sequencing (NGS) to: (a) precisely determine HPV genotypes, (b) characterize the HPV sequence (full‐length when HPV remained episomal or truncated when integrated), (c) analyze the viral‐host DNA junctions, (d) quantify the HPV copy number per sample and (e) assess the sites and signatures of HPV integration into the human genome [14]. We previously identified five mechanistic signatures in the small cohort of cervical cancer. The signatures were: episomal (EPI), integrated in a truncated form revealing two HPV‐chromosomal junctions colinear (2J‐COL) or non‐linear (2J‐NL), multiple hybrid junctions clustering in a single chromosomal region (MJ‐CL) or scattered over different chromosomal regions (MJ‐SC) of the human genome. Our team recently applied this technique in large cohorts of HPV‐positive anal squamous cell carcinomas (ASCC) [15] and cervical cancer (CC) patients [16] to assess prognostic and predictive value of these HPV genomic signatures.
Several studies across different tumour localizations focused on HPV integration and its impact on gene expression and immune microenvironment modulation, but much remains to be confirmed [10, 17].
Nonintegrated HPV has been previously reported to exert oncogenic effects while maintaining status as viral circular DNA in oropharynx squamous cell carcinoma (OPSCC) [21] Viral integrations were also reported to induce human–viral hybrid amplicons containing both viral DNA and segments from the human genome [22], suggesting three categories of HPV DNA including, episomal, integrated and human–viral episomal hybrids [23]. Hybrid human–virus extra chromosomal DNA (ecDNA) was shown to induce an increase in HPV oncogenes E6 and E7 copy number. More recently, the interaction of the HPV genome with cancer‐associated ecDNA was shown to induce high expression of hybrid viral–human oncogene transcripts in OPSCC [24].
Previous reports in the literature identified HPV integration sites in different types of squamous cell carcinomas and particularly in cervical cancer. Frequent sites were reported in the MYC, TMEM49, FANCC and RAD51B genes. Conversely to cervical cancer, few studies documented in the literature report HPV integration sites in HNSCC. Parfenov et al. revealed some similarities in HNSCC, with HPV integration events targeting RAD51B, NR4A2 and TP63 [18, 21, 25, 26].
Studies in cervical cancers showed that episomal HPV confers a relatively favorable survival as compared to integrated HPV [27, 28, 29]. In HNSCC, high HPV copy number was correlated with the episomal status associated with a significantly better prognosis. On the other hand, low HPV copy numbers were correlated with HPV integration and a poor prognosis [29, 30]. Another study suggests that an intact E2 gene is correlated to good prognosis [31].
In the current manuscript, we will assess HPV integration sites and signatures and their prognostic value in HNSCC patients. We aimed to identify integration hotspots implicated in oncogenesis. We also evaluated the impact of HPV integration on the expression of major cancer and immune‐related genes.
2. Methods
2.1. Patients and samples
Eighty‐two HPV positive frozen tumour samples from initial biopsies or surgical specimens from 80 patients were retrospectively retrieved from the tissue bank of four French hospitals: Institut Curie, Paris (BB‐0033‐00048), IUCT‐Oncopole, Toulouse (BB‐0033‐00014), Centre Léon Bérard, Lyon (BB‐0033‐00050), and Centre Antoine Lacassagne, Nice (CRB‐CAL/ BB‐0033‐00098). All patients were treated for localized or locally advanced HPV positive HNSCC between 1997 and 2017. Samples were obtained prior to treatment. Matched initial tumour sample and synchronous node sample from cervical node dissection were available for two patients. According to the French regulations and ethical requirements, patients were informed about the research performed on tissue specimens and did not express opposition. The study methodologies conformed to the standards set by the Declaration of Helsinki. The analyses were approved by the internal review board.
2.2. Genomic DNA and total RNA extractions
Tumour samples were frozen in liquid nitrogen in a cryotube immediately after surgery and stored at −80 °C. Tumoural cellularity was verified systematically by performing cryosections and by macro‐dissecting tumoural zones. Total genomic DNA was extracted with phenol‐chloroform after proteinase K digestion, followed by the precipitation of nucleic acids in ethanol [32]. Total RNA was extracted with miRNeasy Mini kit Qiagen following supplier’s recommendations. The quality of RNA was verified by migration on agarose gel. Nucleic acids were quantified using Nanodrop spectrophotometer ND‐1000 (ThermoScientific, Wilmington, DE, USA).
2.3. HPV genotyping
HPV detection and genotyping were first performed using real‐time PCR. The techniques and primers have been previously described [33, 34].
2.4. DNA library preparation
The DNA libraries were prepared using 500 ng of genomic DNA. The fragmentation used ultrasonication (Covaris) to produce double‐strand DNA fragments of 290 bp average length. The Kapa Hyper Prep kit Roche was used for the next steps, according to the manufacturer’s instructions: (a) End‐Repair and A‐tailing, (b) ligation of adapters, containing unique barcodes for each sample, specific to the Illumina (San Diego, CA, USA) technology for amplification and sequencing, (c) purification and size selection with Agencourt AMPure XP, (d) pre‐capture PCR and (e) purification.
2.5. HPV double capture method and NGS sequencing
The double capture was carried out using the SeqCap EZ Rapid Library Small Target Capture method, developed by Roche NimbleGen, which is adapted to capture small DNA targets [14]. The DNA libraries were multiplexed by 12 samples and hybridized with the biotinylated HPV oligonucleotide probes that recognize all HPV subtypes. The DNA sequences were then captured by streptavidin beads and amplified by PCR. We performed two rounds of hybridization and capture to improve the specificity and to enrich the fraction of relevant reads for subsequent bioinformatics analysis. Post‐capture libraries were sequenced using Illumina MiSeq system, in 150 bp paired‐end reads, with 24 samples multiplexed on a V2 micro flow‐cell.
2.6. Bioinformatics pipeline for analysis of HPV genotypes and viral‐cellular junctions
To simplify the post‐sequencing analyses, raw sequencing reads were analyzed with the nf‐VIF (nextflow‐based Virus Insertion Finder) bioinformatics pipeline (v1.0.1) [16]. The pipeline is available for free public access: https://github.com/bioinfo‐pf‐curie/nf‐VIF. Briefly, nf‐VIF was developed to detect and genotype the HPV strain(s) in the samples and to precisely map the integration sites on the Human genome. The different steps of the analysis are; (a) reads cleaning and quality controls, (b) HPV genotyping, (c) local alignment on detected HPV strain(s), (d) detection of putative HPV breakpoints using soft‐clipped reads, (e) soft‐clipped reads alignment on Human genome reference, (f) detection of integration loci and filtering of the results and (g) presentation of results in a dynamic report. nf‐VIF reports the number of reads and the positions of the HPV/human junctions with their genomic positions. We used the GRCh37/hg19 on the UCSC Genome Browser (https://genome.ucsc.edu/) to identify the corresponding genes. When the breakpoints were intergenic, we recorded the closest 3’ and 5’ genes.
Additionally, viral integration status was determined using three parameters: (a) human‐virus DNA junction detection as described above, (b) ratio of viral genes E2 and E6 read depth and (c) E2 deleted fraction (i.e., fraction of the gene locus without coverage). Using strain specific coordinates, the depth of coverage was computed from the aligned BAM files using the R–Bioconductor package GenomicAlignments1 (package version 1.28.0, R version 4.1). Samples with no junction detected were considered to exclusively host episomal HPV while samples with at least one junction detected and E2 gene loss (i.e., E2/E6 mean read depth ratio of less than 0.1 or E2 deleted fraction higher than 10%) were classified as hosting fully integrated HPV. Remaining cases were labelled as mixed, having both integrated viral DNA and episomal forms [35] (Tables S1 and S2, Fig S1A and B).
2.7. HPV copy number
HPV copy number was calculated as the ratio of the number of reads mapped to HPV genome over the number of reads mapped to the human KLK3 gene used as a diploid control gene. KLK3 was chosen as a reference since its size is similar to HPV genome and is not altered in HNSCC. Similar results were obtained with two other reference genes (GAPDH and RAB7A). To determine the optimal cutoff value for HPV copy number, we used CutoffFinder.R. This tool allows cutoff optimization for biomarkers that are investigated in research [36].
2.8. Description of HPV genomic signatures
The absence of HPV‐human chromosomal junction reads indicated that the viral genome was maintained in its episomal form: this signature was named EPI. In the presence of HPV integration, the viral genome and its breakpoints as well as the coordinates of the integration sites in the human genome were identified by pure HPV reads and HPV‐human chromosomal junction reads. Integration in a truncated form revealing two HPV‐chromosomal colinear junctions is called 2J‐COL or non‐linear named 2J‐NL. Integration with multiple hybrid junctions clustering in a single chromosomal region is referred as MJ‐CL or MJ‐SC when junctions are scattered over different chromosomal regions (MJ‐SC) of the human genome. In some cases, we identified a single HPV‐chromosomal junction corresponding to two junctions with a lost junction; it was named 2J‐UN.
2.9. Real‐time quantitative reverse transcription PCR
To characterize the consequences of HPV integration on gene expression we performed real‐time quantitative PCR. We designed primers for our integration hotspot genes: PLGRKT, PDL1 (also known as CD274), PDL2 (also known as PDCD1LG2 or CD273), MYC and PVT1. Total RNA was available for 44 patients including three among the four patients with integration in the PDL1/PDL2/PLGRKT region, and two among the three patients with integration in the PVT1/MYC region. No RNA was available for the patients with integration near MACROD2 and KLF5/KLF12 regions. We selected 51 additional genes mainly involved in the immune process, in particular 17 coding for immune checkpoints, 14 for immune cell populations and 8 for chemokines.
cDNA synthesis and PCR‐reaction conditions have been previously described elsewhere [37]. Target gene expression levels were normalized on the basis of TBP contents (Genbank accession number NM_003194) used as an endogenous RNA control. The expression values of the tumour samples were normalized such that the median of the expression values of 13 normal head and neck tissues was one.
Primers for target genes were designed with the assistance of Oligo 6.0 computer program (National Biosciences, Plymouth, MN). To avoid amplification of contaminating genomic DNA, one of the two primers was placed at the junction between two exons or on two different exons. Agarose gel electrophoresis was used to verify the specificity of PCR amplicons. The sequences of the oligonucleotide hybridization primers are available upon request.
2.10. Copy number variant analysis using 500 gene panel
Sequencing was performed using an in‐house NGS panel of 571 genes, called DRAGON Dx (Detection of Relevant Alterations in Genes involved in Oncogenetics). Indexed paired‐end libraries of tumour DNA were performed using the Agilent Sureselect XT2 library prep kit. The kit supports sequencing targeted regions of the genome spanning 2.7 Mb. About 50 ng of input DNA were used to build the libraries according to the manufacturer’s protocol. The pool was finally sequenced on a NovaSeq 6000 (Illumina) S2 × 150 bp flow cell.
Targeted chromosomal regions were annotated according to UCSC database [38]. For each region, reads were counted and a double normalization was performed: (a) normalization by the total number of reads in each sample and (b) normalization by median coverage of all samples. After normalization, the reference value was 1.
2.11. Statistical analysis
To compare signatures’ repartition between HNSCC, ASCC and CC we used Chi‐square test. Correlations between HPV genomic signatures and clinical, histological and molecular features were assessed using a Chi‐square test (with Yates correction if appropriate). Given the small sample size, we pooled the 2J‐UN, 2J‐COL and 2J‐NL together and the MJ‐CL and MJ‐SC together. The three groups were EPI, 2J and MJ. Overall survival (OS) was defined as the time interval from the date of diagnosis to death. Disease free survival (DFS) was defined as the time interval from diagnosis to the date of relapse. Survival data were censored at the date of last follow‐up. We draw the survival curves using the Kaplan–Meier method and compared them using the log‐rank test. The optimal cutoff value for the HPV copy number was established using CutoffFinder.R. A low HPV copy number was <9 and a high HPV copy number was ≥9. To compare gene expression levels according to the signature group, we used non‐parametric tests (Mann–Whitney or Kruskal–Wallis depending on effectives). For all statistical tests, the limit of significance was defined as P < 0.05. All analyses were performed using Prism 6.0 software (GraphPad Inc., La Jolla, CA, USA).
3. Results
3.1. Patient characteristics and association with survival
Eighty patients with HPV positive HNSCC were included. Tumours were squamous cell carcinoma arising from the oropharynx (n = 73/80, 91.2%) or other head and neck anatomical sites (n = 7/80, 8.8%). Patients’ characteristics, tumour’s characteristics and treatments are presented in Table 1. Eight patients (n = 8/79, 10.1%) were treated by surgery (alone or post‐induction chemotherapy), 15 patients (n = 15/79, 19%) had exclusive radiation therapy as a local treatment (with or without concomitant chemotherapy) and 54 patients (n = 54/79, 68.4%) had surgery and additional radiation therapy indicated for a R1 resection and/or invasion of the nodal capsule. Two patients did not receive any curative treatment because of impaired general status. As expected age (P = 0.02) and tumour stage (P = 0.002) were associated with OS.
Table 1.
Heading | Patients (%) | Events (%) | OS (P‐value) h |
---|---|---|---|
Total | 80 (100) | 26 (32.5) | |
Age (years) | |||
<65 | 45 (56.2) | 10 (22.2) | |
≥65 | 35 (43.8) | 16 (45.7) | 0.02 |
Gender | |||
Male | 60 (75) | 21 (35) | |
Female | 20 (25) | 5 (25) | 0.44 (NS) |
Tobacco a | |||
Yes | 48 (60.8) | 18 (37.5) | |
No | 31 (39.2) | 7 (22.6) | 0.32 (NS) |
Alcohol b | |||
Yes | 19 (25) | 7 (36.8) | |
No | 57 (75) | 18 (31.6) | 0.7 (NS) |
Localization | |||
Oropharynx | 73 (91.2) | 22 (30.1) | 0.12 (NS) |
Non‐oropharyngeal | 7 (8.8) | 4 (57.1) | |
Tumour stage c | |||
I | 20 (26.7) | 4 (20) | 0.002 |
II | 32 (42.7) | 7 (21.9) | |
III | 22 (29.3) | 12 (54.5) | |
IV | 1 (1.3) | 1 (100) | |
Lymph node invasion c | |||
Yes | 64 (85.3) | 21 (32.8) | 0.68 (NS) |
No | 11 (14.7) | 3 (27.3) | |
HPV genotype | |||
Genotype 16 | 72 (90) | 22 (30.5) | 0.16 (NS) |
Other genotypes | 8 (10) | 4 (50) | |
Tumour differentiation d | |||
Well/moderate | 32 (56.1) | 14 (43.8) | 0.36 (NS) |
Poor | 25 (43.9) | 8 (32) | |
Mitotic index e | |||
Low/moderate | 8 (20) | 3 (37.5) | 0.93 (NS) |
High | 32 (80) | 11 (34.4) | |
Perineural invasion f | |||
Yes | 12 (28.6) | 4 (33.3) | 0.97 (NS) |
No | 30 (71.4) | 8 (26.7) | |
Lymphovascular invasion g | |||
Yes | 17 (37) | 6 (35.3) | 0.80 (NS) |
No | 29 (63) | 10 (34.5) | |
Initial therapy a | |||
Surgery with or without induction CT | 8 (10.1) | 1 (12.5) | 0.55 (NS) |
Exclusive (chemo)RT | 15 (19) | 5 (33.3) | |
Surgery followed by (chemo)radiation | 54 (68.4) | 19 (35.2) | |
None | 2 (2.5) | 1 (50) |
Data available for 79 patients.
Data available for 76 patients.
Data available for 75 patients.
Data available for 57 patients
Data available for 40.
Data available for 42.
Data available for 46.
Univariate analysis (Kaplan–Meier method).
3.2. HPV genotypes
HPV genotype, initially determined by PCR, was confirmed by the double Capture‐HPV method followed by NGS. HPV16 was the most common genotype (n = 72/80, 90%) (Table 1). The other HPV genotypes were HPV 33 (n = 4/80, 5%), HPV 35 (n = 2/80, 2.5%), HPV 26 (n = 1/80, 1.2%) and HPV 56 (n = 1/80, 1.2%). We did not find patients co‐infected with several HPV genotypes.
3.3. Integrations signatures
When determining viral integration status by comparing the mean depth of coverage at the E2 and E6 genes loci, we identified 3 groups of patients with episomal, integrated and mixed (Tables S1 and S2; Fig. S1). Thirty‐one patients harbored episomal HPV (n = 31/80, 38.8%), whereas 49 (n = 49/80, 61.2%) displayed HPV integration among whom only 6 patients had a pure integration (Tables S1 and S2). The ratio of E2/E6 expression was differential among episomal and integrated/mixed confirming that E2 expression level is lower in HPV integrated samples (Table S2).
In addition to the 31/80 episomal HPV, the mechanistic HPV genomic signatures of the 49 integrated and mixed HPV tumours, were MJ‐SC (n = 25/80, 31.3%), MJ‐CL (n = 7/80, 8.7%), 2J‐COL (n = 7/80, 8.7%), 2J‐NL (n = 3/80, 3.8%) and seven 2J‐UN (n = 7/80, 8.7%) (Fig. 1). The distribution of HPV signatures was statistically close to the one observed in our cohort of 93 ASCC (P = 0.94) (Fig. S2A) [15] but markedly different from that observed in our cohort of 272 CC (P < 0.0001) (Fig. S2B) [16].
3.4. Viral genes deletion
All patients displaying HPV integration exhibited deletion in some viral genes, albeit systematically retaining E6 and E7 oncogenes. Deletions were observed in E1, E2, E5, L1 or L2 viral genes (data not shown).
3.5. Patients with synchronous tumour samples
Both tumour and lymph node samples at diagnosis were available for two patients. HPV16 was the viral genotype in both cases. The signatures were the same at both sites (one patient with EPI and one with MJ‐SC). The Table S3 shows the breakpoints of the MJ‐SC patient. The four same breakpoints were found in the primary tumour and cervical nodes. However, four additional breakpoints were present in the primary tumour. Of note, two additional breakpoints (supported by respectively eight and five reads) were detected on the primary tumour and not on the lymph nodes samples.
3.6. Association between HPV genomic signatures and clinico‐biological characteristics
Table 2 presents the distribution of HPV genomic signatures (grouped into EPI, 2J and MJ) according to the patients’ clinical and pathological characteristics. We observed no significant association between HPV genomic signatures and characteristics (Table S4). However, we noted a trend towards an association between the EPI signature and absence of lymph node invasion (n = 7/29, 23.3% in the EPI population vs. n = 4/46, 8.7% in the HPV integrated population, P = 0.07) and for the absence of lymphovascular invasion (n = 11/14, 80% in the EPI vs. n = 18/32, 56.2% in the HPV integrated population, P = 0.14). We also found a trend for an association between the HPV signature and HPV copy number (P = 0.06). The association is statistically significant between 2J signature and low HPV copy number (n = 12, 70.6%), as compared to the “not 2J” population (n = 24, 38%) (P = 0.020).
Table 2.
Heading | Patients (%) | Number of patients (%) | P‐value h | ||
---|---|---|---|---|---|
EPI | 2J | MJ | |||
Total | 80 (100) | 31 (38.8) | 17 (21.2) | 32 (40) | |
Age (years) | 0.51 (NS) | ||||
<65 | 45 (56.2) | 15 (50) | 10 (58.8) | 20 (62.5) | |
≥65 | 35 (43.8) | 16 (50) | 7 (41.2) | 12 (37.5) | |
Gender | 0.44 (NS) | ||||
Male | 60 (75) | 23 (71.9) | 11 (64.7) | 26 (81.2) | |
Female | 20 (25) | 8 (28.1) | 6 (35.3) | 6 (18.8) | |
Tobacco a | 0.76 (NS) | ||||
Yes | 48 (60.8) | 17 (58.1) | 10 (58.8) | 21 (65.6) | |
No | 31 (39.2) | 13 (41.9) | 7 (41.2) | 11 (34.4) | |
Alcohol b | 0.44 (NS) | ||||
Yes | 19 (25) | 5 (20.7) | 4 (23.5) | 10 (33.3) | |
No | 57 (75) | 23 (79.3) | 13 (76.5) | 21 (67.7) | |
Localization | 0.62 (NS) | ||||
Oropharynx | 73 (91.2) | 29 (93.8) | 16 (94.1) | 28 (87.5) | |
Non‐oropharyngeal | 7 (8.8) | 2 (6.2) | 1 (5.9) | 4 (12.5) | |
Tumour stage c | 0.40 (NS) | ||||
I | 20 (26.7) | 9 (29.0) | 4 (26.7) | 7 (23.3) | |
II | 32 (42.7) | 11 (35.5) | 5 (33.3) | 16 (53.3) | |
III | 22 (28.3) | 10 (35.5) | 5 (33.3) | 7 (23.3) | |
IV | 1 (1.3) | 0 (0) | 1 (6.7) | 0 (0) | |
Lymph node invasion c | 0.18 (NS) | ||||
Yes | 64 (85.3) | 22 (76.7) | 14 (93.3) | 28 (90.3) | |
No | 11 (14.7) | 7 (23.3) | 1 (6.7) | 3 (9.7) | |
HPV genotype | 0.96 (NS) | ||||
Genotype 16 | 72 (90) | 28 (90.6) | 15 (88.2) | 29 (90.6) | |
Other genotypes | 8 (10) | 3 (9.4) | 2 (11.8) | 3 (9.4) | |
Tumour differentiation d | 0.33 (NS) | ||||
Well/moderate | 32 (56.1) | 14 (51.9) | 5 (41.7) | 13 (68.4) | |
Poor | 25 (43.9) | 12 (48.1) | 7 (58.3) | 6 (31.6) | |
Mitotic index e | 0.41 (NS) | ||||
Low/moderate | 8 (20) | 5 (22.7) | 0 (0) | 3 (23.1) | |
High | 32 (80) | 16(77.3) | 6 (100) | 10 (76.9) | |
Perineural invasion f | 0.32 (NS) | ||||
Yes | 12 (28.6) | 6 (37.5) | 1 (11.1) | 5 (27.8) | |
No | 30 (71.4) | 9 (62.5) | 8 (88.9) | 13 (72.2) | |
Lymphovascular invasion g | 0.34 (NS) | ||||
Yes | 17 (37) | 3 (20) | 4 (40) | 10 (45.5) | |
No | 29 (63) | 11 (80) | 6 (60) | 12 (54.5) | |
Initial therapy a | 0.32 (NS) | ||||
Surgery with or without induction CT | 8 (10.1) | 3 (9.7) | 1 (5.9) | 4 (12.5) | |
Exclusive (chemo) RT | 15 (19) | 4 (12.9) | 2 (11.8) | 9 (28.1) | |
Surgery followed by (chemo) radiation | 54 (68.4) | 21 (70.9) | 14 (82.3) | 19 (59.4) | |
None | 2 (2.5) | 2 (6.5) | 0 | 0 | |
HPV copy number | 0.06 (NS) | ||||
Low (<9) | 36 (45) | 12 (40.5) | 12 (70.6) | 12 (37.5) | |
High (≥9) | 44 (55) | 19 (59.5) | 5 (29.4) | 20 (62.5) |
Data available for 79 patients.
Data available for 76 patients.
Data available for 75 patients.
Data available for 57 patients.
Data available for 40.
Data available for 42.
Data available for 46.
Chi‐square test P (with Yates correction if appropriate) values for comparison of the EPI group vs. 2J group vs. the MJ group for each parameter; tumour stage UICC 8e classification CT: chemotherapy; RT: radiotherapy.
3.7. Integration sites
To validate suspected HPV integration into the human genome, we designed primers with one of them being derived from the human genome at the potential site of integration and the other against HPV sequences suspected of being near the site of integration within the HPV genome. The primers were designed 200–400 bp away from the detected integration sites based on the NGS reads.
Integration in the human genome was observed scattered on all chromosomes except on the small chromosome 21 as represented in Figure 2. We identified 267 HPV‐human chromosome junction sequences (Table S5) among the 49 tumours with HPV integration. We assessed the proportion of cancer‐related genes in the 267 HPV genome insertion sites (located ≤ 106 bp from the integration site). The protein coding genes represent 141/267 genes and are enriched in genes related to cancer (based on 685 OncoKB genes) as compared to the total number of genes in a human genome (based on 19 982 GENCODE protein coding genes).
In fact, OncoKB cancer related genes represent 11.3% (16/141) of the protein coding genes found near insertion sites, as compared to 3.4% of the entire human genome (685/19 982) (Chi‐square test, P = 1.06 × 10‐6).
The number of HPV integration sites per tumour corresponds to 16.8 (267/49) in our series of HNSCC, while it is 5.5 for anal [15] and cervical cancers [16]. All integration sites were unique at the nucleotide level. Patients had at least one intragenic integration in 77.6% (n = 38/49, 77.6%), whereas 22.4% of patients only harbored intergenic integrations (n = 11/49, 22.4%). HPV‐Human DNA recombinations within a single sample (≥3 junctions) were detected for 15/49 HPV integrated patients in our cohort. PCR and sequence analyses confirmed the 7 junctions observed in sample R14 (Data not shown).
Remarkably, we found four recurrent integration regions. The first chromosomal region with frequent integration (n = 4/49, 8.2%) was the 9p24.1 region. It contains two important immune checkpoint genes: PDL1, PDL2 and PLGRKT (Fig. 3). Most breakpoints happened into introns or in the 3’ or 5’UTR. One breakpoint was in the 6th exon of PDL1. This exon is located at the 3’ end and after the sequence coding for the functional domain of PDL1 protein. Three patients (n = 3/49, 6.1%) had integration near MYC and PVT1 in the 8q24.21 region (Fig. S3). MACROD2 integration was found in two patients (n = 2/49, 4%). Finally, two patients (n = 2/49, 4%) had integration in the KLF5/KLF12 containing region.
3.8. Signatures’ association with survival, copy number or integration region
No statistical difference was observed in terms of DFS (Fig. S4A–C) or OS (Fig. S4D–F) in patients with episomal, integrated and mixed HPV statuses. These results were confirmed in the oropharynx patients’ subpopulation (Fig. S4G–J). Additionally, no association was found between HPV genomic signatures and OS (P = 0.65) or DFS (P = 0.87) (Fig. S5A and C). There was a trend towards an association between HPV copy number and survival: patients with a high HPV copy number had longer OS (P = 0.08) and DFS (P = 0.07) (Fig. S5B and D). No association was found between survival and the recurrent integration hotspots (data not shown).
3.9. mRNA expression of HPV targeted genes
Total RNA was available for three (R295, R299 and R650) of the four patients with HPV integration in the PDL1/PDL2/PLGRKT region, and for two (R654 and R661) of the three patients with HPV integration in the MYC/PVT1 region. Figure 4 represents the targeted gene expressions of the 44 HNSCC patients with available RNA. One patient (R295) among the three patient with integration in the PDL1, PDL2 and PLGRKT containing region showed overexpression of PDL1 (but not of PDL2 and PLGRKT) (Fig. 4A). R295 is the only tumour with HPV integration in the promoter region of PDL1 (Fig. 3 and Fig. 4A). MYC (but also PVT1 at a lower level) is overexpressed in the two patients with HPV integration in the MYC/PVT1 region, as compared to the expression in the 42 other HNSCC tumours (Fig. 4B).
3.10. Relationship between HPV genomic signatures and mRNA expression of immune‐related genes
We tested the possible association between HPV genomic signatures and the expression of various immune‐related genes in our cohort of 44 HNSCC patients with available RNA (Table S6). We observed a significant association between HPV genomic signature and the fibroblast marker PDGFRß (P = 0.04). There were trends towards an association between 2J signature and higher expression (near two fold as compared to the two other groups) of several genes CD20 (marker of B lymphocytes), CD163 (marker of M2 macrophages), ITGAM (marker of myeloid‐derived suppressor cells), as well as IL8 (neutrophils chemoattractant cytokine), OX40L (a major immune checkpoint), VIM (marker of EMT) and APOBEC3H but not APOBEC3A and APOBEC3B. Of note, there was no association between HPV genomic signature and well‐known markers of cell proliferation such as MKI6 or CCND1.
4. Discussion
In the present study, we used the robust and sensitive double capture method, followed by NGS on 80 patients with HPV positive HNSCC. HPV16 was the main detected genotype. No HPV co‐infection was noted. We did not find any HPV18 infection. We confirmed the five previously described mechanistic HPV signatures with distribution close to the one observed in ASCC but different from the one described in CC. No correlation was established between HPV genomic signature s and known clinico‐biologic prognostic factors. A low HPV copy number tended to be associated with worse prognostic and with the 2J signature. We identified 267 HPV‐human chromosome junction sequences scattered on all chromosomes with four integration hotspot regions (PDL1 region, MYC and PVT1 region, MACROD2 region and in KLF5/KLF12 region). PDL1 and MYC were overexpressed upon integration.
The HPV16 predominance and the absence of HPV18 could be explained by the high proportion of oropharyngeal tumour in our samples. HPV18 is known to be rare in HPV‐positive oropharyngeal SCCs compared with other sites (34% of oral and 17% of laryngeal SCCs) [6]. In most cases, the HPV genomic signature was EPI then it was MJ‐SC, followed by MJ‐CL, 2J‐COL and other 2J‐UN. The distribution of HPV signatures was statistically consistent with the one observed in ASCC but very different from the one described in CC [15, 16]. HPV remained episomal in 38.8% of patients with HNSCC, 45% in ASCC, whereas only 12% in CC. One explanation might be HPV genotypes repartition between cancer types. HPV16 is the major genotype in ASCC and HNSCC, and this particular genotype displays more EPI signature. Holmes et al. [14] reported HPV18 to display more integration whereas HPV16 genotypes exhibited more EPI signature. The second explanation could be that HPV integration plays a bigger role in the carcinogenesis of CC than in HNSCC and ASCC [16]. The integration rate increases with the degree of transformation from normal epithelium to cervical intraepithelial neoplasia to invasive cancers with an integration rate up to 90% [17] (p). Further studies using long read sequencing such as Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) are needed to (a) assess the mechanistic of HPV insertion and junction‐related signatures and (b) to differentiate between hybrid human–virus ecDNA and chromosomal HPV DNA [24, 29] in our 61.2% integrated/mixed HPV patients.
HPV genomic signatures seem to have no impact on outcome or response to treatment. Previous studies found that HNSCC harbouring HPV integration had worse survival than those with HPV remaining in an episomic state [18]. We did not confirm this result, possibly due to our small sample size. In ASCC and CC, HPV copy number (also called “viral load” in other articles) was an independent prognostic factor [15, 16]. In our HNSCC cohort, patients with a high HPV copy number had a better prognosis. The small size of our cohort did not allow reaching statistical significance (P = 0.07). A recent study on HPV‐positive oropharyngeal tumours used digital droplet PCR to assess the copy number and confirmed that high viral copy number is associated with better clinical outcomes [39]. A possible explanation is that carcinogenesis is primarily virus driven in samples with high copy number, whereas cancers with low copy number are driven by other oncogenic events. We found a trend to significance for an association between HPV genomic signatures and HPV copy number (P = 0.06). The association is statistically significant between 2J signature and low HPV copy number when compared to the “not 2J” population (P = 0.020).
Two patients with matched samples (primary tumour and synchronous lymph nodes) displayed the same signatures at both sites. This finding supports the data on HPV clonal evolution during HPV induced carcinogenesis [40]. In the MJ‐SC patient, we found the four same breakpoints in the primary tumour and cervical nodes. However, four additional breakpoints were only present in the primary tumour sample. We explain these results by the difference of sequencing coverage between the two samples (5‐fold variation in the number of reads).
Conversely to cervical cancer, few studies documented in the literature report HPV integration sites in HNSCC. We observed 267 HPV‐human chromosome junction sequences scattered on human chromosomes. We identified four chromosomal regions with recurrent HPV integration. The first hotspot integration region, observed in 8.2% of our patients, is the 9p24.1 chromosomal region. It includes three genes: PDL1, PDL2 and PLGRKT. PDL1 and PDL2 encode for the two ligands for the PD‐1 immune inhibiting checkpoint, which plays a role in the negative regulation of the adaptive immune response. PD‐L1 or PD‐L2 are upregulated in many human tumours, including HNSCCs [41]. High level of PD‐L1 favors immune escape. We observed PDL1 (CD274) amplification (Fig. S6A) and overexpression upon integration. The patient with the highest PDL1 expression has a breakpoint in the promotor region leading to gene amplification and overexpression. HPV integration in this hotspot region is described in other studies, also resulting in an overexpression of PDL1 [18]. Immune checkpoint inhibitors targeting the PD1‐PDL1 axis are approved for the treatment of recurrent and/or metastatic HNSCC [3, 4, 5]. Exploratory analyses of Checkmate‐141 (nivolumab) and Keynote‐012 (pembolizumab) clinical trials, suggest a greater benefit of immune checkpoints inhibitors in patients with PDL1 status ≥1% as well as in p16 positive patients [3, 42]. Further studies are needed to verify if HPV disruption of PDL1 or PDL2 is a predictive marker of response to PD1‐PDL1 targeting therapy in HNSCC.
The second HPV integration hotspot is the region containing MYC and PVT1, 6.1% of patients had integration in the region. MYC is a well‐known oncogene that code for a transcription factor regulating the expression of 15% of human genes [16]. PVT1 is a long non‐coding RNA (lncRNA) showing aberrant expression in multiple human cancer types. PVT1 is implicated in the malignant progression of HNSCC and represents a potential biomarker and therapeutic target in HNSCC [43] (p1). It is known to interact with its neighbor MYC, resulting in MYC upregulation [44]. Several studies aiming to discover viral integration sites in the genome of host cells have demonstrated frequent integrations in the MYC gene in CC [14, 17, 25]. In our study, the patients with integration in the MYC region display the highest MYC expression, which may be explained by DNA focal gain (Fig. S6B). New therapeutic strategies are emerging in malignancies displaying MYC activation, for instance, with BET bromodomain inhibition [45].
The third HPV integration hotspot is MACROD2, with two patients displaying intragenic HPV integration. The protein encoded by this gene is a deacetylase involved in removing ADP‐ribose from mono‐ADP‐ribosylated proteins; it has a key role in DNA repair. Our team previously identified MACROD2 to be a hotspot of HPV integration in CC [16]. MACROD2 loss has already been described in colorectal and hepatocellular carcinogenesis [46, 47]. MACROD2 is a caretaker tumour suppressor gene. Deletions alter DNA repair and sensitivity to DNA damage thought impaired PARP1 activity resulting in chromosome instability [47, 48].
The last recurrent HPV integration hotspot was the chromosomal region 13q22.1 where are located KLF5 and KLF12 genes, identified in two patients. These genes along with MYC are already described as HPV integration hotspot in cervical cancer [14, 25]. We observed an additional tumour with integration in KLF14 located in 7q32.3, one in KLF4 located in 9q31.2, and one in KLF6 located in 10p15.1 (Table S5). The Krüppel Like Factors (KLF) refers to a family of seventeen members of transcription factors with key functions in many cellular processes: proliferation, differentiation, migration, inflammation and pluripotency [49]. Deregulation of the KLF has been shown in HNSCC [50].
Several studies suggested HPV modulation of local immunity in various cancers [20]; therefore, we performed gene expression analysis on key immune related genes. We observed a significant association between HPV genomic signature and the fibroblast marker PDGFRß. No other genes were associated with HPV genomic signatures. This might be explained by our small sample size and would require a more dedicated study, that in addition was based on selected genes, which are not fully specific of each immune cell subtypes.
This work provides new insights into the role of HPV integration in HNSCC and opens new avenues regarding the biological interplay between the virus and HNSCC immune microenvironment. Our study has several limitations: First of all, the small sample size might have hindered reaching statistical significance when correlating to clinical outcome or when assessing the relationship between HPV genomic signatures and immune‐related gene expressions analyses. A second limitation is the long inclusion period (1997–2017) when cancer treatments have consistently evolved over the past twenty years. Dedicated functional analyses are needed to better evaluate HPV integration consequences on immune microenvironment. Overall, prospective clinical studies on HPV‐positive HNSCC patients need to be conducted to demonstrate the antitumour efficacy of targeting agents in light of the molecular alterations identified as companion biomarkers (PDL1 and MYC) in the present study.
5. Conclusion
For many years, HPV oncogenic potential was only attributed to the viral oncoproteins E6 and E7, but recent studies highlights that HPV integration is an oncogenic event per se. Here, we described recurrent HPV integration in four chromosomal regions (the PDL1 region, MYC and PVT1 region, MACROD2 region and KLF5 and KLF12 region). PDL1 and MYC genes were overexpressed upon HPV integration. Our hotspot genes have a direct role in carcinogenesis with different mechanisms: immunomodulation, loss of tumor suppressor, activation of oncogenes and upregulation of transcription factors. HPV induced HNSCC carcinogenesis is more complex than first described. It relies on three actors: the virus, the host and the immune system.
Conflict of interest
All other authors report no conflict of interest.
Author contributions
Conception and design: IB, CLT and, MK. Development of methodology: IB, CLT, NS, SL and AN. Acquisition of data: CC, JPD, MK, JM, AM, ES, PS, AS, SV, JK, SI, SL, JMP and ADV. Analysis and interpretation of data (e.g., statistical analysis, biostatistics and computational analysis): MD, JM, NS, SV, EG, AH and JMP. Writing, review and/or revision of the manuscript: JM, IB, CLT, MK, CD, LLC, CL and RR. Study supervision: IB and MK.
Peer review
The peer review history for this article is available at https://publons.com/publon/10.1002/1878‐0261.13219.
Supporting information
Acknowledgements
High‐throughput sequencing has been performed on the ICGex NGS platform of the Institut Curie supported by the grants ANR‐10‐EQPX‐03 (Equipex) and ANR‐10‐INBS‐09‐08 (France Génomique Consortium) from the Agence Nationale de la Recherche (“Investissements d’Avenir” program), by the Cancéropole Ile‐de‐France and by the SiRIC‐Curie program—SiRIC Grant «INCa‐DGOS‐4654». This work was supported by the Association pour la Recherche sur le Cancer (ARC) and the Association d’Aide à la Recherche en Cancérologie de Saint‐Cloud (ARCS).
Juliette Mainguené and Sophie Vacher contributed equally to this work and serve as co‐first authors.
Nicolas Servant, Christophe Le Tourneau, and Ivan Bièche contributed equally to this work and serve as co‐last authors.
Data availability statement
Nextflow‐based Virus Insertion Finder bioinformatics pipeline (v1.0.1) is available on free public access: https://github.com/bioinfo‐pf‐curie/nf‐VIF.
References
- 1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424. 10.3322/caac.21492 [DOI] [PubMed] [Google Scholar]
- 2. Grégoire V, Lefebvre J‐l, Licitra L, Felip E; EHNS‐ESMO‐ESTRO Guidelines Working Group . Squamous cell carcinoma of the head and neck: EHNS‐ESMO‐ESTRO clinical practice guidelines for diagnosis, treatment and follow‐up. Ann Oncol. 2010;21:v184–86. 10.1093/annonc/mdq185 [DOI] [PubMed] [Google Scholar]
- 3. Ferris RL, Blumenschein G, Fayette J, Guigay J, Colevas AD, Licitra L, et al. Nivolumab for recurrent squamous‐cell carcinoma of the head and neck. N Engl J Med. 2016;375:1856–67. 10.1056/NEJMoa1602252 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Cohen EEW, Soulières D, Le Tourneau C, Dinis J, Licitra L, Ahn M‐J, et al. Pembrolizumab versus methotrexate, docetaxel, or cetuximab for recurrent or metastatic head‐and‐neck squamous cell carcinoma (KEYNOTE‐040): a randomised, open‐label, phase 3 study. Lancet. 2019;393(10167):156–67. 10.1016/S0140-6736(18)31999-8 [DOI] [PubMed] [Google Scholar]
- 5. Burtness B, Harrington KJ, Greil R, Soulières D, Tahara M, de Castro G, et al. Pembrolizumab alone or with chemotherapy versus cetuximab with chemotherapy for recurrent or metastatic squamous cell carcinoma of the head and neck (KEYNOTE‐048): a randomised, open‐label, phase 3 study. Lancet. 2019;394(10212):1915–28. 10.1016/S0140-6736(19)32591-7 [DOI] [PubMed] [Google Scholar]
- 6. Kreimer AR, Clifford GM, Boyle P, Franceschi S. Human papillomavirus types in head and neck squamous cell carcinomas worldwide: a systematic review. Cancer Epidemiol Biomarkers Prev. 2005;14:467–75. 10.1158/1055-9965.EPI-04-0551 [DOI] [PubMed] [Google Scholar]
- 7. Menezes FDS, Fernandes GA, Antunes JLF, Villa LL, Toporcov TN. Global incidence trends in head and neck cancer for HPV‐related and ‐unrelated subsites: a systematic review of population‐based studies. Oral Oncol. 2021;115:105177. 10.1016/j.oraloncology.2020.105177 [DOI] [PubMed] [Google Scholar]
- 8. Otter S, Whitaker S, Chatterjee J, Stewart A. The human papillomavirus as a common pathogen in oropharyngeal, anal and cervical cancers. Clin Oncol. 2019;31:81–90. 10.1016/j.clon.2018.10.004 [DOI] [PubMed] [Google Scholar]
- 9. Harden ME, Munger K. Human papillomavirus molecular biology. Mutat Res Rev Mutat Res. 2017;772:3–12. 10.1016/j.mrrev.2016.07.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Pinatti LM, Walline HM, Carey TE. Human papillomavirus genome integration and head and neck cancer. J Dent Res. 2018;97:691–700. 10.1177/0022034517744213 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Choo KB, Pan CC, Han SH. Integration of human papillomavirus type 16 into cellular DNA of cervical carcinoma: preferential deletion of the E2 gene and invariable retention of the long control region and the E6/E7 open reading frames. Virology. 1987;161:259–61. 10.1016/0042-6822(87)90195-4 [DOI] [PubMed] [Google Scholar]
- 12. Cricca M, Venturoli S, Leo E, Costa S, Musiani M, Zerbini M. Disruption of HPV 16 E1 and E2 genes in precancerous cervical lesions. J Virol Methods. 2009;158(1‐2):180–3. 10.1016/j.jviromet.2009.01.005 [DOI] [PubMed] [Google Scholar]
- 13. Xue Y, Lim D, Zhi L, He P, Abastado JP, Thierry F. Loss of HPV16 E2 protein expression without disruption of the E2 ORF correlates with carcinogenic progression. Open Virol J. 2012;6:163–72. 10.2174/1874357901206010163 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Holmes A, Lameiras S, Jeannot E, Marie Y, Castera L, Sastre‐Garau X, et al. Mechanistic signatures of HPV insertions in cervical carcinomas. NPJ Genom Med. 2016;1. 10.1038/npjgenmed.2016.4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Morel A, Neuzillet C, Wack M, Lameiras S, Vacher S, Deloger M, et al. Mechanistic signatures of human papillomavirus insertions in anal squamous cell carcinomas. Cancers. 2019;11:1846. 10.3390/cancers11121846 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Kamal M, Lameiras S, Deloger M, Morel A, Vacher S, Lecerf C, et al. Human papilloma virus (HPV) integration signature in cervical cancer: identification of MACROD2 gene as HPV hot spot integration site. Br J Cancer. 2021;124:777–85. 10.1038/s41416-020-01153-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Hu Z, Zhu D, Wang W, Li W, Jia W, Zeng X, et al. Genome‐wide profiling of HPV integration in cervical cancer identifies clustered genomic hot spots and a potential microhomology‐mediated integration mechanism. Nat Genet. 2015;47:158–63. 10.1038/ng.3178 [DOI] [PubMed] [Google Scholar]
- 18. Koneva LA, Zhang Y, Virani S, Hall PB, McHugh JB, Chepeha DB, et al. HPV integration in HNSCC correlates with survival outcomes, immune response signatures, and candidate drivers. Mol Cancer Res. 2018;16:90–102. 10.1158/1541-7786.MCR-17-0153 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Pett M, Coleman N. Integration of high‐risk human papillomavirus: a key event in cervical carcinogenesis? J Pathol. 2007;212:356–67. 10.1002/path.2192 [DOI] [PubMed] [Google Scholar]
- 20. Barros MR, de Melo CML, Barros MLCMG, de Cássia Pereira de Lima R, de Freitas AC, Venuti A. Activities of stromal and immune cells in HPV‐related cancers. J Exp Clin Cancer Res. 2018;37:137. 10.1186/s13046-018-0802-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Parfenov M, Pedamallu CS, Gehlenborg N, Freeman SS, Danilova L, Bristow CA, et al. Characterization of HPV and host genome interactions in primary head and neck cancers. Proc Natl Acad Sci U S A. 2014;111:15544–9. 10.1073/pnas.1416074111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Deshpande V, Luebeck J, Nguyen N‐P, Bakhtiari M, Turner KM, Schwab R, et al. Exploring the landscape of focal amplifications in cancer using AmpliconArchitect. Nat Commun. 2019;10:392. 10.1038/s41467-018-08200-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Nulton TJ, Olex AL, Dozmorov M, Morgan IM, Windle B. Analysis of the cancer genome atlas sequencing data reveals novel properties of the human papillomavirus 16 genome in head and neck squamous cell carcinoma. Oncotarget. 2017;8:17684–99. 10.18632/oncotarget.15179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Pang J, Nguyen N, Luebeck J, Ball L, Finegersh A, Ren S, et al. Extrachromosomal DNA in HPV‐mediated oropharyngeal cancer drives diverse oncogene transcription. Clin Cancer Res. 2021;27:6772–86. 10.1158/1078-0432.CCR-21-2484 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Zhang R, Shen C, Zhao L, Wang J, McCrae M, Chen X, et al. Dysregulation of host cellular genes targeted by human papillomavirus (HPV) integration contributes to HPV‐related cervical carcinogenesis. Int J Cancer. 2016;138:1163–74. 10.1002/ijc.29872 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Zhang Y, Koneva LA, Virani S, Arthur AE, Virani A, Hall PB, et al. Subtypes of HPV‐positive head and neck cancers are associated with HPV characteristics, copy number alterations, PIK3CA mutation, and pathway signatures. Clin Cancer Res. 2016;22:4735–45. 10.1158/1078-0432.CCR-16-0323 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Deng T, Feng Y, Zheng J, Huang Q, Liu J. Low initial human papillomavirus viral load may indicate worse prognosis in patients with cervical carcinoma treated with surgery. J Gynecol Oncol. 2015;26:111–7. 10.3802/jgo.2015.26.2.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Lei J, Ploner A, Lagheden C, Eklund C, Nordqvist Kleppe S, Andrae B, et al. High‐risk human papillomavirus status and prognosis in invasive cervical cancer: a nationwide cohort study. PLoS Medicine. 2018;15:e1002666. 10.1371/journal.pmed.1002666 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Nulton TJ, Kim NK, DiNardo LJ, Morgan IM, Windle B. Patients with integrated HPV16 in head and neck cancer show poor survival. Oral Oncol. 2018;80:52–5. 10.1016/j.oraloncology.2018.03.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Veitía D, Liuzzi J, Ávila M, Rodriguez I, Toro F, Correnti M. Association of viral load and physical status of HPV‐16 with survival of patients with head and neck cancer. Ecancermedicalscience. 2020;14:1082. 10.3332/ecancer.2020.1082 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Anayannis NV, Schlecht NF, Ben‐Dayan M, Smith RV, Belbin TJ, Ow TJ, et al. Association of an intact E2 gene with higher HPV viral load, higher viral oncogene expression, and improved clinical outcome in HPV16 positive head and neck squamous cell carcinoma. PLoS One. 2018;13:e0191581. 10.1371/journal.pone.0191581 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Ferrari A, Vincent‐Salomon A, Pivot X, Sertier A‐S, Thomas E, Tonon L, et al. A whole‐genome sequence and transcriptome perspective on HER2‐positive breast cancers. Nat Commun. 2016;7:12222. 10.1038/ncomms12222 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Cacheux W, Rouleau E, Briaux A, Tsantoulis P, Mariani P, Richard‐Molard M, et al. Mutational analysis of anal cancers demonstrates frequent PIK3CA mutations associated with poor outcome after salvage abdominoperineal resection. Br J Cancer. 2016;114:1387–94. 10.1038/bjc.2016.144 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Lombard I, Vincent‐Salomon A, Validire P, Zafrani B, de la Rochefordière A, Clough K, et al. Human papillomavirus genotype as a major determinant of the course of cervical cancer. J Clin Oncol. 1998;16:2613–9. 10.1200/JCO.1998.16.8.2613 [DOI] [PubMed] [Google Scholar]
- 35. Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, et al. Software for computing and annotating genomic ranges. PLoS Comput Biol. 2013;9:e1003118. 10.1371/journal.pcbi.1003118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Budczies J, Klauschen F, Sinn BV, Győrffy B, Schmitt WD, Darb‐Esfahani S, et al. Cutoff Finder: a comprehensive and straightforward web application enabling rapid biomarker cutoff optimization. PLoS One. 2012;7:e51862. 10.1371/journal.pone.0051862 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Bièche I, Onody P, Tozlu S, Driouch K, Vidaud M, Lidereau R. Prognostic value of ERBB family mRNA expression in breast carcinomas. Int J Cancer. 2003;106:758–65. 10.1002/ijc.11273 [DOI] [PubMed] [Google Scholar]
- 38. Rosenbloom KR, Armstrong J, Barber GP, Casper J, Clawson H, Diekhans M, et al. The UCSC genome browser database: 2015 update. Nucleic Acids Res. 2015;43(D1):D670–81. 10.1093/nar/gku1177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Stevenson A, Wakeham K, Pan J, Kavanagh K, Millan D, Bell S, et al. Droplet digital PCR quantification suggests that higher viral load correlates with improved survival in HPV‐positive oropharyngeal tumours. J Clin Virol. 2020;129:104505. 10.1016/j.jcv.2020.104505 [DOI] [PubMed] [Google Scholar]
- 40. Chae J, Park WS, Kim MJ, Jang SS, Hong D, Ryu J, et al. Genomic characterization of clonal evolution during oropharyngeal carcinogenesis driven by human papillomavirus 16. BMB Rep. 2018;51:584–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Pai SI, Zandberg DP, Strome SE. The role of antagonists of the PD‐1:PD‐L1/PD‐L2 axis in head and neck cancer treatment. Oral Oncol. 2016;61:152–8. 10.1016/j.oraloncology.2016.08.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Chow LQM, Haddad R, Gupta S, Mahipal A, Mehra R, Tahara M, et al. Antitumor activity of pembrolizumab in biomarker‐unselected patients with recurrent and/or metastatic head and neck squamous cell carcinoma: results from the phase Ib KEYNOTE‐012 expansion cohort. J Clin Oncol. 2016;34:3838–45. 10.1200/JCO.2016.68.1478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Yu C, Wang Y, Li G, She LI, Zhang D, Chen X, et al. LncRNA PVT1 promotes malignant progression in squamous cell carcinoma of the head and neck. J Cancer. 2018;9:3593–602. 10.7150/jca.26465 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Jin KE, Wang S, Zhang Y, Xia M, Mo Y, Li X, et al. Long non‐coding RNA PVT1 interacts with MYC and its downstream molecules to synergistically promote tumorigenesis. Cell Mol Life Sci. 2019;76:4275–89. 10.1007/s00018-019-03222-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Delmore JE, Issa GC, Lemieux ME, Rahl PB, Shi J, Jacobs HM, et al. BET bromodomain inhibition as a therapeutic strategy to target c‐Myc. Cell. 2011;146:904–17. 10.1016/j.cell.2011.08.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Zhou Z‐J, Luo C‐B, Xin H‐Y, Hu Z‐Q, Zhu G‐Q, Li J, et al. MACROD2 deficiency promotes hepatocellular carcinoma growth and metastasis by activating GSK‐3β/β‐catenin signaling. NPJ Genom Med. 2020;5:15. 10.1038/s41525-020-0122-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Sakthianandeswaren A, Parsons MJ, Mouradov D, Sieber OM. MACROD2 deletions cause impaired PARP1 activity and chromosome instability in colorectal cancer. Oncotarget. 2018;9:33056–8. 10.18632/oncotarget.25887 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Jin N, Burkard ME. MACROD2, an original cause of CIN? Cancer Discov. 2018;8:921–3. 10.1158/2159-8290.CD-18-0674 [DOI] [PubMed] [Google Scholar]
- 49. McConnell BB, Yang VW. Mammalian Krüppel‐like factors in health and diseases. Physiol Rev. 2010;90:1337–81. 10.1152/physrev.00058.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Tsompana M, Gluck C, Sethi I, Joshi I, Bard J, Nowak NJ, et al. Reactivation of super‐enhancers by KLF4 in human head and neck squamous cell carcinoma. Oncogene. 2020;39:262–77. 10.1038/s41388-019-0990-4 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Nextflow‐based Virus Insertion Finder bioinformatics pipeline (v1.0.1) is available on free public access: https://github.com/bioinfo‐pf‐curie/nf‐VIF.