Skip to main content
PLOS One logoLink to PLOS One
. 2020 Sep 28;15(9):e0238497. doi: 10.1371/journal.pone.0238497

Signatures of somatic mutations and gene expression from p16INK4A positive head and neck squamous cell carcinomas (HNSCC)

Nabil F Saba 1,*,#, Ashok R Dinasarapu 2,#, Kelly R Magliocca 1, Bhakti Dwivedi 1, Sandra Seby 3, Zhaohui S Qin 4, Mihir Patel 1, Christopher C Griffith 5, Xu Wang 1, Mark El-Deiry 1, Conor Ernst Steuer 1, Jeanne Kowalski 6, Dong Moon Shin 1, Michael E Zwick 2, Zhuo Georgia Chen 1,7,*
Editor: Hyunseok Kang8
PMCID: PMC7521680  PMID: 32986729

Abstract

Human papilloma virus (HPV) causes a subset of head and neck squamous cell carcinomas (HNSCC) of the oropharynx. We combined targeted DNA- and genome-wide RNA-sequencing to identify genetic variants and gene expression signatures respectively from patients with HNSCC including oropharyngeal squamous cell carcinomas (OPSCC). DNA and RNA were purified from 35- formalin fixed and paraffin embedded (FFPE) HNSCC tumor samples. Immuno-histochemical evaluation of tumors was performed to determine the expression levels of p16INK4A and classified tumor samples either p16+ or p16-. Using ClearSeq Comprehensive Cancer panel, we examined the distribution of somatic mutations. Somatic single-nucleotide variants (SNV) were called using GATK-Mutect2 (“tumor-only” mode) approach. Using RNA-seq, we identified a catalog of 1,044 and 8 genes as significantly expressed between p16+ and p16-, respectively at FDR 0.05 (5%) and 0.1 (10%). The clinicopathological characteristics of the patients including anatomical site, smoking and survival were analyzed when comparing p16+ and p16- tumors. The majority of tumors (65%) were p16+. Population sequence variant databases, including gnomAD, ExAC, COSMIC and dbSNP, were used to identify the mutational landscape of somatic sequence variants within sequenced genes. Hierarchical clustering of The Cancer Genome Atlas (TCGA) samples based on HPV-status was observed using differentially expressed genes. Using RNA-seq in parallel with targeted DNA-seq, we identified mutational and gene expression signatures characteristic of p16+ and p16- HNSCC. Our gene signatures are consistent with previously published data including TCGA and support the need to further explore the biologic relevance of these alterations in HNSCC.

Introduction

Recent reports have noted an increase in the incidence of head and neck squamous cell carcinomas (HNSCC), which account for approximately 3% of all cancers in the US (https://seer.cancer.gov/). These cancers are more than twice as common among men as among women. Tobacco and alcohol use are the major risk factors for HNSCC (~75%), while infection with high-risk types of human papilloma virus (HPV) accounts for the remainder of HNSCC [13]. HNSCC, particularly oropharyngeal squamous cell carcinomas (OPSCC), that are caused by infection with cancer causing types of HPV, especially HPV type 16, show distinct clinical, pathological, and molecular features compared with non-HPV related cancer [1,4]. OPSCC and non-OPSCC are two subsites with in HNSCC. Increased expression of p16INK4A, referred to hereafter as p16+, has been reported to strongly correlate with HPV infection in HNSCC; however, p16-positivity is not limited to HPV-positive tumors and therefore, is not a perfect surrogate for HPV positivity [5,6]. Previous studies have indicated that in HNSCC, HPV-positive patients have better overall survival cure rates than their HPV-negative counterparts [7].

Multiple studies have identified significant enrichment of somatic mutations in genes such as PIK3CA, TP53, CDKN2A, FGFR3, PTEN and RB1 in HPV-associated HNSCC [4,810]. Other distinctions between HPV-positive and HPV-negative cancers have been identified by combined analysis of somatic mutations and gene expression profiles [11,12]. Although the ClearSeq Comprehensive Cancer panel can identify driver mutations in the coding region of 151 cancer genes, it may fail to detect structural changes, such as gene fusions, that may be therapeutically relevant. The possibility of therapeutically actionable gene fusions driving HNSCC has not been fully explored. We therefore proposed that a combination of RNA-Seq and targeted DNA-seq to identify a high yield diagnostic tailored for HNSCC. In addition, we aimed to correlate our methods with those used in other published studies in HNSCC. A prior proof-of-principle study using a small sample size confirmed the feasibility of using FFPE samples in HNSCC to that end [13]. Here, we report an analysis of genetic alterations in HNSCC arising in diverse anatomical sites by use of targeted DNA-seq and RNA-seq of FFPE tumor samples mainly from patients with OPSCC.

Results

Somatic mutation analysis and confirmation of recurrent gene mutations among HNSCC individuals

We conducted targeted sequencing of 151 disease-associated genes that have been implicated in studies of a wide range of cancers in 22 p16+ (HPV+), 4 p16- (HPV-), and one p16-unknown (HPV unknown status) tumor samples using Agilent’s ClearSeq Comprehensive Cancer panel. Clinical and demographic information are provided in Table 1 and S1 Table. A summary of somatic mutations in p16+ and p16- tumor samples is reported in Figs 1 and 2, and S2 Table. We restricted our analysis to predicted protein coding gene regions and excluded synonymous and noncoding variants from our analysis (S3 Table). We observed C>T substitutions were increased in p16+ as compared to p16- cancers (Fig 2; [4]). The most common mutations were in PIK3CA, KMT2A, and PTEN. We also identified that PIK3CA mutations were localized to E542K, E545K, and H1047L hotspots known to promote activation, with the remaining mutations of uncertain function (Fig 1C and 1D; [14]). Many canonical pathways known to be involved in oncogenic signaling were mutated, including RTK-RAS and PI3K ([15]; Fig 3). Further, the “druggability” of the 25 genes specifically mutated in the RTK-RAS and PI3K pathways was searched using the Drug Gene Interaction database (DGIdb) [16] and the majority of the genes were found to be in druggable categories (S1 Fig). Among these were 5 genes for which “clinically actionable” compounds are available including the EGFR and PIK3CA related genes.

Table 1. Demographic and clinical characteristics of study population.

  p16+ p16- p16 unknown
N (% sample) 23 (66) 5 (14) 7 (20)
Gender
Male 22 2 4
Female 1 3 3
Age
Range 41–72 50–71 29–74
Mean 58.3 62.4 58.9
Tobacco
Current 3 2 1
Former 11 2 5
Never 8 1 1
Other (Cannabis) 1 0 0
Therapy
Radiation (Yes; No; Unknown) 18; 4; 1 1; 4; 0 4; 3; 0
Chemo (Yes; No; Unknown) 18; 4; 1 3; 2; 0 3; 4; 0
Anatomy
Tongue 0 1 5
FOM 0 0 1
BOT 11 3 0
Tonsil 10 1 0
Larynx 0 0 1
Unknown 2 0 0

BOT–Base of Tongue; FOM–Floor of Mouth; p16+—Over expression of p16INK4A.

Fig 1. Summary of somatic mutations identified.

Fig 1

A) Barplots showing numbers of mutations detected in the HNSCC cohort using GATK tumor-only method after each filtering step (see Methods section). MAFtools was used to generate the list of top mutated genes from 27 subjects, including p16+, p16- and p16-unknown status. B) Top 15 mutated genes. C) PIK3CA mutations identified in p16+ samples. Schematic of protein domains, displaying sites of mutations identified in the most frequently mutated gene PIK3CA in p16+ tumor samples.

Fig 2. Summary of SNV identified.

Fig 2

A) Individual number of SNVs identified from all 3 classes of HNSCC samples. For example, 72, 3 and 1 C>T substitutions were identified from 22 p16+, 4 p16- and 1 p16-unknown samples, respectively B) Percent contribution of individual SNVs from all 3 classes of HNSCC samples.

Fig 3. Enrichment of known oncogenic pathways based on 26 HNSCC samples including p16+ (N = 22) and p16- (N = 4) groups.

Fig 3

Oncogenic signaling pathways are derived from TCGA cohorts [17]. A-B) Mutation affected pathways C) 25 mutated genes of the RTK-RAS and PI3K signaling pathways (22 p16+ and 4 p16-). See S1 Fig for their known/reported drug-gene interactions and druggable categories. All gene-drug interactions and drug claims are compiled from the Drug Gene Interaction Database [16].

Gene fusion detection

In HNSCC, TCGA provided the first report on gene fusions [14]. Among these fusion events, a known gene fusion, FGFR3-TACC3, was identified in two p16+ tumors i.e GHN-66 and GHN-80 (S4 Table). Other studies also showed that gene fusions are associated with significant upregulation of genes including EGFR [18]. However, we did not see the upregulation of EGFR gene expression.

Differential expression of genes among HNSCC individuals

We conducted differential expression analysis between p16+ (N = 12) and p16- (N = 5), or p16+ (N = 12) and p16-unknown (N = 7) groups and created a catalog of gene expression alterations (Fig 4A–4C). The number of differentially expressed (DE) genes in p16+ vs p16- comparison was 1,044 and 8, respectively at FDR < 0.05 and 0.01 (total of 16,178 genes assayed). Among a total of 16,253 tested genes, the number of differentially expressed (DE) genes in p16+ vs p16-unknown group comparison was 574 and 58, respectively at FDR < 0.05 and 0.01. To confirm the findings that we obtained using our dataset, we used the TCGA dataset for validation. Specifically, we used comparable tissues of origin including the base of tongue (BOT), tonsil etc (S5 Table). The DEGs identified in our analyses were used to create a heatmap with TCGA expression data (Fig 4D). With this abundance of gene expression data, we have identified a misclassified TCGA sample. The sample TCGA-CV-5971 was identified by TCGA as HPV+ but our gene expression data show that its expression is similar to that of the HPV- group (Fig 4D & S6 Table). Canonical pathway analysis on the differentially expressed genes (p16+ vs p16-) using ingenuity pathway analysis (IPA) revealed most significant enrichment in pathways related to cell cycle (Fig 5).

Fig 4. Differentially Expressed Genes (DEGs) from a cohort of HNSCC.

Fig 4

A) Number of DEGs in p16+ vs p16- and p16+ vs p16-unknown comparisons B-C) Heatmaps of 8 and 58 DEGs with FDR<0.01, respectively, between p16+ vs p16- and p16+ vs p16-unknown D) Heatmap generated based on TCGA [14] HNSCC data confirming the gene expression consistency. Yellow indicates reduced expression while red indicates increased expression.

Fig 5. Significantly affected pathways (p < = 0.05) based on Ingenuity Pathway Analysis (IPA).

Fig 5

p16 expression and survival

Survival analyses were performed using the Kaplan-Meier method with the log-rank test for statistical significance. From the curves, it is evident that the patients, who have negative (or unknown) status for the p16, have more death rate as compared to the patients, who are positive for p16. For all 3 groups, the rate of decrease in survival rate is fairly constant (Fig 6). Moreover, a clear association of Kaplan-Meier survival curve of p16-based molecular subtype p16-negative with p16-unknown group supports clustering of RNA-seq based gene expression profiles (S2 Fig). As the p16 unknown group consisted of non-oropharyngeal tumors, these results were not surprising as the significance of p16 status in non-oropharyngeal primaries is unclear.

Fig 6. Kaplan-Meier curves of the molecular subtypes of HNSCC cohort based on p16 status.

Fig 6

Discussion

Using RNA-seq data in parallel with targeted sequencing of a panel of 151 genes, we demonstrate that gene expression data from FFPE samples can identify gene signatures characteristic of p16 + versus p16- OPSCC which is a subsite of HNSCC. These signatures need to be further explored for their biologic relevance in OPSCC. One caveat is the relatively limited size of the p16-negative group, which poses challenges in achieving accurate estimates of biological variability and statistical robustness in data analysis. Although matched tumor-normal analysis is preferred due to higher precision, we demonstrate that mutation detection without matched normal samples is possible for certain applications. We were also able to confirm the reliability of using routine FFPE specimens to accurately identify possible targeted pathways that were confirmed relevant in p16-positive versus p16-negative tumors in prior reports, and that could have wider future applicability and inform clinical applications in HNSCC.

Materials and methods

Research subjects

The study was approved by Emory University Institutional Review Board (IRB). Study participants included individuals with a pathologically documented diagnosis of HNSCC/OPSCC and who had undergone surgical resection of their primary disease or had adequate core biopsies of their primary tumors. A summary of clinical data of the study participants is shown in Table 1 and S1 Table. There were 35 samples total; 27 were used for DNA-Seq and 26 for RNA-Seq with 16 samples overlapping. All tissue samples were collected from subjects who gave written and signed informed consent using the Head and Neck Cancer Winship-Emory University IRB-approved consent form for tissue collection. Tissue samples were collected and stored at the Emory University Winship Cancer Institute, in Atlanta, Georgia. Clinical data collected included gender, smoking history, radiation treatment status, chemotherapy treatment status and stage (AJCC 7). A total of 35 formalin-fixed paraffin embedded (FFPE) OPSCC tumor specimens, 10 of which were oral cavity specimens were included in the study. As part of the routine pathology diagnosis of OPSCC, the tumors were evaluated for p16 immunoreactivity. Some non-oropharyngeal tumors had exhausted tissue and could not be checked for p16 (p16 unknown). The FFPE tumor blocks were de-identified according to the IRB-approved protocol. 5μM serial sections of FFPE samples were obtained from each block for DNA and RNA isolation.

p16 immunohistochemical (IHC) staining

HPV infection status of OPSCC samples was identified using p16 expression in the tumor cells. In brief, p16 ink4a immunohistochemical staining was performed in a standard manner per supplier's instructions (CINtec 9517, MTM Laboratories, Westborough, MA) on FFPE tissue sections using the automated, open system immunostainer (DAKO AutoStainer Link 48, Copenhagen, Denmark). The slides were processed using the DAB reagent to visualize the antibody-antigen complex, then counterstained with hematoxylin and subsequently washed and cover slipped. Both positive and negative control slides were prepared. The proportion of tumor cells demonstrating nuclear and cytoplasmic p16 staining were categorized dichotomously as either p16INK4A-positive (> 70% tumor cells exhibiting strong and diffuse nuclear and cytoplasmic staining) p16INK4A-negative (<70% tumor cells exhibiting strong and diffuse nuclear and cytoplasmic The IHC staining for expressed p16 was performed on five-micron sections of FFPE tissue sections. A pathologist (K.M) verified clinical diagnosis and HPV status of each specimen.

DNA and RNA extraction

Genomic DNA and total RNA were isolated using Omega BioTek chemistries according to the manufacturer’s protocol from 5 μM sections for each isolate. DNA was quantitated using NanoDrop and Qubit, and RNA was quantitated using NanoDrop and Agilent BioAnalyzer.

RNA-seq library preparation

The RNA from FFPE tumor samples was used to prepare libraries for RNA sequencing. 200 ng of genomic DNA from each specimen was used to generate uniquely barcoded sequencing libraries. Briefly, libraries were prepared using the Ion DNA Barcoding and Ion Xpress Template kits (Life Technologies, Grand Island, NY, USA) according to the manufacturer’s protocols. Fragment sizes were assessed on an Agilent Bioanalyzer 2100 using the High Sensitivity kit (Agilent, Santa Clara, CA, USA). All 8 uniquely barcoded specimen libraries were run on a single 318 chip and sequenced using an Ion PGM 200 sequencing kit.

Next-generation comprehensive cancer profiling

Agilent’s targeted sequencing ClearSeq Comprehensive Cancer Panel was used. We extracted DNA from selected samples (N = 27) and sequenced each sample for 151 disease-associated genes that have been implicated in studies of a wide range of cancers. All coding exons, exon-intron boundaries and selected introns of these genes are targeted. Libraries were paired-end sequenced using the Illumina MiSeq platform with the 75bp paired-end read mode. On average 10 million reads per sample were generated.

Short-read mapping, and somatic variant calling

Reads were trimmed for adaptors and paired-end mapped to the reference human genome (hg19) using BWA-MEM algorithm (version 0.7.12) from GenAligners v3.0 (SureCall 4.0, analysis software from Agilent Technologies). For the detection of somatic single-nucleotide polymorphism (SNP) and insertion and deletion (indel), we used Mutect2 (GATK v4) on tumor samples (“tumor-only” mode) on each sample. Resulting BAM files were sorted using Samtools v1.3 and PCR duplicates were marked using Picard v2.6.0. Realignment was performed following the Genome Analysis Toolkit (GATK) best practices. A base quality recalibration table was generated using GATK BaseRecalibrator with one or more databases of known polymorphic sites (dbSNP138 (hg19) and HAPMAP 3.3 (hg19) from the GATK resource bundle). The appropriate liftover chain file from GATK resource bundle also downloaded (b37tohg19.chain) to convert the genomic coordinates from b17 to hg19 build. To restrict a subset of genomic regions in variant calling while using GATK tools, we have provided a.bed file of exome intervals obtained from Agilent website. VCF containing population allele frequencies (AF > 0.05) of common germline variants from ExAC and an exome intervals (.bed) file were provided to GATK GetPileupSummaries to get a summary of read counts from recalibrated tumor BAM that support a set number of known variant sites. This summary of read counts was used to calculate cross-sample contamination. To call somatic variants from tumor (recalibrated) BAM via local assembly of haplotypes (using Mutect2), a VCF containing population allele frequencies of common and rare alleles from gnomAD to avoid calling any germline variants and a.bed file of exome intervals were used. The Mutect2 called variants were then filtered based on the contamination estimate using GATK FilterMutectCalls and only passed somatic variants were included in the further analysis. Calls that are likely true positives get the PASS label in the FILTER field. This step seemingly applies 14 filters, including contamination.

Comparison of mutations with COSMIC database

To determine if the called variants have been previously detected, we have annotated the derived list of somatic variants using Catalog of Somatic Mutations in Cancer (COSMIC) version 87 [19] (hg19, Coding and Non Coding vcf files combined) with ANNOVAR tool [20]. Mutations not detected by the above methods were considered novel (S3 Table).

Analysis of significantly mutated pathways

We examined the distribution of mutations in known oncogenic signaling pathways derived from TCGA cohorts] 17]. The mutation profiles in p16+ and p16- were shown with the R package “MAFtools” [21]. We also used MAFtools to calculate the mutation rate of each gene.

Data pre-processing and alignment of sequenced reads

Quality assessment of raw FASTQ reads was performed using the FASTQC program. Paired end RNA-Seq samples were mapped to the human genome reference assembly (hg38) with STAR 2.4.2a. Transcript expressions as counts were estimated with HTSeq. Count data were subsequently normalized using TMM (weighted trimmed mean of M-values) with the EdgeR package [22], and converted to counts per million (CPM) and log2-transformed. A filtering process was also performed to exclude the genes without at least 10 counts in 33% of the samples. For gene-fusion detection, we use STAR-Fusion (https://github.com/STAR-Fusion/STAR-Fusion66). It is a method that accurately identifies fusion transcripts from RNA-seq data and outputs all supporting data discovered during alignment.

Data visualization and clustering analysis

The similarity of the relative gene transcript abundances (using log2-transformed values of CPM) for each of the samples was compared using an unsupervised hierarchical clustering and heatmap analysis in R. Unsupervised hierarchical clustering of the differentially expressed genes was performed using Pearson correlation distance and average clustering.

Differential gene expression analysis

To assess the significance in the difference between p16-positive OPSCC samples and p16-negative OPSCC samples in terms of gene expression, we used the two-sample t-test. Transcripts were considered to be differentially expressed if their FDR < 0.05. Volcano plots of -log10(p-value) vs. log2 (CPM) fold-change were made to examine these associations in each tissue pair within each individual.

Analysis of significantly enriched pathways

WEB-based Ingenuity Pathway Analysis (IPA) (QIAGEN) was used for pathway analysis to identify pathways that were enriched in all significant gene lists by each of the HPV pairs (p16-positive vs. p16-negative). Only genes that were differentially expressed in sample comparisons (significance at p-value ≤ 0.05) were included in the analysis. Pathways were considered to be significant if the pathway’s p-value of enrichment was ≤0.01.

The Cancer Genome Atlas (TCGA)

TCGA RNA-seq data including 49 HPV-negative and 18 HPV-positive tissue samples in the form of raw gene count (disease =“HNSC” and data.type =“RNASeq2”) was downloaded using TCGA2STAT package for R [23] and used to find overlap between TCGA gene expression and our HNSCC data. Both expression data and clinical data were available for 67 HNSCC tumors (from different locations: oral tongue, BOT and tonsil). We used the dataset abbreviations as defined by the TCGA consortium, a public resource that catalogues clinical data and molecular characterizations of many cancer types (as defined in https://tcga-data.nci.nih.gov/docs/publications/tcga/).

In order to validate the differentially expressed gene signature identified using our OPSCC patient samples, we queried transcriptome data for HNSCC downloaded from the TCGA cancer program. Only the patient tumor samples with available clinical and RNA-seq gene expression data were obtained. As our study focuses only on oropharyngeal samples, patient samples obtained from anatomic sites—“Base of Tongue” and “Tonsil” with p16 status annotation were only used in this analysis, resulting in a total of 67 samples (p16-positive OPSCC samples (n = 18) and p16-negative OPSCC samples (n = 49)). The remaining samples were excluded as they were from a different anatomic site or p16 status annotation was not available. Unsupervised hierarchical clustering of the OPSCC TCGA patient samples was performed based on the expression of our differentially expressed gene signature using Pearson correlation distance and average clustering.

Gene fusion analysis

A fusion gene refers to two genes (either in whole or in part) that undergo fusion resulting in a chimeric gene, which is usually caused by reasons such as chromosome translocation and associated problems. STAR-fusion software analysis and the detection of fusion genes were used to identify fusion genes. Fusion gene lists were filtered with STAR-fusion with the default filter method and other parameters.

Statistical analysis

Statistical analyses were conducted using R (version 4.0.1; https://www.r-project.org/). Survival analysis was performed via “survival” and “survminer” R packages [https://github.com/therneau/survival and https://github.com/kassambara/survminer/]. The overall survival (OS) was evaluated by Kaplan-Meier curves and the statistical difference was estimated using log-rank test.

Supporting information

S1 Table. Tumor DNA-seq (ClearSeq, N = 27) and RNA-seq (N = 24) samples.

(DOCX)

S2 Table. Summary of somatic variant annotations from Annovar.

(DOCX)

S3 Table. Summary of somatic variant annotations from COSMIC database.

(DOCX)

S4 Table. Fusion genes found in HNSCC samples.

(XLSX)

S5 Table. RNA-seq dataset downloaded from TCGA database.

(DOCX)

S6 Table. Samples were classified as HPV-positive using an empiric definition of > 1,000 mapped RNA-seq reads.

(DOCX)

S1 Fig. Following plot shows potential druggable gene categories along with up to top 5 genes involved in them.

(TIF)

S2 Fig. Heatmap of DEGs identified between p16+ vs p16- and p16+ vs p16-unknown groups.

(TIF)

Acknowledgments

We acknowledge the contribution of Dr Michael Rossi to this work.

Abbreviations

HNSCC

Head and Neck squamous cell carcinomas

COSMIC

Catalog of Somatic Mutations in Cancer

dbSNP

database of single nucleotide polymorphisms

HAPMAP

The haplotype map database

FFPE

stands for formalin-fixed, parafin-embedded

ExAC

Exome Aggregation Consortium

gnomAD

The Genome Aggregation Database

GATK

Genome Analysis Toolkit

TCGA

The Cancer Genome Atlas project

OPSCC

Oropharyngeal Squamous Cell Carcinoma

p16+

p16INK4A overexpression

HPV

Human Papilloma Virus

Data Availability

The sequencing data has been deposited at the Sequence Read Archive (SRA) under the BioProject ID PRJNA635454. All other relevant data are within the paper and its Supporting Information files.

Funding Statement

This research was supported by a grant NCI R21 CA182661-01A1 to NFS (5R21CA182661-02) and GZC (GZC 5R21CA182661-02). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Saba NF, Goodman M, Ward K, Flowers C, Ramalingam S, Owonikoko T, et al. Gender and ethnic disparities in incidence and survival of squamous cell carcinoma of the oral tongue, base of tongue, and tonsils: a surveillance, epidemiology and end results program-based analysis. Oncology 2011;81(1):12–20 10.1159/000330807 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Jethwa AR, Khariwala SS. Tobacco-related carcinogenesis in head and neck cancer. Cancer Metastasis Rev 2017;36(3):411–23 10.1007/s10555-017-9689-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Argiris A, Karamouzis MV, Raben D, Ferris RL. Head and neck cancer. Lancet 2008;371(9625):1695–709 10.1016/S0140-6736(08)60728-X [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gillison ML, Akagi K, Xiao W, Jiang B, Pickard RKL, Li J, et al. Human papillomavirus and the landscape of secondary genetic alterations in oral cancers. Genome Res 2019;29(1):1–17 10.1101/gr.241141.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Reimers N, Kasper HU, Weissenborn SJ, Stutzer H, Preuss SF, Hoffmann TK, et al. Combined analysis of HPV-DNA, p16 and EGFR expression to predict prognosis in oropharyngeal cancer. Int J Cancer 2007;120(8):1731–8 10.1002/ijc.22355 [DOI] [PubMed] [Google Scholar]
  • 6.Stephen JK, Divine G, Chen KM, Chitale D, Havard S, Worsham MJ. Significance of p16 in Site-specific HPV Positive and HPV Negative Head and Neck Squamous Cell Carcinoma. Cancer Clin Oncol 2013;2(1):51–61 10.5539/cco.v2n1p51 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ang KK, Harris J, Wheeler R, Weber R, Rosenthal DI, Nguyen-Tan PF, et al. Human papillomavirus and survival of patients with oropharyngeal cancer. N Engl J Med 2010;363(1):24–35 10.1056/NEJMoa0912217 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Stransky N, Egloff AM, Tward AD, Kostic AD, Cibulskis K, Sivachenko A, et al. The mutational landscape of head and neck squamous cell carcinoma. Science 2011;333(6046):1157–60 10.1126/science.1208130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Agrawal N, Frederick MJ, Pickering CR, Bettegowda C, Chang K, Li RJ, et al. Exome sequencing of head and neck squamous cell carcinoma reveals inactivating mutations in NOTCH1. Science 2011;333(6046):1154–7 10.1126/science.1206923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.McBride SM, Rothenberg SM, Faquin WC, Chan AW, Clark JR, Ellisen LW, et al. Mutation frequency in 15 common cancer genes in high-risk head and neck squamous cell carcinoma. Head Neck 2014;36(8):1181–8 10.1002/hed.23430 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Slebos RJ, Yi Y, Ely K, Carter J, Evjen A, Zhang X, et al. Gene expression differences associated with human papillomavirus status in head and neck squamous cell carcinoma. Clin Cancer Res 2006;12(3 Pt 1):701–9 10.1158/1078-0432.CCR-05-2017 [DOI] [PubMed] [Google Scholar]
  • 12.Seiwert TY, Zuo Z, Keck MK, Khattri A, Pedamallu CS, Stricker T, et al. Integrative and comparative genomic analysis of HPV-positive and HPV-negative head and neck squamous cell carcinomas. Clin Cancer Res 2015;21(3):632–41 10.1158/1078-0432.CCR-13-3310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Saba NF, Wilson M, Doho G, DaSilva J, Benjamin Isett R, Newman S, et al. Mutation and Transcriptional Profiling of Formalin-Fixed Paraffin Embedded Specimens as Companion Methods to Immunohistochemistry for Determining Therapeutic Targets in Oropharyngeal Squamous Cell Carcinoma (OPSCC): A Pilot of Proof of Principle. Head Neck Pathol 2015;9(2):223–35 10.1007/s12105-014-0566-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Cancer Genome Atlas N. Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature 2015;517(7536):576–82 10.1038/nature14129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.McCubrey JA, Steelman LS, Chappell WH, Abrams SL, Montalto G, Cervello M, et al. Mutations and deregulation of Ras/Raf/MEK/ERK and PI3K/PTEN/Akt/mTOR cascades which alter therapy response. Oncotarget 2012;3(9):954–87 10.18632/oncotarget.652 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Griffith M, Griffith OL, Coffman AC, Weible JV, McMichael JF, Spies NC, et al. DGIdb: mining the druggable genome. Nat Methods 2013;10(12):1209–10 10.1038/nmeth.2689 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sanchez-Vega F, Mina M, Armenia J, Chatila WK, Luna A, La KC, et al. Oncogenic Signaling Pathways in The Cancer Genome Atlas. Cell 2018;173(2):321–37 e10 10.1016/j.cell.2018.03.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Guo T, Gaykalova DA, Considine M, Wheelan S, Pallavajjala A, Bishop JA, et al. Characterization of functionally active gene fusions in human papillomavirus related oropharyngeal squamous cell carcinoma. Int J Cancer 2016;139(2):373–82 10.1002/ijc.30081 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Forbes SA, Beare D, Boutselakis H, Bamford S, Bindal N, Tate J, et al. COSMIC: somatic cancer genetics at high-resolution. Nucleic Acids Res 2017;45(D1):D777–D83 10.1093/nar/gkw1121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 2010;38(16):e164 10.1093/nar/gkq603 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Mayakonda A, Koeffler P. 2016. Maftools: Efficient analysis, visualization and summarization of MAF files from large-scale cohort based cancer studies [Google Scholar]
  • 22.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010;26(1):139–40 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Wan YW, Allen GI, Liu Z. TCGA2STAT: simple TCGA data access for integrated statistical analysis in R. Bioinformatics 2016;32(6):952–4 10.1093/bioinformatics/btv677 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Hyunseok Kang

21 May 2020

PONE-D-20-10368

Signatures of somatic mutations and gene expression from p16INK4A positive head and neck squamous cell carcinomas (HNSCC)

PLOS ONE

Dear Dr. Saba,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jul 05 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Hyunseok Kang, MD, MPH

Academic Editor

PLOS ONE

Additional Editor Comments (if provided):

Thank you for submitting this work to PLOS ONE. Please consider revising the manuscript as suggested by the reviewers.

Journal requirements:

When submitting your revision, we need you to address these additional requirements:

1.    Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified what type of consent you obtained from patients for tissue collection (for instance, written or verbal, and if verbal, how it was documented and witnessed).

In addition, please provide the source of the tissue samples used in this work (e.g. hospital, institution or medical center name).

3. Please provide the accession number of specific URL weblink of the specific RNA-Seq dataset downloaded from the TCGA for this study.

4. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

5. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

6. Please upload a new copy of Figure 3 and 6 as the detail is not clear. Please follow the link for more information: https://blogs.plos.org/plos/2019/06/looking-good-tips-for-creating-your-plos-figures-graphics/

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This work presents the signatures of somatic mutation and gene expression from p16ink4a positive HNSCC. Thank you for your valuable research on genetic variants and gene expression signatures of significant HPV infection in head and neck cancer. However, while reviewing this manuscript, it was difficult to check all the details because all the resolutions of the figures were low. Attaching high resolution figures will be helpful in the next review.

In this study, p16 of head and neck cancer FFPE was classified by IHC staining. In case of ‘p16 unknown’, didn't you perform IHC? Or did you stain IHC and the staining result wasn't clear? I recommend you need to define p16? group. I am still not sure why you compared p16 separately in this study. Therefore, the reason for comparing the p16 unknown group with the p16 positive group in this study should be clearly described in the result section.

In addition, in the various experiments conducted in this study, some were targeted only to OPSCC, and some were confirmed in HNSCC, including cancers of a few other origins. Personally, I think it might have been clearer for this study to focus only on OPSCC. Or, in some experiments, please add the reason why only the OPSCC group was tested separately.

Reviewer #2: comments:

1. figures are fuzzy, hardly to read

2. Please explain what you found in details in figure 1. you could not see the results is summarized in figure..'

3.the manuscript is disorganized and need to be re-written. ie. figure legends were inserted in the Results part. Please be careful. Legends for fig 4 and 5 were separated. Hardly to read.

4. Figure 2 A. What is the percentage means? what is the figure 2B telling us?

5. Figure 4A, is confusing, looks mismatch what depicted in the manuscript. What is' p16?' in the figure? is it unknown HPV? confused

6. Are those aberrant genetic features correlating with any patient survival?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Sep 28;15(9):e0238497. doi: 10.1371/journal.pone.0238497.r002

Author response to Decision Letter 0


21 Jul 2020

Response to Editor and Reviewer:

Dear Editor,

We thank you very much for the comments and suggestions. The comments and suggestions are valuable and very helpful for revising and improving our manuscript. We have made revisions according to the referees’ comments and suggestions, as described in the authors’ response.

Additional Editor Comments (if provided):

Thank you for submitting this work to PLOS ONE. Please consider revising the manuscript as suggested by the reviewers.

Journal requirements:

When submitting your revision, we need you to address these additional requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Manuscript formatted according to PLOS ONE requirements.

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified what type of consent you obtained from patients for tissue collection (for instance, written or verbal, and if verbal, how it was documented and witnessed).

In addition, please provide the source of the tissue samples used in this work (e.g. hospital, institution or medical center name).

“All tissue samples were collected from subjects who gave informed consent using the Winship-Emory University IRB-approved consent form for tissue collection. Clinical data collected included gender, smoking history, radiation treatment status, chemotherapy treatment status and stage (AJCC 7)”.

3. Please provide the accession number of specific URL weblink of the specific RNA-Seq dataset downloaded from the TCGA for this study.

The following text added in Methods section

“TCGA RNA-seq data including 49 HPV-negative and 18 HPV-positive tissue samples in the form of raw gene count data was downloaded using TCGA2STAT package for R (disease=”HNSC” and data.type=”RNASeq2”) and used to find overlap between TCGA gene expression and our HNSCC data. Both expression data and clinical data were available for 67 HNSCC tumors (from different locations: oral tongue, base of tongue, and tonsil). We used the dataset abbreviations as defined by the TCGA consortium, a public resource that catalogues clinical data and molecular characterizations of many cancer types (as defined in https://tcga-data.nci.nih.gov/docs/publications/tcga/)”

4. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

Both RNA and DNA-Seq raw datasets were uploaded at NCBI SRA database (BioProject: PRJNA635454).

https://dataview.ncbi.nlm.nih.gov/object/PRJNA635454?reviewer=m305l851tkjcfjg7voodtmd3ma

Included the following text in the main manuscript

“Data Availability: The sequencing data has been deposited at the Sequence Read Archive (SRA) under the BioProject ID PRJNA635454. All other relevant data are within the paper and its Supporting Information files.”

5. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

As you suggested by the editor we have removed the phrase “data not shown”.

6. Please upload a new copy of Figure 3 and 6 as the detail is not clear. Please follow the link for more information: https://blogs.plos.org/plos/2019/06/looking-good-tips-for-creating-your-plos-figures-graphics/

All 6 main and two supplementary figures were uploaded.

[Note: HTML markup is below. Please do not edit.]

Authors’ response

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: This work presents the signatures of somatic mutation and gene expression from p16ink4a positive HNSCC. Thank you for your valuable research on genetic variants and gene expression signatures of significant HPV infection in head and neck cancer. However, while reviewing this manuscript, it was difficult to check all the details because all the resolutions of the figures were low. Attaching high resolution figures will be helpful in the next review.

We have recreated all 6 main and two supplementary figure with high-resolution.

In this study, p16 of head and neck cancer FFPE was classified by IHC staining. In case of ‘p16 unknown’, didn't you perform IHC? Or did you stain IHC and the staining result wasn't clear? I recommend you need to define p16? group. I am still not sure why you compared p16 separately in this study. Therefore, the reason for comparing the p16 unknown group with the p16 positive group in this study should be clearly described in the result section.

We appreciate the author’s comments; we have attempted to check the p16 status on all samples; for some of the oral cavity samples residual tissue was exhausted; the lack of p16 status in these tumors even though a limitation does not alter the overall findings in this report given that these were oral cavity tumors. The fact that our results did not reveal notable differences between the p16 unknown and p16 negative is confirming of this as the clinical significance of a p16 positive status in oral cavity cancers is unclear.

In addition, in the various experiments conducted in this study, some were targeted only to OPSCC, and some were confirmed in HNSCC, including cancers of a few other origins. Personally, I think it might have been clearer for this study to focus only on OPSCC. Or, in some experiments, please add the reason why only the OPSCC group was tested separately.

OPSCC and non-OPSCC are two subsites with in HNSCC.

Reviewer #2: comments:

1. figures are fuzzy, hardly to read

We have recreated all 6 main and two supplementary figure with high-resolution.

2. Please explain what you found in details in figure 1. you could not see the results is summarized in figure..'

Figure 1 summarizes the somatic mutation analysis with out using matching normal samples. Our results are consistent with previously analyses with matching normal samples (Genome Res 2019 (Ref 4); Nature 2015 (Ref 14)).

3.the manuscript is disorganized and need to be re-written. ie. figure legends were inserted in the Results part. Please be careful. Legends for fig 4 and 5 were separated. Hardly to read.

We have reorganized according to PLOS One guidelines.

Figure captions are inserted immediately after the first paragraph in which the figure is cited.

Tables are inserted immediately after the first paragraph in which they are cited.

4. Figure 2 A. What is the percentage means? what is the figure 2B telling us?

Percent sample size of total substitutions in a group (eg. p16-negative) from 151 target genes. Previous analysis of HPV-negative HNSCC data identified T>C substitutions were correlated with tobacco exposure (Genome Res 2019; Ref 4). HPV-negative samples were over represented compared to HPV-positive samples in both Genome Res 2019 (Ref 4) and Nature 2015 (Ref 14) papers. In our case, we have more HPV-positive samples compared to HPV-negative.

5. Figure 4A, is confusing, looks mismatch what depicted in the manuscript. What is' p16?' in the figure? is it unknown HPV? confused

Figure 4A describes the differential expression analysis between p16-negative (Control) and p16-positive (Case) and, p16-unknown (Control) and p16-positive (Case) groups. Heatmap of commonly regulated/expressed genes suggested p16-unknown (Control) and p16-negative (Control) are similar in nature (added a new figure as supplementary file, Figure S2).

As noted above, the p16 unknown cases were oral cavity cancers where the tissue was exhausted; we included these cases in our analysis as we think the behavior and profile of these tumors was likely the same as our p16 negative cohort.

6. Are those aberrant genetic features correlating with any patient survival?

We have included patient survival analysis. From the curves, it is evident that the patients, who have negative (or unknown) status for the p16, have more death rate as compared to the patients, who are positive for p16. Moreover, a clear association of Kaplan-Meier survival curve of p16-based molecular subtype p16-negative with p16-unknown group supports RNA-seq based gene expression profiles (Figure. S2).

Nabil F Saba MD

Georgia Z Chen PhD

(Corresponding authors)

Attachment

Submitted filename: PONE-D-20-10368 - Response to Reviewers.GCns.docx

Decision Letter 1

Hyunseok Kang

19 Aug 2020

Signatures of somatic mutations and gene expression from p16INK4A positive head and neck squamous cell carcinomas (HNSCC)

PONE-D-20-10368R1

Dear Dr. Saba,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Hyunseok Kang, MD, MPH

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: I Don't Know

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: All comments have been addressed. On discussion section you mentioned that 'Using RNA-seq data in parallel with targeted sequencing of a panel of 151 genes, we demonstrate that gene expression data from FFPE samples can identify gene signatures characteristic of p16 + versus p16- OPSCC which is a subsite of HNSCC.' but you used some of p16 non OPSCC samples (GHN-25). This paper analyzes data not only from OSCC but also from all HNSCC, and since data is mixed(p16+ all OSCC, p16- some OSCC and non-OSCC, and p16? all non-OSCC), accuracy of expression is considered to be important. Overall, if you care about this part, I think the rest of all is a well designed study.

Reviewer #2: The authors basically answered the question, although limitation for this research exist, it is qualified to be published.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Hyunseok Kang

14 Sep 2020

PONE-D-20-10368R1

Signatures of somatic mutations and gene expression from p16INK4A positive head and neck squamous cell carcinomas (HNSCC)

Dear Dr. Saba:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Hyunseok Kang

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Tumor DNA-seq (ClearSeq, N = 27) and RNA-seq (N = 24) samples.

    (DOCX)

    S2 Table. Summary of somatic variant annotations from Annovar.

    (DOCX)

    S3 Table. Summary of somatic variant annotations from COSMIC database.

    (DOCX)

    S4 Table. Fusion genes found in HNSCC samples.

    (XLSX)

    S5 Table. RNA-seq dataset downloaded from TCGA database.

    (DOCX)

    S6 Table. Samples were classified as HPV-positive using an empiric definition of > 1,000 mapped RNA-seq reads.

    (DOCX)

    S1 Fig. Following plot shows potential druggable gene categories along with up to top 5 genes involved in them.

    (TIF)

    S2 Fig. Heatmap of DEGs identified between p16+ vs p16- and p16+ vs p16-unknown groups.

    (TIF)

    Attachment

    Submitted filename: PONE-D-20-10368 - Response to Reviewers.GCns.docx

    Data Availability Statement

    The sequencing data has been deposited at the Sequence Read Archive (SRA) under the BioProject ID PRJNA635454. All other relevant data are within the paper and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES