Abstract
Lynch syndrome (LS) arises in patients with pathogenic germline variants in DNA mismatch repair genes. LS is the most common inherited cancer predisposition condition and confers an elevated lifetime risk of multiple cancers notably colorectal and endometrial carcinomas. A distinguishing feature of LS associated tumors is accumulation of variants targeting microsatellite repeats and the potential for high tumor specific neoepitope levels. Recurrent somatic variants targeting a small subset of genes have been identified in tumors with microsatellite instability. Notably these include frameshifts that can activate immune responses and provide vaccine targets to affect the lifetime cancer risk associated with LS. However the presence and persistence of targeted neoepitopes across multiple tumors in single LS patients has not been rigorously studied. Here we profiled the genomic landscapes of five distinct treatment naïve tumors, a papillary transitional cell renal cell carcinoma, a duodenal carcinoma, two metachronous colorectal carcinomas, and multi-regional sampling in a triple-negative breast tumor, arising in a LS patient over 10 years. Our analyses suggest each tumor evolves a unique complement of variants and that vaccines based on potential neoepitopes from one tissue may not be effective across all tumors that can arise during the lifetime of LS patients.
Subject terms: Cancer genomics, Genome informatics, Cancer
Introduction
Lynch syndrome (LS) is the most common human cancer predisposition syndrome1. It arises as a result of defective DNA mismatch repair (MMR) with loss of the post-replicative proofreading and editing system that ensures genome integrity. As a result, patients with LS are predisposed to a spectrum of cancers, notably carcinomas of the colorectum and endometrium. LS is associated with pathogenic germline variants in one of the four key MMR genes, mutL homologue 1 (MLH1), mutS homologue 2 (MSH2), mutS homologue 6 (MSH6) or postmeiotic segregation increased 2 (PMS2). Tumors arising in LS patients typically undergo loss of heterozygosity (LOH) of the wild type allele of the gene of interest resulting in complete loss of function of the protein. The genomic instability associated with MMR targets microsatellite repeats resulting in an accumulation of frameshift and insertion/deletion (indels) variants2,3. This microsatellite instability (MSI) signature is associated with responses to immune checkpoint inhibition (ICI) likely as a result of variant derived neoepitopes that may trigger the host immune system4,5. In contrast to other forms of genomic instability, notably associated with loss of BRCA function, MSI + tumors tend to be diploid with fewer chromosomal aberrations in their genomes6–8.
Clinical testing for LS includes immunohistochemistry (IHC) assays for expression of the four primary MMR genes and a panel of PCR markers for the presence of shifts in the number of dinucleotide and trinucleotide microsatellite repeats at selected loci9. The absence of expression for one or more of the 4 MMR genes and the presence of MSI defined by shifts in at least 2 of 5 microsatellite loci provides molecular confirmation for a diagnosis of LS. Despite a common molecular pathogenic basis the responses of MSI + tumors to ICI varies suggesting that either distinct variants or a threshold for elevated mutational burden trigger more pronounced immune responses10. Studies of mutational burdens and neoepitopes have primarily focused on colorectal cancer (CRC), the most common tumor associated with MSI + status including LS. Reports have suggested that MSI + CRCs evolve a set of recurring somatic variants targeting both coding and noncoding loci that may be exploited for vaccine development11. However, unlike patients with sporadic MSI + tumors LS patients have a lifetime risk of multiple different types of cancers. Despite this, little is known about the mutational landscape and neoepitope profiles of tumors arising at multiple sites over time in individual LS patients.
To investigate the longitudinal mutational patterns arising in LS associated tumors we interrogated the genomes of five different cancers that arose over a period of 10 years in a patient who underwent resection in the absence of chemotherapy and radiation for each cancer. These included a papillary transitional cell carcinoma (PTCC) in the renal pelvis, a duodenal carcinoma (DC), two separate CRCs that arose 3 years apart, and multiple regions of a triple negative breast cancer (TNBC). In each case we flow sorted tumor fractions from archived formalin fixed paraffin embedded (FFPE) tissue and profiled the tumor genomes with whole genome copy number variant (CNV) arrays and whole exome sequencing. These data were then used to identify the pathogenic variant underlying the diagnosis of LS, and to compare and contrast the CNV, mutational and neoepitope patterns across these divergent tumors that arose over a 10 year period. These results provide a unique analysis of distinct MSI + tumors arising in a single LS patient (Fig. 1).
Results
Identification of pathogenic LS variant
A common feature of tumors arising in LS patients is the loss of wild type alleles for the affected MMR gene. Although IHC and PCR testing in 2011 confirmed the diagnosis of LS the pathogenic MMR gene variant remained to be determined. We used the exome data from each tumor and a patient matched normal tissue to screen all known MMR genes for germline pathogenic variants that included LOH of the wild type allele in the tumors. Given previous IHC results we focused on variants in MSH2 and MSH6 as candidates. We identified the pathogenic variant NM_000251.2 (MSH2):c.942 + 3A > T that was heterozygous in the germ line but had converted to homozygosity in each tumor profiled (Fig. 2). This is a common pathogenic MSH2 variant associated with LS12. In contrast, none of the variants detected in MMR genes had allele patterns consistent with a pathogenic role in the predisposition to cancer (Supplementary Table S1). Strikingly there was a somatic MSH6M875I variant in the TNBC samples. The lack of MSH6 IHC staining is consistent with the presence of this likely pathogenic variant giving rise to a double negative MSI + tumor13,14. In contrast to MSH2 this variant was heterozygous suggesting that additional events such as epigenetic silencing may have contributed to the lack of MSH6 expression. However there was an absence of MSH2 and MSH6 expression in both the TNBC from 2017 and the CRC from 2018, the latter lacking the somatic MSH6 variant. Notably lack of MSH6 staining is known to occur due to disruption of MSH2-MSH6 heterodimer complexes and the degradation of MSH6 in the presence of a homozygous pathogenic MSH2 variant15,16.
Distinct driver mutations, CNV profiles and ploidies in each LS cancer
Tumor genomes that were diploid, duodenal carcinoma (2008) and the two CRCs (2015, 2018), or near diploid, PTCC (2008) by DNA content contained low levels of CNVs (Fig. 3). The absence of chromosomal instability is frequently seen in MSI + tumors6,8. In contrast three biopsies from different regions within the TNBC (2017) had 2.8 N and 3.2 N ploidies (Fig. 4). In addition the genomes of each of the three sorted TNBC fractions had high levels of CNVs that included every chromosome consistent with a BRCA like phenotype. Despite this additional chromosomal instability and variation in ploidies the CNV patterns were identical across the three sorted populations.
A significant feature of this study is that all biopsies were from chemoradiation naive surgically resected tissues. Thus the pattern of variants and the predicted neoepitopes recapitulate the natural history of the tumors. Whole exome sequences were obtained from the PTCC, the two CRCs, and the TNBC sorted samples. Each tumor had unique variants that reflected the tissue of origin (Fig. 1, Table 1). For example the PTCC from 2008 had a FGFR3R248C pathogenic variant that is enriched in upper urothelial tract cancers arising in LS patients17; the CRC from 2015 had variants of unknown significance (VUS) in PIK3CAH510N and SMAD4G230V whereas the CRC from 2018 had somatic indels in TGFBR2 and RNF43, all of which are frequent somatic targets in CRCs6, while the TNBC from 2017 had a frame shift indel in PTEN and a FANCMS789X nonsense variant in each of the three sorted biopsies. These variant patterns are consistent with the independent nature of each tumor.
Table 1.
PTCC (2008) | CRC (2015) | TNBCC (2017) | CRC (2018) |
---|---|---|---|
FGFR3R248C | PIK3CAH510N | RANCMS789X | SETD1BH5fs-delC |
ARID1AR1276X | SMAD4G230V | PTENc.955_958delACTT | TGFBR2E150fs-delA |
BRAFR671X | ERBB4P241H | TP53R114C | RNF43G532fs-delG |
KMT2DR1756Q | TP53A74S | PALB2c.839delA | SEC31AI463fs-delA |
AIM2c.1026_1027delAA | |||
MSH6M573l |
Heterogeneity in different cancer types from the same LS patient
We observed limited overlap in somatic single nucleotide variants (SNVs) and frameshift variants (Fig. 5). These observations suggest that these cancers arise independently and the mutational landscape, with rare exceptions, differs across these cancers. Even though these four cancers arise from the same patient at different time points, it is highly likely that independent somatic variants give rise to each cancer, in the genetic background of LS. However we found four somatic likely benign SNVs that are shared across the four different cancers (Supplementary Tables S2, S3). These variants targeted four genes: CROCCP2 (upstream gene variant), PTPRD (intronic variant), ACOT2 (synonymous variant), and RCAN1 (intronic variant). In addition we found one shared frameshift VUS targeting the epigenetic regulator KMT2C (Supplementary Fig. S1). The latter is frequently mutated in a variety of cancers18,19.
Consistent with very little overlap in SNVs and frameshift variants across these cancers, we found no overlap in potential neoepitopes (Fig. 6). These results suggest that vaccines that are developed based on potential neoepitopes of one tissue may not work well across all tissues in LS patients.
Multiregional sequencing and heterogeneity within the triple negative breast cancer from the LS patient
We observed a large degree of heterogeneity across multiregional sequencing from three TNBC biopsies for the same breast tissue. Out of 5,193 total SNVs found across three biopsies, 2,052 (~ 40%) are shared (Fig. 7A). Out of 492 total frameshift variants found across three biopsies, 261 (~ 53%) are shared (Fig. 7B). The proportion of potential neoepitopes that are shared across three biopsies for HLA-A*24:02, HLA-B*38:01, HLA-B*27:05, and HLA-C*12:03 are 53%, 54%, 46%, and 50%, respectively (Fig. 7C). These observations suggest that this subset of neoepitopes could potentially work as therapeutics for targeting the TNBC.
Similarities and differences between two metachronous LS colon cancers
The initial CRC was resected in 2015 and was an adenocarcinoma with mucinous features that included 3/30 positive lymph nodes. The second CRC was resected in 2018 and was also a low grade adenocarcinoma with mucinous features but without any lymph node involvement. In contrast to the three breast biopsies sharing about 50% of SNVs, frameshift variants, and potential neoepitopes, we observed significantly less overlap between the two CRCs: only 0.5% of SNVs and 3% of frameshift variants are shared (Fig. 8A,B). Further, in only one of the HLA, HLA-C*12:03, we found one potential neoepitope that is shared between these two colorectal cancers (Fig. 8C). These results indicate that these two colorectal cancers are separate cancers and they arose independently of each other.
Discussion
Our study provides a unique portrait of the genomic and neoepitope landscapes across four distinct chemoradiation naive tumors arising over a ten year period in a LS patient. These include a PTCC, two metachronous CRCs, and multi-regional analysis of a TNBC. In each case we flow sorted tumor nuclei from FFPE tissue to increase the resolution of our genomic analyses including whole genome CNV and whole exome profiles. Each tumor contained genomic features that are specific to the tissue of origin and the subtype of tumor. Notably the PTCC had a FGFR3R248C pathogenic variant that is a recurring driver lesion for this tumor, while the TNBC had BRCA-like CNV features including high levels of interstitial aberrations and a somatic nonsense variant in FANCM. The latter has been identified as a breast cancer predisposition gene that confers an increased risk of TNBC20,21.
The tumor samples were sequenced at different times during our study. Furthermore FFPE tissues may harbor various artifacts that can interfere with genomic analyses. However we applied a rigorous variant calling pipeline with identical filters across all samples for variant calling. This included remapping all reads for each flow sorted tumor sample to the most recent 1,000 genomes HG38 build of the human genome. The PTCC and the multiple biopsies from the TNBC had aneuploid DNA contents that were exploited to flow sort tumor populations for genomic analyses. In contrast the duodenal and two metachronous CRCs were diploid by flow cytometry. In these cases we gated on the 4N(G2/M) fractions to enrich tumor nuclei for our analyses. There were variations in tumor purity in the later samples based on allele frequencies of somatic variants. Notably the 2015 CRC sample retained non-tumor nuclei in the sorted fractions that affected the exome results. Nevertheless we did not detect any shared somatic variants including those that were homozygous in the PTCC and TNBC sorted biopsies.
We examined the genes that contain at least one missense variant, the specific missense variations, and the potential neoepitopes that are shared across four different metachronous cancers arising over 10 years within the same LS patient (PTCC, two CRCs, and TNBC). We found little to no overlap in the genomes of these chemoradiation naive tumors, suggesting that tissue specific independent variants contribute to each of these cancers, instead of a common set of driver events. The lack of overlap in the predicted neoepitopes across these tumors suggests that a vaccine strategy for the lifetime risk of cancer in LS patients will be challenged by the diversity of each tumor including those arising in the same tissue. The one exception was a single base pair frame shift deletion at codon 2171 in KMT2C. Recurring frameshift deletions targeting this codon (COSV51277546) have been reported in various cancers22. Notably mutations and loss of expression of KMT2C and other members of this lysine methyltransferase family are associated with increased survival in pancreatic cancers23,24. Given the lengthy clinical history of this patient it will be of interest in future studies to explore the potential role of epigenetic dysregulation in LS patients.
Methods
Clinical samples
Tissue samples were obtained under a Mayo Clinic protocol 2130-00 Cancer Tissue Study (Principal Investigator Dr. B. A. Pockaj). This study was approved by Mayo Clinic IRB protocol 08-006579-08 Breast Cancer Clinical Genomics Project. The patient gave informed consent for the collection and use of the samples. All tumor samples were histopathologically evaluated prior to genomic analysis. All research conformed to the Helsinki Declaration (https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/). The LS patient’s oncologic history started in 1988 at age 53 with uterine cancer (Fig. 1). The patient underwent surgery with no additional therapy. She then started on hormonal replacement therapy and 9 months later developed a right breast palpable mass that was found to be cancer. She was treated with mastectomy and axillary lymph node dissection. Zero out of 20 lymph nodes was involved and she denied any tamoxifen therapy. She did well until 2005 when she was noted to have a kidney mass and until November 2007 when she had urinary tract infections and the same kidney mass that had not progressed. In 2008 she underwent a right nephroureterectomy for a grade 2, stage IA transitional cell carcinoma of the renal pelvis, with negative ureteral margins. In September of 2008 she was found to have a duodenal cancer and underwent a Whipple procedure which had a 10 × 8.5 cm infiltrating poorly differentiated carcinoma with medullary features of the duodenum. Four lymph nodes were removed and they were negative. The margins were also negative.
She was then followed and eventually was found to have bladder cancer in 2011. The patient’s tumor was tested for defective DNA mismatch repair in May 2011 using IHC for the four microsatellite DNA repair proteins (MSH2, MSH6, MLH1, PMS2) and PCR assays for five microsatellite loci (BAT25, BAT26, Mono27, NR24, and NR21). These clinical diagnostic tests and additional IHC analysis of MSH2 and MSH6 in TNBC (2017) and CRC (2018) tissues were done by the Department of Laboratory Medicine and Pathology, Mayo Clinic. All additional genomic profiling described in our manuscript was done in the setting of a research study. There was an absence of MSH2 and MSH6 staining as well as microsatellite instability (MSI) noted at 5 of 5 informative PCR markers confirming the MSI + nature of the tumor. In 2015 she was feeling poorly and workup found that she had a CRC. She underwent a right hemicolectomy which showed 3 colon cancers, one 6.5 cm in size, one 5.4 cm in size, the other 1.2 cm in size. These were low grade adenocarcinomas with partial mucinous features. The margins were negative. There was no lymphovascular invasion, however 3/30 lymph nodes were positive for metastatic disease. She was seen in consultation and declined any chemotherapy. In 2017 she presented with a contralateral or left breast cancer. Estrogen receptor (ER) and progesterone receptor (PR) were evaluated by standard ASCO/CAP guidelines with < 1% of the cells staining for the receptors respectively25. HER2 negative was defined by ASCO/CAP guidelines as staining by IHC of 0 or 1 + 26. HER2 IHC of 2 + was further evaluated by FISH and deemed negative by standard ASCO/CAP guidelines. Her most recent tumor in 2018 was an invasive mucinous CRC arising in a tubulovillus adenoma. The invasion involved submucosa (pT1) with negative (0/13) lymph nodes.
Flow cytometry
Excess paraffin was removed from each FFPE sample with a scalpel from either side of 40–60 μm scrolls then processed according to our published methods27,28. We used one to three 50 µm scrolls from each FFPE tissue block to obtain sufficient numbers of intact nuclei for subsequent sorting and molecular assays. Nuclei from each sample were disaggregated then filtered through a 40 μm mesh prior to flow sorting with an Influx cytometer (Becton–Dickinson, San Jose, CA) with ultraviolet excitation and DAPI emission collected at > 450 nm. DNA content and cell cycle were analyzed using the software program MultiCycle (Phoenix Flow Systems, San Diego, CA).
Copy number analysis
DNAs from flow sorted FFPE tissue were treated for one minute with DNAse 1 prior to Klenow-based labeling. In each case 1 ul of 10 × DNase 1 reaction buffer and 2 μl of DNase 1 dilution buffer were added to 7 μl of DNA sample and incubated at room temperature then transferred to 70 °C for 30 min to deactivate DNase 1. Sample and reference templates were then labeled with Cy-5 dUTP and Cy-3 dUTP respectively using a BioPrime labeling kit (Invitrogen, Carlsbad, CA) according to our published protocols29. All labeling reactions were assessed using a Nanodrop assay (Nanodrop, Wilmington, DE) prior to mixing and hybridization to CGH arrays (Agilent Technologies, Santa Clara, CA) for 40 h in a rotating 65 °C oven. All microarray slides were scanned using an Agilent 2565C DNA scanner and the images were analyzed with Agilent Feature Extraction version 11.0 using default settings. The aCGH data was assessed with a series of QC metrics then analyzed using an aberration detection algorithm (ADM2)30. The latter identifies all aberrant intervals in a given sample with consistently high or low log ratios based on the statistical score derived from the average normalized log ratios of all probes in the genomic interval multiplied by the square root of the number of these probes. This score represents the deviation of the average of the normalized log ratios from its expected value of zero and is proportional to the height h (absolute average log ratio) of the genomic interval, and to the square root of the number of probes in the interval.
Sequence filtering, QC and alignment
DNAs from each sorted tumor population and a patient matched control sample were sequenced within the Mayo Clinic Medical Genome Facility (MGF) using established protocols for whole exome analysis. Briefly, whole exon capture was carried out with Agilent’s SureSelect Human All Exon 71 MB v6 kit. 500 ng of the prepped library is incubated with whole exon biotinylated RNA capture baits supplied in the kit for 24 h at 65 °C. The captured DNA:RNA hybrids were recovered using Dynabeads MyOne Streptavidin T1 (Dynal). The DNA was eluted from the beads and desalted using purified using Ampure XP beads (Agencourt).The purified capture products were then amplified using the SureSelect Post-Capture Indexing forward and Index PCR reverse primers (Agilent) for 12 cycles. Libraries were loaded onto paired end flow cells at concentrations of 4–5 pM to generate cluster densities of 600,000–800,000/mm2 using the Illumina cBot and HiSeq Paired end cluster kit version 3.The flow cells were sequenced to a mean depth of 80X as 101 X 2 paired end reads on an Illumina HiSeq 4,000 using TruSeq SBS sequencing kit version 3 and HiSeq data collection version 1.4.8 software. Base-calling was performed using Illumina’s RTA version 1.12.4.2. Reads from the BAM files were stripped using XYalign version 1.1.531. We then mapped stripped reads to the 1,000 genomes version of GRCh38 using bwa-mem version 0.7.1732,33.
Neoepitope prediction
We applied EpitopeHunter to predict neoepitopes as previously described34. The steps are briefly as follows: We used VarScan version 2.3.9 to call variant with the following thresholds: minimum coverage of 10, minimum variant allele frequency of 0.08, and somatic p value of 0.0535. Variants were annotated using variant effect predictor (VEP) version 8636. We generated peptides consisting of 8 amino acids (8-mers), 9 amino acids (9-mers), 10 amino acids (10-mers), and 11 amino acids (11-mers) using pvacseq version 3.0.537. We used HLA-LA to perform HLA typing38. Binding affinity between each HLA and each peptide was calculated using the Immune Epitope Database, IEDB39. Individual peptides were called a potential neoepitope if their binding affinity was less than 500 nM.
Supplementary information
Acknowledgements
Funding for this work was provided by the Marley Foundation, the Ziccarelli Foundation, and the Breast Cancer Research Foundation (BCRF). Mayo Clinic Cancer Center is supported in part by an NCI Cancer Center Support Grant 5P30 CA15083-36.
Author contributions
T.N.P., A.S. and M.A.W. performed mutation and neoepitope prediction analyses. E.L. and S.M. processed tissue and DNA samples, and did the flow sorting and CNV analyses. B.A.P. reviewed all clinical data and samples. K.S.A., B.A.P. and M.T.B. designed the study and reviewed all data. T.N.P., M.A.W. and M.T.B. wrote the paper.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
is available for this paper at 10.1038/s41598-020-68939-7.
References
- 1.Lynch HT, Snyder CL, Shaw TG, Heinen CD, Hitchins MP. Milestones of lynch syndrome: 1895–2015. Nat. Rev. Cancer. 2015;15:181–194. doi: 10.1038/nrc3878. [DOI] [PubMed] [Google Scholar]
- 2.Kondelin J, et al. Comprehensive evaluation of protein coding mononucleotide microsatellites in microsatellite-unstable colorectal cancer. Cancer Res. 2017;77:4078–4088. doi: 10.1158/0008-5472.CAN-17-0682. [DOI] [PubMed] [Google Scholar]
- 3.Liu B, et al. hMSH2 mutations in hereditary nonpolyposis colorectal cancer kindreds. Cancer Res. 1994;54:4590–4594. [PubMed] [Google Scholar]
- 4.Le DT, et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science. 2017;357:409–413. doi: 10.1126/science.aan6733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Llosa NJ, et al. The vigorous immune microenvironment of microsatellite instable colon cancer is balanced by multiple counter-inhibitory checkpoints. Cancer Discov. 2015;5:43–51. doi: 10.1158/2159-8290.CD-14-0863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Muzny DM, et al. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487:330–337. doi: 10.1038/nature11252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cancer Genome Atlas Research Network et al. Integrated genomic characterization of endometrial carcinoma. Nature. 2013;497:67–73. doi: 10.1038/nature12113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Berg KCG, et al. Multi-omics of 34 colorectal cancer cell lines—A resource for biomedical studies. Mol. Cancer. 2017;16:116. doi: 10.1186/s12943-017-0691-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kawakami H, Zaanan A, Sinicrope FA. Microsatellite instability testing and its role in the management of colorectal cancer. Curr. Treat. Options Oncol. 2015;16:30. doi: 10.1007/s11864-015-0348-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mandal R, et al. Genetic diversity of tumors with mismatch repair deficiency influences anti-PD-1 immunotherapy response. Science. 2019;364:485–491. doi: 10.1126/science.aau0447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cortes-Ciriano I, Lee S, Park WY, Kim TM, Park PJ. A molecular portrait of microsatellite instability across multiple cancers. Nat. Commun. 2017;8:15180. doi: 10.1038/ncomms15180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.National Center for Biotechnology Information. ClinVar; [VCV000036580.4].
- 13.Vargas-Parra GM, et al. Elucidating the molecular basis of MSH2-deficient tumors by combined germline and somatic analysis. Int. J. Cancer. 2017;141:1365–1380. doi: 10.1002/ijc.30820. [DOI] [PubMed] [Google Scholar]
- 14.Ashktorab H, et al. Targeted exome sequencing reveals distinct pathogenic variants in Iranians with colorectal cancer. Oncotarget. 2017;8:7852–7866. doi: 10.18632/oncotarget.13977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Richman S. Deficient mismatch repair: Read all about it (review) Int. J. Oncol. 2015;47:1189–1202. doi: 10.3892/ijo.2015.3119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hall G, et al. Immunohistochemistry for PMS2 and MSH6 alone can replace a four antibody panel for mismatch repair deficiency screening in colorectal adenocarcinoma. Pathology. 2010;42:409–413. doi: 10.3109/00313025.2010.493871. [DOI] [PubMed] [Google Scholar]
- 17.Donahu TF, et al. Genomic characterization of upper-tract urothelial carcinoma in patients with lynch syndrome. JCO Precis. Oncol. 2018 doi: 10.1200/PO.17.00143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gao YB, et al. Genetic landscape of esophageal squamous cell carcinoma. Nat. Genet. 2014;46:1097–1102. doi: 10.1038/ng.3076. [DOI] [PubMed] [Google Scholar]
- 19.Rao RC, Dou Y. Hijacked in cancer: The KMT2 (MLL) family of methyltransferases. Nat. Rev. Cancer. 2015;15:334–346. doi: 10.1038/nrc3929. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kiiski JI, et al. Exome sequencing identifies FANCM as a susceptibility gene for triple-negative breast cancer. Proc Natl Acad Sci U S A. 2014;111:15172–15177. doi: 10.1073/pnas.1407909111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Neidhardt G, et al. Association between loss-of-function mutations within the FANCM gene and early-onset familial breast cancer. JAMA Oncol. 2017;3:1245–1248. doi: 10.1001/jamaoncol.2016.5592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cancer, C. C. O. S. M. I. https://cancer.sanger.ac.uk/cosmic/signatures.
- 23.Dawkins JB, et al. Reduced expression of histone methyltransferases KMT2C and KMT2D correlates with improved outcome in pancreatic ductal adenocarcinoma. Cancer Res. 2016;76:4861–4871. doi: 10.1158/0008-5472.CAN-16-0481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sausen M, et al. Clinical implications of genomic alterations in the tumour and circulation of pancreatic cancer patients. Nat. Commun. 2015;6:7686. doi: 10.1038/ncomms8686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fitzgibbons PL, Murphy DA, Hammond ME, Allred DC, Valenstein PN. Recommendations for validating estrogen and progesterone receptor immunohistochemistry assays. Arch. Pathol. Lab. Med. 2010;134:930–935. doi: 10.1043/1543-2165-134.6.930. [DOI] [PubMed] [Google Scholar]
- 26.Wolff AC, et al. Recommendations for human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American Pathologists clinical practice guideline update. Arch. Pathol. Lab. Med. 2014;138:241–256. doi: 10.5858/arpa.2013-0953-SA. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Holley T, et al. Deep clonal profiling of formalin fixed paraffin embedded clinical samples. PLoS ONE. 2012;7:e50586. doi: 10.1371/journal.pone.0050586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Barrett MT, et al. Genomic amplification of 9p24.1 targeting JAK2, PD-L1, and PD-L2 is enriched in high-risk triple negative breast cancer. Oncotarget. 2015;6:26483–26493. doi: 10.18632/oncotarget.4494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ruiz C, et al. Advancing a clinically relevant perspective of the clonal nature of cancer. Proc. Natl. Acad. Sci. U.S.A. 2011;108:12054–12059. doi: 10.1073/pnas.1104009108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lipson D, Aumann Y, Ben-Dor A, Linial N, Yakhini Z. Efficient calculation of interval scores for DNA copy number data analysis. J. Comput. Biol. 2006;13:215–228. doi: 10.1089/cmb.2006.13.215. [DOI] [PubMed] [Google Scholar]
- 31.Webster TH, et al. Identifying, understanding, and correcting technical artifacts on the sex chromosomes in next-generation sequencing data. Gigascience. 2019 doi: 10.1093/gigascience/giz074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Clarke L, et al. The 1000 genomes project: Data management and community access. Nat. Methods. 2012;9:459–462. doi: 10.1038/nmeth.1974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.33Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv [q-bio.GN].https://arxiv.org/abs/1303.3997 (2013).
- 34.Narang P, Chen M, Sharma AA, Anderson KS, Wilson MA. The neoepitope landscape of breast cancer: Implications for immunotherapy. BMC Cancer. 2019;19:200. doi: 10.1186/s12885-019-5402-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Koboldt DC, et al. VarScan 2: Somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res. 2012;22:568–576. doi: 10.1101/gr.129684.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.McLaren W, et al. The Ensembl variant effect predictor. Genome Biol. 2016;17:122. doi: 10.1186/s13059-016-0974-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Hundal J, et al. pVAC-Seq: A genome-guided in silico approach to identifying tumor neoantigens. Genome Med. 2016;8:11. doi: 10.1186/s13073-016-0264-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dilthey AT, et al. HLA*LA-HLA typing from linearly projected graph alignments. Bioinformatics. 2019;35:4394–4396. doi: 10.1093/bioinformatics/btz235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Vita R, et al. The immune epitope database 2.0. Nucleic Acids Res. 2010;38:D854–862. doi: 10.1093/nar/gkp1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.