Abstract
NUT midline carcinoma is an aggressive neoplasm defined by chromosomal rearrangements of the nuclear protein in testis (NUT) gene (NUTM1). In this article, we present a strategy to detect this rare tumor through a standard DNA methylation array analysis even when occurring in unusual anatomic sites. We illustrate our approach through a case study in which we detected metastatic spread of a NUT midline carcinoma within a bone marrow biopsy that exhibited histological features of a blastoid, undifferentiated neoplasm. Our strategy builds on molecular data derived from The Cancer Genome Atlas and Gene Expression Omnibus as well as computational strategies adopted from the Brain Tumor Methylation Classifier. It is a combined approach that detects the unusual cell lineage of NUT midline carcinomas and makes diagnostic use of the entity-specific copy number alterations.
Electronic supplementary material
The online version of this article (10.1007/s00428-020-02869-7) contains supplementary material, which is available to authorized users.
Keywords: NUT midline carcinoma, DNA methylation profile, Copy number variation profile, Machine learning
Introduction
NUT midline carcinoma (NMC) is an aggressive neoplasm defined by chromosomal rearrangements of the nuclear protein in testis (NUT) gene (NUTM1), which most commonly is fused with genes of the bromodomain and extra-terminal domain family, like bromodomain-containing protein 4 (BRD4) and bromodomain-containing protein 3 (BRD3) [1, 2]. NMC is characterized by a very poor prognosis with a median survival time of 6.7 months and can occur anywhere along the trunk, most commonly along the midline, with typical sites being the head, neck, and mediastinum [2, 3]. Classical methods to diagnose NMC rely on morphology, immunohistochemistry, and in situ hybridization. Here, we present a novel strategy to detect this rare tumor directly through a standard DNA methylation array analysis. We illustrate our approach through a case study of a NUT midline carcinoma within a bone marrow biopsy, exhibiting histological features of a blastoid, undifferentiated neoplasm.
Materials and methods
Illustrative case
A 71-year-old male presented with B-symptoms (night sweats for the last 6 months and weight loss of 10 kg within the last 3 months) and lower back pain lasting for more than 6 months. Magnetic resonance imaging (MRI) revealed multiple lesions in the lungs, mediastinum, kidney, liver, and bones. An externally performed biopsy of the pulmonary mass failed to render a conclusive diagnosis. Subsequently, suspecting a hematolymphoid neoplasm because of accompanying cytopenia, a BM biopsy was performed and showed a diffuse infiltration by cohesive, blastoid appearing medium- to large-sized tumor cells with high mitotic activity (Fig. 1a–c). No squamous or glandular differentiation was found. Suspected differential diagnoses included, among others, melanoma, carcinoma, or (pleomorphic) rhabdomyosarcoma. On immunohistochemistry, the neoplastic cells were positive for CD99 and p63 (Fig. 1d), while remaining negative for ALK, CD3, CD5, CD11c, CD20, CD34, CD79a, CD57, CD117, chromogranin, CK19, CK22, desmin, ERG, Melan-A, HMB45, mast cell tryptase, MPO, MyoD1, PSA, PSAP, S100, TLE1, and WT1 and showed a retained expression of INI-1. Ewing’s sarcoma, mesenchymal chondrosarcoma, and NMC were then considered as differentials. Immunohistochemical NUT protein staining showed granular positivity for NUT in more than 50% of the tumor cells (Fig. 1e). Subsequent fluorescence in situ hybridization (FISH) detected a classical NUTM1 gene break by (Fig. 1f). The patient died 2 months after receiving the diagnosis of NMC.
DNA methylation array-based computational diagnostic approach
Upon request of the treating oncologists, a DNA methylation array was performed. Approximately 1 μg of DNA, determined by absorption (Nanodrop method) and representing a tumor area of approximately 0.5 mm2, was used for the analysis. The tumor cell content was 90%. The DNA methylation profile was then obtained on an Infinium Human Methylation EPIC BeadChip array (Illumina, USA). The resulting IDAT files were processed using the in-house diagnostic toolchain EpiDiP (publicly available at http://www.epidip.org) that allows comparison of a given dataset against currently more than 15,500 reference datasets mostly derived from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). In addition, raw data of diagnostically worked up cases from within our institution and collaborators have been injected into this data lake. The analysis considers both the DNA methylation profile and the copy number variations. This software is primarily written in R (version 3.6.2 as of the time of writing this manuscript) and solely relies on open-source packages. IDAT data, the format provided by Illumina Methylation Array scanners, are parsed through minfi (available from Bioconductor) and normalized using the SWAN algorithm (Bioconductor). Since most reference datasets are from the 450 K methylation array era, we convert all datasets, both from 850 K/EPIC arrays as well as 450 K arrays to an approximately 400 K probe set present in both arrays. Sex chromosomes and cross-reactive probes [4] are excluded. Filtering for the top differentially methylated probes is performed through a standard deviation of all probes across all datasets. Only the top 25,000 probes of this ranking, adopted from the data preparation method used to build the brain tumor methylation classifier, are considered for dimension reduction by uniform manifold approximation and projection (UMAP, R implementation, available from CRAN) [5]. Specimens sharing epigenomic and hence lineage similarities cluster together as demonstrated previously with t-distributed stochastic neighbor embedding (t-SNE) [6].
Genome-wide copy number alterations are read out through conumee (available on Bioconductor) [7]. Copy number plots are calculated for all cases in the reference data lake and made accessible for each case through EpiDiP.
The pan-data-lake UMAP plot is overlaid with tissue type annotations curated in an in-house database and linked to copy number plots for each sample. These data are presented to the pathologist through a Shiny-based web application that we made available for free online [8] (http://www.epidip.org).
Upon highlighting a particular diagnostic case through searching for its identifier, neighboring cases can be interrogated for their original annotation and their copy number plots can visually be compared. A user-defined set of genomic loci, typically proto-oncogenes and tumor suppressor genes, is annotated through conumee for improved legibility (Fig. 2a).
Results
Dimension reduction by UMAP indicated a high similarity of the tumor of our illustrative case to squamous carcinomas of the lung (Fig. 2c, d) when compared to over 15,500 reference methylome datasets comprising tumors of diverse differentiation lineages and cells-of-origin. In addition, the chromosomal copy number profile, extracted by conumee, showed a circumscribed loss of heterozygosity (LOH) on chromosome 15q, adjacent to the NUTM1 locus. The raw array data are available in Supplementary File 1. When invoking the copy number profiles of the five closest neighboring cases, they show the rather disrupted genomes typical of mutagenesis-induced squamous cell cancers (Fig. 3), making this case rather different from common pulmonary squamous cell carcinoma.
Discussion
The histology of NMC corresponds to an undifferentiated tumor that may or may not exhibit squamous cell carcinoma features [2]. The presence of so-called abrupt foci of keratinization is considered a hallmark of NMC [9]. Another distinctive feature is the cellular monomorphism of the poorly differentiated component, contrasting with the highly atypical, polymorphic tumor cells seen in other dedifferentiated carcinomas [10]. The appearance of NMC overlaps with other poorly differentiated small blue round cell tumors such as Ewing’s sarcoma or acute leukemia [11, 12]. Classical diagnostic methods rely on morphology, immunohistochemistry, and FISH. Classical squamous cell carcinomas often have complex karyotypes and a high mutation load presumably caused by long-term exposure to mutagens such as ultraviolet light, tobacco, or alcohol [2]. In this regard, NMC is clearly different, as this tumor is likely driven by its typical single-gene rearrangement as suggested by the otherwise minor cytogenetic aberrations. Indeed, our analyses revealed a circumscribed deletion involving parts of chromosome 15q, adjacent to the NUTM1 locus (Fig. 2a, b), and a NUTM1 break by FISH (Fig. 1f).
Even without any prior knowledge of NUT midline carcinoma, the epigenomic and copy number data from our illustrative case are in line with the hypothesis that such tumors feature a squamous cell-like differentiation and a “flat genome” typical of fusion-driven neoplasms with distinct numeric changes found in the vicinity of a chromosomal break [7]. We are aware that our strategy we routinely apply to a wide range of neoplasms has so far been tested with only a single NNC case; however, we make available our data and analysis platform for public use with the intention to facilitate the collection of further samples of NUT midline carcinoma. The ultimate aim is to include NMC in future machine learning training datasets making it detectable by same-day DNA methylation/CNV diagnostics based on Oxford Nanopore sequencers, too (ONT, UK) [13]. Meanwhile, we propose our strategy to recognize this tumor type based on methylation array data by manual inspection of methylation data through dimension reduction and concurrent copy number profile visualization. Previously published work such as EPICUP [14] would in such cases report “squamous cell cancer of the lung” which is somewhat correct from an epigenomic point of view but does not provide a helpful hint in the given scenario, rather being misleading. EPICUP never provided the pathologist with accompanying CNV profiles. This diagnostically helpful strategy was originally implemented alongside the Brain Tumor Methylation Classifier [15] but is currently restricted to brain tumors. Hence, we also integrated CNV profiles in our UMAP-based tool.
Methylation analysis readily covers all differential diagnoses brought up in our illustrative case. Our UMAP/CNV tool [8] is, in addition, aware of a broad spectrum of hematolymphoid, epithelial, melanocytic, neuroendocrine, cerebral, and mesenchymal neoplasms all of which are represented through publicly available raw data. In summary, we present an alternative, modern, and timely way to detect NMC by a single microarray-based test.
Electronic supplementary material
Acknowledgments
We thank Martin Sill (DKFZ) and David Capper (Charité Berlin) for their excellent primer on how to analyze methylation data through R.
Author contributions
S.H., A.T., and J. H. conceived and designed the project. J. H., S.F., and M.B. analyzed the molecular data. S.H, A.T., S.F., M.B., A.V., J.S., and J.H. wrote the manuscript. All the authors agreed to the content of the manuscript.
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
Informed consent
Written informed consent was obtained from the patient.
Footnotes
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Simon Haefliger and Alexandar Tzankov contributed equally to this work.
References
- 1.French CA, Ramirez CL, Kolmakova J, et al. BRD-NUT oncoproteins: a family of closely related nuclear proteins that block epithelial differentiation and maintain the growth of carcinoma cells. Oncogene. 2008;27:2237–2242. doi: 10.1038/sj.onc.1210852. [DOI] [PubMed] [Google Scholar]
- 2.French C. NUT midline carcinoma. Nat Rev Cancer. 2014;14:149–150. doi: 10.1038/nrc3659. [DOI] [PubMed] [Google Scholar]
- 3.Bauer DE, Mitchell CM, Strait KM, et al. Clinicopathologic features and long-term outcomes of NUT midline carcinoma. Clin Cancer Res. 2012;18:5773–5779. doi: 10.1158/1078-0432.CCR-12-1153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Chen Y, Lemire M, Choufani S, et al. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013;8:203–209. doi: 10.4161/epi.23470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.McInnes L, Healy J, Melville J (2018) UMAP: uniform manifold approximation and projection for dimension reduction. ArXiv180203426 Cs Stat.
- 6.van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9:2579–2605. [Google Scholar]
- 7.Capper D, Stichel D, Sahm F, et al. Practical implementation of DNA methylation and copy number–based CNS tumor diagnostics: the Heidelberg experience. Acta Neuropathol (Berl) 2018;136:181–210. doi: 10.1007/s00401-018-1879-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hench J, Frank S (2020) EpiDiP Server. http://www.epidip.org
- 9.Travis WD, World Health Organization, International Agency for Research on Cancer (2015) WHO classification of tumours of lung, pleura, thymus and heart: ... reflects the views of a working group that convened for a consensus and editorial meeting at the International Agency for Research on Cancer, Lyon, April 24–26, 2014, 4. ed. International Agency for Research on Cancer, Lyon.
- 10.French CA. NUT carcinoma: clinicopathologic features, pathogenesis, and treatment: NUT Carcinoma. Pathol Int. 2018;68:583–595. doi: 10.1111/pin.12727. [DOI] [PubMed] [Google Scholar]
- 11.Mertens F, Wiebe T, Adlercreutz C, et al. Successful treatment of a child with t(15;19)-positive tumor: t(15;19)-Positive Tumor. Pediatr Blood Cancer. 2007;49:1015–1017. doi: 10.1002/pbc.20755. [DOI] [PubMed] [Google Scholar]
- 12.Li W, Chastain K. NUT midline carcinoma with leukemic presentation mimicking CD34-positive acute leukemia. Blood. 2018;132:456–456. doi: 10.1182/blood-2017-07-796268. [DOI] [PubMed] [Google Scholar]
- 13.Euskirchen P, Bielle F, Labreche K, et al. Same-day genomic and epigenomic diagnosis of brain tumors using real-time nanopore sequencing. Acta Neuropathol (Berl) 2017;134:691–703. doi: 10.1007/s00401-017-1743-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Moran S, Martínez-Cardús A, Sayols S, et al. Epigenetic profiling to classify cancer of unknown primary: a multicentre, retrospective analysis. Lancet Oncol. 2016;17:1386–1395. doi: 10.1016/S1470-2045(16)30297-2. [DOI] [PubMed] [Google Scholar]
- 15.Capper D, Jones DTW, Sill M, et al. DNA methylation-based classification of central nervous system tumours. Nature. 2018;555:469–474. doi: 10.1038/nature26000. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.