Skip to main content
Nature Communications logoLink to Nature Communications
. 2021 Mar 4;12:1434. doi: 10.1038/s41467-021-21576-8

Evolution of late-stage metastatic melanoma is dominated by aneuploidy and whole genome doubling

Ismael A Vergara 1,2,3, Christopher P Mintoff 2, Shahneen Sandhu 2, Lachlan McIntosh 1,4, Richard J Young 2, Stephen Q Wong 2, Andrew Colebatch 2, Daniel L Cameron 1,5, Julia Lai Kwon 2, Rory Wolfe 6, Angela Peng 2,7, Jason Ellul 2, Xuelin Dou 2, Clare Fedele 2, Samantha Boyle 2, Gisela Mir Arnau 2, Jeanette Raleigh 2, Athena Hatzimihalis 2, Pacman Szeto 2,7, Jennifer Mooi 2, Daniel S Widmer 8, Phil F Cheng 8, Valerie Amann 8, Reinhard Dummer 8, Nicholas Hayward 3,9, James Wilmott 3, Richard A Scolyer 3,10,11, Raymond J Cho 12, David Bowtell 2,13, Heather Thorne 2, Kathryn Alsop 2, Stephen Cordner 14, Noel Woodford 14, Jodie Leditschke 14, Patricia O’Brien 14, Sarah-Jane Dawson 2,15, Grant A McArthur 2,13, Graham J Mann 3,16, Mitchell P Levesque 8, Anthony T Papenfuss 1,2,4,5,13,✉,#, Mark Shackleton 2,7,13,17,✉,#
PMCID: PMC7933255  PMID: 33664264

Abstract

Although melanoma is initiated by acquisition of point mutations and limited focal copy number alterations in melanocytes-of-origin, the nature of genetic changes that characterise lethal metastatic disease is poorly understood. Here, we analyze the evolution of human melanoma progressing from early to late disease in 13 patients by sampling their tumours at multiple sites and times. Whole exome and genome sequencing data from 88 tumour samples reveals only limited gain of point mutations generally, with net mutational loss in some metastases. In contrast, melanoma evolution is dominated by whole genome doubling and large-scale aneuploidy, in which widespread loss of heterozygosity sculpts the burden of point mutations, neoantigens and structural variants even in treatment-naïve and primary cutaneous melanomas in some patients. These results imply that dysregulation of genomic integrity is a key driver of selective clonal advantage during melanoma progression.

Subject terms: Cancer, Computational biology and bioinformatics, Evolution, Oncology


The genetic changes that occur in late stage metastatic melanoma are not well delineated. Here, the authors use rapid autopsy samples from metastatic melanoma patients and show that the late stage in the disease is characterised by whole genome doubling and aneuploidy.

Introduction

Malignant transformation in cutaneous melanocytes is typically initiated by acquisition of single nucleotide variants (SNVs) resulting from ultra-violet (UV) light-induced DNA damage. The most common SNVs, which are even found in pre-neoplastic nevi13, constitutively activate the BRAF kinase or NRAS GTPase, although deleterious SNVs that impair tumour suppression by NF1 are also frequently detected4,5. For each of these major genotypic variants, concurrent point mutation affecting other genes is often thought to be required for the full invasive phenotype of melanoma611.

The role of structural variants (SVs; see Methods for definition) in melanoma is less well defined, although a subset of melanomas appears driven by oncogenic fusions12,13. Copy number alterations (CNAs), including deletions of the CDKN2A tumour suppressor and amplifications of BRAF, arise infrequently in nevi but are more abundant in invasive melanoma6,14. This suggests that the development of genomic instability is a key event in melanomagenesis, potentially facilitating allelic imbalances that support maintenance and progression of disease. However, understanding is limited of the mutational events that facilitate metastasis and the evolution of typically fatal late disease.

Previous studies have identified continued acquisition of both SNVs and SVs during cancer progression1518. Consistent with this, in relatively limited sampling, an array of genetic changes was observed to develop over time within treatment-naïve melanomas in individual patients6,14,19, as well as during evolution of therapy resistance20,21. These included new SNVs as well as focal allelic amplifications and deletions, suggesting that melanoma progression is fuelled by mutations other than those classically induced by UV mutagenesis.

To address this and to overcome the challenges of obtaining multiple metastatic tissues from the same patients, the CASCADE (CAncer tiSsue aCquisition After Death) rapid autopsy program22 was established, enabling sampling of end-stage cancers. In this study, using tumour and blood samples from 7 melanoma patients enroled in CASCADE, as well as data from 6 patients from previous studies19,21, we characterised somatic mutational processes linked to end-stage melanoma. We found that although gain of SNVs is generally limited in melanoma progression, disease evolution is dominated by large-scale copy number changes, including universal tetraploidization and the acquisition of remarkable degrees of allelic imbalance highlighted by extensive loss of heterozygosity (LOH) in some patients. These findings implicate dysregulation of genomic integrity as a key driver of biological advantage during competitive subclonal growth in human melanoma.

Results

Patient enrolment and sequencing

Tumour samples from seven CASCADE melanoma patients with DNA available from normal tissues, matched primary melanomas and/or regional metastases, were chosen for genomic sequencing (Fig. 1A, Supplementary Figs. 1, 2A; EGA accession: EGAS00001004950). Survival times from melanoma diagnosis to death ranged from 8 months to 12 years (Supplementary Fig. 3). All CASCADE patients had BRAF mutant melanomas and received BRAF and/or MEK inhibitors during their disease course, with some also receiving immunotherapy (Supplementary Figs. 2A, 3). For each patient, 3–6 end-stage tumours (Supplementary Table 1) underwent whole exome sequencing (WES) (5 patients; mean coverage 156×, range 108–256×) or whole-genome sequencing (WGS) (2 patients, mean coverage 67×, range 40–81×). Multi-regional sampling of primary and/or regionally metastatic disease was performed in 4 patients to assess intra-tumoural heterogeneity (Fig. 1A, Supplementary Fig. 1). WES was also performed on plasma obtained prior to death from 4 patients (Supplementary Fig. 3).

Fig. 1. Temporal and multi-regional spatial sampling reveals limited acquisition of point mutations during melanoma progression to end-stage disease.

Fig. 1

A Spatial distribution of pre-mortem and autopsy samples for CASCADE patient CAS-D. Created with BioRender.com. B Unique SNVs and small indels detected across tumours from each patient. The identifier for each sample is described in Tables S1 and S2. Y-axis indicates the number of mutations in each patient. C Mutational load of SNVs and small indels across tumours from each patient. Y-axis is shown in a logarithmic scale.

With permission, published WES data from matched primary and metastatic disease from 6 additional patients were added to the cohort (Figs. S2B and S3, Supplementary Table 2)19,21. Of these, one had a BRAF mutant melanoma (ETH-E), two had NRAS mutant melanomas (SK-G & ETH-J), and three were BRAF/NRAS/NF1 wild-type (MI-F, SK-H, ETH-F; Supplementary Fig. 2B).

In total, 88 tumour samples from 13 patients were analysed. These spanned primaries, regional metastases, distant metastases and circulating tumour DNA and included multiple samples from individual tumours in some cases. We defined early mutations as those detected in the earliest available tumour tissue of each patient. This constituted primary melanomas for 11 patients and regional metastases for two patients. Mutations first observed in tumours recurring subsequent to this were termed late events. End-stage mutational events were those first observed in metastases obtained at autopsy. Patients were first diagnosed with disease at different stages: CAS-A, B and C presented with Stage I disease; CAS-F, CAS-G, ETH-E, ETH-F, ETH-J and MI-F had Stage III disease at the time of diagnosis; and CAS-E and D had Stage IV disease at diagnosis.

Sequencing data were first surveyed for SNVs and small insertion/deletions (indels) using an ensemble variant calling pipeline designed to take advantage of the multiple samples per patient (see “Methods”). Validation by targeted amplicon sequencing of a subset of predicted SNVs and small indels showed an accuracy of 90% (sensitivity 86%, specificity 98%) (Supplementary Notes, Supplementary Tables 3 and 4). Moreover, highly concordant variant calling was noted from DNA extracted from fresh frozen or formalin-fixed paraffin-embedded (FFPE) tissue from a visceral lung metastasis from patient CAS-G (Supplementary Notes, Supplementary Figs. 4, 5, Supplementary Table 5). This suggested only minimal FFPE artefacts in variant calling.

Gain and loss of somatic SNVs and small indels during melanoma progression

Cutaneous melanomas typically have high numbers of SNVs23. Consistent with this, mutational loads from 3.3 (ETH-J) to 150.1 (MI-F) mutations/Mbp were observed in early melanomas in our cohort (Fig. 1B, C). Metastatic disease was usually dominated by these early mutations (Fig. 1B). In each late metastasis, loss of early mutations was apparent (median 2 mutations/Mbp) along with the emergence of new SNVs and indels (median 2.2 mutations/Mbp). Some of the new mutations were shared amongst other sites of late disease and some were unique to specific sites. Shared mutational losses and gains suggested evolutionary relationships between tumours.

In order to understand mutational processes in melanoma evolution, we estimated mutational signatures in early and late disease in each patient (see “Methods”). Early mutations were predictably dominated by UV signatures (signatures 7a and 7b) (Fig. 2A). In contrast, mutations first observed in late disease mostly possessed non-UV signatures, with signatures 31, 38, 39 and 58 each apparent in at least 3 patients. While the aetiology of signature 38 is unknown, it is described as found exclusively in UV-associated melanomas. UV signature mutations also emerged in late disease in 8 patients, likely derived from low abundance subpopulations in early disease that seeded distant metastases.

Fig. 2. Timing of mutational processes and candidate drivers shows UV damage and DNA repair dysfunction dominate in chronologically distinct phases.

Fig. 2

A Mutational signatures in early and late disease. Shown signatures, represented by COSMIC identifiers, were identified in at least 2 patients. Vertical lines indicate the normalised inferred weight per patient with the indicated signature in early (upper) and late (lower) disease. B Recurrently mutated melanoma or COSMIC cancer genes in early or late disease. Shown genes were mutated in at least 2 patients and are presented in decreasing number of patients. Dotted mutations indicate predicted deleterious or COSMIC mutations. C Distribution of hotspot mutations27 across the cohort. Mutations surrounded by black indicate those occurring in late disease.

In two patients (CAS-G and SK-H), metastatic disease was dominated by signature 11 (sometimes referred to as the temozolomide signature). Interestingly, although patient CAS-G had previously been treated with temozolomide, signature 11 mutations were evident in this patient’s lung and bowel but not liver metastases. Patient SK-H did not receive systemic therapy prior to tumour sampling19. In these patients, the signature 11 mutation loads in metastatic disease were coincident with predicted deleterious mutations in DNA-mismatch repair genes, including MSH6 (CAS-G and SK-H), MLH1 (CAS-G), MLH3 (CAS-G and SK-H) and MSH3 (SK-H); mutations in CAS-G were validated by targeted amplicon sequencing (Supplementary Table 3). Signatures 6, 15 and 21—all associated with DNA mismatch repair deficiency—were each present in two patients (signature 6: patients ETH-E and ETH-F; signature 15: patients ETH-J, CAS-C; signature 21: patients SK-G, CAS-B). These findings implicate DNA repair deficiency as a major contributor to gain of SNVs in late melanoma.

We interrogated the data from early disease for mutations in genes recurrently mutated in melanoma5 or in the COSMIC database (Fig. 2B and Supplementary Table 6), identifying BRAF (n = 8 patients), FAT4 (n = 6), TP53 (n = 4), KMT2C (n = 4), GRIN2A (n = 6), NRAS (n = 2), WRN (n = 4) and ARID2 (n = 4) amongst the genes most commonly affected (Fig. 2B). The vast majority of SNVs in early disease were present in all metastatic sites.

We also identified cancer-associated genes that were mutated only in late disease (Fig. 2B), particularly in the two patients with impaired DNA repair, CAS-G and SK-H, whose rates of gained mutation across all metastases were 108 and 202 mutations/Mbp respectively. These included two independent mutations in PIK3CA24 in the lung (DM6, E542K mutation) and small bowel (DM4, E545K mutation) metastases of patient CAS-G (Fig. 2C). In patient CAS-C, mutations in CTNNB1, which is linked to melanoma metastasis19,25, were found exclusively in brain metastases at residues P44 and S45, both of which regulate beta-catenin26. Residues E542 and E545 in PIK3CA, and S45 in CTNNB1, are recognised mutational hotspots in cancer27. PIK3CA, which is associated with parallel subclonal evolution in melanoma14 and centrosome amplification28, was the gene that most commonly (n = 3 patients) acquired a hotspot mutation in late disease (Fig. 2C, Supplementary Table 7).

Apart from these examples, SNVs in cancer-associated genes appeared de novo in metastases only infrequently. Indeed, loss in late disease was seen in all patients of SNVs and indels present in early disease, such that no consistent net acquisition of these mutations was observed in the 11 patients without defective DNA mismatch repair.

Acquisition of allelic imbalance and loss of heterozygosity

Apparent loss of SNVs and indels can be caused by a number of processes: deletion of one or more copies of an allele including LOH, reversion of mutations, dominance in metastases by sub-clones without mutations, and genetic heterogeneity in antecedent tumours not inherited by later metastases or acquired at the site of origin after dissemination of metastatic cells29. To better discern processes underlying loss of mutations, allele-specific copy number (CN) was estimated in our dataset.

Acquisition of allelic imbalance (AI), including LOH, was evident in all sites of disease in all patients (Fig. 3A, left and middle panels) and typically involved whole chromosomes or chromosomal arms. In early disease, the extent of AI varied widely from 5.4% (patient CAS-G) to 83.6% (SK-H) of the genome (Fig. 3A, left panel). In all patients, melanoma evolution was characterised by increasing AI either subclonally in early disease (e.g. multiple primary cores in patients CAS-D, ETH-J) or in subsets of metastatic samples (e.g. lesions in CAS-C, ETH-F). In some patients (CAS-D, ETH-E and SK-H), about 50% of the genome in late disease was affected by LOH.

Fig. 3. Widespread allelic imbalance (AI) and loss of heterozygosity (LOH) in metastatic melanoma.

Fig. 3

A Left: genomic proportions of each tissue sample affected by allelic imbalance. Horizontal bars indicate the fraction of each genome with the indicated allelic state. Middle: allelic imbalances in each sample at the cytogenetic band level. Regions of allelic balance have the same paternal and maternal allele copy number; regions of AI have different allele-specific copy numbers. AIs including an LOH event are shown separately. Undetermined regions are indicated in grey. Right: relationship between LOH and mutations lost during disease progression. Shown are proportions of all lost mutations that co-occur with acquisition of LOH events. B B-allele frequencies of single nucleotide polymorphisms in regional and distant metastases in patient CAS-B. A mirrored imbalance of heterozygous SNPs on chromosome 10 was evident between these lesions. C The impact of LOH events on neoantigenic mutations. Analysis of predicted neoantigenic mutations lost via LOH events across the cohort. X-axis indicates the number of neoantigenic mutations lost per LOH event. D Impact of LOH events on structural variants (SVs). Shown are Circos plots of large SVs detected in two liver metastases in patient CAS-D. Inner track indicates the allelic state for each chromosome.

Co-occurrence of SNV/indel loss and deletions was identified between early to late disease. LOH explained a significant proportion of lost SNVs and indels in late disease (Fig. 3A, right panel). Mutations lost in late disease and located in regions of retained heterozygosity had lower cancer cell fractions (in early disease) compared to those lost in regions of acquired LOH (Supplementary Fig. 6). This supports the hypothesis that these mutations were not present in the subclone that seeded the metastasis. This group could also have included mutations acquired in regions of copy number gain, where the mutated allele was subsequently lost. While mutations lost in regions of acquired LOH were mostly associated with the UV-induced mutational process with no other recurrent signatures (Supplementary Fig. 7), those lost in other regions were less well explained by this mutagen. Instead, multiple other mutational processes were observed in this group. For example, the presence of signatures 6 and 11 in patient CAS-G, signature 11 in MI-F and signature 14 in CAS-E suggested DNA-mismatch repair defects may have also occurred in early disease in these patients (Supplementary Fig. 8). This suggests that acquisition of somatic mutations via DNA repair deficiency may not only be common in late disease (Fig. 2A) but also occur subclonally in early melanoma.

Within each patient, allele-specific copy number alterations (CNAs) typically involved the same alleles, suggesting common origins of these clones. However, akin to observations in lung cancer30, different alleles in distinct metastases were occasionally involved. In two liver metastases from patient CAS-G, chromosome 15 acquired LOH by independent deletion events affecting different alleles (Supplementary Fig. 9). Similarly, in patient CAS-B, chromosome 10 acquired LOH by deletion of different alleles in regional and visceral metastases (Fig. 3B). Mirrored AIs such as these might reflect convergent evolution driven by continued chromosomal instability and selection for gene dosage30, effects of varied selective pressures in distinct environments, or selective deletion of different subclonal mutations.

As LOH events accounted for a major proportion of lost mutations in late disease (Fig. 3A, right panel), we wondered whether deletion of disadvantageous non-synonymous mutations or neoantigens might favour selection of LOH. Indeed, LOH might provide an efficient mechanism to improve the fitness of melanoma cells through the removal of multiple deleterious mutations and/or by eliminating multiple neoantigenic SNVs in single mutational events.

Of the 506 LOH segments acquired across all patients, 157 (31%) deleted at least one non-synonymous mutation (Supplementary Fig. 10A). As LOH segments that did not delete non-synonymous variants tended to be smaller (Supplementary Fig. 10B), possibly resulting from over-segmentation in copy number estimation (Supplementary Fig. 10C), we considered only segments greater than 10 Mb (n = 194).

We reasoned that if LOH provides a fitness advantage by selective deletion of disadvantageous non-synonymous mutations, we might observe the preferential loss of regions with a higher than the expected number of non-synonymous mutations, which can be estimated from the number of synonymous mutations and human codon usage frequencies (see Supplementary Material). Across all patients, only 67/194 large acquired LOH segments deleted more non-synonymous mutations than expected, which is not more frequent than expected by chance (p = 0.8, one-sided binomial test; see Supplementary Methods). This does not support a dominant role for removal of highly mutated regions, but does not rule out selective loss of individual mutations impacting fitness.

We next considered the potential role of neoantigens resulting from SNVs in early disease by predicting neoantigens for patient-specific HLA-A, -B and -C alleles using NetMHC4.031,32 (see Supplementary Methods) and examining the loss of these predicted neoantigens due to LOH in late disease. We found between 43 and 2109 predicted neoepitopes with weak or strong HLA class I binding affinity in each patient (median 217) in early disease. Of these, 18–985 (median 107) occurred in genes commonly expressed in melanoma (see Supplementary Methods). To evaluate the extent to which acquired LOH events delete predicted expressed neoantigens, we collected overlapping LOH segments into events by comparing late disease samples within each patient. Patients lost from 2 (CAS-A) to 18 (CAS-D) predicted expressed neoantigens due to LOH. Thirty-nine LOH events removed from 1 to 10 (CAS-D) predicted expressed neoantigens, with 19 removing from 2 or more (Fig. 3C, Supplementary Table 8). Patients lost from 2 (CAS-A) to 18 (CAS-D) predicted expressed neoantigens due to LOH. Thirty-nine LOH events removed from 1 to 10 (CAS-D) predicted expressed neoantigens, with 19 removing from 2 or more (Fig. 3C, Supplementary Table 8). This highlights that large-scale LOH may provide an efficient mechanism to remove expressed neoantigens, or regions with high neo-antigenicity, in single mutational events. However, it is important to note that acquired LOH may reduce only a small proportion of the overall neoantigen load, as we observed.

These observations led us to consider if some regions in individual patients might demonstrate a preference for deletion of neoantigens over non-neoantigenic non-synonymous mutations, or vice versa. To test this, we looked for regions of LOH where high predicted neo-antigenicity and high rates of non-synonymous mutations were mutually exclusive, finding no such inverse association in 7 of 8 patients tested (Supplementary Table 9), although several patients could not be tested due to low numbers of mutations. Of these, 2 (CAS-F and ETH-J) only lost regions with high predicted neo-antigenicity. Interestingly, one patient, CAS-D, showed strong mutual exclusivity (p = 0.02, two-sided Barnard’s test) with 22 LOH regions with high predicted neoantigenity and low rates of non-synonymous mutations, and 13 LOH regions with low predicted neo-antigenicity and high numbers of non-synonymous mutations.

These results fall short of demonstrating a dominant role for the sculpting by LOH of non-synonymous mutations or neoantigens, but leave open the possibility that selection for deletion of specific mutations may account for large-scale LOH in some patients. Certainly, other factors may influence the selection of LOH events, such as consequent changes in the dosage of affected transcripts and the presence of advantageous mutations whose deletion may be selected against. Additionally, reduction of the allelic load of non-synonymous or neoantigenic mutations through the acquisition of allelic imbalance was not considered.

Despite the above, mutations lost in regions of acquired LOH possessed other properties that might suggest enhanced neo-antigenicity. In some patients, deleted predicted strong binding neoantigenic mutations occurred in genes with higher expression (Supplementary Fig. 11). Also, predicted neoepitopes lost due to LOH frequently had higher estimated cancer cell fraction in early disease than those lost through other mechanisms (Supplementary Fig. 12: CAS-A, CAS-B, CAS-C, CAS-D, CAS-E, CAS-F, SK-G, SK-H) and possessed stronger predicted binding affinity to specific HLA alleles in some cases (Supplementary Fig. 13: CAS-D HLA-C:06:02, CAS-F HLA-B:44:03, SK-G HLA-B:07:02 SK-G). Although neoantigen prediction has poor specificity for predicting immunogenicity33,34, these data support the idea that LOH events might provide a more efficient means than individual SNV reversions for clonal immuno-editing35.

As structural variants (SVs), such as gene fusions are increasingly recognised oncogenic drivers in melanoma12,13, we also examined the impact of LOH on SVs by analysing WGS data obtained from patient CAS-D. Several SVs observed in chromosomes 2 and 4 in the left liver metastasis were absent from the central liver lesion with coincident LOH. Similarly, the extension of LOH in chromosome 10 of the central liver lesion coincided with loss of SVs (Fig. 3D). This was not detectable in ctDNA, which nevertheless represented the most abundant SVs across late disease sites (Supplementary Fig. 14). LOH can thus shape not only the burden of SNVs and indels, but also SVs. While the selective subclonal advantage of individual LOH events is hard to predict, these observations highlight the possibility that such changes sculpt the broad mutational landscape of cancer genomes.

Increased ploidy dominates the landscape of end-stage melanoma

We next considered total copy number (Fig. 4A). Consistent with observations of LOH and AI, CNAs in late disease were often large-scale and generally detectable in ctDNA. They were also frequently manifest as amplifications (Fig. 4A, left panel), such that the mean ploidy across late disease was 2.9 (range 1.6–4.2).

Fig. 4. Ubiquitous hyperploidy in metastatic melanoma.

Fig. 4

A Left: genomic proportions of each tissue sample affected by copy number alterations (CNAs). Horizontal bars indicate the fraction of each genome with the indicated allelic copy number. Regions with total CN > 5 are represented by the same colour. Grey regions indicate undetermined CN. Right: CNAs in each sample at the cytogenetic band-level. B Validation of CN in 6 patients using Fluorescence In Situ Hybridisation (FISH). X-axis corresponds to the total CN of the segment overlapping the FISH probe predicted from sequencing data. Y-axis corresponds to the average CN across cells in each sample, determined by manual counting, for the indicated FISH probe. Centromeric probes corresponded to chromosomes 8, 9 and 11 as controls for probes for MYC, CDKN2A and CCND1, respectively, in those chromosomes. Cen: Centromeric probe. C FISH showing MYC amplification in two metastases from patient CAS-B, compared to the primary tumour (left). Arrow heads indicate cells with red (Myc) and green (centromere) signals. Scale bar = 10um. CTR: Centromere probes for chromosome 8. D Widespread genome doubling in melanoma. For each sample, the bar length indicates the Akaike Information Criteria (AIC) score obtained via branching process analyses of aneuploidy with genome doubling (left) versus aneuploidy alone (right). Solid circles indicate the most likely scenario for each sample.

Largely diploid (mean 2, range 1.6 – 3.3) early disease was observed in patients CAS-B, CAS-C, CAS-D, CAS-G, MI-F, SK-G and ETH-E. Across these cases, increased ploidy was not usually seen until the development of distant metastases. However, in patients CAS-B and CAS-D, increased ploidy occurred subclonally in regional metastases and primary sites of earlier disease, respectively. Increased ploidy (mean 3.4, range 2.7–3.8) was observed in the early disease of all other patients, in which it was at least maintained in all but one metastasis (patient ETH-F, tumour DM5), which may have been seeded independently from a lower ploidy ancestor. Across this cohort, chromosomes 1q, 6p, 7, 8q, 13q, 15q, 20, 21q, and 22q were recurrently amplified (Fig. 4A, right panel), suggesting net selective advantage of increased expression of genes in those regions, including BRAF, RAC1 and MET on chromosome 7, AKT3 on chromosome 1q and MYC on chromosome 8q. The presence of large CNAs in treatment-naïve melanomas indicates that such changes can occur independently of treatment.

Predicted ploidy changes were validated in 6 patients by fluorescence in situ hybridisation (FISH) using probes against MYC, CDKN2A, and CCND1, and against centromeres in chromosomes 8, 9 and 11 (r = 0.76, Spearman correlation; Fig. 4B). In patient CAS-B, the increased copy number in visceral metastases of the brain and the lung was confirmed by probes for MYC and the chromosome 8 centromere (Fig. 4C). Further, FISH to CDKN2A confirmed a hemizygous deletion affecting 9p21.3 in the primary melanoma and diploidy of the same region in metastases (not shown). For patient CAS-D, the prediction of ploidy differences between spatially distinct cores of the primary melanoma was supported by FISH to MYC and the chromosome 8 centromere (Supplementary Fig. 15). Increased ploidy was also observed in a distant liver metastasis of CAS-D, based on FISH to MYC and CCND1 (Supplementary Figs. 15, 16). Indeed, probe counts for CCND1 suggested intra-tumoural heterogeneity in this metastasis, with a dominant clone carrying 5 copies of CCND1 and a minor clone (10% of cells) carrying 8 copies (Supplementary Fig. 16).

A striking case of disagreement in copy number between predicted and FISH-observed copy number was noted in a brain metastasis of CAS-C (Fig. 4B and Supplementary Fig. 17), in which average probe counts for MYC and CCND1 were 7.2 and 12.4 respectively. Assessment of this metastasis revealed extensive heterogeneity, with sub-clones that carried either no copies or a mode of 8 copies of MYC, and a separate population of very large cells (~27% of total cells) each with >30 copies of MYC (Supplementary Fig. 17). Amplifications of this magnitude have been reported in melanoma36 and may be due to extra-chromosomal DNA37.

Near-ubiquitous tetraploidization in late melanoma

Tumours with increased ploidy were typically characterised by large-scale chromosomal CNAs ranging from 2–4 copies. Although this might be explained by the stochastic acquisition of multiple amplifications, we wondered whether such ploidy change more likely resulted from whole-genome doubling (WGD) events that occurred against a background of smaller-scale stochastic amplifications and deletions. To test this, we developed two branching process-based models (see Methods): one modelled aneuploidy only and the other modelled aneuploidy together with a single WGD event. We used the Akaike Information Criteria (AIC) to penalise for the extra parameter in the genome doubling model and determine which model better explained the data.

In this analysis, increased ploidy was better explained by the combination of WGD together with chromosomal or arm-level CNAs (Fig. 4D). Indeed, every patient showed evidence of WGD. Multi-regional analysis of two primary melanomas (samples P2 in CAS-D and P1 in ETH-J) indicated that WGD can be an early subclonal event in melanoma (Fig. 4A, D). In contrast, CAS-C possessed only one tumour at post-mortem, a brain metastasis, which had undergone WGD. Although WGD was reflected in ctDNA from three patients, WGD in patient CAS-C was private to the brain metastasis.

The observation of near-universal genome doubling together with aneuploidy across patients supports the hypothesis that these mechanisms provide an advantage to tumour cells. Tetraploidization may act as an efficient mechanism to achieve high levels of aneuploidy that increase the biological tolerance of deletion events38,39 or help reduce the probability of large-scale homozygous losses, which can lead to subpopulations with decreased fitness. Interestingly, multiple oncogenes were found amplified within LOH regions at a copy number beyond that expected given the ploidy of the lesion (Supplementary Fig. 18). These include genes that drive oncogenicity via amplification40 such as LMO1 (4 patients), CCND1 (3 patients), MYCN (2 patients) as well as MYC, MDM2, MDM4 and NKX2-1 (1 patient). Oncogenes linked to melanoma progression were also amplified, including NRAS, KIT and RAC1, the latter highly amplified in four patients in regions with LOH at copy numbers of 5 (patient ETH-F), 6 (patients CAS-E and SK-H) and 7 (patient CAS-F). This suggests amplification of the remaining allele in regions of LOH may confer selective advantage to the tumour via over-expression of driver genes.

Mutations in genes that regulate genomic integrity

Due to the high level of CNAs in late melanoma, we also considered mutations in genes that regulate genomic integrity4144, cell cycle and DNA repair (Supplementary Table 10, Fig. 5A). Deleterious TP53 mutations, which are associated with tolerance of aneuploidy45, were acquired early in four patients (Fig. 5A) and in late disease in two others. Interestingly, amplification of wild-type MYC was also observed in two of these patients, CAS-D and CAS-B, providing a mutational combination proposed to facilitate tetraploidization46. CLTC (mitotic spindle instability), APC (checkpoint defects, merotely) and BUB1B (checkpoint defects) were also amongst genes involved in regulation of genomic integrity recurrently affected by deleterious mutation42,43 (Fig. 5A). Mutations in genes that regulate the cell cycle and DNA repair further suggested mechanisms of increased CNAs: a truncating insertion in FGFR1OP, which anchors microtubules47 (patient CAS-D) and deleterious missense mutations in WRN, associated with homology-directed repair and stability of fragile sites48, in patients CAS-C, CAS-F and MI-F. Notably, CDKN2A underwent an in-frame deletion (COSMIC variant COSM24418) in patient CAS-E and a hotspot P114L substitution (COSMIC variant COSM753743) of reported functional significance49 in patient CAS-F.

Fig. 5. Potential drivers of aneuploidy and whole genome doubling in early-stage and metastatic melanoma.

Fig. 5

A Recurrent mutations across the cohort in genes that regulate chromosomal structural integrity, cell cycle and DNA repair. Dotted mutations indicate predicted deleterious or COSMIC mutations. Genes included correspond to significantly mutated genes in the TCGA cohort or COSMIC cancer census genes (see Supplementary Methods). The identifier for each sample is indicated in Tables S1 and S2. B Mutant allele fraction (MAF) of mutations gained or increased in allelic dose with increased ploidy, in association with genome doubling, in patients ETH-E, CAS-G, CAS-D and CAS-C. Each point per gene represents a sample. MAF: estimated fraction of mutated alleles in the tumour cells of the sample. *predicted deleterious or COSMIC mutations.

To identify mutations associated with hyperploidy using an unbiased approach, we further evaluated patients for whom both low and high ploidy samples were available, identifying SNVs present in tetraploid but not matched diploid samples, or increased in relative dose after tetraploidization (Fig. 5B, Supplementary Table 11). The 63 genes identified by these criteria across patients CAS-C, CAS-G, CAS-D, and ETH-E regulate multiple biological processes, including signal transduction and cell differentiation (Supplementary Table 12). Mutations were also gained in genes linked to cell cycle progression and cytoskeletal and microtubule organisation, including SIPA150 (n = 2 patients), TACC151, SPICE152 and FRY53, amongst others (Supplementary Table 12). While 88 samples were available, shortcomings of our study were that the number of patients was relatively low (13) and samples possessed a complex dependence structure, confounding hypothesis testing. Nevertheless, these findings suggest diverse potential drivers of disrupted genomic integrity in melanoma progression.

Tumour phylogenies

As melanoma evolution was found to be dominated by CNAs, we estimated allele-specific CNA-based phylogenetic trees of spatiotemporally separated tumours and late disease ctDNA from each of 4 patients (CAS-D, CAS-B, CAS-G, CAS-C) whose melanomas progressed from near-diploid in early disease to hyperploid in late disease (Fig. 6A) and in 2 patients (SK-H, MI-F) with previously published trees19 (Supplementary Figs. 19, 20). We also estimated phylogenetic trees from SNVs and jointly from SNVs and copy number (Supplementary Figs. 1924). This provided insight to disease evolution in these cases, including evolutionary branches on which genome doubling was likely to have occurred and patterns of aneuploidy before and after genome doubling.

Fig. 6. Evolution of cutaneous melanoma.

Fig. 6

A Copy number-based phylogenetic trees for patients CAS-D, CAS-B, CAS-G and CAS-C. Evolutionary relationships between samples were derived using MEDICC, based on allele-specific copy numbers. Support values for each node were obtained by resampling of the distance matrix. MEDICC treated genome doubling events as independent chromosomal amplifications, resulting in long branch lengths. S.B.: Small Bowel. B Summarised genetic changes in the progression to end-stage melanoma. From top to bottom: evolution of somatic mutational load (mutations/Mbps), allelic imbalance without LOH (% genome affected), allelic imbalance with LOH (% genome affected), average ploidy, and evidence for genome doubling (ΔAIC = AICnon-WGD-AICWGD, where AICnon-WGD is the Akaike Information Criterion (AIC) for the model with aneuploidy only, and AICWGD is the AIC for the model that includes WGD; ΔAIC > 0 supports genome doubling) across primary melanomas and regional and distant metastases for each patient. Germline estimates are depicted as comparative references. Yellow diamonds joining lines represent mean values for each time of disease. Each dot represents an individual tissue sample.

Based on these analyses, we observed that metastatic disease in patient CAS-D (Fig. 6A, left) most likely arose from the sub-clone in the primary melanoma that underwent genome doubling (P2, Supplementary Fig. 15). The left liver lesion (DM4) probably developed from an early seeding from this sub-clone, as it displayed similar allelic balance in chromosome 1q, which was affected by LOH in all other metastatic sites (Fig. 3, Supplementary Fig. 25). The central liver lesion (DM3) in this patient clustered closest with brain metastases, suggesting a shared subclonal origin. A SNV-based phylogeny for this patient (Supplementary Fig. 21) showed similar disease evolution patterns, but reflected that the vast majority of the SNV burden was acquired in earlier disease progression compared to CNAs. We assumed that widespread metastases would contribute to ctDNA mutations, however, the CNA profile of late disease ctDNA in patient CAS-D very closely resembled the caudate lobe liver lesion (DM2) (Fig. 6A, left). This was also observed in the SNV phylogeny, in addition to resemblance to axillary regional metastasis and the Pouch of Douglas lesion (DM1). This suggests that some more than other metastases may shed tumour-associated DNA into the circulation despite their co-location in the same organs. A joint evidence tree based on both SNVs and CNAs (Supplementary Fig. 21) confirmed patterns of progression and the similarity of ctDNA.

Although genome doubling was observed universally in almost all distant metastases (Fig. 4D and 6A), continued acquisition of aneuploidy occurred after genome doubling in patients CAS-G, CAS-D and CAS-B (Fig. 6A and Supplementary Figs. 22, 23). Additionally, the timing of genome doubling varied, occurring in the primary tumours of CAS-D (subclonally), CAS-E, SK-H, ETH-F and ETH-J (Fig. 4D), but only observed in one brain metastasis of CAS-C (Fig. 6A and Supplementary Fig. 24). Thus, genome doubling is not absolutely required to permit distant metastases, and CNAs can occur both before and after genome doubling.

The joint SNV and CNA phylogenies for these patients (Supplementary Figs. 2124) retained the consistent clustering of early versus late mutations compared to germline, as well as the preferential grouping of ctDNA samples with specific mutations, with differences likely the result of enhanced phylogenetic signal when combining these data. For patients MI-F and SK-H, previously reported phylogenies19 were in agreement with those generated here with the same event types (Supplementary Figs. 19, 20, Supplementary Notes).

The phylogenetic reconstruction provided by MEDICC CNA-based trees was inspected for further instances of convergent evolution, revealing evidence of parallel evolution and reversals of mutations by LOH, as well as mirrored allelic imbalances. For example, we examined allele-specific copy number variants in patient CAS-D and sought CNAs that occurred more than once in different parts of the evolutionary tree (Supplementary Fig. 26). Chromosome 12q was initially diploid (in P1 and P3) prior to genome doubling and lost a copy to reach allele-specific copy number (2, 1) within the primary (P2). This was followed by a further gain to copy number (2, 2) and then multiple independent losses in distant metastases to (2, 1). Similarly, the region 11q23–11q25 was also initially diploid. Following WGD (P2), it underwent arm-level deletion to allele-specific copy number (2, 1), then independent gains in two distant metastases of the liver to return to allele-specific copy number (2, 2).

The evidence for convergent evolution presented here is likely an underestimate, as previous studies have illustrated parallel acquisition of mutations through disease progression in genetically identical inter-patient cancers54 as well as in intra-patient single-cell resolution characterisation of melanoma29. Phylogenies like the ones presented here will become increasingly more accurate with temporal-spatial sampling coupled with single-cell resolution data and methodologies for subclonal evolutionary history reconstruction that do not rely on the infinite sites model assumption55.

Comparison of the timing of CNA and SNV acquisition during melanoma development confirmed that the burden of SNVs in late disease in each patient was mostly acquired in early disease and did not change substantially during disease progression (Fig. 6B), with exceptions in the two patients who developed DNA mismatch repair defects (SK-H and CAS-G). In contrast, the kinetics of CNA acquisition varied markedly. All patients had at least low levels of aneuploidy in early disease with highly recurrent full or partial loss of chromosomes 9 and 10. While WGD occurred in at least one tumour in every patient, it was first observed in the early disease of 5 patients (CAS-E, SK-H, ETH-F, ETH-J, and subclonally in CAS-D), in the regional metastases of CAS-B (subclonally), and not until late disease in 5 patients (CAS-C, CAS-G, MI-F, SK-G, ETH-E).

Of the 11 patients for whom we analysed one or more samples of primary cutaneous melanoma, 5 showed >15% of the genome affected by allelic imbalance, 7 showed >15% of the genome affected by LOH, and 5 showed ploidy >2.5. Three patients had multi-regional sampling of their primary melanomas and all of these showed genomic instability in at least one region. It is possible that subclonal chromosomal instability in the primary melanomas we evaluated was more frequent than observed due to sampling.

Interestingly, degrees of AI and hyperploidy varied amongst distant metastases of individual patients in the latter cohort, with some metastatic sites showing decreased AI and reduced ploidy compared to early disease. In contrast, the extent of LOH increased in almost every distant metastasis (Fig. 6B), suggesting common clonal origins of cells carrying these events. This suggests that genomic sculpting by deletion events confers biological advantage in melanoma.

Discussion

We performed multi-regional sampling of end-stage melanoma at autopsy21,22. This enabled us to characterise genetic changes associated with progressive disease, finding that tetraploidization and acquisition of aneuploidy are dominant mutational mechanisms across evolution to end-stage, fatal melanoma. This contrasts with the key roles of UV-induced SNVs in melanocytic transformation to early melanoma4,6. Although SNVs or indels affecting genes associated with cancer were abundant in the early melanomas of our cohort, their continued acquisition was minimal during progression to late disease. In contrast, acquisition of CNAs was abundant. Although at least low-level aneuploidy was consistently found in early disease, with recurrent LOH in chromosomes 9 and 10, most sites of metastatic and end-stage disease were affected by aneuploidy due to both WGD and gain and loss of individual chromosomes or chromosome arms. Our findings extend previous studies6,14,19,20 to illuminate the breadth of genetic changes that drive melanoma progression through treatment to end-stage disease.

Two patients (CAS-G and SK-H) were exceptions to the above pattern, displaying markedly increased SNV burdens in metastatic disease in association with acquisition of mutations in known DNA mismatch repair genes. Indeed, across all patients, mutational signatures associated with aberrant DNA repair were observed in SNVs that first appeared in sites of metastatic disease. This highlights the importance of defective DNA repair as a mechanism that permits cancer cell adaptation in the face of selective pressures such as anti-cancer therapy, and is consistent with known SNV-based mechanisms of acquired resistance in melanoma to targeted therapy56,57 and immunotherapy58.

However, loss of SNVs was also observed, often as a result of LOH. The proportions of tumour genomes affected by LOH were increased in almost all patients in late distant metastases, in some cases to >50% compared to earlier disease. One potential role of LOH may be as an efficient means for providing biological advantage by eliminating multiple deleterious mutations and reducing immunogenicity through deletion of neoantigens in single mutational events. Although only limited support was found for the possibility that LOH is a strongly selected driver of loss of such mutations, LOH could also be detrimental if it reduced critical gene products. In this sense, pervasive WGD, which we observed in the vast majority of metastases in our cohort, may protect sub-clones by providing adequate essential proteins while at the same time permitting selective advantage from focal deletion events.

Previous studies showed emergence of aneuploidy in early melanocytic neoplasia that was associated with disease evolution to the invasive and metastatic state6,14. Our data demonstrate that WGD can also occur in untreated primary cutaneous melanomas, in which it may develop subclonally. These findings indicate that tetraploidization and acquisition of aneuploidy are not necessarily consequences of the stresses to cancer cells imparted by anti-cancer therapy or the metastatic process. Nor are they required for metastasis and therapy resistance; one patient in our cohort (CAS-C) harboured several late metastases that had not undergone WGD. Rather, such genomic derangements appear to be widespread events in intra-tumoural evolution of melanoma that confer competitive subclonal advantage, including via increasing metastatic potential, presumably due to the changes they confer in allelic doses of wild-type (WT) genes and/or genes affected by SNVs and indels.

Pan-cancer studies5961 have shown that hyperploidy is prevalent in cancer and associated with WGD in an estimated 37% of cases61. In in silico modelling38, an apparently optimal state of triploidy in cancer was achieved more efficiently by initially tetraploid cells than by diploid cells, presumably as a result of the ability of the former to buffer lethal events such as nullisomy. This is consistent with the mode of 72 chromosomes per cell observed in the Mitelman Database38 and the mean ploidies in our cohort of 2.9 in late metastases and 3.4 in high ploidy early primary melanomas.

While tetraploidization is a genome-wide event, AIs acquired through disease progression in our cohort targeted specific chromosomes. We hypothesise that selection of region-specific AIs is favoured because mutations in those regions modulate a competitive growth advantage. Accordingly, an allele carrying a mutation that is beneficial would be selected for amplification, and/or the corresponding WT allele might be reduced or lost (e.g. via LOH). Both events increase proportional dosage of the advantageous mutation. In contrast, mutations that are biologically deleterious and/or immunogenic would be diluted by amplification of the complementary WT allele or loss of the mutated allele.

This model also predicts that mutations enriched in tumours as a consequence of AI will support the malignant phenotype. Consistent with this, we observed gain or increased dosage of numerous mutated genes in metastatic tumours in transition from diploid to hyperploid states. These genes notably included regulators of cell cycle progression and the maintenance of genomic integrity, which are manifestly perturbed in a high proportion of progressing melanomas. It will be important to validate these genes functionally as potential regulators of aneuploidy in melanoma that might inform therapeutic opportunities. Interestingly, stochastic deregulation of wild-type regulators of genomic integrity, such as the MYC amplifications we observed with concurrent TP53 mutations, might compound enrichment of mutations that fuel aneuploidy.

Our data also raise the possibility that WGD and/or the enhanced tolerance it confers to continued acquisition of AI, might have prognostic significance in melanoma. Aneuploidy has been proposed by some to increase the probability of recurrence after treatment for early melanoma62,63. Moreover, the rate of acquisition of aneuploidy, as represented by subclonality of CNAs, was linked to survival outcomes in non-small cell lung cancer30. In this case, the reliable detectability we observed by ctDNA of dominant patterns of tetraploidization and aneuploidy in metastatic melanoma could indicate a useful, minimally invasive and prognostically important means of monitoring for key stages of disease evolution in individual melanoma patients.

Methods

Sample collection and DNA extraction

Recruitment to the CASCADE (CAncer tiSsue aCquisition After DEath rapid autopsy programme at the Peter MacCallum Cancer Centre followed approved protocols (PMCC Human Research Ethics Committee approval number 11/102). Informed written consent was obtained from all patients and families. Rapid autopsies were performed as previously described22. See Supplementary Methods for further details.

Formalin-fixed paraffin-embedded (FFPE) tissues were sectioned 20 times and the 1st, 10th and 20th sections underwent haematoxylin & eosin (H&E), Melan-A and S100 staining, respectively, for identification by an expert histopathologist of intra-tumoural regions of viable melanoma. Only samples that contained >70% tumour content were used for DNA extraction and sequencing. After macro-dissection and DNA extraction (QIAamp DNA FFPE Tissue Kit, Qiagen), each sample was assessed for DNA integrity64. In cases where adjacent fresh frozen samples were available for DNA extraction (DNeasy blood and tissue kit, Qiagen), tumour content was verified using digital PCR for BRAF mutations65. Genomic DNA from buffy coats (used as matching germline DNA) was extracted using the DNeasy blood and tissue kit (Qiagen). Plasma DNA was extracted from 1–2 ml of plasma (QIAamp Circulating Nucleic Acid Kit, Qiagen).

Whole exome sequencing

100 ng to 1 ug of input DNA was fragmented using a Covaris S2 focal acoustic device to an average fragment size of 180–220 bp (duty cycle: 10%; intensity: 5; cycles per burst: 200; time: 3 min). Libraries were prepared using KAPA Hyper Prep Kits for Illumina, (Kapa Biosystems) with SureSelect XT adaptors and primers (Agilent Technologies). 6 or 9 PCR cycles were used for good quality or FFPE-derived DNA, respectively. Hybridisation capture was performed with SureSelect Human All Exon V5 or V6 (Agilent). Three indexed libraries were run per lane on an Illumina HiSeq2500 platform (paired-end 100 bp).

Whole-genome sequencing

100 ng of input DNA was fragmented as above to an average size of 350 bp (duty cycle: 5%; intensity: 5; cycles per burst: 200; time: 50 seconds). Libraries were prepared using the TruSeq Nano DNA Library Preparation Kit (Illumina). Libraries were sequenced at The Kinghorn Centre for Clinical Genomics (Sydney, Australia) on a HiSeq X Ten (Illumina). One sample was sequenced per lane or over several lanes (paired-end 150 bp) to a depth of 30X or 60X for germline or tumour, respectively.

Calling and validation of somatic SNVs and small indels

After trimming and quality filtering, sequencing reads were aligned to the human reference genome (hg19) using bwa mem, deduplicated, realigned and scores recalibrated. Reads with a mapping quality below 30 were removed. SNVs and small insertions/deletions (indels) were called using an ensemble approach with multiSNV66 (minimum average read coverage across samples of 10X), VarScan67 (minimum allele frequency of 10%), muTect68 and IndelGenotyper69. SNVs flagged as ‘pass’ and ‘normal contamination’ by multiSNV were kept for further processing. SNVs with an allele fraction >5% in the germline DNA were discarded. Variants in regions of low coverage (<10x), repetitive/artefactual regions, off-target regions (for WES), and that overlapped with healthy population databases were filtered out. Variants detected by at least two callers and passing all filters were kept for downstream analysis.

Given that the set of variants called for each patient was usually represented in multiple samples, the following steps were applied to optimise the reliability of variant calling. First, to exclude the possibility of absent variants due to insufficient depth of sequencing, variants were filtered out if the region was regarded as low coverage (<10x) in any sample of that patient, including the germline DNA. Second, to leverage the sensitivity of muTect in samples of low cellularity, variants absent in samples based on the procedure above but called only by muTect were included. Third, to leverage the sensitivity of multiSNV as a multi-sample variant caller, the presence of variants detected as above was confirmed in other samples from the same patient via a lenient execution of VarScan (i.e. allowing variant allele frequencies <10%, minimum coverage of 10x and variant present in ≥2 reads).

Mutational signatures were estimated using deconstructSigs70 with the COSMIC v3 signatures set71. Validation of SNVs and small indels was performed using the 48.48 Access Array system (Fluidigm). See Supplementary Methods for further details.

Somatic copy number alteration and structural variant calling

We defined structural variants (SVs) as mutations that involved a segment of DNA ( > 50nt) differing between a tumour sample and the normal sample, caused by one or more double stranded DNA breaks. These included insertions, deletions, tandem or inverted duplications, inversions, translocations or copy number alterations (CNAs). When the evidence of an SV was derived from CNA data alone, we used the term CNA.

Estimates of allele-specific copy number, tumour cellularity and ploidy within each sample were obtained using Sequenza72 (v2.1.0), followed by manual verification. Large SVs (>1000nt) were detected using the GRIDSS (Genomic Rearrangement Identification Software Suite; v1.3.2)73. See Supplementary Methods for further details.

Phylogenic analysis

MEDICC74 (downloaded July 2014) was used to infer phylogenies from allele-specific copy number calls summarised at the cytogenetic band level. To infer phylogenies from SNVs, a binary matrix of mutations excluding regions of shared LOH was constructed for each patient. To infer phylogenies from all mutations, a character-weighted binary supermatrix was constructed. Maximum parsimony phylogenies were constructed using TNT (v1.5) from these matrices75. See Supplementary Methods for further details.

Copy number validation

Fluorescence in situ hybridisation (FISH) was performed for six CASCADE patients on FFPE slides to validate copy number results. Myc/CEP8 and CCND1/CEP11 probes were used to validate allelic amplifications and CDKN2A/CEP9 probes were used to validate allelic losses. For each slide, the area to be scored was identified at low (10x) magnification using a DAPI filter. A minimum of 100 cells from at least three random fields was scored for the number of probe-specific alleles (red) and centromere (green) signals at high (100x) magnification.

Inference of genome doubling

To test if the patterns of increased ploidy were due to stochastic aneuploidy alone (stochastic gains of individual chromosomes) or if WGD also played a role, two branching process-based models were fitted to the data. The first modelled aneuploidy as the stochastic breakage and mis-segregation of chromosomes at the arm-level. The second modelled aneuploidy in the same way, together with a single WGD event.

The evolutionary history of each tumour genome was divided into N arbitrary time periods. Each copy of each arm was assigned a fixed probability α of undergoing a deletion, β of not changing, and γ of being duplicated within each time period, and was subject to the constraint that α+β+γ=1. The WGD model included a single WGD event after M time intervals (M ≤ N). We used probability generating functions to algebraically construct the probabilities of every possible copy number state as polynomials depending on α, β, γ and N for the model with no WGD and α, β, γ, N and M for the model with WGD. The generating functions grow exponentially and are prohibitively large for N and M much greater than about 5. To construct the generating function, we developed an efficient program in the sage language (http://www.sagemath.org/) and utilised memoization to reduce computational time and memory consumption. This program only needed to be run once. We selected N = 6, permitting arm-level copy numbers up to 256 to be modelled.

The parameters α, β, γ, and M for the two models were estimated for each patient using maximum likelihood. To determine which model provided the most likely explanation of the observed data, while correcting for the additional parameter in the WGD model, we used the Akaike Information Criterion (AIC)76. Unlike other information criteria, AIC does not rely upon any assumptions contradicted by the model. In general, the model with WGD has 3 free parameters, while the model without WGD has 2. In this case we also imposed α = γ, leading to 2 and 1 free parameters respectively. We applied the more general formula, leading to a small constant offset of +2 to both AICWGD and AICnon-WGD, which does not affect the relative likelihood or choice of best-fitting model.

Notably, our approach makes several assumptions that may limit its applicability: the only events modelled are WGD and arm-level copy number change (which are treated as independent events); a whole chromosome gain or loss is modelled as a pair of independent arm-level events; focal amplification and complex events are not included in the model; and resolution of event order is limited to the N = 6 arbitrary time periods.

Code and additional details regarding the test are available at https://github.com/PapenfussLab/Genome_doubling_test.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Supplementary information

Reporting Summary (93.7KB, pdf)

Acknowledgements

A.T.P. was supported by an Australian National Health and Medical Research Council (NHMRC) Program Grant (1054618) and Senior Research Fellowship (1116955), and by the Lorenzo and Pamela Galli Charitable Trust. M.S. was supported by fellowships from Pfizer Australia, NHMRC, veski and the Victorian Cancer Agency. P.F.C. was supported by EU Horizon 2020 PHC grant No. 633974 (SOUND – Statistical multi-Omics UNDerstanding of Patient Samples). R.A.S. and N.K.H. were supported by the Australian National Health and Medical Research Council (NHMRC). This research was supported by the Victorian Institute of Forensic Medicine, Tobin Brothers Funerals, the Peter MacCallum Cancer Foundation, Bioplatforms Australia, the Melanoma Institute of Australia, the Cancer Council of Victoria, the Victorian Cancer Biobank, the Melbourne Melanoma Project, the Victorian State Government Operational Infrastructure Support scheme and the Australian Government NHMRC Independent Research Institute Infrastructure Support scheme. We acknowledge the CASCADE Management Committee, staff who cared for CASCADE participants, staff at Tobin Brothers Funerals and the Victorian Institute of Forensic Medicine, and the patients and their families. We thank Jocelyn Sietsma Penington and Maria Macheda for support publishing the CASCADE data.

Author contributions

A.T.P., M.S. and I.A.V. conceived the study. I.A.V. and A.T.P. analysed the data. L.M. and A.T.P. conceived the WGD test, which L.M. implemented. R.J.Y. performed the FISH experiments. S.S. and M.S. consented patients. S.S. provided clinical advice and input into the manuscript, and supported access to clinical data. C.P.M., J.L.K. and J.M. collated data. C.P.M., A.C., C.F., S.B. and P.S. extracted DNA and/or RNA. SQW & SJD generated the ctDNA data and the SNV validation data. C.P.M., A.P., X.D., J.R., and A.H. processed tissues. C.P.M. and S.S. retrieved samples. GMA sequenced the CASCADE samples. J.E. provided computing support and code for the pipeline framework, which I.A.V. adapted. D.L.C. provided input to the SV analysis. R.W. provided input on the clinical data analysis. D.S.W., P.F.C., V.A., R.D., R.J.C., M.P.L., N.H., R.A.S., S.J.D., and G.J.M. provided data. A.T.P., M.S., N.H., R.A.S., S.J.D. and G.J.M. contributed funding. D.B., M.S., H.T., K.A., S.C., N.W., J.L.K., P.O.B. and G.A.Mc. supported the establishment and operation of the CASCADE program. I.A.V., A.T.P. and M.S. wrote the manuscript. All authors read and approved the manuscript.

Data availability

Whole exome and genome sequencing data for CASCADE patients have been deposited in the European Genome-Phenome Archive (EGA) under the study ID EGAS00001004950. These data are available on request to the Peter MacCallum Cancer Centre Data Access Committee (dac@petermac.org). Whole exome sequencing data from patients MI-F, SK-G and SK-H19 are available via the National Center for Biotechnology Information database of Genotypes and Phenotypes (NCBI dbGaP accession p). Whole exome sequencing data from patients ETH-E, ETH-F and ETH-J21 are available on request to Mitchell Levesque (Mitchell.Levesque@usz.ch). These patients did not give their consent for the public availability of their raw sequencing data. The public release of the data was not permitted at the time of collection under Swiss law. Repeat annotations from Repeatmasker (http://www.repeatmasker.org/), known cancer genes from COSMIC (https://cancer.sanger.ac.uk/cosmic), variants from EVS (http://evs.gs.washington.edu/EVS) and 1000 Genomes (http://ftp.1000genomes.ebi.ac.uk/), and pathway information from Reactome (https://reactome.org/) were also referenced in the study. All other data are available within the Article, Supplementary Information or from the authors upon request.

Code availability

All code developed for this in this study is publicly available: analysis pipeline (https://github.com/PapenfussLab/CascadePipe), genome doubling test (https://github.com/PapenfussLab/Genome_doubling_test), and neoantigen analysis pipeline (https://github.com/PapenfussLab/CascadeNeoPipe).

Competing interests

R.D. and M.S. have intermittent, project focused consulting and/or advisory relationships with Novartis, Merck Sharp & Dhome (MSD), Bristol-Myers Squibb (BMS), Merck-Serono, Roche, Amgen, Takeda, Pierre Fabre, and/or Sun Pharma that are unrelated to the submitted work.

Footnotes

Peer review information Nature Communications thanks Peter Campbell and Christoph Klein for their contributions to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Anthony T Papenfuss, Mark Shackleton

Contributor Information

Anthony T. Papenfuss, Email: papenfuss@wehi.edu.au

Mark Shackleton, Email: mark.shackleton@monash.edu.

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-021-21576-8.

References

  • 1.Pollock PM, et al. High frequency of BRAF mutations in nevi. Nat. Genet. 2003;33:19–20. doi: 10.1038/ng1054. [DOI] [PubMed] [Google Scholar]
  • 2.Poynter JN, et al. BRAF and NRAS mutations in melanoma and melanocytic nevi. Melanoma Res. 2006;16:267–273. doi: 10.1097/01.cmr.0000222600.73179.f3. [DOI] [PubMed] [Google Scholar]
  • 3.Roh MR, Eliades P, Gupta S, Tsao H. Genetics of melanocytic nevi. Pigment Cell Melanoma Res. 2015;28:661–672. doi: 10.1111/pcmr.12412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hayward NK, et al. Whole-genome landscapes of major melanoma subtypes. Nature. 2017;545:175–180. doi: 10.1038/nature22071. [DOI] [PubMed] [Google Scholar]
  • 5.Cancer Genome Atlas, N. Genomic classification of cutaneous melanoma. Cell. 2015;161:1681–1696. doi: 10.1016/j.cell.2015.05.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Shain AH, et al. The genetic evolution of melanoma from precursor lesions. N. Engl. J. Med. 2015;373:1926–1936. doi: 10.1056/NEJMoa1502583. [DOI] [PubMed] [Google Scholar]
  • 7.Horn S, et al. TERT promoter mutations in familial and sporadic melanoma. Science. 2013;339:959–961. doi: 10.1126/science.1230062. [DOI] [PubMed] [Google Scholar]
  • 8.Huang FW, et al. Highly recurrent TERT promoter mutations in human melanoma. Science. 2013;339:957–959. doi: 10.1126/science.1229259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Berger MF, et al. Melanoma genome sequencing reveals frequent PREX2 mutations. Nature. 2012;485:502–506. doi: 10.1038/nature11071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Krauthammer M, et al. Exome sequencing identifies recurrent somatic RAC1 mutations in melanoma. Nat. Genet. 2012;44:1006–1014. doi: 10.1038/ng.2359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hodis E, et al. A landscape of driver mutations in melanoma. Cell. 2012;150:251–263. doi: 10.1016/j.cell.2012.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Turner J, et al. Kinase gene fusions in defined subsets of melanoma. Pigment Cell Melanoma Res. 2017;30:53–62. doi: 10.1111/pcmr.12560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Stransky N, Cerami E, Schalm S, Kim JL, Lengauer C. The landscape of kinase fusions in cancer. Nat. Commun. 2014;5:4846. doi: 10.1038/ncomms5846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Harbst K, et al. Multiregion whole-exome sequencing uncovers the genetic evolution and mutational heterogeneity of early-stage metastatic melanoma. Cancer Res. 2016;76:4765–4774. doi: 10.1158/0008-5472.CAN-15-3476. [DOI] [PubMed] [Google Scholar]
  • 15.Gerlinger M, et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N. Engl. J. Med. 2012;366:883–892. doi: 10.1056/NEJMoa1113205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Patch AM, et al. Whole-genome characterization of chemoresistant ovarian cancer. Nature. 2015;521:489–494. doi: 10.1038/nature14410. [DOI] [PubMed] [Google Scholar]
  • 17.Notta F, et al. A renewed model of pancreatic cancer evolution based on genomic rearrangement patterns. Nature. 2016;538:378–382. doi: 10.1038/nature19823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Savas P, et al. The subclonal architecture of metastatic breast cancer: results from a prospective community-based rapid autopsy program “CASCADE”. PLoS Med. 2016;13:e1002204. doi: 10.1371/journal.pmed.1002204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sanborn JZ, et al. Phylogenetic analyses of melanoma reveal complex patterns of metastatic dissemination. Proc. Natl Acad. Sci. USA. 2015;112:10995–11000. doi: 10.1073/pnas.1508074112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kemper K, et al. Intra- and inter-tumor heterogeneity in a vemurafenib-resistant melanoma patient and derived xenografts. EMBO Mol. Med. 2015;7:1104–1118. doi: 10.15252/emmm.201404914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Raaijmakers MI, et al. Co-existence of BRAF and NRAS driver mutations in the same melanoma cells results in heterogeneity of targeted therapy resistance. Oncotarget. 2016;7:77163–77174. doi: 10.18632/oncotarget.12848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Alsop K, et al. A community-based model of rapid autopsy in end-stage cancer patients. Nat. Biotechnol. 2016;34:1010–1014. doi: 10.1038/nbt.3674. [DOI] [PubMed] [Google Scholar]
  • 23.Alexandrov LB, et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Karakas B, Bachman KE, Park BH. Mutation of the PIK3CA oncogene in human cancers. Br. J. Cancer. 2006;94:455–459. doi: 10.1038/sj.bjc.6602970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Damsky WE, et al. beta-catenin signaling controls metastasis in Braf-activated Pten-deficient melanomas. Cancer Cell. 2011;20:741–754. doi: 10.1016/j.ccr.2011.10.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Marin O, et al. A noncanonical sequence phosphorylated by casein kinase 1 in beta-catenin may play a role in casein kinase 1 targeting of important signaling proteins. Proc. Natl Acad. Sci. USA. 2003;100:10193–10200. doi: 10.1073/pnas.1733909100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chang MT, et al. Identifying recurrent mutations in cancer reveals widespread lineage diversity and mutational specificity. Nat. Biotechnol. 2016;34:155–163. doi: 10.1038/nbt.3391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Berenjeno IM, et al. Oncogenic PIK3CA induces centrosome amplification and tolerance to genome doubling. Nat. Commun. 2017;8:1773. doi: 10.1038/s41467-017-02002-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Werner-Klein M, et al. Genetic alterations driving metastatic colony formation are acquired outside of the primary tumour in melanoma. Nat. Commun. 2018;9:595. doi: 10.1038/s41467-017-02674-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Jamal-Hanjani M, et al. Tracking the evolution of non-small-cell lung cancer. N. Engl. J. Med. 2017;376:2109–2121. doi: 10.1056/NEJMoa1616288. [DOI] [PubMed] [Google Scholar]
  • 31.Andreatta M, Nielsen M. Gapped sequence alignment using artificial neural networks: application to the MHC class I system. Bioinformatics. 2016;32:511–517. doi: 10.1093/bioinformatics/btv639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Nielsen M, et al. Reliable prediction of T-cell epitopes using neural networks with novel sequence representations. Protein Sci. 2003;12:1007–1017. doi: 10.1110/ps.0239403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Linnemann C, et al. High-throughput epitope discovery reveals frequent recognition of neo-antigens by CD4+ T cells in human melanoma. Nat. Med. 2015;21:81–85. doi: 10.1038/nm.3773. [DOI] [PubMed] [Google Scholar]
  • 34.Verdegaal EM, et al. Neoantigen landscape dynamics during human melanoma-T cell interactions. Nature. 2016;536:91–95. doi: 10.1038/nature18945. [DOI] [PubMed] [Google Scholar]
  • 35.Mittal D, Gubin MM, Schreiber RD, Smyth MJ. New insights into cancer immunoediting and its three component phases-elimination, equilibrium and escape. Curr. Opin. Immunol. 2014;27:16–25. doi: 10.1016/j.coi.2014.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Pouryazdanparast P, Newman M, Mafee M, Guitart J, Gerami P. Malignant melanoma with monster cells showing massive cyclin D1 amplification. Am. J. Dermatopathol. 2009;31:402–403. doi: 10.1097/DAD.0b013e31819f8316. [DOI] [PubMed] [Google Scholar]
  • 37.L’Abbate A, et al. Genomic organization and evolution of double minutes/homogeneously staining regions with MYC amplification in human cancer. Nucleic Acids Res. 2014;42:9131–9145. doi: 10.1093/nar/gku590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Laughney AM, Elizalde S, Genovese G, Bakhoum SF. Dynamics of tumor heterogeneity derived from clonal karyotypic evolution. Cell Rep. 2015;12:809–820. doi: 10.1016/j.celrep.2015.06.065. [DOI] [PubMed] [Google Scholar]
  • 39.Dewhurst SM, et al. Tolerance of whole-genome doubling propagates chromosomal instability and accelerates cancer genome evolution. Cancer Discov. 2014;4:175–185. doi: 10.1158/2159-8290.CD-13-0285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Vogelstein B, et al. Cancer genome landscapes. Science. 2013;339:1546–1558. doi: 10.1126/science.1235122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Nigg EA. Centrosome aberrations: cause or consequence of cancer progression? Nat. Rev. Cancer. 2002;2:815–825. doi: 10.1038/nrc924. [DOI] [PubMed] [Google Scholar]
  • 42.Thompson SL, Bakhoum SF, Compton DA. Mechanisms of chromosomal instability. Curr. Biol. 2010;20:R285–R295. doi: 10.1016/j.cub.2010.01.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Maiato H, Logarinho E. Mitotic spindle multipolarity without centrosome amplification. Nat. Cell Biol. 2014;16:386–394. doi: 10.1038/ncb2958. [DOI] [PubMed] [Google Scholar]
  • 44.Valente D, et al. HIPK2 deficiency causes chromosomal instability by cytokinesis failure and increases tumorigenicity. Oncotarget. 2015;6:10320–10334. doi: 10.18632/oncotarget.3583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Aylon Y, Oren M. p53: guardian of ploidy. Mol. Oncol. 2011;5:315–323. doi: 10.1016/j.molonc.2011.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Yin XY, Grove L, Datta NS, Long MW, Prochownik EV. C-myc overexpression and p53 loss cooperate to promote genomic instability. Oncogene. 1999;18:1177–1184. doi: 10.1038/sj.onc.1202410. [DOI] [PubMed] [Google Scholar]
  • 47.Yan X, Habedanck R, Nigg EA. A complex of two centrosomal proteins, CAP350 and FOP, cooperates with EB1 in microtubule anchoring. Mol. Biol. Cell. 2006;17:634–644. doi: 10.1091/mbc.e05-08-0810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Pirzio LM, Pichierri P, Bignami M, Franchitto A. Werner syndrome helicase activity is essential in maintaining fragile site stability. J. Cell Biol. 2008;180:305–314. doi: 10.1083/jcb.200705126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Koh J, Enders GH, Dynlacht BD, Harlow E. Tumour-derived p16 alleles encoding proteins defective in cell-cycle inhibition. Nature. 1995;375:506–510. doi: 10.1038/375506a0. [DOI] [PubMed] [Google Scholar]
  • 50.Hattori M, et al. Molecular cloning of a novel mitogen-inducible nuclear protein with a Ran GTPase-activating domain that affects cell cycle progression. Mol. Cell Biol. 1995;15:552–560. doi: 10.1128/MCB.15.1.552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Delaval B, et al. Aurora B -TACC1 protein complex in cytokinesis. Oncogene. 2004;23:4516–4522. doi: 10.1038/sj.onc.1207593. [DOI] [PubMed] [Google Scholar]
  • 52.Archinti M, Lacasa C, Teixido-Travesa N, Luders J. SPICE-a previously uncharacterized protein required for centriole duplication and mitotic chromosome congression. J. Cell Sci. 2010;123:3039–3046. doi: 10.1242/jcs.069963. [DOI] [PubMed] [Google Scholar]
  • 53.Ikeda M, Chiba S, Ohashi K, Mizuno K. Furry protein promotes aurora A-mediated Polo-like kinase 1 activation. J. Biol. Chem. 2012;287:27670–27681. doi: 10.1074/jbc.M112.378968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wistuba II, et al. Two identical triplet sisters carrying a germline BRCA1 gene mutation acquire very similar breast cancer somatic mutations at multiple other sites throughout the genome. Genes Chromosomes Cancer. 2000;28:359–369. doi: 10.1002/1098-2264(200008)28:4&#x0003c;359::AID-GCC1&#x0003e;3.0.CO;2-N. [DOI] [PubMed] [Google Scholar]
  • 55.Kuipers J, Jahn K, Raphael BJ, Beerenwinkel N. Single-cell sequencing data reveal widespread recurrence and loss of mutational hits in the life histories of tumors. Genome Res. 2017;27:1885–1894. doi: 10.1101/gr.220707.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Van Allen EM, et al. The genetic landscape of clinical resistance to RAF inhibition in metastatic melanoma. Cancer Discov. 2014;4:94–109. doi: 10.1158/2159-8290.CD-13-0617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Shi H, et al. Acquired resistance and clonal evolution in melanoma during BRAF inhibitor therapy. Cancer Discov. 2014;4:80–93. doi: 10.1158/2159-8290.CD-13-0642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Sharma P, Hu-Lieskovan S, Wargo JA, Ribas A. Primary, adaptive, and acquired resistance to cancer immunotherapy. Cell. 2017;168:707–723. doi: 10.1016/j.cell.2017.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Storchova Z, Kuffer C. The consequences of tetraploidy and aneuploidy. J. Cell Sci. 2008;121:3859–3866. doi: 10.1242/jcs.039537. [DOI] [PubMed] [Google Scholar]
  • 60.Carter SL, et al. Absolute quantification of somatic DNA alterations in human cancer. Nat. Biotechnol. 2012;30:413–421. doi: 10.1038/nbt.2203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Zack TI, et al. Pan-cancer patterns of somatic copy number alteration. Nat. Genet. 2013;45:1134–1140. doi: 10.1038/ng.2760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Kheir SM, et al. Prognostic significance of DNA aneuploidy in stage I cutaneous melanoma. Ann. Surg. 1988;207:455–461. doi: 10.1097/00000658-198804000-00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Reddy VB, Gattuso P, Aranha G, Carson HJ. Cell proliferation markers in predicting metastases in malignant melanoma. J. Cutan. Pathol. 1995;22:248–251. doi: 10.1111/j.1600-0560.1995.tb00746.x. [DOI] [PubMed] [Google Scholar]
  • 64.Wong SQ, et al. Targeted-capture massively-parallel sequencing enables robust detection of clinically informative mutations from formalin-fixed tumours. Sci. Rep. 2013;3:3494. doi: 10.1038/srep03494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Reid AL, Freeman JB, Millward M, Ziman M, Gray ES. Detection of BRAF-V600E and V600K in melanoma circulating tumour cells by droplet digital PCR. Clin. Biochem. 2015;48:999–1002. doi: 10.1016/j.clinbiochem.2014.12.007. [DOI] [PubMed] [Google Scholar]
  • 66.Josephidou M, Lynch AG, Tavare S. multiSNV: a probabilistic approach for improving detection of somatic point mutations from multiple related tumour samples. Nucleic Acids Res. 2015;43,:e61. doi: 10.1093/nar/gkv135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Koboldt DC, et al. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res. 2012;22:568–576. doi: 10.1101/gr.129684.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Cibulskis K, et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 2013;31:213–219. doi: 10.1038/nbt.2514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Rosenthal R, McGranahan N, Herrero J, Taylor BS, Swanton C. DeconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome Biol. 2016;17:31. doi: 10.1186/s13059-016-0893-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Alexandrov LB, et al. The repertoire of mutational signatures in human cancer. Nature. 2020;578:94–101. doi: 10.1038/s41586-020-1943-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Favero F, et al. Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data. Ann. Oncol. 2015;26:64–70. doi: 10.1093/annonc/mdu479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Cameron, D. L. et al. GRIDSS: sensitive and specific genomic rearrangement detection using positional de Bruijn graph assembly. Genome Res.10.1101/gr.222109.117 (2017). [DOI] [PMC free article] [PubMed]
  • 74.Schwarz RF, et al. Phylogenetic quantification of intra-tumour heterogeneity. PLoS Comput Biol. 2014;10:e1003535. doi: 10.1371/journal.pcbi.1003535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Goloboff PA, Farris JS, Nixon KC. TNT, a free program for phylogenetic analysis. Cladistics. 2008;24:774–786. doi: 10.1111/j.1096-0031.2008.00217.x. [DOI] [Google Scholar]
  • 76.Akaike H. A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974;19:716–723. doi: 10.1109/TAC.1974.1100705. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reporting Summary (93.7KB, pdf)

Data Availability Statement

Whole exome and genome sequencing data for CASCADE patients have been deposited in the European Genome-Phenome Archive (EGA) under the study ID EGAS00001004950. These data are available on request to the Peter MacCallum Cancer Centre Data Access Committee (dac@petermac.org). Whole exome sequencing data from patients MI-F, SK-G and SK-H19 are available via the National Center for Biotechnology Information database of Genotypes and Phenotypes (NCBI dbGaP accession p). Whole exome sequencing data from patients ETH-E, ETH-F and ETH-J21 are available on request to Mitchell Levesque (Mitchell.Levesque@usz.ch). These patients did not give their consent for the public availability of their raw sequencing data. The public release of the data was not permitted at the time of collection under Swiss law. Repeat annotations from Repeatmasker (http://www.repeatmasker.org/), known cancer genes from COSMIC (https://cancer.sanger.ac.uk/cosmic), variants from EVS (http://evs.gs.washington.edu/EVS) and 1000 Genomes (http://ftp.1000genomes.ebi.ac.uk/), and pathway information from Reactome (https://reactome.org/) were also referenced in the study. All other data are available within the Article, Supplementary Information or from the authors upon request.

All code developed for this in this study is publicly available: analysis pipeline (https://github.com/PapenfussLab/CascadePipe), genome doubling test (https://github.com/PapenfussLab/Genome_doubling_test), and neoantigen analysis pipeline (https://github.com/PapenfussLab/CascadeNeoPipe).


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES