Abstract
Although p53 inactivation promotes genomic instability1 and presents a route to malignancy for more than half of all human cancers2,3, the patterns through which heterogenous TP53 (encoding human p53) mutant genomes emerge and influence tumorigenesis remain poorly understood. Here, in a mouse model of pancreatic ductal adenocarcinoma that reports sporadic p53 loss of heterozygosity before cancer onset, we find that malignant properties enabled by p53 inactivation are acquired through a predictable pattern of genome evolution. Single-cell sequencing and in situ genotyping of cells from the point of p53 inactivation through progression to frank cancer reveal that this deterministic behaviour involves four sequential phases—Trp53 (encoding mouse p53) loss of heterozygosity, accumulation of deletions, genome doubling, and the emergence of gains and amplifications—each associated with specific histological stages across the premalignant and malignant spectrum. Despite rampant heterogeneity, the deletion events that follow p53 inactivation target functionally relevant pathways that can shape genomic evolution and remain fixed as homogenous events in diverse malignant populations. Thus, loss of p53—the ‘guardian of the genome’—is not merely a gateway to genetic chaos but, rather, can enable deterministic patterns of genome evolution that may point to new strategies for the treatment of TP53-mutant tumours.
Subject terms: Cancer genetics, Cancer genomics
Malignant evolution enabled by p53 inactivation in mice proceeds through an ordered and predictable pattern of Trp53 loss of heterozygosity, accumulation of deletions, genome doubling and the emergence of gains and amplifications.
Main
Inactivating mutations in the TP53 tumour suppressor gene are associated with cancers that are particularly aggressive and refractory to therapy2,4. Some of the earliest insights into p53 action and the consequences of TP53 mutation linked p53 to a DNA-damage-induced cell cycle checkpoint of which the inactivation enables genomic instability5–8, implying that p53 acts as a guardian of the genome to prevent the emergence of cells containing potentially tumour-promoting mutations9. Subsequent research has demonstrated that p53 transcriptionally coordinates an expansive set of cell fate programs that actively limit tumorigenesis and the disruption of which remains critical for tumour maintenance5,10–14. However, owing to the absence of markers that discretely define cells after p53 inactivation, precisely how genomic instability manifests and shapes the transformation of TP53-mutant lineages as a consequence of the loss of these programs has not been defined.
Next-generation sequencing studies of human tumours associate TP53 mutations with features of genomic instability, including rampant copy-number alterations (CNAs)15, chromothripsis16 and whole-genome doubling (polyploidy)17,18, and TP53-mutant tumours often display substantial intratumoral heterogeneity19,20. Consistent with a role for p53 inactivation as an enabler of genomic chaos, inferential reconstructions of the order of events from bulk sequencing data often places TP53 mutations early in evolutionary time, preceding other genomic rearrangements3. However, the timing and order with which these features arise after TP53 inactivation and their relationship with the biological transitions in stepwise cancer development have not been established, in part because human cancers are examined at the end point and not as they progress through the benign-to-malignant transition. Such information is important to understand the relationship between TP53 mutations, genomic instability and cancer progression, and may ultimately inform therapeutic interventions.
TP53 mutations are a prominent feature of pancreatic ductal adenocarcinoma (PDAC), a lethal disease also dominated by frequent mutations in other well-established driver genes including oncogenic mutations in KRAS and inactivating mutations in the cell cycle inhibitor CDKN2A and/or the TGF-β pathway effector SMAD421. The loss of TP53 represents a key inflection point in the progression of PDAC, as TP53 loss of heterozygosity (LOH) resulting in biallelic TP53 inactivation is strongly associated with progression to invasive and genomically heterogeneous disease22,23 with recent studies linking acquired CNAs to disease progression and phenotypic heterogeneity24,25. Nevertheless, given the limited availability of patient tissue before and after tumour development26,27, it has been impossible to gain a temporal picture of how p53 inactivation leads to the evolution of PDAC genomes during malignant progression. Here we used a dual-fluorescence lineage-tracing model of PDAC that reports selection for Trp53 LOH in vivo, thereby permitting the direct observation of the evolutionary dynamics of cells undergoing stepwise progression to malignancy at the single-cell resolution from the point of p53 inactivation. Our results demonstrate that tumour evolution after p53 inactivation in the setting of pancreatic transformation is not random but is subject to deterministic features that contribute to the genomic and biological hallmarks of Trp53-mutant tumours.
Lineage tracing of sporadic Trp53 LOH
Mouse pancreas cancer models driven by conditional activation of an oncogenic KrasG12D allele and a single conditional inactivating allele of Trp53 (hereafter, the KPC model) result in pathophysiologically accurate PDAC that develops with near invariable loss of the remaining wild-type (WT) Trp53 allele (hereafter p53)28–30. To expedite the production of experimental cohorts and enable stage-specific genetic perturbations, we developed a series of PDAC models that are produced directly from multiallelic embryonic stem cells (GEMM-ES cells)31. By integrating alleles that facilitate inducible expression of short hairpin RNAs (shRNAs) in pancreatic cells expressing mutant Kras, we studied the role of tumour-suppressor loss in PDAC maintenance and chromatin regulation in early-stage neoplasia10,32,33. To extend this approach to PDAC initiated by mutant Kras and mono-allelic inactivation of p53, we generated a GEMM-ES cell platform containing the following alleles: a pancreas-specific cre, a lox-stop-lox KrasG12D allele, a conditional knockout p53 allele (p53flox), a lox-stop-lox rtTA-IRES-mKate allele knocked into the Rosa26 locus34 and a collagen homing cassette (CHC)35 introduced into the Col1a1 gene that facilitates targeting of various genetic elements using recombination-mediated cassette exchange (Extended Data Fig. 1a and Methods).
In developing this platform, we produced a model that did not function as initially intended but instead offered the ability to trace the lineage of cells that have lost p53 function (Fig. 1a and Extended Data Fig. 1). Owing to the linkage of the CHC to the WT p53 allele on the opposite chromosome as the conditional p53 allele, the GFP-coupled shRNA cassette is lost as PDAC develops after p53 LOH (Fig. 1a). Thus, in a setting in which a neutral GFP-linked shRNA (here, for example, targeting Renilla luciferase (shRenilla)) is incorporated into the system and the mice are fed doxycycline chow, premalignant tissue is double-positive (DP) for mKate (that is, lineage tracing of cells upon mutant Kras activation) and GFP (shRNA) fluorescence, whereas the resulting PDAC, which acquires p53 LOH, is single positive (SP) for mKate fluorescence (Fig. 1a and Extended Data Fig. 1b). Incorporation of a GFP-coupled p53 shRNA into the CHC locus to suppress p53 in trans prevented the loss of GFP fluorescence in PDAC, demonstrating that the transition from DP to SP cells results from selection for p53 inactivation (Extended Data Fig. 1b–e). Genotyping and digital PCR of sorted DP versus SP cells, along with sparse whole-genome sequencing (WGS) and immunohistochemistry (IHC) analysis of pancreatic tissue after PDAC development confirmed the genomic and functional link between the loss of the GFP-positive CHC locus and WT p53 (Extended Data Fig. 1f–j). GFP retention in PDAC was also observed using a complementary model in which a hot-spot p53 mutant (p53R172H) was engineered in cis to the CHC and not lost after p53 LOH (KPCcis-shRNA; Extended Data Fig. 2a–d). Thus, the physical linkage between the p53 locus and a fluorescent reporter in this model (designated KPCLOH) acts as a lineage-tracing mark of p53 LOH, permitting phenotypic analysis of cells after selection for biallelic p53 inactivation.
Analysis of KPCLOH pancreata after PDAC development confirmed that SP cells (acquiring p53 LOH) have malignant properties, whereas DP cells (retaining WT p53) do not. Thus, SP tumour cells displayed a malignant histology, whereas DP cells were confined to lesions with premalignant histopathology irrespective of their location within the tumour mass or adjacent premalignant tissue (Fig. 1b–d). In agreement, SP cells sorted from PDAC tissue had a much greater tumour-initiating potential compared with tumour-associated DP cells after orthotopic transplantation into immunocompromised mice (Extended Data Fig. 2e,f), and the few tumours that arose from DP cells had no GFP fluorescence (for example, representing cells that underwent a p53 LOH event) or focal p53 alteration events that maintained the GFP targeted locus (Extended Data Fig. 2g,h).
We envisioned that the lineage-tracing abilities of the above model would enable in vivo genotyping of cells throughout PDAC progression. Notably, an analysis of tissues derived from KPCLOH mice lacking detectable PDAC (that is, pre-tumour) revealed that SP cells were present as cells emerging within DP structures, or as variably sized lesions histologically consistent with premalignant cell fate: acinar to ductal metaplasia (ADM), atypical flat lesions (AFL), and low- and high-grade pancreatic intraepithelial neoplasia (PanIN)36–38 (Fig. 1e–g and Extended Data Fig. 2i). In contrast to the high proliferative fraction and tumour-initiating potential of SP cells present in frank PDAC, SP cells within premalignant lesions showed a low proliferative fraction that increased with lesion size and, when isolated from mice without gross PDAC, displayed poor colony-forming and tumour-initiating abilities (Fig. 1h–k). Thus, these results imply that selection for p53 loss in pancreatic cells expressing oncogenic Kras is not sufficient in and of itself to confer malignant fitness, but that these properties are acquired over time, facilitated by the absence of p53 function. As such, the KPCLOH model enables the isolation of p53-deficient cells at distinct stages of transformation (Extended Data Fig. 2i,j), including an initial, intermediate evolutionary phase that connects p53 inactivation to the acquisition of cancer-initiating potential.
CNAs follow p53 LOH
The genomes of TP53-mutant cancers are already highly rearranged and genomically heterogenous at diagnosis15,19. To determine the specific consequence of p53 inactivation on genomic evolution during PDAC development, we performed sparse WGS comparing 38 flow-sorted SP and DP populations isolated from PDAC-bearing pancreata, including 17 matched DP and SP pairs. Whereas DP genomes were invariably diploid and rarely displayed CNAs, SP cell genomes were highly rearranged and frequently polyploid (Fig. 2a and Extended Data Fig. 3a). Notably, consistent with biological selection for additional genomic driver events, SP cells sorted after PDAC development acquired recurrent losses on chromosomes 4, 7, 9, 11 and 13, as well as gains on chromosomes 3, 5, 6, 8 and 15 (Fig. 2b). These alterations were also observed in sequencing profiles obtained from PDAC produced by shRNA-mediated p53 suppression or after p53 LOH in cells containing a recurrent p53 hotspot mutation (such as KPCR172H), implying that copy-number evolution in p53 altered PDAC results from p53 inactivation and does not require gain-of-function effects of mutant p53 (Extended Data Fig. 3b,c).
A meta-analysis of recurrent gains and losses in sorted SP populations from PDAC-bearing mice (n = 24) revealed conservation with CNAs that are frequently observed in human PDAC. These conserved CNAs possess known PDAC drivers, such as deletions on chromosomes 4 and 11 (encompassing the Cdkn2a and p53 loci, respectively) as well as gains of chromosome 6, including Kras39 (Fig. 2b and Extended Data Fig. 3d). Other recurrent CNAs that occurred in both species include deletions of chromosome 9 (corresponding to regions on human chromosomes 3 and 6) and gains of chromosome 5 (corresponding to human chromosomes 7 and 13) (Fig. 2b,c and Extended Data Fig. 3d). These regions contained genes implicated in processes that are linked to PDAC development, including chromatin remodelling (Mll2 and Setd2), axon guidance (Sema3a and Sema3b), PDAC proliferation or progression (Il6, Shh and Cdk8) and TGF-β signalling (TgfbrII and Bmp5)40–43.
The presence of recurrent copy-number events encompassing known PDAC drivers and their synteny to those present in TP53-mutant human tumours implies that the selective forces driving genome evolution in TP53-mutant cancers are similar across species. To test whether these acquired events target functionally relevant pathways, we enforced one predicted consequence of chromosome 9 deletions—that is, TGF-β pathway disruption—using shRNA-mediated knockdown of Smad4, the transcriptional effector of TGF-β signalling, in the KPCcis-shRNA model described above (Methods and Extended Data Fig. 2). Smad4 suppression not only accelerated the development of PDAC with inactivated p53, but also alleviated the pressure to lose chromosome 9, effects that were not observed in otherwise identical cohorts with a neutral (shRenilla) shRNA (Fig. 2d and Extended Data Fig. 3e–h). Notably, the effects of Smad4 suppression may alter selection for copy-number changes beyond chromosome 9 loss, as the deletion landscape in KPCcis-shSmad4 tumours also exhibits altered frequencies of losses on chromosomes 4, 12 and 13 (Extended Data Fig. 3f–h). Thus, recurrent deletions can target critical pathways that contribute to the phenotypic and genomic evolution of p53-mutant cancers.
Ordered phases of genome evolution
The above data validate the KPCLOH model as a powerful platform to link the acquisition of genomic rearrangements to the phenotypic progression to malignancy after p53 inactivation. As our genomic and functional analyses nominate recurrent CNAs as such selected events, we performed single-cell genome sequencing of lineages defined by p53 inactivation isolated from KPCLOH mice both after cancer development and during the benign-to-malignant transition (namely, PDAC versus pre-tumour mice; Extended Data Fig. 2j). We reasoned that such an approach would enable us to visualize the accumulation of CNA events over time and leverage the nature of the acquired CNAs and their associated breakpoints as an additional lineage-tracing dimension to establish detailed phylogenetic relationships during distinct phases of tumour evolution after p53 loss (Methods).
Single-cell sequencing of DP and SP cells from six PDAC-bearing pancreata (designated T1–T6) corroborated the bulk sequencing data of flow-sorted populations and permitted the analysis of intratumoural genetic heterogeneity and the clonal relationships between p53-intact and p53-LOH lineages (Fig. 3a and Extended Data Fig. 4a–e). As expected, DP cells were largely euploid without recurrent CNAs. Although in two cases a subset of DP cells had gains on chromosomes 2 and 6, matched SP cell populations lacked these gains (Extended Data Fig. 4b). By contrast, SP cells from PDAC-bearing mice (hereafter, PDAC-SP cells) carried a large number of CNAs, were genomically heterogeneous and mostly polyploid (Fig. 3a and Extended Data Fig. 4a,c). Breakpoint-based phylogenetic analysis (Methods) revealed that this intratumoural heterogeneity was often associated with a clonal sweep of related polyploid PDAC-SP cells (for example, PDAC samples T1 and T4) that lacked a definable relationship with matched diploid DP cells (Fig. 3a and Extended Data Fig. 4d). Thus, single-cell sequencing of PDAC tissue from the KPCLOH model reveals two discrete genomic states defined by p53 status without an apparent evolutionary medium to connect them (namely, DP, diploid and non-rearranged; SP, polyploid and highly rearranged).
Analysis of SP cells from pre-tumour mice provided a bridge. Single-cell analysis of SP cells isolated from seven age-matched non-tumour-bearing mice (hereafter, pre-tumour SP cells, P1–P7) revealed that pre-tumour SP populations were distinct from PDAC SP and associated DP cells in that they remained largely diploid but had acquired a wide-range of CNAs (Fig. 3a and Extended Data Fig. 5a). Moreover, a small subpopulation of polyploid cells was detected in 6 out of the 7 pre-tumour SP samples analysed and, in most cases, could be related to a diploid precursor (Fig. 3a and Extended Data Figs. 5b–d and 6). Thus, single-cell sequencing establishes distinct phases of genome evolution after p53 LOH in which CNAs are first acquired in diploid cells, with polyploidy emerging as a relatively late event. Consistent with this evolutionary continuum, two PDAC-SP populations presented as a mixture of rearranged diploid and related polyploid cells (Extended Data Fig. 4b,e).
The above results were confirmed by in situ genomic analysis of pathologically defined benign and malignant lesions. Specifically, we performed fluorescence guided laser microdissection (LMD) followed by sequencing of DP and SP lesions isolated from tumour and pre-tumour mice (Methods). Consistent with bulk and single-cell sequencing data, DP cells were largely euploid containing occasional gains on chromosome 6, whereas PDAC-SP lesions invariably acquired widespread copy-number changes and were frequently polyploid (Fig. 3b and Extended Data Fig. 7a–c). Pre-SP lesions with premalignant morphology were diploid and contained few CNAs (mainly deletions), with the loss of chromosome 11 (where p53 resides) being the dominant event (Fig. 3b and Extended Data Fig. 7b,c). DNA FISH analysis confirmed that prominent chromosome 9 deletions and polyploidy were restricted to SP cells with PDAC histopathology (Extended Data Fig. 7d,e). Consistent with these histological findings, the majority of rare pre-tumour SP cells capable of colony formation in vitro displayed rearranged polyploid genomes (Extended Data Fig. 7f). These results illustrate phases of genomic evolution that directly couple the degree of CNA acquisition and ploidy state after p53 LOH to malignant pancreatic transformation.
Determinism governs evolutionary paths
Further examination of the single-cell data revealed non-random patterns through which copy-number changes are selected during discrete phases of genome evolution after p53 inactivation. Pre-tumour SP cells displayed distinct breakpoint patterns on chromosome 11, indicative of independent, competing, p53 LOH lineages emerging during the benign-to-malignant switch (Fig. 4a,b). A single-cell census genotyping approach confirmed that these resulted in loss of the WT p53 haplotype (Fig. 4a,b and Extended Data Fig. 8a–c). Although some pre-tumour SP cells had only chromosome 11 deletions, evolving populations gradually acquired additional deletions, including recurrent events conserved in mouse and human PDAC. The one exception to this deletion-centric pattern involved occasional interstitial gains of chromosome 6 encompassing Kras that, owing to their distinct structural features compared with those occurring in DP cells, were most likely acquired after p53 LOH (Figs. 2a–c and 4c–e and Extended Data Fig. 8d,e). These results add granularity to a previous report implying that Kras gains contribute to tumorigenesis after p53 inactivation39. Thus, the most proximal events to p53 inactivation involve the accumulation of deletions in diploid cells, including functionally validated deletions on chromosome 9 (for example, TGF-β signalling; Fig. 2).
Of the 10–20% of pre-tumour SP cells that were polyploid, a breakpoint-based phylogenetic analysis demonstrated that most could be traced to highly rearranged diploid precursors (Figs. 3a and 4f,g and Extended Data Figs. 5c,d and 6). In some instances, different genome doubling events arising from distinct diploid precursors were detected while, in others, a single event was followed by rapid genomic diversification resulting in a heterogenous expanding polyploid lineage (Fig. 4g and Extended Data Fig. 8f). These results are consistent with genome doubling as an active process (for example, occurring multiple times during the evolution of p53 LOH lineages) and, when giving rise to expanding polyploid clones, arising from highly rearranged diploid precursors (Figs. 3a and 4f,g and Extended Data Fig. 5).
Although polyploid pre-tumour SP cells continued to acquire deletion events, they also began to accumulate widespread chromosomal gains and focal amplifications that were largely absent in the diploid state (Fig. 4g,h). In agreement, polyploid PDAC-SP cells also displayed more gains and amplifications compared with diploid pre-tumour SP and PDAC-SP cells (Fig. 4h and Extended Data Fig. 8g). Although many of these gains and amplifications encompassed validated oncogenic drivers, they were invariably subclonal and displayed three layers of heterogeneity: (1) presence or absence in a subclone; (2) variation in copy-number state between related cells; and (3) single focal events that target validated drivers such as MYC44 (Fig. 4i and Extended Data Fig. 9a–d). By contrast, deletion events maintained a higher degree of homogeneity compared with gains both at recurrent deletion events (for example, chromosome 9) as well as at the genome-wide level (Fig. 4i,j and Extended Data Fig. 9e). These results establish a deterministic pattern of genome evolution during pancreatic neoplasia: p53 LOH, followed by the accumulation of deletions, polyploidy and then gains and amplifications. They also imply that polyploidy enables the accumulation of a broader repertoire of CNAs that, in the case of chromosomal gains, are generally not tolerated in cells with diploid genomes.
Conservation of patterns in human PDAC
Analysis of human PDAC using whole-genome sequencing25, targeted capture sequencing (MSKCC-IMPACT)45 and single-cell sequencing datasets confirmed that the genomes of TP53-mutant PDAC display patterns predicted from our lineage-tracing model (Fig. 5 and Extended Data Figs. 10 and 11). Consistent with an initial deletion-centric route to genome evolution, diploid PDAC sustaining biallelic TP53 mutations contained more recurrent deletions (for example, 9p, 17p, 18q) compared with those retaining one or two copies of WT TP53 and showed a relative paucity of gains and amplifications (Fig. 5a and Extended Data Fig. 10a). Furthermore, while tumours retaining WT TP53 were invariably diploid, the majority of those harbouring biallelic TP53 mutations were polyploid and had acquired substantially more gains and amplifications (Fig. 5b,c and Extended Data Fig. 10b,c).
Analysis of bulk tumour samples indicated that deletion events exhibited a higher degree of homogeneity compared with gains in both diploid and polyploid genomes (Fig. 5d and Extended Data Fig. 10d,e), a result confirmed by single-cell sequencing of a series of diploid and polyploid tumours (Extended Data Fig. 11a,b). Thus, regions containing known tumour suppressors on chromosome 9p, 17p and 18q were found homogenously, whereas amplifications targeting MYC, KRAS and GATA6—oncogenic events that drive metastatic progression and/or influence PDAC subtypes25,39,44—were heterogenous (Extended Data Fig. 11c–e). Accordingly, patients with polyploid versus diploid PDAC with biallelic TP53 mutations were significantly more likely to present with metastatic disease (Fig. 5e), which was associated with the worst survival in patients (Extended Data Fig. 10f). These results reinforce the deterministic evolutionary patterns observed in our mouse model, with early and homogenous deletion events dominating the diploid state and the emergence of polyploidy, gains and amplifications linked to more aggressive disease (Extended Data Fig. 12).
Discussion
Despite the well-established association between p53 inactivation and genomic instability1, the trajectories by which genomic instability arises and shapes tumour progression after TP53 loss have remained obscure owing to challenges in simultaneously monitoring TP53 status, genome evolution and cellular phenotype during the stepwise process of malignant progression. Here we took advantage of a mouse pancreatic cancer model containing a unique reporter system to trace genome evolution after sporadic p53 inactivation during a previously inaccessible phase of cancer evolution initiated by p53 loss of function—at the benign-to-malignant transition. By pinpointing and tracing cells after p53 LOH in premalignancy, we demonstrate that the evolution of malignant pancreatic genomes enabled by p53 disruption is not random; instead, it occurs through distinct, ordered phases that operate with predictive principles that can be linked to specific histological stages and contribute functionally to tumour progression. Thus, although p53 inactivation unleashes rampant intratumoral heterogeneity, selective forces lead to a surprisingly reproducible pattern of genome evolution that can be observed in the corresponding human disease.
The lineage-tracing approach provides granularity that reveals a notable degree of determinism in the sequence of events, with deletions dominating early evolution and gains and amplifications being acquired later on. The preponderance of deletions as the earliest events after p53 inactivation in mouse and human PDAC implies that reduced activity of certain pathways (for example, TGF-β) may be essential for malignant initiation, whereas the increased activity of other pathways (such as MYC, axon guidance) may be more important during progression46. This ordered evolution is consistent with the ability of acute p53 inactivation to facilitate the acquisition of chromosome losses47 and suggests that deletions inferred as occurring early in sequencing and phylogenetic studies of cancer evolution3,48,49 may represent events involved in establishing fitness that drives premalignant-to-malignant transitions.
Our results also provide unanticipated insights into the emergence of polyploidy during tumorigenesis unleashed by p53 inactivation. Although associations between polyploidy and genome evolution with TP53 mutation have been noted previously18,48,49, lineage tracing of pancreas cancer evolution from premalignancy through frank cancer development suggests that genome doubling neither precedes p53 inactivation nor is the first selected event after p53 loss. Instead, p53-deficient cells accrue an excess of genomic deletions before genome doubling, after which newly polyploid cells diversify, enabling selection for chromosomal gains and amplifications. Thus, the initial selection for polyploidy may reflect an adaptive mechanism to compensate for rampant loss of gene dosage created by an excess of deletions, only then accelerating genome diversification, and the accumulation of oncogenic events linked to elevated gene dosage that apparently fuel tumour progression25. Indeed, we note that patients with TP53-mutant polyploid PDAC show a greater incidence of metastasis at diagnosis, a property that has recently been associated with KRAS and MYC amplifications in functional studies and in patients25,39,44.
Overwhelming evidence indicates that p53 suppresses tumorigenesis by inducing a set of transcriptionally regulated effector programs that collectively limit the proliferation of oncogene-expressing cells and of which the ongoing inactivation is needed to sustain disease5,10–14. However, by capturing p53-deficient cells well before the emergence of frank PDAC, we observed that p53 loss is not sufficient in and of itself for malignancy; instead, the acquisition of recurrent CNAs is also required. Interestingly, although p53-deficient cells gain proliferative potential during the benign-to-malignant switch, the fact that this was not immediately detected may reflect the limitations of our assay or that other niche-specific factors (such as immune surveillance) create an initial selective advantage for p53-deficient cells. Regardless, our results provide compelling evidence that both the loss of canonical tumour suppressor functions and the ensuing genomic instability each functionally contribute to the emergence of aggressive cancer in our PDAC model.
Patients with TP53-mutant tumours have a poor prognosis and often respond poorly to cancer therapy4. The fact that TP53 mutations are associated with polyploid tumours littered with CNAs and other rearrangements helps to explain the rampant heterogeneity that is a hallmark of TP53 mutant cancers, undoubtedly contributing to their aggressive tumour behaviour15,18,19. At the same time, the predictable pattern of genome evolution enabled by p53 inactivation may have therapeutic ramifications. For example, although much focus has been placed on targeting oncogenes in amplified/gained regions, their subclonal nature in TP53-mutant polyploid tumours suggests that such strategies will eventually fail owing to a reservoir of resistant non-altered cells within the tumour mass. Instead, therapeutic approaches that exploit tumour-cell vulnerabilities created by homogeneous deletion events50 that arise before genome doubling may be a more effective (although challenging) approach to create durable responses in this patient population.
Methods
Development of KPC PDAC GEMM-ES cell models
p48cre;LSL-KrasG12D;p53flox/WT;CHC;Rosa26-CAGGS-LSL-rtta-IRES-mKate2 (RIK) embryonic stem (ES) cells were derived from embryonic day 3.5 (E3.5) blastocysts collected from superovulated, 3–6-week-old female p48cre;p53flox/flox mice bred with 6–10 week-old male LSL-KrasG12D/+;CHC/CHC;RIK/RIK mice as previously described31. This breeding scheme results in segregation of the CHC and p53flox loci in trans configuration on separate alleles of mouse chromosome 11. p48cre;LSL-KrasG12D;p53LSL-R172H/WT;CHC;RIK ES cells were derived similarly from E3.5 blastocysts collected from superovulated, 3–6-week-old female p48cre;LSL-KrasG12D mice bred with 6–10-week-old male p53LSL-R172H/LSLR172H;CHC/CHC;RIK/RIK mice. This breeding scheme results in linkage of the LSL-p53R172H and CHC loci in cis on mouse chromosome 11. In brief, blastocysts were incubated in an 80 μl drop of KSOM + AA (Millipore) under mineral oil for 7–8 h, washed briefly in M2 medium (Millipore) and cultured and passaged in KOSR + 2i medium using ESGRO Complete Accutase (Millipore) on irradiated DR4 mouse embryonic fibroblast feeder layers until expanded for cryopreservation and genotyping. Male p48cre;LSL-KrasG12D;p53flox/WT;CHC;RIK and p48cre;LSL-KrasG12D;p53LSL-R172H/WT;CHC;RIK ES cells were further expanded for targeting in M15 + LIF medium.
The KPCshRenilla/LOH GEMM-ES cell mouse model was developed through FLP-mediated recombination of an shRNA targeting Renilla luciferase (guide strand: TAGATAAGCATTATAATTCCT, cloned into the cTGM vector32 into the CHC of KPCTRE p48cre;LSL-KrasG12D;p53flox/WT;CHC;RIK ES cells. The KPCshp53 model was generated by targeting p48cre;LSL-KrasG12D;p53flox/WT;CHC;RIK embryonic stem cells with a shRNA targeting mouse Trp53 (guide strand: TTACACATGTACTTGTAGTGG, cloned into cTGM). CHC targeting of KPCcis-shRNA ES cells was performed as described above using FLP-mediated recombination with cTGM vectors encoding shRNAs targeting Renilla luciferase to generate KPCcis-shRenilla or shSmad4 (guide sequence: CAAAGATGAATTGGATTCTTT) to generate KPCcis-shSmad4 ES cells.
Targeting and validation was performed as described previously31 in M15 + LIF medium through co-electroporation (Lonza nucleofector) with shRNA encoding cTGM vectors and FLP-recombinase (CMV-flpe). Targeted cells were selected in hygromycin and resistant clones were isolated and expanded. Correct integration of targeted shRNAs was verified by the genotyping PCRs described below along with two additional tests to determine the number of integrants and conditional function. (1) The TaqMan copy-number assay was performed for shRNA-linked GFP (Invitrogen) according to the manufacturer’s instructions on a ViiA7 RT–PCR machine (Life Technologies). (2) Functionally, clones displaying single integration were treated with adenoviral Cre recombinase (University of Iowa) and grown in doxycycline-containing medium (1 μg ml−1) for 3 days followed by flow cytometry (Guava cytometer, Millipore; Guavasoft v.4.0) to ensure GFP expression was achieved only after Cre-mediated expression of mKate and rtta. All ES cells were confirmed to be free of mycoplasma and other microorganisms before blastocyst injection. Blastocyst injection into albino Bl6 hosts and subsequent implantation into surrogate mothers was performed as described previously31.
In vivo animal studies
All of the animal experiments were performed in accordance with protocol 11-06-018 approved by the Memorial Sloan-Kettering Institutional Animal Care and Use Committee. All of the mouse strains have been previously described: the p48-cre51, LSL-KrasG12D52, p53flox53, LSL-p53R172H54, CHC35 and CAGs-LSL-RIK34 strains were maintained on mixed Bl6/129J backgrounds. Sample sizes for animal experiments were not predetermined, but all of the experiments represent a comparison of at least three different mice or the result of injection of cells from at least three different donors. Sample sizes of each experiment are noted in the figure legends. KPCLOH mice were randomly distributed into groups based on sporadic tumour development. Female athymic nude mice between 6 and 8 weeks of age (Envigo) were used for orthotopic transplant experiments. BL6N mice were crossed for the generation of primary mouse embryonic fibroblasts. As described above, male ES cells were used for the generation of engineered cohorts and therefore male GEMM-ES cell mice were analysed. These male KPCLOH/shRenilla, KPCshp53, KPCcis-shRenilla and KPCcis-shSmad4 mice were maintained on doxycycline chow (625 mg kg−1, replenished twice weekly, Harlan Laboratories) at 4 weeks of age until euthanasia. Athymic nude hosts were placed on doxycycline chow a week before orthotopic transplant and maintained on doxycycline chow until euthanasia. For both orthotopic or autochthonous tumour development studies, mice were immediately euthanized (end point) when the earlier of two conditions were observed: (1) the tumour size reached the limits defined in the approved animal protocol (specifically, tumours did not exceed a maximum diameter of 15 mm) or (2) if the mice presented with body condition indicative of cachexia, signs of discomfort, blood loss, abdominal distension indicative of secondary cancer stigmata like ascites, weight loss of >20% of their initial weight, severe infection, blood loss or difficulty breathing. No tumours exceeded the size limit defined in the approved animal protocol. Animals were housed on a 12 h–12 h light–dark cycle under standard temperature and humidity, at around 18–24 °C and 40–60%, respectively.
Genotyping PCR
Genomic DNA was extracted from sorted cells and primary cultures using the QIAGEN AllPrep DNA/RNA mini kit according to the manufacturer’s instructions. Genomic DNA (10 ng) was amplified using the primers listed below with Herculase polymerase (Agilent) according to the manufacturer’s instructions:
LSL-KrasG12D-WT, LSL cassette, recombined: (1) 5′-GTCTTTCCCCAGCACAGTGC-3′; (2) 5′-CTCTTGCCTACGCCACCAGCTC-3′; and (3) 5′-AGCTAGCCACCATGGCTTGAGTAAGTCTGCA-3′.
Rosa26-LSL-rtta-IRES-mKate2 (RIK): (1) 5′-GGTGAGCGAGCTGATTAAGG-3′; and (2) 5′-TTTTGCTGCCGTACATGAAG-3′.
p53flox WT, flox, recombined: (1) 5′-CACAAAAACAGGTTAAACCCAG-3′; (2) 5′-AGCACATAGGAGGCAGAGAC-3′; and (3) 5′-GAAGACAGAAAAGGGGAGGG-3′.
CHC WT, Col1a1 gene, targeted CHC: (1) 5′-AATCATCCCAGGTGCACAGCATTGCGG-3′; (2) 5′-GGATGTGGAATGTGTGCGAG-3′; (3) 5′-ATCAAGGAAACCCTGGACTACTGCG-3′; and (4) 5′-CTTTGAGGGCTCATGAACCTCCCAGG-3′.
Original images of genotyping PCR gels are provided in Supplementary Fig. 1.
Digital droplet PCR
DNA was extracted from flow-sorted DP and SP cells isolated from PDAC bearing KPCLOH pancreata using AllPrep DNA/RNA Mini kits (Qiagen). Digital droplet PCR was performed on DNA to detect probes targeting WT p53, recombined p53flox, CHC, WT Kras and KrasG12D using the T100 thermal cycler (Bio-Rad) according to the manufacturer’s instructions.
Trp53 WT amplicon sequence and mm10 genome coordinates: TGGGAGCCGTGTCCGCGCCATGGCCATCTACAAGAAGTCACAGCACATGACGGAGGTCGTGAGACGCTGCCCCCACCTGAGCGCTGCTCCGATGGTGATGGTAAGCCCTCAACACCGCCTGT
mm10, chromosome 11: 69588447–69588569:+
KrasG12D (WT/mutant) amplicon sequencing, mm10 genome coordinates: TTATTTTTATTGTAAGGCCTGCTGAAAATGACTGAGTATAAACTTGTGGTGGTTGGAGCTG[G/A]TGGCGTAGGCAAGAGCGCCTTGACGATACAGCTAATTCAGAATCACTTTGTGGATGAGTAT
mm10, chromosome 6: 145246710–145246832:−
Primary cultures
Primary cancer cultures and cell lines were generated from the pancreas of male KPCshRenilla/LOH, KPCshp53, KPCcis-shRenilla and KPCcis-shSmad4 mice after tumour development. Pancreas tissue was diced with scissors and digested with 1 mg ml−1 collagenase V (Sigma-Aldrich) diluted in Hanks buffered saline solution followed by 0.25% trypsin. Digested tissues were washed with complete DMEM (DMEM, 10% FBS (GIBCO), 1× penicillin–streptomycin) and grown in complete DMEM on collagen-coated plates (PurCol, Advanced Biomatrix, 0.1 mg ml−1) supplemented with 1 μg ml−1 doxycycline at 37 °C. Authentication of primary cultures was performed by flow cytometry of engineered fluorescent alleles and all cultures were routinely tested for mycoplasma.
Cell sorting and flow cytometry
Mouse pancreatic tissue for sorting or flow cytometry (as well as from orthotopic transplant) was processed as described previously55. In brief, pancreatic tissue was gently minced with scissors washed with Hanks Buffered Saline solution, and then dissociated by incubation with collagenase V (1 mg ml−1; Thermo Fisher Scientific; in Hanks buffered saline solution), then trypsin (0.05%) and finally dispase (2 U ml−1, Invitrogen). DNase I (100 μg ml−1, Sigma-Aldrich) was added during all enzyme incubations. Cells were washed with PBS between the collagenase and trypsin steps, and with FACS buffer (2% FBS, 10 mM EGTA, in PBS) between the trypsin and dispase steps. Suspensions were then filtered through a 40 μm mesh and resuspended in FACS buffer with 300 nM DAPI for bulk or single-cell sorting into 96-well plates on Aria3 sorters (BD, maintained by the MSKCC Flow Cytometry Facility; FACS Diva v.8.0). Flow cytometry from dissociated pancreata after growth of tumours following orthotopic transplant and from primary cultures at the indicated timepoints was performed on Fortessa instruments (BD, maintained by the MSKCC Flow Cytometry Facility; FACS Diva v.8.0). Ploidy profiling was performed using NST-DAPI buffer as described previously56 on an Attune NxT flow cytometer (Invitrogen; Attune NxT v.3.1). Human PDAC flow cytometry and ploidy profiling for single-cell sequencing is described in more detail below. For calculating the median ploidy of tumour polyploid distributions, the ratio of the median DAPI-area measurements for diploid and polyploids gates was calculated and multiplied by 2 (for example, assuming the diploid distribution has a median ploidy of 2). Supplementary Fig. 2 illustrates the gating strategy based on mKate and GFP and DAPI fluorescence.
Histology, immunofluorescence and immunohistochemistry
Tissues were fixed overnight at 4 °C in 10% formalin before paraffin embedding and sectioning performed by IDEXX/RADIL. H&E staining was performed using conventional protocols by IDEXX/RADIL. For immunofluorescence staining, 5 μm sections were deparaffinized and rehydrated with a histoclear/alcohol series and antigen retrieval was performed by boiling in 1× citrate antigen retrieval buffer (Vector). Slides were blocked in PBS with 5% BSA and primary antibody staining was performed overnight in blocking buffer at 4 °C. For IHC staining endogenous peroxidases were blocked after antigen retrieval by treating slides with 3% hydrogen peroxide for 15 min at room temperature. The following primary antibodies were used: chicken anti-GFP (1:500, Abcam, 13970), rabbit anti-mKate2/Turbo RFP (1:1,000, Evrogen, AB233), mouse anti-Ki-67 (1:500, BD, 550609), rabbit anti-p53 (1:100, NCL-L-p53-CM5p, Leica Biosystems). Primary antibodies were detected using the following fluorescently conjugated secondary antibodies at a 1:500 dilution: goat-anti-chicken AF488 (Invitrogen, A32931), goat anti-rabbit 555 (Invitrogen, A32732), goat anti-mouse AF633 (Invitrogen, A-21052). All secondary antibodies were diluted in blocking buffer and incubated for 1 h at room temperature. Stained slides were washed and nuclei were counterstained with PBS containing DAPI and mounted under cover slips with ProLong Gold (LifeTechnologies). Labelling of primary antibodies for DAB staining was performed using the IHC ImmPRESS HRP Goat Anti-Rabbit IgG Polymer Peroxidase Detection Kit (VectorLabs, MP-7451) for 1 h at room temperature and developed with DAB reagent (VectorLabs, SK-4100). IHC sections were counterstained with haematoxylin and mounted with VectorMount (VectorLabs, H-5000-60) after dehydration. Images were acquired using the Zeiss AxioImager microscope using Zeiss ZEN 3.3 software. Scanning of slides for matched histology (H&E) and immunofluorescence was performed using the Aperio Versa system (Leica) with the Aperio VERSA Application (v.1.0.4) software.
Low-density-sorted growth assay
A total of 500 DP or SP cells sorted from tumour or non-tumour bearing mice were sorted as described into 500 μl of complete DMEM. Cells were transferred to PurCol-coated (as described above) 12-well plates and the cell number was counted after 2 weeks of growth using a Guava flow cytometer (Millipore software, GuavaSoft v.4.0).
Orthotopic transplant assays
DP and SP cells were sorted from tumour- and non-tumour-bearing mice as described. For tumour-bearing mice, 10,000 or 25,000 cells were sorted into FACS buffer (PBS, 2% FBS); for non-tumour bearing mice, 100–1,000 SP cells were sorted into FACS buffer (PBS, 2% FBS). For comparison with SP cells sorted from non-tumour-bearing mice, 100 SP cells from tumour-bearing mice were sorted. Cells were washed with PBS, resuspended in 25 μl of a 1:1 mix of serum-free DMEM and growth-factor-reduced Matrigel (Corning), and injected into the exposed pancreas of athymic nude mice using a Hamilton syringe fitted with a 26-gauge needle. Recipient mice were enrolled on doxycycline chow (625 mg kg−1, Harlan Laboratories) 2 days before surgery and maintained on doxycycline chow until euthanasia due to tumours reaching the IACUC approved humane end points for tumour burden (see above). If tumours did not form, the mice were monitored until they were euthanized due to age-related decline in health.
DNA-FISH analysis
DNA fluorescence in situ hybridization (DNA-FISH) analysis was performed on paraffin sections using a three-colour probe designed to detect copy-number changes of chromosome 2, 9 and 10. The bacterial artificial chromosome clones used in the probe mix were based on sequencing data (minimal region of gain/loss) and are as follows: 9qC (RP23-248H6, RP23-340D4, RP23-60M16; labelled with Spectrum Red) and 2qC (RP23-332C13, RP23-186P20, RP23-435A5; labelled with Spectrum Green). All RP11 clones were purchased from the Roswell Park Cancer Institute Genomics Shared Resource. Probe labelling, hybridization, post-hybridization washing and fluorescence detection were performed according to procedures established at the Molecular Cytogenetics Core Facility. Slides were scanned using a Zeiss Axioplan 2i epifluorescence microscope (Carl Zeiss Microscopy) equipped with Isis imaging software (MetaSystems). The entire section was first scanned through 63× to assess signal pattern. Corresponding H&E and/or immunostained slides were used to identify regions of premalignant or cancer morphology (foci of adenocarcinoma). Regions selected for analysis were imaged through the depth of the tissue (compressed stack of 12 z-sections at 0.5 μm intervals) and, for each case, within each representative region at least 50 discrete nuclei were scored.
Matched immunofluorescence and histology quantification
For quantification of immunofluorescence staining of Ki-67 in mKate/GFP DP and mKate SP cells in PDAC-bearing mice, 5 random ×10 images were collected in grossly premalignant and PDAC tissue areas. For quantification of immunofluorescence staining of Ki-67 in mKate/GFP DP and mKate SP cells in mice before PDAC development, ×20 fields containing all SP cells without PDAC morphology observed in one pancreatic cross-section were collected from nine mice. Three of these mice also displayed focal areas of incipient PDAC where up to 5 random ×20 fields were collected. Tissue sections from the pancreas of tumour- and non-tumour-bearing mice were simultaneously stained for GFP, mKate2, Ki-67 and DAPI. Quantification of the Ki-67 fraction in DP and SP cells was conducted using a semi-automated strategy. First, images were background-subtracted and processed using a Gaussian filter using MATLAB. Second, nuclei (DAPI+) and epithelial (mKate2+ and/or GFP+) cells were identified using Otsu-based segmentation in MATLAB. Third, epithelial cells were classified as either SP or DP on the basis of mKate2 and GFP status. Fourth, cells were nominated as Ki-67+ by applying a fixed threshold, determined by aggregating data from all of the fields of view. Finally, assignment of Ki-67 status was manually corrected by inspecting each image individually. In the case of fields containing rare pre-tumour SP lesions in which DP cells vastly outnumbered SP cells, DP cells were randomly subsampled to match the number of SP cells that were found in the same field of view. Fields of view in which there were no DP or SP cells were excluded from the analysis. For both of these analyses Wilcoxon’s rank-sum test was used to compare the groups.
For determination of the frequency of mKate/GFP DP and mKate SP cells in histopathologically defined premalignant and PDAC cells, lesions were identified from up to 20 random fields on H&E-stained slides from 6 PDAC-bearing pancreata. Lesions of interest were classified by pathologists blinded to immunofluorescence data. Lesions defined as ADM were identified in fields of 4 out of 6 mice analysed and lesions defined as PanIN were identified in fields of 5 out of 6 mice.
For histological classification of mKate SP cells in mice before PDAC development (Fig. 1g), 43 SP-cell-containing lesions were identified in cross-sections of seven pancreata without frank PDAC using immunofluorescence for mKate and GFP as described. These lesions were then identified on sequential, H&E-stained sections and classified by pathologists who were blinded to the immunofluorescence data.
For DNA-FISH quantification based on morphology and p53 genotype, first FISH for chromosomes 2, 9 and 10 was performed on 5 pre-tumour KPCLOH pancreata as described above. FISH images were collected of fields containing either SP cells with PDAC or premalignant morphology based on H&E and fluorescence staining of sequential sections. Next, cover slips were removed from FISH-stained slides and, after washing with PBS for 1 h at 4 °C to remove mounting medium, slides were processed for immunofluorescence staining for mKate and GFP as described. FISH foci were then scored in 50 discrete cells in verified SP cells with PDAC or premalignant morphology.
LMD analysis
Regions for LMD were identified from sequential sections of formalin-fixed paraffin-embedded tissues stained for mKate, GFP and DAPI as described above. Sections for LMD were placed on membrane slides (PPS-Membrane FrameSlides, 11600294, Leica) and subjected to rehydration and light haematoxylin staining. Epithelial regions corresponding to mKate/GFP DP or mKate SP immunofluorescence signal were cut using hand-drawn regions of interest and collected in PCR tubes with an LMD7000 laser microdissection microscope (Leica) using the Leica micro-dissection software (v.7.5.1). DNA was isolated using the Qiagen QIAamp DNA FFPE Advanced Kit (56604) according to the manufacturer’s recommended protocol.
TCGA PDAC SNP array data and human/mouse synteny analysis
Processed, segmented single-nucleotide polymorphism (SNP) array data were download from the TCGA Genomics Data Commons Data Portal (https://portal.gdc.cancer.gov/). To filter out spurious single-probe segments, array probe coordinates were converted to genomic bin coordinates with a 5 bin threshold used to accept/reject a segment. Frequency plot analysis was performed on the transformed dataset (n = 186 patient samples) by aggregating bin values and accepting a threshold values of ±0.1 for gain/deletion designation, respectively, in each bin. Approximately 66% of samples contained TP53 point mutations that were classified as missense compared to truncating, splice site or structural variant somatic variants. Of the missense classified mutations, approximately 30% affected residues designated as hotspots (such as Arg175, Arg248 and Arg273—the three most frequent sites). Human–mouse synteny analysis was performed as described previously57. Human reference human build hg19 and mouse reference genome build mm9 were used for the analysis while selecting a resolution of 400 kb of contiguous sequence.
KPCLOH bulk, LMD and single-cell sequencing
DP and SP cells were disaggregated from PDAC and non-tumour bearing pancreas and processed for flow cytometry as described above. For bulk SP/DP sorting from PDAC samples, the instrument was set in high-purity mode with cells pelleted after cytometry and processed for DNA/RNA extraction using the AllPrep DNA/RNA Mini kit (Qiagen) according to the manufacturer’s instructions. Subsequently, 500 ng of genomic DNA was processed for library generation as previously described58. For single-cell deposition and whole-genome amplification (WGA) from tumour as well as pre-tumour samples, single cells were sorted using the instrument set on single-cell mode with realignment/calibration of the automated cell deposition unit and a wash step performed before processing each unique sample. Single cells (DP and SP) were deposited into wells of a 96-well plate prepared with 9 μl of lysis buffer as described previously56. The number of single cells sequenced per sample category (for example, tumour or pre-tumour) is summarized in Supplementary Table 1. For single-cell WGA, we used a modified version of the DOP-PCR WGA, a WGA approach that has been empirically determined to yield highly accurate single-cell copy-number data59–61. The modified approach introduces inline barcode sequences during the second step of the WGA amplification reaction. After WGA, single-cell WGA DNA products were purified using the QIAquick 96 PCR purification kit according to the manufacturer’s recommendations. Purified DNA was subsequently quantified with equal amounts of WGA DNA (100 ng) per cell pooled and processed for Illumina library sequencing preparation using the NEBNext kit (NEB). For sequencing of LMD material, eluted DNA was subject to the WGA protocol and downstream sequence library preparation as described above. Sequencing libraries were sequenced on the HiSeq2500 instrument using single-end 101 bp reads while targeting an average coverage of 1 million reads per single-cell, a coverage that was previously determined to be sufficient for accurate genome-wide copy-number determination at a bin resolution of 600 kb58. A minimum of 250,000 reads per cell was required for inclusion in downstream analyses.
KPCLOH bulk, LMD and single-cell sequencing data analysis
Bulk, LMD, and single-cell sequencing data were mapped to mouse reference genome build mm9 while skipping the first 50 bp containing inline barcoding sequences as well as the DOP-PCR quasi-degenerate sequence. Uniquely mapped reads were indexed, sorted and PCR duplicates were subsequently removed. Uniquely mapped sequencing reads were counted in genomic bins/intervals that were computed using a previously developed algorithm (Varbin)62. The genome was partitioned into 5,000 bins of approximately 600 kb in length. Read counts were subsequently corrected for GC content using the LOWESS smoothing algorithm63, normalized and segmented using circular binary segmentation64. Ploidy was inferred using a least-squares fitting algorithm while factoring integer bin value assignment distribution58. For downstream single-cell analysis, absolute copy number calls below or above a reference normal (copy number = 2 diploid single cells and copy number = 4 for polyploid single cells) were defined as deletions and gains, respectively, using the algorithm CORE65. Datasets were similarly processed using an orthogonal copy number calling algorithm (HMMcopy47). Results were highly concordant between the two analysis pipelines for both human and mouse datasets and are illustrated in Supplementary Fig. 3. For bulk frequency plot analysis, normalized segments for all of the samples were centred around a mean value of 1 with thresholds of ±0.1 used for defining gained/deleted bins. For illustration of single-cell copy-number profiles across sequence samples, heat maps of PDAC (SP and DP) and pre-tumour SP single cells were constructed using absolute copy-number values imputed from the single-cell data and clustered using the Ward method with Manhattan disease.
Aggregate single-cell genotyping for p53 LOH haplotype inference
A hybrid genome of mm10 with the sequence elements for eGFP and mKate was made and then indexed for BWA. Raw fastq data were first mapped to the hybrid genome using bwa and then counts of reads mapping to the two trans-elements (eGFP and mKate) along with genes Clp1 and Trp53 were collected using bedtools coverage. Reads were filtered to remove supplementary alignments, those marked with alternative hits (XA:Z: bwa flag) and those with a MAPQ < 30. The resulting coverage files were then processed with R scripts to normalize the counts to RPKM values and then plot the distribution of normalized coverage by gene and sample type.
Phylogenetic analysis of KPCLOH single-cell sequencing data
Phylogenetic and clonal relationships between DP and SP cells from PDAC and pre-tumour mice were investigated using two orthogonal methods: a change-point/breakpoint-based analytical method66,67 as implemented using the SCC lust software package (available at GitHub/KrasnitzLab) and a minimum-event distance (MED) method that models whole-genome duplication events in reconstructing phylogenies and ancestral genomes (MEDICC268). In brief, SCClust identifies change points throughout a set of integer-valued copy-number profiles of single-cell genomes. For all pairs of cells, dissimilarity is quantified using Fisher’s exact test for change-point coincidence and used to cluster the cell genomes hierarchically. Nodes of the tree with a statistically significant high degree of similarity among the leaves are identified as sampled from clonal cell populations. For this purpose, statistical significance of dissimilarity within pairs of cells is quantified by the FDR, in comparison to the null distribution of dissimilarities following a random interchange of change points among cells. SCClust was used with the default parameters except for the ‘keepboundaries’ parameter, which was set to True. Figure 3a was prepared using a previously described visualization tool (Single-Cell Genome Viewer)66 available from GitHub/KrasnitzLab. In MEDCICC2, copy-number segments are presented as positive integer vectors with the algorithm solving for MED between pairs of copy-number profiles (for example, profile = aggregate segments of a single cell). The algorithm also models whole-genome duplication events using a set of constraints, including accounting for chromosome boundaries as described in ref. 68. MEDICC2 constructs phylogenetic trees on the basis of imputed MED values over single cells while minimizing the total number of events that explains a transformation of one profile into another. A total of 200 iterations of resampling bootstrap were used in assigning confidence in branching relationships between diploid and polyploid single cells. MEDICC2 provided support tree panels with bootstrap confidence statistics on branch/node relationships are available in Supplementary Figs. 4 and 5 for pre-tumours 1 and 3, respectively. Owing to the lack of heterozygous SNPs in the genome of inbred laboratory mice of C57BL/6NJ strain background (from which the KPCLOH model was constructed) compared to humans, MEDICC2 was run using total copy number without phase information using flag --total-copy-numbers. Compared to an average human individual with around 1.5 million heterozygous SNPs, the average number of heterozygous SNPs of an inbred C57BL/6NJ mouse is 16,000. This statistic was retrieved from the Sanger Mouse Genomes Project (ftp://ftp-mouse.sanger.ac.uk/REL-1303-SNPs_Indels-GRCm38/mgp_v3_stats.txt).
Human PDAC whole-genome sequencing data analysis
WGS data of microdissected pancreatic tumours were downloaded from EGA (accession EGAD00001006152)23. WGS data were subsequently downsampled to a depth of around 10 million sequencing reads with the data processed for copy-number analysis as previously described56. In brief, sequencing reads were mapped to human reference genome hg19 with uniquely mapped reads sorted and indexed. Uniquely mapped reads were subsequently counted in genomic bins while partitioning the genome in 20,000 bins using a previously described algorithm (Varbin)62. Read bin counts were normalized genome-wide with subsequent processing using circular binary segmentation64. For absolute copy-number quantification, segmented data were processed using a least-squares fitting algorithm58 with the parameters of the algorithm constrained by ploidy values derived from a previously published algorithm (CELLULOID)23. As performed for mouse single-cell data, sequencing reads were similarly processed using HMMcopy with the results largely concordant as shown in Supplementary Fig. 6. TP53 mutation type statistics were similar to those described for the TCGA cohort (discussed above). Absolute copy-number data were subsequently partitioned into three categories for downstream analysis: diploid/TP53 monoallelic/WT, diploid/TP53-biallelic and polyploid/TP53-biallelic. For TP53-mutation allelic status associative analysis with ploidy, Fisher exact tests were used for significance. For TP53 allelic and ploidy categories, we derived significant recurrent CNAs using the algorithm CORE65, which computes scores and genomic coordinates for significantly recurrent CNAs with scores proportional to their occurrence in a given dataset (that is, patient cohort). Copy-number calls of less or greater than a reference normal/pseudonormal (2 for diploid; 4 for polyploid) were classified as deletion or gain events, respectively. Statistical tests for the enrichment of event types were based on t-sample t-tests for significance. For analysis of the level of heterogeneity/homogeneity of deletions and gains, FACETS69 (https://github.com/mskcc/facets/; version Oct 8, 2021; commit 3058bba) was used to process whole-genome sequencing data. To generate the SNP count files, a snp-pileup command was run using a cleaned version of dbSnp v.137 (filter to contain only SNV and with duplicates removed). The resulting tumour/normal pileups were then processed using the standard FACETS method as described in the manual. Specifically: mat = readSnpMatrix(countFile), xx = preProcSample(mat,snp.nbhd = 1000), oo = procSample(xx,cval = 500), fit = emcncf(oo). The inferred heterogeneity/homogeneity of chromosomal segments was computed by taking the estimated cell fraction (cf) statistic of each segment relative to the computed purity of the sequenced sample (for example, cf/purity) and analysing the distribution of the resulting statistic using a Kolmogorov–Smirnov test. A two-sample Kolmogorov–Smirnov test was used to determine the significance of the difference between the distributions.
MSK-IMPACT target capture data analysis and clinical correlations
A total of 1,077 PDAC cases sequenced using MSKCC institutional targeted sequencing panel (MSK-IMPACT) were analysed. Zygosity determination, genome-wide total and allele-specific DNA copy number, purity and ploidy were calculated using FACETS (v.0.5.13)69. The expected number of copies for each mutation, used in designating TP53 allelic state, was generated on the basis of the observed variant allele fraction and local ploidy70. Cancer cell fractions were calculated using a binomial distribution and maximum likelihood estimation normalized to produce posterior probabilities71. Inferred heterogeneity/homogeneity of chromosomal segments was performed as described above and similarly tested for statistical significance using a two-sided Kolmogorov–Smirnov test. Clinical annotations of disease spread and patient outcome were performed manually and subsequently used in genomic correlative analysis. For significant of correlation between ploidy status and disease spread, Fisher exact tests were implemented. Univariate Cox regression analysis was used to ascertain prognostic predictive value of disease spread.
Human single-cell sequencing and data analysis
Frozen tissue from nine resected PDACs (four diploid and five polyploid) that were biallelically mutant for TP53 were processed for single-nucleus isolation as previously described56. Research involving resected samples was approved by the MSKCC institutional review board (MSKCC IRB). In brief, two 50 μm tissue sections were cut and mechanically dissociated using a pair of scalpels in the presence of 1 ml of nuclei isolation buffer (NST-DAPI). DAPI-stained nuclei were subsequently sorted on the basis of DNA content with single-nuclei collected from the diploid and polyploid gate (when present) and deposited into 96-well plates preloaded with nucleus lysis buffer as described above. Single-nucleus WGA amplification was performed as described above for mouse SP and DP single cells with downstream sequence library performed using the NEBNext Library preparation kit (NEB). Libraries were sequenced on the HiSeq2500 instrument for single-end 101 bp sequencing while targeting a coverage of around 1–2 million reads per single cell. Single-cell demultiplexed sequencing data were processed for absolute copy-number determination as previously described58. The absolute copy number was converted to the nearest integer, and homogeneity score was inferred as the frequency of the major integer copy-number state for each bin across the genome.
Materials availability
Plasmids, mouse ES cells and primary cancer cell lines generated and used in this work are available from the corresponding author.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-022-05082-5.
Supplementary information
Acknowledgements
We thank A. Levine, M. Wigler, J. Hicks, V. Seshan and all of the members of the Lowe laboratory for discussions throughout the study; F. Fang, K. Daniels and M. Tween for help with flow cytometry; P. Belleau for help in the implementation of the CORE algorithm; S. Goodwin of the CSHL Sequencing Core for sequencing; A. Polydorides for assistance with histopathological examinations of KPCLOH sections; and the staff at the UNC Pathology Services Core for help with tissue processing and imaging. This work was supported by grants from the MSKCC Center for Pancreatic Cancer Research (CPCR; to S.D.L., C.A.I.-D. and S.W.L.), a grant from the Simons Foundation, Life Sciences Founders Directed Giving-Research (award number 519054 to M. Wigler), an NIH/NCI Cancer Center Support grant (P30 CA008748, to N.D.S.) and grants from HHMI and the NIH (P01CA087497 and P01CA13106, to S.W.L.). Funding support was also provided by Philips Research North America to S.W.L. T.B. is supported by the William C. and Joyce C. O’Neil Charitable Trust, Memorial Sloan Kettering Cancer Center. J.P.M.IV is supported by a Pancreatic Cancer Action Network Career Development Award. J.R. is a Howard Hughes Medical Institute Fellow of the Damon Runyon Cancer Research Foundation, DRG-2382-19. K.M.T. was supported by the Jane Coffin Childs Fund for Medical Research and a Shulamit Katzman Endowed Postdoctoral Research Fellowship.
Extended data figures and tables
Source data
Author contributions
T.B., J.P.M.IV and Z.Z. conceived and performed experiments and analysed data with guidance from S.W.L. J.R. performed image analysis and quantification. K.M.T. generated and analysed the KPCcis/shRNA models. J.B., S.T., S.Z., E.G. and N.L. provided technical assistance with sequencing. G.A. and A.Y. performed pathological analysis. K.C. performed and analysed DNA-FISH. T.B., Y.-J.H., A.Z., J.K., L.C., J.W., C.B., Y.G. and N.D.S. analysed sequencing data. A.E. and A.M.V. provided clinical annotations. N.D., G.J.N., M.T.A.D., A.K., F.N., S.D.L., C.A.I.-D. and S.W.L. provided supervision. T.B., J.P.M.IV and S.W.L. wrote the manuscript with input from all of the authors.
Peer review
Peer review information
Nature thanks the anonymous reviewers for their contribution to the peer review of this work.
Data availability
All human and mouse sequencing data generated in this study are publicly available at the Sequence Read Archive (SRA; https://www.ncbi.nlm.nih.gov/sra) under accession number PRJNA718334. Human bulk PDAC sequencing data are available at the European Genome–Phenome Archive (EGA; https://www.ebi.ac.uk/ega/) under accession code EGAD00001006152. EGA data are accessible for research purposes by registration for an EGA account and contacting the Data Access Committee. Source data are provided with this paper.
Code availability
All core code used in this study have been previously published with downloadable and runnable source codes available at GitHub (https://github.com/robertaboukhalil/ginkgo; https://github.com/KrasnitzLab/SCGV; https://github.com/KrasnitzLab/SCclust). The full breakpoint-based phylogenetic analyses including data and code is available online (https://github.com/KrasnitzLab/p53-LOH-figures; https://hub.docker.com/repository/docker/krasnitzlab/p53-loh-figures). For Docker, free registration and installation is required (https://www.docker.com). Analysis and plotting scripts relating to heat maps and heterogeneity/homogeneity metrics are available at GitHub (https://github.com/naikai/p53-LOH-figures).
Competing interests
S.W.L. is a founder and member of the scientific advisory board of Blueprint Medicines, Mirimus, ORIC Pharmaceuticals and Faeth Therapeutics, and is on the scientific advisory board of Constellation Pharmaceuticals and PMV Pharmaceuticals. S.D.L. is on the scientific advisory board of Nybo Therapeutics and Episteme Prognostics. The other authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Timour Baslan, John P. Morris IV, Zhen Zhao
Extended data
is available for this paper at 10.1038/s41586-022-05082-5.
Supplementary information
The online version contains supplementary material available at 10.1038/s41586-022-05082-5.
References
- 1.Eischen CM. Genome stability requires p53. Cold Spring Harb. Perspect. Med. 2016;6:a026096. doi: 10.1101/cshperspect.a026096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.The ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium Pan-cancer analysis of whole genomes. Nature. 2020;578:82–93. doi: 10.1038/s41586-020-1969-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gerstung M, et al. The evolutionary history of 2,658 cancers. Nature. 2020;578:122–128. doi: 10.1038/s41586-019-1907-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kastenhuber ER, Lowe SW. Putting p53 in context. Cell. 2017;170:1062–1078. doi: 10.1016/j.cell.2017.08.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kastan MB. Wild-type p53: tumors can't stand it. Cell. 2007;128:837–840. doi: 10.1016/j.cell.2007.02.022. [DOI] [PubMed] [Google Scholar]
- 6.Wahl G, Vafa O. Genetic instability, oncogenes, and the p53 pathway. Cold Spring Harb. Symp. Quant. Biol. 2000;65:511–520. doi: 10.1101/sqb.2000.65.511. [DOI] [PubMed] [Google Scholar]
- 7.Livingstone LR, et al. Altered cell cycle arrest and gene amplification potential accompany loss of wild-type p53. Cell. 1992;70:923–935. doi: 10.1016/0092-8674(92)90243-6. [DOI] [PubMed] [Google Scholar]
- 8.Thompson SL, Compton DA. Proliferation of aneuploid human cells is limited by a p53-dependent mechanism. J. Cell Biol. 2010;188:369–381. doi: 10.1083/jcb.200905057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lane DP. Cancer. p53, guardian of the genome. Nature. 1992;358:15–16. doi: 10.1038/358015a0. [DOI] [PubMed] [Google Scholar]
- 10.Morris IV JP, et al. α-Ketoglutarate links p53 to cell fate during tumour suppression. Nature. 2019;573:595–599. doi: 10.1038/s41586-019-1577-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ventura A, et al. Restoration of p53 function leads to tumour regression in vivo. Nature. 2007;445:661–665. doi: 10.1038/nature05541. [DOI] [PubMed] [Google Scholar]
- 12.Feldser DM, et al. Stage-specific sensitivity to p53 restoration during lung cancer progression. Nature. 2010;468:572–575. doi: 10.1038/nature09535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Martins CP, Brown-Swigart L, Evan GI. Modeling the therapeutic efficacy of p53 restoration in tumors. Cell. 2006;127:1323–1334. doi: 10.1016/j.cell.2006.12.007. [DOI] [PubMed] [Google Scholar]
- 14.Xue W, et al. Senescence and tumour clearance is triggered by p53 restoration in murine liver carcinomas. Nature. 2007;445:656–660. doi: 10.1038/nature05529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Donehower LA, et al. Integrated analysis of TP53 gene and pathway alterations in The Cancer Genome Atlas. Cell Rep. 2019;28:1370–1384. doi: 10.1016/j.celrep.2019.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Rausch T, et al. Genome sequencing of pediatric medulloblastoma links catastrophic DNA rearrangements with TP53 mutations. Cell. 2012;148:59–71. doi: 10.1016/j.cell.2011.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Fujiwara T, et al. Cytokinesis failure generating tetraploids promotes tumorigenesis in p53-null cells. Nature. 2005;437:1043–1047. doi: 10.1038/nature04217. [DOI] [PubMed] [Google Scholar]
- 18.Bielski CM, et al. Genome doubling shapes the evolution and prognosis of advanced cancers. Nat. Genet. 2018;50:1189–1195. doi: 10.1038/s41588-018-0165-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Raynaud F, Mina M, Tavernari D, Ciriello G. Pan-cancer inference of intra-tumor heterogeneity reveals associations with different forms of genomic instability. PLoS Genet. 2018;14:e1007669. doi: 10.1371/journal.pgen.1007669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Shah SP, et al. The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature. 2012;486:395–399. doi: 10.1038/nature10933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cancer Genome Atlas Research Network Integrated genomic characterization of pancreatic ductal adenocarcinoma. Cancer Cell. 2017;32:185–203. doi: 10.1016/j.ccell.2017.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Rozenblum E, et al. Tumor-suppressive pathways in pancreatic carcinoma. Cancer Res. 1997;57:1731–1734. [PubMed] [Google Scholar]
- 23.Notta F, et al. A renewed model of pancreatic cancer evolution based on genomic rearrangement patterns. Nature. 2016;538:378–382. doi: 10.1038/nature19823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Waddell N, et al. Whole genomes redefine the mutational landscape of pancreatic cancer. Nature. 2015;518:495–501. doi: 10.1038/nature14169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Chan-Seng-Yue M, et al. Transcription phenotypes of pancreatic cancer are driven by genomic events during tumor evolution. Nat. Genet. 2020;52:231–240. doi: 10.1038/s41588-019-0566-9. [DOI] [PubMed] [Google Scholar]
- 26.Litchfield K, et al. Representative sequencing: unbiased sampling of solid tumor tissue. Cell Rep. 2020;31:107550. doi: 10.1016/j.celrep.2020.107550. [DOI] [PubMed] [Google Scholar]
- 27.Baslan T, Hicks J. Unravelling biology and shifting paradigms in cancer with single-cell sequencing. Nat. Rev. Cancer. 2017;17:557–569. doi: 10.1038/nrc.2017.58. [DOI] [PubMed] [Google Scholar]
- 28.Hingorani SR, et al. Trp53R172H and KrasG12D cooperate to promote chromosomal instability and widely metastatic pancreatic ductal adenocarcinoma in mice. Cancer Cell. 2005;7:469–483. doi: 10.1016/j.ccr.2005.04.023. [DOI] [PubMed] [Google Scholar]
- 29.Maddipati R, Stanger BZ. Pancreatic cancer metastases harbor evidence of polyclonality. Cancer Discov. 2015;5:1086–1097. doi: 10.1158/2159-8290.CD-15-0120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bardeesy N, et al. Both p16Ink4a and the p19Arf-p53 pathway constrain progression of pancreatic adenocarcinoma in the mouse. Proc. Natl Acad. Sci. USA. 2006;103:5947–5952. doi: 10.1073/pnas.0601273103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Saborowski M, et al. A modular and flexible ESC-based mouse model of pancreatic cancer. Genes Dev. 2014;28:85–97. doi: 10.1101/gad.232082.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Livshits G, et al. Arid1a restrains Kras-dependent changes in acinar cell identity. eLife. 2018;7:e35216. doi: 10.7554/eLife.35216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Alonso-Curbelo D, et al. A gene-environment-induced epigenetic program initiates tumorigenesis. Nature. 2021;590:642–648. doi: 10.1038/s41586-020-03147-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Dow LE, et al. Conditional reverse tet-transactivator mouse strains for the efficient induction of TRE-regulated transgenes in mice. PLoS ONE. 2014;9:e95236. doi: 10.1371/journal.pone.0095236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Beard C, Hochedlinger K, Plath K, Wutz A, Jaenisch R. Efficient method to generate single-copy transgenic mice by site-specific integration in embryonic stem cells. Genesis. 2006;44:23–28. doi: 10.1002/gene.20180. [DOI] [PubMed] [Google Scholar]
- 36.Morris JPT, Wang SC, Hebrok M. KRAS, Hedgehog, Wnt and the twisted developmental biology of pancreatic ductal adenocarcinoma. Nat. Rev. Cancer. 2010;10:683–695. doi: 10.1038/nrc2899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Basturk O, et al. A revised classification system and recommendations from the baltimore consensus meeting for neoplastic precursor lesions in the pancreas. Am. J. Surg. Pathol. 2015;39:1730–1741. doi: 10.1097/PAS.0000000000000533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Aichler M, et al. Origin of pancreatic ductal adenocarcinoma from atypical flat lesions: a comparative study in transgenic mice and human tissues. J. Pathol. 2012;226:723–734. doi: 10.1002/path.3017. [DOI] [PubMed] [Google Scholar]
- 39.Mueller S, et al. Evolutionary routes and KRAS dosage define pancreatic cancer phenotypes. Nature. 2018;554:62–68. doi: 10.1038/nature25459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Jones S, et al. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321:1801–1806. doi: 10.1126/science.1164368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Biankin AV, et al. Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes. Nature. 2012;491:399–405. doi: 10.1038/nature11547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lomberk G, et al. Distinct epigenetic landscapes underlie the pathobiology of pancreatic cancer subtypes. Nat. Commun. 2018;9:1978. doi: 10.1038/s41467-018-04383-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Huang YH, et al. ID1 mediates escape from TGFβ tumor suppression in pancreatic cancer. Cancer Discov. 2020;10:142–157. doi: 10.1158/2159-8290.CD-19-0529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Maddipati R, et al. MYC levels regulate metastatic heterogeneity in pancreatic adenocarcinoma. Cancer Discov. 2022;12:542–561. doi: 10.1158/2159-8290.CD-20-1826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Cheng DT, et al. Memorial Sloan Kettering—integrated mutation profiling of actionable cancer targets (MSK-IMPACT): a hybridization capture-based next-generation sequencing clinical assay for solid tumor molecular oncology. J. Mol. Diagn. 2015;17:251–264. doi: 10.1016/j.jmoldx.2014.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Solimini NL, et al. Recurrent hemizygous deletions in cancers may optimize proliferative potential. Science. 2012;337:104–109. doi: 10.1126/science.1219580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Laks E, et al. Clonal decomposition and DNA replication states defined by scaled single-cell genome sequencing. Cell. 2019;179:1207–1221. doi: 10.1016/j.cell.2019.10.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Watkins TBK, et al. Pervasive chromosomal instability and karyotype order in tumour evolution. Nature. 2020;587:126–132. doi: 10.1038/s41586-020-2698-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Minussi DC, et al. Breast tumours maintain a reservoir of subclonal diversity during expansion. Nature. 2021;592:302–308. doi: 10.1038/s41586-021-03357-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.List A, et al. Lenalidomide in the myelodysplastic syndrome with chromosome 5q deletion. N. Engl. J. Med. 2006;355:1456–1465. doi: 10.1056/NEJMoa061292. [DOI] [PubMed] [Google Scholar]
- 51.Kawaguchi Y, et al. The role of the transcriptional regulator Ptf1a in converting intestinal to pancreatic progenitors. Nat. Genet. 2002;32:128–134. doi: 10.1038/ng959. [DOI] [PubMed] [Google Scholar]
- 52.Jackson EL, et al. Analysis of lung tumor initiation and progression using conditional expression of oncogenic K-ras. Genes Dev. 2001;15:3243–3248. doi: 10.1101/gad.943001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Marino, S. & Vooijs, M. van Der Gulden, H., Jonkers, J. & Berns, A. Induction of medulloblastomas in p53-null mutant mice by somatic inactivation of Rb in the external granular layer cells of the cerebellum. Genes Dev.14, 994–1004 (2000). [PMC free article] [PubMed]
- 54.Olive KP, et al. Mutant p53 gain of function in two mouse models of Li-Fraumeni syndrome. Cell. 2004;119:847–860. doi: 10.1016/j.cell.2004.11.004. [DOI] [PubMed] [Google Scholar]
- 55.Morris JPT, et al. Dicer regulates differentiation and viability during mouse pancreatic cancer initiation. PLoS ONE. 2014;9:e95486. doi: 10.1371/journal.pone.0095486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Baslan T, et al. Genome-wide copy number analysis of single cells. Nat. Protoc. 2012;7:1024–1041. doi: 10.1038/nprot.2012.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Lee J, et al. Synteny Portal: a web-based application portal for synteny block analysis. Nucleic Acids Res. 2016;44:W35–W40. doi: 10.1093/nar/gkw310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Baslan T, et al. Optimizing sparse sequencing of single cells for highly multiplex copy number profiling. Genome Res. 2015;25:714–724. doi: 10.1101/gr.188060.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Garvin T, et al. Interactive analysis and assessment of single-cell copy-number variations. Nat. Methods. 2015;12:1058–1060. doi: 10.1038/nmeth.3578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.de Bourcy CF, et al. A quantitative comparison of single-cell whole genome amplification methods. PLoS ONE. 2014;9:e105585. doi: 10.1371/journal.pone.0105585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Cai X, et al. Single-cell, genome-wide sequencing identifies clonal somatic copy-number variation in the human brain. Cell Rep. 2014;8:1280–1289. doi: 10.1016/j.celrep.2014.07.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Navin N, et al. Tumour evolution inferred by single-cell sequencing. Nature. 2011;472:90–94. doi: 10.1038/nature09807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Cleveland WS. Lowess—a program for smoothing scatterplots by robust locally weighted regression. Am. Stat. 1981;35:54. doi: 10.2307/2683591. [DOI] [Google Scholar]
- 64.Venkatraman ES, Olshen AB. A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics. 2007;23:657–663. doi: 10.1093/bioinformatics/btl646. [DOI] [PubMed] [Google Scholar]
- 65.Krasnitz A, Sun G, Andrews P, Wigler M. Target inference from collections of genomic intervals. Proc. Natl Acad. Sci. USA. 2013;110:E2271–E2278. doi: 10.1073/pnas.1306909110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Chorbadjiev L, et al. Integrated computational pipeline for single-cell genomic profiling. JCO Clin. Cancer Inform. 2020;4:464–471. doi: 10.1200/CCI.19.00171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Alexander J, et al. Utility of single-cell genomics in diagnostic evaluation of prostate cancer. Cancer Res. 2018;78:348–358. doi: 10.1158/0008-5472.CAN-17-1138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Kaufmann, T. L. et al. MEDICC2: whole-genome doubling aware copy-number phylogenies for cancer evolution. Preprint at bioRxiv10.1101/2021.02.28.433227 (2021). [DOI] [PMC free article] [PubMed]
- 69.Shen R, Seshan VE. FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing. Nucleic Acids Res. 2016;44:e131. doi: 10.1093/nar/gkw520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Dentro SC, Wedge DC, Van Loo P. Principles of reconstructing the subclonal architecture of cancers. Cold Spring Harb. Perspect. Med. 2017;7:a026625. doi: 10.1101/cshperspect.a026625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.McGranahan N, et al. Clonal status of actionable driver events and the timing of mutational processes in cancer evolution. Sci. Transl. Med. 2015;7:283ra254. doi: 10.1126/scitranslmed.aaa1408. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All human and mouse sequencing data generated in this study are publicly available at the Sequence Read Archive (SRA; https://www.ncbi.nlm.nih.gov/sra) under accession number PRJNA718334. Human bulk PDAC sequencing data are available at the European Genome–Phenome Archive (EGA; https://www.ebi.ac.uk/ega/) under accession code EGAD00001006152. EGA data are accessible for research purposes by registration for an EGA account and contacting the Data Access Committee. Source data are provided with this paper.
All core code used in this study have been previously published with downloadable and runnable source codes available at GitHub (https://github.com/robertaboukhalil/ginkgo; https://github.com/KrasnitzLab/SCGV; https://github.com/KrasnitzLab/SCclust). The full breakpoint-based phylogenetic analyses including data and code is available online (https://github.com/KrasnitzLab/p53-LOH-figures; https://hub.docker.com/repository/docker/krasnitzlab/p53-loh-figures). For Docker, free registration and installation is required (https://www.docker.com). Analysis and plotting scripts relating to heat maps and heterogeneity/homogeneity metrics are available at GitHub (https://github.com/naikai/p53-LOH-figures).