Abstract
The chromosomal theory of inheritance dictates that genes on the same chromosome segregate together while genes on different chromosomes assort independently1. Extrachromosomal DNAs (ecDNAs) are common in cancer and drive oncogene amplification, dysregulated gene expression and intratumoural heterogeneity through random segregation during cell division2,3. Distinct ecDNA sequences, termed ecDNA species, can co-exist to facilitate intermolecular cooperation in cancer cells4. How multiple ecDNA species within a tumour cell are assorted and maintained across somatic cell generations is unclear. Here we show that cooperative ecDNA species are coordinately inherited through mitotic co-segregation. Imaging and single-cell analyses show that multiple ecDNAs encoding distinct oncogenes co-occur and are correlated in copy number in human cancer cells. ecDNA species are coordinately segregated asymmetrically during mitosis, resulting in daughter cells with simultaneous copy-number gains in multiple ecDNA species before any selection. Intermolecular proximity and active transcription at the start of mitosis facilitate the coordinated segregation of ecDNA species, and transcription inhibition reduces co-segregation. Computational modelling reveals the quantitative principles of ecDNA co-segregation and co-selection, predicting their observed distributions in cancer cells. Coordinated inheritance of ecDNAs enables co-amplification of specialized ecDNAs containing only enhancer elements and guides therapeutic strategies to jointly deplete cooperating ecDNA oncogenes. Coordinated inheritance of ecDNAs confers stability to oncogene cooperation and novel gene regulatory circuits, allowing winning combinations of epigenetic states to be transmitted across cell generations.
Subject terms: Tumour heterogeneity, Cancer genetics, Genome, Cytogenetics, Cell division
Cooperative species of extrachromosomal DNAs are coordinately inherited through mitotic co-segregation.
Main
Oncogene amplification drives cancer development by increasing the copies of genetic sequences that encode oncogene products. Oncogenes are frequently amplified on megabase-sized circular ecDNA, which is detected in half of human cancer types5. First reported in 1965 (ref. 6), ecDNA amplifications (also known as double minutes7) have been shown to promote cancer development by driving copy-number heterogeneity5,8 and rapid adaptation to selective pressure in cancer9–11. This heterogeneity and adaptability can be attributed to the fact that, although ecDNA is replicated in each cell cycle and transmitted through cell division, owing to their lack of centromeres, ecDNA molecules are inherited unevenly among daughter cells during mitosis12–14.
ecDNAs exhibit a substantial level of genetic sequence diversity. First, multiple ecDNAs originally derived from different chromosomal loci can co-exist in the same cancer cell, often congregating in micrometre-sized hubs in the nucleus that enable intermolecular gene activation between distinct ecDNAs4,15. Second, ecDNAs contain clustered somatic mutations that suggest APOBEC3-mediated mutagenesis16, increasing the diversity of ecDNA sequence and function16–18. Third, ecDNAs can contain complex structural rearrangements of sequences originating from various genomic sites4,11,18–21. DNA damage can cause ecDNAs to cluster and sometimes become incorporated into micronuclei7,22–24, where DNA can further fragment and recombine25–27. These rearrangement events can give rise to diverse, co-existing ecDNA species in a cell population, including ecDNAs with distinct oncogene loci4,11,18,20,28 or encompassing only enhancers or oncogene coding sequences18.
Observations of diverse ecDNA species co-occurring in the same cell containing distinct oncogenes4,11,18,20 suggest that ecDNAs may represent specialized, cooperative molecules. For example, it has been reported that new ecDNA species can form in cells after recurrence or drug treatment of ecDNA-carrying cancers while the original ecDNA amplicons are retained11,29, suggesting that multiple ecDNA species may arise independently and that their interaction provides fitness advantages to cancer cells. Concordantly, ecDNAs carrying oncogenes alongside non-coding regulatory elements can interact with each other and with chromosomes in a combinatorial manner to promote gene expression3,4,30. These observations lend support to the hypothesis that the co-occurrence of multiple ecDNA sequences in a cell may have combinatorial and synergistic effects on transcriptional programs.
The diversity of ecDNA genetic sequences and importance of intermolecular interactions between ecDNAs in a cancer cell population raises the questions of (1) how heterogeneous ecDNA species are distributed in a cell population; (2) as ecDNAs are segregated unequally during mitosis, how these mixtures of ecDNAs are inherited by daughter cells; and (3) how the dynamics of multiple ecDNA species affect cancer evolution under selective pressure. Using a combination of image analysis, single-cell and bulk sequencing, and computational modelling, we set out to elucidate the principles and consequences of ecDNA co-evolution in cancer.
Distinct ecDNAs co-occur in cancer cells
To examine how frequently ecDNA molecules with distinct sequences co-exist in the same tumours, we first analysed ecDNA structures predicted from whole-genome sequencing (WGS) data in The Cancer Genome Atlas19 (TCGA; Methods). This analysis revealed that 289 out of 1,513 patient tumours contained ecDNA, carrying coding sequences of well-characterized oncogenes such as EGFR, MDM2 and CDK4 (refs. 5,19) (Fig. 1a,b). Among tumours that contained ecDNA, more than 25% (81 samples) contained two or more ecDNA species in the same tumour (Fig. 1a and Extended Data Fig. 1a). Many of these ecDNA species were highly amplified and contained canonical oncogenes (Fig. 1b), supporting the idea that heterogeneous ecDNA sequences can be found in the same tumour and their co-occurrence may provide distinct selective advantages (such as CCND2, EGFR and MDM4 in a glioblastoma sample, and MYC and KRAS in a urothelial bladder carcinoma sample; Extended Data Fig. 1b). As we considered only highly abundant and genomically non-overlapping ecDNA sequences as distinct species, this analysis probably underestimates the true diversity of ecDNA species.
The frequent co-amplification of distinct ecDNA species in tumours raised the question of whether multiple ecDNA species can co-occur in the same cells. We examined a panel of cancer cell line and neurosphere models that were previously characterized to contain multiple ecDNA species4,5,9 (Fig. 1c). After validating each cell line using DNA fluorescence in situ hybridization (FISH) analysis of metaphase chromosome spreads (Fig. 1d,e), we found that the vast majority of individual cells had very little overlap in FISH signals from distinct oncogenes on chromosome spreads (ranging from 2–7%; Fig. 1e and Extended Data Fig. 2a–i). These data confirmed that distinct ecDNAs are not covalently linked on the same ecDNA molecule and are therefore expected to be inherited independently from one another in dividing cancer cells.
We next examined the distributions of ecDNA copy numbers in single cells using three orthogonal methods (Fig. 1d): (1) metaphase chromosome spreading followed by DNA-FISH; (2) isolation of single nuclei followed by droplet-based single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) and RNA sequencing (RNA-seq); and (3) enrichment and sequencing of ecDNAs in individual cells through exonuclease digestion and rolling circle amplification31 (scCircle-seq; Methods). Notably, in cell lines with distinct ecDNA species, FISH imaging revealed that pairs of ecDNA species had significantly correlated copy numbers (Spearman correlation R = 0.24–0.52, P < 0.05 in all cases; Fig. 1f and Extended Data Fig. 2j–n). We next assessed the significance of these correlations in a larger population of 71,804 cells from a subpanel of cell lines by adapting a copy-number quantification method for genomic background coverage from scATAC-seq data4,32,33 to calculate ecDNA copy numbers (Fig. 1d,g, Methods and Extended Data Fig. 3a). Notably, we observed positive correlations between distinct ecDNA species in each of the three cell lines with multiple ecDNA species (Fig. 1h–j and Extended Data Fig. 3b,c; Pearson correlation, R = 0.26–0.46, P < 1 × 10−15 in all cases). As expected, genic sequences that are covalently linked on the same ecDNA molecule (as demonstrated by isolation from the same molecular size fractions using CRISPR–CATCH18; Extended Data Fig. 3d) showed strong copy-number correlation in this analysis, validating this approach for measuring the distributions of ecDNA molecules in a cell population (Fig. 1i,j and Extended Data Fig. 3b,c). ecDNA copy numbers were positively correlated with RNA expression of the correspondingly amplified oncogenes, supporting the idea that the copies of ecDNA species drive transcriptional outcomes (Extended Data Fig. 3e). Importantly, we did not observe copy-number correlations between gene pairs located on different chromosomes, suggesting that this relationship between different ecDNA species cannot simply be explained by sequencing quality (Fig. 1i,j and Extended Data Fig. 3b,c). Finally, single-cell Circle-seq confirmed co-enrichment of the MYCN, MDM2 and CDK4 ecDNA species in individual TR14 neuroblastoma cells (Extended Data Fig. 3f).
To investigate whether patient tumours with variable oncogene copy numbers exhibit a similar signature of copy-number correlation in single cells, we curated a dataset of 41 tumour samples from publicly available scATAC-seq or single-cell DNA-seq data of triple-negative breast cancer, high-grade serous ovarian cancer and glioblastoma34–36. We devised a statistical approach for identifying focal amplifications using single-cell copy-number profiles and validated our ability to identify ecDNA amplifications in well-characterized cell lines (Methods and Extended Data Fig. 4a). Applying this approach to patient tumours, we found that 15 out of 41 (37%) cases had focal amplifications matching the signature of ecDNA. We further predicted 7 cases (17% of all samples) with focal amplification of two or more oncogenes with significantly correlated copy numbers in single cells, suggestive of co-amplified distinct ecDNA species (Extended Data Fig. 4b,c).
Together, these results show that distinct ecDNA species tend to co-occur with correlated copy numbers far more than expected by chance both in cancer cell lines and patient samples.
Distinct ecDNA species co-segregate
In principle, our observations of co-occurrence and correlation of two distinct ecDNA species can be the result of (1) hyper-replication of ecDNAs in a subpopulation of cells; (2) co-selection of both species, given that both species provide fitness advantages and/or engage in synergistic intermolecular interactions; or (3) co-segregation of both species into daughter cells during cell division. To investigate whether hyper-replication contributes to the observed ecDNA correlation, we evaluated copy-number correlations in cells across different phases of the cell cycle using the single-cell multi-omics data (Methods). We observed no additional co-enrichment of ecDNA in cells that have replicated their DNA (Extended Data Fig. 5a–c), which is consistent with previous literature reporting that ecDNA is replicated once per cell cycle, along with genomic DNA, during S phase37,38. Conversely, as different ecDNA species can carry different oncogenes and mixed ecDNAs can interact with each other to increase gene expression4,30, co-selection can reasonably explain co-occurrence of ecDNA species. However, given their stochastic segregation into daughter cells12–14, it is unclear how a collective of ecDNA species and their cooperative interactions are preserved over successive cell divisions (Fig. 2a).
To address this question, we assessed the distribution of multiple ecDNA species during a single cell division. Using DNA-FISH combined with immunofluorescence staining for Aurora kinase B, a component of the mitotic midbody, we quantified the copy numbers of ecDNA inherited among daughter cell pairs undergoing mitosis12,39 (Fig. 2b). Notably, in all five cancer cell lines containing multiple distinct ecDNA species (Fig. 1c,e and Extended Data Fig. 2), we observed significant co-segregation of distinct ecDNA species to daughter cells as measured by the correlated proportions of ecDNAs inherited (R = 0.4–0.71, P < 1 × 10−4 in each case; Fig. 2c,d, Methods and Extended Data Fig. 5d). In other words, the daughter cell that inherits more copies of ecDNA species 1 tends to inherit more copies of species 2, and vice versa. Simulations of segregating ecDNAs showed that this correlation of ecDNA species in daughter cells is far greater than expected from random segregation, or the levels of co-inheritance contributed by rare covalent fusions of ecDNAs, and scales linearly with the level of co-segregation of ecDNAs (Methods and Extended Data Fig. 5e–g). It is unlikely that this result would be driven by cellular volumetric differences as ecDNA segregate by colocalizing with mitotic chromosomes rather than spreading by diffusion4,13,40. Together, these data show that, while individual ecDNAs segregate into daughter cells following a binomial distribution12,14, collectives of ecDNA species may co-segregate during mitosis.
Transcription promotes co-segregation
We next investigated the molecular mechanism of ecDNA co-segregation. Previous studies have shown that ecDNAs aggregate in response to artificially induced DNA damage22,23; more recent reports showed that damaged DNA fragments are tethered together in mitosis by the CIP2A–TOPBP1 complex and co-segregate25,41. However, CIP2A localizes to DNA breaks and does not to bind to intact ecDNAs41. Consistent with this report, we found that genetic knockout of CIP2A had no significant effect on co-segregation of ecDNA species (Extended Data Fig. 5h–j).
As we and others have previously reported that different ecDNA species interact with one another through intermolecular contacts at transcriptionally active sites in ecDNA hubs during interphase4,15, we examined whether their co-segregation may be related to intermolecular proximity in the nucleus. To visualize ecDNA hubs during mitosis using live-cell imaging, we used the colorectal cancer COLO 320DM cell line with a Tet-operator (TetO) array inserted into MYC ecDNAs and fluorescently labelled ecDNA molecules using TetR-mNeonGreen (Methods). We observed in many cases that hubs of ecDNA molecules remained as a unit throughout mitosis, with many ecDNA molecules co-segregating into the same daughter nucleus (Extended Data Fig. 6a). Clusters of ecDNAs in G2 phase remained spatially proximal as cells entered mitosis, attached to the condensing chromosomes, and therefore co-segregated into the same daughter nucleus as a unit (Extended Data Fig. 6b). Inhibition of the bromodomain and extraterminal domain (BET) family of proteins has previously been shown to reduce ecDNA clustering4; while the level of ecDNA co-segregation showed a downward trend with BRD4 degradation (Methods), the effect was not significant, potentially due to incomplete degradation and compensatory effects by other members of the BET protein family (Extended Data Fig. 7a–c). To investigate the idea that intermolecular contacts at transcriptionally active sites may promote coordinated inheritance of ecDNA species, we next examined whether transcription inhibition can disrupt ecDNA co-segregation. We tested three different transcription inhibitors—triptolide, 5,6-dichlorobenzimidazole 1-β-d-ribofuranoside (DRB) and actinomycin D—targeting various steps of transcription initiation and elongation by RNA polymerase II42–45 (Fig. 2e and Extended Data Fig. 7d–g). We found that triptolide uniquely reduced ecDNA co-segregation in five cancer cell line models as measured by DNA-FISH and Aurora kinase B immunofluorescence imaging of late mitotic cells (P = 0.00399 for paired comparisons of all cell lines with triptolide treatments; in individual cell line comparisons, P < 0.05 in SNU16m1, CA718 and H716, and not significant in GBM39KT-D10 and TR14; DRB and actinomycin D had no effect on co-segregation in SNU16m1; Fig. 2f and Extended Data Fig. 7h,i). To further exclude potential off-target effects from triptolide, we pretreated cells with an antagonist of triptolide, ZL-12A, which induces the degradation of the transcription factor IIH (TFIIH) helicase ERCC3 by reacting with the same cysteine (Cys342) as triptolide, thereby attenuating triptolide-triggered degradation of RNA polymerase II46 (Extended Data Fig. 7d). Pretreatment with ZL-12A blocked the effects of triptolide on active RNA polymerase II as well as co-segregation of ecDNA species (Extended Data Fig. 7j–m), confirming the specific effect of transcription initiation on ecDNA co-segregation. As triptolide acts on transcription initiation through the TFIIH complex rather than elongation of RNA transcripts45 (Extended Data Fig. 7d), these results suggested that transcription initiation, but not transcription elongation, promotes ecDNA co-segregation. We observed this reduction of ecDNA co-segregation after only 3.5 h of triptolide treatment, suggesting that transcription inhibition very shortly before or during mitosis can disrupt ecDNA co-segregation. Consistent with this result, ecDNA remains transcriptionally active at the onset of mitosis, as shown by nascent oncogene RNA-FISH signal in ecDNA-containing cells at prometaphase but not when the same oncogene is located on chromosomes (Fig. 2g). Together, our live-cell imaging and chemical perturbation experiments support the idea that intermolecular proximity and active transcription before and at the start of mitosis facilitate the coordinated inheritance of ecDNA species into daughter cells.
Modelling of ecDNA co-assortment
With the observation of co-segregation of ecDNAs, we next assessed the respective contributions of co-selection and co-segregation in shaping the patterns of ecDNA co-assortment using evolutionary modelling. Similar to previous work12, we implemented an individual-based, forward-time evolutionary framework to study ecDNA dynamics in a growing tumour population (Fig. 3a and Methods). This model is instantiated with a single founding cell carrying two distinct ecDNA species with the same copy number. Cells divide or die according to a ‘fitness’ function that determines their birth rate based on the presence of each ecDNA species. During cell division, ecDNA copies are inherited among daughter cells according to a ‘co-segregation’ parameter: a value of 0 indicates independent random segregation and a value of 1 indicates perfectly correlated segregation. By simulating 1 million cancer cells under fixed selection for two individual ecDNA species (Fig. 3b–e and Extended Data Fig. 8a,b), we found that (1) co-occurrence of ecDNA species is mainly driven by co-selection pressure acting over multiple generations with modest synergy from co-segregation (Fig. 3b,c); and (2) copy-number correlation of ecDNAs in cells is mainly driven by co-segregation alone, in which proportional copies of ecDNAs are inherited during cell division (Fig. 3d,e). Once a cancer cell population reaches high copy numbers, ecDNA co-occurrence becomes relatively stable (Extended Data Fig. 8b). We further validated these trends using an alternative model of ecDNA evolution (Methods and Extended Data Fig. 8c–e).
As co-selection and co-occurrence leave distinct signatures on the joint distributions of ecDNAs (Fig. 3f and Extended Data Fig. 8a), we sought to infer the levels of ecDNA co-selection and co-segregation based on experimentally observed ecDNA copy-number distributions in cells. Pairing our evolutionary model with ecDNA copy-number distributions obtained with scATAC-seq, we used approximate Bayesian computation (ABC)47,48 to infer posterior distributions for individual selection, co-selection and co-segregation of ecDNA species (Fig. 3g, Methods and Extended Data Fig. 8f–h). As validation, the inferred levels of co-segregation closely matched those experimentally observed in dividing cells using DNA-FISH (Fig. 2c,d and Fig. 3g and Extended Data Fig. 5d). This analysis inferred high levels of co-selection of ecDNA species relative to their individual selection in cancer cells (Extended Data Fig. 8g,h). Co-selection becomes less critical at higher initial copy numbers for our inference procedure (in effect widening the 95% credible interval) while the co-segregation parameter remains stable across copy numbers (Extended Data Fig. 8h), consistent with the idea that co-segregation of ecDNA species maintains their correlated distributions in cells even at high ecDNA abundance. Together, these results show that co-selection and co-segregation underpin the co-assortment of ecDNAs in cancer cell populations (Fig. 3h).
An altruistic enhancer-only ecDNA
We next assessed how co-selection and co-segregation contribute to the distributions of ecDNAs that do not themselves encode oncogenes but interact with other ecDNA molecules. We recently identified an ecDNA species in the parental SNU16 gastric cancer cell line that contains no oncogene-coding sequences but, instead, originated from a non-coding genomic region between WDR11 and FGFR2. This region has accessible chromatin, is marked by histone H3 lysine 27 acetylation (H3K27ac) and contacts the FGFR2 promoter, suggesting the presence of active enhancers18 (Fig. 4a and Extended Data Fig. 9a). At least one of these enhancer regions is required for oncogene activation on ecDNA, as evidenced by the reduced expression of FGFR2 after targeting the enhancer region by CRISPR interference4 (Extended Data Fig. 9a,b). Long-read sequencing revealed that this enhancer ecDNA resulted from two inverted DNA segments joining together to create a circular molecule (Extended Data Fig. 9a). As intermolecular interactions of regulatory elements between different ecDNA molecules can drive oncogene expression3,4, the presence of amplified enhancer elements in the pool of ecDNA molecules may support enhancer–promoter interactions in trans and further upregulate oncogene expression—that is, an ‘altruistic’ ecDNA. An enhancer-only ecDNA may be especially sensitive to the co-occurrence of oncogene-coding ecDNAs in the same ecDNA hubs to exert its regulatory effect. Simulations under our model of ecDNA co-evolution suggested that co-segregation and co-selection synergize to maintain enhancer-only ecDNAs with oncogene-encoding ecDNAs in a majority of cancer cells (Fig. 4b,c) and that co-selection is particularly important to maintain enhancer-only ecDNAs (Extended Data Fig. 9c,d).
To quantify the frequency of enhancer-only ecDNA species, we performed metaphase DNA-FISH with separate, non-overlapping probes targeting the MYC and FGFR2 coding sequences, as well as the enhancer sequence (Methods). This analysis showed that approximately 20% of ecDNA molecules in SNU16 cells contained this enhancer sequence without either oncogene (consistent with CRISPR–CATCH enrichment in the parental SNU16 line; Fig. 4a) and that the vast majority of individual cells (98%, 63 out of 64 cells examined) contained the enhancer-only ecDNA species (Fig. 4d,e). Analysis of pairs of daughter cells undergoing mitosis further showed co-segregation of the enhancer sequence with both MYC and FGFR2 ecDNA molecules significantly above levels that can be explained by covalent linkages alone (R > 0.80, P < 1 × 10−6 for each comparison; Fig. 4f,g and Methods). These results support the theory that specialized ecDNAs without oncogenes can arise and be stably maintained by virtue of synergistic interaction with oncogene-carrying ecDNA.
Pharmacological effects on ecDNA species
ecDNAs can drive rapid genome evolution in response to pharmacological treatment, including through modulation of copy number29 and generation of new ecDNAs containing resistance-promoting genes11,18. We hypothesized that co-segregation and co-selection of ecDNA species that interact in trans could lead to coupled copy-number dynamics in response to targeted drug treatment. To test this hypothesis, we performed drug treatment with pemigatinib, an FGFR2 inhibitor49, using the SNU16m1 gastric cancer monoclonal cell line (containing MYC and FGFR2 ecDNAs that engage in intermolecular enhancer–promoter interactions4; Fig. 4h). Despite the clonal nature of the SNU16m1 cells, there is a high level of ecDNA copy-number heterogeneity among cells (5–300 copies of MYC ecDNA and 100–500 copies of FGFR2 ecDNA in individual cells; Extended Data Fig. 2j). The MYC and FGFR2 ecDNA species are correlated in copy number among these clonal cells (Extended Data Fig. 2j), consistent with the idea that a single-cell clone can establish heterogeneous yet correlated copy numbers of ecDNA species in progeny cells through asymmetric co-segregation during cell division. Pemigatinib was predicted to reduce the selective advantage of cells with amplified FGFR2 expression, leading to loss of FGFR2 ecDNAs in the cell population over time. When cells are treated with a drug that targets a single ecDNA species (such as pemigatinib targeting the gene product of FGFR2 ecDNAs), our simulations predicted coordinated copy-number dynamics of co-existing ecDNAs only if they co-segregate (Fig. 4h,i, Methods and Extended Data Fig. 10a). Simulations further predicted that drug removal would allow steady recovery of the copy number of the targeted ecDNA species (Fig. 4i).
To test these predictions experimentally, we treated SNU16m1 cells with 5 μM pemigatinib over 6 weeks (Fig. 4h,j). As predicted by simulations of co-segregating ecDNAs, this targeted FGFR2 inhibition led to an initial coordinated depletion of both FGFR2 and MYC ecDNAs (Fig. 4h,j and Extended Data Fig. 10b), supporting the idea that the two ecDNA species are coordinately inherited despite not being covalently linked (separate ecDNA species were validated by metaphase DNA-FISH after the first 3 weeks of drug treatment; Extended Data Fig. 10c,e). However, while cells that were continuously treated with pemigatinib maintained low FGFR2 copy numbers, MYC ecDNA copy numbers recovered after week 3 and became further amplified, suggesting that MYC ecDNAs may eventually be selected in cells resistant to drug treatment (Fig. 4j (dark orange)). We further found that, while MYC had been selected on ecDNAs at high copy numbers, the remaining FGFR2 copies increasingly integrated into chromosomes by week 6 (Fig. 4k). Importantly, while previous studies have reported that ecDNA can integrate into chromosomes5,50,51, our results suggest that its chromosomal integration can promote drug resistance by the evasion of co-inheritance (Fig. 4l and Extended Data Fig. 10f). A 2-week temporary removal of pemigatinib in the middle of the experiment resulted in recovery of FGFR2 and MYC ecDNA copy numbers and re-established sensitivity to co-depletion of both ecDNA species once the drug was re-added, showing that the coordinated copy-number dynamics can be rapidly re-established within a few cell generations (Fig. 4j (light orange)). Finally, pemigatinib did not result in MYC ecDNA loss in the COLO 320DM colorectal cancer cell line, which does not contain FGFR2 ecDNAs (Fig. 1c and Extended Data Fig. 10d), showing that the loss of MYC ecDNAs in SNU16m1 cells is specifically due to the coupling with FGFR2 ecDNAs.
To further demonstrate the generality of these coordinated dynamics of ecDNA species under selective pressure, we treated the neuroblastoma TR14 cells with nutlin-3a, a targeted inhibitor of MDM2. MDM2 inhibition led to concomitant depletion of co-segregating MDM2 and MYCN ecDNAs in a TP53-dependent manner, demonstrating molecular specificity of ecDNA co-depletion to MDM2 activity through the TP53 pathway (Methods and Extended Data Fig. 10g–j). Conversely, the coordinated depletion of ecDNAs under targeted inhibition cannot be explained by a general cytotoxic effect on rapidly dividing cells, as general cytotoxic drugs did not always reduce ecDNA contents (etoposide or fluorouracil; Extended Data Fig. 10k–l; low-dose hydroxyurea reduced ecDNA contents as reported previously52,53).
Together, these results demonstrate that pharmacological targeting of an oncogene carried by one ecDNA species can coordinately regulate co-existing ecDNA species, driven by both reduced selective advantage for a particular oncogene (for example, pemigatinib targeting FGFR2) and indirect effects on additional ecDNA species through physical co-segregation. However, resistance can emerge when ecDNA co-inheritance is uncoupled through chromosomal integration of the drug-targeted oncogene.
Discussion
ecDNA amplifications in cancer are highly heterogeneous and dynamic, involving mixtures of DNA species that evolve and increase in complexity over time and in response to selective pressures such as drug treatments31,54. Through single-cell sequencing, imaging, evolutionary modelling and chemical perturbations across multiple cancer types, we have shown that diverse ecDNA species co-occur in cancer cells, that they co-segregate during mitosis, and that these evolutionary associations contribute to ecDNA specialization and response to targeted therapy. We have also shown that intermolecular interactions and active transcription promote co-segregation of ecDNA species. We provide evidence that ecDNA co-segregation is distinct from the damage-induced clustering of DNA fragments by the CIP2A–TOPBP1 complex25,41 (Extended Data Fig. 7), probably because the majority of ecDNAs lack double-stranded breaks (as shown by pulsed-field gel electrophoresis18), which are required for CIP2A recruitment41.
While individual ecDNAs are stochastically inherited during mitosis12,14, co-segregation and co-selection of distinct ecDNAs synergistically maintain a collective of cooperating ecDNAs across cell generations. This coordinated behaviour of ecDNA collectives presents implications for our understanding of cancer evolution and development of cancer therapies. First, co-selection of structurally diverse ecDNAs can lead to functional specialization (such as enhancer-only ecDNAs), suggesting that interactive modules of ecDNAs may exist, for example, within intermolecular ecDNA hubs3,4,15. Second, our pharmacological experiments show that therapeutic interventions targeting the gene product of an ecDNA species may impact co-existing ecDNAs and further underscore that co-segregation of ecDNA species gives rise to highly dynamic and complex behaviours under selective pressure. However, the eventual uncoupling of ecDNA species suggests that therapies naively exploiting co-segregation are not guaranteed to ‘cure’ tumour cells of ecDNA. Rather, acute targeted therapy can induce rapid, potentially therapeutically advantageous, genome remodelling as a consequence of ecDNA co-segregation. Third, our computational framework can assess ecDNA co-segregation and co-selection from single-cell genomic or imaging data, therefore offering opportunities to understand how ecDNAs co-evolve in tumours.
ecDNAs exhibit aggressive behaviour in cancer cells as they can rapidly shift in copy number and evolve novel gene regulatory relationships4,12. This accelerated evolution and ability to explore genetic and epigenetic space is challenged by its potentially transient nature—a winning combination of ecDNAs may not be present in the next daughter cell generation if they are randomly transmitted. ecDNA co-inheritance enables cancer cells to balance accelerated evolution with a measure of genetic and epigenetic memory across cell generations, increasing the probability that combinations of ecDNA species will be transmitted together to daughter cells (Fig. 3e). The consequence is a jackpot effect that supports cooperation among heterogeneous ecDNAs, enabling the co-amplification of multiple oncogenes and continued diversification of cancer genomes. Beyond cancer evolution, our general framework for coordinated asymmetric inheritance may be applicable to viral episomes, subcellular organelles or biomolecular condensates that control cell fates.
Methods
Cell culture
The TR14 neuroblastoma cell line was a gift from J. J. Molenaar (Princess Máxima Center for Pediatric Oncology). Cell line identity for the master stock was verified by STR genotyping (IDEXX BioResearch). The GBM39-KT cell line was derived from a patient with glioblastoma undergoing surgery at Mayo Clinic, Rochester, Minnesota as described previously55. Monoclonal spheroids were isolated from GBM39-KT cells by limiting dilution to generate GBM39-KT-D10. The CA718 cell line was derived from a patient with glioblastoma as described previously5 and was obtained from the University of California San Diego Moores Cancer Center. Parental SNU16, COLO 320DM, H716 and HCT116 cells were obtained from ATCC. The monoclonal SNU16m1 was a sub-line of the parental SNU16 cells generated from a single cell after lentiviral transduction and stable expression of dCas9-KRAB as we previously described4. SNU16 and SNU16m1 cells were maintained in Dulbecco’s modified Eagle’s medium/nutrient mixture F-12 (DMEM/F12 1:1; Gibco, 11320-082), 10% fetal bovine serum (FBS; Hyclone, SH30396.03) and 1% penicillin–streptomycin (Thermo Fisher Scientific, 15140-122). COLO 320DM cells were maintained in DMEM (Thermo Fisher Scientific, 11995073) supplemented with 10% FBS and 1% penicillin–streptomycin. GBM39-KT cells were maintained in DMEM/F12 1:1, B-27 supplement (Gibco, 17504044), 1% penicillin–streptomycin, GlutaMAX (Gibco, 35050061), human epidermal growth factor (EGF, 20 ng ml−1; Sigma-Aldrich, E9644), human fibroblast growth factor (FGF, 20 ng ml−1; Peprotech) and heparin (5 μg ml−1; Sigma-Aldrich, H3149-500KU). TR14 cells were grown in RPMI 1640 with 20% FBS and 1% penicillin–streptomycin. For the mitotic cell imaging experiments in Fig. 2, SNU16m1 cells were grown in RPMI 1640 with 10% FBS. H716 cells were grown in ATCC formulated RPMI 1640 (Gibco, A1049101) with 10% FBS and 1% penicillin–streptomycin–glutamine. COLO 320DM cells used for live-cell imaging, PC3 and HCT116 were cultured in DMEM (Corning, 10-013-CV) with 10% FBS and 1% penicillin–streptomycin–glutamine. All cells were cultured at 37 °C with 5% CO2. All cell lines tested negative for mycoplasma contamination.
Chemicals
BRD4 bivalent degrader was a gift from M. M. Hassan and N. S. Gray, and was resuspended in DMSO as 10 mM stock56. Triptolide (Millipore, 645900) was resuspended with DMSO as 55 mM stocks and were used at a final concentration of 10 µM. Actinomycin D (Millipore Sigma, SBR00013) was used at a final concentration of 5 µg ml−1. DRB (Sigma-Aldrich, D1916) was resuspended with DMSO as 70 mM stocks and was used at a final concentration of 200 µg ml−1. ZL-12A was synthesized as reported previously46 and resuspended in DMSO as 20 mM stock, and was used at a final concentration of 50 µM for 3 h. In the pretreatment assay with triptolide, ZL-12A was added for 3 h, followed by a wash-off with 1× PBS and the addition of DMSO or triptolide (10 µM) for 3.5 h.
Genetic knockout of CIP2A
CIP2A-knockout cells were created using the SNU16m1 cells as follows. We designed a guide RNA sequence targeting the protein-coding region of CIP2A using CHOPCHOP57 (https://chopchop.cbu.uib.no), as well as a non-targeting control sgRNA (guide sequences are provided in Supplementary Table 1). To deliver each guide with CRISPR–Cas9 into cells, we mixed purified S. pyogenes Cas9 nuclease (Alt-R S.p. Cas9 Nuclease V3; IDT, 1081058) with each single-guide RNA (sgRNA; diluted to 30 μM in 1× TE buffer; Synthego) at a 1:6 molar ratio in Neon Resuspension Buffer R (Thermo Fisher Scientific) and incubated it at room temperature for 10 min to form Cas9 ribonucleoprotein (RNP) complexes. SNU16m1 cells were collected and washed twice with 1× PBS before being resuspended in Buffer R with Cas9 RNPs for a final concentration of 300,000 cells per 10 μl Neon reaction with 0.71 μM Cas9 complexes. Transfection was performed using the Neon Transfection System (Thermo Fisher Scientific, MPK5000) according to the manufacturer’s protocol using 10 μl tips with the following parameters: 1,400 V, 20 m s−1, 2 pulses. Three Neon reactions per guide condition were combined, resulting in 900,000 cells for either the control or CIP2A-knockout genotype.
WGS
WGS libraries were prepared by DNA tagmentation. We first transposed genomic DNA with Tn5 transposase produced as previously described58, in a 50 µl reaction with TD buffer59, 50 ng DNA and 1 µl transposase. The reaction was performed at 50 °C for 5 min, and transposed DNA was purified using the MinElute PCR Purification Kit (Qiagen, 28006). Libraries were generated by 5–7 rounds of PCR amplification using the NEBNext High-Fidelity 2× PCR Master Mix (NEB, M0541L), purified using SPRIselect reagent kit (Beckman Coulter, B23317) with double size selection (0.8× right, 1.2× left) and sequenced on the Illumina NextSeq 550 or the Illumina NovaSeq 6000 platform. Reads were trimmed of adapter content with Trimmomatic60 (v.0.39), aligned to the hg19 genome using BWA MEM61 (0.7.17-r1188) and PCR duplicates were removed using Picard’s MarkDuplicates (v.2.25.3). WGS data from bulk SNU16 cells were previously generated (SRR530826, Genome Research Foundation).
Analysis of ecDNA sequences in TCGA patient tumours
We performed ecDNA detection based on bulk WGS data from TCGA using the AmpliconArchitect (AA) method for genomic focal amplification analysis. The outputs of this method were previously published19. In brief, this approach for detecting ecDNA uses three general steps which are wrapped into a workflow we call AmpliconSuite-pipeline (https://github.com/AmpliconSuite/AmpliconSuite-pipeline, v.1.1.1). First, given a BAM file, the analysis pipeline performs detection of seed regions where copy-number amplifications exist (CN > 4.5 and size between 10 kb and 10 Mb). Second, AA performs joint analysis of copy number and breakpoint detection in the focally amplified regions, forming a copy-number aware local genome graph. AA extracts paths representing genome structures and substructures from this graph that explains the changes in copy number. Last, a rule-based classification is performed using AmpliconClassifier (AC)62, based on the paths extracted by AA to predict the mode of focal amplification. This includes assessing structural variant types, segment copy numbers and the structure of the genome paths extracted by AA. Moreover, AC identifies ecDNA cycles based on criteria such as cyclic path length and copy number, providing a comprehensive classification system for amplicons on the basis of their structural characteristics. For example, if the changes in copy number are explained predominantly by one or more circular genome paths featuring a structural variant enclosing them with a head-to-tail circularization, this is consistent with an ecDNA mode of amplification, whereas a breakage-fusion-bridge genome structure contains multiple foldbacks and multiple genomic segments arranged in a palindrome. The complete classification criteria and description of the AC tool are available in the supplementary information of ref. 62.
We used AA (v.1.0) outputs from a previous study19, and classified focal amplifications types present in these outputs using AC (v.0.4.14) with the ‘--filter_similar’ flag set and otherwise the default settings. The ‘--filter_similar’ option removes probable false-positive focal amplification calls that contain far greater-than-expected levels of overlapping structural variants and shared genomic boundaries between ecDNAs of unrelated samples. In brief, AC scores the structural similarity of focal amplifications. These scores consider both genomic interval overlap and shared breakpoint junctions, with breakpoints deemed to be shared if their total distance is less than a specified threshold (default = 250 bp). Moreover, AC computes similarity scores for amplicons from unrelated origins, establishing a background null distribution for comparison. The tool uses a β-distribution model to fit the empirical null distribution, providing estimation of statistical significance of the similarity score. Out of 8,810 AA amplicons in the ref. 19 TCGA dataset, 45 candidate focal amplifications were removed by this filter.
To predict the distinct number of ecDNA species present in a sample, we used the genome intervals reported by AC for each focal amplification. AC determines the number of distinct, genomically non-overlapping ecDNA species present by clustering ecDNA genome intervals if those regions are connected by structural variants or the boundaries of the regions are within 500 kb. If intervals do not meet this criteria, AC predicts them as being unconnected and reports them as separate ecDNA species. AC uses a list of oncogenes that combines genes in the ONGene database (https://pubmed.ncbi.nlm.nih.gov/28162959/) and COSMIC (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6450507/).
Paired scATAC-seq and scRNA-seq library generation
Single-cell paired RNA-seq and ATAC-seq libraries were generated on the 10x Chromium Single-Cell Multiome ATAC + Gene Expression platform according to the manufacturer’s protocol and sequenced on an Illumina NovaSeq 6000 system. Data for COLO 320DM were generated previously4 and published under Gene Expression Omnibus (GEO) accession GSE159986.
Paired scATAC-seq and scRNA-seq analysis
A custom reference package for hg19 was created using cellranger-arc mkref (10x Genomics, v.1.0.0). The single-cell paired RNA-seq and ATAC-seq reads were aligned to the hg19 reference genome using cellranger-arc count (10x Genomics, v.1.0.0).
Subsequent analyses on RNA were performed using Seurat (v.3.2.3)63, and those on ATAC-seq were performed using ArchR (v.1.0.1)64. Cells with more than 200 unique RNA features, less than 20% mitochondrial RNA reads and less than 50,000 total RNA reads were retained for further analyses. Doublets were removed using ArchR. Raw RNA counts were log-normalized using Seurat’s NormalizeData function and scaled using the ScaleData function. Dimensionality reduction for the ATAC-seq data was performed using Iterative Latent Semantic Indexing (LSI) with the addIterativeLSI function in ArchR.
We next calculated amplicon copy numbers based on background ATAC-seq signals as we previously described and validated4,32. In brief, we determined read counts in large intervals across the genome using a sliding window of 3 Mb moving in 1 Mb increments across the reference genome. Genomic regions with known mapping artifacts were filtered out using the ENCODE hg19 blacklist. For each interval, insertions per bp were calculated and compared to 100 of its nearest neighbours with matched GC nucleotide content. The mean log2[fold change] was computed for each interval. On the basis of a diploid genome, copy numbers were calculated using the formula ), where CN denotes copy number and FC denotes mean fold change compared with neighbouring intervals. To query the copy numbers of a gene, we obtained all genomic intervals that overlapped with the annotated gene sequence and computed the mean copy number of those intervals.
For analyses presented in Extended Data Fig. 5a–c, we inferred cell cycle stage from each cell’s RNA-seq data using the CellCycleScoring function in Seurat and the gene sets for S and G2M phases included in the Seurat package. Copy-number correlations were then evaluated for cells grouped by their inferred cell cycle phase: G1, S, or G2M.
scCircle-seq analysis
TR14 scCircle-seq data were previously generated65 and deposited at the European Genome-Phenome Archive (EGA) under accession number EGAS00001007026. A detailed description of the single-cell extrachromosomal circular DNA and transcriptome sequencing (scEC&T-seq) protocol is available at Nature Protocol Exchange (10.21203/rs.3.pex-2180/v1)66. Single cells were sorted, separation of genomic DNA and mRNA was performed by G&T-seq67 and genomic DNA of single cells was subjected to exonuclease digestion and rolling-circle amplification as described previously65.
The processing of scCircle-seq reads is described in detail previously65. In brief, scCircle-seq sequencing reads were 3′ trimmed for quality using Trim Galore (v.0.6.4)68, and adapter sequences with reads shorter than 20 nucleotides were removed. The alignment of reads to the human reference assembly hg19 was performed using BWA MEM (v.0.7.15) with the default parameters69. PCR and optical duplicates were removed using Picard (v.2.16.0). Sequencing coverage across mitochondrial DNA was used as an internal control to evaluate circular DNA enrichment. Cells that exhibited less than 10 reads per bp sequence-read depth over mitochondrial DNA or less than 85% genomic bases captured in mitochondrial DNA were excluded from further analyses65.
Read counts from scCircle-seq BAM files were quantified in 1 kb bins across TR14 ecDNA regions (MYNC, CDK4, MDM2) as defined by ecDNA reconstruction analyses in TR14 bulk populations described previously4. To account for differences in sequencing depth among cells, read counts were normalized to library size.
Analysis of copy-number correlations of amplified oncogenes in human tumour samples
Copy numbers computed for single cells using scATAC-seq as described above (see the ‘Paired scATAC-seq and scRNA-seq analysis’ section) were used to devise a statistical approach for predicting ecDNA. We reasoned that, due to the random segregation of individual ecDNA molecules, ecDNA focal amplifications would be characterized by not only elevated mean copy number but also inflated copy-number variance. Indeed, classifying amplifications with a mean copy number of ≥4 and variance/mean ratio of ≥2.5 specifically classified only known ecDNAs in validated cell lines (Extended Data Fig. 4a).
We applied this statistical approach to a curated dataset of 41 tumours (from triple-negative breast cancer (TNBC), high-grade serous ovarian cancer (HGSC) and glioblastoma) with publicly available scATAC-seq or scDNA-seq data34–36. For TNBC and HGSC tumours profiled with scDNA-seq data in ref. 35, we used the author-provided single-cell copy numbers available on Zenodo (10.5281/zenodo.6998936). Processed scATAC-seq data for glioblastoma samples were obtained from ref. 34 and ref. 36 (GEO accession number GSE163655), and copy numbers were computed as described above (see the ‘Paired scATAC-seq and scRNA-seq analysis’ section) in 3 Mb genomic windows. Putative ecDNAs were predicted using the decision rule determined from validated cell lines, and copy numbers were determined for oncogenes by averaging copy numbers of windows overlapping with the oncogene of interest. Copy-number correlations were computed across oncogenes, only considering cells where the oncogene was amplified with a copy-number ≥4.
ecDNA isolation by CRISPR–CATCH
Molecular isolation of ecDNA by CRISPR–CATCH was performed as previously described18. In brief, molten 1% certified low-melting-point agarose (Bio-Rad, 1613112) in PBS was equilibrated to 45 °C. In total, 1 million cells were pelleted per condition, washed twice with cold 1× PBS, resuspended in 30 µl PBS and briefly heated to 37 °C. Then, 30 µl agarose solution was added to cells, mixed, transferred to a plug mould (Bio-Rad, 1703713) and incubated on ice for 10 min. Solid agarose plugs containing cells were ejected into 1.5 ml Eppendorf tubes, suspended in buffer SDE (1% SDS, 25 mM EDTA at pH 8.0) and placed onto a shaker for 10 min. The buffer was removed and buffer ES (1% N-laurolsarcosine sodium salt solution, 25 mM EDTA at pH 8.0, 50 µg ml−1 proteinase K) was added. Agarose plugs were incubated in buffer ES at 50 °C overnight. The next day, proteinase K was inactivated with 25 mM EDTA with 1 mM PMSF for 1 h at room temperature with shaking. Plugs were then treated with RNase A (1 mg ml−1) in 25 mM EDTA for 30 min at 37 °C and washed with 25 mM EDTA with a 5 min incubation. Plugs not directly used for ecDNA enrichment were stored in 25 mM EDTA at 4 °C.
To perform in vitro Cas9 digestion, agarose plugs containing DNA were washed three times with 1× NEBuffer 3.1 (New England BioLabs) with 5 min incubations. Next, DNA was digested in a reaction with 30 nM sgRNA (Synthego) and 30 nM spCas9 (New England BioLabs, M0386S) after pre-incubation of the reaction mix at room temperature for 10 min. Cas9 digestion was performed at 37 °C for 4 h, followed by overnight digestion with 3 µl proteinase K (20 mg ml−1) in a 200 µl reaction. The next day, proteinase K was inactivated with 1 mM PMSF for 1 h with shaking. The plugs were then washed with 0.5× TAE buffer three times with 5 min incubations. The plugs were loaded into a 1% certified low-melting-point agarose gel (Bio-Rad, 1613112) in 0.5× TAE buffer with ladders (CHEF DNA Size Marker, 0.2–2.2 Mb; Saccharomyces cerevisiae ladder, Bio-Rad, 1703605; CHEF DNA size marker, 1–3.1 Mb; Hansenula wingei ladder, Bio-Rad, 1703667) and pulsed-field gel electrophoresis was performed using the CHEF Mapper XA System (Bio-Rad) according to the manufacturer’s instructions and using the following settings: 0.5× TAE running buffer, 14 °C, two-state mode, run time duration of 16 h 39 min, initial switch time of 20.16 s, final switch time of 2 min 55.12 s, gradient of 6 V cm−1, included angle of 120° and linear ramping. The gel was stained with 3× Gelred (Biotium) with 0.1 M NaCl on a rocker for 30 min covered from light and imaged. The bands were then extracted and DNA was isolated from agarose blocks using beta-Agarase I (New England BioLabs, M0392L) according to the manufacturer’s instructions. All guide sequences are provided in Supplementary Table 1.
Short-read sequencing of ecDNA isolated by CRISPR–CATCH
Sequencing of ecDNA isolated by CRISPR–CATCH was performed as previously described18. In brief, we transposed DNA with Tn5 transposase produced as previously described58 in a 50 µl reaction with TD buffer59, 10 ng DNA and 1 µl transposase. The reaction was performed at 50 °C for 5 min, and transposed DNA was purified using the MinElute PCR Purification Kit (Qiagen, 28006). The libraries were generated by 7–9 rounds of PCR amplification using NEBNext High-Fidelity 2× PCR Master Mix (NEB, M0541L), purified using SPRIselect reagent kit (Beckman Coulter, B23317) with double size selection (0.8× right, 1.2× left) and sequenced on the Illumina NextSeq 550 or the Illumina NovaSeq 6000 platform. Sequencing data were processed as described above for WGS. CRISPR–CATCH sequencing data for SNU16m1 (bands 30–34) and COLO 320DM (bands a–m) used in Extended Data Fig. 3 were generated previously4 and deposited at the NCBI Sequence Read Archive (SRA) under BioProject accession PRJNA670737; CRISPR–CATCH sequencing data for SNU16 (MYC, FGFR2 and enhancer ecDNAs) used in Fig. 4 were generated previously18 and deposited at the NCBI SRA under BioProject accession PRJNA777710.
Metaphase DNA-FISH
TR14 neuroblastoma cells were grown to 70% confluency in a 15 cm dish and treated with KaryoMAX Colcemid (Gibco) for 4 h. A mitotic shake off was performed and the medium of the cells was collected. The remaining cells were washed with PBS and treated with trypsin-EDTA 0.05% (Gibco) for 2 min. The cells were washed again with the collected medium and centrifuged at 300g for 10 min. The pellet was resuspended at 0.075 M KCl and left at 37 °C for 20 min. The sample was centrifuged at 300g for 5 min. The cell pellet was resuspended carefully in 10 ml Carnoy’s solution and centrifuged at 300g for 5 min. This wash step was repeated four times using 5 ml of Carnoy’s solution. The remaining pellet was resuspended in 400 µl of Carnoy’s solution. Then, 12 µl of the suspension was dropped on preheated slides from a height of approximately 15 cm. The slides were held over a heated water bath (55 °C) for 1 min. The slides were aged overnight at room temperature. Slides were prepared for staining according to the probe manufacturer’s protocol (DNA-FISH metaphase chromosome spreads, Arbor Biosciences). Before staining, the slides were first washed in PBS, followed by a wash in 65 °C SSCT (5 ml 20× SSC, 500 µl 10% Tween-20, and brought up to 50 ml with molecular-grade H2O) for 15 min. The slides were next washed twice for 2 min with room temperature SSCT. Dehydration of the slides was performed in 70% and 90% ethanol for 5 min each. After air-drying, the slides were transferred into 0.07 N NaOH for 3 min for chemical denaturation. After two washes for 5 min in SSCT, the dehydration step was repeated, and the slides were air-dried. The probes used for staining were designed to target the MYCN, MDM2 and CDK4 gene using myTags (Arbor), conjugated as following: CDK4-Alexa 488, MYCN-Atto 550, MDM2-Atto 633. Then, 10 µl of the hybridization buffer (in SSCT: 50 % formamide, 10% dextran sulphate, 40 ng µl−1 RNase A) was mixed with 1.5 µl of each resuspended probe. This mixture was headed to 70 °C for 5 min and stored on ice. Then, 14.5 µl of this mixture was added to the slide, which was covered by a cover glass and sealed with rubber cement. The slides were incubated in a hybridization chamber (Abbott Molecular) overnight at 37 °C. The next day, the rubber cement and cover glass were removed, and the sample was washed in prewarmed (37 °C) SSCT for 30 min. The slides were then washed at room temperature with 2× SSCT for 5 min each followed by a 5 min wash with PBS. The air-dried slide was stained with Hoechst (1: 4,000 for 2 min) and washed with PBS for another 5 min. After drying, the slides were mounted using ProLong Glass Antifade Mountant (Thermo Fisher Scientific) and sealed with a coverglass. Imaging of TR14 metaphase spreads was done on the Leica Stellaris 8 system (Advanced Light Microscopy Facility, Max-Delbrück Center for Molecular Medicine) using a ×63 oil objective with a ×2 zoom. Excitation was done using the 405 nm, 488 nm, 561 nm and 538 nm lasers and detection was done using two HyD S and one HyD X and HyD R detectors. 4× line averaging was applied to each channel.
For the GBM39-KT, GBM39-KT-D10, SNU16, SNU16m1, CA718 and H716 cell lines, cells were treated with KaryoMAX Colcemid (Gibco) at 100 ng ml−1 for 3 h, and single-cell suspensions were then collected by centrifugation and washed once in 1× PBS. The cells were treated with 0.75 M KCl hypotonic buffer for 20 min at 37 °C, and fixed with Carnoy’s fixative (3:1 methanol:glacial acetic acid) followed by three additional washes with the same fixative. The samples were then dropped onto humidified glass slides and air-dried. The glass slides were then briefly equilibrated in 2× SSC buffer, dehydrated in ascending ethanol concentrations of 70%, 85% and 100% for 2 min each. FISH probes (Empire Genomics) were diluted in hybridization buffer in 1:6 ratio and covered with a coverslip. The samples were denatured at 75 °C for 3 min and hybridized at 37 °C overnight in a humidified slide moat. The samples were washed with 0.4× SSC for 2 min, and 2× SSC 0.1% Tween-20 for another 2 min. The nuclei were stained with 4,6-diamidino-2-phenylindole (DAPI) (50 ng ml−1) diluted in 2× SSC for about a minute, and washed once briefly in double-distilled H2O. Air-dried samples were mounted with ProLong Diamond. Images were acquired on a Leica DMi8 widefield microscope using a 63× oil objective.
Metaphase DNA-FISH image analysis
Colocalization analysis for two- and three-colour metaphase FISH described in Fig. 1 and Extended Data Fig. 2 was performed using Fiji (v.2.1.0/1.53c)70. Images were split into the individual FISH colours + DAPI channels, and the signal threshold was set manually to remove background fluorescence. Overlapping FISH signals were segmented using watershed segmentation. FISH signals were counted using particle analysis. xy coordinates of pixels containing FISH signals were saved along with image dimensions and coordinates of regions of interest (ROIs) as distinct particle identities (for example, distinct ecDNA molecules). Colocalization was then quantified in R. Each pixel containing FISH signal was assigned to the nearest overlapping ROI using xy coordinates. Unique ROIs in all colour channels were summarized such that ROIs in different channels that overlap with one another by one pixel or more in the same image were considered as colocalized.
Colocalization analysis for two-colour metaphase FISH data for ecDNAs in SNU16m1 cells described in Extended Data Fig. 10 was performed using Fiji (v.2.1.0/1.53c)70. Images were split into the two FISH colours + DAPI channels, and signal threshold set manually to remove background fluorescence. Overlapping FISH signals were segmented using watershed segmentation. Colocalization was quantified using the ImageJ-Colocalization Threshold program and individual and colocalized FISH signals were counted using particle analysis.
Immunofluorescence staining and DNA-FISH in mitotic cells
For assessing mitotic segregation of ecDNA in GBM39-KT, GBM39KT-D10, TR14, SNU16m1, CA718 and H716 cells shown in Fig. 2 and Extended Data Figs. 5 and 7, asynchronous cells were grown on coverslips coated with either poly-l-lysine or poly-d-lysine (laminin for GBM39-KT and GBM39KT-D10). Cells were washed once with PBS and fixed with cold 4% paraformaldehyde (PFA) at room temperature for 10−15 min. The samples were permeabilized with 0.5% Triton X-100 in PBS for 10 min at room temperature and then washed with PBS. The samples were then blocked with 3% BSA in PBS 0.05% Triton X-100 for 30 min at room temperature. The samples were incubated in primary antibody (Aurora kinase B polyclonal antibody, 1:200 dilution, A300-431A, Thermo Fisher Scientific; BRD4 antibody, 1:200, ab245285, Abcam; RNA polymerase II CTD repeat YSPTSPS (phosphorylated Ser2) antibody (3E10), ab252855, Abcam; CIP2A antibody, 1:400 dilution, NBP2-48710, Novus Biologicals; all diluted in 3% BSA) for either 1 h at room temperature or overnight at 4 °C. The samples were washed three times in PBS 0.05% Triton X-100. The samples were incubated in fluorophore-conjugated secondary antibody (1:500 in 3% BSA) for 1 h at room temperature (with all of the subsequent steps in the dark) and then washed three times in PBS 0.05% Triton X-100. Cells were washed once with PBS and refixed with cold 4% PFA for 20 min at room temperature. The coverslips were then washed once in 1× PBS, and incubated with freshly prepared 0.7% Triton X-100 in 1× PBS with 0.1 M HCl for 10 min on ice, followed by acid denaturation of DNA strands with 1.9 M HCl for 30 min at room temperature. They were then dehydrated in ascending ethanol concentrations of 70%, 85% and 100% for approximately 2 min each. FISH probes (Empire Genomics) were diluted 1:4 in hybridization buffer (Empire Genomics) and added to the sample with the addition of a slide. The samples were denatured at 75 °C for 3 min and then hybridized at 37 °C overnight in a humid and dark chamber. The samples were then washed with 0.4× SSC then 2× SSC 0.1% Tween-20 (all washes lasting approximately 2 min). DAPI (100 ng ml−1) was applied to samples for 10 min. The samples were then washed again with 2× SSC 0.1% Tween-20 then 2× SSC. The samples were briefly washed in double-distilled H2O and mounted with ProLong Gold. The slides were sealed with nail polish. The samples were imaged either on a DeltaVision Elite Cell Imaging System (Applied Precision), on an Olympus widefield microscope (IX-71; Olympus) controlled by the SoftWoRx software v.6.5.2 (Applied Precision) and a ×60 objective lens with a CoolSNAP HQ2 camera (Photometrics), or on a Leica DMi8 widefield microscope using a ×63 oil objective lens. z stacks were acquired and used to generate maximum-intensity projections (ImageJ or LAS X) for downstream analysis. Images acquired on the Leica DMi8 were subjected to deconvolution using either small-volume computational clearing or large-volume computational clearing before making maximum-intensity projections.
For assessing mitotic segregation of oncogene and enhancer ecDNAs in SNU16 cells as shown in Fig. 4, cells were seeded onto fibronectin-coated 22 × 22 coverslips contained in a six-well culture plate at about 70% confluence. Then, 24 h after cell seeding, the cells were fixed with 4% PFA and permeabilized with 1× PBS containing 0.25% Triton X-100. The samples were blocked with 3% BSA-1× PBS for 1 h at room temperature, followed by primary antibody incubation (Aurora B kinase antibody; A300-431A; Thermo Fisher Scientific) (1:200 in 3% BSA) overnight at 4 °C. The sample was washed three times in 1× PBS followed by incubation with diluted an anti-rabbit Alexa Fluor 647 antibody (donkey anti-rabbit IgG (H+L) highly cross-adsorbed secondary antibody, Alexa Fluor 647, A31573, Invitrogen; 1:500 dilution in 3% BSA) for 1 h at room temperature. The sample is then washed three times in 1× PBS and fixed with 4% PFA for 20 min at room temperature. DNA-FISH was performed as described in the ‘Metaphase DNA-FISH’ section, with the conditions to heat denaturation changed to 80 °C for 20 min. Images were acquired on a Leica DMi8 widefield microscope using a ×63 oil objective, and each z plane was post-processed by small-volume computational clearing on LAS X before generating maximum-projection images.
Mitotic cell imaging analysis
To quantify fractions of ecDNAs segregated to each daughter cell in pairs of dividing cells as shown in Fig. 2 and Extended Data Figs. 5 and 7, ecDNA pixel intensity was quantified from maximum intensity projections using ImageJ. ecDNA pixel intensity was measured using the ‘Integrated Density’ measurement from ImageJ. Before quantification, the background signal from FISH probes was removed uniformly for the entire image until all background signal from the daughter cell nuclei was removed. We further filtered out images with poor quality, those with overlapping nuclei that did not allow for accurate segmentation and those showing cells with unclear daughter cell pairings based on Aurora kinase B staining. To measure the fractions of ecDNAs segregated to daughter cells after inhibitor treatments, segmentation of daughter cells and measurement of DNA-FISH abundance was performed on maximum-intensity projections using AIVIA Software (Leica Microsystems). Individual machine-learning-based pixel classifiers were trained on the channels corresponding to the FISH probes of interest and DAPI to create confidence masks for FISH signal and nuclei, respectively. The confidence masks were used to create a recipe to segment individual FISH puncta and assign each punctum to a segmented daughter cell. The fractional inheritance of each ecDNA species was estimated by comparing the FISH area in the daughter cells of each corresponding pair. The abundances of proteins of interest (RNA Pol II pSer2, CIP2A and BRD4) were quantified using AIVIA software by measuring the pixel intensity values in the segmented nuclei.
To quantify the fractions of oncogene and enhancer ecDNAs segregated to daughter cells as shown in Fig. 4, the images were split into the different FISH colours + DAPI channels, and the signal threshold was set manually to remove background fluorescence using Fiji (v.2.1.0/1.53c)70. Overlapping FISH signals were segmented using watershed segmentation. All FISH colour channels except for DAPI were stacked and ROIs were drawn manually to identify the two daughter cells, after which the colour channels were split again and image pixel areas occupied by FISH signals were analysed using particle analysis. Fractions of ecDNAs in each daughter cell were estimated by fractions of FISH pixels in the given daughter cell.
Intron RNA-FISH
Intron RNA-FISH was performed using Stellaris RNA FISH system (LGC Biosearch Technologies), with the manufacturer’s protocol for adherent cells. Intron RNA-FISH probe was designed against MYC intron 2 sequence (hg38) using the Stellaris Probe Designer tool (maximum number of probes = 48, oligo length = 20, minimum spacing length = 2), the final probe design for MYC intron 2 consists of 31 probes and was tagged with the Quasar 570 fluorophore. Images were acquired on the Leica DMi8 system using a ×63 oil objective to obtain z stack images, which underwent small-volume computational clearing before making maximum-intensity projections. For the RNase-A-treated negative control, cells were first fixed in 3.7% PFA, followed by digestion with RNase A (Thermo Fisher Scientific, EN0531) diluted to a final concentration of 200 µg ml−1 with 1× RNase-free PBS for 30 min at 37 °C. RNase A was washed off once with 1× RNase-free PBS before 70% ethanol permeabilization. Intron RNA-FISH staining was then continued as described in the manufacturer’s protocol for adherent cells.
Live-cell imaging
The live-cell imaging cell line was engineered from COLO 320DM cells obtained from ATCC. In brief, the engineering involved the following key steps: (1) CRISPR-mediated knock-in of 96× TetO array into intergenic sites next to MYC, followed by puromycin selection for TetO-positive cells; (2) lentiviral infection of TetR-mNeonGreen, followed by sorting of mNeonGreen positive cells using flow cytometry to enable labelling of TetO inserted MYC locus; (3) monoclonal expansion of cells and evaluation by microscopy to select for clones that forms distinct mNeonGreen puncta with a good signal-to-noise ratio; (4) lentiviral infection of H2B-emiRFP670 was conducted to fluorescently label histone H2B protein, followed by sorting of emiRFP670 and mNeonGreen double-positive cells using flow cytometry. The final monoclonal cells were analysed using metaphase DNA-FISH to confirm good TetO labelling efficiency and that amplicons remained as ecDNA structures.
Cells were seeded onto poly-d-lysine coated 96-well glass-bottom plates 2 days before imaging. On the day of imaging, the medium was switched to FluoroBrite DMEM (Gibco, A1896701) supplemented with 10% FBS and 1× GlutaMax. Prolong live antifade reagent (Invitrogen, P36975) was used at 1:200 dilution to suppress photobleaching. Cells were imaged on a top stage incubator (Okolab) fitted onto a Leica DMi8 widefield microscope with a ×63 oil objective, with temperature (37 °C), humidity and CO2 (5%) controlled throughout the imaging experiment.
Simulations of ecDNA segregation in pairs of daughter cells
To understand how co-segregation dynamics of ecDNAs in dividing cells may affect copy-number correlations in daughter cells, we simulated distributions of ecDNA copies among two daughter cells by random sampling using the sample function in R, for which the sample size is the total copy number of an ecDNA species multiplied by two (as a result of DNA replication). For a given fraction of one ecDNA species that co-segregates with the same fraction of another ecDNA species, the corresponding ecDNA copies were randomly distributed among two daughter cells but at the same ratio for both ecDNA species.
To compare observed ecDNA segregation with these simulations given a non-zero frequency of covalent fusions between two ecDNAs such as the low-level fusion events between different oncogene ecDNA species in various cell lines shown in Extended Data Fig. 2 or those between the enhancer and oncogene sequences shown in Fig. 4, the fraction of fused ecDNAs was treated as co-segregating ecDNAs in the simulations. To generate the expected distributions of enhancer and oncogene ecDNAs among daughter cells in Fig. 4, for each mitotic immunofluorescence and FISH image collected, the fractions of enhancer ecDNAs, oncogene ecDNAs and fused enhancer-oncogene ecDNAs were used to simulate 20 segregation events in which a fraction of ecDNAs corresponding to the fused molecules were perfectly co-segregated. The resulting copy-number correlations in simulated daughter cells represent the null distribution of ecDNAs explained by covalent fusion alone with no additional co-segregation between distinct ecDNA molecules.
ATAC-seq
ATAC-seq data for SNU16 were previously published under GEO accession GSE159986 (ref. 4). Adapter-trimmed reads were aligned to the hg19 genome using Bowtie2 (v.2.1.0). Aligned reads were filtered for quality using samtools (v.1.9)71, duplicate fragments were removed using Picard’s MarkDuplicates (v.2.25.3) and peaks were called using MACS2 (v.2.2.7.1)72 with a q-value cut-off of 0.01 and with a no-shift model.
ChIP–seq
ChIP–seq data for SNU16 were previously published under GEO accession GSE159986 (ref. 4). Paired-end reads were aligned to the hg19 genome using Bowtie2 (ref. 73) (v.2.3.4.1) with the --very-sensitive option after adapter trimming with Trimmomatic60 (v.0.39). Reads with MAPQ values of less than 10 were filtered using samtools (v.1.9) and PCR duplicates removed using Picard’s MarkDuplicates (v.2.20.3-SNAPSHOT). The ChIP–seq signal was converted to bigwig format for visualization using deepTools bamCoverage74 (v.3.3.1) with the following parameters: --bs 5 --smoothLength 105 --normalize Using CPM --scaleFactor 10.
Evolutionary modelling of ecDNA copy-number framework
ecDNA copy number was simulated over growing cell populations using a forward-time simulation implemented in Cassiopeia75 (https://github.com/YosefLab/Cassiopeia). All simulations performed in this study were of two distinct ecDNA species in a growing cell population. Simulations were parameterized with (1) initial ecDNA copy numbers (initial copy number for ecDNA species j is denoted as ); (ii) selection coefficients for cells carrying no ecDNA (s−,−), both ecDNAs (s+,+), or either ecDNA (s−,+ or s+,−; in this study, selection coefficients are treated as constant functions of the types of ecDNA species present in a cell); (3) a base birth rate (λbase = 0.5); (4) and a co-segregation coefficient (γ). Optionally, a death rate can also be specified (μ).
Starting with the parent cell, a birth rate is defined based on the selection coefficient acting on the cell, as λ1 = λbase × (1 + s). Then, a waiting time to a cell division event is drawn from an exponential distribution: tb ~ exp(−λ1). When a death rate is also specified, a time to a death event is also drawn from an exponential distribution: td ~ exp(−μ). If tb < td, a cell division event is simulated and a new edge is added to the growing phylogeny with edge length tb; otherwise, the cell dies and the lineage is stopped. This process will continue until a user-defined stopping condition is specified—either a target cell number (for example, 1 million) or a target time limit.
During a cell division, ecDNAs are split among daughter cells (d1 and d2) according to the co-segregation coefficient, γ, and the ecDNA copy numbers of the parent cell p. In this study, this co-segregation is simulated using two different strategies to determine the effects of co-segregation (see the ‘Alternative model of ecDNA co-evolution’ section below). In the following description, let indicate the copy number of ecDNA species j in daughter cell i and let Nj indicate the copy number of ecDNA species j in the parent cell.
ecDNA species 1 is randomly split distributed to each daughter cell:
Where binomial is the binomial probability distribution. To simulate co-segregation, for the second ecDNA species, copies are distributed to the daughter cells in proportion to the segregation coefficient γ and the copy number of the first ecDNA species in each daughter cell:
Then, the remainder of copies left over that were not passed with co-segregation are randomly distributed between daughter cells:
After this simulation, the output is a phylogeny T over l leaves (denoted by L) with ecDNA copy numbers for ecDNA species j in leaf i.
Evolutionary modelling of ecDNA co-assortment trends
To simulate the trends of ecDNA copy-number dynamics, we used the evolutionary modelling framework described previously (see the ‘Evolutionary modelling of ecDNA copy-number framework’ section). We used the following fixed parameters: selection acting on individual ecDNA (s−,+,s+,−) of 0.2, selection acting on cells without ecDNA (s−,−) of 0.0, a base birth rate (λbase) of 0.5, and initial ecDNA copy numbers for both species () of 5 in the parental cell. We varied co-selection (s+,+) and co-segregation (γ) between 0 and 1.0 and reported the fraction of cells reporting a copy-number of both ecDNAs above a threshold m (by default 1) and the Pearson correlation between ecDNA copy numbers in cells:
Where is the copy number of ecDNA species i in leaf l and is the vector of copy numbers of ecDNA species i across all cells.
For the results presented in Fig. 3b–e and Extended Data Fig. 8b, we simulated populations of 1 million cells and reported the average co-occurrence and correlation across 10 replicates.
Inference of evolutionary parameters
ABC was used to determine evolutionary parameters in cell line data, specifically selection acting on individual ecDNAs (assumed to be equal between ecDNAs (s−,+,s+,−), the level of co-selection (s+,+), and the co-segregation coefficient (γ). In brief, ABC takes a parameter set from a prior or proposal distribution and simulates a dataset from this parameter set. If the simulated dataset matches the observed dataset within specified error tolerance , then we accept the parameter set and update our posterior distribution . In our case, we defined the priors over each parameter as follows:
We used the evolutionary model presented above (see section titled “Evolutionary modelling of ecDNA copy-number framework”) to simulate datasets from the proposed parameter set θ, no death rate, a base birth rate λbase = 0.5, and selection acting on cells without ecDNA s−,− = 0.
Here our goal is to infer a posterior distribution over each evolutionary parameter given single-cell copy numbers observed from scATAC-seq data in a target cell line, denoted as yobs (see the ‘Paired scATAC-seq and scRNA-seq analysis’ section above). To accomplish this, we chose to derive summary statistics describing the co-occurrence (proportion of cells carrying more than 2 copies of each gene amplified as ecDNA) and the Pearson correlation between the log-transformed copy numbers of ecDNAs for guiding our inference, denoted by Cobs and ρobs, respectively. In each round of ABC, we simulated a dataset y0 of 500,000 cells and compared the summary statistics of this simulated dataset to the observed summary statistics using the following distance function:
where C0 and ρ0 are the simulated co-occurrence and Pearson correlation, respectively. We used a tolerance of ϵ = 0.05 as our target error, and each ABC instance was run for up to 3 days. Each simulation was initialized with a parental cell with equal copy-number of initial ecDNA (): in Fig. 3g this initial copy number was 5 although alternative initial conditions are explored in Extended Data Fig. 8f–h. We used the following summary statistics for each cell line: SNU16m1 (Cobs = 0.99, ρobs = 0.46); TR14 (Cobs = 0.96, ρobs = 0.26); GBM39-KT (Cobs = 0.67, ρobs = 0.36).
The specific implementation of this procedure was performed using a sequential Monte Carlo scheme (ABC-SMC) using the Python package pyabc (v.0.12.8). In brief, this approach performs sequential rounds of inference while computing a weight for the accepted parameters for each iteration. Further details of this procedure were reported previously76–79.
Cell-level co-segregation model of ecDNA co-evolution
Previously, we introduced the co-segregation on the ecDNA element level inside of each cell, where an ecDNA element carrying one species is linked to another element with a probability defined as the co-segregation parameter. Here, we introduce an alternative model, in which ecDNA co-segregation is implemented at the cellular level. In each cell division, if a cell is chosen for proliferation, the number of ecDNA copies in that cell are doubled. We first have the randomly segregation of both ecDNA species following a binomial distribution separately, and then pair those with high copy numbers into the same daughter cells with a probability . More precisely, γ describes the likelihood of extreme copy-number correlation, and 1 − γ describes the likelihood of extreme copy-number anticorrelation. If γ = 0.5, it is related to unbiased likelihood for both extreme scenarios, and it results in the modelling of standard random ecDNA proliferation without co-segregation.
In this model, the population growth is also modelled as a birth–death stochastic process and implemented by a standard Gillespie algorithm12. We start from a small initial population (a single cell or three cells) carrying a certain amount of ecDNA elements and recording the exact number of ecDNA copies for each cell through the simulation. Cells are chosen randomly but proportional to their fitness (1 + s) for proliferation, where s is the selection coefficient. Neutral proliferation is defined compared to fitness of cells without ecDNA (s = 0). If there is a fitness effect by carrying ecDNA, s > 0. For simplicity, in our models, we give a fixed selection coefficient for cells carrying either ecDNA and vary the selection coefficient for cells with both ecDNA to investigate the impact of co-selection in ecDNA co-evolution. For reporting, we discretize the population into three subpopulations, named pure, mix and free (no) ecDNA cells (Fig. 3g), which represent cells carrying just one type of ecDNA, both types or no ecDNA at all, respectively. For the results presented in Extended Data Fig. 8c–e, we simulated populations of 10,000 cells and reported summary statistics across 500 replicates.
Evolutionary modelling of drug intervention
The evolutionary model described previously (see the ‘Evolutionary modelling of ecDNA copy-number framework’ section) was used to evaluate the effect of pemigatinib treatment on SNU16m1 cells. To do so, we modified the framework to allow for a burn-in period to simulate population growth without drug and then introduced a perturbation to selection coefficients at a defined timepoint.
Specifically, we allowed the cell population to grow to 5,000 cells under the following conditions: base birth rate (λbase) of 0.5, a death rate (μ) of 2.5, an initial ecDNA copy number for both species () of 10, and the following selection coefficients: s−,− = 0; s−,+ = 0.15; s+,− = 0.15; s+,+ = 0.8 (here, let cells carrying only FGFR2 ecDNA be denoted by s+,− and cells only carrying MYC ecDNA by s−,+).
For the experiments presented in Extended Data Fig. 10a in which we examine the dynamics of ecDNA copy-number after pemigatinib treatments cross a range of values, we simulated pemigatinib treatment by modulating the co-segregation level and selection pressures acting on cells after the 5,000 cell burn-in population was simulated. Specifically, we explored co-segregation parameters between 0 and 1, and selection pressure values . We then simulated 500,000 cells from the pre-treatment group of 5,000 cells while maintaining the same values for γ, μ, λbirth, s−,− and s−,+.
For the pulsed pemigatinib treatment simulations presented in Fig. 4i, we used the same base birth rate, initial copy numbers, death rate and selection coefficients for the burn-in period of 5,000 cells. To simulate the first round of pemigatinib treatment, selection pressure values were set to s+,+ = s+,− = −0.1 and 100,000 cells were simulated from the initial 5,000 cell pre-treatment group and 25,000 cells were sampled at random to continue for the drug holiday. During the drug holiday, 1.2 million cells were simulated according to initial selection parameters from the 25,000 cells sampled from the simulated drug treatment, with a modified base birth rate of 0.4 to model recovery times after drug treatment. After the drug holiday, 200,000 cells were sampled at random and a further drug treatment was simulated up until at least 110 time units according to the same selection parameters used in the first round of simulated pemigatinib treatment. For time-dependent functions of copy number reported in Fig. 4i, the mean copy numbers of both ecDNA species were computed in time bins of 5 up until the introduction of pemigatinib and bins of 1 afterwards.
Evolutionary modelling of enhancer-only ecDNA
To examine the evolutionary principles of enhancer-only ecDNA, we used the previously described evolutionary model (see the ‘Evolutionary modelling of ecDNA copy-number framework’ section above) without death and fixed the following evolutionary parameters: s+,− = 0.2, s−,+ = 0, λbase = 0.5 and . We simulated ten replicates of 1-million cell populations a modulated co-selection coefficient s+,+ from [0, 1] and co-segregation coefficient γ from [0, 1]. In Fig. 4, we report the distribution of co-occurrence summary statistics C across these ten replicates.
Nanopore sequencing of SNU16 genomic DNA
Genomic DNA from approximately 2 million SNU16 cells was extracted using the MagAttract HMW DNA Kit (Qiagen, 67563) and prepared for long-read sequencing using the Ligation Sequencing Kit V14 (Oxford Nanopore Technologies SQK-LSK114) according to the manufacturer’s instructions. Libraries were sequenced on a PromethION (Oxford Nanopore Technologies) using a 10.4.1 flow cell (Oxford Nanopore Technologies FLO-PRO114M).
Base calling from raw POD5 data was performed using Dorado (Oxford Nanopore Technologies, v.0.2.1+c70423e). Reads were aligned to hg19 using Winnowmap2 (ref. 80) (v.2.03) with the following parameters: -ax map-ont. Structural variants were called using Sniffles81 (v.2.0.7) using the following additional parameters: --output-rnames.
Pemigatinib treatment of SNU16m1 and COLO 320DM cell lines
SNU16m1 and COLO 320DM cells were treated with 5 μM pemigatinib (Selleckchem, S0088), or with an equal volume of DMSO. Fresh pemigatinib was replenished approximately every 3–4 days. Approximately 1 million SNU16m1 cells were sampled from the DMSO condition, 300,000 cells from the pemigatinib-treated conditions at day 0, 7, 14, 21, 28, 35 and 42; genomic DNA was extracted using the DNeasy Blood & Tissue Kit (Qiagen, 69504), and subjected to WGS (see the ‘WGS’ section above). Approximately 2 million COLO 320DM cells were sampled at day 14, genomic DNA was extracted using the Quick DNA MiniPrep kit (Zymo Research; D0325) and subjected to WGS using the same procedure as above. Copy numbers for oncogene regions were computed using cnvkit (v.0.9.6.dev0)82.
Chemotherapy treatment of SNU16m1 cell line
SNU16m1 cells were treated with 10 μM etoposide (Selleckchem, S1225), 20 μM fluorouracil (Selleckchem, S1209), 100 μM hydroxyura (Selleckchem, S1896), or equal volume DMSO control for 20 days. 2,300,000 SNU16m1 cells were plated in T-75 flasks for treatment with chemotherapeutic drugs and approximately 1,000,000 cells were seeded in T-25 flasks for treatment with DMSO control. Fresh chemotherapy drug was replenished at least every 7 days. On day 20 of the experiment, the remaining cells were collected and genomic DNA was extracted using the Quick DNA MiniPrep kit (Zymo Research, D0325) and subjected to WGS and analysis (see the ‘WGS’ section above). Copy numbers for oncogene regions were computed using cnvkit (v.0.9.10)82.
Nutlin-3a treatment of TR14 cells and interphase DNA-FISH
A total of 175,000 TR-14 cells was seeded per well in 12-well plates. Cells were treated either with 0.1% DMSO or with 1 µl nutlin-3a (Sigma Aldrich, SML0580) for 6 days, without an additional wash-out period.
The samples were fixed using Carnoy’s solution (3:1 methanol:acetic acid). Fixed samples on coverslips or slides were briefly equilibrated in 2× SSC buffer. They were then dehydrated in ascending ethanol concentrations of 70%, 90% and 100% for approximately 2 min each. FISH probes were diluted in probe hybridization buffer and added to the sample with the addition of a coverslip or slide. The samples were denatured at 78 °C for 5 min and then hybridized at 37 °C overnight in a humid and dark chamber. The samples were washed twice in 0.4× SSC with 0.3% IGEPAL CA-630 for 2 min with agitation for the first 10–15 s. They were then washed once in 2× SSC with 0.1% IGEPAL CA-630 at room temperature for 2 min, again with agitation for the first 10–15 s. DAPI (100 ng ml−1) was applied to samples for 10 min. The samples were then washed again with 2× SSC and mounted with ProLong Antifade Mountant.
FISH and microscopy was performed in the same manner as TR14 was processed as described above (see the ‘Metaphase DNA-FISH image analysis’ section). Statistical significance was assessed using Wilcoxon rank-sum tests.
TP53 knockdown by shRNA
Lentiviruses were produced for TP53 knockdown using short hairpin RNA (shRNA) targeting TP53 (shTP53) or GFP (sgGFP) as a control. The shTP53 pLKO.1 puro plasmid was a gift from Y. Yu, Johannes Kepler Universität Linz. The shGFP pLKO.1 control plasmid was obtained from the RNAi Consortium, Broad Institute. HEK293T cells were transfected using TransIT-LT1 (Mirus) in a 2:1:1 ratio of lentiviral plasmid, psPAX2 and pMD2.G plasmids (Addgene) according to the TransIT-LT1 manufacturer’s protocol. Viral supernatant was collected 48 and 72 h after transfection, pooled, filtered and stored at −80 °C.
TR14 cells were transduced for 1 day in the presence of 8 µg ml−1 polybrene (Sigma-Aldrich). They were then grown in full medium for 1 day and selected with puromycin (2 μg ml−1) for 5–7 days.
Western immunoblotting
A total of 800,000 cells was seeded in six-well plates and treated with either 0.1% DMSO or with the indicated concentration of nutlin-3a (Sigma Aldrich, SML0580) for 6 days, without an additional wash-out period. Whole-cell protein lysates were then prepared by lysing cells in radioimmunoprecipitation assay buffer supplemented with cOmplete Protease inhibitor (Roche) and PhosphStop (Roche). Protein concentrations were determined using the bicinchoninic acid assay (Thermo Fisher Scientific). Then, 30 µg of protein was denatured in Laemmli buffer at 95 °C for 10 min. The lysates were loaded onto 16% Tris-Glycine (Thermo Fisher Scientific) for gel electrophoresis. Proteins were transferred onto polyvinylidene fluoride membranes (Roche), blocked with 5% dry milk for 1 h and incubated with primary antibodies overnight at 4 °C, followed by secondary antibodies for 1 h at room temperature (MDM2 antibody (SMP14), Santa Cruz Biotechnology, sc-965, 1:200 dilution; p53 Antibody (DO-1), Santa Cruz Biotechnology, sc-126, 1:500 dilution; goat anti-mouse IgG (H+L) secondary antibody, HRP, Invitrogen, 31430, 1:2,000 dilution; vinculin monoclonal antibody (VLN01), Invitrogen, MA5-11690, 1:250 dilution). Chemiluminescent signal was detected using SuperSignal West Femto Maximum Sensitivity Substrate (Thermo Fisher Scientific) and the Fusion FX7 imaging system (Vilber Lourmat) using ImageLab. Unprocessed western blot images are provided as source data.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-024-07861-8.
Supplementary information
Source data
Acknowledgements
We thank the members of the Chang and Mischel laboratories for discussions; J. D’Silva for suggestions in developing an evolutionary model of ecDNA co-inheritance; M. M. Hassan and N. S. Gray for providing the BRD4 bivalent degrader. This work was delivered as part of the eDyNAmiC team supported by the Cancer Grand Challenges partnership funded by Cancer Research UK (CGCATF-2021/100012 (P.S.M. and H.Y.C.), CGCATF-2021/100025 (V.B.), CGCATF-2021/100017 (A.G.H.), CGCATF-2021/100021 (B.F.C.) and CGCATF-2021/100020 (W.H. and B.W.)) and the National Cancer Institute (OT2CA278688 (P.S.M. and H.Y.C.), OT2CA278635 (V.B.), OT2CA278644 (A.G.H.), OT2CA278692 (B.F.C.) and OT2CA278670 (W.H. and B.W.)). This project was also supported by the National Institutes of Health (NIH) R35-CA209919 (H.Y.C.), and U24CA264379 and R01GM114362 (V.B.). K.L.H. was supported by a Stanford Graduate Fellowship and an NCI Predoctoral to Postdoctoral Fellow Transition Award (NIH F99CA274692); M.G.J. by an NCI Pathway to Independence Award (NIH K99CA286968); J.A.B. by NIH training grant T32HL120824; A.G.H. by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation; 398299703) and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement no. 949172); B.W. by a Barts Charity Lectureship (MGU045) and a UKRI Future Leaders Fellowship (MR/V02342X/1). X.Y. is a Damon Runyon Fellow supported by the Damon Runyon Cancer Research Foundation (DRG-2474-22). H.Y.C. is an Investigator of the Howard Hughes Medical Institute.
Extended data figures and tables
Author contributions
K.L.H., J.T.L., P.S.M. and H.Y.C. conceived the project. K.L.H. analysed scATAC-seq and scRNA-seq data, analysed ecDNA copy numbers and colocalization using metaphase DNA-FISH images, performed simulations of ecDNA segregation in paired daughter cells, performed CRISPR–CATCH experiments and analyses, integrated ATAC-seq and ChIP–seq data, and analysed WGS data. M.G.J. developed the evolutionary modelling framework and performed evolutionary modelling and inference, generated Nanopore sequencing of SNU16 genomic DNA and analysed human tumour samples for evidence of ecDNA co-amplification. K.L.H. and E.J.C. analysed ecDNA segregation in mitotic immunofluorescence and DNA-FISH images. B.J.H. and M.G.J. performed chemotherapy and pemigatinib treatments in cell culture. I.T.-L.W. and J.T.L. performed immunofluorescence staining and DNA-FISH in mitotic cells and imaging. I.T.-L.W. performed RNA-FISH, BRD4 degrader and transcription inhibitor treatments, subsequent immunofluorescence–DNA-FISH staining and image acquisition, and live-cell imaging. X.Y. engineered the COLO 320DM cells for live-cell imaging. E.J.C. and A.G. performed BRD4 and triptolide immunofluorescence–DNA-FISH imaging acquisition and live-cell imaging. K.L.H. and B.J.H. performed CIP2A knockout. J.L. analysed ecDNA amplicon sequences in TCGA patient tumours using AmpliconClassifier. E.S. created the alternative model of ecDNA co-evolution. Z.L., B.M. and B.F.C. provided ZL-12A for the triptolide experiments. K.L.H., M.G.J., B.J.H. and R.L. prepared sequencing libraries for WGS and CRISPR–CATCH. K.L.H. and R.L. prepared sequencing libraries for scATAC-seq and scRNA-seq. I.T.-L.W., L.B., R.S. and J.R.D. performed DNA-FISH and imaging experiments. R.C.G. analysed scCircle-seq data. M.G.J., K.L.H., J.A.B., B.W., W.H., V.B., A.G.H., P.S.M. and H.Y.C. guided data analysis and provided feedback on experimental design. K.L.H., M.G.J. and H.Y.C. wrote the manuscript with input from all of the authors.
Peer review
Peer review information
Nature thanks Samuel Bakhoum and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Data availability
Sequencing data generated for this study have been deposited at the NCBI SRA under BioProject accession PRJNA1127616. Source imaging data generated for this study have been deposited in the Stanford Digital Repository83 (10.25740/ff315yn8920). AmpliconClassifier output files containing ecDNA coordinates in TCGA samples are publicly available at figshare84 (10.6084/m9.figshare.24768555.v1). WGS data from bulk SNU16 cells were previously generated (SRR530826, Genome Research Foundation). Paired scATAC-seq and scRNA-seq data for COLO 320DM cells were generated previously and published at the GEO (GSE159986). TR14 scCircle-seq data were deposited in the European Genome-phenome Archive (EGA; EGAS00001007026). CRISPR–CATCH sequencing data integrated from previous studies were deposited in the SRA under BioProject accessions PRJNA670737 and PRJNA777710. ATAC-seq and ChIP–seq data for SNU16 were previously published at the GEO (GSE159986). Source data are provided with this paper.
Code availability
The ecDNA evolutionary modelling framework used in this study is publicly available through Cassiopeia75 at GitHub (https://github.com/YosefLab/Cassiopeia). AmpliconClassifer is available at GitHub (https://github.com/jluebeck/AmpliconClassifier).
Competing interests
H.Y.C. is a co-founder of Accent Therapeutics, Boundless Bio, Cartography Biosciences and Orbital Therapeutics, and an advisor of 10x Genomics, Arsenal Biosciences, Chroma Medicine and Spring Discovery. V.B. is a co-founder, paid consultant, scientific advisory board member and has equity interest in Boundless Bio and Abterra. The terms of this arrangement have been reviewed and approved by the University of California, San Diego in accordance with its conflict-of-interest policies. M.G.J. consults for and holds equity in Vevo Therapeutics. P.S.M. and B.F.C. are co-founders and advisors of Boundless Bio. The other authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: King L. Hung, Matthew G. Jones, Ivy Tsz-Lo Wong, Ellis J. Curtis
These authors jointly supervised this work: Paul S. Mischel, Howard Y. Chang
Contributor Information
Paul S. Mischel, Email: pmischel@stanford.edu
Howard Y. Chang, Email: howchang@stanford.edu
Extended data
is available for this paper at 10.1038/s41586-024-07861-8.
Supplementary information
The online version contains supplementary material available at 10.1038/s41586-024-07861-8.
References
- 1.Wilson, E. B. The Cell in Development and Heredity (Macmillan, 1925).
- 2.Yan, X., Mischel, P. & Chang, H. Extrachromosomal DNA in cancer. Nat. Rev. Cancer24, 261–273 (2024). [DOI] [PubMed] [Google Scholar]
- 3.Hung, K. L., Mischel, P. S. & Chang, H. Y. Gene regulation on extrachromosomal DNA. Nat. Struct. Mol. Biol.10.1038/s41594-022-00806-7 (2022). [DOI] [PMC free article] [PubMed]
- 4.Hung, K. L. et al. ecDNA hubs drive cooperative intermolecular oncogene expression. Nature600, 731–736 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Turner, K. M. et al. Extrachromosomal oncogene amplification drives tumour evolution and genetic heterogeneity. Nature543, 122–125 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Cox, D., Yuncken, C. & Spriggs, A. I. Minute chromatin bodies in malignant tumours of childhood. Lancet286, 55–58 (1965). [DOI] [PubMed] [Google Scholar]
- 7.Shimizu, N. Gene amplification and the extrachromosomal circular DNA. Genes12, 1533 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hamkalo, B. A., Farnham, P. J., Johnston, R. & Schimke, R. T. Ultrastructural features of minute chromosomes in a methotrexate-resistant mouse 3T3 cell line. Proc. Natl Acad. Sci. USA82, 1126–1130 (1985). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Nathanson, D. A. et al. Targeted therapy resistance mediated by dynamic regulation of extrachromosomal mutant EGFR DNA. Science343, 72–76 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Levan, G., Mandahl, N., Bengtsson, B. O. & Levan, A. Experimental elimination and recovery of double minute chromosomes in malignant cell populations. Hereditas86, 75–90 (1977). [DOI] [PubMed] [Google Scholar]
- 11.deCarvalho, A. C. et al. Discordant inheritance of chromosomal and extrachromosomal DNA elements contributes to dynamic disease evolution in glioblastoma. Nat. Genet.50, 708 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lange, J. T. et al. The evolutionary dynamics of extrachromosomal DNA in human cancers. Nat. Genet.10.1038/s41588-022-01177-x (2022). [DOI] [PMC free article] [PubMed]
- 13.Levan, A. & Levan, G. Have double minutes functioning centromeres? Hereditas88, 81–92 (1978). [DOI] [PubMed] [Google Scholar]
- 14.Lundberg, G. et al. Binomial mitotic segregation of MYCN-carrying double minutes in neuroblastoma illustrates the role of randomness in oncogene amplification. PLoS ONE3, e3099 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Yi, E. et al. Live-cell imaging shows uneven segregation of extrachromosomal DNA elements and transcriptionally active extrachromosomal DNA hubs in cancer. Cancer Discov.12, 468–483 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bergstrom, E. N. et al. Mapping clustered mutations in cancer reveals APOBEC3 mutagenesis of ecDNA. Nature602, 510–517 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Nikolaev, S. et al. Extrachromosomal driver mutations in glioblastoma and low-grade glioma. Nat. Commun.5, 5690 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hung, K. L. et al. Targeted profiling of human extrachromosomal DNA by CRISPR-CATCH. Nat. Genet.10.1038/s41588-022-01190-0 (2022). [DOI] [PMC free article] [PubMed]
- 19.Kim, H. et al. Extrachromosomal DNA is associated with oncogene amplification and poor outcome across multiple cancers. Nat. Genet.52, 891–897 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.L’Abbate, A. et al. Genomic organization and evolution of double minutes/homogeneously staining regions with MYC amplification in human cancer. Nucleic Acids Res.42, 9131–9145 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Stephens, P. J. et al. Massive genomic rearrangement acquired in a single catastrophic event during cancer development. Cell144, 27–40 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Shimizu, N., Misaka, N. & Utani, K. Nonselective DNA damage induced by a replication inhibitor results in the selective elimination of extrachromosomal double minutes from human cancer cells. Genes Chromosom. Cancer46, 865–874 (2007). [DOI] [PubMed] [Google Scholar]
- 23.Oobatake, Y. & Shimizu, N. Double-strand breakage in the extrachromosomal double minutes triggers their aggregation in the nucleus, micronucleation, and morphological transformation. Genes Chromosom. Cancer59, 133–143 (2020). [DOI] [PubMed] [Google Scholar]
- 24.Shimizu, N., Itoh, N., Utiyama, H. & Wahl, G. M. Selective entrapment of extrachromosomally amplified DNA by nuclear budding and micronucleation during S phase. J. Cell Biol.140, 1307–1320 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Trivedi, P., Steele, C. D., Au, F. K. C., Alexandrov, L. B. & Cleveland, D. W. Mitotic tethering enables inheritance of shattered micronuclear chromosomes. Nature618, 1049–1056 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Zhang, C.-Z. et al. Chromothripsis from DNA damage in micronuclei. Nature522, 179–184 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ly, P. et al. Selective Y centromere inactivation triggers chromosome shattering in micronuclei and repair by non-homologous end joining. Nat. Cell Biol.19, 68–75 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chapman, O. S. et al. Circular extrachromosomal DNA promotes tumor heterogeneity in high-risk medulloblastoma. Nat. Genet.10.1038/s41588-023-01551-3 (2023). [DOI] [PMC free article] [PubMed]
- 29.Deshpande, V. et al. Exploring the landscape of focal amplifications in cancer using AmpliconArchitect. Nat. Commun.10, 392 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zhu, Y. et al. Oncogenic extrachromosomal DNA functions as mobile enhancers to globally amplify chromosomal transcription. Cancer Cell39, 694–707 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Koche, R. P. et al. Extrachromosomal circular DNA drives oncogenic genome remodeling in neuroblastoma. Nat. Genet.10.1038/s41588-019-0547-z (2019). [DOI] [PMC free article] [PubMed]
- 32.Satpathy, A. T. et al. Massively parallel single-cell chromatin landscapes of human immune cell development and intratumoral T cell exhaustion. Nat. Biotechnol.37, 925–936 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kumar, P. et al. ATAC-seq identifies thousands of extrachromosomal circular DNA in cancer and cell lines. Sci. Adv.6, eaba2489 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Guilhamon, P. et al. Single-cell chromatin accessibility profiling of glioblastoma identifies an invasive cancer stem cell population associated with lower survival. eLife10, e64090 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Funnell, T. et al. Single-cell genomic variation induced by mutational processes in cancer. Nature612, 106–115 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Nikolic, A. et al. Copy-scAT: deconvoluting single-cell chromatin accessibility of genetic subclones in cancer. Sci. Adv.7, eabg6045 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Carroll, S. M. et al. Characterization of an episome produced in hamster cells that amplify a transfected CAD gene at high frequency: functional evidence for a mammalian replication origin. Mol. Cell. Biol.7, 1740–1750 (1987). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hoff, D. D. V., Needham-VanDevanter, D. R., Yucel, J., Windle, B. E. & Wahl, G. M. Amplified human MYC oncogenes localized to replicating submicroscopic circular DNA molecules. Proc. Natl Acad. Sci. USA85, 4804–4808 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Fuller, B. G. et al. Midzone activation of aurora B in anaphase produces an intracellular phosphorylation gradient. Nature453, 1132–1136 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kanda, T., Sullivan, K. F. & Wahl, G. M. Histone–GFP fusion protein enables sensitive analysis of chromosome dynamics in living mammalian cells. Curr. Biol.8, 377–385 (1998). [DOI] [PubMed] [Google Scholar]
- 41.Lin, Y.-F. et al. Mitotic clustering of pulverized chromosomes from micronuclei. Nature10.1038/s41586-023-05974-0 (2023). [DOI] [PMC free article] [PubMed]
- 42.Singh, J. & Padgett, R. A. Rates of in situ transcription and splicing in large human genes. Nat. Struct. Mol. Biol.16, 1128–1133 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Kimura, H., Sugaya, K. & Cook, P. R. The transcription cycle of RNA polymerase II in living cells. J. Cell Biol.159, 777–782 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Bensaude, O. Inhibiting eukaryotic transcription. Which compound to choose? How to evaluate its activity? Transcription2, 103–108 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Manzo, S. G. et al. Natural product triptolide mediates cancer cell death by triggering CDK7-dependent degradation of RNA polymerase II. Cancer Res.72, 5363–5373 (2012). [DOI] [PubMed] [Google Scholar]
- 46.Liu, Z. et al. Proteomic ligandability maps of spirocycle acrylamide stereoprobes identify covalent ERCC3 degraders. J. Am. Chem. Soc.146, 10393–10406 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Chkhaidze, K. et al. Spatially constrained tumour growth affects the patterns of clonal selection and neutral drift in cancer genomic data. PLoS Comput. Biol.15, e1007243 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Marjoram, P. & Tavaré, S. Modern computational approaches for analysing molecular genetic variation data. Nat. Rev. Genet.7, 759–770 (2006). [DOI] [PubMed] [Google Scholar]
- 49.Liu, P. C. C. et al. INCB054828 (pemigatinib), a potent and selective inhibitor of fibroblast growth factor receptors 1, 2, and 3, displays activity against genetically defined tumor models. PLoS ONE15, e0231877 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Carroll, S. M. et al. Double minute chromosomes can be produced from precursors derived from a chromosomal deletion. Mol. Cell. Biol.8, 1525–1533 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Ruiz, J. C. & Wahl, G. M. Chromosomal destabilization during gene amplification. Mol. Cell. Biol.10, 3056–3066 (1990). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Snapka, R. M. & Varshavsky, A. Loss of unstably amplified dihydrofolate reductase genes from mouse cells is greatly accelerated by hydroxyurea. Proc. Natl Acad. Sci. USA80, 7533–7537 (1983). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hoff, D. D. V. et al. Elimination of extrachromosomally amplified MYC genes from human tumor cells reduces their tumorigenicity. Proc. Natl Acad. Sci. USA89, 8165–8169 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Shoshani, O. et al. Chromothripsis drives the evolution of gene amplification in cancer. Nature591, 137–141 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Sarkaria, J. N. et al. Use of an orthotopic xenograft model for assessing the effect of epidermal growth factor receptor amplification on glioblastoma radiation response. Clin. Cancer Res.12, 2264–2271 (2006). [DOI] [PubMed] [Google Scholar]
- 56.Hassan, M. M. et al. Exploration of the tunability of BRD4 degradation by DCAF16 trans-labelling covalent glues. Preprint at bioRxiv10.1101/2023.10.07.561308 (2023). [DOI] [PubMed]
- 57.Labun, K. et al. CHOPCHOP v3: expanding the CRISPR web toolbox beyond genome editing. Nucleic Acids Res.47, W171–W174 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Picelli, S. et al. Tn5 transposase and tagmentation procedures for massively scaled sequencing projects. Genome Res.24, 2033–2040 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Corces, M. R. et al. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat. Methods14, 959–962 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 2114 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics25, 1754–1760 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Luebeck, J. et al. Extrachromosomal DNA in the cancerous transformation of Barrett’s oesophagus. Nature616, 798–805 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol.36, 411–420 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Granja, J. M. et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 53, 403–411 (2021). [DOI] [PMC free article] [PubMed]
- 65.Chamorro González, R. et al. Parallel sequencing of extrachromosomal circular DNAs and transcriptomes in single cancer cells. Nat. Genet.55, 880–890 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.González, R. C., Conrad, R., Kasack, K. & Henssen, A. G. scEC&T-seq: a method for parallel sequencing of extrachromosomal circular DNAs and transcriptomes in single human cells. Protoc. Exch. 10.21203/rs.3.pex-2180/v1 (2023). [DOI] [PMC free article] [PubMed]
- 67.Macaulay, I. C. et al. G&T-seq: parallel sequencing of single-cell genomes and transcriptomes. Nat. Methods12, 519–522 (2015). [DOI] [PubMed] [Google Scholar]
- 68.Krueger F. Trim Galore v.0.6.4 (GitHub, 2021).
- 69.Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at arxiv.org/abs/1303.3997 (2013).
- 70.Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods9, 676–682 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics25, 2078–2079 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Zhang, Y. et al. Model-based Analysis of ChIP-Seq (MACS). Genome Biol.9, R137 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods9, 357–359 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Ramírez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res.44, W160–W165 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Jones, M. G. et al. Inference of single-cell phylogenies from lineage tracing data using Cassiopeia. Genome Biol.21, 92 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Sisson, S. A., Fan, Y. & Tanaka, M. M. Sequential Monte Carlo without likelihoods. Proc. Natl Acad. Sci. USA104, 1760–1765 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Beaumont, M. A., Cornuet, J.-M., Marin, J.-M. & Robert, C. P. Adaptive approximate Bayesian computation. Biometrika96, 983–990 (2009). [Google Scholar]
- 78.Toni, T., Welch, D., Strelkowa, N., Ipsen, A. & Stumpf, M. P. H. Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems. J. R. Soc. Interface6, 187–202 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Lintusaari, J., Gutmann, M. U., Dutta, R., Kaski, S. & Corander, J. Fundamentals and recent developments in approximate bayesian computation. Syst. Biol.66, e66–e82 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Jain, C., Rhie, A., Hansen, N. F., Koren, S. & Phillippy, A. M. Long-read mapping to repetitive reference sequences using Winnowmap2. Nat. Methods19, 705–710 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Sedlazeck, F. J. et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat. Methods15, 461–468 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Talevich, E., Shain, A. H., Botton, T. & Bastian, B. C. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput. Biol.12, e1004873 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Hung, K. et al. Coordinated inheritance of extrachromosomal DNAs in cancer cells – imaging datasets. Stanford Digital Repository10.25740/ff315yn8920 (2024). [DOI] [PubMed]
- 84.Hung, K. L. et al. Data for ‘Coordinated inheritance of extrachromosomal DNAs in cancer cells’. figshare10.6084/m9.figshare.24768555.v1 (2024). [DOI] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing data generated for this study have been deposited at the NCBI SRA under BioProject accession PRJNA1127616. Source imaging data generated for this study have been deposited in the Stanford Digital Repository83 (10.25740/ff315yn8920). AmpliconClassifier output files containing ecDNA coordinates in TCGA samples are publicly available at figshare84 (10.6084/m9.figshare.24768555.v1). WGS data from bulk SNU16 cells were previously generated (SRR530826, Genome Research Foundation). Paired scATAC-seq and scRNA-seq data for COLO 320DM cells were generated previously and published at the GEO (GSE159986). TR14 scCircle-seq data were deposited in the European Genome-phenome Archive (EGA; EGAS00001007026). CRISPR–CATCH sequencing data integrated from previous studies were deposited in the SRA under BioProject accessions PRJNA670737 and PRJNA777710. ATAC-seq and ChIP–seq data for SNU16 were previously published at the GEO (GSE159986). Source data are provided with this paper.
The ecDNA evolutionary modelling framework used in this study is publicly available through Cassiopeia75 at GitHub (https://github.com/YosefLab/Cassiopeia). AmpliconClassifer is available at GitHub (https://github.com/jluebeck/AmpliconClassifier).