Mutation divergence over space in tumour expansion

Haiyang Li; Zixuan Yang; Fengyu Tu; Lijuan Deng; Yuqing Han; Xing Fu; Long Wang; Di Gu; Benjamin Werner; Weini Huang

doi:10.1098/rsif.2023.0542

. 2023 Nov 22;20(208):20230542. doi: 10.1098/rsif.2023.0542

Mutation divergence over space in tumour expansion

Haiyang Li ^1,², Zixuan Yang ¹, Fengyu Tu ¹, Lijuan Deng ¹, Yuqing Han ¹, Xing Fu ¹, Long Wang ¹, Di Gu ^3,^✉, Benjamin Werner ^2,^✉, Weini Huang ^1,^4,^✉

PMCID: PMC10681009 PMID: 37989227

Abstract

Mutation accumulation in tumour evolution is one major cause of intra-tumour heterogeneity (ITH), which often leads to drug resistance during treatment. Previous studies with multi-region sequencing have shown that mutation divergence among samples within the patient is common, and the importance of spatial sampling to obtain a complete picture in tumour measurements. However, quantitative comparisons of the relationship between mutation heterogeneity and tumour expansion modes, sampling distances as well as the sampling methods are still few. Here, we investigate how mutations diverge over space by varying the sampling distance and tumour expansion modes using individual-based simulations. We measure ITH by the Jaccard index between samples and quantify how ITH increases with sampling distance, the pattern of which holds in various sampling methods and sizes. We also compare the inferred mutation rates based on the distributions of variant allele frequencies under different tumour expansion modes and sampling sizes. In exponentially fast expanding tumours, a mutation rate can always be inferred for any sampling size. However, the accuracy compared with the true value decreases when the sampling size decreases, where small sampling sizes result in a high estimate of the mutation rate. In addition, such an inference becomes unreliable when the tumour expansion is slow, such as in surface growth.

Keywords: spatial tumour heterogeneity, spatial sampling, agent-based model, tumour evolution, mutation divergence

1. Introduction

The accumulation of somatic mutations is a natural consequence of cell turnover during tissue formation and maintenance [1–3]. During cell divisions, genetic variants such as single-nucleotide polymorphisms (SNPs) and copy number variations can happen, which may lead to fitness differentials between cells and eventually tumour initiation [4,5]. While only a small subset of mutations will be driver events and confer fitness advantages, neutral passenger mutations continuously accumulate and increase intra-tumour heterogeneity (ITH) [6–8]. Tumours with high genetic heterogeneity are considered more difficult to treat and with higher recurrence rates in clinical practice [7–10]. This may be for example due to the higher chance of pre-existing resistance before in treatment in tumours with high ITH [11–16].

The patterns of ITH are driven by both spatial and temporal dynamics [17,18]. While temporal samples are often infeasible, the distribution of accumulated mutations has been used to reveal the evolutionary history of tumours [19]. Sottoriva et al. [20] introduced a ‘Big Bang’ model, where neutral passenger mutations can accumulate fast in the early expansion of tumours after tumorigenesis. The distribution of those neutral mutations follows a power law decay along the increase of mutation frequency by theoretical predictions [21,22], which is consistent with around one-third of tumour samples across many cancer types in The Cancer Genome Atlas (TCGA) database [23]. In large populations, new driver mutations can arise and even lead to clonal competitions in theory [24] as well as in experimental observations [25], which immediately increases the complexity of revealing tumour history and parameters through ITH pattern in bulk samples [26]. Spatial studies with multi-region sequencing have been carried out to compensate the practical constraints of obtaining temporal samples in solid tumours [17,18,27–32]. Different spatial models of cancer evolution are further developed to analyse the spatial impact on ITH, as well as to use spatial information, e.g. from multi-region genomic data to infer evolutionary parameters such as mutation rates and selection strength [31,33–37].

Using a three-dimensional lattice, Waclaw et al. [33] modelled exponentially expanding tumours with continuously arising mutations with and without dispersal. They showed that the ITH level and selective advantage of tumour cells have a negative correlation where tumours without driver mutations are more genetically heterogeneous. In addition, high heterogeneity leads to the rapid onset of resistance to chemotherapy, and a small amount of localized cellular dispersal can cause a faster growth of tumours formed of conglomerates of balls with more similar genetic alterations, compared with spherical tumours without cell dispersal. Ryser et al. [31] investigated the role of early tumour cell migration in shaping the patterns of private mutation in colorectal tumours through an extension of the ‘Big Bang model’ [20]. Their results showed that detecting the same private mutations in opposite sides of the final tumours indicates early cell mixing and movement, and that neutral evolution leads to high local ITH under the single expansion hypothesis of tumorigenesis. Chkhaidze et al. [34] modelled the spatial constraints of tumour growth through a pushing parameter, and showed that the phylogeny of multiple samples deviates more from the ground truth under stronger spatial constraints due to higher sampling bias. Combining theoretical and clinical data, Lenos et al. [38] and Fu et al. [39] also visualized the heterogeneity distribution and clonal diversity under volume and surface growth, where cells divide evenly throughout the whole tumour in volume growth but only in the boundary with the empty space in surface growth. Lenos et al. [38] quantitatively analysed the lineage tracing data from primary colon cancer and found that the stem cells driving tumour growth are mostly at tumour edges, and stem cell properties change over time depending on the cell location. Fu et al. [39] showed that the complete variability of clone size is due to spatio-temporal regulation of necrosis and that surface growth had a greater effect on sub-clonal diversity than volume growth. They also revealed that clonal diversity decreased sharply with the growth time in the surface model. More recently, Noble et al. [36] compared the tumour dynamics under different spatial structures in computational models, which reflect tissue architectures ranging from non-spatial as leukaemia, various gland structures as colorectal and breast cancer, to surface growth in certain types of liver cancer. While no spatial constraints result in rapid clonal expansion of cells carrying driver mutations, strong spatial constraint like surface growth will suppress selection and lead to tumour patterns similar to those under neutral evolution. Beside of lattice models, Gallaher et al. [40] applied the off-lattice model to systematically study and predict the spatial dynamic evolution of heterogeneous tumours in different treatment periods. Their results showed that evolution-based strategies by exploiting the cost of resistance can delay treatment failure, and fewer drugs and more vacation-oriented treatment can reduce tumour heterogeneity.

Beyond competition and constraints on pure physical space, the micro- and macro-environment play a significant role in tumour development especially on the non-genetic factors [41]. Anderson et al. [42] introduced hybrid models to simulate tumours in context, where the tumour environment such as extracellular matrix dynamics and oxygen or nutrient concentration are governed by partial differential equations, and cell reproduction and death are modelled in two-dimensional lattice with feedbacks between the two scales. Gerlee & Anderson [43] extended the original hybrid model and included mutations to investigate the impact of micro-environment on tumour clonal evolution. They observed more aggressive tumour phenotypes and a higher ITH in harsher environment with a low oxygen concentration. Hybrid models have been further developed to study for example tumour stroma interactions on prostate cancer [44], hierarchical and stem-cell driven tumour dynamics [45] and metastases [46]. More recently, Gallaher et al. [47] used a hybrid model to study phenotype heterogeneity in Glioblastoma, where a cell’s phenotype is an outcome of its inheritable traits and the influence of the environment modelled as a continuous field of platelet-derived growth factor distribution. Their results showed that both environmental and intrinsic factors are required to best fit the observed phenotypic heterogeneity and cell migration patterns in rat glioma data. While tumour–environment interactions most likely will impact on the genetic heterogeneity as well, here we focus on a simpler scenario and quantify how the physical space constraint and sampling distance will impact on the measured genetic heterogeneity between samples. While our conclusions are based on pure spatial competition, deviation from the mutation divergence patterns observed in our model can reflect other major driving mechanisms beyond space limitation, where hybrid models can be used to further extend our study.

While different growth modes of solid tumours are related to the strength of spatial constraints and likely to be tissue-specific [48], we simplify this as a continuous inter-cellular push rate (0 ≤ p ≤ 1) similar to Sottoriva et al. [49], Ryser et al. [31], Chkhaidze et al. [34] in an agent-based model in two- or three-dimensional lattice. Different values of the push rate, i.e. the likelihood of a cell pushing neighbour cells outward when there are no direct empty nodes around it in division, refer to different modes of tumour growth including the surface (p = 0) and exponential growth (p = 1) as two boundary examples.

More and more experimental and theoretical studies have demonstrated the importance of sampling itself on the interpretation of measured ITH. Opasic et al. [50] investigated how sampling will impact on the accuracy of identifying clonal mutations in a lattice model and showed non-random sampling from a circular spatial pattern might improve the classification of true clonal mutations compared with random sampling. Ling et al. [51] evaluated the spatial distribution of point mutations, by sequencing approximately 300 sampled regions within the circular range of a single tumour, and found that cells sampled farther away from the sample centre carry more mutations. Experimental work from Masugi et al. [52] and Zhao et al. [53] indicated that cell density and metastatic likelihood differ between tumour centre and margin. Quantitative measures of how sampling can impact on the ITH will provide theoretical foundations in how to implement multi-region sampling in practice in future. Here, we take random samples with various sizes in each simulation and analysis the average pattern of ITH between samples. In comparison, we also take multiple samples from the central ((2/3)R) and marginal regions ((1/3)R) of simulated tumours with radius R. We record the accumulation of point mutations, compare mutation frequencies above a detection limit and infer mutation rates in each sample. To compare the ITH between samples, we apply the Jaccard index to quantify the mutation composition difference, which has been used to quantify mutation diversity in much genetic data [54–59]. The more mutations shared between two samples, the higher the Jaccard index is and the lower the ITH is.

Our results show that under a given tumour expansion mode (fixed push rate), the ITH increases with the sampling distance, which is consistent with observations in experimental data [51,60]. This pattern holds for various sampling sizes, for random, centre and margin sampling, and for two- or three-dimensional simulations. The faster a tumour expands (larger push rates), the slower the ITH increases along the sampling distance. In addition, we construct the distributions of variant allele frequencies (VAF) and infer the mutation rate for each sample. When push rates are large such as in exponentially expanding tumours, a linear relation is always observed between the number of mutations and accumulative mutation frequencies independent of sampling sizes. Using this linear relation, we can estimate the mutation rates and find that the sampling size strongly impacts the inferred mutation rates. Smaller sampling sizes lead to an overestimate of inferred mutation rates as well as a larger variance among samples. Increasing the sampling size improves the inference with a more accurate mean compared with the true value and smaller variance among samples. When the tumour expansion is slower such as in surface growth, such inference becomes impossible because the linear relationship does not hold. The smaller the sampling size, the larger the VAF deviates from a linear relation.

2. Methods

2.1. Stochastic simulations of tumour growth in space and mutation accumulation

We use an agent-based model to simulate tumour growth [33,34,42,49,61–63] and perform the majority of our simulations in two-dimensional lattice where each cell has eight direct neighbours. Tumours develop from a single cell, which is seeded in the centre of the lattice. When a cell divides into two daughter cells, one cell remains in its original location (L_p), and the other daughter cell is located in a randomly selected empty space among its direct neighbours. If there is no free space, with a probability p (0 ≤ p ≤ 1), a new space can be created by pushing a randomly selected neighbour cell (at location L_d) and all the rest of the cells along the direction of L_p and L_d outwards for one position until an empty space was reached [34]. For two extreme values of the push rate, p = 0 refers to the surface growth where only cells in the outskirt of the tumour would divide and p = 1 refers to an exponential growth where all cells divide in each time step (figure 1). In a comparison, we implement a similar pushing algorithm as in [33,63], where instead of pushing a random neighbour, the nearest empty lattice point is first searched and the cells between the dividing cell and this empty point are pushed along the shortest path. To reduce our computational cost, we also performed the same agent-based model with pushing a random neighbour in three-dimensional lattice in a few parameter sets and compared the ITH patterns from two- and three-dimensional simulations.

Figure 1. — The spatial growth under different push rates. (a) when push rate p = 0, it refers to the surface growth, where only cells on the surface have an empty spot in its direct neighbours and can divide. For other values of the push rate, cells not on the surface can also divide by creating an empty spot among its neighbours through pushing. We allocated a unique ID for each new cell. The larger the cell ID, the later the cell was reproduced. (b) To reach the same tumour size, surface growth takes more generations of reproductions compared with exponential growth under p = 1. While mutation accumulation only happens in cell divisions in our model, for tumours of the same size, the push rate will impact on the mutation burden in tumours and also the spatial distribution of those mutations. The shadowed area is 100 simulations under the corresponding push rate, where their averages are shown as solid lines. (Figure 1b is 100 times simulation, cell number is around 2¹⁵, p = 1 grow to 2¹⁵ cells only 15 generations. When p = 0 divide 100 generations can grow approx. 27 500 cells.)

In each cell division, random point mutations can happen in both daughter cells compared with the parent cell. The number of new mutations in each daughter cell follows a Poisson distribution [6,23,26,33,64], where λ is the mean value representing the average mutations (mutation rate) per cell division. Thus, the probability to have k new mutations in one daughter cell is λ^k e^−λ/k!. If not specified, we use λ = 10, which is conventionally considered as the average number of point mutations per cell division in tumours [26,65,66].

2.2. Spatial sampling

To understand how mutations spread over space, we test two different spatial sampling methods, i.e. random sampling and centre-margin sampling. For the random sampling method, we sample 500 locations randomly in each simulated tumour and collect cells in a rectangular area around these locations (figure S1a in electronic supplementary material). To investigate the impact of sampling sizes on our diversity measures, we vary sample size from 100 to 3600 cells (around 0.6–21.6% of the whole tumour). Alternatively, we divide simulated tumours into the central region (a circle of two-thirds of the radius from the tumour centre) and the marginal region (the rest one-third ring structure). We randomly sample 500 rectangular areas with 100 cells in each sample (around 0.6% of the whole tumour) in the margin and 500 samples with 100 cells in the centre region (figure S1b in electronic supplementary material).

2.3. Measurements of intra-tumour heterogeneity between samples

We use a statistical measurement, the Jaccard index, to compare the diversity between samples. Supposing A and B are the set of mutations in two samples, the similarity of mutations between the two samples is given by

J (A, B) = \frac{| A \cap B |}{| A \cup B |} = \frac{| A \cap B |}{| A | + | B | - | A \cap B |},

2.1

where $A \cap B$ is the intersection of mutations between these two samples and $A \cup B$ is their union. Thus, by definition the Jaccard index will be between 0 and 1. By using the Jaccard index, we can quantify how similar mutations are accumulated in samples over space. More specifically, we quantify how the Jaccard index is related to the sampling distance, which is measured by the Euclidean distance between the centres of samples.

2.4. Mutation rate inference and Kolmogorov–Smirnov test

Another classical measurement for patterns of mutation accumulation in population genetics is the frequency distribution of mutations among tumour cells in each sample, which is called variant allele frequency (VAF) distribution in cancer research. We are interested in how different growth dynamics, such as push rates as well as spatial sampling, impact on the measured VAF distributions. For simplicity, we simulate only one driver event where genetic changes lead to the initiation of a single tumour cell, which seeds for the tumour growth and the accumulation of random neutral mutations during the tumour expansion. In this scenario, the accumulated mutations follow a theoretical expectation of power-law decay, where the number of mutations of a given frequency decrease along with the mutation frequency f [23]. The cumulative distribution is a linear relation as M(f) = (μ/β)((1/f) − (1/f_max)), and the slope could determine the effective mutation rate μ/β (the true mutation rate scaled by death). We constructed the cumulative VAFs for all samples and compared how the push rates, sampling methods and sizes would impact on this measurement. We quantify how much the observed cumulative agrees with a linear regression by Kolmogorov–Smirnov (KS) test [67], which measures the maximum distance of the linear regression with the observed cumulative VAF curve. If the observed cumulative VAFs agree with linear regression, we infer the mutation rates using the equation above.

3. Results

3.1. Spatial mixing increases and variance of mutation frequencies decreases with push rates

When the push rate is low (p = 0), cells grow slowly and mainly on the surface. From spatial patterns constructed by the cell ID (figure 1a), where the cell born later is assigned a larger ID number, we observed clear circular boundaries among early and later born cells. However, when the push rate increases, these spatial boundaries become loose. Instead, spatial mixing among early and later born cells appears. Similar effects are observed in the spatial pattern of mutations. In figure 2b, we demonstrate the spatial pattern of four different mutations randomly picked up from four cells born in the second generation. When p = 0, clear boundaries among cells carrying those mutations exist. With the increase in push rate, cells are carrying different mutations mix in space.

The earlier a mutation arises during the tumour growth, the higher frequency it reaches in the final tumour (figure 2c). This pattern is consistent under all push rates. Arising at the same tumour generation, the mean value of the frequency these mutations can reach in the final tumour is independent of the push rates. However, the variance increases monotonically when the push rate decreases.

3.2. The growth modes have a strong impact on the variant allele frequencies distribution and mutation rate inference

Next, we construct the VAF of all mutations accumulated through tumour growth. Mutations with the frequency of less than 0.01 in the final tumour are discarded, as in reality, it is hard to detect such lower frequency mutations in a standard sequencing depth. Figure 3a,b shows examples of single simulations under two boundaries of push rates, p = 0 and p = 1. Without mimicking sequencing noise in our simulation, when push rate p = 1, the VAF distribution is discrete with mutations at frequencies 0.5, 0.25, 0.125, 0.0625 and so on (figure 3 inset). The cumulative VAF distribution is a linear line and perfectly agrees with the theoretical expectation under neutral selection. On the contrary, when only surface growth is allowed under p = 0, there is a wide and relatively continuous VAF distribution covering all intermediate frequencies and even with few mutations exceeding 0.5 (figure 3b inset), which is the frequency of most clonal mutations in diploid populations. The cumulative VAF distribution under surface growth strongly deviates from a linear relation, which is quantified by the KS distance. The larger the KS distance, the further away the cumulative VAF distribution deviates from a linear regression.

We measure the KS distance for different push rates and sampling sizes under random sampling. Note, to eliminate the extremely low-frequency mutations (less than 0.01), we discard the last few dots in the cumulative VAF distributions for the linear regression (e.g. the last three dots in figure 3a,b). We found that the push rates have a strong impact on this measurement, where the KS distance keeps a relatively high level when push rates are small (figure 3d and figure S4a–c in electronic supplementary material). This means that mutation rate inferences based on a linear regression are not reliable under small p. The KS distance decreases when p becomes larger (figure S4d,e in electronic supplementary material), and a linear regression is reasonable across all sampling sizes (figure S2d,e in electronic supplementary material). Thus, we can infer the mutation rates based on the slope of the linear regression under large p.

Figure 3c shows the fitting results of 100 simulations with 500 random sampling each under p = 1. While increasing the sampling size, the inferred mutation rate is closer to the true value with reduced variance. On the contrary, although we also observe a close-to-linear relation under small p (figure S2a in electronic supplementary material), the inferred mutation rate is often an overestimate compared with the true value (figure 3c, dashed line). In addition, the smaller the sampling size is, the large the variance of the mutation rate is.

3.3. Intra-tumour heterogeneity increases with the sampling distance

In each simulation, we first randomly sample 500 areas with various sampling sizes. We measure the ITH between spatially non-overlapping samples and quantify how this heterogeneity changes with spatial distances. We compare the samples pairwise to calculate the Jaccard index, which is inversely proportional to intra-tumour heterogeneity. Meanwhile, the spatial distances between samples are defined by the Euclidean distance between the central points of each sample. For various sampling sizes and push rates, the Jaccard index between two random samples decreases, thus the ITH increases, monotonically with the spatial distances (figure 4, figure S5 in electronic supplementary material). When the push rate p = 0 (surface growth), the Jaccard index drops down to 0 very fast (figure 4a), where the non-overlapping samples have fewer and fewer shared mutations when the sampling distance increases. When p increases, given the same sampling size and distance, the Jaccard index increases. This agrees with the observation of spatial mixing of cells carrying different mutations. When p = 1, cell spatial mixing reaches the highest level, and we seldom observe any Jaccard index as 0 even under the smallest sampling size and largest sampling distance, and there are always shared mutations among those samples. While the results in figure 4 are based on single simulations, 249 500 Jaccard indexes are calculated after the pairwise combination of 500 samples in a single simulation. Such a large number gives a stable pattern, which is consistent with results over 100 simulations (figure S6 in electronic supplementary material). In addition, an alternative pushing algorithm with the shortest path to the nearest empty spot gives qualitatively similar results (figure S8 in electronic supplementary material). However, in this pushing algorithm, compared with the dividing cell the nearest empty spot in a lattice is more likely to be at the same side of the tumour, which leads to more clustering of cells carrying the same genetic mutations, compared with the case when pushing happens in a random direction. Consequently, the Jaccard index in this alternative pushing algorithm is lower under the same parameter set (figure S8 in electronic supplementary material).

Figure 4. — The mutation divergence over space. Pairwise comparison of mutation divergences between samples are measured by Jaccard index. As the sampling distance increases, fewer mutations are shared between the samples, and the smaller the Jaccard index is. (a) p = 0. (b) p = 0.125. (c) p = 1. This holds for any push rates and any sampling size. For the surface growth (a), the Jaccard index decreases faster to 0 when the sampling distance increases compared with exponential growth (c). This effect is stronger when the sampling size is smaller (e.g. orange dots compared with purple dots). (1-time simulation, sampling 500 points for each sample size, cell number is approx. 2¹⁴, cut last three points.)

To understand the impact of sampling methods, we divided the simulated tumours into the central region (2/3 R circle) and margin region (1/3 R ring width), where R is the tumour radius (figure S1b in electronic supplementary material). We randomly sampled 500 areas (100 cells, 196 cells, 400 cells) in the margin and centre, respectively (figure S7 in electronic supplementary material). Then, we compared the relationships of the Jaccard index and spatial distance between samples in the margin and the centre region under the different push rates. The patterns are very similar to those observed in completely random sampling. The Jaccard index decreases with the increase of the sampling distance, and the push rates lead to a higher Jaccard index under the same sampling size. In addition, we see the Jaccard index is slightly higher between samples in the central region compared with the margin region. This is more obvious when push rates are small, where the spatial constraint is stronger and thus less mutations are shared between spatially non-overlapping samples. In summary, the two sampling methods do not alter the pattern of how ITH increases with the sampling distance qualitatively with increasing p (less spatial constraints). However, there is a small quantitative difference if we sample in the margin or centre of tumours.

3.4. Patterns of mutant divergence remain similarly between two- and three-dimensional models

The majority of our simulations are based on two-dimensional lattice to reduce the computational cost of simulating tumours with large sizes and sampling distances, which results in a limitation as solid tumours are often three-dimensional. To explore this, we extend our agent-based model from two to three dimensions in some parameter sets (figure S9 in electronic supplementary material). We model the tumour growth up to 10⁵ cells in three-dimensional lattice and compare the Jaccard index between two- and three-dimensional simulations. In general, we observe a qualitatively similar pattern of Jaccard index over sampling distance and sampling sizes in three-dimensional compared with two-dimensional models. Interestingly, the diameter of a sample, which is also the shortest sampling distance of non-overlapping samples, determines a quantitatively comparable pattern of the Jaccard index (figure 5a–c). For a similar sampling size under two- and three-dimensional simulations, the Jaccard index is in general higher in two-dimensional compared with three-dimensional simulations even under the same sampling distance (figure 5d–f). This might be due to the fact that it takes more cell divisions to reach the same physical distance in three-dimensional compared with two-dimensional simulations, thus higher intra-tumour heterogeneity between samples.

Figure 5. — Comparison between the Jaccard index distribution between two- and three-dimensional simulations. Here, d refers to the diameter of a sample, where for the same value of d, there are more tumour cells in the samples in three-dimensional simulations compared with two-dimensional simulations. Meanwhile, the diameter of a sample is also the shortest sampling distance of two non-overlapping samples. (a–c) Compare the Jaccard distribution under the same sampling diameters between two- and three-dimensional simulations. (d–f) Compare the Jaccard distribution under a similar sampling size between two- and three-dimensional simulations. (1-time simulation under p = 0, sampling 500 points for each sample diameter or size, cell number is approx. 214 in the two-dimensional simulation and 105 in the three-dimensional simulation, cut last three points) were compared with the same sampling distance and the same number of sampling cells, respectively.

3.5. Pairwise intra-tumour heterogeneity increases with sampling distances in patients

We have demonstrated that the relationship between Jaccard index and sampling distance is qualitatively stable across two- and three-dimensional models as well as different sampling methods. We see a similar pattern in a study of colorectal tumours [60], where the pairwise comparison between samples from the same patient are available through whole exome sequencing data (see their main figure 4). More specifically, multi-region sampling ranging from 4 to 13 samples per tumour were taken for 10 carcinomas. The divergent exonic point mutations (their x-axis) between samples increases with physical distance (their y-axis), which is consistent with our results. In Ling et al. [51], 300 biopsies were collected from a hepatocellular carcinoma, 23 of which are whole exome sequencing samples. We extract the information of sampling distances and exonic mutations from their work, and construct the Jaccard index of these 23 samples pairwisely over their physical distance (see figure 6). In our simulations, we use sub-clonal mutations to calculate Jaccard index, because including clonal mutations shared by all cells does not add any real value but only shrinks the Jaccard index into a small range, thus with a lower resolution to distinguish patterns. Correspondingly, when we calculate the Jaccard index in these patient biopsies, we exclude the clonal mutations as well. While there might be some noise in the real data, where a clonal mutation rarely presents exactly in all samples, we compared two filters where mutations are excluded if they are present in more than half of or in all biopsies. We see a similar pattern with negative correlation between Jaccard index and sampling distance when we filter out the shared mutations in half of the samples.

Figure 6. — The pairwise Jaccard index between 23 samples with whole exome sequencing data in Ling *et al.* [51]. Panel (a) is modified from Ling *et al.* [51], where we normalize sampling distances between 0 and 100. The largest distance 100 refers to the diameter of the whole patient tissue. The 23 biopsy samples used in our analysis are labelled with red circles in Ling *et al.* [51], which are the samples with exome sequencing. (*b,c*), In our simulation, we used sub-clonal mutations, i.e. mutations do not present in all cells, to calculate Jaccard index. Here we filter out the clonal mutations from the data as well. As the tumour purity in real samples seldom reaches to 100%, the clonal mutations might not present in all samples. Here, we show (b) the Jaccard distribution after filtering the mutations present in half of all biopsies (Pearson correlation coefficient = −0.309, p-value = 1.69 × 10⁻⁶) and also (c) the Jaccard distribution after filtering the mutations present in all biopsies (Pearson correlation coefficient = 0.094, p-value = 0.152).

4. Discussion

We developed a computational model that tracked the dynamic movement of each cell and variation divergence, which revealed the relationship of spatial heterogeneity distribution with sampling size and tumour expansion modes. We used push rates to model slow and fast growth modes, where small push rates refer to surface growth and large push rates to exponential expansion without spatial constraints. We implemented two alternative pushing algorithms, where pushing happens in a random direction or towards the nearest empty spot. Furthermore, we recorded the mutation accumulation during all growth modes and applied different sampling methods, i.e. completely random sampling and margin-centre sampling, with various sampling sizes.

Under the surface growth (small push rates), the accumulation of mutations is concentrated in a continuous space, and mutations arising in different original cells can form clear boundaries in space. When the push rate increases, the mutations become more spatially dispersed, which agrees with the conclusion of Chkhaidze et al. [34] in simulating driver mutations under the action of pushing. We further showed that small push rates introduce high stochasticity in the system. While the final frequency that an early mutation can reach in small push rates has the same mean expectation compared with large push rates, the variance is much higher. We showed the cumulative VAF distribution follows a linear relationship with 1/frequency for sufficiently high push rates independent of sampling size. We can infer mutation rates based on the method proposed in Williams et al. [23]. However, a small sampling size will overestimate the mutation rates compared with the true value in our simulations. When the push rate decreases, the mutation cumulative VAF distributions deviate largely from a linear relation, and such a mutation rate inference becomes less meaningful. This is consistent with Chkhaidze et al. [34], where they also showed that the inference of mutation rates becomes less accurate under stronger spatial constraints. While they mainly focused on the patterns of phylogenetic trees under spatial constraints with random punch or needle sampling, we are interested in further quantifying the relation between ITH and sampling distance, which can be served as a baseline expectation of how mutation accumulation diverges when space competition and constraint is the main driving force of the ITH.

While mutation heterogeneity can reveal a tumour’s life history [68] and the patterns of ITH in space are important to understand and improve tumour treatments, systematic studies to quantify those properties are still rare [34,39]. We used the Jaccard index to quantify the mutation heterogeneity between samples and analyse how sampling distance, methods and sizes will impact on the ITH spatial pattern. Our results show that the sampling distance will quickly increase the ITH between samples when push rates are small, which is emphasized if the sampling size is also smaller. On the contrary, high push rates will always maintain a certain level of mutation spatial mixing even with a small sampling size. These results agree with some observations in clinical data. Gates et al. [69] sequenced primary glioma biopsy samples at defined distances and recorded Jaccard indices over sample distance. Their data showed that the genetic heterogeneity increased with the sample spatial distance. Similar observations were reported in Cross et al. [60], where the number of distinct exonic point mutations increase with the physical sample distance in colorectal tumours. We constructed the Jaccard index distribution of 23 biopsies from [51] over the relative sample distance and further confirmed our conclusion in patient samples. To test the robustness of this observation, we applied margin-centre sampling to compare with completely random sampling. Both the ITH patterns in the centre and margin regions are consistent with our results under completely random sampling, with the centre region revealing a lower ITH compared with the margin region. In addition, our results under three-dimensional simulations remain similar to our two-dimensional results with some quantitative deviations. In a summary, our model gives a baseline expectation of mutation divergence when only space competition is considered, which is a stable pattern independent of the sampling methods and the ecological growth mode of a tumour. Deviation from this may reflect other important mechanisms, e.g. the interactions with tumour microenvironment, which may break the mutation divergence pattern over space.

Our model provides a quantitative analysis of how growth modes, sampling distance and size impact on the measurements of intra-tumour heterogeneity. Those results confirm the importance of obtaining spatial information in understanding tumour evolution, as well as the possible deviation of estimated evolutionary properties such as mutation rates introduced by sampling details.

Contributor Information

Di Gu, Email: sveong@163.com.

Benjamin Werner, Email: b.werner@qmul.ac.uk.

Weini Huang, Email: weini.huang@qmul.ac.uk.

Ethics

This work did not require ethical approval from a human subject or animal welfare committee.

Data accessibility

All R code and Python code of full pipeline descriptions and settings used for mapping are available from the GitHub repository: https://github.com/SYSU-BioEvoLab/Spatial_Heterogeneity [70].

Supplementary material is available online [71].

Declaration of AI use

We have not used AI-assisted technologies in creating this article.

Authors' contributions

H.L.: formal analysis, investigation, methodology, software, validation, visualization, writing—original draft, writing—review and editing; Z.Y.: Formal analysis, investigation, software, visualization; F.T., L.D., Y.H., X.F. and L.W.: software; D.G.: conceptualization, data curation; B.W.: conceptualization, formal analysis, methodology, supervision, writing—review and editing; W.H.: conceptualization, formal analysis, funding acquisition, methodology, project administration, supervision, writing—original draft, writing—review and editing.

All authors gave final approval for publication and agreed to be held accountable for the work performed therein.

Conflict of interest declaration

We declare we have no competing interests.

Funding

H.L. was funded by the China Scholarship Council and Sun Yat-sen University between 2020 and 2021. W.H. was funded by (grant no. NSFC 32170624). B.W. was funded by the UKRI future leader fellowship and Barts Charity Lectureship (grant no. MGU045). G.D. was funded by The First Affiliated Hospital of Guangzhou Medical University.

References

1.Martincorena I, Campbell PJ. 2015. Somatic mutation in cancer and normal cells. Science 349, 1483-1489. ( 10.1126/science.aab4082) [DOI] [PubMed] [Google Scholar]
2.Martincorena I, et al. 2018. Somatic mutant clones colonize the human esophagus with age. Science 362, 911-917. ( 10.1126/science.aau3879) [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Abascal F, et al. 2021. Somatic mutation landscapes at single-molecule resolution. Nature 593, 405-410. ( 10.1038/s41586-021-03477-4) [DOI] [PubMed] [Google Scholar]
4.Greaves M, Maley CC. 2012. Clonal evolution in cancer. Nature 481, 306-313. ( 10.1038/nature10762) [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Greaves M, Maley CC. 2015. Evolutionary determinants of cancer. Cancer Discovery 5, 806-820. ( 10.1158/2159-8290.CD-15-0439) [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Bozic I, et al. 2010. Accumulation of driver and passenger mutations during tumor progression. Proc. Natl Acad. Sci. USA 107, 18 545-18 550. ( 10.1073/pnas.1010978107) [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Burrell RA, McGranahan N, Bartek J, Swanton C. 2013. The causes and consequences of genetic heterogeneity in cancer evolution. Nature 501, 338-345. ( 10.1038/nature12625) [DOI] [PubMed] [Google Scholar]
8.McGranahan N, Swanton C. 2014. Biological and therapeutic impact of intratumor heterogeneity in cancer evolution. Cell 27, 15-26. ( 10.1016/j.ccell.2014.12.001) [DOI] [PubMed] [Google Scholar]
9.Sun X, Yu Q. 2015. Intra-tumor heterogeneity of cancer cells and its implications for cancer treatment. Acta Pharmacol. Sin. 36, 1219-1227. ( 10.1038/aps.2015.92) [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Turajlic S, Sottoriva A, Graham T, Swanton C. 2019. Resolving genetic heterogeneity in cancer. Nat. Rev. Genet. 20, 404-416. ( 10.1038/s41576-019-0114-6) [DOI] [PubMed] [Google Scholar]
11.Wodarz D, Komarova NL. 2005. Evolutionary dynamics of drug resistance in cancer. Gene Therapy Mol. Biol. 9, 247-256. [Google Scholar]
12.Iwasa Y, Nowak MA, Michor F. 2006. Evolution of resistance during clonal expansion. Genetics 172, 2557-2566. ( 10.1534/genetics.105.049791) [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Durrett R, Moseley S. 2010. Evolution of resistance and progression to disease during clonal expansion of cancer. Theor. Popul. Biol. 77, 42-48. ( 10.1016/j.tpb.2009.10.008) [DOI] [PubMed] [Google Scholar]
14.Tomasetti C. 2012. On the probability of random genetic mutations for various types of tumor growth. Bull. Math. Biol. 74, 1379-1395. ( 10.1007/s11538-012-9717-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Turner NC, Reis-Filho JS. 2012. Genetic heterogeneity and cancer drug resistance. Lancet Oncol. 13, e178-85. ( 10.1016/S1470-2045(11)70335-7) [DOI] [PubMed] [Google Scholar]
16.Foo J, Michor F. 2014. Evolution of acquired resistance to anti-cancer therapy. J. Theor. Biol. 355, 10-20. ( 10.1016/j.jtbi.2014.02.025) [DOI] [PMC free article] [PubMed] [Google Scholar]
17.de Bruin EC, et al. 2014. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution. Science 346, 251-2566. ( 10.1126/science.1253462) [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Seferbekova Z, Lomakin A, Yates LR, Gerstung M. 2023. Spatial biology of cancer evolution. Nature 24, 295-313. ( 10.1038/s41576-022-00553-x) [DOI] [PubMed] [Google Scholar]
19.Marusyk A, Almendro V, Polyak K. 2012. Intra-tumour heterogeneity: a looking glass for cancer? Nature 12, 323-334. ( 10.1038/nrc3261) [DOI] [PubMed] [Google Scholar]
20.Sottoriva A, et al. 2015. A Big Bang model of human colorectal tumor growth. Nat. Genet. 47, 209-216. ( 10.1038/ng.3214) [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Durrett R. 2013. Population genetics of neutral mutations in exponentially growing cancer cell populations. Ann. Appl. Probab. 23, 230-250. ( 10.1214/11-AAP824) [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Kessler DA, Levine H. 2013. Large population solution of the stochastic Luria–Delbrück evolution model. Proc. Natl Acad. Sci. USA 110, 11 682-11 687. ( 10.1073/pnas.1309667110) [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Williams MJ, Werner B, Barnes CP, Graham TA, Sottoriva A. 2016. Identification of neutral tumor evolution across cancer types. Nat. Genet. 48, 238-244. ( 10.1038/ng.3489) [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Park SC, Krug J. 2007. Clonal interference in large populations. Proc. Natl Acad. Sci. USA 104, 18 135-18 140. ( 10.1073/pnas.0705778104) [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Karlsson K, et al. 2023. Deterministic evolution and stringent selection during preneoplasia. Nature 618, 383-393. ( 10.1038/s41586-023-06102-8) [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Williams MJ, Werner B, Heide T, Curtis C, Barnes CP, Sottoriva A, Graham TA. 2018. Quantification of subclonal selection in cancer from bulk sequencing data. Nat. Genet. 50, 895-903. ( 10.1038/s41588-018-0128-6) [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Gerlinger M, et al. 2012. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl. J. Med. 366, 883-892. ( 10.1056/NEJMoa1113205) [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Bashashati A, et al. 2013. Distinct evolutionary trajectories of primary high-grade serous ovarian cancers revealed through spatial mutational profiling. J. Pathol. 231, 21-34. ( 10.1002/path.4230) [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Zhang J, et al. 2014. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science 346, 256-259. ( 10.1126/science.1256930) [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Blokzijl F, et al. 2016. Tissue-specific mutation accumulation in human adult stem cells during life. Nature 538, 260-264. ( 10.1038/nature19768) [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Ryser MD, Min BH, Siegmund KD, Shibata D. 2018. Spatial mutation patterns as markers of early colorectal tumor cell mobility. Proc. Natl Acad. Sci. USA 115, 5774-5779. ( 10.1073/pnas.1716552115) [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Joshi K, et al. 2019. Spatial heterogeneity of the T cell receptor repertoire reflects the mutational landscape in lung cancer. Nat. Med. 25, 1549-1559. ( 10.1038/s41591-019-0592-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Waclaw B, Bozic I, Pittman ME, Hruban RH, Vogelstein B, Nowak MA. 2015. A spatial model predicts that dispersal and cell turnover limit intratumour heterogeneity. Nature 525, 261-264. ( 10.1038/nature14971) [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Chkhaidze K, Heide T, Werner B, Williams MJ, Huang W, Caravagna G, Graham TA, Sottoriva A. 2019. Spatially constrained tumour growth affects the patterns of clonal selection and neutral drift in cancer genomic data. PLoS Comput. Biol. 15, e1007243. ( 10.1371/journal.pcbi.1007243) [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Werner B, et al. 2020. Measuring single cell divisions in human tissues from multi-region sequencing data. Nat. Commun. 11, 1035. ( 10.1038/s41467-020-14844-6) [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Noble R, Burri D, Le Sueur C, Lemant J, Viossat Y, Kather JN, Beerenwinkel N. 2022. Spatial structure governs the mode of tumour evolution. Nat. Ecol. Evol. 6, 207-217. ( 10.1038/s41559-021-01615-9) [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Schenck RO, Kim E, Bravo RR, West J, Leedham S, Shibata D, Anderson ARA. 2022. How homeostasis limits keratinocyte evolution. Proc. Natl Acad. Sci. USA 119, e2006487119. ( 10.1073/pnas.2006487119) [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Lenos KJ, et al. 2018. Stem cell functionality is microenvironmentally defined during tumour expansion and therapy response in colon cancer. Nat. Cell Biol. 20, 1193-1202. ( 10.1038/s41556-018-0179-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Fu X, et al. 2022. Spatial patterns of tumour growth impact clonal diversification in a computational model and the TRACERx Renal study. Nat. Ecol. Evol. 6, 88-102. ( 10.1038/s41559-021-01586-x) [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Gallaher JA, Enriquez-Navas PM, Luddy KA, Gatenby RA, Anderson AR. 2018. Spatial heterogeneity and evolutionary dynamics modulate time to recurrence in continuous and adaptive cancer therapies. Cancer Res. 78, 2127-2139. ( 10.1158/0008-5472.CAN-17-2649) [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Bissell MJ, Radisky D. 2001. Putting tumours in context. Nat. Rev. Cancer 1, 46-54. ( 10.1038/35094059) [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Anderson AR, Weaver AM, Cummings PT, Quaranta V. 2006. Tumor morphology and phenotypic evolution driven by selective pressure from the microenvironment. Cell 127, 905-915. ( 10.1016/j.cell.2006.09.042) [DOI] [PubMed] [Google Scholar]
43.Gerlee P, Anderson AR. 2007. An evolutionary hybrid cellular automaton model of solid tumour growth. J. Theor. Biol. 246, 583-603. ( 10.1016/j.jtbi.2007.01.027) [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Basanta D, Strand DW, Lukner RB, Franco OE, Cliffel DE, Ayala GE, Hayward SW, Anderson AR. 2009. The role of transforming growth factor-beta-mediated tumor-stroma interactions in prostate cancer progression: an integrative approach. Cancer Res. 69, 7111-7120. ( 10.1158/0008-5472.CAN-08-3957) [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Scott JG, Hjelmeland AB, Chinnaiyan P, Anderson ARA, Basanta D. 2014. Microenvironmental variables must influence intrinsic phenotypic parameters of cancer stem cells to affect tumourigenicity. PLoS Comput. Biol. 10, 1-7. ( 10.1371/journal.pcbi.1003433) [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Araujo A, Cook LM, Lynch CC, Basanta D. 2014. An integrated computational model of the bone microenvironment in bone-metastatic prostate cancer. Cancer Res. 74, 2391-2401. ( 10.1158/0008-5472.CAN-13-2652) [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Gallaher JA, et al. 2020. From cells to tissue: how cell scale heterogeneity impacts glioblastoma growth and treatment response. PLoS Comput. Biol. 16, 1-27. ( 10.1371/journal.pcbi.1007672) [DOI] [PMC free article] [PubMed] [Google Scholar]
48.West J, Schenck RO, Gatenbee C, Robertson-Tessi M, Anderson AR. 2021. Normal tissue architecture determines the evolutionary course of cancer. Nat. Commun. 12, 1-9. ( 10.1038/s41467-021-22123-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Sottoriva A, Barnes CP, Graham TA. 2017. Catch my drift? Making sense of genomic intra-tumour heterogeneity. Bioch. et Biophys. Acta (BBA)-Rev. Cancer 1867, 95-100. ( 10.1016/j.bbcan.2016.12.003) [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Opasic L, Zhou D, Werner B, Dingli D, Traulsen A. 2019. How many samples are needed to infer truly clonal mutations from heterogenous tumours? BMC Cancer 19, 1-11. ( 10.1186/s12885-019-5597-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Ling S, et al. 2015. Extremely high genetic diversity in a single tumor points to prevalence of non-Darwinian cell evolution. Proc. Natl Acad. Sci. USA 112, E6496-E6505. ( 10.1073/pnas.1519556112) [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Masugi Y, et al. 2019. Characterization of spatial distribution of tumor-infiltrating CD8⁺ T cells refines their prognostic utility for pancreatic cancer survival. Mod. Pathol. 32, 1495-1507. ( 10.1038/s41379-019-0291-z) [DOI] [PubMed] [Google Scholar]
53.Zhao Y, et al. 2021. Selection of metastasis competent subclones in the tumour interior. Nat. Ecol. Evol. 5, 1033-1045. ( 10.1038/s41559-021-01456-6) [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Alioto TS, et al. 2015. A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing. Nat. Commun. 6, 1-13. ( 10.1038/ncomms10001) [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Silva GO, Siegel MB, Mose LE, Parker JS, Sun W, Perou CM, Chen M. 2017. SynthEx: a synthetic-normal-based DNA sequencing tool for copy number alteration detection and tumor heterogeneity profiling. Genome Biol. 18, 1-14. ( 10.1186/s13059-017-1193-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Araf S, et al. 2018. Genomic profiling reveals spatial intra-tumor heterogeneity in follicular lymphoma. Leukemia 32, 1261-1265. ( 10.1038/s41375-018-0043-y) [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Gendoo DM, et al. 2019. Whole genomes define concordance of matched primary, xenograft, and organoid models of pancreas cancer. PLoS Comput. Biol. 15, e1006596. ( 10.1371/journal.pcbi.1006596) [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Yan T, et al. 2019. Multi-region sequencing unveils novel actionable targets and spatial heterogeneity in esophageal squamous cell carcinoma. Nat. Commun. 10, 1-15. ( 10.1038/s41467-019-09255-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Pereira B, et al. 2021. Cell-free DNA captures tumor heterogeneity and driver alterations in rapid autopsies with pre-treated metastatic cancer. Nat. Commun. 12, 1-13. ( 10.1038/s41467-021-23394-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Cross W, et al. 2018. The evolutionary landscape of colorectal tumorigenesis. Nat. Ecol. Evol. 2, 1661-1672. ( 10.1038/s41559-018-0642-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
61.Metzcar J, Wang Y, Heiland R, Macklin P. 2019. A review of cell-based computational modeling in cancer biology. JCO Clin. Cancer Inform. 2, 1-13. ( 10.1200/CCI.18.00069) [DOI] [PMC free article] [PubMed] [Google Scholar]
62.West J, Robertson-Tessi M, Anderson ARA. 2022. Agent-based methods facilitate integrative science in cancer. Trends Cell Biol. 33, 300-311. ( 10.1016/j.tcb.2022.10.006) [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Haughey MJ, Bassolas A, Sousa S, Baker AM, Graham TA, Nicosia V, Huang W. 2023. First passage time analysis of spatial mutation patterns reveals sub-clonal evolutionary dynamics in colorectal cancer. PLoS Comput. Biol. 19, 1-34. ( 10.1371/journal.pcbi.1010952) [DOI] [PMC free article] [PubMed] [Google Scholar]
64.Clarke R. 1946. An application of the Poisson distribution. J. Inst. Actuar. 72, 481-481. ( 10.1017/S0020268100035435) [DOI] [Google Scholar]
65.Frigola J, Sabarinathan R, Mularoni L, Muiños F, Gonzalez-Perez A, López-Bigas N. 2017. Reduced mutation rate in exons due to differential mismatch repair. Nat. Genet. 49, 1684-1692. ( 10.1038/ng.3991) [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Carlson J, et al. 2018. Extremely rare variants reveal patterns of germline mutation rate heterogeneity in humans. Nat. Commun. 9, 1-13. ( 10.1038/s41467-018-05936-5) [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Peacock JA. 1983. Two-dimensional goodness-of-fit testing in astronomy. Mon. Not. R. Astron. Soc. 202, 615-627. ( 10.1093/mnras/202.3.615) [DOI] [Google Scholar]
68.McGranahan N, Swanton C. 2017. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 613-628. ( 10.1016/j.cell.2017.01.018) [DOI] [PubMed] [Google Scholar]
69.Gates ED, et al. 2019. Spatial distance correlates with genetic distance in diffuse glioma. Front. Oncol. 9, 676. ( 10.3389/fonc.2019.00676) [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Li H, et al. 2023. Data from: Mutation divergence over space in tumour expansion. GitHub repository. (https://github.com/SYSU-BioEvoLab/Spatial_Heterogeneity) [DOI] [PMC free article] [PubMed] [Google Scholar]
71.Li H, et al. 2023. Mutation divergence over space in tumour expansion. Figshare. ( 10.6084/m9.figshare.c.6926943) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All R code and Python code of full pipeline descriptions and settings used for mapping are available from the GitHub repository: https://github.com/SYSU-BioEvoLab/Spatial_Heterogeneity [70].

Supplementary material is available online [71].

[RSIF20230542C1] 1.Martincorena I, Campbell PJ. 2015. Somatic mutation in cancer and normal cells. Science 349, 1483-1489. ( 10.1126/science.aab4082) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C2] 2.Martincorena I, et al. 2018. Somatic mutant clones colonize the human esophagus with age. Science 362, 911-917. ( 10.1126/science.aau3879) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C3] 3.Abascal F, et al. 2021. Somatic mutation landscapes at single-molecule resolution. Nature 593, 405-410. ( 10.1038/s41586-021-03477-4) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C4] 4.Greaves M, Maley CC. 2012. Clonal evolution in cancer. Nature 481, 306-313. ( 10.1038/nature10762) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C5] 5.Greaves M, Maley CC. 2015. Evolutionary determinants of cancer. Cancer Discovery 5, 806-820. ( 10.1158/2159-8290.CD-15-0439) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C6] 6.Bozic I, et al. 2010. Accumulation of driver and passenger mutations during tumor progression. Proc. Natl Acad. Sci. USA 107, 18 545-18 550. ( 10.1073/pnas.1010978107) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C7] 7.Burrell RA, McGranahan N, Bartek J, Swanton C. 2013. The causes and consequences of genetic heterogeneity in cancer evolution. Nature 501, 338-345. ( 10.1038/nature12625) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C8] 8.McGranahan N, Swanton C. 2014. Biological and therapeutic impact of intratumor heterogeneity in cancer evolution. Cell 27, 15-26. ( 10.1016/j.ccell.2014.12.001) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C9] 9.Sun X, Yu Q. 2015. Intra-tumor heterogeneity of cancer cells and its implications for cancer treatment. Acta Pharmacol. Sin. 36, 1219-1227. ( 10.1038/aps.2015.92) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C10] 10.Turajlic S, Sottoriva A, Graham T, Swanton C. 2019. Resolving genetic heterogeneity in cancer. Nat. Rev. Genet. 20, 404-416. ( 10.1038/s41576-019-0114-6) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C11] 11.Wodarz D, Komarova NL. 2005. Evolutionary dynamics of drug resistance in cancer. Gene Therapy Mol. Biol. 9, 247-256. [Google Scholar]

[RSIF20230542C12] 12.Iwasa Y, Nowak MA, Michor F. 2006. Evolution of resistance during clonal expansion. Genetics 172, 2557-2566. ( 10.1534/genetics.105.049791) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C13] 13.Durrett R, Moseley S. 2010. Evolution of resistance and progression to disease during clonal expansion of cancer. Theor. Popul. Biol. 77, 42-48. ( 10.1016/j.tpb.2009.10.008) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C14] 14.Tomasetti C. 2012. On the probability of random genetic mutations for various types of tumor growth. Bull. Math. Biol. 74, 1379-1395. ( 10.1007/s11538-012-9717-1) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C15] 15.Turner NC, Reis-Filho JS. 2012. Genetic heterogeneity and cancer drug resistance. Lancet Oncol. 13, e178-85. ( 10.1016/S1470-2045(11)70335-7) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C16] 16.Foo J, Michor F. 2014. Evolution of acquired resistance to anti-cancer therapy. J. Theor. Biol. 355, 10-20. ( 10.1016/j.jtbi.2014.02.025) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C17] 17.de Bruin EC, et al. 2014. Spatial and temporal diversity in genomic instability processes defines lung cancer evolution. Science 346, 251-2566. ( 10.1126/science.1253462) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C18] 18.Seferbekova Z, Lomakin A, Yates LR, Gerstung M. 2023. Spatial biology of cancer evolution. Nature 24, 295-313. ( 10.1038/s41576-022-00553-x) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C19] 19.Marusyk A, Almendro V, Polyak K. 2012. Intra-tumour heterogeneity: a looking glass for cancer? Nature 12, 323-334. ( 10.1038/nrc3261) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C20] 20.Sottoriva A, et al. 2015. A Big Bang model of human colorectal tumor growth. Nat. Genet. 47, 209-216. ( 10.1038/ng.3214) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C21] 21.Durrett R. 2013. Population genetics of neutral mutations in exponentially growing cancer cell populations. Ann. Appl. Probab. 23, 230-250. ( 10.1214/11-AAP824) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C22] 22.Kessler DA, Levine H. 2013. Large population solution of the stochastic Luria–Delbrück evolution model. Proc. Natl Acad. Sci. USA 110, 11 682-11 687. ( 10.1073/pnas.1309667110) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C23] 23.Williams MJ, Werner B, Barnes CP, Graham TA, Sottoriva A. 2016. Identification of neutral tumor evolution across cancer types. Nat. Genet. 48, 238-244. ( 10.1038/ng.3489) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C24] 24.Park SC, Krug J. 2007. Clonal interference in large populations. Proc. Natl Acad. Sci. USA 104, 18 135-18 140. ( 10.1073/pnas.0705778104) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C25] 25.Karlsson K, et al. 2023. Deterministic evolution and stringent selection during preneoplasia. Nature 618, 383-393. ( 10.1038/s41586-023-06102-8) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C26] 26.Williams MJ, Werner B, Heide T, Curtis C, Barnes CP, Sottoriva A, Graham TA. 2018. Quantification of subclonal selection in cancer from bulk sequencing data. Nat. Genet. 50, 895-903. ( 10.1038/s41588-018-0128-6) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C27] 27.Gerlinger M, et al. 2012. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl. J. Med. 366, 883-892. ( 10.1056/NEJMoa1113205) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C28] 28.Bashashati A, et al. 2013. Distinct evolutionary trajectories of primary high-grade serous ovarian cancers revealed through spatial mutational profiling. J. Pathol. 231, 21-34. ( 10.1002/path.4230) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C29] 29.Zhang J, et al. 2014. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science 346, 256-259. ( 10.1126/science.1256930) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C30] 30.Blokzijl F, et al. 2016. Tissue-specific mutation accumulation in human adult stem cells during life. Nature 538, 260-264. ( 10.1038/nature19768) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C31] 31.Ryser MD, Min BH, Siegmund KD, Shibata D. 2018. Spatial mutation patterns as markers of early colorectal tumor cell mobility. Proc. Natl Acad. Sci. USA 115, 5774-5779. ( 10.1073/pnas.1716552115) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C32] 32.Joshi K, et al. 2019. Spatial heterogeneity of the T cell receptor repertoire reflects the mutational landscape in lung cancer. Nat. Med. 25, 1549-1559. ( 10.1038/s41591-019-0592-2) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C33] 33.Waclaw B, Bozic I, Pittman ME, Hruban RH, Vogelstein B, Nowak MA. 2015. A spatial model predicts that dispersal and cell turnover limit intratumour heterogeneity. Nature 525, 261-264. ( 10.1038/nature14971) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C34] 34.Chkhaidze K, Heide T, Werner B, Williams MJ, Huang W, Caravagna G, Graham TA, Sottoriva A. 2019. Spatially constrained tumour growth affects the patterns of clonal selection and neutral drift in cancer genomic data. PLoS Comput. Biol. 15, e1007243. ( 10.1371/journal.pcbi.1007243) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C35] 35.Werner B, et al. 2020. Measuring single cell divisions in human tissues from multi-region sequencing data. Nat. Commun. 11, 1035. ( 10.1038/s41467-020-14844-6) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C36] 36.Noble R, Burri D, Le Sueur C, Lemant J, Viossat Y, Kather JN, Beerenwinkel N. 2022. Spatial structure governs the mode of tumour evolution. Nat. Ecol. Evol. 6, 207-217. ( 10.1038/s41559-021-01615-9) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C37] 37.Schenck RO, Kim E, Bravo RR, West J, Leedham S, Shibata D, Anderson ARA. 2022. How homeostasis limits keratinocyte evolution. Proc. Natl Acad. Sci. USA 119, e2006487119. ( 10.1073/pnas.2006487119) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C38] 38.Lenos KJ, et al. 2018. Stem cell functionality is microenvironmentally defined during tumour expansion and therapy response in colon cancer. Nat. Cell Biol. 20, 1193-1202. ( 10.1038/s41556-018-0179-z) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C39] 39.Fu X, et al. 2022. Spatial patterns of tumour growth impact clonal diversification in a computational model and the TRACERx Renal study. Nat. Ecol. Evol. 6, 88-102. ( 10.1038/s41559-021-01586-x) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C40] 40.Gallaher JA, Enriquez-Navas PM, Luddy KA, Gatenby RA, Anderson AR. 2018. Spatial heterogeneity and evolutionary dynamics modulate time to recurrence in continuous and adaptive cancer therapies. Cancer Res. 78, 2127-2139. ( 10.1158/0008-5472.CAN-17-2649) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C41] 41.Bissell MJ, Radisky D. 2001. Putting tumours in context. Nat. Rev. Cancer 1, 46-54. ( 10.1038/35094059) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C42] 42.Anderson AR, Weaver AM, Cummings PT, Quaranta V. 2006. Tumor morphology and phenotypic evolution driven by selective pressure from the microenvironment. Cell 127, 905-915. ( 10.1016/j.cell.2006.09.042) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C43] 43.Gerlee P, Anderson AR. 2007. An evolutionary hybrid cellular automaton model of solid tumour growth. J. Theor. Biol. 246, 583-603. ( 10.1016/j.jtbi.2007.01.027) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C44] 44.Basanta D, Strand DW, Lukner RB, Franco OE, Cliffel DE, Ayala GE, Hayward SW, Anderson AR. 2009. The role of transforming growth factor-beta-mediated tumor-stroma interactions in prostate cancer progression: an integrative approach. Cancer Res. 69, 7111-7120. ( 10.1158/0008-5472.CAN-08-3957) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C45] 45.Scott JG, Hjelmeland AB, Chinnaiyan P, Anderson ARA, Basanta D. 2014. Microenvironmental variables must influence intrinsic phenotypic parameters of cancer stem cells to affect tumourigenicity. PLoS Comput. Biol. 10, 1-7. ( 10.1371/journal.pcbi.1003433) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C46] 46.Araujo A, Cook LM, Lynch CC, Basanta D. 2014. An integrated computational model of the bone microenvironment in bone-metastatic prostate cancer. Cancer Res. 74, 2391-2401. ( 10.1158/0008-5472.CAN-13-2652) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C47] 47.Gallaher JA, et al. 2020. From cells to tissue: how cell scale heterogeneity impacts glioblastoma growth and treatment response. PLoS Comput. Biol. 16, 1-27. ( 10.1371/journal.pcbi.1007672) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C48] 48.West J, Schenck RO, Gatenbee C, Robertson-Tessi M, Anderson AR. 2021. Normal tissue architecture determines the evolutionary course of cancer. Nat. Commun. 12, 1-9. ( 10.1038/s41467-021-22123-1) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C49] 49.Sottoriva A, Barnes CP, Graham TA. 2017. Catch my drift? Making sense of genomic intra-tumour heterogeneity. Bioch. et Biophys. Acta (BBA)-Rev. Cancer 1867, 95-100. ( 10.1016/j.bbcan.2016.12.003) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C50] 50.Opasic L, Zhou D, Werner B, Dingli D, Traulsen A. 2019. How many samples are needed to infer truly clonal mutations from heterogenous tumours? BMC Cancer 19, 1-11. ( 10.1186/s12885-019-5597-1) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C51] 51.Ling S, et al. 2015. Extremely high genetic diversity in a single tumor points to prevalence of non-Darwinian cell evolution. Proc. Natl Acad. Sci. USA 112, E6496-E6505. ( 10.1073/pnas.1519556112) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C52] 52.Masugi Y, et al. 2019. Characterization of spatial distribution of tumor-infiltrating CD8⁺ T cells refines their prognostic utility for pancreatic cancer survival. Mod. Pathol. 32, 1495-1507. ( 10.1038/s41379-019-0291-z) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C53] 53.Zhao Y, et al. 2021. Selection of metastasis competent subclones in the tumour interior. Nat. Ecol. Evol. 5, 1033-1045. ( 10.1038/s41559-021-01456-6) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C54] 54.Alioto TS, et al. 2015. A comprehensive assessment of somatic mutation detection in cancer using whole-genome sequencing. Nat. Commun. 6, 1-13. ( 10.1038/ncomms10001) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C55] 55.Silva GO, Siegel MB, Mose LE, Parker JS, Sun W, Perou CM, Chen M. 2017. SynthEx: a synthetic-normal-based DNA sequencing tool for copy number alteration detection and tumor heterogeneity profiling. Genome Biol. 18, 1-14. ( 10.1186/s13059-017-1193-3) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C56] 56.Araf S, et al. 2018. Genomic profiling reveals spatial intra-tumor heterogeneity in follicular lymphoma. Leukemia 32, 1261-1265. ( 10.1038/s41375-018-0043-y) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C57] 57.Gendoo DM, et al. 2019. Whole genomes define concordance of matched primary, xenograft, and organoid models of pancreas cancer. PLoS Comput. Biol. 15, e1006596. ( 10.1371/journal.pcbi.1006596) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C58] 58.Yan T, et al. 2019. Multi-region sequencing unveils novel actionable targets and spatial heterogeneity in esophageal squamous cell carcinoma. Nat. Commun. 10, 1-15. ( 10.1038/s41467-019-09255-1) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C59] 59.Pereira B, et al. 2021. Cell-free DNA captures tumor heterogeneity and driver alterations in rapid autopsies with pre-treated metastatic cancer. Nat. Commun. 12, 1-13. ( 10.1038/s41467-021-23394-4) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C60] 60.Cross W, et al. 2018. The evolutionary landscape of colorectal tumorigenesis. Nat. Ecol. Evol. 2, 1661-1672. ( 10.1038/s41559-018-0642-z) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C61] 61.Metzcar J, Wang Y, Heiland R, Macklin P. 2019. A review of cell-based computational modeling in cancer biology. JCO Clin. Cancer Inform. 2, 1-13. ( 10.1200/CCI.18.00069) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C62] 62.West J, Robertson-Tessi M, Anderson ARA. 2022. Agent-based methods facilitate integrative science in cancer. Trends Cell Biol. 33, 300-311. ( 10.1016/j.tcb.2022.10.006) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C63] 63.Haughey MJ, Bassolas A, Sousa S, Baker AM, Graham TA, Nicosia V, Huang W. 2023. First passage time analysis of spatial mutation patterns reveals sub-clonal evolutionary dynamics in colorectal cancer. PLoS Comput. Biol. 19, 1-34. ( 10.1371/journal.pcbi.1010952) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C64] 64.Clarke R. 1946. An application of the Poisson distribution. J. Inst. Actuar. 72, 481-481. ( 10.1017/S0020268100035435) [DOI] [Google Scholar]

[RSIF20230542C65] 65.Frigola J, Sabarinathan R, Mularoni L, Muiños F, Gonzalez-Perez A, López-Bigas N. 2017. Reduced mutation rate in exons due to differential mismatch repair. Nat. Genet. 49, 1684-1692. ( 10.1038/ng.3991) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C66] 66.Carlson J, et al. 2018. Extremely rare variants reveal patterns of germline mutation rate heterogeneity in humans. Nat. Commun. 9, 1-13. ( 10.1038/s41467-018-05936-5) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C67] 67.Peacock JA. 1983. Two-dimensional goodness-of-fit testing in astronomy. Mon. Not. R. Astron. Soc. 202, 615-627. ( 10.1093/mnras/202.3.615) [DOI] [Google Scholar]

[RSIF20230542C68] 68.McGranahan N, Swanton C. 2017. Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168, 613-628. ( 10.1016/j.cell.2017.01.018) [DOI] [PubMed] [Google Scholar]

[RSIF20230542C69] 69.Gates ED, et al. 2019. Spatial distance correlates with genetic distance in diffuse glioma. Front. Oncol. 9, 676. ( 10.3389/fonc.2019.00676) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C70] 70.Li H, et al. 2023. Data from: Mutation divergence over space in tumour expansion. GitHub repository. (https://github.com/SYSU-BioEvoLab/Spatial_Heterogeneity) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20230542C71] 71.Li H, et al. 2023. Mutation divergence over space in tumour expansion. Figshare. ( 10.6084/m9.figshare.c.6926943) [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Mutation divergence over space in tumour expansion

Haiyang Li

Zixuan Yang

Fengyu Tu

Lijuan Deng

Yuqing Han

Xing Fu

Long Wang

Di Gu

Benjamin Werner

Weini Huang

Roles

Abstract

1. Introduction

2. Methods

2.1. Stochastic simulations of tumour growth in space and mutation accumulation

Figure 1.

2.2. Spatial sampling

2.3. Measurements of intra-tumour heterogeneity between samples

2.4. Mutation rate inference and Kolmogorov–Smirnov test

3. Results

3.1. Spatial mixing increases and variance of mutation frequencies decreases with push rates

Figure 2.

3.2. The growth modes have a strong impact on the variant allele frequencies distribution and mutation rate inference

Figure 3.

3.3. Intra-tumour heterogeneity increases with the sampling distance

Figure 4.

3.4. Patterns of mutant divergence remain similarly between two- and three-dimensional models

Figure 5.

3.5. Pairwise intra-tumour heterogeneity increases with sampling distances in patients

Figure 6.

4. Discussion

Contributor Information

Ethics

Data accessibility

Declaration of AI use

Authors' contributions

Conflict of interest declaration

Funding

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases