Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2018 Jun 7;14(6):e1006233. doi: 10.1371/journal.pcbi.1006233

Variation of mutational burden in healthy human tissues suggests non-random strand segregation and allows measuring somatic mutation rates

Benjamin Werner 1,*, Andrea Sottoriva 1,*
Editor: Natalia L Komarova2
PMCID: PMC6007938  PMID: 29879111

Abstract

The immortal strand hypothesis poses that stem cells could produce differentiated progeny while conserving the original template strand, thus avoiding accumulating somatic mutations. However, quantitating the extent of non-random DNA strand segregation in human stem cells remains difficult in vivo. Here we show that the change of the mean and variance of the mutational burden with age in healthy human tissues allows estimating strand segregation probabilities and somatic mutation rates. We analysed deep sequencing data from healthy human colon, small intestine, liver, skin and brain. We found highly effective non-random DNA strand segregation in all adult tissues (mean strand segregation probability: 0.98, standard error bounds (0.97,0.99)). In contrast, non-random strand segregation efficiency is reduced to 0.87 (0.78,0.88) in neural tissue during early development, suggesting stem cell pool expansions due to symmetric self-renewal. Healthy somatic mutation rates differed across tissue types, ranging from 3.5 × 10−9/bp/division in small intestine to 1.6 × 10−7/bp/division in skin.

Author summary

Cairn proposed in 1975 that upon proliferation, cells might not segregate DNA strands randomly into daughter cells, but preferentially keep the ancestral (blue print) template strand in stem cells. This mechanism would allow to drastically reduce the rate of mutation accumulation in human tissues. Testing the hypothesis in human stem cells within their natural tissue environment remains challenging. Here we show that the patterns of mutation accumulation in human tissues with age support highly effective non-random DNA strand segregation after adolescence. In contrast, during early development in infants, DNA strand segregation is less effective, likely because stem cell populations are continuing to grow.

Introduction

The immortal DNA strand hypothesis, originally proposed by Cairns in 1975, poses that adult mammalian stem cells do not segregate DNA strands randomly after proliferation [1]. Instead, stem cells might preferentially retain the parental ancestral strand, whereas the duplicated strand is passed onto differentiated cells with limited life span (Fig 1). In principle, such hierarchical tissues could produce differentiated progeny indefinitely without accumulating any proliferation-induced mutations in the stem cell compartment [2,3]. Experimental evidence supporting this hypothesis comes from BrdU stain tracing experiments both in vitro and in vivo [47]. Evidence from spindle orientation bias in mouse models of normal and precancerous intestinal tissue corroborated these findings, suggesting that strand segregation is then lost during tumourigenesis [8]. However, many of the experiments suffer from uncertainties in stem cell identity and a definite mechanism of strand recognition remains unknown [9]. Hence why Cairns hypothesis remains controversial [10].

Fig 1. The Immortal DNA strand hypothesis.

Fig 1

a) During replication of the ancestral DNA strand, errors (dashed line) might occur. If these errors are not corrected by intrinsic DNA repair mechanisms, they become permanently fixed in daughter cells after the next cell division. However, the original ancestral strand is still present and can provide the blue print for additional non-mutated copies of DNA. b) In principle, a stem cell driven tissue allows for non-random DNA strand segregation. Preferentially segregating ancestral DNA strands into stem cells and duplicated strands into differentiated cells with limited life span can drastically reduce the accumulation of somatic mutations in tissues.

Orthogonal studies based on the expected accumulation of somatic mutations in healthy human tissues have argued against the immortal strand hypothesis [11,12]. However, the mere accumulation of somatic mutations in healthy tissue neither supports nor negates the immortal strand hypothesis in vivo. Here, we show that measuring the change of the mutational burden and, most crucially, the change of the variance of the mutational burden with age allows determining the probability of DNA strand segregation and the per cell mutation rate in healthy human tissues. First, we outline the approach and then apply it to genomic data from healthy human colon, small intestine, liver, skin and brain tissue. The data comes from four recent independent studies on mutational burden in healthy tissues [1316], which contain information on in total 39 individuals at different ages and analysed genomes of 341 single cells. We find evidence for non-random strand segregation in all adult tissues and significant differences in somatic mutation rates between tissues, but less prominent strand-segregation in brain tissue during early development.

Results

The expected change of mean and variance of mutational burden with age

We describe the accumulation of mutations with time in hierarchically organised human tissues by a stochastic mathematical and computational model, Fig 1. A detailed description and derivation of all equations is provided below (Materials and Methods). Briefly, our model considers a constant number of N stem cells that contribute to tissue homeostasis. Stem cells divide with a certain constant rate λ, e.g. once every week or month. During each division, the parental DNA strand is copied and χ novel mutations might occur on the daughter strand. Here χ is a random number that follows a Poisson distribution with mutation rate μ per bp/division and genome size L. Cell fate is also probabilistic in our model. Cells with the parental strand will keep a stem-cell fate with probability p, e.g. for p = 1 they will always remain stem cell, or differentiate otherwise, e.g. for p = 1/2 cell fate decisions are purely random (coin flip). We can understand the probability p as the probability of non-random strand segregation, e.g. p ≈ 1 suggest highly non-random strand segregation, whereas p = 1/2 corresponds to random strand segregation.

With this model, we can describe the accumulation of mutations over time explicitly (see Materials and Methods for more details). Assuming the mutation rate μ as well as the cell proliferation rate λ to be constant, we find that both the mutational burden μ˜ as well as the variance of the mutational burden σ2 are predicted to increase linearly with time t:

1λΔμ˜Δt=(1p)μL (1)
1λΔσ2Δt=(1p)μL+(μL)2p(1p), (2)

see Materials and Methods for a detailed derivation and Fig 2 for a verification by individual based computer simulations. However, the rates by which the mutational burden and the variance of the mutational burden increase over time depend differently on the mutation rate μ and the non-random strand segregation probability p. This allows us to independently measure the mutation rate μ and the non-random strand segregation probability p via:

p=Δσ2Δμ˜1Δσ2Δμ˜1+1λΔμ˜Δt (3)
μL=Δσ2Δμ˜1+1λΔμ˜Δt. (4)

Fig 2. Predicted mutational burden in individual stem cells with age.

Fig 2

a) We show simulated stochastic mutation accumulation in a stem cell population of constant size. Here N = 20,000 stem cells segregating DNA strands with probability p = 0.7 and a mutation rate of μ = 6 per cell division (corresponding to a mutation rate of μ = 10−9 per bp per cell division). b) Mutational burden and c) variance of the mutational burden increase linear. Linear regression (dashed lines) gives Δμ˜Δt=1.799 and Δσ2Δt=9.41. The expected exact values based on above parameters and Eqs (1) and (2) are Δμ˜Δt=1.8 and Δσ2Δt=9.36. Eqs (3) and (4) yield for the strand segregation probability p = 0.702 and for the mutation rate μ = 6.03, (exact values imposed on the simulation were p = 0.7 and μ = 6).

Importantly, measuring the change in mutational burden Δμ˜Δt and variance Δσ2Δt over time in combination with Eqs (3) and (4) determines the mutation rate μ (per cell divison) and the non-random strand segregation probability p for healthy tissues.

Measured mean and variance of mutational burden with age from sequencing data

In a recent publication Blokzijl and colleagues [13] measured mutation accumulation in healthy colon, small intestine and liver tissue by whole genome sequencing multiple single stem cell derived organoids of healthy donors of different ages. In addition, Martincorena and colleagues [14] measured mutational burden in multiple skin samples of four individuals with ages between 58 and 73 years. Furthermore, two recent publications [15,16] performed large-scale single cell whole genome sequencing of neurons at different ages. In the experiments by Blokzijl and colleagues [1,13], they isolated single cells and expanded those into organoids. These cells can therefore be thought of as tissue specific stem cells. In contrast, the other experiments [2,3,1416] do not directly measure mutational burden in stem but more differentiated progenitor cells. However, compared to the total number of cell divisions in the tissue, the number of divisions separating stem and progenitor cells are neglectable.

These datasets enable measurements for the change in mutational burden Δμ˜Δt and the variance Δσ2Δt of the mutational burden with age in those healthy human tissues, see Figs 3 & 4. Eqs (3) and (4) have a single undetermined parameter, the stem cell proliferation rate λ. Strictly speaking, they therefore only provide possible ranges for the mutation rate and the strand segregation probability. However, the possible ranges are narrow for any biologically meaningful stem cell proliferation rate, see Fig 5.

Fig 3. Mutational burden and variance in healthy human tissues.

Fig 3

Mutational burden and variance of the mutational burden in colon, small intestine liver and skin tissue in healthy adult humans of different ages, data taken from [13,14]. Open circle represent mutational burden of single cells, whereas dark grey dots represent the mean mutational burden or variance respectively. In all cases, the data well supports our expectation of a linearly increasing mean and variance with age. Linear regressions (dashed lines) give estimates for the change of the mutational burden and the variance with age, see main text (uncertainties represent standard errors). Eqs (3) and (4) then allow to estimate the non-random strand segregation probability as well as the per-cell mutation rate per cell division. In all cases, the probability of non-random strand segregation is high (median: p = 0.979 (0.97,0.99)), whereas the mutation rate per cell division varies between tissues and is highest in skin, see insets and main text for tissue specific estimates.

Fig 4. Mutational burden and variance in human neurons during early development and adulthood.

Fig 4

Mutational burden and variance measured from single whole genome sequencing of neurons in the prefrontal cortex and the hippocampus dental gyrus [15] as well as in single neurons during early childhood development after birth [16] (uncertainties represent standard errors). Mutation accumulation in early childhood is highly increased compared to adulthood. However, the per-cell mutation rate per division appears higher in adulthood. The non-random strand segregation in contrast is with p = 0.999 (0.998;0.9993) extremely high in adults, whereas with p = 0.876 (0.78;0.88) it is lower in early childhood. This can be understood as a consequence of cell population expansions due to symmetric self-renewals in early childhood. For details of mutation rate estimates, see the main text.

Fig 5. Dependence of parameter inferences on stem cell proliferation rate.

Fig 5

Inferences of a) the DNA strand segregation probability and b) mutation rate per cell division are robust against wide ranges of the stem cell proliferation rate λ. If stem cells divide once per week this implies (Eq (3)) for the probability of DNA strand segregation in colon: p = 0.973 (0.892; 0.996), small intestine: p = 0.969 (0.877; 0.995), liver: p = 0.988 (0.952; 0.998), prefrontal cortex: p = 0.999 (0.997; 0.9999), hippocampal dentale gyrus: p = 0.999 (0.997; 0.9999), skin: p = 0.985 (0.94; 0.998). Numbers in brackets correspond to the range of the DNA strand segregation probabilities for stem cell replication rates between once per month and every day respectively. In contrast for neurons during early development we find: p = 0.876 (0.67; 0.96) if cells divide every 48h (number in brackets correspond to cell divisions once per week and twice a day respectively). Based on Eq (4) we find for the in vivo mutation rate per base pair per cell division in colon: μ = 4.37 (4.27; 4.77) × 10−9, small intestine: μ = 3.54 (3.45; 3.91) × 10−9, liver: μ = 8.48 (8.39; 8.8) × 10−9, prefrontal cortex: μ = 7.68 (7.67; 7.7) × 10−8, hippocampal dentale gyrus: μ = 1.14 (1.14; 1.15) × 10−7, neurons during early development: μ = 1.23 (1.02; 1.47) × 10−8 and skin: μ = 1.57 (1.56; 1.65) × 10−7.

Estimations of non-random strand segregation probability in healthy human tissues

For all tissues, the experimental observations confirm our expectation of a linearly increasing mean and variance of the mutational burden. Using linear regression on the data in [47,1316], we find for colon that the change in mutational burden over time was: Δμ˜Δt=37.2±3.1, for small intestine: Δμ˜Δt=34.6±6.9, for liver: Δμ˜Δt=30.5±2.1, for prefrontal cortex: Δμ˜Δt=16.2±1.1 and for hippocampal dentate gyrus: Δμ˜Δt=21.8±7.9 mutations per whole genome per year. We found for skin: Δμ˜Δt=1.66±0.15 mutations per 0.69 Mb per year. We found for neurons during early development: Δμ˜Δt=4.2±1.3 mutations per whole genome per day. Uncertainties here are standard errors. Similarly, for the change of variance we found for colon: Δσ2Δt=985.5±103, for small intestine: Δσ2Δt=747.3±304, for liver: Δσ2Δt=1564±56, for prefrontal cortex: Δσ2Δt=7500±965, for hippocampal dentate gyrus: Δσ2Δt=15016±6234 mutations per whole genome per year, for skin: Δσ2Δt=5.23±0.37 mutations per 0.69 Mb per year and for neurons during early development: Δσ2Δt=252.2±191 mutations per whole genome per day (Figs 3 & 4).

If stem cells divide once per week this implies (Eq (3)) for the probability of DNA strand segregation in colon: p = 0.973 (0.971; 0.974), small intestine: p = 0.969 (0.966; 0.97), liver: p = 0.988 (0.987; 0.989), prefrontal cortex: p = 0.999 (0.998; 0.9993), hippocampal dentale gyrus: p = 0.999 (0.998; 0.9998), skin: p = 0.985 (0.983; 0.987). In contrast for neurons during early development we find: p = 0.876 (0.78; 0.88) if cells divide every 48h. Numbers in brackets correspond to the range of the DNA strand segregation probabilities given the upper and lower bound of the error estimates of the linear regressions. Dependencies of the estimates on the proliferation rate can be found in the caption of Fig 5.

This suggests highly effective non-random DNA strand segregation in human adult stem cells and is in line with previous observations of predominantly asymmetric stem cell divisions [68,17,18]. It would require extreme stem cell proliferation rates of approximately one division per stem cell per year for the data to be consistent with solely random strand segregation (p = 0.5), Fig 5. This is an unlikely scenario as all tissues analysed here are thought to have high stem cell proliferation rates [9,19,20]. Interestingly, during development non-random DNA strand segregation is less prominent. One explanation is an expanding stem cell population due to symmetric stem cell self-renewals during early development[1,10], which also would explain the increased accumulation of mutations early, as well as typical increased telomere shortening early in life [11,12,21].

Measurements of somatic mutation rates in healthy human tissues

Based on Eq (4) we find for the in vivo mutation rate per base pair per cell division in colon: μ = 4.37 (4.26; 4.46) × 10−9, small intestine: μ = 3.54 (2.61; 4.17) × 10−9, liver: μ = 8.48 (8.22; 8.77) × 10−9, prefrontal cortex: μ = 7.68 (7.18; 8.12) × 10−8, hippocampal dentale gyrus: μ = 1.14 (1.04; 1.68) × 10−7, neurons during early development: μ = 1.23 (0.43; 1.52) × 10−8 and skin: μ = 1.57 (1.54; 1.63) × 10−7. The ranges of these values agree with a recent estimate of the somatic mutation rate in human fibroblasts [1316,22] and are one to two orders of magnitude larger than germline mutation rates [13,23,24]. However, our method does not require precise estimates of the total number of cell divisions since conception (Fig 5). We find surprising differences in the somatic mutation rates across tissue types that cannot be explained by for example different stem cell proliferation rates alone. The mutation rate estimate in skin is particularly high. This might be due to the nature of the samples used by Martincorena and colleagues [14], as the mutational burden was measured in eye lids of individuals that were exposed to high levels of UV radiation for decades. It is plausible that this contributed to the very high mutation rate estimate. It remains to be seen, if these differences across tissues prevail for denser sampling in more individuals.

Explaining strand segregation in terms of symmetric stem cell divisions

Our analysis suggests in general highly effective non-random DNA strand segregation in human colon, small intestine, liver, skin and brain. However, approximately 1% to 5% of divisions in adults do not seem to segregate strands properly and stem cells accumulate additional mutations over time. The reason for this improper segregation could be either wrongly segregated strands during an asymmetric stem cell division or the loss of a stem cell by either a symmetric stem cell differentiation or cell death followed by a symmetric stem cell self-renewal. Arguments are made for both symmetric and asymmetric stem cell divisions in human tissues [15,16,2528]. We wondered if our approach provides a mean to distinguish both possibilities. We therefore implemented stochastic simulations of mutation accumulation in either asymmetric dividing stem cell populations with imperfect strand segregation or a stem cell population with a mix of symmetric and asymmetric divisions (S1 Fig). Both scenarios lead to linearly increasing mean and variance of the mutational burden, with small differences in the actual rates. However, as predicted, the ratio of the variance and the mean σ2/μ˜ are in both scenarios independent of time and on average the same (see also Eq (14)). Interestingly, the distribution of σ2/μ˜ differs. Whereas the variance of the distribution of σ2/μ˜ increases with time for symmetric stem cell divisions, it approximately remains constant for asymmetric stem cell divisions. However, measuring this effect reliably would require measuring the mean and variance of the mutational burden in many more independent samples of many more healthy humans of different ages than the currently available datasets. Hence, lack of resolution in currently available data precludes us to determine the cause of imperfect strand segregations. However, this effect might provide a future mean to quantitate the amount of symmetric self-renewal in human stem cell populations.

Discussion

Stem cells in fast proliferating healthy adult tissues such as colon have been reported to accumulate approximately 40 new mutations per year [13] (Fig 3). However, if mutation rates are in the order of 10−9 per base pair per cell division, which seems to be the current consensus and agrees with our measurements here, and the human genome consists of 6 × 109 base pairs, this would on average only allow for 6 to 7 divisions per stem cell per year. This is in contradiction to current measures on stem cell turnover rates in for example healthy colonic crypts [19,29]. This discrepancy is resolved by non-random strand segregation, where many stem cell proliferations would not induce novel mutations on the stem cell level and the effective observed mutation accumulation on a population level can remain low despite high stem cell turnover rates. A clear molecular mechanism of strand recognition remains unknown. However, direct and indirect evidence to which our observations may contribute increasingly hint on the importance of strand segregation to maintain genomic integrity within healthy human tissues.

Our joined inference of mutation rate and strand segregation probability also reveals that mutation rates per cell division are likely higher than was assumed in previous studies [11]. We therefore find stronger signals of strand segregation in human sequencing data than was thought previously [11]. Our inference neglects the effects of cell-division independent mutations that may contribute to mutational burden in tissues at a low rate. This can lead to an underestimation of the true strand-segregation probability as well as the per-cell mutation rate in human tissues, see S2 Fig.

A loss of strand segregation in stem cells implies a 50 to 100 times increased effective mutation rate on the cell population level without any other changes to the intrinsic DNA repair machinery. In a non-homeostatic setting, such as a growing tumour, in which the number of self-renewing cells (whether they are all or only a subset of cells) increases, the rate of random strand segregation events is much higher. This effect may contribute to the usually high mutational burden in cancers [3033]. However, we note that our model has been developed for normal tissue and does not account for chromosomal rearrangements in malignancies, which likely impact the estimation of mutation rates. It is an intriguing thought that early organ growth during development constitutes a very similar situation in which strand segregation is less effective within expanding stem cell populations and the increased rate of mutation accumulation early in life emerges as a natural consequence [34].

Materials and methods

We assume that homeostasis in a healthy adult human tissue is maintained by a constant pool of N stem cells. Each of these stem cells undergoes n cell divisions during a time interval Δt. With each division, a stem cell non-randomly segregates DNA strands with a probability p. If p = 1 the ancestral strand will remain in the stem cell and the duplicated strand will be passed onto a daughter cell that becomes a non-stem cell, whereas p = 0.5 implies random strand segregation (i.e. no strand segregation), see Fig 1. We assume the probability p to be the same for all stem cells and don’t account for possible variation by for example specific mutations that would change strand segregation probabilities for individual stem cells. The non-ancestral duplicated strand inherits on average μL novel mutations, where μ is the mutation rate per base pair per cell division and L the length of the copied genome (e.g. L ≈ 6 × 109 base pairs in humans). Throughout the manuscript we assume a constant mutation rate μ. In principal the mutation rate could depend on time explicitly, e.g. μμ(t). However, this would lead to non-linear dependencies, which is not supported by the currently available data, e.g. Figs 3 & 4. Thus assuming a constant mutation rate is retrospectively justified by the actual change of the mutational burden in human tissues.

It follows that for n cell divisions, the probability to segregate parental DNA strands k times is binomially distributed (k successes in n draws given a success probability of p)

P(k,n,p)=(nk)pk(1p)nk. (5)

This implies that on average E[k,n,p] = np cell divisions do not induce additional mutations in stem cells. However, n(1 − p) cell divisions will increase mutational burden within a single stem cell lineage, each division by a random number χ, given by a Poisson distribution:

P(χ)=(μL)χχ!eμL. (6)

The mutational burden χ˜ within a single stem cell lineage consequently increases by

χ˜=i=1nkχi. (7)

Exact expressions for the mutational burden μ˜ and variance σ2 for such distributions are known[35]. The mutational burden μ˜ after n stem cell divisions is given by

μ˜=E[χ˜]=E[nk]E[χ]=n(1p)μL, (8)

and the variance of the mutational burden σ2 is given by

σ2=E[nk]Var[χ]+(E[χ])2Var[nk]=n(1p)μL+n(μL)2p(1p). (9)

These expressions allow quantifying the change of the mutational burden as well as the change of the variance of the mutational burden after a number of Δn divisions per stem cell

Δμ˜Δn=μ2˜μ1˜n2n1=(1p)μL, (10)
Δσ2Δn=σ22σ12n2n1=(1p)μL+(μL)2p(1p). (11)

However, in actual data the number of stem cell divisions is unknown and change would be measured in time t. Assuming a constant rate of stem cell proliferations λ we can write Δn = λΔt. This allows us to rewrite above equations for the change of the mean and the variance of the mutational burden over real time t via

1λΔμ˜Δt=(1p)μL (12)
1λΔσ2Δt=(1p)μL+(μL)2p(1p). (13)

Importantly, both the change of the mutational burden Δμ˜Δt as well as the change of the variance of the mutational burden Δσ2Δt can be measured from human somatic mutation data, see Figs 3 & 4. Furthermore, Eqs (12) and (13) imply that the mutational burden as well as the variance of the mutational burden are expected to increase linearly with age in adult tissues. Even if strand segregation is highly effective, mutations still accumulate linearly with age. However, the rate of mutation accumulation is decreased by a factor of (1 − p). As we neither know the somatic mutation rate μ nor the stem cell proliferation rate λ with certainty, a linear increase in mutational burden with age at most suggests imperfect strand segregation (e.g. 0 ≤ p < 1). Importantly, the linear increase of both the mean and the variance in time is a result of the sum of Poisson distributed random variables and does by itself not imply the presence or absence of non-random strand segregation.

However, the ratio of variance and mean is independent of time t and the stem cell proliferation rate λ

Δσ2Δμ˜=1+μLp, (14)

and therefore provides natural bounds for possible mutation rates per cell division μ and strand segregation probabilities p in human tissues, see S1 Fig. Furthermore, rearranging Eq (12) and substituting μLp=μL1λΔμ˜Δt into Eq (14), the strand segregation probability p and the mutation rate μ disentangle, allowing us independent estimates via

p=Δσ2Δμ˜1Δσ2Δμ˜1+1λΔμ˜Δt (15)
μL=Δσ2Δμ˜1+1λΔμ˜Δt. (16)

The relative change of the mutational burden and the variance variance of the mutational burden allow estimates of the mutation rate μ (per cell division) and the non-random strand segregation probability p. Estimating the mutation rate as well as the strand segregation probability, we need to measure the change of the mutational burden as well as the change of the variance of the mutational burden. This requires multiple measurements of the mutational burden within single cells of a single individual that ideally would be followed over time. This is unpractical and such data currently does not exist. We therefore measure the mutational burden and variance in multiple cells of multiple individuals of different ages. To calculate the variance and the mean of the mutational burden, we require at least 3 samples per individual, see Figs 3 & 4. For completeness we also show expressions for the mutation rate μ and p in dependence of stem cell proliferations n. They are given by

p=Δσ2Δμ˜1Δσ2Δμ˜1+Δμ˜Δn (17)

and

μL=Δσ2Δμ˜1+Δμ˜Δn. (18)

We recognize that our model is based on some assumptions and approximations. For example, telomeres, the protective ends of chromosomes, shorten with each cell division. Upon reaching a critically short telomere length, cells enter senescent. Senescence is not modelled in our model, however we argue that since this is likely to occur at very old ages [21,36], this process is unlikely to influence our results significantly.

Supporting information

S1 Fig. Strand segregation in terms of symmetric stem cell divisions.

a) Mean mutational burden μ, b) mutational variance σ2, and c) the ratio of mutational variance and mutational burden σ2/μ for purely asymmetrically or a mix of symmetrically and asymmetrically dividing stem cells. Here we compare stochastic simulations for N = 5000 purely asymmetrically dividing stem cells with a strand segregation probability of p = 0.9 and stem cells with perfect strand segregation p = 1 but a fraction of 10% of stem cell divisions being symmetric differentiations followed by symmetric self-renewals. Both scenarios lead to a linear increase of mean and variance of mutational burden with minimal rate differences. However, as predicted, the ratio of variance and mean become time independent and are the same on average for both processes. However, the variance of the distribution of the ratio of the variance and mean increases with time for symmetric stem cell divisions but is approximately constant for asymmetric stem cell divisions. This effect might provide a future method to distinguish and quantitate the amount of symmetric self-renewal in human stem cell populations.

(TIFF)

S2 Fig. Influence of cell division independent background mutation rate on inference of non-random strand segregation probability and per-cell mutation rate.

Plots a) to d) show the non-random strand segregation probability p and the per cell division mutation rate μ based on Eqs (15) and (16) inferred from stochastic simulations if we in addition allow for a constant cell-division independent mutation rate that influences both the ancestral and the duplicated DNA strand equally. In the upper panels a) and b) the underlying true parameters per cell division are μ = 6 and p = 0.95, whereas in the lower panels c) and d) we have μ = 6 and p = 0.7. If the background mutation rate is 0, we recover the original parameters. Both the non-random strand segregation probability p as well as the per cell division mutation rate μ are slightly underestimated for an increasing background mutation rate. Importantly, the non-random strand segregation probability is always underestimated and inferences become biologically meaningless (e.g. p < 0.5) for large background mutation rates. The actual data suggests high non-random strand segregation probabilities (see main text) and therefore implies small background mutation rates compared to cell division induced mutations.

(TIFF)

Data Availability

All data is published and referenced accordingly in the manuscript.

Funding Statement

BW is supported by the Geoffrey W. Lewis Post-Doctoral Training fellowship. AS is supported by The Chris Rokos Fellowship in Evolution and Cancer and by Cancer Research UK (A22909). AS is also supported by the Wellcome Trust (202778/B/16/Z). This work was also supported by Wellcome Trust funding to the Centre for Evolution and Cancer (105104/Z/14/Z). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Cairns J. Mutation selection and the natural history of cancer. Nature. 1975. October 28;255:197–200. [DOI] [PubMed] [Google Scholar]
  • 2.Werner B, Dingli D, Lenaerts T, Pacheco JM, Traulsen A. Dynamics of Mutant Cells in Hierarchical Organized Tissues. PLoS Comput Biol. 2011. December 1;7(12):e1002290 doi: 10.1371/journal.pcbi.1002290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Rodriguez-Brenes IA, Wodarz D, Komarova NL. Minimizing the risk of cancer: tissue architecture and cellular replication limits. Journal of The Royal Society Interface. 2013. June 26;10(86):20130410–0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Shinin V, Gayraud-Morel B, Gomès D, Tajbakhsh S. Asymmetric division and cosegregation of template DNA strands in adult muscle satellite cells. Nat Cell Biol. 2006. June 25;8(7):677–82. doi: 10.1038/ncb1425 [DOI] [PubMed] [Google Scholar]
  • 5.Conboy MJ, Karasov AO, Rando TA. High Incidence of Non-Random Template Strand Segregation and Asymmetric Fate Determination In Dividing Stem Cells and their Progeny. Goodell MA, editor. PLoS biology. 2007. April 17;5(5):e102 doi: 10.1371/journal.pbio.0050102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Booth C, Potten CS. Gut instincts: thoughts on intestinal epithelial stem cells. Journal of Clinical Investigation. 2000. June 1;105(11):1493–9. doi: 10.1172/JCI10229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Falconer E, Chavez EA, Henderson A, Poon SSS, McKinney S, Brown L, et al. Identification of sister chromatids by DNA template strand sequences. Nature. Nature Publishing Group; 2010. July 1;463(7277):93–7. doi: 10.1038/nature08644 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Quyn AJ, Appleton PL, Carey FA, Steele RJC, Barker N, Clevers H, et al. Short Article. Cell Stem Cell. Elsevier Ltd; 2010. February 5;6(2):175–81. doi: 10.1016/j.stem.2009.12.007 [DOI] [PubMed] [Google Scholar]
  • 9.Lansdorp PM. Immortal Strands? Give Me a Break. Cell. 2007. June;129(7):1244–7. doi: 10.1016/j.cell.2007.06.017 [DOI] [PubMed] [Google Scholar]
  • 10.Legraverend C, Escobar M, Jay P. “‘The Immortal DNA Strand’”: Difficult to Digest? Cell Stem Cell. Elsevier Inc; 2010. April 2;6(4):298–9. doi: 10.1016/j.stem.2010.03.005 [DOI] [PubMed] [Google Scholar]
  • 11.Tomasetti C, Bozic I. The (not so) immortal strand hypothesis. Stem Cell Research. 2015. March 1;14(2):238–41. doi: 10.1016/j.scr.2015.01.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Kiel MJ, He S, Ashkenazi R, Gentry SN, Teta M, Kushner JA, et al. Haematopoietic stem cells do not asymmetrically segregate chromosomes or retain BrdU. Nature. 2007. August 29;449(7159):238–42. doi: 10.1038/nature06115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Blokzijl F, de Ligt J, Jager M, Sasselli V, Roerink S, Sasaki N, et al. Tissue-specific mutation accumulation in human adult stem cells during life. Nature. Nature Publishing Group; 2016. October 13;538(7624):260–4. doi: 10.1038/nature19768 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Martincorena I, Roshan A, Gerstung M, Ellis P, Van Loo Peter, McLaren S, et al. High burden and pervasive positive selection of somatic mutations in normal human skin. Science. 2015. May 18;348:880–6. doi: 10.1126/science.aaa6806 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lodato MA, Rodin RE, Bohrson CL, Coulter ME, Barton AR, Kwon M, et al. Aging and neurodegeneration are associated with increased mutations in single human neurons. Science. 2018. January 29;359:555–9. doi: 10.1126/science.aao4426 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bae T, Tomasini L, Mariani J, Zhou B, Roychowdhury T, Franjic D, et al. Different mutational rates and mechanisms in human cells at pregastrulation and neurogenesis. Science. 2017. December 7;543:550–555. eaan8690. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Yatabe Y, Tavare S, Shibata D. Investigating stem cells in human colon by using methylation patterns. Proceedings of the National Academy of Science. 2001. August 30;:1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Knoblich JA. Asymmetric cell division:recent developments and theirimplications for tumour biology. Nature Reviews Molecular Cell Biology. Nature Publishing Group; 2010. December 1;11(12):849–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Clevers H. The Intestinal Crypt,A Prototype Stem Cell Compartment. Cell. 2013. July 18;154(2):274–84. doi: 10.1016/j.cell.2013.07.004 [DOI] [PubMed] [Google Scholar]
  • 20.Tomasetti C, Vogelstein B. Variation in cancer risk among tissues can be explained by the number of stem cell divisions. Science. 2015. January 1;347(6217):78–81. doi: 10.1126/science.1260825 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Werner B, Beier F, Hummel S, Balabanov S, Lassay L, Orlikowsky T, et al. Reconstructing the in vivo dynamics of hematopoietic stem cells from telomere length distributions. eLife. 2015. October 15; 10.7554, e08687v2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Milholland B, Dong X, Zhang L, Hao X, Suh Y, Vijg J. Differences between germline and somatic mutation rates in humans and mice. Nature Communications. Nature Publishing Group; 2017. April 28;8:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lynch M. Evolution of the mutation rate. Trends in Genetics. Elsevier Ltd; 2010. August 1;26(8):345–52. doi: 10.1016/j.tig.2010.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lynch M, Ackerman MS, Gout J-F, Long H, Sung W, Thomas WK, et al. Genetic drift, selection and the evolution of the mutation rate. Nature Reviews Genetics. Nature Publishing Group; 2016. November 1;17(11):704–14. doi: 10.1038/nrg.2016.104 [DOI] [PubMed] [Google Scholar]
  • 25.Sánchez-Danés A, Hannezo E, Larsimont J-C, Liagre M, Youssef KK, Simons BD, et al. Defining the clonal dynamics leading to mouse skin tumour initiation. Nature. Nature Publishing Group; 2016. August 18;536(7616):298–303. doi: 10.1038/nature19069 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Scheele CLGJ, Hannezo E, Muraro MJ, Zomer A, Langedijk NSM, van Oudenaarden A, et al. Identity and dynamics of mammary stem cells during branching morphogenesis. Nature. Nature Publishing Group; 2017. February 16;542(7641):313–7. doi: 10.1038/nature21046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ju YS, Martincorena I, Gerstung M, Petljak M, Alexandrov LB, Rahbari R, et al. Somatic mutations reveal asymmetric cellular dynamics in the early human embryo. Nature. Nature Publishing Group; 2017. March 30;543(7647):714–8. doi: 10.1038/nature21703 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Dingli D, Traulsen A, Michor F. (A)Symmetric Stem Cell Replication and Cancer. PLoS Comput Biol. 2007;3(3):e53 doi: 10.1371/journal.pcbi.0030053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Basak O, Beumer J, Wiebrands K, Seno H, van Oudenaarden A, Clevers H. Induced Quiescence of Lgr5+ Stem Cells in Intestinal Organoids Enables Differentiation of Hormone- Producing Enteroendocrine Cells. Stem Cell. Elsevier; 2017. February 2;20(2):177–190.e4. [DOI] [PubMed] [Google Scholar]
  • 30.Greenman C, Stephens P, Smith R, Dalgliesh GL, Hunter C, Bignell G, et al. Patterns of somatic mutation in human cancer genomes. Nature. Nature Publishing Group; 2007. March 8;446(7132):153–8. doi: 10.1038/nature05610 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.McGranahan N, Swanton C. Clonal Heterogeneity and Tumor Evolution: Past, Present, and the Future. Cell. Elsevier Inc; 2017. February 9;168(4):613–28. doi: 10.1016/j.cell.2017.01.018 [DOI] [PubMed] [Google Scholar]
  • 32.Gerlinger M, Rowan AJ, Horswell S, Larkin J, Endesfelder D, Gronroos E, et al. Intratumor Heterogeneity and Branched Evolution Revealed by Multiregion Sequencing. New England Journal of Medicine. 2012. March 8;366(10):883–92. doi: 10.1056/NEJMoa1113205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Davies H, Staaf J, Ramakrishna M, Glodzik D, Zou X, Martincorena I, et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature. Nature Publishing Group; 2016. May 2;:1–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Frank SA. Somatic evolutionary genomics: Mutations during development cause highly variable genetic mosaicism with risk of cancer and neurodegeneration. Proceedings of the National Academy of Sciences. 2010. January 26;107:1725–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Ross SM. Introduction to Probability Models Academic Press; 2014. [Google Scholar]
  • 36.Rodriguez-Brenes IA, Wodarz D, Komarova NL. Quantifying replicative senescenceas a tumor suppressor pathwayand a target for cancer therapy. Scientific Reports. Nature Publishing Group; 2015. November 27;:1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Strand segregation in terms of symmetric stem cell divisions.

a) Mean mutational burden μ, b) mutational variance σ2, and c) the ratio of mutational variance and mutational burden σ2/μ for purely asymmetrically or a mix of symmetrically and asymmetrically dividing stem cells. Here we compare stochastic simulations for N = 5000 purely asymmetrically dividing stem cells with a strand segregation probability of p = 0.9 and stem cells with perfect strand segregation p = 1 but a fraction of 10% of stem cell divisions being symmetric differentiations followed by symmetric self-renewals. Both scenarios lead to a linear increase of mean and variance of mutational burden with minimal rate differences. However, as predicted, the ratio of variance and mean become time independent and are the same on average for both processes. However, the variance of the distribution of the ratio of the variance and mean increases with time for symmetric stem cell divisions but is approximately constant for asymmetric stem cell divisions. This effect might provide a future method to distinguish and quantitate the amount of symmetric self-renewal in human stem cell populations.

(TIFF)

S2 Fig. Influence of cell division independent background mutation rate on inference of non-random strand segregation probability and per-cell mutation rate.

Plots a) to d) show the non-random strand segregation probability p and the per cell division mutation rate μ based on Eqs (15) and (16) inferred from stochastic simulations if we in addition allow for a constant cell-division independent mutation rate that influences both the ancestral and the duplicated DNA strand equally. In the upper panels a) and b) the underlying true parameters per cell division are μ = 6 and p = 0.95, whereas in the lower panels c) and d) we have μ = 6 and p = 0.7. If the background mutation rate is 0, we recover the original parameters. Both the non-random strand segregation probability p as well as the per cell division mutation rate μ are slightly underestimated for an increasing background mutation rate. Importantly, the non-random strand segregation probability is always underestimated and inferences become biologically meaningless (e.g. p < 0.5) for large background mutation rates. The actual data suggests high non-random strand segregation probabilities (see main text) and therefore implies small background mutation rates compared to cell division induced mutations.

(TIFF)

Data Availability Statement

All data is published and referenced accordingly in the manuscript.


Articles from PLoS Computational Biology are provided here courtesy of PLOS

RESOURCES