Abstract
Populations of physiologically vital mitochondrial DNA (mtDNA) molecules evolve in cells under control from the nucleus. The evolution of populations of mixed mtDNA types is complicated and poorly understood, and variability of these controlled admixtures plays a central role in the inheritance and onset of genetic disease. Here, we develop a mathematical theory describing the evolution of, and variability in, these stochastic populations for any type of cellular control, showing that cell-to-cell variability in mtDNA and mutant load inevitably increases with time, according to rates that we derive and which are notably independent of the mechanistic details of feedback signaling. We show with a set of experimental case studies that this theory explains disparate quantitative results from classical and modern experimental and computational research on heteroplasmy variance in different species. We demonstrate that our general model provides a host of specific insights, including a modification of the often-used but hard-to-interpret Wright formula to correspond directly to biological observables, the ability to quantify selective and mutational pressure in mtDNA populations, and characterization of the pronounced variability inevitably arising from the action of possible mtDNA quality-control mechanisms. Our general theoretical framework, supported by existing experimental results, thus helps us to understand and predict the evolution of stochastic mtDNA populations in cell biology.
Introduction
Molecules of mitochondrial DNA (mtDNA) form dynamic evolutionary populations within cells, replicating and degrading according to cellular control signals.1, 2 mtDNA can vary due to mutation or artificial manipulation;3 the proportion of mutant mtDNA in a cell is referred to as heteroplasmy. mtDNA encodes vital aspects of the bioenergetic machinery of eukaroytic cells; mtDNA variability can thus have dramatic cellular consequences, including devastating genetic diseases and numerous other conditions,3 making a theoretical understanding of this complex evolutionary system important. Understanding the natural feedback control acting on mtDNA populations is also a vital step in the development of artificial approaches to control mitochondrial behavior with genetic tools.4, 5 Cell-to-cell variances of mtDNA copy number and heteroplasmy are of particular importance, owing to their implications for maternal transmission of dangerous mutations6 and the manifestation of pathologies dependent on the range of heteroplasmies present in a tissue7—even a very small proportion of cells exceeding a heteroplasmy threshold can lead to pathologies.8
Stochastic behavior underlies much of cell biology and contributes to this cell-to-cell variability; cellular processes including gene expression,9, 10, 11 DNA replication,12 and mitochondrial and mtDNA dynamics13, 14, 15, 16 are subject to fundamentally stochastic influences. Variability in mitochondria can be a leading contributor to cell physiological behavior, making mitochondria an important target for explanatory stochastic models.15 Existing studies have included stochastic modeling and numerical treatments of mitochondrial17 and mtDNA populations, making quantitative progress with the assumption of specific control mechanisms.1, 14, 18, 19, 20 Other theoretical studies have drawn on classical statistical genetics, notably including the well-known Wright formula,21, 22 to produce a description of partitioning of mtDNA populations at cell divisions, but the role of stochastic mtDNA dynamics between cell divisions is largely omitted. Although recent experimental studies are starting to shed light on cellular control of mtDNA,14, 16 a general theoretical framework is currently absent. Here we address this open question by constructing a general, bottom-up stochastic description of mtDNA populations subject to arbitrary cellular control mechanisms, providing analytic results for the predicted behavior associated with any mtDNA control mechanism, and adapting the classic Wright formula to account for and interpret stochastic mtDNA dynamics. Notably, our approach and results hold independently of the details of specific regulatory mechanisms underlying mtDNA feedback signaling, providing a general theoretical framework for control of stochastic mtDNA populations across different species and environments.
As we develop the theoretical framework to address mtDNA dynamics below, we will consider a set of applications of the theory, linking with existing experimental data from a variety of studies to validate our approach and obtain quantitative results and predictions on the processes governing mtDNA dynamics. We will focus on two questions arising from the study of mtDNA diseases: (1) how and at what rate does cell-to-cell heteroplasmy variance increase and (2) how selective pressure against a particular mtDNA mutation affects cellular mtDNA populations. The single-cell measurements required to address these questions directly remain challenging: we aim to show that mathematical theory, appropriately validated and refined with available data, provides a powerful alternative route to make quantitative progress understanding this important behavior. Box 1 summarizes the central biological messages arising from development and analysis of our theory.
Box 1. Key Biological Findings from Our Mathematical Model.
These results arise when the assumptions of our modeling approach hold: that cells are heteroplasmic, mtDNA replication and degradation are Poisson processes with the same rates for mutant and wild-type mtDNA, and feedback mechanisms depend only on the current state of the system. We justify these assumptions with reference to experimental data in the text and in Appendix A.
-
I.
When mean copy numbers are constant, at least one variance (wild-type or mutant) increases linearly with time (Equations 8, 9, and 10).
-
II.
When mean copy numbers are constant, heteroplasmy variance increases linearly with time with a rate that depends only on mtDNA copy numbers and the rate of mtDNA turnover, and not on any details of mechanism or signaling (Equation 12); many copy-number control mechanisms are therefore indistinguishable even from dynamical measurements of heteroplasmy (Figures 1 and 2).
-
III.
Control applied through (1) biogenesis rates and (2) degradation rates induce comparable behavior in the cellular mtDNA population (Figure 1; Supplemental Data Section 2).
-
IV.
A modified Wright formula (Equation 14; Figure 4) gives a general method to establish a quantitative link between observed normalized heteroplasmy variance 〈h2〉 ’ and observable (as opposed to effective) quantities (Figure 4).
-
V.
Quality control does not guarantee the clearing or stabilization of mutant load and can induce substantial variance in wild-type mtDNA (potentially challenging cells with diminished populations; Figures 5 and 6).
Material and Methods
We will consider mtDNA populations in cells that are heteroplasmic with two non-recombining haplotypes, though this treatment can readily be extended to more mtDNA types. We write a state with w wild-type mtDNAs and m mutant mtDNAs as {w,m}. We first consider the class of systems where both haplotypes are subject to the same degradation rate ν and the same replication rate λ, both of which may be general functions of both haplotype copy numbers. This model thus represents the situation where no direct selective difference exists between mutant and wild-type. This assumption holds for only some biological cases (see Burgstaller et al.23 and references therein for a review of studies where mtDNA types segregate unevenly) and will be relaxed later. We also assume that cellular control is based only on the current state of the cellular mtDNA population and not its history. The dynamics governing the system then consist of a set of Poisson processes:
(Equation 1) |
(Equation 2) |
(Equation 3) |
(Equation 4) |
This formalism captures a wide range of models for mtDNA dynamics (see below). We will begin with the assumption that the system does not undergo cell divisions and has a stationary state in the population mean of both haplotype copy numbers, and we will write this steady state as . This initial picture is more appropriate for quiescent cell types or mtDNA “set points” than for the pronounced changes in mtDNA copy number that occur during development.2, 14 We will later generalize this picture to allow for arbitrary changes in copy number.
We will first consider general results from this formalism, applicable to a wide variety of possible cellular behaviors. We will then illustrate its application with a range of previously proposed and new feedback mechanisms.
Results
Copy-Number Variance with Stable Population Means
Any control mechanism of the form in Equations 1, 2, 3, and 4 (including manifestations of feedback control) can be represented to linear order by a Taylor expansion of its rates about (the steady state exists by construction from our previous assumption):
(Equation 5) |
(Equation 6) |
It will readily be seen that to support a stable population mean at , . Assuming that w and m can be written as the sum of a deterministic and a fluctuating component, we use Van Kampen’s system size expansion to find a Fokker-Planck equation describing the behavior of w and m governed by Equations 5 and 6.24, 25 From this equation we extract expressions for the time behavior of the mean and variance of w and m (see Supplemental Data Sections 1 and 2).
We show in Supplemental Data Section 4 that attempting to identify a stable state for population variances and covariance yields the condition
(Equation 7) |
hence, any population mean state in which mutant content is nonzero does not admit a stationary solution for variances, unless . If then there is no further change to the system once steady state has been reached (no stochastic turnover occurs) and the system remains frozen thereafter. In other words, for a nonzero mutant population and nonzero mtDNA turnover, the variance of at least one mtDNA population will change with time.
The Fokker-Planck equation we derive can be used to compute the expected behaviors of (wild-type variance), (mutant variance), and (wild-type mutant covariance) for a given control mechanism. The variance and covariance solutions display some transient behavior, involving terms on the timescale . Because, for stability, are nonpositive and are nonnegative, is either a constant or an exponentially decaying function of time t. The expressions thus subsequently converge to linear trends for large t:
(Equation 8) |
(Equation 9) |
(Equation 10) |
The forms of the transient functions and the constants and are given in Supplemental Data Section 2 and are functions only of the difference between replication and degradation rates , steady-state copy numbers and , and mitophagy rate . Furthermore, the structure of these expressions is such that for and nonzero w and m, at most one of the can be zero, , and . Thus, around the mean , either wild-type variance or mutant variance or both increase linearly with time (Box 1, I). As time continues, the increasing variance means that extinction of one mtDNA becomes increasingly likely; implications of this behavior are explored below.
The mathematical structure of the solutions only ever involves the difference between replication and degradation rates , showing that control of (1) biogenesis rates and (2) degradation rates induce comparable behavior in the cellular mtDNA population (Box 1, III).
Heteroplasmy Statistics
As shown in Supplemental Data Section 5, a first-order Taylor expansion gives an approximation for the variance of :
(Equation 11) |
and we can then use previously obtained expressions for to compute this approximate heteroplasmy variance. Neglecting transient terms and using and in Equation 11 gives, after some algebra,
(Equation 12) |
where , with and ; thus, h is (mean) heteroplasmy and n is (mean) total copy number (recall that in steady state). In Equation 12 we have used normalized heteroplasmy variance , accounting for the dependence of on the magnitude of h; is the quantity most often reported in experimental studies.
In other words, when the system size expansion is valid (see below), for any control mechanism, heteroplasmy variance in the copy number steady state increases linearly with time with a rate that depends only on the copy numbers of the system and the timescale of random turnover (Box 1, II) (Equation 12 is independent of the β and δ terms in Equations 5 and 6). As we discuss later, this observation implies that many possible mechanisms could be responsible for the same observed trend in heteroplasmy variance, meaning that measurements of heteroplasmy variance alone, even if repeated at different time points, place only a limited mechanistic constraint on mtDNA dynamics.14 In Supplemental Data Section 7, we discuss experimental strategies that can more efficiently discriminate between different control mechanisms.
Transient Behavior and Cell Divisions and Validity of the Expansion
To obtain analytic insight, we have thus far focused on modeling mtDNA behavior using the system size expansion when a steady-state assumption had already been applied. Transient behavior can also be explored by employing the system size expansion directly on the appropriate master equation, using the full expressions for and (see Supplemental Data Sections 2 and 6). Relaxing the steady state assumption means that the ODEs describing variance behavior are analytically intractable for many forms of . However, they can simply be solved numerically and, as shown in subsequent sections, well match stochastic simulation (which of course is numerically far more intensive). This ODE approach fully accounts for non-equilibrium behavior—including transient relaxation, cell cycling, and so on—while the system size expansion remains appropriate (see below).
This analysis can readily be used to characterize the effect of partitioning mtDNAs at cell divisions. To compute the time behavior of variance where cell divisions occur at arbitrary times, we invoke a linear noise assumption,24, 25 first using the ODEs above to compute the variance behavior within one cell cycle. Partitioning rules for copy-number statistics are then applied, and the resulting post-partition statistics are used as the initial condition for a next phase of ODE solution. We here illustrate this process for binomial partitioning of mtDNAs to connect with recent studies in mice14 and HeLa,15 although with an appropriate choice of partitioning rules, this approach can be used to address any partitioning regime (for example, the sub-binomial case recently reported in fission yeast16). In the case of binomial partitioning, the appropriate partitioning rules are , following straightforwardly from the variance of a binomial distribution with p = 1/2. We will see below that this picture well describes the behavior of stochastic populations in dividing cells: hence, the total variance contributions of turnover between divisions and partitioning at divisions can be modeled as a linear sum, and the behavior of mechanisms across cell cycles is comparable to that within a cell cycle.
The results above hold for a nonzero mutant population. As copy-number variance increases, we expect extinction of one mtDNA type to become increasingly likely. To address this behavior, we must consider when the system size expansion itself, which is reliant on the validity of the linear noise approximation, holds. An important threat to this validity is a non-negligible extinction probability for one mtDNA type, whereupon a normal distribution no longer adequately models the copy-number distribution. Heuristically, this situation arises when, for example, . Another challenge arises due to the fact that, when λ and ν are functions of w and m, our linear theory is an approximation to the nonlinear dynamics that result. Highly nonlinear behavior (for example, pronounced discrete steps in rates occurring at critical copy numbers) will therefore not be perfectly captured, but the ability of our theory to reproduce simulation of the fully nonlinear dynamics (in Figures 1, 3, and 5 and Supplemental Data Section 3) suggests that the linear theory provides valuable insight into a wide range of biologically plausible behaviors. Treatments of fully nonlinear cases represent a substantial technical challenge that will be addressed in future work.
In cases where the validity of the system size expansion is challenged, the mean and variance of mtDNA populations are likely to be underestimated by the preceding analysis (see Supplemental Data Section 3), and the heteroplasmy variance will likely be overestimated, with the true increase in with time gradually becoming sublinear. Fixation is also neglected by the deterministic version of the mean equations of motion, which allow only an asymptotic descent to zero. Thus, the more general statement of our findings are that (1) for the period when extinction of either type is unlikely, variances and covariances change linearly (after transients), (2) as extinction of one type becomes more likely due to this increased variance, the increasing trend continues but departs from those linear forms (in particular, the increase of slows to become sublinear), and (3) when extinction of one type is almost certain, the system tends toward the behavior expected if only one type was present (ultimately stalling variance increase, as if in Equation 7). The results we focus on in the remainder of this report can be viewed as describing the “quasi-stationary state” where extinction is negligible; further quantitative details can be derived using, for example, adaptations of the system size expansion that address extinction.26
Specific Control Mechanisms and Comparison with Simulation
The previous results make no assumptions about the specific form of control applied to the mtDNA population, other than the fact that it depends only on current state and is manifest through the rates of Poissonian replication and degradation that are equal for both mtDNA species (and can be described with the system size expansion, as discussed above). We can exploit the generality of the preceding formalism to obtain results for any given (feedback) control mechanism, defined by a specific form of and in Equations 1, 2, 3, and 4.
We first consider the well-known “relaxed replication” model,1, 19 which involves stochastic mtDNA degradation, coupled with mtDNA replication that is physically modeled as a deterministic process. We propose that, if degradation is regarded as a stochastic process (due to its microscopic reliance on complicated processes and colocalizations in the cell), picturing replication (which also relies on complicated interactions on the microscopic scale) as a stochastic process leads to a consistent stochastic generalization (see Appendix A). The corresponding model has exactly the same expressions for rates as in Capps et al.1 (type A in Figure 1A), but replication rate is now interpreted as the rate of a stochastic, rather than a deterministic, process.
We also introduce several other models for mtDNA control to consider a range of potential functional forms, including differential and ratiometric control based on a target wild-type copy number, an absence of any feedback control, and others (types B–G in Figure 1A). The presence or absence of w and m in these expressions reflects what quantity is being sensed by the cell (wild-type mtDNA alone, mutant mtDNA alone, or a combination of the two). We further note that this general formalism can also incorporate physical constraints on the mtDNA population. For example, the hypothesis that mitochondrial concentration is controlled between cell divisions15 (recently confirmed in fission yeast16) could correspond to wopt, the “target” mtDNA number, being a linear function of cell volume in the models above, or could arise through passive birth-death dynamics (model D) with control implemented at the cell division stage (see previous section). Interpretations of these control mechanisms in terms of cellular sensing and the language of stochastic population processes are given in Supplemental Data Section 3.
Figure 1B illustrates the application of our analysis to these example control mechanisms in the absence of cell divisions. The close agreement between stochastic simulation and analytic results in steady state demonstrates the ability of our general theory to describe a wide range of different potential cellular control mechanisms. The long-term linear increases in one or both mtDNA variances are clear (Box 1, I), and trajectories of with the same steady state and turnover timescale are identical (Box 1, II). Figure 1C, including cell divisions, demonstrates close agreement between ODE solutions and stochastic simulation, further showing that the linear noise treatment successfully captures stochastic behavior over cell divisions. The close similarity of trajectories across divisions is a consequence of their aforementioned identity in steady-state conditions with no cell divisions (Figure 1B); the slow divergence is due to differences in mechanism behavior away from the steady state.
Applications I: Heteroplasmy Variance Increases at Constant Mean Copy Number
The increase of heteroplasmy variance with time is of profound importance in determining the inheritance and onset of mtDNA diseases. Because disease symptoms often manifest only when heteroplasmy exceeds a certain threshold,7 increasing heteroplasmy variance with time can lead to pathologies even if mean heteroplasmy does not change (because a higher cell-to-cell variance implies a greater probability of a given cell exceeding a threshold).14
We sought experimental evidence to support the linear increase of predicted by our theory. Time course measurements of single-cell heteroplasmy values remain limited; we identified results from the Drosophila germline27 and in the mouse germline for the NZB/BALB model28, 29 and the HB model.14 For these data, we compared the ability to fit the data of a null model involving constant (, where α is a constant) and an alternative model , where changes linearly with time as our theory predicts (Equation 12; Box 1, II). Using the Akaike information criterion (AIC) and assuming normally distributed noise on mean (, an assumption consistent with our linear approximation but which can be further refined as in Wonnapinij et al.29), we found that the alternative, time-varying model was favored in all cases, providing support for our theory (Figures 2A–2C).
These results are quantitatively consistent with a previous study on the dynamics of heteroplasmy variance during the mtDNA bottleneck in mice14 where a mechanism involving random mtDNA turnover and random mtDNA partitioning at cell divisions was found to best explain experimental observations. mtDNA in mice and rats often has a half-life of 10–100 days.23 This corresponds to day-1 and days. In Figure 1 we show the patterns of copy number and heteroplasmy means and variances for days and 103 mtDNA molecules per cell under different specific control strategies. The increase of from 0 to 10−3, corresponding for h = 0.1 to an increase in from 0 to 0.011, matches the scale of change observed in the mtDNA bottleneck (though the bottleneck is complicated by changing population size n and compensatory changing turnover ).14 Previous work has shown that the case with no feedback (D in Figure 1) describes well the behavior of mtDNA with cell divisions in mouse development.14 In Appendix A we discuss further connections with previous theoretical studies; experimental cell-to-cell measurements in more quiescent cell types, although currently lacking, will provide valuable further tests of our theory.
mtDNA Turnover in the Wright Formula
Powerful existing analyses of mtDNA population variance22, 29 with widespread influence3 have drawn upon a classical theory by Wright (and Kimura)21, 30 describing stochastic sampling of a population of elements between generations. The resulting expression for expected heteroplasmy variance is the well-known equation sometimes referred to as the Wright formula (though other equations also bear this name):3, 22
(Equation 13) |
where ne is an effective population size and g is a number of generations. The mapping of this effective theory to the complicated mtDNA system is valuable to develop intuition but cannot capture the detailed dynamics of individual mtDNA molecules, due to assumptions (see Appendix B) that mean the effective parameters of the theory (ne and g) cannot generally be interpreted as biological observables,22, 31 preventing quantitative analyses of mechanisms and dynamics.3 In particular, ne does not generally correspond to a minimum mtDNA copy number (see Appendix B), and, being a genetic rather than a physical parameter, “is unlikely ever to correspond closely to the number of anything.”31
The Wright formula, however, does accurately describe the heteroplasmy variance due to binomial sampling of 2ne real elements at cell divisions for an observable population size, and, as seen in the previous section, the additional effect of mtDNA turnover based on observable values can be included as an extra linear contribution. In general, this term will depend on the dynamics controlling the mtDNA population and can easily be calculated via the ODE approach above (Figure 1).
In the case where no systematic change in mtDNA population size occurs with time, we can use the observation that trajectories are often comparable across a variety of different possible cellular control mechanisms (and identical in the steady state; Figure 1 and Equation 12) to produce a simple approximate description linking to observables. The simple steady-state behavior is given by Equation 12. To construct an approximation, we use a simple estimate of mean population size over a cell cycle, writing , where n is the mtDNA population size immediately after division and thus gives a population size “average” over the changes within a cell cycle. Using the above analysis with , (representing the “average” populations of wild-type and mutant mtDNA), and (so that τ is the timescale of mtDNA degradation), the corresponding expression in terms of h and n is then given by a “turnover-adjusted” Wright formula (see Appendix B):
(Equation 14) |
where g is the number of cell divisions that have occurred and t is the amount of time that has expired since an initial state with . This expression is subject to the conditions for system size expansion validity described above; thus, as fixation probability increases, the increase of will drop below this prediction.
Figure 3 illustrates the agreement between Equation 14 and stochastic simulation for the range of control mechanisms we consider under different population sizes and heteroplasmies. It is worth reiterating that more exact solutions for a given control mechanism can easily be computed using the preceding ODE approach, and stochastic analysis can also be used to quantitatively describe the effects of more specific circumstances (for example, the systematically varying population size through the mtDNA bottleneck14). In the case of no such systematic variation, and, crucially, if the Poissonian model of Equations 1, 2, 3, and 4 holds, then Equation 14, a modified Wright formula, represents a simpler, approximate way to establish a quantitative link between observed normalized heteroplasmy variance and observable quantities (Box 1, IV), i.e., n (mtDNA copy number immediately after division), g (number of cell divisions), and τ (timescale of mtDNA turnover).
Applications II: Linking Physical and Genetic Rates with the Modified Wright Formula
The Wright formula is traditionally used to compare a heuristic, effective “bottleneck size” across experimental systems (for example, in studies of different organisms22 and of human disease32). In its uncorrected form, this bottleneck size can only be semiquantitatively treated—bottleneck sizes can be ranked, but absolute values and differences cannot be straightforwardly interpreted. Our adaptation allows us to use this formula to connect the rates of physical subcellular processes with the resulting rates of genetic change.
To illustrate this connection, we focus on a particular period during mouse development. Between 8.5 and 13.5 days post conception (dpc) in the developing mouse germline, cell divisions occur with a period of about 16 hr,33 giving g = 7 or 8 cell divisions in this period (of length t = 5 days). Copy-number measurements during this period show that the mean total number of mtDNA molecules per cell remains of the order of n = 2,000 (Figure 4A).28, 34, 35 During this period, heteroplasmy variance increases on average (but with substantial variability) from around 0.01 to 0.02 (Figure 4B).28, 36 Figure 4B shows a best-fit line to data, with slope 1.52 × 10−3 day−1 (5%–95% confidence intervals (1.11–1.92) × 10−3 day−1).
We can use these measurements in conjunction with the turnover-adjusted Wright formula (Equation 14) to obtain estimates for the rate of mtDNA turnover during this period. Using Equation 14 with the best-fit , and g = 7 divisions, n = 2,000 mtDNA molecules, t = 5 days gives the resulting estimate days (5%–95% confidence intervals 0.43–0.88 days, using the same values for n, g, t) for the characteristic timescale of mtDNA degradation. This increase in mtDNA turnover (relative to the day timescale in differentiated tissues23) in the germline during this developmental period matches quantitative results from a more detailed study of the bottleneck reporting τ within the range 0.38–2.1 days (based on posteriors for between 0.02 and 0.11 hr−1)14 and illustrates how a suitable mathematical model can be used to estimate biological quantities that are challenging to directly address with experiment.37
Influence of Mutations, Replication Errors, and Selective Differences: Quality Control of Replication Errors
Our approach is easily generalized to include other processes than those described by Equations 1, 2, 3, and 4: in Supplemental Data Section 6 we demonstrate that adding and changing appropriate processes allows us to analyze the effects on mtDNA mean and variance due to de novo mutations, replication errors, and multiple selective pressures. Our approach can be thus used to characterize variability arising from selection and mutation under any control mechanisms, without requiring stochastic simulation.
We can use this ability to explore a particular scientific question: if mtDNA replication errors occur and the cell attempts to clear the resulting mutant mtDNA through selective quality control, how do cellular mtDNA populations change? To investigate this question, we introduce the process (replication errors—leading to the production of a new mutant mtDNA—occurring with rate μ) and reparameterize Equation 4 as (an increase of ε in mutant degradation rate compared to wild-type degradation). We thus model the situation where replication errors arise and the cell attempts to clear them through quality control, while a control strategy for mtDNA populations is also in place.
Figure 5 illustrates the mean and variance of w and m in two different cases, distinguished by the relative magnitude of the selective difference (ε) and error rate (μ). This ratio is crucial in determining whether mutant mtDNA is cleared or increased: Figure 5A shows that mutant is cleared when (selection is sufficiently strong to overcome errors), but when selective difference ε is insufficiently high, mutant mtDNA mean and variance (and heteroplasmy) increase with time. In both cases, we also observe substantial differences in mtDNA behavior depending on the control model in place. Control models lacking an explicit target copy number (D, no feedback; F and G, immigration-like) experience substantial increases in wild-type variance while mutant is being removed. Models involving a target wild-type copy number and weak or no coupling to mutant mtDNA (B, C, E) admit an order-of-magnitude lower increase in wild-type variance as mutant is cleared. Relaxed replication (model A), which combines a target copy number with a strong coupling between mutant and wild-type mtDNA, displays an intermediate increase on wild-type variance as mutant is cleared.
Theoretical approaches that consider only the mean behavior of mtDNA populations (Figure 5A) cannot account for this cellular heterogeneity and the important fact that quality control acting to remove mutant mtDNA can also induce variability in wild-type mtDNA (Figure 5B). The action of quality control may therefore yield a subset of cells with wild-type mtDNA substantially lower than the mean value across cells—potentially placing a physiological challenge on those cells where wild-type mtDNA is decreased. In addition to the important point that the simple presence of quality control does not guarantee the clearing or stabilization of mutant load, we thus find that quality control may have substantial effects on wild-type as well as mutant mtDNA if cellular control couples the two species (Box 1, V).
Applications III: Variance Induced through Mutant Clearing
The A>G mutation at position 3243 in human mtDNA is the most common heteroplasmic pathological mtDNA mutation, giving rise to MELAS (mitochondrial encephalomyopathy, lactic acidosis, and stroke-like episodes), a multi-system disease. The dynamics of c.3243A>G heteroplasmy are complex and tissue dependent; its behavior in blood has been characterized in particular detail using a fluorescent PCR assay for heteroplasmy38 in a way that allows us to explore our theoretical predictions about mtDNA statistics with mutant clearing.
Pyle et al.38 took two blood samples, several years apart, from human patients, and quantified c.3243A>G heteroplasmy (h) and mean mtDNA copy number per cell for both samples. Although single-cell data are not presented in the publication, progress can be made with the averaged quantities. A strong decrease in (h) with time is observed for all patients, confirming that mutant mtDNA is being cleared (Figure 6A). The behavior of total mtDNA molecules per cell (w + m) is less consistent, with a range of large increases and moderate decreases in total number. As shown in Figures 6B and 6C, patient-to-patient variance in both w and m increases with time in conjunction with h decreases.
Although the measurements in Pyle et al.38 are averages over groups of cells, an approximate quantitative comparison of these data with the predictions of our theory can be made. In the spirit of “back-of-the-envelope” calculations,39 we estimate that each sample of cells giving rise to a measurement corresponds to approximately 103 cells (see Supplemental Data Section 8). Then, using an estimate of days (by comparison with other mammalian species, as above), we find that a selective pressure of day−1 matches the observed decrease in heteroplasmy, corresponding, for example, to a decrease from 0.15 to 0.14 over 8 years, as in Figure 6A. Solving the ODEs resulting from this system, using control model D as the simplest case, predicts increases in cell-to-cell copy-number variance of approximate magnitude and over 8 years. We can translate these cell-to-cell values into the variance expected across samples of cells by dividing by the number of samples (taken as 103 as above). The resulting variance corresponds, for example, to expected standard deviations in wild-type and mutant copy number after 8 years of 10.0 and 4.0, respectively, for a sample with and (consistent with the increasing spread of values in Figures 6B and 6C). We used the Kolmogorov-Smirnov test to test the alternative hypotheses that the experimentally observed and at the later time point differed from those predicted by our model; no test yielded p < 0.05 (see Supplemental Data Section 8). Of course, an absence of support for an alternative hypothesis cannot be taken as support for a null hypothesis, but shows that the existing experimental data are not incompatible with our model.
The observations in Figure 6 support the predictions made in Figure 5, where mutant load is decreased but variance in wild-type copy number (and mutant copy number) increases. The bottom-left quadrant of Figure 5B shows that different control mechanisms display similar initial behavior; follow-up studies on these subjects could be used to distinguish possible mechanisms for mtDNA control. For example, if the rate of wild-type variance increase decreases over time, models B, C, and E are more likely; if wild-type variance continues to increase, models A, D, F, and G are more likely. More detailed model discrimination based on the time behavior of w and m variances are possible (Figure 1) and can be performed using statistical methods accounting for mean and variance behavior.29, 40
Discussion
A general, bottom-up theory has been produced to describe the time behavior of cell-to-cell variance in mtDNA populations subject to controlled biogenesis and/or degradation, mutation, selection, and cell divisions. This theory is based around the microscopic behavior of mtDNA molecules, allowing a hitherto absent connection between widely used “effective” statistical genetics approaches (Equation 13) and measurable biological quantities, and motivating experiments to further elucidate the mechanisms acting to control mtDNA (described in Supplemental Data Section 7). We have shown that the predictions of this theory agree with experimental observations of mixed mtDNA populations and that the application of appropriately validated mathematical theory allows us to make estimates of important biological quantities that remain challenging to directly address with experiments. Our theory describes the cell-to-cell variability in mtDNA populations and thus provides a framework with which to understand the inheritance and onset of mtDNA diseases.6, 14
Our theoretical platform unifies several existing modeling approaches that have driven advances in the study of mtDNA populations. We have specifically demonstrated that the “relaxed replication” model1, 19, 34 (our model A), simple birth-death models14, 20 (our model D), and cellular controls based on homeostatic principles15, 16 (our models B, C, and G) can naturally be represented within our framework. As a result, analytic expressions for the expected behavior of heteroplasmy variance and other population statistics can readily be extracted for these and other mtDNA models (see Supplemental Data Section 3), allowing the detailed characterization of mtDNA dynamics, including the probability of crossing disease thresholds,7 which can be computed from heteroplasmy statistics.14 We have also used the theoretical ideas developed herein to refine a widely used model for mtDNA populations of changing size (the Wright formula), explicitly connecting it with cellular processes and allowing a link between physical and genetic quantities (Equation 14, Figure 4).
Further, our theory also describes the dynamics of heteroplasmy change with time in the presence of selective pressure for one mtDNA type. Given an initial heteroplasmy h0 and a selective pressure β (positive β corresponding to positive selection for the mutant mtDNA type), we find (see Supplemental Data Section 5) that heteroplasmy evolves according to:
(Equation 15) |
This behavior immediately motivates a transformation, allowing the evolution of heteroplasmy to be compared across different starting values h0:
(Equation 16) |
allowing, as in our previous work,23 heteroplasmy results from different biological samples to be compared together, accounting for different initial heteroplasmies (in other words, the same selective pressure will produce the same regardless of h0).
It is likely that control mechanisms found in biology have nonlinear forms (for example, sigmoidal response curves are common in cellular signaling). We have shown that a linearization satisfactorily describes some non-equilibrium behavior (for example, in the case of our cell division model), but further investigation of more general nonlinear behavior and modulation of wider cell behavior by mtDNA populations (for example, by influencing cell cycle progression15) are important future developments. In Supplemental Data Section 1 we discuss a linear stability analysis of our expressions for mean mtDNA behavior, which highlights a link between the “sensing” of an mtDNA species (in the sense that the presence of that species modulates replication or degradation rates) and the ability to control the mean level of that species.
The control of stochastic systems is a well-established field within control theory.41 Optimal control mechanisms addressing the mean and variance of stochastic processes have been derived in a variety of contexts (see, for example, Borkar42 and citations therein), particularly in financial applications,43 and often find tradeoffs between controlling the mean and variance of a process. We observe a comparable tradeoff, that tight control on moments of one species leads to loose control on another. We have focused on providing a general theoretical formalism with which to treat any given control mechanism; it is anticipated that the above treatment may also be of value in describing heterogeneity in other systems where replication and/or death rates of individuals depend on feedback from current numbers of individuals (for example, through terms describing competition for resources in ecology). Within the context of mtDNA populations, we anticipate that this theoretical framework will assist in understanding natural processes of mtDNA inheritance and evolution within an organismal lifetime (including segregation and increasing variance with age)14, 23 and informing applied approaches to control mitochondrial behavior with genetic tools.4, 5
Published: October 27, 2016
Footnotes
Supplemental Data include mathematical derivations, two figures, and two tables and can be found with this article online at http://dx.doi.org/10.1016/j.ajhg.2016.09.016.
Appendix A. Interpretation of Relaxed Replication
The relaxed replication model1, 19 describes a cellular population of mtDNA molecules according to the following algorithm. mtDNAs randomly degrade as a Poisson process with rate . Every time step , the value of C(w,m), a deterministic function of w and m, is computed, then mtDNAs are added to the population. The genetic properties of these added mtDNAs are random—each is assigned a genetic type based on a random sampling of the existing populations—but their physical properties (i.e., the total copy number added at each step) are deterministic. We argue that, as both replication and degradation of mtDNAs depend on complicated behavior and thermal, microscopic interactions, it makes more sense to model both processes as stochastic. Thus, C(w,m) is interpreted as the rate of a Poisson process describing replication, just as is the rate of a Poisson process describing degradation. This interpretation reconciles the nature of the two processes.
Although this feature is less interesting than the underlying scientific behavior, the original algorithm also raises a (not insurmountable) technical problem with implementation. If a time step is chosen, the algorithm will never add any mtDNA to the system. But, in order to suitably characterize the stochastic degradation without using the Gillespie algorithm,44 it is desirable to choose a time step as low as possible. There is therefore a risk that one or other of the deterministic replication and stochastic degradation processes is inadequately captured in a given simulation protocol.
We have illustrated the excellent agreement between our theoretical approaches and stochastic simulation (for example, Figure 1). To quantitatively connect with previous analyses of specific control strategies, we confirm that the behavior for model A (relaxed replication) matches that observed in previous simulation studies19 with a back-of-the-envelope calculation.39 The rate of variance increase with time with days (comparable in magnitude to the (1 − 10)× log 2 days used in Chinnery and Samuels19) and wopt = 1,000 from Figure 1 is roughly 40 day−1. Considering 50 years of evolution of this system, we expect a standard deviation of roughly in mutant copy number. This value is consistent with the simulations in Chinnery and Samuels.19
We connect to an additional numerical result in Capps et al.1 In the absence of a mutant population, the variance of the wild-type population was reported to be stable at , with the original model interpretation of mtDNA replication as deterministic. Under the interpretation of stochastic replication, an absent mutant population (mss = 0) permits stability in the wild-type population variance, which after a little algebra is calculated to be . Intuitively, modeling both replication and degradation as stochastic does not affect mean copy number but does increase variance.
Appendix B. Interpretation of Wright Formula for mtDNA
The Wright formula
(Equation 17) |
has been proposed as a model for the time evolution of heteroplasmy variance in a population with effective size ne subject to random partitioning at each of g generations. This picture has been successfully employed to investigate heteroplasmy distributions in real systems,22 with ne and g interpreted as parameters of the theory without immediate biological interpretation.
The mapping of the original genetic system considered by Wright21 to cellular populations of mtDNA requires some discussion. If “generations” are interpreted as cell divisions, the mechanism by which mtDNA copy number is redoubled between divisions is assumed by the model to be deterministic. Cell divisions will result in a halving of the mtDNA population. Application of the Wright model assumes that the original population is thenceforth recovered with no increased variance in the population. In other words, the mtDNA population is assumed to exactly double between divisions with no stochasticity in the process. As we underline in the main text, the effects of (inevitable) stochasticity due to mtDNA turnover are not explicitly captured by the Wright formula. Other complications exist, as described in Wonnapinij et al.,22 but play less important roles here. As a result, the “bottleneck size” ne cannot immediately be interpreted as an observable minimum cellular copy number of mtDNA molecules (a quantity that is reported by, for example, a qPCR experiment measuring cellular mtDNA content), but rather the size of an effective “founder” population.
Supplemental Data
References
- 1.Capps G.J., Samuels D.C., Chinnery P.F. A model of the nuclear control of mitochondrial DNA replication. J. Theor. Biol. 2003;221:565–583. doi: 10.1006/jtbi.2003.3207. [DOI] [PubMed] [Google Scholar]
- 2.St John J. The control of mtDNA replication during differentiation and development. Biochim. Biophys. Acta. 2014;1840:1345–1354. doi: 10.1016/j.bbagen.2013.10.036. [DOI] [PubMed] [Google Scholar]
- 3.Wallace D.C., Chalkia D. Mitochondrial DNA genetics and the heteroplasmy conundrum in evolution and disease. Cold Spring Harb. Perspect. Biol. 2013;5:a021220. doi: 10.1101/cshperspect.a021220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bacman S.R., Williams S.L., Pinto M., Peralta S., Moraes C.T. Specific elimination of mutant mitochondrial genomes in patient-derived cells by mitoTALENs. Nat. Med. 2013;19:1111–1113. doi: 10.1038/nm.3261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gammage P.A., Rorbach J., Vincent A.I., Rebar E.J., Minczuk M. Mitochondrially targeted ZFNs for selective degradation of pathogenic mitochondrial genomes bearing large-scale deletions or point mutations. EMBO Mol. Med. 2014;6:458–466. doi: 10.1002/emmm.201303672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Burgstaller J.P., Johnston I.G., Poulton J. Mitochondrial DNA disease and developmental implications for reproductive strategies. Mol. Hum. Reprod. 2015;21:11–22. doi: 10.1093/molehr/gau090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rossignol R., Faustin B., Rocher C., Malgat M., Mazat J.P., Letellier T. Mitochondrial threshold effects. Biochem. J. 2003;370:751–762. doi: 10.1042/BJ20021594. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Baris O.R., Ederer S., Neuhaus J.F., von Kleist-Retzow J.C., Wunderlich C.M., Pal M., Wunderlich F.T., Peeva V., Zsurka G., Kunz W.S. Mosaic deficiency in mitochondrial oxidative metabolism promotes cardiac arrhythmia during aging. Cell Metab. 2015;21:667–677. doi: 10.1016/j.cmet.2015.04.005. [DOI] [PubMed] [Google Scholar]
- 9.Raj A., van Oudenaarden A. Nature, nurture, or chance: stochastic gene expression and its consequences. Cell. 2008;135:216–226. doi: 10.1016/j.cell.2008.09.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kaern M., Elston T.C., Blake W.J., Collins J.J. Stochasticity in gene expression: from theories to phenotypes. Nat. Rev. Genet. 2005;6:451–464. doi: 10.1038/nrg1615. [DOI] [PubMed] [Google Scholar]
- 11.Paulsson J. Summing up the noise in gene networks. Nature. 2004;427:415–418. doi: 10.1038/nature02257. [DOI] [PubMed] [Google Scholar]
- 12.Lygeros J., Koutroumpas K., Dimopoulos S., Legouras I., Kouretas P., Heichinger C., Nurse P., Lygerou Z. Stochastic hybrid modeling of DNA replication across a complete genome. Proc. Natl. Acad. Sci. USA. 2008;105:12295–12300. doi: 10.1073/pnas.0805549105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kopsidas G., Kovalenko S.A., Heffernan D.R., Yarovaya N., Kramarova L., Stojanovski D., Borg J., Islam M.M., Caragounis A., Linnane A.W. Tissue mitochondrial DNA changes. A stochastic system. Ann. N Y Acad. Sci. 2000;908:226–243. doi: 10.1111/j.1749-6632.2000.tb06650.x. [DOI] [PubMed] [Google Scholar]
- 14.Johnston I.G., Burgstaller J.P., Havlicek V., Kolbe T., Rülicke T., Brem G., Poulton J., Jones N.S. Stochastic modelling, Bayesian inference, and new in vivo measurements elucidate the debated mtDNA bottleneck mechanism. eLife. 2015;4:e07464. doi: 10.7554/eLife.07464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Johnston I.G., Gaal B., Neves R.P., Enver T., Iborra F.J., Jones N.S. Mitochondrial variability as a source of extrinsic cellular noise. PLoS Comput. Biol. 2012;8:e1002416. doi: 10.1371/journal.pcbi.1002416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jajoo R., Jung Y., Huh D., Viana M.P., Rafelski S.M., Springer M., Paulsson J. Accurate concentration control of mitochondria and nucleoids. Science. 2016;351:169–172. doi: 10.1126/science.aaa8714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Figge M.T., Reichert A.S., Meyer-Hermann M., Osiewacz H.D. Deceleration of fusion-fission cycles improves mitochondrial quality control during aging. PLoS Comput. Biol. 2012;8:e1002576. doi: 10.1371/journal.pcbi.1002576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Poovathingal S.K., Gruber J., Halliwell B., Gunawan R. Stochastic drift in mitochondrial DNA point mutations: a novel perspective ex silico. PLoS Comput. Biol. 2009;5:e1000572. doi: 10.1371/journal.pcbi.1000572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chinnery P.F., Samuels D.C. Relaxed replication of mtDNA: A model with implications for the expression of disease. Am. J. Hum. Genet. 1999;64:1158–1165. doi: 10.1086/302311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Johnston I.G., Jones N.S. Closed-form stochastic solutions for non-equilibrium dynamics and inheritance of cellular components over many cell divisions. Proc. Math. Phys. Eng. Sci. 2015;471:20150050. doi: 10.1098/rspa.2015.0050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wright S. Statistical genetics and evolution. Bull. Am. Math. Soc. 1942;48:223–246. [Google Scholar]
- 22.Wonnapinij P., Chinnery P.F., Samuels D.C. The distribution of mitochondrial DNA heteroplasmy due to random genetic drift. Am. J. Hum. Genet. 2008;83:582–593. doi: 10.1016/j.ajhg.2008.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Burgstaller J.P., Johnston I.G., Jones N.S., Albrechtová J., Kolbe T., Vogl C., Futschik A., Mayrhofer C., Klein D., Sabitzer S. MtDNA segregation in heteroplasmic tissues is common in vivo and modulated by haplotype differences and developmental stage. Cell Rep. 2014;7:2031–2041. doi: 10.1016/j.celrep.2014.05.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Van Kampen N. Elsevier; 1992. Stochastic Processes in Physics and Chemistry. [Google Scholar]
- 25.Elf J., Ehrenberg M. Fast evaluation of fluctuations in biochemical networks with the linear noise approximation. Genome Res. 2003;13:2475–2484. doi: 10.1101/gr.1196503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Assaf M., Meerson B. Extinction of metastable stochastic populations. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2010;81:021116. doi: 10.1103/PhysRevE.81.021116. [DOI] [PubMed] [Google Scholar]
- 27.Solignac M., Génermont J., Monnerot M., Mounolou J.C. Drosophila mitochondrial genetics: evolution of heteroplasmy through germ line cell divisions. Genetics. 1987;117:687–696. doi: 10.1093/genetics/117.4.687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wai T., Teoli D., Shoubridge E.A. The mitochondrial DNA genetic bottleneck results from replication of a subpopulation of genomes. Nat. Genet. 2008;40:1484–1488. doi: 10.1038/ng.258. [DOI] [PubMed] [Google Scholar]
- 29.Wonnapinij P., Chinnery P.F., Samuels D.C. Previous estimates of mitochondrial DNA mutation level variance did not account for sampling error: comparing the mtDNA genetic bottleneck in mice and humans. Am. J. Hum. Genet. 2010;86:540–550. doi: 10.1016/j.ajhg.2010.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kimura M. Solution of a process of random genetic drift with a continuous model. Proc. Natl. Acad. Sci. USA. 1955;41:144–150. doi: 10.1073/pnas.41.3.144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Birky C.W., Jr. The inheritance of genes in mitochondria and chloroplasts: laws, mechanisms, and models. Annu. Rev. Genet. 2001;35:125–148. doi: 10.1146/annurev.genet.35.102401.090231. [DOI] [PubMed] [Google Scholar]
- 32.Monnot S., Gigarel N., Samuels D.C., Burlet P., Hesters L., Frydman N., Frydman R., Kerbrat V., Funalot B., Martinovic J. Segregation of mtDNA throughout human embryofetal development: m.3243A>G as a model system. Hum. Mutat. 2011;32:116–125. doi: 10.1002/humu.21417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lawson K.A., Hage W.J. Clonal analysis of the origin of primordial germ cells in the mouse. Ciba Found. Symp. 1994;182:68–84. doi: 10.1002/9780470514573.ch5. discussion 84–91. [DOI] [PubMed] [Google Scholar]
- 34.Cree L.M., Samuels D.C., de Sousa Lopes S.C., Rajasimha H.K., Wonnapinij P., Mann J.R., Dahl H.H., Chinnery P.F. A reduction of mitochondrial DNA molecules during embryogenesis explains the rapid segregation of genotypes. Nat. Genet. 2008;40:249–254. doi: 10.1038/ng.2007.63. [DOI] [PubMed] [Google Scholar]
- 35.Cao L., Shitara H., Horii T., Nagao Y., Imai H., Abe K., Hara T., Hayashi J., Yonekawa H. The mitochondrial bottleneck occurs without reduction of mtDNA content in female mouse germ cells. Nat. Genet. 2007;39:386–390. doi: 10.1038/ng1970. [DOI] [PubMed] [Google Scholar]
- 36.Jenuth J.P., Peterson A.C., Fu K., Shoubridge E.A. Random genetic drift in the female germline explains the rapid segregation of mammalian mitochondrial DNA. Nat. Genet. 1996;14:146–151. doi: 10.1038/ng1096-146. [DOI] [PubMed] [Google Scholar]
- 37.Diot A., Hinks-Roberts A., Lodge T., Liao C., Dombi E., Morten K., Brady S., Fratter C., Carver J., Muir R. A novel quantitative assay of mitophagy: Combining high content fluorescence microscopy and mitochondrial DNA load to quantify mitophagy and identify novel pharmacological tools against pathogenic heteroplasmic mtDNA. Pharmacol. Res. 2015;100:24–35. doi: 10.1016/j.phrs.2015.07.014. [DOI] [PubMed] [Google Scholar]
- 38.Pyle A., Taylor R.W., Durham S.E., Deschauer M., Schaefer A.M., Samuels D.C., Chinnery P.F. Depletion of mitochondrial DNA in leucocytes harbouring the 3243A->G mtDNA mutation. J. Med. Genet. 2007;44:69–74. doi: 10.1136/jmg.2006.043109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Johnston I.G., Rickett B.C., Jones N.S. Explicit tracking of uncertainty increases the power of quantitative rule-of-thumb reasoning in cell biology. Biophys. J. 2014;107:2612–2617. doi: 10.1016/j.bpj.2014.08.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Johnston I.G. Efficient parametric inference for stochastic biological systems with measured variability. Stat. Appl. Genet. Mol. Biol. 2014;13:379–390. doi: 10.1515/sagmb-2013-0061. [DOI] [PubMed] [Google Scholar]
- 41.Åström K. Dover; 2012. Introduction to Stochastic Control Theory. [Google Scholar]
- 42.Borkar V. Controlled diffusion processes. Probability Surveys. 2005;2:213–244. [Google Scholar]
- 43.Markowitz H. Portfolio selection. J. Finance. 1952;7:77–91. [Google Scholar]
- 44.Gillespie D. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 1977;81:2340–2361. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.