Abstract
Extant eukaryotic cells have a dynamic traffic network that consists of diverse membrane-bound organelles exchanging matter via vesicles. This endomembrane system arose and diversified during a period characterized by massive expansions of gene families involved in trafficking after the acquisition of a mitochondrial endosymbiont by a prokaryotic host cell >1.8 billion years ago. Here we investigate the mechanistic link between gene duplication and the emergence of new nonendosymbiotic organelles, using a minimal biophysical model of traffic. Our model incorporates membrane-bound compartments, coat proteins and adaptors that drive vesicles to bud and segregate cargo from source compartments, and SNARE proteins and associated factors that cause vesicles to fuse into specific destination compartments. In simulations, arbitrary numbers of compartments with heterogeneous initial compositions segregate into a few compositionally distinct subsets that we term organelles. The global structure of the traffic system (i.e., the number, composition, and connectivity of organelles) is determined completely by local molecular interactions. On evolutionary timescales, duplication of the budding and fusion machinery followed by loss of cross-interactions leads to the emergence of new organelles, with increased molecular specificity being necessary to maintain larger organellar repertoires. These results clarify potential modes of early eukaryotic evolution as well as more recent eukaryotic diversification.
Introduction
The stark distinction between prokaryotes and eukaryotes is a striking and surprising feature of cellular life. All extant eukaryotes share a large number of defining traits, such as mitochondria, active cytoskeletons, deformable membranes, vesicular traffic, and intracellular compartments (1). The eukaryotic cell plan emerged in the fossil record ∼1.8 billion years ago (GYA), and cells representing major extant eukaryote groups were evident by 1.3 GYA (2). Calibrated molecular phylogeny suggests that the most recent ancestor of all extant eukaryotes (the last eukaryotic common ancestor [LECA]) dates to ∼1.5 GYA (3), although other estimates place LECA more recently (4,5). What is clear is that LECA was already a sophisticated unicellular organism, possessing the complex molecular and phenotypic traits shared by all extant eukaryotic supergroups (6,7). The absence of intermediate forms bridging the prokaryote/eukaryote divide makes it challenging to reconstruct eukaryote evolution before LECA. There is a general consensus that mitochondrial endosymbiosis was a defining step in the origin of eukaryotes, and the nature of the host cell and the timing of this event relative to the emergence of other eukaryotic traits remain active areas of research (8). It has been argued that the mitochondrion was acquired by phagocytic cells that already possessed eukaryote-specific traits (1,4), but recent phylogenetic analyses instead support a prokaryotic archaebacterial host (9). Bioenergetic considerations suggest that the acquisition of mitochondria was a watershed event, setting the stage for the subsequent evolution of eukaryote-specific traits facilitated by a greatly expanded protein repertoire (10). At some point in this process, a recognizably eukaryotic cell with a functional vesicle-traffic apparatus must have emerged. Here we explore the evolutionary period leading from this proto-eukaryote to LECA, during which time the system of intracellular traffic and compartmentalization developed into its present form.
The eukaryotic endomembrane system consists of various organelles, such as the endoplasmic reticulum (ER), Golgi apparatus, endosomes, lysosomes, and plasma membrane, that exchange matter through vesicle-mediated traffic. (Throughout the text we will consistently use the term “organelle” to refer to nonendosymbiotic organelles that belong to the endomembrane system, rather than to symbiotic organelles such as mitochondria and plastids.) The traffic system exists in a dynamic steady state in which the sizes and compositions of the organelles remain approximately constant over time even though each organelle is receiving foreign material and losing its own (11). Eukaryotic cells can regain their internal organization after perturbations, so the structure of the endomembrane system is at least partly encoded by local molecular interactions (12–14). Cell-biological investigations have identified a suite of molecules that coordinate to set up and maintain the endomembrane system. Rab GTPases encode compartmental identity and have been implicated in nearly every step of vesicle-mediated traffic, including vesicle budding, uncoating, motility, and specific fusion (15–18). These GTP/GDP-binding proteins cycle between the cytoplasm (in their GDP-bound inactive state) and the membrane (in their GTP-bound active state). When a Rab is membrane bound and active, it can in turn regulate a large number of downstream effector proteins with diverse functions. Coat proteins such as clathrin, COP I, and COP II initiate vesicle formation by locally deforming the organelle membrane, and concentrate or deplete specific cargo molecules via adaptor-mediated interactions (19–23). Vesicles bud off from source compartments and can be transported to their destinations by motor proteins running on cytoskeletal tracks (23). The soluble NSF attachment protein receptor (SNARE) family of proteins are integral membrane proteins that occur in pairs (24,25): when a v-SNARE on a vesicle meets a specific partner t-SNARE on the target compartment, the two coil together, driving the membranes to fuse with one another. Molecules that regulate SNARE activity provide additional layers of specificity (26). After fusion occurs, the SNAREs are dissociated by N-ethylmaleimide-sensitive fusion protein (NSF) utilizing the energy of ATP hydrolysis, resetting the system for another round of vesicle budding and fusion.
Phyletic distributions suggest that LECA possessed all of the traffic-related protein families necessary to support a sophisticated endomembrane system (27). All extant eukaryotic cells express several paralogous Rabs, coats, and SNAREs—protein families that are essentially absent in prokaryotes. These paralogs arose during multiple pre-LECA gene family expansions, and different members of paralogous gene families tend to be associated with a specific subset of organelles or pathways of transport (28–33). These observations form the basis of a hypothesis proposed by Dacks and Field (27) and Dacks et al. (34) to explain how new organelles arose on the evolutionary branch leading to LECA subsequent to mitochondrial endosymbiosis. The hypothesis can be split into two parts, one mechanistic and one evolutionary: First, a given organelle is essentially determined by the budding and fusion machinery, which dictates how it exchanges matter with the rest of the system. Second, a new organelle can be generated by the duplication of the budding and fusion molecules associated with an existing organelle, and their subsequent divergence into specifically interacting subsets. Thus, an initially simple traffic system would become more complex as new organelles were added through the paralogous expansion of Rabs, coats, SNAREs, and other gene families involved in budding and fusion.
Here we explore the Dacks-Field evolutionary hypothesis using a biophysical model of eukaryotic endomembrane traffic. It is essential to formalize the hypothesis in mathematical terms so that we can examine the implications of constraints at multiple levels, i.e., physical constraints that place limits on membrane transport and the efficiency and specificity of molecular interactions, and biological constraints that require that at every step of evolution a functional traffic system must persist. Previous investigators have modeled endomembrane traffic using a variety of mathematical approaches (35–39). Our own model is detailed enough to include the machinery of vesicle budding and fusion, and general enough to allow complex, multicompartment steady states. We interrogate the model using simulations as well as bifurcation analyses, and find that the duplication of budding and fusion molecules can indeed give rise to new organelles, as long as molecular interactions obey certain minimum limits of specificity. Thus, the Dacks-Field hypothesis, which was originally posed in qualitative terms based only on phylogenetic signals of gene family expansion, is essentially compatible with our biophysical understanding of the present-day traffic machinery. This fits with the idea that the traffic molecules and their rules of interaction achieved their present form long before LECA and the emergence of the standard eukaryotic cell plan (27). If this is true, the same evolutionary forces that gave rise to LECA probably also contributed to the subsequent diversification of the eukaryotic supergroups, and might continue to operate in extant organisms.
Materials and Methods
The traffic model
The traffic model (Fig. 1) is realized as a system of ordinary differential equations describing how the sizes and compositions of membrane-bound compartments change with time. The components of this model are NS types of SNAREs, NC types of coats, and M preexisting compartments (we only discuss cases in which M > NC, NS). Throughout the text, we use Greek indices to denote SNARE types (α, β = 1, …, NS) or coat types (γ = 1, …, NC), and Roman indices to indicate compartments (i, j = 1, …, M). Each compartment is described by its size and SNARE composition:
= constant size (membrane area) of a single vesicle
= variable size (membrane area) of compartment j
= number of molecules of SNARE α on compartment j
with
(1) |
Vesicles are assumed to fuse to their destination compartments nearly instantaneously after their formation, so the total membrane area and total SNARE amounts, and , are obtained by summing over compartments alone. Coat proteins cause vesicles to bud off compartments; therefore, there are NC types of vesicles corresponding to the NC coat types. We assume for simplicity that coat proteins of each type are available in equal amounts, so the rate of budding of any type of vesicle depends only on the size of the source compartment. We assume a power-law dependence: the rate of budding of γ-coated vesicles from compartment j, in units of membrane area per unit time, is given by
(2) |
The parameter λ has dimensions of membrane area to the power 1-μ per unit time. The dimensionless exponent μ describes the dependence of the budding rate on compartment size: the case μ = 0 gives vesicle budding independently of the source compartment size, as when coat proteins are rate limiting; the case μ = 1 is the mass-action situation in which the budding rate is proportional to the available source area; and other values of the exponent can be used to capture curvature-dependent budding rates.
SNARE proteins are themselves a variety of cargo. A vesicle packages all of the SNAREs present on the source compartment, but to different extents depending on the affinity of its coat and associated adaptors for each SNARE. We assume that the concentration of a SNARE on a vesicle is a constant multiple of its concentration on the source compartment. Let the amount of SNARE α on γ-coated vesicles from compartment j be . We then have
(3) |
where is the dimensionless parameter describing the affinity of coat γfor SNARE α. Because cargo packaging is an energy-dependent process, there are no thermodynamic constraints on (40). The value = 1 represents the special case in which budding vesicles of type γ sample SNARE α directly from the source compartment; higher or lower values, respectively, will produce vesicles enriched or depleted in SNAREs.
The fusion of vesicles into target compartments is mediated by the pairing of SNAREs, modulated by associated factors such as Rabs and tethers. The probability that a vesicle will fuse into a compartment is assumed to depend on the specificity with which the SNAREs on the vesicle interact with the SNAREs on the target, and on the product of the vesicular and compartmental SNARE concentrations. The rate at which γ-coated vesicles from compartment j fuse into compartment i is obtained by a weighted sum over all possible SNARE pairs:
(4) |
where ρ is a scale factor whose value we will later determine. The parameter captures the affinity of interaction between SNAREs α and β. We do not explicitly incorporate pairs of v- and t-SNAREs; however, v/t-like pairing of SNAREs can be modeled by an appropriate choice of . There is experimental evidence that SNARE-mediated membrane fusion is a cooperative, nonlinear process (41,42); the exponent n describes the extent of SNARE cooperativity. The specific form of Eq. 4—summing over all possible SNARE pairs and taking the nth power afterwards—is chosen so that having two distinct SNAREs that interact identically with all other components of the model is equivalent to having only one SNARE, but doubled in amount. Finally, the rate of fusion depends on the size of the target compartment, as described by the term .
The exponents μ and ν describe how vesicle budding and fusion rates scale with compartment size. Setting μ = ν = 1 implies that budding and fusion rates should scale linearly with area. This is equivalent to assuming that each infinitesimal patch of membrane on a compartment can be treated as an independent unit whose likelihood of giving rise to a vesicle or fusing with one depends only on its local composition, and not on the size of the compartment on which it resides. Setting either μ or ν different from one assumes that a tiny patch of membrane can sense the compartment size. In real cells, it is plausible that the rates of surface processes such as vesicle budding and fusion increase with the compartment surface area in a nonlinear fashion, given the existence of membrane-curvature-sensing proteins (43,44) and their involvement in the vesicle-trafficking machinery (45–47).
We assume that vesicles fuse into compartments almost as soon as they form, that is, vesicles exist for a period much shorter than the timescale in which source compartments change composition. Using a simple one-species model of vesicular transport, Dmitrieff and Sens (39) showed that a finite-fusion-time version of this model can be mapped onto a simpler instantaneous fusion version. Our assumption of instantaneous vesicle fusion implies that the total rate of budding of γ-coated vesicles from one compartment (Eq. 2) is equal to the sum of their rates of fusion to all target compartments (including the source):
(5) |
This sets the value of the factor ρ. Combining Eqs. 2, 4, 5, we obtain
(6) |
Finally, the rates of change of compartment sizes and SNARE amounts are given by conservation laws:
(7) |
Equations 6 and 7 capture the complete content of our model.
Parameter values
For convenience, we take the number of coat and SNARE types to be equal (NC = NS = N), although the model can accommodate more general scenarios. We choose the units of time so that λ = 1, the units of membrane surface area so that = 1, and the units of amounts of each type of SNARE so that = 1. The exponents are generally held fixed at the values μ = 1.1, ν = 1, and n = 2, unless otherwise mentioned. The remaining key parameters of our model are the coat-SNARE specificity matrix θ and the SNARE-SNARE specificity matrix ϕ. The fixed points of this system are invariant to rescaling θ, but their stabilities are not; in the present analysis, we restrict our attention to matrices θ with maximal value unity. Finally, the system is invariant to rescaling ϕ, which only appears in the numerator and denominator of Eq. 6; we scale ϕ so its maximal value is unity.
Bifurcation analysis
Consider an N coat, N SNARE, N compartment version of the model in which θ and ϕ are of the forms given in Eq. 8. Suppose all compartments are equally sized and there is a one-to-one map between each compartment i and its dominant SNARE α(i). For each i, the amount of SNARE α(i) on compartment i is y0, with the remaining (1 − y0) being distributed uniformly among the remaining compartments. Because all interaction parameters are completely symmetric, there is nothing to break the symmetry of this initial state. As the system evolves with time, it will stay on the line , , , for all i. This line is therefore a one-dimensional (1D) dynamical system described by the dynamical variable y, whose time evolution is given by some function . For a given δ and ε, when f is plotted against y (see Fig. 4, A–H), we will have fixed points whenever f = 0. The stability of any fixed point is determined from the local slope of the graph: . Bifurcations (i.e., the emergence or loss of stable fixed points) occur where f and vanish simultaneously. This 1D dynamical system contains both the one-organelle (y = 1/N, all compartments identical) and N-organelle (y > 1/N, all compartments distinct) fixed points. We numerically analyze the emergence and stability of the 1D fixed points in δ-ε space, and verify by simulations that this correctly gives the stability of the fixed points of the full dynamical system.
Simulations
All simulations and calculations were done in Wolfram Mathematica 7.0. Numerical solutions of ordinary differential equations were obtained using the function NDSolve. Roots of functions for the bifurcation analysis were determined numerically using the function FindRoot.
Results
Ingredients of the traffic model
The pioneering model of two coats and two SNARES developed by Heinrich and Rapoport (35) reaches a steady state in which two initially similar compartments become distinct in composition—enriched in different SNAREs and sending out vesicles with different coats. This result is interpreted as the two compartments becoming distinct organelles. Our model is similar in spirit to the Heinrich-Rapoport model but has three distinguishing features: First, we treat physically distinct compartments and chemically distinct organelles differently, which allows us to separate the effects of biochemistry from those of available membranes. Second, we explicitly formulate the model using network equations with arbitrary numbers of proteins of each molecular variety, which allows us to study large systems with complex topologies. Third, we treat molecular interactions as parameters that can vary, for example, over evolutionary timescales.
The model pertains only to local molecular mechanisms of budding and fusion, and is formulated in biologically realistic terms supported by experimental evidence (Fig. 1, A–D; see “The traffic model” above). The traffic system consists of single-membrane-bound compartments. We do not explicitly include the de novo synthesis of new compartments, but this is implicit in our assumption that the number of compartments always exceeds the number of available coat and SNARE types. Coat proteins of different types (a shorthand that includes associated factors such as Rabs and adaptors) cause these compartments to give rise to vesicles of different compositions (19–23), and SNAREs (a shorthand that includes associated factors such as Rabs and tethers) on compartment and vesicle membranes specifically pair up to drive fusion (24–26). Because we are specifically interested in exploring the Dacks-Field evolutionary scenario, we focus on the budding and fusion machinery and ignore additional complexities such as cytoskeletal transport and transport via tubules. In the simple two-compartment case, over all parameter values we tested, our model produced results qualitatively similar to those obtained with the Heinrich-Rapoport model. This suggests that the results we describe below arise from broad biological considerations rather than from specific details of the model’s formulation.
Compartments segregate into compositionally distinct organelles
We studied the behavior of this model through numerical simulations (see “Simulations” above). For many forms of the specificity matrices θ and ϕ, and a variety of initial conditions, we find that the system approaches a steady state. If the genotype includes multiple varieties of coats and SNAREs, this steady state typically comprises many compositionally distinct compartments that can be sorted into subsets: compartments within one subset have identical compositions, and compartments in different subsets have distinct compositions. We refer to all compartments within a subset as being part of the same organelle (Fig. 1 E). In our formulation, as in most previously studied traffic models (36–38) (but not in Heinrich and Rapoport (35)), some degree of SNARE cooperativity (corresponding to n > 1) is necessary to obtain stable, nonidentical organelles (Fig. 2, A and B).
The manner in which the mass of an organelle is distributed among the compartments it is composed of depends on the relationship between the budding and fusion exponents μ and ν. If μ > ν (budding rates increase faster with surface area than do fusion rates), a compartment that is very large will give rise to many vesicles but will not receive as many in return, so it will tend shrink; conversely, small compartments will tend to grow. The result at steady state is that each organelle is made up of several equally sized compartments (Fig. 2 C). If μ < ν, a large compartment will attract more vesicles than it loses, so it will grow at the expense of smaller compartments. At steady state, each organelle will be made up of a single large compartment, with all other compartments shrinking and eventually disappearing (Fig. 2 E). In the degenerate case of μ = ν, organelles are made up of several compartments of arbitrary sizes (Fig. 2 D). These dynamics are reminiscent of the switch between multiple small compartments and a single large compartment seen in organelles such as the Golgi or late endosomes under a variety of perturbations (48,49).
The number of distinct organelles depends only on specific molecular interactions
As long as the number M of compartments is larger than the number N of coats and SNAREs, we find that the number of organelles (i.e., the number of distinct compartment compositions at steady state) depends only on the interaction matrices θ and ϕ. Fig. 3 shows the results of simulations with four different sets of parameters, with either 10 or 20 initial compartments. The corresponding values for θ and ϕ have simple biological interpretations (Fig. 3 A). In Fig. 3, B–D, each coat has one preferred SNARE that it packages better than it does all the others, and each SNARE has a high affinity for its own type and a low one for all the others, so two SNAREs of the same type make a good pair to cause fusion. If the preferences of coats for SNAREs (or SNAREs for SNAREs) are not very specific (Fig. 3 B), all compartments become identical in composition, and the number of organelles is one. However, if these preferences are sharp enough (Fig. 3, C and D), the number of organelles formed is the same as the number N of coats and SNAREs, and each organelle is given its identity by one dominant SNARE type.
Gene duplication and loss of cross-interactions can lead to the formation of new organelles
We next examine what happens when gene duplication changes the underlying genotype. A system with three coats and SNAREs with no cross-interactions has a steady state comprising three organelles (Fig. 3 D). Suppose now that one entire molecular set (coats, SNAREs, and all associated factors) is duplicated. In the resulting matrices, although there are four coats and SNAREs, the new protein copies have the same interactions as the old ones, so the number of truly distinct coats and SNAREs is only three. Indeed, we see that only three organelles are formed in steady state (Fig. 3 E). However, if the off-diagonal cross-interaction terms between new and old protein copies are reduced, with all else being held constant, the system switches to a four-organelle state (Fig. 3 F). Thus, the duplication of budding and fusion machinery can be seen as the driving force behind the emergence of new organelles.
Larger organellar repertoires require increased molecular interaction specificity
Gene duplication followed by divergence typically results in a state with a larger organelle number, but this is not always the case. The emergence of new organelles appears to be contingent on the degree of specificity of all the molecules in the system, not just of the duplicated protein copies. To understand this parameter dependence, we performed a bifurcation analysis on a reduced highly symmetric subsystem in which the number of compartments is equal to the number N of coats and SNAREs (see “Bifurcation analysis” above). Suppose the matrices θ and ϕ are of the forms
(8) |
where δ and ε are dimensionless parameters with values between zero and one. As shown in Fig. 3, when δ and ε are small enough, N distinct organelles emerge (Fig. 3, C and D). However, when these parameters are closer to one (meaning that the coats do not discriminate much among the SNAREs, and the SNAREs do not discriminate among themselves), all compartments become identical in composition and the number of organelles is one (Fig. 3 B). For N = 2, 3, 4, and 5, we used a bifurcation analysis (Fig. 4, A–H; see Materials and Methods) to map out the regions of the δ-ε space that give rise to the N-organelle and one-organelle behaviors. For N = 2 (Fig. 4 I), we find a curve in δ-ε space across which N-organelle fixed points appear and the one-organelle fixed point simultaneously becomes unstable. For N ≥ 3 (Fig. 4 J), we find two curves: first, N-organelle fixed points appear but the one-organelle fixed point is still stable; second (as we approach the origin), the one-organelle fixed point becomes unstable. In the region between these two curves, the outcome depends on initial conditions. For increasing N, we can examine the size of the parameter region in δ-ε space that generates N-organelle behavior (Fig. 4 K). We find that as N increases, this region shrinks: δ and ε are both required to be smaller for N-organelle behavior to arise, although decreased specificity along one axis (say, of coat-SNARE interactions) can be compensated for, to a point, by increased specificity along the other axis (say, SNARE-SNARE interactions). In general, greater interaction specificities are required, across all the molecules in the system, to maintain a larger repertoire of distinct organelles.
Discussion
Our goal in this work was to assess the plausibility of the Dacks-Field hypothesis about pre-LECA eukaryote evolution, on the basis of molecular and functional constraints; that is, we sought to perform a biophysical analysis rather than an evolutionary analysis. We inserted all of the evolutionary ingredients (gene duplication and divergence) by hand to capture essential features of the known phylogeny of Rabs, coats, and SNAREs (34). This approach glosses over a variety of complications. For example, we assume that coats, SNAREs, and all their associated factors are encoded by individual genes, ignoring the issue of multimeric proteins and interacting complexes with potentially overlapping subunits. What is represented as a single duplication event in our analysis corresponds to multiple underlying duplication and divergence events. Understanding the frequency with which such a series of rare events might occur would require a detailed population-genetic analysis—one that includes effects of population size, selection, underlying mutation, recombination rates, and so on. Such an analysis is not only technically complex but requires a deep understanding of ancient conditions and selection pressures. However, we are not asking for the likelihood that some series of gene duplication events might occur—we want to check, given that some series of events did occur, whether their claimed effects on intracellular organization are consistent with known biophysical constraints. The Dacks-Field hypothesis is extremely specific: it posits that duplication and coevolution of the machinery underlying vesicle budding and fusion were sufficient by themselves to generate new organelles. This is falsifiable. It is possible, for example, that organelle numbers were determined by templating from parent to daughter cells, or that they arose through active cytoskeletal processes, or that they depended on having distinct endosymbiont genomes. What we do find is precisely what the basic qualitative form of the hypothesis predicts, with two important additions. First, we place quantitative limits on the degree of molecular specificity required to maintain a functional endomembrane system. Second, we show that specificity must increase across the entire system to support larger organellar repertoires. Indeed, many layers of specificity are built into present-day endomembrane systems (50). It is possible that unrelated selective pressures could have favored increased specificities in the short term, setting the stage for organellar diversification in the long term (51).
The acquisition of mitochondria was necessary but not sufficient for the emergence and diversification of the endomembrane system. The endosymbionts provided the energy to support a vast expansion in gene families (10), but this does not in itself explain the very structured type of gene duplication and divergence required by our model. If we start with an initial set of interacting proteins (some set of Rabs, coats, and SNARES), all of these proteins must duplicate, the resulting pairs of initially identical proteins must break up into two subsets, and members of each subset must coevolve to maintain interactions among themselves while at the same time suppressing interactions with the other subset. Hybridization parsimoniously accounts for the emergence of multiple weakly interacting protein subsets (Fig. 5, A and B). Hybridizations between moderately diverged single-celled eukaryotes are relatively common in high-density populations (52), and endosymbiotic associations among eukaryotes may promote hybridization between more diverged varieties (53). There is another, more subtle route that leads to the desired result, one that can only operate in compartmentalized cells. Whole-genome duplication can produce multiple, simultaneous gene duplications (Fig. 5 C), but we must still account for the subsequent loss of cross-interactions. In the hybridization scenario, the precursor protein sets are segregated into two distinct cells. By analogy, after a whole-genome duplication and a few mutations in targeting sequences, precursor protein sets might become segregated into distinct compartments, a process termed neolocalization (54). Subsequent coevolution would tend to suppress cross-interactions, although some form of selection would be required to maintain within-set interactions (Fig. 5 D). This retargeting scenario opens up the following interesting possibility: a cell with many existing compartments is more likely to achieve multiple weakly interacting protein subsets. Conversely, we have shown how duplicate protein subsets of Rabs, coats, and SNAREs can generate new compartments. This virtuous cycle may have triggered an accelerated phase of pre-LECA eukaryote evolution, resulting in the prokaryote/eukaryote divide we observe among extant organisms.
Conclusions
Given the patchy fossil record of unicellular organisms, the study of pre-LECA eukaryote evolution has so far been restricted to phylogenetic analyses and comparative genomics. Mathematical models of the endomembrane system can supplement these bioinformatic tools, providing a powerful means to falsify evolutionary hypotheses or evaluate their plausibility.
The emergence of stable nonidentical compartments seems to be a universal feature of endomembrane traffic models (35–39). We, too, find that initially heterogeneous compartments sort themselves into subsets whose members have identical compositions, and each such subset embodies a distinct organelle. The surprising result is that the number of organelles depends only on specific molecular interactions in a highly predictable manner, and not on the number of preexisting compartments or the initial state. Although we discovered this property in the context of a certain mathematical formulation, we imagine it occurs quite generally among a broad class of self-organized traffic models. Taking this further, we suggest that it is likely to be a verifiable property of real present-day cells. Crucially, this robust mapping from molecular genotype to compartmental phenotype paves the way for forces acting at the genetic level to drive changes in intracellular organization. Thus, the present-day structure of cells contains clues about the evolutionary processes through which they arose, processes that have continued to operate over billions of years.
Acknowledgments
We thank Upinder Bhalla, Jitu Mayor, and Mark Field for useful discussions. M.T. conceived the study, R.R. and M.T. developed the model, R.R. analyzed the model, and R.R. and M.T. wrote the paper.
M.T. was supported in part by a Wellcome Trust-DBT India Alliance Intermediate Fellowship (500103/Z/09/Z). A portion of this work was carried out during a long-term program at the Kavli Institute for Theoretical Physics.
Footnotes
Rohini Ramadas’s present address is Department of Mathematics, University of Michigan, Ann Arbor, Michigan.
References
- 1.de Duve C. The origin of eukaryotes: a reappraisal. Nat. Rev. Genet. 2007;8:395–403. doi: 10.1038/nrg2071. [DOI] [PubMed] [Google Scholar]
- 2.Knoll A.H., Javaux E.J., Cohen P. Eukaryotic organisms in Proterozoic oceans. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2006;361:1023–1038. doi: 10.1098/rstb.2006.1843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Parfrey L.W., Lahr D.J.G., Katz L.A. Estimating the timing of early eukaryotic diversification with multigene molecular clocks. Proc. Natl. Acad. Sci. USA. 2011;108:13624–13629. doi: 10.1073/pnas.1110633108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Cavalier-Smith T. Deep phylogeny, ancestral groups and the four ages of life. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2010;365:111–132. doi: 10.1098/rstb.2009.0161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chernikova D., Motamedi S., Rogozin I.B. A late origin of the extant eukaryotic diversity: divergence time estimates using rare genomic changes. Biol. Direct. 2011;6:26. doi: 10.1186/1745-6150-6-26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Parfrey L.W., Barbero E., Katz L.A. Evaluating support for the current classification of eukaryotic diversity. PLoS Genet. 2006;2:e220. doi: 10.1371/journal.pgen.0020220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Koonin E.V. Preview. The incredible expanding ancestor of eukaryotes. Cell. 2010;140:606–608. doi: 10.1016/j.cell.2010.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Embley T.M., Martin W. Eukaryotic evolution, changes and challenges. Nature. 2006;440:623–630. doi: 10.1038/nature04546. [DOI] [PubMed] [Google Scholar]
- 9.Williams T.A., Foster P.G., Embley T.M. A congruent phylogenomic signal places eukaryotes within the Archaea. Proc. Biol. Sci. 2012;279:4870–4879. doi: 10.1098/rspb.2012.1795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lane N., Martin W. The energetics of genome complexity. Nature. 2010;467:929–934. doi: 10.1038/nature09486. [DOI] [PubMed] [Google Scholar]
- 11.Pelham H.R.B. The dynamic organisation of the secretory pathway. Cell Struct. Funct. 1996;21:413–419. doi: 10.1247/csf.21.413. [DOI] [PubMed] [Google Scholar]
- 12.Fujiwara T., Oda K., Ikehara Y. Brefeldin A causes disassembly of the Golgi complex and accumulation of secretory proteins in the endoplasmic reticulum. J. Biol. Chem. 1988;263:18545–18552. [PubMed] [Google Scholar]
- 13.Zaal K.J.M., Smith C.L., Lippincott-Schwartz J. Golgi membranes are absorbed into and reemerge from the ER during mitosis. Cell. 1999;99:589–601. doi: 10.1016/s0092-8674(00)81548-2. [DOI] [PubMed] [Google Scholar]
- 14.Marshall W.F. Origins of cellular geometry. BMC Biol. 2011;9:57. doi: 10.1186/1741-7007-9-57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Stenmark H., Olkkonen V.M. The Rab GTPase family. Genome Biol. 2001;2:3007.1–3007.7. doi: 10.1186/gb-2001-2-5-reviews3007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zerial M., McBride H. Rab proteins as membrane organizers. Nat. Rev. Mol. Cell Biol. 2001;2:107–117. doi: 10.1038/35052055. [DOI] [PubMed] [Google Scholar]
- 17.Grosshans B.L., Ortiz D., Novick P. Rabs and their effectors: achieving specificity in membrane traffic. Proc. Natl. Acad. Sci. USA. 2006;103:11821–11827. doi: 10.1073/pnas.0601617103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Stenmark H. Rab GTPases as coordinators of vesicle traffic. Nat. Rev. Mol. Cell Biol. 2009;10:513–525. doi: 10.1038/nrm2728. [DOI] [PubMed] [Google Scholar]
- 19.Aridor M., Weissman J., Balch W.E. Cargo selection by the COPII budding machinery during export from the ER. J. Cell Biol. 1998;141:61–70. doi: 10.1083/jcb.141.1.61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Schekman R., Orci L. Coat proteins and vesicle budding. Science. 1996;271:1526–1533. doi: 10.1126/science.271.5255.1526. [DOI] [PubMed] [Google Scholar]
- 21.Rothman J.E., Wieland F.T. Protein sorting by transport vesicles. Science. 1996;272:227–234. doi: 10.1126/science.272.5259.227. [DOI] [PubMed] [Google Scholar]
- 22.Bonifacino J.S., Lippincott-Schwartz J. Coat proteins: shaping membrane transport. Nat. Rev. Mol. Cell Biol. 2003;4:409–414. doi: 10.1038/nrm1099. [DOI] [PubMed] [Google Scholar]
- 23.Munro S. Organelle identity and the organization of membrane traffic. Nat. Cell Biol. 2004;6:469–472. doi: 10.1038/ncb0604-469. [DOI] [PubMed] [Google Scholar]
- 24.Chen Y.A., Scheller R.H. SNARE-mediated membrane fusion. Nat. Rev. Mol. Cell Biol. 2001;2:98–106. doi: 10.1038/35052017. [DOI] [PubMed] [Google Scholar]
- 25.McNew J.A., Parlati F., Rothman J.E. Compartmental specificity of cellular membrane fusion encoded in SNARE proteins. Nature. 2000;407:153–159. doi: 10.1038/35025000. [DOI] [PubMed] [Google Scholar]
- 26.Gerst J.E. SNARE regulators: matchmakers and matchbreakers. Biochim. Biophys. Acta. 2003;1641:99–110. doi: 10.1016/s0167-4889(03)00096-x. [DOI] [PubMed] [Google Scholar]
- 27.Dacks J.B., Field M.C. Evolution of the eukaryotic membrane-trafficking system: origin, tempo and mode. J. Cell Sci. 2007;120:2977–2985. doi: 10.1242/jcs.013250. [DOI] [PubMed] [Google Scholar]
- 28.Bock J.B., Matern H.T., Scheller R.H. A genomic perspective on membrane compartment organization. Nature. 2001;409:839–841. doi: 10.1038/35057024. [DOI] [PubMed] [Google Scholar]
- 29.Jékely G. Small GTPases and the evolution of the eukaryotic cell. Bioessays. 2003;25:1129–1138. doi: 10.1002/bies.10353. [DOI] [PubMed] [Google Scholar]
- 30.McMahon H.T., Mills I.G. COP and clathrin-coated vesicle budding: different pathways, common approaches. Curr. Opin. Cell Biol. 2004;16:379–391. doi: 10.1016/j.ceb.2004.06.009. [DOI] [PubMed] [Google Scholar]
- 31.Makarova K.S., Wolf Y.I., Koonin E.V. Ancestral paralogs and pseudoparalogs and their role in the emergence of the eukaryotic cell. Nucleic Acids Res. 2005;33:4626–4638. doi: 10.1093/nar/gki775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zhang D., Aravind L. Identification of novel families and classification of the C2 domain superfamily elucidate the origin and evolution of membrane targeting activities in eukaryotes. Gene. 2010;469:18–30. doi: 10.1016/j.gene.2010.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Elias M., Brighouse A., Dacks J.B. Sculpting the endomembrane system in deep time: high resolution phylogenetics of Rab GTPases. J. Cell Sci. 2012;125:2500–2508. doi: 10.1242/jcs.101378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Dacks J.B., Poon P.P., Field M.C. Phylogeny of endocytic components yields insight into the process of nonendosymbiotic organelle evolution. Proc. Natl. Acad. Sci. USA. 2008;105:588–593. doi: 10.1073/pnas.0707318105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Heinrich R., Rapoport T.A. Generation of nonidentical compartments in vesicular transport systems. J. Cell Biol. 2005;168:271–280. doi: 10.1083/jcb.200409087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gong H., Sengupta D., Schwartz R. Simulated de novo assembly of golgi compartments by selective cargo capture during vesicle budding and targeted vesicle fusion. Biophys. J. 2008;95:1674–1688. doi: 10.1529/biophysj.107.127498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Binder B., Goede A., Holzhütter H.G. A conceptual mathematical model of the dynamic self-organisation of distinct cellular organelles. PLoS ONE. 2009;4:e8295. doi: 10.1371/journal.pone.0008295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Gong H., Guo Y., Schwartz R. Discrete, continuous, and stochastic models of protein sorting in the Golgi apparatus. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2010;81:011914. doi: 10.1103/PhysRevE.81.011914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dmitrieff S., Sens P. Cooperative protein transport in cellular organelles. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2011;83:041923. doi: 10.1103/PhysRevE.83.041923. [DOI] [PubMed] [Google Scholar]
- 40.Traub L.M. Tickets to ride: selecting cargo for clathrin-regulated internalization. Nat. Rev. Mol. Cell Biol. 2009;10:583–596. doi: 10.1038/nrm2751. [DOI] [PubMed] [Google Scholar]
- 41.Hua Y., Scheller R.H. Three SNARE complexes cooperate to mediate membrane fusion. Proc. Natl. Acad. Sci. USA. 2001;98:8065–8070. doi: 10.1073/pnas.131214798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Mohrmann R., de Wit H., Sørensen J.B. Fast vesicle fusion in living cells requires at least three SNARE complexes. Science. 2010;330:502–505. doi: 10.1126/science.1193134. [DOI] [PubMed] [Google Scholar]
- 43.Peter B.J., Kent H.M., McMahon H.T. BAR domains as sensors of membrane curvature: the amphiphysin BAR structure. Science. 2004;303:495–499. doi: 10.1126/science.1092586. [DOI] [PubMed] [Google Scholar]
- 44.Huang K.C., Ramamurthi K.S. Macromolecules that prefer their membranes curvy. Mol. Microbiol. 2010;76:822–832. doi: 10.1111/j.1365-2958.2010.07168.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Cabrera M., Langemeyer L., Ungermann C. Phosphorylation of a membrane curvature-sensing motif switches function of the HOPS subunit Vps41 in membrane tethering. J. Cell Biol. 2010;191:845–859. doi: 10.1083/jcb.201004092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Bigay J., Casella J.F., Antonny B. ArfGAP1 responds to membrane curvature through the folding of a lipid packing sensor motif. EMBO J. 2005;24:2244–2253. doi: 10.1038/sj.emboj.7600714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Carlton J., Bujny M., Cullen P.J. Sorting nexin-1 mediates tubular endosome-to-TGN transport through coincidence sensing of high- curvature membranes and 3-phosphoinositides. Curr. Biol. 2004;14:1791–1800. doi: 10.1016/j.cub.2004.09.077. [DOI] [PubMed] [Google Scholar]
- 48.Terasaki M. Dynamics of the endoplasmic reticulum and golgi apparatus during early sea urchin development. Mol. Biol. Cell. 2000;11:897–914. doi: 10.1091/mbc.11.3.897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gabriely G., Kama R., Gerst J.E. Involvement of specific COPI subunits in protein sorting from the late endosome to the vacuole in yeast. Mol. Cell. Biol. 2007;27:526–540. doi: 10.1128/MCB.00577-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Dacks J.B., Peden A.A., Field M.C. Evolution of specificity in the eukaryotic endomembrane system. Int. J. Biochem. Cell Biol. 2009;41:330–340. doi: 10.1016/j.biocel.2008.08.041. [DOI] [PubMed] [Google Scholar]
- 51.Koumandou V.L., Dacks J.B., Field M.C. Control systems for membrane fusion in the ancestral eukaryote; evolution of tethering complexes and SM proteins. BMC Evol. Biol. 2007;7:29. doi: 10.1186/1471-2148-7-29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Libkind D., Hittinger C.T., Sampaio J.P. Microbe domestication and the identification of the wild genetic stock of lager-brewing yeast. Proc. Natl. Acad. Sci. USA. 2011;108:14539–14544. doi: 10.1073/pnas.1105430108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Nowack E.C.M., Melkonian M. Endosymbiotic associations within protists. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2010;365:699–712. doi: 10.1098/rstb.2009.0188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Marques A.C., Vinckenbosch N., Kaessmann H. Functional diversification of duplicate genes through subcellular adaptation of encoded proteins. Genome Biol. 2008;9:R54. doi: 10.1186/gb-2008-9-3-r54. [DOI] [PMC free article] [PubMed] [Google Scholar]