Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2024 May 18;123(13):1763–1780. doi: 10.1016/j.bpj.2024.05.015

Modeling reveals the strength of weak interactions in stacked-ring assembly

Leonila Lagunes 1,2, Koan Briggs 3, Paige Martin-Holder 4, Zaikun Xu 5, Dustin Maurer 5, Karim Ghabra 6, Eric J Deeds 1,2,5,
PMCID: PMC11267433  PMID: 38762753

Abstract

Cells employ many large macromolecular machines for the execution and regulation of processes that are vital for cell and organismal viability. Interestingly, cells cannot synthesize these machines as functioning units. Instead, cells synthesize the molecular parts that must then assemble into the functional complex. Many important machines, including chaperones such as GroEL and proteases such as the proteasome, comprise protein rings that are stacked on top of one another. While there is some experimental data regarding how stacked-ring complexes such as the proteasome self-assemble, a comprehensive understanding of the dynamics of stacked-ring assembly is currently lacking. Here, we developed a mathematical model of stacked-trimer assembly and performed an analysis of the assembly of the stacked homomeric trimer, which is the simplest stacked-ring architecture. We found that stacked rings are particularly susceptible to a form of kinetic trapping that we term “deadlock,” in which the system gets stuck in a state where there are many large intermediates that are not the fully assembled structure but that cannot productively react. When interaction affinities are uniformly strong, deadlock severely limits assembly yield. We thus predicted that stacked rings would avoid situations where all interfaces in the structure have high affinity. Analysis of available crystal structures indicated that indeed the majority—if not all—of stacked trimers do not contain uniformly strong interactions. Finally, to better understand the origins of deadlock, we developed a formal pathway analysis and showed that, when all the binding affinities are strong, many of the possible pathways are utilized. In contrast, optimal assembly strategies utilize only a small number of pathways. Our work suggests that deadlock is a critical factor influencing the evolution of macromolecular machines and provides general principles for understanding the self-assembly efficiency of existing machines.

Significance

Understanding the assembly of macromolecular machines is important for understanding a wide range of cellular processes. Here, we use mathematical models to study the assembly of stacked rings, which are a common motif in these machines. Our models revealed that these complexes can readily get “stuck” during assembly when the binding affinity between subunits is too strong. This suggests an evolutionary pressure to favor weaker interactions, and our analysis of solved structures confirmed this prediction. Our findings not only contribute to the fundamental understanding of assembly but also offer insights into the evolutionary pressures shaping the architecture of stacked rings and have implications for both cell and synthetic biology.

Introduction

Cells employ large macromolecular machines for the execution and regulation of almost every process vital to cell and organismal viability. For instance, protein homeostasis relies on the ribosome for protein synthesis (1), the proteasome for protein degradation (2), and chaperones such as GroEL to promote proper protein folding (3). These enzymes are all large multisubunit complexes (1,2,3), and they serve not just to catalyze the relevant chemical reactions but are also critical in regulating protein levels and function. Molecular machines are involved in a host of other processes: for instance, RuBisCo is critical for photosynthesis in algae and plants (4), ATP synthase is crucial for energy production in cells (5), and the microtubule is critical for cellular infrastructure and organization (6). It is clear that the proper functioning of these machines is critical to essentially all cellular life. Interestingly, however, cells cannot synthesize these machines as functioning units. Instead, cells synthesize the molecular parts that must then assemble in the functional multisubunit complex. Understanding how these machines assemble is thus crucial to understanding how they function and how they are regulated.

Since these complexes often perform central roles in cell function and metabolism, they also tend to be associated with a variety of diseases and conditions. For instance, dysregulation of proteasome function has been associated with diseases ranging from cancer to Alzheimer’s disease (7,8,9,10,11). As a result, understanding the assembly of these complex machines can provide not only a deeper understanding of cell function but also insights into disease processes. Finally, over the past 10 years, approaches have emerged that allow for the design of synthetic protein complexes that could prove useful in applications as diverse as targeted drug delivery and gene therapy (12,13,14). Studying the assembly of natural complexes can provide insights into general “design principles” that could allow for the design of synthetic machines that self-assemble more efficiently. Improving our understanding of macromolecular assembly thus has applications ranging from basic cell biology to applied synthetic biology. Advancing our understanding of macromolecular assembly thus holds implications spanning from fundamental cell biology to practical synthetic biology applications.

A common structural motif observed in many macromolecular machines are either rings or contain rings (15,16,17). Some structures are simple rings: glutamine synthetase (18,19), pyruvate kinase (20), the KaiC circadian clock protein (21), and AAA proteins (22,23) are all examples. Many complexes, however, involve rings that are stacked on top of one another. The prototypical example of this type of structure is the proteasome core particle (CP), which consists of four rings stacked on top of each other; bacterial proteases such as ClpXP/ClpAP and HslUV share a similar architecture (24). The GroEL/ES chaperonin complex and enzymes such as the nicotinamide mononucleotide adenylyltransferase and some E2 ubiquitin-conjugating enzymes also demonstrate a stacked-ring architecture (25). In synthetic designs, some complexes contain ring-like structures, such as viral capsids (26,27,28,29) and other nanorings (30). Rings and stacked rings are clearly a common motif in the evolution of macromolecular machines (16,17).

Experimentalists have studied the assembly of some machines, particularly the proteasome (2,31,32), the ribosome (33,34) and the GroEL/ES chaperonin (35,36,37). These studies have resulted in proposed assembly pathways that provide helpful pictures that summarize experimental data. While these pathways are generally well accepted in their respective communities, they have several limitations—perhaps most notably, they cannot make detailed, quantitative predictions regarding assembly dynamics. It is also difficult to understand, in many cases, why certain aspects of these pathways have evolved or how particular pathways influence assembly yields. As such, mathematical and biophysical models can be helpful in augmenting these experimental investigations. To date, the vast majority of computational models of self-assembly have focused on the assembly of viral capsids at various levels of biophysical resolution (38,39,40,41,42,43,44,45). While some work has been done using molecular dynamics to study the assembly of molecular machines such as the ribosome (46,47,48,49,50), there has been comparatively little work investigating the assembly of rings and stacked rings such as the proteasome. One difficulty is time scale; under standard in vitro assembly conditions, the proteasome CP self-assembly reaction takes about 3 h to reach completion (31,51), which is well beyond the reach of currently available biophysical simulation techniques (52,53). It is thus critical to develop computational and mathematical models of assembly that allow us to systematically explore assembly dynamics on realistic timescales.

A little over 10 years ago we introduced a mathematical model of the assembly of ring-like structures that attempted to overcome some of the limitations mentioned above (15). In that work we used chemical reaction network theory to develop ordinary differential equation (ODE)-based models of assembly dynamics. Using these models, we showed that ring-like structures can suffer from a severe form of kinetic trapping that we termed deadlock, which occurs due to the formation of incompatible stable intermediates. Specifically, when the binding affinities between the subunits are very strong, the subunits quickly form large intermediate structures that cannot react with one another due to steric clashes. These intermediates consume all the monomers, leading to an impasse whereby no further assembly can occur until the system reaches timescales where these intermediates start to dissociate. The stronger the interactions between the subunits, the longer it takes to for this deadlocked state to resolve, which suggested an evolutionary pressure to avoid very strong interactions in these types of structures. Interestingly, this model predicted that heteromeric rings would tend to have a single interaction that was weaker than the other two, and analysis of available experimental structures found that this is indeed the case for most heteromeric rings (15). While this model was highly informative, it was limited to simple ring-like structures.

In this work, we extend this previous model to the important case of stacked-ring structures, which, as described above, are an extremely common motif in macromolecular machines (e.g., the proteasome CP and GroEL). We focus here on a single stacked trimeric ring, which is simple enough to characterize completely and yet complex enough to exhibit behaviors that are distinct from the assembly of single rings. Our main finding is that deadlock in these cases can be significantly worse than in the previous case of single rings—while deadlock in a single ring might typically resolve on timescales of minutes to hours, deadlock for stacked rings can last for days or even longer. We showed that deadlock is worst when all of the interfaces in the structure have high binding affinities. This is true not only in models where there is a fixed initial concentration of subunits (modeling an in vitro assembly reaction (31,50,51)) but also in more realistic in vivo models where we include constant synthesis of subunits or degradation of all intermediates formed (15). These results led us to predict that evolutionary pressures would select against stacked trimers having strong binding affinities both within and between rings. We tested our prediction by analyzing solved stacked-trimer structures; we found that indeed the majority—if not all—of the stacked trimers did not contain two very strong interactions. In experimental studies of assembly, it is common for researchers to discuss the notion of “assembly pathways,” but to our knowledge there has been no attempt to rigorously define what an assembly pathway actually is. To better understand the origins of deadlock and why certain patterns of interaction affinity generate a deadlocked state, we defined an assembly pathway as a binary tree representing a scenario of how a group of subunits go from being monomers to form the fully assembled structure. Our assembly pathway analysis showed that when all the binding affinities are strong, many of the possible pathways are utilized during assembly, consuming subunits and creating high levels of deadlock. In contrast, when one of the interfaces is weaker, only a small number of pathways contribute significantly to assembly, suggesting that observed structures have evolved patterns of interaction affinities that enforce a more hierarchical assembly process (15). In sum, our work provides critical insight into the evolutionary pressures that have shaped the assembly of stacked rings. Furthermore, our work posits a potential design principle to optimize self-assembly efficiency in stacked rings: synthetic structures should likely be designed not to have strong interactions throughout but instead have a mixture of binding strengths to avoid deadlock during assembly. Since previous design approaches typically attempt to maximize the affinity of every interface (54,55,56,57,58), adopting a different approach may lead to a higher success rate for designing such structures. While further advances are necessary to model the assembly of more complex structures and synthetic designs, our work highlights the role of mathematical modeling in understanding self-assembly processes in evolution, biomedicine, and synthetic biology.

Results

Constructing a model of stacked-ring assembly

In this work we model the self-assembly of a stacked-ring-like protein complex containing n subunits like the one depicted in Fig. 1 A (left). As an example, in the X-ray structure from the Protein Data Bank (PDB: 2FO3) in Fig. 1 A (left), each subunit is depicted with a unique color. In this work, we consider the assembly of a stacked trimer containing two three-membered homomeric rings (n=6). To construct a model of a stacked trimer, we can visualize each subunit as a node in a graph and a stacked ring as two connected symmetric trimeric rings (Fig. 1 A, right), where the connections are representative of the noncovalent bonds between subunits. Stacked trimers have a three-fold axis of rotational symmetry as well as mirror symmetry between rings, so that the bottom ring is the mirror image of the top ring (16,17). Indeed, there are examples of homomeric stacked rings such as nicotinamide mononucleotide adenylyltransferase (59), periplasmic heme-binding protein ShuT (60), and glycerophosphodiesterase (61), to name a few.

Figure 1.

Figure 1

Stacked-trimer model schematic. (A) An example stacked trimer (X-ray structure from PDB: 2FO3) on the left is depicted with each monomer in a unique color. Any such stacked trimer can be represented as a graph with edges representing the noncovalent bonds between the six proteins. Each protein has three binding interfaces (bonds are represented by the solid gray lines). (B) Each protein can form two types of bonds, with proteins on the same ring (interface 1: within) and with proteins from the opposing ring (interface 2: between). Each bond type has associated Kd values, Kd,1 and Kd,2, respectively. (C) A list of all the intermediate species that can be formed during stacked-trimer assembly in our model. To see this figure in color, go online.

Similar to a previous model of single-ring assembly (15), the subunits are treated as identical but have a “sidedness,” that is, each subunit has three distinct interfaces: left, right, and across. The left side of one subunit can bind to the right side of another, and we call these the within-ring interactions (Fig. 1 B, left). The across side of one subunit can bind to the reflecting across side of another, in an up-down fashion, giving rise to what we call between-ring interactions (Fig. 1 B, right). Thus, each monomer contains three types of binding interfaces: two for binding within the ring (the left and right side of what we call interface 1) and one for binding between the rings (which we call interface 2, Fig. 1 B). For each bond type, there is an associated binding affinity. For interface 1, the binding affinity between each subunit is described by a dissociation constant Kd,1 and for interface 2, the binding affinity between the two subunits forming the between-ring bond is Kd,2. Note that these Kd values are defined based on the association and dissociation rates of the chemical reactions that form the corresponding dimers from the monomers (62). These individual Kd values thus just represent the equilibrium constants for the monomer-to-dimer reactions; in general, many of the other species that can form in this system are much more stable because they involve the formation of more than one interface by a single subunit simultaneously (63,64). The stability of such structures is discussed briefly below and in great detail in the supporting material.

We call any structure that is smaller than the stacked trimer an “intermediate,” that is, all intermediates are substructures of the fully assembled stacked ring. Formally, an intermediate is any connected subgraph of the fully assembled structure. As such, intermediates have a number of subunits between 1 and n1. We have enumerated all the possible intermediate structures for the stacked trimer in Fig. 1 C. Since every intermediate is a substructure of the fully assembled complex, we use a simple visual language for these intermediates. The filled-in circles represent subunits that are “present” in that structure, whereas unfilled circles represent subunits that are “absent” from that intermediate. Note that the unfilled circles represent all the available locations in the intermediate where other intermediates can bind. To enumerate the species, we first start with species 1, which is simply the monomer. We then add another monomer in every possible available location on the graph and determine whether the resulting structure forms a connected component. If it does, this is a new intermediate and we add it to the list. Doing so generates the two possible dimer species, species 2 and 3. Iterating this process of adding monomers to the two dimers generates all possible trimers, and continuing this process allowed us to generate all 11 possible intermediates as shown in Fig. 1 C. We denote an individual species “i” as Si.

To construct a mathematical model of stacked-trimer assembly, we next enumerated all the binding reactions between intermediates. To do so, we checked all possible pairwise combinations of the intermediates in all possible relative orientations (see supporting material section 1) to see if they could react to form another intermediate or the fully assembled structure. For example, consider species 1 and 2 (S1 and S2), the monomer, and the “Kd,2” dimer. If we rotate species S1 by 120°, note that it “fits into” an open spot in the species S2 structure. As a result, there is a reaction between S1 and S2 that generates another intermediate, in this case S6. Similarly, if we rotate S1 by 240°, that induces a reaction that generates S4 (see Fig. S1). Using this basic approach, we enumerated all the possible binding reactions that could occur between intermediates. The complete list can be found in supporting material sections 1.2–1.4.

The concentration of each species is represented by Xi, where i is the index of the intermediate and not the number of subunits in the structure. Note that this value is in general a function of time (i.e., Xi(t)); in our notation, however, we leave the dependence on time implicit. The number of subunits in an intermediate is given by η(Si), which is a function that maps from the species index to the number of subunits present in the given species. For example, S1 is the monomer (Fig. 1 C) and η(S1)=1, since there is only one subunit present. In contrast, S10 has η(S10)=4, since there are four subunits present in that intermediate. Additionally, we define XT=i=1nη(Si)Xi to be the total concentration of the subunits (i.e., the total concentration of all subunits in all the species combined).

Similar to previous models (15,65,66), we used the standard law of mass action to derive a system of ODEs that describes the time evolution of the concentration of all the species in the model based on these reactions (Fig. 1 C). The derivation is described in detail in supporting material section 1.5. In this work we make the simplifying assumption that the association rate between intermediates is the same, regardless of the size and structure of the intermediates in question. All association reactions in our model thus occur at the same rate, which we term k+. All binding reactions in the model are of course reversible, and the dissociation rates depend on both the number and the strength of the noncovalent bonds being formed between and/or within each ring. As in previous models of self-assembly, we assume that the intermediate structures are fairly rigid (15,64). This allows us to use straightforward thermodynamic arguments to calculate the dissociation constant for any given reaction, which we call the “effective” dissociation constant Keff. This calculation is described in detail in supporting material section 1.4 and is based on the number of each type of interface that is disrupted for a given dissociation reaction. For example, the dissociation of a single ring (S9) into a monomer (S1) and a dimer (S3) involves the dissolution of two Kd,1 interfaces, and as such should be much less energetically favorable than dissociation of a simple dimer (i.e., S3 dissociating to form two S1 molecules). Interestingly, there is also an entropic contribution to the difference in energetics between a simple dimer and a ring, which is described in detail in supporting material section 1.4. As a result, ring-like structures are very stable, so dissociation reactions that disrupt more than one noncovalent bond happen at very low rates, often many orders of magnitude slower than the dissociation rates of the dimers (15,64,67). Using these assumptions and straightforward calculations of how many “ways” two intermediates can combine to form another (see supporting material section 1.5), we derived a system of ODEs for the self-assembly of the stacked trimer. These ODEs were numerically solved using Python 3.9.7 for fixed parameter values (see supporting material). All of the code used for this work is available at: https://github.com/llagunes-324B21/stackedTrimer_detModel.

We define the assembly yield of an intermediate i as Yi=η(Si)XiXT for each species i=1,2,,12 at a given time point t. Note that Yi is simply the fraction of total monomers in the system that are found in that given intermediate. Take, for instance, Y12, the fraction of monomers in the fully assembled stacked trimer. If Y12=1 at some point in time, that would mean that 100% of the monomers are found in the stacked-trimer structure; if Y12=0.5, then only 50% of the monomers are found in the fully assembled structure. When Y12<1, this indicates that some monomers are sequestered in smaller intermediates. We use this definition of the assembly yield to investigate the role of the binding affinities on the formation of the stacked trimer.

We note that our previous models of the assembly of single rings considered a wider variety of structures, including rings of varying lengths and heteromeric rings with more than one different kind of subunit (15). The modeling framework described here could easily be extended to larger structures: one could consider a case where each ring contains seven subunits (i.e., n=14), or cases with more rings stacked together (i.e., four stacked rings as in the case of the proteasome CP). Interestingly, as these stacked-ring structures become larger, the number of species and reactions they can exhibit increases dramatically. For instance, for a stacked seven-member ring, we would need 307 chemical species and 18,150 reactions, compared to the 12 species and 34 reactions for the three-membered stacked ring considered here. Stacked-ring structures clearly entail considerably greater levels of combinatorial complexity than simple rings (15, 63, 68,69,70,71). Thus, while the framework described here can be readily extended to larger and more complex structures, here we focus on the stacked homomeric trimer, which is simple enough to characterize completely while still exhibiting complex behaviors distinct from those of single rings.

The assembly dynamics of stacked homomeric rings

To investigate the role the binding affinities Kd,1 and Kd,2 have in the formation of the stacked-trimer assembly yield, we simulated the dynamics obtained from our model of stacked-ring assembly starting from an initial condition of only monomers. As in previous work, to parameterize our model, we made the simplifying assumption that the rates of all the association reactions (i.e., the rate constants for all of the binding reactions) are the same, regardless of the identity of the reactants (15). We have taken the association rate to be k+=106 M−1 s−1, which is a reasonable value for protein-protein interactions (15,72). As described previously, changing the value of this rate constant in the model corresponds to a simple rescaling of the time units of the simulation (15). Thus, while the exact timescale depends in the specific value of the association rate, the general shape of the curves discussed in this work do not depend on the value of k+. For our first exploration of self-assembly dynamics in this model, we used an initial monomer concentration of 4 μM, i.e., X1=4×106 M.

We first consider the scenario for which the total concentration of subunits XT is fixed. In other words, there is no synthesis of new monomers, or any intermediate, and there is no protein degradation. The specific system of ODEs corresponding to this scenario may be found in supporting material section 1.5. We call this case the in vitro model to resemble in vitro biochemical experiments where self-assembly occurring in a solution has fixed total protein concentration (15,31,32). Fig. 2 A shows the assembly yield dynamics of the stacked trimer for a set of unique combinations of Kd,1 and Kd,2. In this model, we find that the assembly dynamics depend critically on the affinity between the subunits. When both affinities are extremely strong (Kd,1=Kd,2=1012 M), the overall yield is relatively low, reaching at most 0.5 (i.e., 50% full assembly) even on relatively long timescales (106 s, or around 11 days) (Fig. 2 A, green curve). As mentioned in the introduction, the fundamental reason for this is a type of kinetic trapping that we have termed “deadlock” (15). In this parameter regime, the monomers react very quickly with one another to form various larger intermediates, and, because of the strength of the interactions between the subunits, those intermediates are relatively stable. Most of them also cannot react with one another due to steric clashes. For instance, when Kd,1=Kd,2=1012 M, the system is dominated by mostly species 11 and species 7 during the “plateau” phase of the dynamics (supporting material section 2 and Fig. S6). These intermediates cannot react with one another; for example, species 8 (Fig. 1 B) cannot react with any other structure but a dimer or a monomer. Since the monomers and dimers are rapidly depleted in this parameter regime, species 8 cannot react with anything, thus leading to deadlock.

Figure 2.

Figure 2

Dynamics of stacked-trimer assembly. (A and B) Time-course plots for (A) in vitro and (B) in vivo model with different Kd,1 and Kd,2 parameter values. Initial monomer concentration is 4×106 M, and for the in vivo model the degradation rate δ is set to 2.8×104 s−1. (C and D) Assembly yield curves with respect to increasing initial monomer concentration with different Kd,1 and Kd,2 parameter values. Same color scheme as in (A) and (B) for both in vitro and in vivo models. Gray lines represent cases for which Kd,1 and Kd,2 increase from 103 M to 1012 M. To see this figure in color, go online.

For deadlock to resolve, larger intermediates must dissociate to generate smaller species that can then react (15). In the case of the stacked trimer, many of the intermediates are actually rings themselves (e.g., species 8 and 9). For simple thermodynamic reasons, rings are far more stable than chains (15, 64), and so this dissociation timescale can be theoretically quite long. As a result, while deadlock resolves in reasonable timescales for the case of the assembly of simple rings, it can last well beyond biologically reasonable timescales in the case of the stacked trimer (see supporting material) (Fig. 2 A). The assembly of stacked-ring structures can thus exhibit fundamentally different assembly dynamics than single rings (15).

In Fig. 2 A, we have highlighted several other examples of combinations of binding affinities that give rise to interesting and characteristic assembly dynamics. When both affinities are of intermediate strength (Kd,1=Kd,2=108 M, Fig. 2 A, yellow curve), we see similar levels of deadlock to the case of two extremely strong interactions discussed above. Making one of the interactions much weaker produces strikingly different results depending on which interaction is strong and which is weak. If the interaction within the rings is strong (Kd,1=1012 M, Kd,2=103 M, Fig. 2 A, purple curve), we see even more extreme deadlock than when both interactions are strong. This is because the species with the highest concentration during the deadlock phase is species 11, the intermediate with only one monomer missing. Additionally, there is a relatively high concentration of species 9, which is the single-ring structure. These two structures are not compatible and therefore cannot form a stacked trimer, which leads to deadlock, and since the binding affinity within the rings is very high, species 9 is not disassembling quickly enough to overcome deadlock (supporting material section 2 and Fig. S6). In contrast, if the between-ring interaction is strong (Kd,1=103 M, Kd,2=1012 M, Fig. 2 A, red curve), the impact of deadlock is much less. This indicates that it is not the total strength of the interactions within the structure, but rather the pattern of interaction strengths, that dictates the dynamics of assembly in stacked rings. Finally, when both interactions are very weak (Kd,1=Kd,2=103 M, Fig. 2 A, blue curve), the system completely avoids deadlock, but the assembly dynamics are quite slow because the smaller intermediates, like the two dimer species, are much less stable, and thus it takes considerably longer for larger intermediates to form (supporting material section 2 and Fig. S6).

The above model considers the case of an in vitro biochemical experiment, and it may be applicable to situations inside cells where monomeric species are pre-formed and then assemble in response to some incoming signal (73,74,75,76). In the case of many molecular machines, however, we anticipate that monomers would be constantly synthesized, and assembly takes place to replace structures that are lost due to degradation or dilution from cell division (77). We model this by introducing a constant synthesis rate of monomers and a first-order degradation term for all the intermediates; we call this the in vivo model. We fixed the degradation term δ to be 2.8×104 s, which models the case where, say, a bacterial cell divides approximately once very hour (15). In our in vivo models we did not consider cases where the total protein concentration changes over time. We thus set the monomer synthesis rate Q=δ×XT (supporting material section1.5). Fig. 2 B shows the assembly yield trajectories for the stacked trimer for combinations of Kd,1 and Kd,2. The curves represent the same Kd,1Kd,2 parameter combinations as in Fig. 2 A with the same color scheme. In this scenario, the assembly yield of the stacked trimer is relatively high for most Kd,1 and Kd,2 values. Deadlock is also less severe than seen in the in vitro model—while there are “plateaus” in the assembly dynamics (Fig. 2 A, green, purple, and red curves), these plateaus resolve on much faster timescales than in the in vitro case. This is because monomers are being constantly synthesized, and thus the system never truly “runs out” of them as it does in the in vitro model. The primary impact of deadlock in this case is the fact that the steady-state yield depends strongly on the pattern and strength of binding affinities. For instance, weak interactions lead to relatively low yields (Fig. 2 B, blue curve). Interestingly, having uniformly strong interactions (Kd,1=Kd,2=1012 M) leads to lower steady-state yields than cases where one of the interactions is weak (e.g., Kd,1=1012 M, Kd,2=103 M, Fig. 2 B, comparing the green and purple curves). This suggests that, as in the case of single rings, deadlock can impact steady-state yields even in models where monomers are constantly synthesized in the cell (15).

In the models thus far, the initial monomer concentration is fixed at 4 μM, i.e., X1=4×106 M. To characterize how the assembly yield of the stacked trimer responds to changes in the initial monomer concentration, we first considered the in vitro model, where there is no protein synthesis or degradation. In Fig. 2 C, we calculated the assembly yield of the stacked trimer after 24 h for different initial monomer concentrations and fixed Kd,1 and Kd,2 values. Note that in Fig. 2 C, the colored curves correspond to the same Kd,1 and Kd,2 combinations as in the previous panels. When both affinities are of intermediate strength (Kd,1=Kd,2=108 M, Fig. 2 C, yellow curve), we see that the assembly yield for the stacked trimer increases as the initial monomer concentration increases, reaches a peak, and then begins to decrease with increasing initial monomer concentration. In the first phase before the peak, the initial monomer concentration is low and thus there is slow binding between the intermediate structures. As a result, it takes longer for the stacked trimer to be assembled, leading to low assembly yields after 24 h. In the second phase after the peak, the initial concentration is high and yet the assembly yield for the stacked trimer decreases. This is because there is now fast binding between the intermediates due to the higher protein concentration. The rapid binding of monomers and other smaller intermediates leads to deadlock, as described above (Fig. 2 A). At the peak, a balance is achieved for this particular pattern of interaction affinities: binding is not too fast, so the system is able to avoid deadlock and achieve near 100% yield. This behavior is completely distinct from that observed for single rings; increasing the concentration of subunits for a single ring never reduces the yield at a particular time point (15). Rather, for single rings, increasing subunit concentration simply decreases the amount of time it takes for deadlock to appear. In stacked rings, however, increasing concentration can actually induce deadlock.

As with the assembly kinetics described in Fig. 2 A, the response of the system to changes in initial conditions depends critically on the pattern of binding affinity within the structure. For instance, when both interactions are very strong (Kd,1=1012 M, Kd,2=1012 M, Fig. 2 C, green curve), we see an increase in assembly yield up to about 40%. The interactions are so strong in this case that the system essentially cannot avoid deadlock regardless of the initial concentration of monomers. If we make the between-ring interaction much weaker (Kd,1=1012 M, Kd,2=103 M, Fig. 2 C, purple curve) we see essentially the same behavior, indicating that weakening the bonds between the rings does not make the system more robust to deadlock. Interestingly, however, if we reverse the situation so that the between-ring interaction is strong and the within-ring interaction is weak (Kd,1=103 M, Kd,2=1012 M, Fig. 2 C, red curve), we see the assembly yield for the stacked trimer increase to a maximum level and then decrease with increasing initial monomer concentration. In this case, the maximum assembly yield for the stacked trimer spans a longer region of initial monomer concentrations than in the intermediate strength (Kd,1=Kd,2=108 M) case. However, that optimal region reaches approximately 80% stacked-trimer assembly, suggesting that the system cannot fully avoid deadlock. Finally, when both affinities are weak (Kd,1=103 M, Kd,2=103 M, Fig. 2 C, blue curve), the system reaches 100% assembly at 24 h but only when protein concentrations are relatively high. This is due to the fact that while binding occurs quickly at higher monomer concentrations, the intermediates formed are unstable (particularly species 2–6, Fig. 1 C) and dissociate before further assembly reactions can occur. Under these conditions, assembly is very slow and higher protein concentrations are required to achieve 100% yield. Note that, even when both bonds are very weak, at even higher protein concentrations yields begin to decrease due to deadlock (supporting material section 1.5). Nonmonotone yield curves as a function of protein concentration are thus characteristic of in vitro assembly of the stacked trimer, which is different from the behavior of single rings (15).

Changing total protein concentration in the in vivo model is not as straightforward as the in vitro case. As discussed above, the total protein concentration in this model is just XT=Q/δ, i.e., the ratio of the monomer synthesis rate to the first-order degradation rate for all the species. One can thus change the total protein concentration by changing either the value of Q or δ. In this case we chose to modify Q, since this is most closely analogous to increasing total monomer concentration as in Fig. 2 C. This also models a case where, for instance, a bacterial population is growing at a constant rate (in this case, doubling every hour) and is attempting to increase the concentration of the stacked monomer by increasing the synthesis of the monomers. In Fig. 2 D, we show the steady-state yield of stacked trimers as a function of total protein concentration, highlighting particular patterns of interaction affinities with distinct colors as with all the other panels in the figure. In this case we do not observe nonmonotone behavior—the higher the value of the monomer synthesis rate, the higher the steady-state yield. The pattern of affinities within the structure, however, still has a large impact on assembly. When both interactions are very weak (Kd,1=Kd,2=103 M, Fig. 2 D, blue curve), assembly yield does not reach 100% until relatively high monomer concentrations. Interestingly, however, having both interactions very strong (Kd,1=Kd,2=1012 M, Fig. 2 D, green curve) generally results in lower assembly yields at any given total monomer concentrations than the other scenarios, particularly the cases where both interactions are of intermediate strength or the between-ring interaction is strong (Fig. 2 D, red and yellow curves).

The results in Fig. 2 D suggest that there may be strong evolutionary pressures that impact the patterns of affinities in stacked-ring structures. Increasing the value of Q represents a direct increase in the amount of energy investment the cell is making in the synthesis of monomers. The higher this value is, the higher is the total protein concentration, and the more ATP the cell is spending on the synthesis of these proteins. If we assume that only the fully assembled structure is functional, as is the case for the proteasome, GroEL (37, 78) and a host of other structures (79,80,81,82,83,84), the assembly yield represents the percentage of that energy investment that actually results in functional machines that can support cell physiology. In other words, this is the return on that energy investment. We would expect that there would be some evolutionary pressure to maximize that return in many cases, particularly in bacterial species where even a small waste of energy can have detectable fitness consequences (85,86). Some curves are always near the top of Fig. 2 D, suggesting that certain patterns of affinity almost always result in higher yields regardless of the total protein concentration present and thus would likely be evolutionarily favored.

To further assess the effect of the binding affinities on yield for the stacked trimer, we generated heat maps for both the in vitro and in vivo models. Fig. 3 A shows the assembly yield for the in vitro model after 24 h for values of Kd,1 and Kd,2 ranging from 103 M to 1012 M and a fixed initial concentration at 4 μM. The contour lines show highlighted assembly yield values. This type of plot allows us to visualize the effect of the binding affinities on assembly yield. If both Kd,1 and Kd,2 are weak (upper-right quadrant of the plot), we see that the assembly yield is greater than 95%. Interestingly, the lowest assembly yield occurs in the black-colored region delineated by the 40% contour line; this occurs when both Kd,1 and Kd,2 are both strong (lower-left quadrant) and when the between-ring interaction is much weaker than the within-ring interaction (i.e., when Kd,1 is strong and Kd,2 is weak). These results further highlight that assembly is vastly impacted by deadlock in the in vitro model when both interactions are strong or when Kd,1 is strong. Fig. 3 B shows a similar heatmap for the steady-state yield of the stacked trimer in the in vivo model. In this case, the degradation rate remains fixed at δ=2.8×104 s−1, the initial monomer concentration is also 4 μM, and we ranged Kd,1 and Kd,2 from 103 M to 1012 M. We see that when both Kd,1 and Kd,2 are strong (lower-left region), the assembly yield for the stacked trimer is not optimal. However, the lowest assembly yield occurs when both Kd,1 and Kd,2 are weak (upper-right region). Note that overall, the assembly yield is higher for all values of Kd,1 and Kd,2 in comparison with the in vitro model with the lowest assembly yield at approximately 82% in the in vivo model and 30% in the in vitro model, similar to the results in Fig. 2, A and B. Since both models show that when both binding affinities are strong the assembly yield for the stacked trimer is non-optimal, this suggests that there is an evolutionary pressure to select other combinations of binding strengths. In other words, we predict that stacked-trimer structures would not tend to have strong binding affinities both within and between the rings.

Figure 3.

Figure 3

Assembly yield for different values of Kd,1 and Kd,2. Heatmaps showing assembly yield after 24 h for (A) in vitro and (B) in vivo models with fixed initial monomer concentration of 4×106 M and where the binding affinities Kd,1 and Kd,2 increase from 1012 M to 103M. Note that all results for the in vivo case correspond to steady-state yields. Contour lines are shown to visualize assembly yield values. For the in vivo model, δ=2.8×104 s−1. To see this figure in color, go online.

Observed stacked-trimer structures avoid having two strong interactions

The findings above suggest that there is likely an evolutionary pressure for stacked rings to not exhibit strong binding affinities both within and between rings. To test this prediction, we considered solved structures of stacked trimeric rings. We used the PDBePISA database, which contains structural and chemical properties of macromolecular surfaces, interfaces, and assemblies for all structures in the PDB (87). We began by assembling a set of over 3000 homo-hexameric structures. Since PDBePISA does not annotate the general geometry of structures, we visually inspected each of these complexes to collect only those that form stacked trimeric rings. To avoid misclassifications, we defined a stacked trimer as one that has a similar topology to the one in Fig. 1 A (right), where there are three bound monomers on a top plane bound to three bound monomers on a bottom plane. Additionally, a stacked trimer would need to have rotational symmetry along each ring and mirror symmetry between the rings (see supporting material section 3). In other words, we removed any structures that were single rings, like those discussed in (15), chains or structures that did not match the topology considered in our model. This resulted in 1580 stacked-ring structures; a list of these structures with their PDB IDs is provided as an additional supplementary file in: https://github.com/llagunes-324B21/stackedTrimer_detModel.

Estimating binding affinities from the structures of protein-protein interfaces is difficult. In our analysis, we focus on the buried solvent-accessible surface area (BSASA), which can be used as proxy for binding affinities (15,88,89). In this case, if there is little contact between the two, then the BSASA is small and the binding affinity likely weak. Alternatively, if there is a large contact between the two monomers, the BSASA is relatively high and the binding affinity is likely stronger. We used the BSASA values calculated in PDBePISA for the interfaces between and within rings. For any stacked trimer, there are a total of six within-ring interfaces. For each structure, we used the average of all six, which we call the Kd,1 BSASA. Similarly, there are a total of three between-ring interfaces; the average of these three interfaces is the Kd,2 BSASA. The calculated Kd,1 and Kd,2 BSASA values for each structure are shown in Fig. 4 A (each red dot represents one of the 1580 stacked-trimer structures). In the scatterplot, we see that the majority of structures tend to cluster in the lower-left region where Kd,1 and Kd,2 BSASAs are relatively equal and not extremely large, suggesting that the majority of stacked trimers tend avoid very strong interactions. There are some structures that have a low Kd,1 BSASA with high Kd,2 BSASA. These structures have much more interface contact between the rings than within the rings. There are also two structures with the opposite pattern, with a high Kd,1 BSASA and a low Kd,2 BSASA. The fact that structures where the between-ring interface much stronger than the within-ring interface are much more common than the alternative matches our prediction that this pattern of assembly affinities leads to higher yields (Figs. 2 and 3) Interestingly, none of the stacked-trimer structures we found in the PDBePISA database show high BSASA for both the between and within interfaces (green rectangle in Fig. 4 A), consistent with our prediction that two strong bonds would not be selected for in the evolution of stacked trimers.

Figure 4.

Figure 4

Buried solvent-accessible surface area (BSASA) in PDB stacked-trimer structures. (A) We identified stacked trimers from the PDBePISA database and collected the BSASA for interfaces between and within rings for each. We used the BSASA within and between rings as a proxy for Kd,1 and Kd,2, respectively. (B) From the stacked trimers in (A), we resampled the data using a permutation test and determined how many cases had Kd,1 and Kd,2 BSASA in the excluded region (green rectangle in A). The histogram shows the total cases where Kd,1 and Kd,2 BSASA were in the green rectangle (counts) based on 10,000 trials. Note that out of the 10,000 trials, only 48 had zero cases where Kd,1 and Kd,2 BSASA were in the excluded region (green rectangle). To see this figure in color, go online.

Since we did not see any stacked trimers with both high Kd,1 and Kd,2 BSASAs, we performed a simple permutation test with 104 replicates to statistically test our prediction that stacked rings do not exhibit strong binding affinities for both within and between rings. Our permutation test consisted of resampling or “shuffling” the Kd,1 and Kd,2 BSASAs between all the structures and checking whether any of the shuffled structures were at a threshold of 4000 Å2 or higher for both interfaces (green rectangle in Fig. 4 A). In Fig. 4 B, we see the distribution of the number of cases observed in the green rectangle from this test. This distribution shows that only 48 trials had no cases where Kd,1 and Kd,2 BSASAs were in the green rectangle. This suggests that the observation of having no PDB structures with two strong interactions is unlikely to have occurred at random ( p<103). It is thus likely that evolution has avoided this scenario in order to lower the impact of deadlock on both assembly kinetics and steady-state yield.

Defining and enumerating assembly pathways

Our results thus far show that the assembly of stacked trimers yields deadlock similar to the assembly of single rings (15) and that patterns of binding affinities play a role in that deadlock. It is unclear, however, why certain patterns of affinities might lead to higher or lower levels of deadlock. For instance, why would having high affinity for both the between- and within-ring interactions lead to lower yields? To answer this question, we developed an approach to analyze assembly pathways. While the term “assembly pathway” is used broadly in the field (90,91,92,93,94), we are aware of no precise, formal, mathematical definition for what an assembly pathway actually is.

Informally, an assembly pathway should represent a distinct scenario representing how a group of subunits go from being monomers to form the fully assembled structure. Somewhat more formally, we define an assembly pathway to be a strict binary tree where the nodes are assembly intermediates (including the monomers and the fully assembled structure). An example is shown in Fig. 5 A. In this tree, each node has two child nodes and one parent node. The exceptions to this are the leaf nodes, which have no child nodes, and the root node, which has no parent node. The leaf nodes are always monomers, and an assembly pathway has exactly the same number of leaf nodes as there are subunits in the fully assembled structure (six in the case of the stacked trimer, Fig. 5). The root node is always the fully assembled structure. If two nodes are to be child nodes of a given parent, there must be a chemical reaction between the two child species that generates the parent. In other words, whenever we have two nodes below a parent node in the tree, they have to be able to combine to generate that particular intermediate (Fig. 5 A). A more complete and formal definition of assembly pathways is given in supporting material section 4.

Figure 5.

Figure 5

Example assembly pathways for a stacked trimer. Each panel shows an assembly pathway to form a stacked trimer. While we have enumerated all 46 pathways, five examples are shown here. The pathways begin at the bottom of the graphic and end with the stacked trimer at the top of the graphic. (A) Pathway 1 has a final step in which two trimeric rings bind to form a stacked trimer. (B) Pathway 2 is one pathway in which the final step is a reaction between two trimers (which in this case are not rings). (C) Pathway 5 represents an alternative to pathway 2 in (B) with the same final step. (D) Pathway 12 is one pathway for which the final step is the reaction between a tetramer and a dimer to form the stacked trimer. (E) Pathway 35 is one pathway in which the final reaction is to add a single monomer to a pentamer to form the stacked trimer. To see this figure in color, go online.

The pathway shown in Fig. 5 A represents one intuitive assembly pathway. Here, two monomers first combine to form a within-ring dimer; in other words, they bind to form one Kd,1 interface. Another monomer then binds to form a complete ring. Two of these rings then dimerize with one another to form the fully assembled structure (Fig. 5 A). While this is a very natural pathway, many other scenarios are possible. For instance, the pathways in Fig. 5, B and C involve the formation of trimers that are not rings but can nonetheless combine into the fully assembled stack trimer. Fig. 5 D shows a pathway where a dimer binds to a tetramer in the last step, and Fig. 5 E shows a case where monomers bind one at a time.

It is straightforward to define a recursive algorithm that can find all the assembly pathways that are possible for a given structure. To do so, we start with the root node and consider all the possible “final reactions” that have the fully assembled structure as their product. In the case of the stacked trimer, there are six such reactions, corresponding to six different ways that the final step in assembly can occur. Every one of these reactions has two reactants; these are the first two child nodes of the root node. For each child node, we can similarly enumerate all the unique reactions that have that species as their product; that is, all the possible sets of child nodes for that particular node in the tree. We can continue this process recursively until we have only monomers as reactants, keeping track of each unique way of constructing each intermediate in the process. Using this approach, we enumerated all 46 possible assembly pathways for the stacked trimer; these are shown in supporting material section 4 and Fig. S12. The pathways in Fig. 5 are numbered according to the particular recursive scheme that we used to enumerate them. The code used for assembly pathway definitions is available at https://github.com/llagunes-324B21/stackedTrimer_detModel.

Pathway contributions reveal the origins of deadlock

We hypothesized that variation in the values of Kd,1 and Kd,2 could change how much any one of these 46 pathways “contributes” to the formation of the fully assembled stacked trimer. To do so, however, we must first mathematically define this notion of contribution. As described in supporting material section 4.2, we have enumerated all the possible binding and unbinding reactions that can occur in the assembly of the stacked trimer. Every binding reaction can be written si+sjk+sk, where si and sj are the reactant species and sk the product (these s are any species in Fig. 1 C). We call the set of all binding reactions B and the subset of those reactions that have species “sk” as their product B(sk). More formal definitions of a “reaction” and a “set of reactions” is given in supporting material section 1.3.

We define a pathway’s contribution to the formation of the fully stacked trimer in a recursive way as follows. Consider first the fully assembled structure, species “s12” in Fig. 1 C. All the different reactions that can produce this structure are collected in the subset A(s12). Take some reaction rA(s12), and define the flux of the reaction r as F(r). For instance, call the binding reaction s1+s11s12 as “r1”: this is just the monomer binding to the pentamer to form the fully assembled stacked trimer, and clearly r1B(s12). The flux of this reaction is just F(r1)=k+x1x11. Note that these fluxes are, in general, dependent on time, but as before we leave this dependence on time implicit. The expressions for the fluxes can also be more complicated, depending on things like stoichiometric coefficients; a full description of how to define any given flux is given in supporting material section 1.5. Finally, it is important to remember that a flux is not an amount or concentration, but rather concentration per unit time. For instance, we might have F(r1)=1μM s−1, which means the reaction in question is producing 1 μM of the fully assembled structure every second.

Now that we have defined this flux, we can define the total flux of reactions that produce a given species sk as FT(sk)rB(sk)F(r). This just quantifies how much of that species is being formed per unit time, no matter which reaction is producing it. This finally allows us to define the critical quantity we need, which is the relative flux of a reaction Fˆ(r)F(r)/FT(sk), where FT(sk) is just the total flux of production of whatever species is the product of the reaction in question. This quantity can be naturally interpreted in the following way. Say we are considering the reaction r1 described above. Imagine we have a single new molecule of the fully assembled structure, which is the product of this reaction. Fˆ(r1) is just the probability that this molecule of s12 was produced by reaction r1.

By definition, every assembly pathway has some reaction rB(s12) as its “final step,” and there are only six such reactions (see the top of the panels in Fig. 6). We can calculate the relative flux of this reaction, which is the probability that all the pathways with that last step have produced any given “newly assembled” molecule of the fully assembled structure. To calculate the probability that an entire pathway is being used to form the fully assembled structure, we now need to consider the two species that are combining to form the fully assembled structure. For instance, “pathway 35” in Fig. 5 E has reaction r1:s1+s11s12 as its last step. Since s1 is the monomer, we do not need to consider how it is formed. Species s11, however, could be formed in several different ways, and pathway 35 indicates that it is formed by the reaction s1+s8s11, which we will call reaction “r2” for simplicity. Note that a fully assembled stacked trimer that uses pathway 35 requires that both r1 and r2 be used for the formation of the fully assembled structure; the probability that a stacked trimer is formed by these two reactions as the last and second-to-last reactions, respectively, is naturally Fˆ(r1)·Fˆ(r2).

Figure 6.

Figure 6

Assembly pathway contributions for different combinations of Kd,1 and Kd,2 in the in vitro model. (AF) Each panel shows all the assembly pathways, enumerated as in Fig. 4, and their calculated pathway contribution. Pathways are grouped based on the final assembly reaction as indicated by the graphics in the top of each graph labeled as “Final Reaction.” The black bar is the individual pathway contribution, and the gray region represents the sum of the pathway contributions for all the pathways in the group. To see this figure in color, go online.

Any assembly pathway can be seen as a subset of binding reactions; in other words, a pathway Pi can be thought of as a subset of B (PiB). We can follow the recursive logic above and realize that the probability of a molecule of the stacked trimer forming using that pathway is just the product of the relative fluxes of all the reactions in the pathway. This leads us to define the contribution of a pathway Pi as: C(Pi)rPiFˆ(r). While the definition in this case may seem a bit cumbersome, the final outcome is quite straightforward. The larger the value of C(Pi) for a given pathway, the higher the chance that a molecule of the fully assembled structure was formed according to the scenario described in the pathway. For instance, if C(P1)=1, the pathway in Fig. 5 A is the only way a stacked trimer is being formed; alternatively, if C(P1)=0, the scenario in Fig. 5 A is not occurring at all. Note that this definition is similar to those used in previous work quantifying how different biochemical pathways contribute to enzyme flux (95). Further information on the definition and calculation of pathway contribution is given in supporting material section 4.

In Fig. 6, we calculate the pathway contribution for each pathway in the in vitro scenario after 24 h with fixed binding affinity values. Here, pathways are grouped by their final association reaction, depicted on the top half of each contribution bar graph. There is only one pathway in which the final association step is the binding between two trimeric rings, pathway 1. There are three pathways in which the final association step is the binding between two trimers that are not rings, three pathways with two trimers in a different orientation, six pathways with a Kd,1 dimer binding to a tetramer, six pathways with a Kd,2 dimer binding to a tetramer, and finally 27 pathways where the final association step is the addition of a single monomer to a hexamer. In Fig. 6 A, we see that when both binding affinities Kd,1 and Kd,2 are weak, pathway 1 contributes the most to the overall assembly of the stacked trimer, with only two pathways from the last group contributing less than 20%. This is likely due to the fact that the trimer is naturally much more stable than any of the dimers (15), so when the interactions are weak, the only fully assembled structures that can form rely on this relatively stable intermediate. We see a similar trend in Fig. 6, B, C, and E, where one pathway contributes the most to the overall assembly of the stacked trimer. In Fig. 6 D, Kd,1 is relatively weak and Kd,2 is very strong; here we see that two pathways contribute almost equally, which are the pathways in which the final association reaction is between a Kd,2 dimer binding to a tetramer or a single monomer addition to the hexamer. Since Kd,2 is very strong in this scenario, this suggests that many of the monomers may be “stuck” in that Kd,2 dimer and not able to bind with other intermediates formed, which leads to deadlock. More interestingly, when both binding affinities Kd,1 and Kd,2 are very strong, Fig. 6 F shows many pathways contributing to the overall yield of the stacked trimer. This shows that when both binding affinities are strong, many of the possible pathways are taking place and consuming monomers. Since these pathways rely on different intermediate structures that may not be compatible, this leads to high levels of deadlock. In other words, having very strong affinities for both interactions in the structure leads to broad utilization of many pathways and deadlock, whereas having one of the interfaces significantly weaker induces a more hierarchical scenario where only one or a few pathways contribute (Fig. 6).

We see similar results for the in vivo model. Note that in Fig. 6 we are considering the relative fluxes at a particular time; in Fig. 7 we are considering pathway contributions at steady state for the in vivo model. When both binding affinities Kd,1 and Kd,2 are weak, Fig. 7 A shows that few pathways contribute the most to the assembly of the stacked trimer. Interestingly, Fig. 7, BD show that more pathways contribute to the stacked trimer assembly than in the in vitro case when one of the interactions is significantly weaker than the other. This is likely due to the constant synthesis of monomers allowing more pathways to contribute. Regardless, in all of these cases only a small number of pathways contribute significantly to the formation of the stacked trimer. However, when both binding affinities Kd,1 and Kd,2 are strong, Fig. 7, E and F show that many pathways contribute. As in the in vitro case, this leads to higher steady-state levels of incompatible intermediates and thus a lower overall flux of stacked-trimer production. This analysis of the contribution of assembly pathways indicates that specific patterns of affinity have evolved to enforce a more “hierarchical” assembly process that relies on only a few pathways (15).

Figure 7.

Figure 7

Assembly pathway contributions for different combinations of Kd,1 and Kd,2 in vivo model. (AF) Each panel shows all the assembly pathways, enumerated as in Fig. 4, and their calculated pathway contribution. Pathways are grouped based on the final assembly reaction as indicated by the graphics in the top of each graph, labeled as “Final Reaction.” The black bar is the individual pathway contribution, and the gray region represents the sum of the pathway contributions for all the pathways in the group. To see this figure in color, go online.

We also used this notion of pathway contribution to better understand how changing the initial subunit concentration can actually induce deadlock in the stacked rings (e.g., the yellow curve in Fig. 2 C). In this case, after 24 h of assembly the in vitro model exhibits nearly 100% yield at concentrations around 108 M but significantly lower yield at higher concentrations (say around 106 M) for the case where both affinities have intermediate strength (Kd,1=Kd,2=108 M). To understand this phenomenon, we calculated the pathway usage of the system at both concentrations at various time points and found that fewer pathways were utilized at lower concentrations than at higher concentrations early on in the assembly process (Fig. S16). This suggests that increasing the concentration of monomers increases the diversity of pathways employed during early assembly, thus generating a larger array of incompatible intermediates and leading to greater levels of deadlock.

Discussion

Almost every cellular process relies in some way or another on the function of a variety of different macromolecular machines. Understanding how these machines are assembled is thus critical to our overall understanding of their function and, ultimately, the regulation of a wide variety of cellular activities. In recent years, new work has emerged that focuses on the design of synthetic protein complexes for applications such as drug delivery, gene therapy, and biofuel production (12,13,14). Understanding the assembly of natural complexes can also provide potential “design principles” for the design of new synthetic machines in order to optimize self-assembly. This will also aid in advancing biotechnological applications and facilitate the integration of natural and synthetic systems for innovative solutions in medicine, biotechnology, and sustainable energy production.

Many experimental studies have described the assembly processes of various molecular machines (31,32,33,34,35,36,37,96). Although these experiments have expanded our understanding of the assembly of specific structures, including that of stacked-ring-like structures, many details of the assembly process remain elusive. For instance, this experimental work does not address how assembly is optimized to avoid phenomena such as kinetic trapping. Mathematical models offer an alternative approach to studying assembly, allowing us to describe the assembly process in great detail and understand how mechanisms of assembly have evolved to improve assembly efficiency (15). Specifically, both on-pathway and off-pathway kinetic trapping have been observed in a wide variety of mathematical and biophysical models of assembly processes, and previous work has suggested that structures, particularly ring-like structures, have likely evolved to avoid on-pathway kinetic trapping (in other words, deadlock).

In this work, we expanded on previous mathematical frameworks to understand the mechanisms of stacked-ring assembly. We found that the phenomenon of deadlock can be much more severe in the case of stacked rings than had been observed for simple ring-like structures. In particular, deadlock can result in much lower assembly yields and take much longer to resolve for stacked rings than it does for single rings. As in the case of single rings, the severity of deadlock depends on the strength of the interactions between the subunits in the ring; if all the interactions are strong, deadlock can severely reduce assembly yields on biologically realistic timescales. This suggested an evolutionary pressure to avoid cases where all the interfaces in a stacked ring have high affinity. Extensive analysis of solved structures of stacked rings suggests that evolution has indeed avoided this particular scenario. Interestingly, it is common for the design pipelines of synthetic macromolecular assemblies to minimize the binding energies of every interface. Our work suggests that this may be a highly suboptimal approach to the design of structures that self-assemble efficiently, a factor that may in part explain the high failure rate of these designs. Our work thus suggests simple principles that may be of use to future efforts in protein design. That being said, significant challenges remain in order to develop detailed mathematical models of the large structures that are typically the focus of such design efforts (54,55,56,57,58). As discussed above, the larger a structure is, the more chemical species and reactions are required to model its assembly. While the framework developed here is quite general, new approaches for enumerating these species and reactions, simulating the assembly process, and analyzing the results will be needed to apply this framework to the study and/or design of more complex structures.

We also use our models to explore the notion of an assembly pathway in greater detail. Many experimental studies discuss the notion of “assembly pathways” (97,98,99) but have no precise definition of what an assembly pathway actually is. Here we provide a more formal, mathematical definition of an assembly pathway, which we expect will provide a structured framework for understanding the intricate process of macromolecular assembly and enable precise analysis, prediction, and manipulation of these critical biological mechanisms. Application of this framework to our models revealed exactly why having strong interactions in these structures generates deadlock. There are many different scenarios whereby a set of monomers can proceed toward the fully assembled structure. When interactions are strong, the system simultaneously proceeds down all these paths, depleting resources and leading to a situation where a large number of incompatible intermediates are formed (i.e., deadlock). When some of the interactions are weak, however, only a small subset of these pathways are employed, leading to a naturally hierarchical assembly process that is much more resistant to kinetic trapping.

While we have developed a model of stacked-trimer assembly showing critical effects of binding affinities on assembly, our model has limitations. For example, our model does not explicitly model allosteric changes during assembly. We also consider a fixed binding affinity between monomers (although unique for between or within assembly), which may not accurately reflect the dynamics of protein-protein interactions or changes in affinity due to environmental conditions, signal responses, or conformation changes. In our model we also assume a homogeneous environment, where the spatial constraints are not considered, nor are crowding effects (52, 100,101,102,103). Our model can be expanded on to include such effects to study their influence on assembly yield in such cases.

Future research will be necessary to further link the results of models like these to experimental studies of macromolecular machines. For instance, the proteasome is a barrel-like structure made up of four heptameric rings in an α7β7β7α7 stoichiometry that is conserved across all kingdoms of life. In archaea, there is experimental evidence that the outer α7 rings assemble first and then the β subunits bind to the α7 ring (31,104). However, in bacteria the outer α7 rings cannot assemble on their own, and instead both α and β are necessary for self-assembly in vitro (2, 31). This has led to the proposal that these two different proteasomes utilize different assembly pathways, but detailed predictions on how these scenarios might lead to experimentally testable hypotheses is currently lacking. Thus, in addition to a need for more detailed experimental work, there is also a critical need for the development of mathematical and biophysical models of these more complex structures. These models will prove critical to our understanding of the design principles of macromolecular machines and the application of those principles to synthetic biology.

Author contributions

L.L., K.B., and E.J.D. designed research. L.L., K.B., P.M.-H., Z.X., D.M., and K.G. performed research. L.L., K.B., P.M.-H., K.G., and E.J.D. analyzed data. L.L. and E.J.D. wrote the manuscript.

Acknowledgments

We gratefully acknowledge insightful discussions with Tom Kolokotrones, Walter Fontana, John Karanicolas, Wonpil Im, Ilya Vakser. and Chris Fischer.

Funding for this work was provided by grants NSF MCB-1412262 and NIH R01-GM143378 to E.J.D., and NIH F32 GM143800 and the UC President’s Postdoctoral Fellowship Program to L.L.

Declaration of interests

The authors declare no competing interests.

Editor: Abhishek Singharoy.

Footnotes

Supporting material can be found online at https://doi.org/10.1016/j.bpj.2024.05.015.

Supporting material

Document S1. Supporting text, Figs. S1–S16, Table S1, and supporting references
mmc1.pdf (12.2MB, pdf)
Document S2. Article plus supporting material
mmc2.pdf (16.3MB, pdf)

References

  • 1.Yuan S., Zhou G., Xu G. Translation machinery: the basis of translational control. J. Genet. Genomics. 2024;51:367–378. doi: 10.1016/j.jgg.2023.07.009. [DOI] [PubMed] [Google Scholar]
  • 2.Budenholzer L., Cheng C.L., Hochstrasser M., et al. Proteasome Structure and Assembly. J. Mol. Biol. 2017;429:3500–3524. doi: 10.1016/j.jmb.2017.05.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Stan G., Lorimer G.H., Thirumalai D. Friends in need: How chaperonins recognize and remodel proteins that require folding assistance. Front. Mol. Biosci. 2022;9 doi: 10.3389/fmolb.2022.1071168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Prywes N., Phillips N.R., Savage D.F., et al. Rubisco Function, Evolution, and Engineering. Annu. Rev. Biochem. 2023;92:385–410. doi: 10.1146/annurev-biochem-040320-101244. [DOI] [PubMed] [Google Scholar]
  • 5.Zhou A., Rohou A., Rubinstein J.L., et al. Structure and conformational states of the bovine mitochondrial ATP synthase by cryo-EM. Elife. 2015;4 doi: 10.7554/eLife.10180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Nogales E. Structural Insights into Microtubule Function. Annu. Rev. Biochem. 2000;69:277–302. doi: 10.1146/annurev.biochem.69.1.277. [DOI] [PubMed] [Google Scholar]
  • 7.Chen Y., Zhang Y., Guo X. Proteasome dysregulation in human cancer: implications for clinical therapies. Cancer Metastasis Rev. 2017;36:703–716. doi: 10.1007/s10555-017-9704-y. [DOI] [PubMed] [Google Scholar]
  • 8.Fhu C.W., Ali A. Dysregulation of the ubiquitin proteasome system in human malignancies: A window for therapeutic intervention. Cancers. 2021;13 doi: 10.3390/cancers13071513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Zheng Q., Huang T., Wang X., et al. Dysregulation of ubiquitin-proteasome system in neurodegenerative diseases. Front. Aging Neurosci. 2016;8 doi: 10.3389/fnagi.2016.00303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Versari D., Herrmann J., et al. Lerman A. Dysregulation of the ubiquitin-proteasome system in human carotid atherosclerosis. Arterioscler. Thromb. Vasc. Biol. 2006;26:2132–2139. doi: 10.1161/01.ATV.0000232501.08576.73. [DOI] [PubMed] [Google Scholar]
  • 11.Cao J., Zhong M.B., et al. Cai D. Endo-lysosomal pathway and ubiquitin-proteasome system dysfunction in Alzheimer’s disease pathogenesis. Neurosci. Lett. 2019;703:68–78. doi: 10.1016/j.neulet.2019.03.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Sparrow J., Edwards V., V, et al. Smith L. Synthetic peptide-based DNA complexes for nonviral gene delivery. Adv. Drug Deliv. Rev. 1998;30:115–131. doi: 10.1016/s0169-409x(97)00111-7. [DOI] [PubMed] [Google Scholar]
  • 13.Chao J., Liu H., et al. Fan C. Structural DNA nanotechnology for intelligent drug delivery. Small. 2014;10:4626–4635. doi: 10.1002/smll.201401309. [DOI] [PubMed] [Google Scholar]
  • 14.Wagner H.J., Weber W., Fussenegger M. Synthetic Biology: Emerging Concepts to Design and Advance Adeno-Associated Viral Vectors for Gene Therapy. Adv. Sci. 2021;8 doi: 10.1002/advs.202004018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Deeds E.J., Bachman J.A., Fontana W. Optimizing ring assembly reveals the strength of weak interactions. Proc. Natl. Acad. Sci. USA. 2012;109:2348–2353. doi: 10.1073/pnas.1113095109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Levy E.D., Pereira-Leal J.B., et al. Teichmann S.A. 3D complex: A structural classification of protein complexes. PLoS Comput. Biol. 2006;2:e155. doi: 10.1371/journal.pcbi.0020155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Levy E.D., Boeri Erba E., et al. Teichmann S.A. Assembly reflects evolution of protein complexes. Nature. 2008;453:1262–1265. doi: 10.1038/nature06942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Almassy R.J., Janson C.A., et al. Eisenberg D. Novel subunit-subunit interactions in the structure of glutamine synthase. Nature. 1986;323:304–309. doi: 10.1038/323304a0. [DOI] [PubMed] [Google Scholar]
  • 19.Eisenberg D., Gill H.S., et al. Rotstein S.H. Structure-function relationships of glutamine synthetases. Biochim. Biophys. Acta. 2000;1477:122–145. doi: 10.1016/s0167-4838(99)00270-8. [DOI] [PubMed] [Google Scholar]
  • 20.Schormann N., Hayden K.L., et al. Chattopadhyay D. An overview of structure, function, and regulation of pyruvate kinases. Protein Sci. 2019;28:1771–1784. doi: 10.1002/pro.3691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Pattanayek R., Wang J., et al. Egli M. Visualizing a Circadian Clock Protein: Crystal Structure of KaiC and Functional Insights. Mol. Cell. 2004;15:375–388. doi: 10.1016/j.molcel.2004.07.013. [DOI] [PubMed] [Google Scholar]
  • 22.Vale R.D. AAA Proteins: Lords of the Ring. J. Cell Biol. 2000;150:13–19. doi: 10.1083/jcb.150.1.f13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lupas A.N., Martin J. AAA proteins. Curr. Opin. Struct. Biol. 2002;12:746–753. doi: 10.1016/s0959-440x(02)00388-3. [DOI] [PubMed] [Google Scholar]
  • 24.Marques A.J., Palanimurugan R., et al. Dohmen R.J. Catalytic mechanism and assembly of the proteasome. Chem. Rev. 2009;109:1509–1536. doi: 10.1021/cr8004857. [DOI] [PubMed] [Google Scholar]
  • 25.van Wijk S.J.L., Timmers H.T.M. The family of ubiquitin-conjugating enzymes (E2s): deciding between life and death of proteins. Faseb. J. 2010;24:981–993. doi: 10.1096/fj.09-136259. [DOI] [PubMed] [Google Scholar]
  • 26.Dokland T. Freedom and restraint: themes in virus capsid assembly. Structure. 2000;8:157–162. doi: 10.1016/s0969-2126(00)00181-7. [DOI] [PubMed] [Google Scholar]
  • 27.Hagan M.F., Chandler D. Dynamic pathways for viral capsid assembly. Biophys. J. 2006;91:42–54. doi: 10.1529/biophysj.105.076851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Selivanovitch E., Douglas T. Virus capsid assembly across different length scales inspire the development of virus-based biomaterials. Curr. Opin. Virol. 2019;36:38–46. doi: 10.1016/j.coviro.2019.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Zhou W., Šmidlehner T., Jerala R. Synthetic biology principles for the design of protein with novel structures and functions. FEBS Lett. 2020;594:2199–2212. doi: 10.1002/1873-3468.13796. [DOI] [PubMed] [Google Scholar]
  • 30.Hamley I.W. Protein Assemblies: Nature-Inspired and Designed Nanostructures. Biomacromolecules. 2019;20:1829–1848. doi: 10.1021/acs.biomac.9b00228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zühl F., Seemüller E., et al. Baumeister W. Dissecting the assembly pathway of the 20S proteasome. FEBS Lett. 1997;418:189–194. doi: 10.1016/s0014-5793(97)01370-7. [DOI] [PubMed] [Google Scholar]
  • 32.Witt S., Kwon Y.D., et al. Jap B.K. Proteasome Assembly Triggers a Switch Required for Active-Site Maturation. Structure. 2006;14:1179–1188. doi: 10.1016/j.str.2006.05.019. [DOI] [PubMed] [Google Scholar]
  • 33.Bernstein K.A., Gallagher J.E.G., et al. Baserga S.J. The small-subunit processome is a ribosome assembly intermediate. Eukaryot. Cell. 2004;3:1619–1626. doi: 10.1128/EC.3.6.1619-1626.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bogenhagen D.F., Ostermeyer-Fay A.G., et al. Garcia-Diaz M. Kinetics and Mechanism of Mammalian Mitochondrial Ribosome Assembly. Cell Rep. 2018;22:1935–1944. doi: 10.1016/j.celrep.2018.01.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Voziyan P.A., Fisher M.T. Polyols induce ATP-independent folding of GroEL-bound bacterial glutamine synthetase. Arch. Biochem. Biophys. 2002;397:293–297. doi: 10.1006/abbi.2001.2620. [DOI] [PubMed] [Google Scholar]
  • 36.O’Neil P.T., Machen A.J., et al. Fisher M.T. The Chaperonin GroEL: A Versatile Tool for Applied Biotechnology Platforms. Front. Mol. Biosci. 2018;5 doi: 10.3389/fmolb.2018.00046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ryabova N.A., Marchenkov V.V., et al. Semisotnov G.V. Molecular chaperone GroEL/ES: Unfolding and refolding processes. Biochemistry. 2013;78:1405–1414. doi: 10.1134/S0006297913130038. [DOI] [PubMed] [Google Scholar]
  • 38.Kumar M.S., Schwartz R. A parameter estimation technique for stochastic self-assembly systems and its application to human papillomavirus self-assembly. Phys. Biol. 2010;7 doi: 10.1088/1478-3975/7/4/045005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Thomas M., Schwartz R. A method for efficient Bayesian optimization of self-assembly systems from scattering data. BMC Syst. Biol. 2018;12 doi: 10.1186/s12918-018-0592-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Thomas M., Schwartz R. Quantitative computational models of molecular self-assembly in systems biology. Phys. Biol. 2017;14 doi: 10.1088/1478-3975/aa6cdc. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Nguyen H.D., Reddy V.S., Brooks C.L. Deciphering the kinetic mechanism of spontaneous self-assembly of icosahedral capsids. Nano Lett. 2007;7:338–344. doi: 10.1021/nl062449h. [DOI] [PubMed] [Google Scholar]
  • 42.Mannige R.V., Brooks C.L., 3rd Geometric considerations in virus capsid size specificity, auxiliary requirements, and buckling. Proc. Natl. Acad. Sci. USA. 2009;106:8531–8536. doi: 10.1073/pnas.0811517106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Mannige R.V., Brooks C.L. Periodic table of virus capsids: Implications for natural selection and design. PLoS One. 2010;5 doi: 10.1371/journal.pone.0009423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Grime J.M.A., Dama J.F., et al. Voth G.A. Coarse-grained simulation reveals key features of HIV-1 capsid self-assembly. Nat. Commun. 2016;7 doi: 10.1038/ncomms11568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Banerjee P., Voth G.A. Conformational transitions of the HIV-1 Gag polyprotein upon multimerization and gRNA binding. Biophys. J. 2024;123:42–56. doi: 10.1016/j.bpj.2023.11.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Williamson J.R. Biophysical studies of bacterial ribosome assembly. Curr. Opin. Struct. Biol. 2008;18:299–304. doi: 10.1016/j.sbi.2008.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Burton B., Zimmermann M.T., et al. Wang Y. A computational investigation on the connection between dynamics properties of ribosomal proteins and ribosome assembly. PLoS Comput. Biol. 2012;8 doi: 10.1371/journal.pcbi.1002530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Perilla J.R., Goh B.C., et al. Schulten K. Molecular dynamics simulations of large macromolecular complexes. Curr. Opin. Struct. Biol. 2015;31:64–74. doi: 10.1016/j.sbi.2015.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Goyal A., Muthu K., et al. Ramadas K. Molecular dynamics simulation of the Staphylococcus aureus YsxC protein: Molecular insights into ribosome assembly and allosteric inhibition of the protein. J. Mol. Model. 2011;17:3129–3149. doi: 10.1007/s00894-011-0998-3. [DOI] [PubMed] [Google Scholar]
  • 50.Suppahia A., Itagi P., et al. Roelofs J. Cooperativity in Proteasome Core Particle Maturation. iScience. 2020;23 doi: 10.1016/j.isci.2020.101090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Sharon M., Witt S., Robinson C.V., et al. Mass spectrometry reveals the missing links in the assembly pathway of the bacterial 20 S proteasome. J. Biol. Chem. 2007;282:18448–18457. doi: 10.1074/jbc.M701534200. [DOI] [PubMed] [Google Scholar]
  • 52.Vakser I.A., Deeds E.J. Computational approaches to macromolecular interactions in the cell. Curr. Opin. Struct. Biol. 2019;55:59–65. doi: 10.1016/j.sbi.2019.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Vakser I.A., Grudinin S., et al. Deeds E.J. Docking-based long timescale simulation of cell-size protein systems at atomic resolution. Proc. Natl. Acad. Sci. USA. 2022;119 doi: 10.1073/pnas.2210249119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Tinberg C.E., Khare S.D., et al. Baker D. Computational design of ligand-binding proteins with high affinity and selectivity. Nature. 2013;501:212–216. doi: 10.1038/nature12443. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Allison B., Combs S., et al. Meiler J. Computational design of protein-small molecule interfaces. J. Struct. Biol. 2014;185:193–202. doi: 10.1016/j.jsb.2013.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Kosugi T., Iida T., et al. Koga N. Design of allosteric sites into rotary motor V1-ATPase by restoring lost function of pseudo-active sites. Nat. Chem. 2023;15:1591–1598. doi: 10.1038/s41557-023-01256-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Miller J.E., Srinivasan Y., et al. Yeates T.O. Designing Protease-Triggered Protein Cages. J. Am. Chem. Soc. 2022;144:12681–12689. doi: 10.1021/jacs.2c02165. [DOI] [PubMed] [Google Scholar]
  • 58.Bale J.B., Park R.U., et al. Baker D. Structure of a designed tetrahedral protein assembly variant engineered to have improved soluble expression. Protein Sci. 2015;24:1695–1701. doi: 10.1002/pro.2748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Pfoh R., Pai E.F., Saridakis V. Nicotinamide mononucleotide adenylyltransferase displays alternate binding modes for nicotinamide nucleotides. Acta Crystallogr. D Biol. Crystallogr. 2015;71:2032–2039. doi: 10.1107/S1399004715015497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Ho W.W., Li H., et al. Poulos T.L. Holo- and apo-bound structures of bacterial periplasmic heme-binding proteins. J. Biol. Chem. 2007;282:35796–35802. doi: 10.1074/jbc.M706761200. [DOI] [PubMed] [Google Scholar]
  • 61.Hadler K.S., Tanifum E.A., et al. Schenk G. Substrate-promoted formation of a catalytically competent binuclear center and regulation of reactivity in a glycerophosphodiesterase from Enterobacter aerogenes. J. Am. Chem. Soc. 2008;130:14129–14138. doi: 10.1021/ja803346w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Briggs K.E.M. The University of Kansas; 2018. Disrupted Pathways: Generating Tunable Macromolecular Assembly Pathways. [Google Scholar]
  • 63.Deeds E.J., Krivine J., et al. Fontana W. Combinatorial complexity and compositional drift in protein interaction networks. PLoS One. 2012;7 doi: 10.1371/journal.pone.0032032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Saiz L., Vilar J.M.G. Stochastic dynamics of macromolecular-assembly networks. Mol. Syst. Biol. 2006;2:2006.0024. doi: 10.1038/msb4100061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Zhang T., Schwartz R. Simulation study of the contribution of oligomer/oligomer binding to capsid assembly kinetics. Biophys. J. 2006;90:57–64. doi: 10.1529/biophysj.105.072207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Xie L., Smith G.R., et al. Schwartz R. Surveying capsid assembly pathways through simulation-based data fitting. Biophys. J. 2012;103:1545–1554. doi: 10.1016/j.bpj.2012.08.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Duke T.A.J., Le Novère N., Bray D. Conformational spread in a ring of proteins: A stochastic approach to allostery. J. Mol. Biol. 2001;308:541–553. doi: 10.1006/jmbi.2001.4610. [DOI] [PubMed] [Google Scholar]
  • 68.Mayer B.J., Blinov M.L., Loew L.M. Molecular machines or pleiomorphic ensembles: signaling complexes revisited. J. Biol. 2009;8:81. doi: 10.1186/jbiol185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Suderman R., Deeds E.J. Machines vs. Ensembles: Effective MAPK Signaling through Heterogeneous Sets of Protein Complexes. PLoS Comput. Biol. 2013;9 doi: 10.1371/journal.pcbi.1003278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Hlavacek W.S., Faeder J.R., et al. Goldstein B. The Complexity of Complexes in Signal Transduction. Biotechnol. Bioeng. 2003;84:783–794. doi: 10.1002/bit.10842. [DOI] [PubMed] [Google Scholar]
  • 71.Danos V., Feret J., Krivine J., et al. In: Verification, Model Checking, and Abstract Interpretation. Logozzo F., Peled D.A., Zuck L.D., editors. Lecture Notes in Computer Science; Heidelberg: 2008. Abstract interpretation of cellular signalling networks; pp. 83–97. [Google Scholar]
  • 72.Camacho C.J., Kimura S.R., et al. Vajda S. Kinetics of Desolvation-Mediated Protein-Protein Binding. Biophys. J. 2000;78:1094–1105. doi: 10.1016/S0006-3495(00)76668-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Tait S.W.G., Green D.R. Mitochondria and cell death: Outer membrane permeabilization and beyond. Nat. Rev. Mol. Cell Biol. 2010;11:621–632. doi: 10.1038/nrm2952. [DOI] [PubMed] [Google Scholar]
  • 74.Shiozaki E.N., Chai J., Shi Y. Oligomerization and activation of caspase-9, induced by Apaf-1 CARD. Proc. Natl. Acad. Sci. USA. 2002;99:4197–4202. doi: 10.1073/pnas.072544399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Teng X., Hardwick J.M. The apoptosome at high resolution. Cell. 2010;141:402–404. doi: 10.1016/j.cell.2010.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Qi H., Jiang Y., et al. Shuai J. Optimal pathways for the assembly of the Apaf-1·cytochrome: C complex into apoptosome. Phys. Chem. Chem. Phys. 2018;20:1964–1973. doi: 10.1039/c7cp06726g. [DOI] [PubMed] [Google Scholar]
  • 77.Mangan S., Alon U. Structure and function of the feed-forward loop network motif. Proc. Natl. Acad. Sci. USA. 2003;100:11980–11985. doi: 10.1073/pnas.2133841100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Shtilerman M., Lorimer G.H., Englander S.W. Chaperonin Function: Folding by Forced Unfolding. Science. 1999;284:822–825. doi: 10.1126/science.284.5415.822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Klinge S., Woolford J.L. Ribosome assembly coming into focus. Nat. Rev. Mol. Cell Biol. 2019;20:116–131. doi: 10.1038/s41580-018-0078-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Kelman Z., O’donnell M. DNA POLYMERASE III HOLOENZYME: Structure and Function of a Chromosomal Replicating Machine. Annu. Rev. Biochem. 1995;64:171–200. doi: 10.1146/annurev.bi.64.070195.001131. [DOI] [PubMed] [Google Scholar]
  • 81.Farnung L., Vos S.M. Assembly of RNA polymerase II transcription initiation complexes. Curr. Opin. Struct. Biol. 2022;73 doi: 10.1016/j.sbi.2022.102335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Maurizi M.R. Proteasome assembly: Biting the hand…. Curr. Biol. 1998;8:453–456. doi: 10.1016/s0960-9822(98)70291-x. [DOI] [PubMed] [Google Scholar]
  • 83.Jonckheere A.I., Smeitink J.A.M., Rodenburg R.J.T. Mitochondrial ATP synthase: Architecture, function and pathology. J. Inherit. Metab. Dis. 2012;35:211–225. doi: 10.1007/s10545-011-9382-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Pagliano C., Saracco G., Barber J. Structural, functional and auxiliary proteins of photosystem II. Photosynth. Res. 2013;116:167–188. doi: 10.1007/s11120-013-9803-8. [DOI] [PubMed] [Google Scholar]
  • 85.Lynch M., Marinov G.K. The bioenergetic costs of a gene. Proc. Natl. Acad. Sci. USA. 2015;112:15690–15695. doi: 10.1073/pnas.1514974112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Schavemaker P.E., Lynch M. Flagellar energy costs across the tree of life. Elife. 2022;11 doi: 10.7554/eLife.77266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Krissinel E., Henrick K. Inference of macromolecular assemblies from crystalline state. J. Mol. Biol. 2007;372:774–797. doi: 10.1016/j.jmb.2007.05.022. [DOI] [PubMed] [Google Scholar]
  • 88.Horton N., Lewis M. Calculation of the free energy of association for protein complexes. Protein Sci. 1992;1:169–181. doi: 10.1002/pro.5560010117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Bougouffa S., Warwicker J. Volume-based solvation models out-perform area-based models in combined studies of wild-type and mutated protein-protein interfaces. BMC Bioinf. 2008;9:448. doi: 10.1186/1471-2105-9-448. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Kelly J.W. The alternative conformations of amyloidogenic proteins and their multi-step assembly pathways. Curr. Opin. Struct. Biol. 1998;8:101–106. doi: 10.1016/s0959-440x(98)80016-x. [DOI] [PubMed] [Google Scholar]
  • 91.Neuenkirchen N., Chari A., Fischer U. Deciphering the assembly pathway of Sm-class U snRNPs. FEBS Lett. 2008;582:1997–2003. doi: 10.1016/j.febslet.2008.03.009. [DOI] [PubMed] [Google Scholar]
  • 92.Yoda M., Kawamata T., et al. Tomari Y. ATP-dependent human RISC assembly pathways. Nat. Struct. Mol. Biol. 2010;17:17–23. doi: 10.1038/nsmb.1733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Mulder A.M., Yoshioka C., et al. Williamson J.R. Visualizing ribosome biogenesis: Parallel assembly pathways for the 30S subunit. Science. 2010;330:673–677. doi: 10.1126/science.1193220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Jacobs W.M., Reinhardt A., Frenkel D. Rational design of self-assembly pathways for complex multicomponent structures. Proc. Natl. Acad. Sci. USA. 2015;112:6313–6318. doi: 10.1073/pnas.1502210112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Shockley E.M., Rouzer C.A., et al. Lopez C.F. Signal integration and information transfer in an allosterically regulated network. NPJ Syst. Biol. Appl. 2019;5 doi: 10.1038/s41540-019-0100-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Chari A., Fischer U. Cellular strategies for the assembly of molecular machines. Trends Biochem. Sci. 2010;35:676–683. doi: 10.1016/j.tibs.2010.07.006. [DOI] [PubMed] [Google Scholar]
  • 97.Ligas J., Pineau E., et al. Meyer E.H. The assembly pathway of complex I in Arabidopsis thaliana. Plant J. 2019;97:447–459. doi: 10.1111/tpj.14133. [DOI] [PubMed] [Google Scholar]
  • 98.Gallastegui N., Groll M. The 26S proteasome: assembly and function of a destructive machine. Trends Biochem. Sci. 2010;35:634–642. doi: 10.1016/j.tibs.2010.05.005. [DOI] [PubMed] [Google Scholar]
  • 99.Goehring N.W., Beckwith J. Diverse paths to midcell: Assembly of the bacterial cell division machinery. Curr. Biol. 2005;15:R514–R526. doi: 10.1016/j.cub.2005.06.038. [DOI] [PubMed] [Google Scholar]
  • 100.Vakser I.A., Grudinin S., et al. Deeds E.J. Docking-based long timescale simulation of cell-size protein systems at atomic resolution. Proc. Natl. Acad. Sci. USA. 2022;119 doi: 10.1073/pnas.2210249119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.von Bülow S., Siggel M., et al. Hummer G. Dynamic cluster formation determines viscosity and diffusion in dense protein solutions. Proc. Natl. Acad. Sci. USA. 2019;116:9843–9852. doi: 10.1073/pnas.1817564116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Heo L., Sugita Y., Feig M. Protein assembly and crowding simulations. Curr. Opin. Struct. Biol. 2022;73 doi: 10.1016/j.sbi.2022.102340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Feig M., Sugita Y. Whole-cell models and simulations in molecular detail. Annu. Rev. Cell Dev. Biol. 2019;35:191–211. doi: 10.1146/annurev-cellbio-100617-062542. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Zwickl P., Kleinz J., Baumeister W. Critical elements in proteasome assembly. Nat. Struct. Biol. 1994;1:765–770. doi: 10.1038/nsb1194-765. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supporting text, Figs. S1–S16, Table S1, and supporting references
mmc1.pdf (12.2MB, pdf)
Document S2. Article plus supporting material
mmc2.pdf (16.3MB, pdf)

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES