Abstract
Natural molecular machines contain protein components that undergo motion relative to each other. Designing such mechanically constrained nanoscale protein architectures with internal degrees of freedom is an outstanding challenge for computational protein design. Here we explore the de novo construction of protein machinery from designed axle and rotor components with internal cyclic or dihedral symmetry. We find that the axle-rotor systems assemble in vitro and in vivo as designed. Using cryoelectron microscopy we find that these systems populate conformationally variable relative orientations reflecting the symmetry of the coupled components and the computationally designed interface energy landscape. These mechanical systems with internal degrees of freedom are a step towards the design of genetically encodable nanomachines.
One-Sentence Summary:
Computationally designed self-assembling axle-rotor protein systems populate multiple rotational states.
Intricate protein nanomachines in nature have evolved to process energy and information by coupling biochemical free energy to mechanical work. Among the best studied and most sophisticated are protein rotary machines such as the F1 motor of ATPase or the bacterial flagellum, which contain axle-like and ring-like symmetric protein components capable of constrained dynamic motion relative to each other (1,2,3). Feynman’s 1959 lecture on nanotechnology as a means to leverage the properties of materials at the molecular scale(4) inspired interest in synthetic nanomachines(5,6). Synthetic chemists were the first to design molecules with mechanically coupled components(7–9). Nucleic acid nanotechnologies have more recently been used to construct rotary systems(10). Designed dynamic protein mechanical systems are of great interest given the richer functionality of proteins, but with this functionality comes more complex folding and a greater diversity of non-covalent interactions which, despite recent advances in design of static protein nanostructures(11–19), has made the design of protein machines an outstanding challenge(20).
We explored the design of protein mechanical systems through a first-principle, bottom-up approach that decouples operational principles from the complex evolutionary trajectory of natural nanomachines. Previous two component protein assembly design studies have focused on nanomaterials such as icosahedral nanocages(21) and two dimensional arrays(19) in which the components have fixed orientations relative to one another. Here we seek to design a nanoscale simple machine or kinematic pair (22,23) in which two protein components can move relative to one another, as a proof of concept of mechanically constrained heterooligomeric assembly that can undergo brownian diffusion along internal degrees of freedom (DOF). We used a hierarchical design approach with three steps: (i) de novo design of stable and rigid protein components suitable for assembly into constrained mechanical systems (ii) directed self-assembly of these components into hetero-oligomeric complexes, (iii) shaping of the multistate energetic landscape along the mechanical degrees of freedom. A major challenge is to design the interface between the two designed rigid bodies to have sufficiently low energy to drive self assembly, while still allowing relative motion of components. We started from a machine blueprint that consists of two coupled structural components resembling an axle and rotor (fig. 1A), in which, similar to natural protein rotary systems, the features of the energy landscape are determined by the symmetry of the interacting components, their shape complementarity and specific interactions across the interface.
Computational design of protein mechanical components
We first sought to design ring-like protein topologies, or rotors, with a range of inner diameter sizes capable of accommodating an axle-like binding partner (fig. 1B). In a first approach we started from de novo alpha-helical tandem repeat proteins (24), and redesigned them to form C1 single chain structures or symmetric C3 or C4 homooligomers. In a second approach we started from de novo helical repeat proteins (DHRs) and helical bundle heterodimers and used a hierarchical design procedure based on architecture-guided rigid helical fusion(14) to build C3 and C5 cyclic symmetric rotor structures. To facilitate subsequent microscopy characterization and modularity, we fused another set of DHRs at the outer side of the rotors, generating arm-like extensions (fig. 1A–B). Synthetic genes encoding these rotor designs (12xC3s, 12xC4s, 2xC5s) were synthesized and the proteins expressed in E. coli. All designed proteins were soluble after purification on nickel-nitrilotriacetic acid (Ni-NTA) columns and ~23% (6/26) had size exclusion chromatography (SEC) profiles that matched the expected theoretical elution profile for the oligomerization state (fig. S1–2, Table S1). These designs were further examined using small-angle X-ray scattering (SAXS)(25,26), negative stain electron microscopy or cryoelectron microscopy (cryoEM) (fig. S1). For the C3_R1 rotor, SAXS data analysis was consistent with the computational model (Volatility ratio (Vr)=4.684, Table S2, fig. S2) and we were able to determine using cryoEM a 6.0Å 3D reconstruction which was close to the design model (backbone RMSD=3.451Å, fig. 1B, fig. S1, fig. S4–5, Table S3). Similar results were obtained for another design of the same topology (C3_R2) (fig. S1). For the C4 design C4_1 we obtained a 7.9Å cryo electron density map closely consistent with the design model (backbone RMSD=1.8Å, fig. 1B, fig. S1, fig. S5–6, Table S3). C3 and C5 rotors with larger inner diameter and different topology (C3_R3, C5_2) were characterized using negative stain EM, yielding low resolution 3D reconstructions consistent with the design model (fig. 1B, fig. S1).
We next sought to design high aspect ratio protein components, or axles, onto which the designed rotor protein could be threaded, using three different design approaches. In a first approach, single helix backbones were parametrically generated and the sequence was optimized in D2, D3 or D4 dihedral symmetry using buried hydrogen bond networks and hydrophobic contacts to produce self-assembling homooligomer interfaces with the high level of specificity needed for dihedral assembly (fig. 2A). To increase the total mass and diversify the shape for subsequent EM analysis, the termini of these rod shape structures were rigidly fused to cyclic homooligomers of matching symmetry (i.e. Dn dihedral assemblies were fused with Cn cyclic assemblies) to create dumbbell shaped structures. In a second approach, two copies of designed cyclic homooligomers were assembled into dihedral structures by connecting them with rigid helical bundle connectors built using fragment sampling (fig. 2B). In a third approach, parametrically generated homotrimer backbones consisting of helical hairpin monomer topologies(27) were circularly permuted and elongated to generate extended C3 homooligomers (fig. 2C). Details of the methods, as well as scripts for carrying out the design calculations, are provided in the supplementary materials. Synthetic genes encoding axle designs generated from the three approaches (12xC3s, 12xC5s, 12xC8s, 6xD2s, 12xD3s, 6xD4s, 6xD5s, 12xD8s) were obtained and the proteins were expressed in E. coli. The designed proteins that were well-expressed, soluble, and readily purified by Ni-NTA affinity chromatography were further purified on SEC. Success rates for the first, second and third approach were 37.5% (6/16), 43% (14/32) and 33% (4/12) respectively as assessed by the match between estimated molecular weight (MW) from SEC with the MW of the design model (fig. 2D, fig. S1–3, Table S1). Designs with matching SEC traces were further examined using SAXS, negative stain electron microscopy, and cryoEM (fig. S1–3).
The first approach generated D2, D3 and D4 axle-like structures with folds featuring interdigitated helices with extended hydrogen bond networks. We obtained a 4.2Å 3D reconstruction of a D3 axle (D3_3) with backbone nearly identical to the design model (backbone RMSD=1.9Å, fig. 2A, fig. S3,-4, fig. S7); SAXS data were also consistent with the design model (Vr=6.0, Table S2, fig. S1–2). The central homohexameric 50 residue helices (D3_2) could also be solubly expressed and formed an oligomeric self-assembly that eluted at the expected volume (fig. S3, Table S1). D3 design D3_1 consisting of 36 residue long single helices was produced by chemical peptide synthesis and assembled into a homohexamer ( fig. S3, fig. S8), and fusion to wheel-like C3s generated a larger D3 oligomer as designed (D3_4, fig. S3). A D4 peptide homo-oligomer designed using the same approach (D4_1) had a SEC profile consistent with the expected oligomeric state (fig. S2–3, Table S1). Negative stain EM of a D2 design (D2_2) yielded a low resolution 3D reconstruction with the overall features of the design model (fig. 2D, fig. S3); the corresponding central 50 residue D2 peptide (D2_1) could also be expressed and the SEC elution volume corresponded to the expected oligomeric state (fig. S3, Table S1).
The second approach generated D3, D4, D5 and D8 axle-like structures with interdigitated helices with internal cavities in the D5 and D8 cases where each central helix only contacts the two neighboring ones (fig. 2B). We obtained a 7.4Å electron density map of a D8 design (D8_1) revealing a backbone structure nearly identical to the design model (backbone RMSD=2.9Å, fig. 2B, fig. S3, fig. S5–6). This cylinder-shaped homodecahexamer has a large central cavity, an 84 residue helix, and opposing N and C termini close to its center (fig. 2B, fig. S3). Negative stain EM 3D reconstructions of D8_2 and D8_3, D5_2 and D4_2 were consistent with the design models (fig. 2D, fig. S3). We converted several of these designs from dihedral to cyclic symmetry by connecting N and C termini, and two such designs, one C5 (C5_1) and one C8 (C8_1), yielded EM reconstructions with good agreement with the design model (fig. 2D, fig. S1, fig. S3). SAXS profiles of additional designs (4xD3s, 2xD4s and 1xD5) were consistent with the design models with Vr < 10 in most cases and measured MW within 15% of design model for D3_1, D3_8, and within 1% for D5_1 (Table S2, fig. S2–3).
The third approach yielded four C3 axles with smaller aspect ratios and overall sizes, containing a large wheel-like feature at one end, a narrow central three helix section and a six helix section at the other end. SAXS profiles together with SEC traces suggested that the designed oligomerization state is realized in solution (Vr~12, Table S1–2, fig. S1–2). For design C3 A1 we obtained a low resolution cryoEM map that recapitulates the general features of the design model, with prominent C3 symmetric DHR extremities and opposing prism-like extensions (fig. 2C, fig. S1, fig. S4).
Design of mechanically coupled axle-rotor assemblies
We next investigated the construction of mechanically constrained axle-rotor assemblies from the designed axles and rotors. As noted above, an inherent challenge for the de novo design of dynamic protein complexes is to incorporate sufficient energetically favorable interactions to enable directed self-assembly without creating deep energy minima that lock the assembly into a single state and prevent Brownian diffusion along the mechanical DOFs. We explored three approaches for constructing axle-rotor assemblies which result in interfaces with widely varying energetics, shape complementarity, and symmetry.
First, we sought to construct two-component assemblies in which the rigid body orientation of the axle and rotor was minimally constrained. We designed symmetry mismatched axle-rotor interfaces with low orientational specificity and loose interface packing, allowing only small numbers of close contacts across the interface and employing primarily electrostatic interactions between rotor and axle, which are longer range and less dependent on shape matching than the hydrophobic interactions generally used in protein design. To prevent potential disassembly at low concentrations due to weak axle-rotor interactions, we sought to kinetically trap the rotor around the axle by installing disulfide bonds at the rotor subunit-subunit interfaces. To gain stepwise control on the in vitro assembly process, we introduced buried histidine mediated hydrogen bond networks at the asymmetric interfaces between rotor subunits to enable pH controlled rotor assembly (fig. S9, see methods). To test this approach we selected three of the machine components described above -- a D3 axle, a C3 rotor and a C5 rotor -- and constructed axle-rotor assemblies with D3-C3 and D3-C5 symmetries (design D3-C5 and D3-C3 respectively, fig. 3A, fig. S10). To thread axles and rotor together, we computationally sampled rotational and translational DOF, and designed complementary electrostatic interacting surfaces excluding positively charged residues on the axle (Lysine and Arginine) and negatively charged residues (Aspartate and Glutamate) on the rotor. Due to the shape complementarity between the internal diameter of the rotors and the axle thickness, the interface is tight for D3-C3, constraining the rotor on the axle, and loose for D3-C5: by design, the D3-C3 can rotate and translate along the main symmetry axis (z), while the D3-C5 rotor has rotation and translation components along x, y and z (fig. 3A–B, fig. S11). Synthetic genes encoding the one axle and two rotor designs were obtained and the proteins were separately expressed in E. coli and purified by Ni-NTA affinity chromatography and SEC, which indicated that the surface redesign did not affect solubility or oligomerization state (fig. S1, fig. S3). Following stoichiometric mixing of the designed D3 axle and C3 rotor, EM analysis showed a collection of assembled and isolated axle and rotor molecules (fig. S9A, top panel). After dropping the pH and reducing the disulfide, the particles appeared as a mixture of opened, linear and hard to distinguish particles (fig. S9A, middle panel). After restoring the pH under oxidizing conditions, the particles appeared fully assembled by EM (fig. S9A, bottom panel). Biolayer interferometry assays showed that the rotor and axle associated rapidly with an approximate association rate of 103 M−1.s−1 and a Kd in the micromolar range (fig. S12). Similar results were obtained with D3-C5 rotary assemblies, and SEC and SAXS profiles were in agreement with the design model in both cases (Vr<15, Table S1–2, fig. S2, fig. S10).
Second, we experimented with more direct steric coupling to limit conformational variability primarily to rotation of the rotor around the axle. We employed shape complementary axle and rotor components to enable the incorporation of steric constraints restricting translation, leveraging Rosetta’s ability to design tightly packed interfaces and hydrogen-bond network mediated specificity(27). We designed 7 axle-rotor assemblies using this approach: three with C3 symmetric axles with C1 rotors (C3-C1_1-3, fig. S10) and four larger designs with C3 axles and rotors (C3-C3_1-4) (fig. 4A, fig. S10) with DHR arm extensions. The C3 symmetry matching of the rotor and axle differs from the mismatching in the other designed assemblies, and the extent of alignment of axle and rotor DHR arms relative to each other provides a measure of conformational variability. Design was carried out by systematically sampling rotational and translational DOF, removing arrangements with backbone to backbone clashes (see methods), and then using the Rosetta HBnet protocol and FastDesign(28) to identify interacting residues and optimize the interface energy. Each interface design trajectory generates widely different periodic energy landscapes according to interface metrics and design specifications (fig. S13), and results in shape complementary axle-rotor interfaces with an overall cogwheel topology. C3-C1 designs were experimentally screened for assembly by expressing rotor and axle pairs bicistronically and carrying out Ni-NTA purification relying on a single HIS tag on the rotor component (fig. S14A). 50% (6/12) expressed solubly and copurified, suggesting that the two components assembled in cells (fig. S14B), and three designs (C3-C1_1-3, fig. S10) were selected for further characterization. The SEC profiles in combination with native mass spectrometry indicated an oligomeric state consistent with the designed assembly, and SAXS data were also consistent with the design model (Vr<12 and MW within ~10% of expected values for C3-C1_3, and ~15% for C3-C1_1-2, Table S1–2, fig. S2, fig. S10, fig. S14C–D). The C3-C3 designs (C3-C3_1-4, fig. S10) were screened for in vitro assembly by stoichiometric mixing of axle and rotor, followed by SEC and SAXS analysis, which were consistent with assemblies of the expected oligomeric state (Vr<10, Table S1–2, fig. S2, fig. S10). Biolayer interferometry showed that the designed C3 axle and C3 rotor rapidly assemble with an approximate association rate of 103 M−1.s−1 and a Kd in the micromolar range (fig. S12).
Third, we sought to design further constrained axle-rotor assemblies by increasing the surface area of the interfaces between axle and rotor to enable more extensive sculpting of the energy landscape. We designed a symmetry mismatched assembly consisting of a D8 axle around which two C4 rotors are assembled (D8-C4), a symmetry mismatched assembly consisting of C5 axle and C3 rotor (C5-C3_1 and C5-C3_2), as well as a C8-C4 assembly corresponding to a circular permutation version of D8-C4 (C8-C4) (fig. 4A, fig. S10). The D8-C4 assembly with one axle for two rotors tests the incorporation of multiple coupled rotational DOF in a multicomponent system and also provides a simple way to monitor the position of rotors relative to each other by experimental structural characterization. For the D8-C4, C5-C3 and C8-C4 designs, since the symmetry of the rotor is internally mismatched to the axle, we used a quasisymmetric design protocol (see methods). The C4 rotor has internally C24 symmetry, and hence is symmetry matched to both D8 and C8 axles. In contrast, the C5-C3 arrangement has broken symmetry with a resulting energy landscape with 15 energy minima, with periodicities reflecting the constituent C5 and C3 symmetries (fig. S13). Twelve D8-C4 designs, twelve C5-C3 and six C8-C4 designs were screened for in vitro assembly by isolating axle and rotors individually by Ni-NTA purification and mixed stoichiometrically. We selected 4 of these designs for further experimental investigation and obtained SEC data indicative of assembly of axle-rotor complexes, while SAXS analysis of a C5-C3 design suggested assembly of the axle-rotor complex (Vr=6.9 and predicted MW within 6% of expectation, Table S1–2, fig. S2, fig. S9). Biolayer interferometry binding kinetics and negative stain EM data were also consistent with quantitative assembly into the designed hetero-oligomeric complex (fig. S10, fig. S13).
Correspondence between designed energy landscape and observed mechanical DOF
We subjected one construct from each design approach and symmetry class to single particle cryoEM examination and related these data to energy landscape calculations based on the design model (fig. 3–4). Comparison of the electron density data on the axle-rotor assemblies to data on the isolated rotors and axles suggest considerable variation in their rigid body orientations, as summarized in fig. S17–19 and S21.
For the D3-C3 and D3-C5 assemblies produced by the first approach, we obtained 2D class averages that clearly resembled projection maps computed from the design models, and 3D reconstructions in close agreement with the overall design model topology and designed hetero-oligomeric state (fig. 3C–D, fig. S15–16, Table S3). For both designs, the D3 axle was clearly visible and we obtained a high resolution structure of the axle nearly identical to the design model. 3D reconstructions in C1, C3 and D3 of the D3-C3 axle-rotor assembly at 7.8Å resolution showed clear density corresponding to the rotor in the middle of the axle with the C3 rotor arms clearly evident (fig. 3C, fig. S15). 3D reconstructions of the D3-C5 design also showed clear density for the rotor which could be isolated by masking the axle, but its resolution could not be further improved as the secondary structure placement relative to the axle appeared variable (fig. 3D, fig. S16). The particle alignment algorithm is likely dominated by features of the axle which is mostly in side-view in the data (S17–18), and thus the lack of resolution of the electron density corresponding to the rotor (see supplementary materials) is probably due to variability in the axle-rotor rigid body transform. Cryosparc 3D variability analysis(29) suggests that the rotor can populate multiple translational and rotational conformational states around the axle (Movie S1–4). Inspection of the cryoEM 3D reconstruction also suggests the rotor arms populate multiple positions along the rotational axis (fig. 3C–D, fig. S17–18). Rosetta energy landscapes generated by rotating and translating the rotor relative to the axle suggest that a broad range of orientations are energetically accessible (fig. 3B), and the rotor-axle rigid body orientation fluctuated in molecular dynamics simulations (MD), with the D3-C5 assembly showing increased displacement compared to D3-C3 (fig. 3B, fig. S11, fig. S17–18). Explicit modelling of conformational variability along the designed DOFs was necessary to produce computed projections closely resembling the experimental 2D class averages (fig. 3C–D, fig. S18). Taken together, the cryoEM data, Rosetta models and molecular dynamics simulations are consistent with the design goal of constrained mechanical coupling of axle and rotor components (see supplementary fig. S17–18 for summary of data indicating conformational sampling of rotor-axle rigid body DOFs).
Amongst the assemblies generated with the second approach, single particle cryoEM analysis of a C3-C3 assembly yielded 2D class averages with the axle and rotor clearly visible. Resolution was limited by the orientation bias of the particle in ice resulting in few side views, but we were able to obtain a 6.5Å 3D reconstruction which resembled the design model (fig. 4A, fig. S4, fig. S10, fig. S19, Table S3). 2D averages and the 3D reconstruction clearly capture the rotor component, but the axle was only partially resolved; the rotor has a mass greater than the C3 axle and clear “arm”-like features, which likely bias the alignment algorithm in its favor. Aligning on the rotor yielded a density map with diffuse density for the axle near the rotor (fig. S19). The contrast between the diffuse density for the axle and the well resolved density of the rotor likely reflects conformational variability (fig. 4C–D, fig. S4, fig. S18–19). The Rosetta energy landscape suggests that the axle-rotor assembly can primarily sample rotational rather than translational DOFs (fig. 4B), and rotational averaging increased the similarity between projections of the design model and the experimental data (fig. 4C–D, fig. S18–19). Taken together, the data (summarized in fig. S19), are consistent with variability along the rotational DOF, in accordance with the designed energy landscape which has 3 energy minima at a 60° rotation distance and 9 other 30° spaced degenerate alternative wells separated by low energy barriers (fig. 4B; fig. S13, fig. S20).
The D8-C4 design generated by the third approach has a rugged energy landscape, with a dynamic range of 151 kcal/mol (as estimated by Rosetta), with 8 steep wells spaced 45° stepwise along the rotational axis corresponding to the high symmetry of the interface (fig. 4B). Consistent with the deep minima in this landscape, we obtained a cryoEM map of ~5.9Å resolution that is close to the design model (fig. 4C–D, fig. S6, Table S3). 3D variability analysis calculations using Cryosparc(30) suggested two nearly equiprobable states in which the rotor arms are either aligned or offset, as in the eclipsed and staggered arrangements of ethane (fig. 4D–E, fig. S6, fig. S18, fig. S21, Movie S5). The two rotational states of one rotor relative to the other suggest energy minima spaced by 45° along the rotational axis, consistent with an 8-fold step like feature in the frequency spectrum analysis of the computed energy landscape (fig. S21). While cryoEM provides a frozen snapshot of molecules and not a real time measurement of diffusion, these data (summarized in fig. S20) suggest that the system populates multiple rotational states consistent with the designed energy landscape. Taken together, these results suggest that the explicit design of side-chain interactions and deep energy minima reduces the degeneracy of conformational states observed with purely electrostatic interactions, and support a correspondence between the energy landscape and the observed conformational variability.
Conclusions
Our proof of concept axle-rotor assemblies demonstrate that protein nanostructures with internal mechanical constraints can now be systematically designed. Key to this advance is the ability to computationally design protein components with complex complementary shapes, symmetries and topologies, such as the high aspect ratio dihedral axle structures (D2 homotetramers to D8 homo-16-mers (fig. 1–2) with oligomerization states and sizes considerably larger than previously designed dihedral structures. Our studies of assembly of these shape complementary homo-oligomeric components into higher order hetero-oligomeric structures with internal degrees of freedom provide insights towards the design of complex protein nanomachines. First, computational sculpting of the interface between the components can be used to promote self-assembly of constrained systems with chosen internal degrees of freedom. Second, the shape and periodicity of the resulting energy landscape is determined by the symmetry of components, the shape complementarity of the interface, and the balance between hydrophobic packing and conformationally promiscuous electrostatic interactions (fig. 3A–B, fig. 4A–B). Symmetry mismatch generates assemblies with larger numbers of energy minima than symmetry matched ones evident in the frequency domain (fig. S13, fig. S20), and explicit design of close sidechain packing across the interface results in deeper minima and higher barriers than non-specific interactions (fig. 3–4, fig. S13). In general, the surface area of the interface between axle and rotor scales with the number of subunits in the symmetry, with larger surface areas providing a larger energetic dynamic range accessible for design (fig. 3–4, fig. S13). The combination of the conformational variability apparent in the cryoEM data of D3-C3, D3-C5 and C3-C3 designs (fig. 3C–D, fig. 4C–D, fig. S4, fig. S15–19), the Rosetta and MD simulations (fig. 3B, fig. 4B, fig. S11), and the discrete states observed for the D8-C4 design (fig. 4D–E, fig. S6, fig. S21), suggests that these assemblies populate multiple rotational states (the axle-rotor assemblies also have multiple symmetrically identical yet physically distinct rotational states—for example, rotation of the C3 rotor around the C3 axle by 120 degrees--which cannot be distinguished by cryoEM). Our cryoEM analysis cannot distinguish whether the conformational variability reflects rotational motion or states captured during axle-rotor assembly, and do not report on energy barrier heights; time-resolved microscopy at the single molecule level will be required to reveal the dynamics of transitions between the different states, and relate the computational sculpting of the rotational energy landscapes to Brownian dynamics.
The internal periodic but asymmetric rotational energy landscapes of our mechanically coupled axle-rotor systems provide one of two needed elements for a directional motor. Coupling to an energy input to break detailed balance and drive directional motion remains to be designed: for example the interface between machine components could be designed for binding and catalysis of a small molecule fuel (22). Symmetry mismatch, which plays a crucial role in torque generation in natural motors (31,32), can incorporated synthetic protein motors as illustrated here for our rotor-axle assemblies. Modular assembly could lead to compound machines for advanced operation or integration within nanomaterials, and the components can be further functionalized using reversible heterodimer extensions (34) (fig. S22). Our protein systems can be genetically encoded for multicomponent self-assembly within cells (fig. S14) or in vitro (fig. S9, fig. S12) Taken together, these approaches could enable the engineering of a range of nanodevices for medicine, material sciences or industrial bioprocesses. More fundamentally, de novo design provides a bottom-up platform to explore the fundamental principles and mechanisms underlying nanomachine function that complements long standing studies of the elaborate molecular machines produced by natural evolution.
Supplementary Material
Acknowledgements
We thank Bridget Carragher, Clint Potter, Ed Eng, Laura Yen, and Misha Kopylov of the New York Structural Biology Center, for assistance and helpful discussions. We especially thank Sjors Scheres of MRC-LMB for helpful discussions and guidance regarding cryoEM data processing. We would like to thank the Rosetta@Home user base for donating their computational hours to run forward folding simulations. We thank Florian Busch and Vicki Wysocki at Ohio State University for providing expert support with native mass spectrometry experiments and Vicki Wysocki at Ohio State University for providing expert support with native mass spectrometry experiments. We especially thank Danny Sahtoe for all the scientific support and many insightful discussions. Thanks to Florian Praetorius for the brainstorming sessions dedicated to designing de novo protein motors. An additional thanks to Asim Bera and Matt Bick, and Justin Decarreaux for support in crystallography and optical microscopy respectively. Thanks to Tom Daniel for all the great interactions and fascinating ideas and discussion and Luis Ceze for accompanying the thought process regarding the design of protein nanomachines, either computational and mechanical. Thanks to Brian Coventry for very helpful advice and computational help, and Lance Stewart for expert help, advice, perspectives and discussions.
Funding
National Science Foundation (NSF) award 1629214 (DB)
A generous gift from the Audacious Project (DB)
the Open Philanthropy Project Improving Protein Design Fund (DB)
University of Washington Arnold and Mabel Beckman cryo-EM center (DB, DV, JK, JQ)
A S10 award funded the purchase of a Glacios microscope, award number S10OD032290 (DB, DV, JK, JQ)
The National Institute of Allergy and Infectious Diseases, DP1AI158186 and HHSN272201700059C (DV)
Pew Biomedical Scholars Award (DV)
Investigators in the Pathogenesis of Infectious Disease Awards from the Burroughs Wellcome Fund (DV)
Human Frontiers Science Program Long Term Fellowship (AC)
Washington Research Foundation Senior Fellow (AC)
Howard Hughes Medical Institute research (AC)
Howard Hughes Medical Institute Hanna Gray fellowship grant GT11817 (NB)
SAXS data was collected at the Advanced Light Source (ALS), SIBYLS beamline on behalf of US DOE-BER, through the Integrated Diffraction Analysis Technologies (IDAT) program. Additional support comes from the NIGMS project ALS-ENABLE (P30 GM124169) and a High-End Instrumentation Grant S10OD018483. The Berkeley Center for Structural Biology is supported in part by the National Institutes of Health (NIH), National Institute of General Medical Sciences, and the Howard Hughes Medical Institute.
The Advanced Light Source (ALS) is supported by the Director, Office of Science, Office of Basic Energy Sciences and US Department of Energy under contract number DE-AC02-05CH11231.
Some of this work was performed at the Pacific Northwest Center for Cryo-EM, which was supported by NIH grant U24GM129547 and performed at the PNCC at OHSU and accessed through EMSL (grid.436923.9), a DOE Office of Science User Facility sponsored by the Office of Biological and Environmental Research.
Molecular graphics and analyses were performed with UCSF Chimera, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from NIH P41-GM103311.
Footnotes
Competing interests
AC, DB, JH, JK, NB and YH are inventors on a provisional patent application submitted by the University of Washington for the design, composition and function of the proteins created in this study.
Data and materials availability
All data are available in the main text or the supplementary materials. All the EM maps have been deposited in the EMDB (accession codes EMD-25575, EMD-25576, EMD-25577, EMD-25578, EMD-25579, EMD-25580).
References and Notes
- 1.Junge W & Nelson N ATP Synthase. Annu. Rev. Biochem 84, 631–657 (2015). [DOI] [PubMed] [Google Scholar]
- 2.Okuno D, Iino R & Noji H Rotation and structure of FoF1-ATP synthase. J. Biochem. (Tokyo) 149, 655–664 (2011). [DOI] [PubMed] [Google Scholar]
- 3.Deme JC et al. Structures of the stator complex that drives rotation of the bacterial flagellum. Nat. Microbiol 5, 1553–1564 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Feynman RP There’s Plenty of Room at the Bottom, in vol. 23 (5) 22–36 (California Institute of Technology Journal of Engineering and Science, 1959). [Google Scholar]
- 5.Zhang L, Marcos V & Leigh DA Molecular machines with bio-inspired mechanisms. Proc. Natl. Acad. Sci 115, 9397–9404 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Drexler K. Building molecular machine systems. Trends Biotechnol. 17, 5–7 (1999). [Google Scholar]
- 7.Feringa BL The Art of Building Small: From Molecular Switches to Molecular Motors. J. Org. Chem 72, 6635–6652 (2007). [DOI] [PubMed] [Google Scholar]
- 8.Sauvage J-P From Chemical Topology to Molecular Machines (Nobel Lecture). Angew. Chem. Int. Ed 56, 11080–11093 (2017). [DOI] [PubMed] [Google Scholar]
- 9.Cheng C & Stoddart JF Wholly Synthetic Molecular Machines. ChemPhysChem 17, 1780–1793 (2016). [DOI] [PubMed] [Google Scholar]
- 10.Ramezani H & Dietz H Building machines with DNA molecules. Nat. Rev. Genet 21, 5–26 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Baker D. What has de novo protein design taught us about protein folding and biophysics? Protein Sci. 28, 678–683 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Butterfield GL et al. Evolution of a designed protein assembly encapsulating its own RNA genome. Nature 552, 415–420 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chen Z. et al. De novo design of protein logic gates. Science 368, 78–84 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hsia Y. et al. Hierarchical design of multi-scale protein complexes by combinatorial assembly of oligomeric helical bundle and repeat protein building blocks. 10.1101/2020.07.27.221333 (2020) doi: 10.1101/2020.07.27.221333. [DOI]
- 15.Langan RA et al. De novo design of bioactive protein switches. Nature 572, 205–210 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ueda G. et al. Tailored design of protein nanoparticle scaffolds for multivalent presentation of viral glycoprotein antigens. eLife 9, e57659 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Xu C. et al. Computational design of transmembrane pores. Nature 585, 129–134 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Divine R. et al. Designed proteins assemble antibodies into modular nanocages. Science 372, eabd9994 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ben-Sasson AJ et al. Design of biologically active binary protein 2D materials. Nature 589, 468–473 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kuhlman B & Bradley P Advances in protein structure prediction and design. Nat. Rev. Mol. Cell Biol 20, 681–697 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bale JB et al. Accurate design of megadalton-scale two-component icosahedral protein complexes. Science 353, 389–394 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Flechsig H & Mikhailov AS Simple mechanics of protein machines. J. R. Soc. Interface 16, 20190244 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.II. The kinematics of machines. Philos. Trans. R. Soc. Lond. Ser. Contain. Pap. Math. Phys. Character 187, 15–40 (1896). [Google Scholar]
- 24.Doyle L. et al. Rational design of α-helical tandem repeat proteins with closed architectures. Nature 528, 585–588 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Dyer KN et al. High-Throughput SAXS for the Characterization of Biomolecules in Solution: A Practical Approach. in Structural Genomics (ed. Chen YW) vol. 1091 245–258 (Humana Press, 2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hura GL et al. Comprehensive macromolecular conformations mapped by quantitative SAXS analyses. Nat. Methods 10, 453–454 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Boyken SE et al. De novo design of protein homo-oligomers with modular hydrogen-bond network-mediated specificity. Science 352, 680–687 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Maguire JB et al. Perturbing the energy landscape for improved packing during computational protein design. Proteins Struct. Funct. Bioinforma. 89, 436–449 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Punjani A & Fleet DJ 3D Variability Analysis: Resolving continuous flexibility and discrete heterogeneity from single particle cryo-EM. 10.1101/2020.04.08.032466 (2020) doi: 10.1101/2020.04.08.032466. [DOI] [PubMed]
- 30.Punjani A, Rubinstein JL, Fleet DJ & Brubaker MA cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017). [DOI] [PubMed] [Google Scholar]
- 31.Sobti M. et al. Cryo-EM structures provide insight into how E. coli F1Fo ATP synthase accommodates symmetry mismatch. Nat. Commun 11, 2615 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Majewski DD et al. Cryo-EM structure of the homohexameric T3SS ATPase-central stalk complex reveals rotary ATPase-like asymmetry. Nat. Commun 10, 626 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Sahtoe DD et al. Reconfigurable asymmetric protein assemblies through implicit negative design. 10.1101/2021.08.15.456388 (2021) doi: 10.1101/2021.08.15.456388. [DOI] [PMC free article] [PubMed]
- 34.Crick FHC The Fourier transform of a coiled-coil. Acta Crystallogr. 6, 685–689 (1953). [Google Scholar]
- 35.Grigoryan G & DeGrado WF Probing Designability via a Generalized Model of Helical Bundle Geometry. J. Mol. Biol 405, 1079–1100 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Fallas JA et al. Computational design of self-assembling cyclic protein homo-oligomers. Nat. Chem 9, 353–360 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Huang P-S et al. RosettaRemodel: A Generalized Framework for Flexible Backbone Protein Design. PLoS ONE 6, e24109 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Correnti CE et al. Engineering and functionalization of large circular tandem repeat protein nanoparticles. Nat. Struct. Mol. Biol 27, 342–350 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Case DA, Aktulga HM, Belfon K, Ben-Shalom IY, Brozell SR, Cerutti DS, Cheatham TE III, Cruzeiro VWD, Darden TA, Duke RE, Giambasu G, Gilson MK, Gohlke H, Goetz AW, Harris R, Izadi S, Izmailov SA, Jin C, Kasavajhala K, Kaymak MC, King E, Kovalenko A, Kurtzman T, Lee TS, LeGrand S, Li P, Lin C, Liu J, Luchko T, Luo R, Machado M, Man V, Manathunga M, Merz KM, Miao Y, Mikhailovskii O, Monard G, Nguyen H, O’Hearn KA, Onufriev A, Pan F, Pantano S, Qi R, Rahnamoun A, Roe DR, Roitberg A, Sagui C, Schott-Verdugo S, Shen J, Simmerling CL, Skrynnikov NR, Smith J, Swails J, Walker RC, Wang J, Wei H, Wolf RM, Wu X, Xue Y, York DM, Zhao S, and Kollman PA. Amber 2018. (University of California; ). [Google Scholar]
- 40.Tian C. et al. ff19SB: Amino-Acid-Specific Protein Backbone Parameters Trained against Quantum Mechanics Energy Surfaces in Solution. J. Chem. Theory Comput 16, 528–552 (2020). [DOI] [PubMed] [Google Scholar]
- 41.Roe DR & Brooks BR A protocol for preparing explicitly solvated systems for stable molecular dynamics simulations. J. Chem. Phys 153, 054123 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Roe DR & Cheatham TE PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J. Chem. Theory Comput 9, 3084–3095 (2013). [DOI] [PubMed] [Google Scholar]
- 43.Virtanen P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Studier FW Protein production by auto-induction in high-density shaking cultures. Protein Expr. Purif 41, 207–234 (2005). [DOI] [PubMed] [Google Scholar]
- 45.Förster S, Apostol L & Bras W Scatter : software for the analysis of nano- and mesoscale small-angle scattering. J. Appl. Crystallogr 43, 639–646 (2010). [Google Scholar]
- 46.Schneidman-Duhovny D, Hammel M, Tainer JA & Sali A Accurate SAXS Profile Computation and its Assessment by Contrast Variation Experiments. Biophys. J 105, 962–974 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Piiadov Vassili, de Araujo Evandro A., Neto Mario O., Craievich Aldo F., Polikarpov Igor. SAXSMoW 2.0: Online calculator of the molecular weight of proteins in dilute solution from experimental SAXS data measured on a relative scale. Prot.Sci, (2018), DOI: 10.1002/pro.3528 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Nannenga BL, Iadanza MG, Vollmar BS & Gonen T Overview of Electron Crystallography of Membrane Proteins: Crystallization and Screening Strategies Using Negative Stain Electron Microscopy. Curr. Protoc. Protein Sci 72, 17.15.1–17.15.11 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Grant T, Rohou A & Grigorieff N cisTEM, user-friendly software for single-particle image processing. eLife 7, e35383 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Carragher B. et al. Leginon: An Automated System for Acquisition of Images from Vitreous Ice Specimens. J. Struct. Biol 132, 33–45 (2000). [DOI] [PubMed] [Google Scholar]
- 51.Tang G. et al. EMAN2: An extensible image processing suite for electron microscopy. J. Struct. Biol 157, 38–46 (2007). [DOI] [PubMed] [Google Scholar]
- 52.Zheng SQ et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Rohou A & Grigorieff N CTFFIND4: Fast and accurate defocus estimation from electron micrographs. J. Struct. Biol 192, 216–221 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pettersen EF et al. UCSF Chimera?A visualization system for exploratory research and analysis. Comput. Chem 25, 1605–1612 (2004). [DOI] [PubMed] [Google Scholar]
- 55.Afonine PV et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr. Sect. Struct. Biol 74, 531–544 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Aiyer Sriram et al. “Evaluating Local and Directional Resolution of Cryo-EM Density Maps.” Methods in molecular biology (Clifton, N.J.) vol. 2215 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.VanAernum ZL et al. Rapid online buffer exchange for screening of proteins, protein complexes and cell lysates by native mass spectrometry. Nat. Protoc 15, 1132–1157 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.VanAernum ZL et al. Surface-Induced Dissociation of Noncovalent Protein Complexes in an Extended Mass Range Orbitrap Mass Spectrometer. Anal. Chem 91, 3611–3618 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are available in the main text or the supplementary materials. All the EM maps have been deposited in the EMDB (accession codes EMD-25575, EMD-25576, EMD-25577, EMD-25578, EMD-25579, EMD-25580).