Abstract
Cyclic symmetry is frequent in protein and peptide homo‐oligomers, but extremely rare within a single chain, as it is not compatible with free N‐ and C‐termini. Here we describe the computational design of mixed‐chirality peptide macrocycles with rigid structures that feature internal cyclic symmetries or improper rotational symmetries inaccessible to natural proteins. Crystal structures of three C2‐ and C3‐symmetric macrocycles, and of six diverse S2‐symmetric macrocycles, match the computationally‐designed models with backbone heavy‐atom RMSD values of 1 Å or better. Crystal structures of an S4‐symmetric macrocycle (consisting of a sequence and structure segment mirrored at each of three successive repeats) designed to bind zinc reveal a large‐scale zinc‐driven conformational change from an S4‐symmetric apo‐state to a nearly inverted S4‐symmetric holo‐state almost identical to the design model. These symmetric structures provide promising starting points for applications ranging from design of cyclic peptide based metal organic frameworks to creation of high affinity binders of symmetric protein homo‐oligomers. More generally, this work demonstrates the power of computational design for exploring symmetries and structures not found in nature, and for creating synthetic switchable systems.
Keywords: chelation, computational design, de novo design, non‐canonical amino acids, peptides, protein folding, racemic crystallography, symmetry
Short abstract
PDB Code(s): 6UFU, 6UG2, 6UG3, 6UG6, 6UGB, 6UGC, 6UCX, 6UD9, 6UDR, 6UDW, 6UDZ, 6UF4, 6UF7, 6UF8, 6UFA and 6UF9;
1. INTRODUCTION
Symmetric protein quaternary structures, which are built from many identical copies of a single protein chain, play key functional and structural roles in biology and have advantages as building blocks for nanomaterials since a small number of repeated interfaces can generate large and complex assemblies. 1 , 2 , 3 , 4 , 5 Such structures have also proved useful for engineering high‐avidity binders to symmetry‐matched targets of therapeutic interest. 6 Macromolecules with internal symmetry in their sequences and tertiary structures would be particularly advantageous for both nanomaterial and therapeutic development, since the complexity of dealing with multiple chains can be avoided. Metal–organic frameworks (MOFs), for example, are often built from internally‐symmetric small‐molecule chelators. 7 , 8 , 9 While there are many small molecules with internal symmetry, natural proteins and peptides with internal symmetry are extremely rare, at least in part because true internal symmetry requires that the chain be circular, with no free N‐or C‐terminus. Perhaps the best‐studied natural example of an internally‐symmetric peptide macrocycle is gramicidin S, a 10‐residue antimicrobial agent with C2 symmetry. 10 , 11 Small synthetic peptide macrocycles with simple, patterned repeating sequences have been found to give rise to C2 or C3 structural symmetries, 12 , 13 , 14 but to our knowledge there has been no systematic effort to explore the space spanned by such structures. We and other groups have designed asymmetric linear and macrocyclic foldamers, 15 , 16 , 17 , 18 , 19 but the peptide design field currently lacks general computational methods for designing symmetric molecules.
We set out to develop general methods for computationally designing internally‐symmetric peptide macrocycles with conformational rigidity imparted by large energy gaps between a symmetric ground state and all alternative conformations. To this end, we incorporated methods for sampling and designing with internal cyclic or improper rotational symmetries into the Rosetta heteropolymer design software. 18 , 19 , 20 , 21 , 22 Our sampling methods use robotics‐inspired kinematic closure methods 18 , 23 , 24 to provide analytical solutions for dihedral values yielding closed macrocycle conformations. Dihedral angles in all asymmetric units of the macrocycle (henceforth referred to as “lobes”) are required to match those in the first “reference” lobe to within a certain tolerance in the cyclic symmetric case, and after inversion in the case of improper rotational symmetries. Subsequent symmetric sequence design algorithms ensure that residue identities and conformations in neighboring lobes match (for cyclic symmetry) or match with chirality inversion and inversion of dihedral values (for improper rotational symmetry). We apply the methods to the design of peptide macrocycles with cyclic symmetries (C2 or C3) and improper S2 and S4 rotational symmetries that are inaccessible to homochiral proteins, but which can be accessed by heterochiral peptides.
2. RESULTS
2.1. Structures of designed peptide macrocycles with C2 and C3 symmetry
We developed a computational pipeline, summarized in the methods and in Figure S1, for designing internally‐symmetric peptide macrocycles. This pipeline involved steps of symmetric backbone conformational sampling, clustering, symmetric sequence design, filtration to discard designs with undesirable features, and final computational validation by large‐scale conformational sampling. We first applied this pipeline to create macrocycles with C2 and C3 symmetry, with asymmetric units ranging from 3 to 5 residues. To assess the completeness of our sampling of cyclic peptide conformational space, we defined four backbone dihedral bins (A, B, X, and Y), with A and B representing right‐handed helical and strand regions of Ramachandran space, respectively, and X and Y representing the mirror image bins (see Data S1). We compared the number of bin sequences (AABAYYXY, for example) sampled to the total number of unique bin patterns possible for each peptide size and symmetry type designed, with the latter determined analytically using Burnside's lemma 25 (see Section 2.1.3 of Data S1). Where possible, we also assessed whether the solution space for designs was fully sampled by examining the set of low‐energy structures obtained to determine whether both members of mirror‐related bin strings were represented, since the occurrence of just one member of such isoenergetic pairs indicates incomplete sampling of the solution space. We were able to sample with many‐fold redundancy over all bin patterns possible for peptides up to 24 residues in length, and found only a small subset for each length that were compatible with chain closure for each symmetry (Table 1). The mirror test suggested that the identified conformations completely cover the space of possibilities for C2‐symmetric macrocycles with up to eight residues, and C3‐symmetric macrocycles up to 18 residues. For higher‐order symmetries, such as C5, complete sampling by the mirror test was possible up to 30 residues (Table S1).
TABLE 1.
Symmetry type a | Size (residues) | Max. bin strings possible | Bin strings observed | Conformational clusters observed b |
---|---|---|---|---|
C2 | 6 | 24 | 8 | 3 |
8 | 70 | 13 | 6 | |
10 | 208 | 57 | 25 | |
12 | 700 | 15 | 16 | |
14 | 2,344 | 108 | 96 | |
16 | 8,230 | 1,122 | 898 | |
18 | 29,144 | 2,171 | 1,770 | |
20 | 104,968 | 2,585 | 2,108 | |
C3 | 6 | 10 | 1 | 1 |
9 | 24 | 4 | 2 | |
12 | 70 | 3 | 3 | |
15 | 208 | 4 | 4 | |
18 | 700 | 3 | 3 | |
21 | 2,344 | 14 | 11 | |
24 | 8,230 | 20 | 19 | |
S2 | 6 | 12 | 1 | 1 |
8 | 32 | 2 | 1 | |
10 | 104 | 9 | 6 | |
12 | 344 | 4 | 4 | |
14 | 1,172 | 173 | 85 | |
16 | 4,096 | 294 | 162 | |
18 | 14,572 | 469 | 303 | |
20 | 52,432 | 707 | 464 | |
S4 | 8 | 4 | 2 | 1 |
12 | 12 | 1 | 1 | |
16 | 32 | 7 | 7 | |
20 | 104 | 22 | 19 | |
24 | 344 | 18 | 17 | |
28 | 1,172 | 27 | 22 | |
32 | 4,096 | 13 | 12 |
We carried out large‐scale conformational sampling on each of the designed sequences, and selected for chemical synthesis peptide macrocycles with large energy gaps between the designed conformation and all alternative states. We began by synthesizing one C2‐symmetric and five C3‐symmetric peptide macrocycles and their mirror‐image enantiomorphs to facilitate the crystallization of the synthesized molecules from racemic mixtures. 26 With this approach we succeeded in crystallizing the C2‐symmetric peptide and three of the five C3‐symmetric peptides (Figure 1). The observed conformation of the C2 symmetric peptide (designated C2‐1) matched its designed conformation within a backbone heavy‐atom RMSD of 1 Å, but with notable deviation in the ψ dihedral angle of residue dSER1 and the ϕ dihedral angles of residues dGLU2 and AIB8, which inverted the dSER1‐dGLU2 amide bond, allowing it to form a new hydrogen bond with dGLU7. Despite this local structural change, the overall fold of the peptide was largely preserved. This is the first computationally‐designed macrocycle of which we are aware that uses 2‐aminoisobutyric acid (AIB), a non‐canonical, conformationally‐constrained amino acid, instead of proline to ensure conformational rigidity. Only one enantiomer was observed in this crystal structure, but an alternative crystal form was identified that incorporates both enantiomers, albeit with a distorted conformation (see Figure S2). Table S2 shows sequences, crystallization conditions, space groups, and backbone heavy‐atom RMSD values for all peptides.
Two of the three C3‐symmetric peptides that we succeeded in crystallizing closely matched the design models (Figure 1). Design C3‐1 adopted a conformation different from the design (Figure S3) likely due to three buried polar hydrogen atoms that form hydrogen bonds with carbonyl oxygens in adjacent peptide molecules. 27 , 28 Recognizing this to be an undesirable structural feature that was poorly penalized by our automated scoring function, from this point forward we filtered designs, selecting only those designs lacking unsatisfied buried polar groups. Crystal structures of peptides C3‐2 and C3‐3, which lack buried unsatisfied polar atoms, closely matched their designs, with backbone heavy‐atom RMSD values of 0.5 Å and 0.3 Å, respectively. The two have a common fold stabilized by three proline residues and three backbone hydrogen bonds.
2.2. Structures of designed peptide macrocycles with S2 symmetry
We next explored symmetries inaccessible to natural proteins. Unlike proteins built only from L‐amino acids, peptides built from mixtures of D‐ and L‐amino acids can access symmetries involving mirror operations, such as improper rotational symmetries. Using the same symmetric sampling, clustering, and sequence optimization strategy, we designed and synthesized a panel of 6 peptide macrocycles with S2 symmetry, ranging in size from 8 to 12 amino acids (designs S2‐1 through S2‐6). These peptides have a sequence that repeats twice, with the chirality of residues in the second lobe inverted relative to the first, yielding a second lobe with a conformation mirroring that of the first. Six designs were selected for synthesis representing a diverse range of backbone conformations and hydrogen bonding patterns. We were able to crystallize all six of these peptides, and to determine their structures by direct phasing methods.
In all cases the observed conformation closely matched the design (overlays in Figure 2, column f), with a maximum backbone heavy‐atom RMSD of 0.6 Å (peptide S2‐3) and a minimum of 0.4 Å (peptide S2‐1). The folds of these peptides are all stabilized by d‐ and l‐proline residues and backbone hydrogen bonds (Figure 2). The designed conformation of peptide S2‐6, a 12‐residue peptide with sequence aNkhPeAnKHpE (where lowercase and uppercase letters represent d‐ and l‐amino acid residues, respectively), has a crimped conformation remarkably forming eight backbone hydrogen bonds, of which six were preserved in the crystal structure. The remaining two were lost to a slight rotation of the amide bond between dASN8 and LYS9, which positioned the donor and acceptor groups where they could instead make hydrogen bonds to water. The crystal structure deviated from the design by a backbone heavy‐atom RMSD of only 0.4 Å. In all other cases, the designed backbone hydrogen bonding patterns were preserved, with two exceptions: a subtle relaxation of the backbones of peptides S2‐4 and S2‐5 replaced two direct backbone‐backbone hydrogen bonds in the designs with bridging water molecules (Figure 2d,e).
2.3. Metal‐induced conformational switching in a designed S4‐symmetric polypeptide macrocycle
We next explored the possibility of using the new design methods to create conformational switches. We sampled S4‐symmetric polypeptide conformations with four repeats in which alternating lobes had opposite chirality, and developed a computational strategy to select backbones that could present metal‐binding side‐chains for tetrahedral coordination of a central metal ion (see Data S1). We designed sequences with l‐ and d‐histidine residues positioned to chelate the metal ion, AIB and other residues to stabilize the fold, and apolar side‐chains on the surface to stabilize an alternative, inside‐out conformation in the absence of metal. The sequences of these designs repeat four times, with residues in the second and fourth lobes possessing chirality and conformations inverted relative to equivalent residues in the first and third lobes. We carried out large‐scale conformational sampling to select designs with low‐energy designed conformations with the histidine side‐chains positioned to bind zinc. In several designs, we noted that large‐scale conformational sampling predicted a second energy minimum corresponding to an inside‐out fold with the apolar side‐chains in the core and the histidines exposed. We selected a single design for synthesis, S4‐1, which has sequence KLqeXHklQEXhKLqeXHklQEXh, in which X represents AIB (Figure 3).
The 24‐amino acid polypeptide S4‐1 crystallized in space group . The backbone conformation in the crystal structure (which contained a single copy of the peptide in the asymmetric unit), matched the design with a backbone heavy‐atom RMSD of 0.3 Å, with a central metal ion (believed to be zinc) very close to that in the design model. The central metal‐coordinating l‐ and d‐histidine side‐chains matched the design with a side‐chain heavy‐atom RMSD of 0.1 Å. The zinc affinity of S4‐1 was 0.32 nM, as measured by competition with the colorimetric chelator 4‐(2‐pyridylazo)resorcinol (PAR) (Figure 3c and Data S1).
The central metal ion plays an important structural role in holo‐S4‐1, stabilizing a conformation that presents four apolar d‐ and l‐leucine side‐chains to aqueous solvent (Figure 3a,d,e,k,l), and as noted above large‐scale conformational sampling runs predicted an alternative conformation with these groups buried in the absence of the metal ion (Figure 3f,g). To explore this possibility, we solved the structure of the apo‐polypeptide. Without zinc, the polypeptide crystallized in space group P21212, and did indeed adopt a very different conformation from the holo‐structure, packing apolar d‐ and l‐leucine side‐chains against AIB residues in the core, and projecting the d‐ and l‐histidine side‐chains that previously coordinated zinc outward (Figure 3h,k,l). The observed apo‐state conformation matches the predicted alternative state to a backbone heavy‐atom RMSD of 1 Å (Figure 3i).
3. DISCUSSION
The 10 new peptide and polypeptide macrocycles presented here have diverse, rigid backbone folds closely matching the design models, in nearly all cases with sub‐Ångstrom accuracy. The structures were designed in four different symmetry classes (C2, C3, S2, and S4), the latter two of which are inaccessible to proteins or other natural macromolecules built from building‐blocks of only one handedness. Likely because of their symmetry, the success rate of the designs was quite high, both in terms of crystallization of the designed peptides (11 of 13) and their close match to predicted structures (10 of 11).
3.1. Expanding the set of usable chemical building‐blocks for computational design
Past design efforts aimed at producing folding macrocycles focused on peptides built from canonical L‐amino acids and their mirror‐image D‐amino acid counterparts. 18 , 19 Although large proteins are stabilized primarily by the formation of a core of well‐packed apolar side‐chains, smaller peptides lack large buried volumes, and tend to be stabilized by intrinsic conformational preferences of amino acid residues. Proline is the most conformationally‐constrained canonical amino acid, and has been essential to the success of past designs; however, proline cannot be used in all contexts. Since the proline side‐chain connects to the backbone nitrogen, eliminating the backbone amide hydrogen, proline cannot serve as a backbone hydrogen bond donor. Moreover, the proline delta carbon alters the conformational preferences of the preceding residue. And of course there are configurations in which the proline side‐chain can clash with other side‐chains. It is therefore convenient to have additional conformationally‐constrained amino acids from which to choose when designing. By adding support for designing with AIB, we were able to include an achiral yet conformationally‐constrained residue with narrow preferences for the regions of Ramachandran space favored by both l‐ and d‐proline, but which is nonetheless able to form backbone hydrogen bonds. This permitted the AIB residues in peptide C2‐1 to donate hydrogen bonds to serine side‐chains, as well as the incorporation of AIB in the holo‐S4‐1 design in the midst of a 310 helix, a secondary structure that would have been disrupted by proline. In the apo state, the AIB side‐chains were able to pack closely with nearby leucine side‐chains, an arrangement that would have resulted in side‐chain clashes had proline been substituted. We anticipate that the ability to expand the set of conformationally‐constrained amino acids for computational design will open new opportunities for stabilizing structures that cannot be stabilized by canonical amino acids or their mirror images alone.
3.2. Near‐exhaustive exploration of conformational space for larger molecules
The design of rigidly‐folded heteropolymers requires efficient means of sampling backbone conformations, both to identify conformational states compatible with a given function that can be stabilized by a suitable choice of sequence, and to validate designed sequences by exploring possible alternative low‐energy conformational states. This is particularly challenging when designing with non‐canonical amino acid residues, requiring unbiased sampling methods that are not reliant on known structures. Because N‐fold symmetric molecules have far fewer (1/N) conformational degrees of freedom than similarly sized asymmetric molecules, the new methods make comprehensive sampling tractable for much larger systems. By focusing on internally‐symmetric macrocycles, we were able to achieve exhaustive or near‐exhaustive coverage of the conformation spaces for peptides with up to 30 amino acids for the highest‐order symmetries (Table S1)—a size range well beyond that which can be sampled exhaustively for asymmetric macrocycles. Since many applications of designed, well‐folded heteropolymers require molecules that are able to present large binding interfaces (e.g., for nanomaterial self‐assembly or therapeutic target binding), or molecules large enough to possess internal binding pockets (e.g., for small‐molecule binding or catalysis), we anticipate that our computational methods for designing larger symmetric structures with non‐canonical building blocks will have broad applicability.
3.3. Design of metal‐dependent peptide conformational dynamics
Our most complex design, the 24‐residue S4‐1 polypeptide, binds zinc with sub‐nanomolar affinity, and undergoes a major conformational change in the presence of zinc. Both the apo and holo states are well‐structured, with the former burying apolar side‐chains and the latter exposing them (Figure 3). Engineered switching behavior could ultimately be used to create cell‐permeable molecules that could serve as drugs. Similar conformational switching behavior is observed in the natural peptide macrocycle cyclosporine A, and is thought to underlie this drug's cell permeability: in the lipid bilayer, cyclosporine is able to adopt a conformation in which all hydrogen bond donors and acceptors are internally satisfied, and apolar groups are lipid‐exposed, promoting its lipid‐solubility, while in an aqueous environment, the molecule changes conformation to expose its polar groups to water, promoting water‐solubility. 29 , 30 The ability to control this switching behavior with a metal ion like zinc could also provide a means of ensuring that the permeation is unidirectional: while extracellular free zinc concentrations are in the nanomolar to micromolar range, 31 high‐affinity zinc binding within the cell lowers the free zinc concentration to picomolar levels. 32 Achieving cellular permeability would likely require further design to ensure that the apolar‐exposed state internally satisfies all hydrogen bond donors and acceptors. A future extension of our approach would be to engineer a symmetric macrocycle drug able to undergo a conformational change to pass through a membrane and to present a large, symmetric surface to bind with high affinity and specificity to a symmetric interface on a homooligomeric intracellular target protein.
3.4. Concluding remarks
We have presented general methods for computational design and validation of symmetric, well‐folded polypeptide macrocycles, including those incorporating metals as structural elements, and have demonstrated robust ability to control structure with sub‐Ångstrom accuracy in laboratory experiments, culminating in an engineered, metal‐dependent conformational switch. The ability to design symmetric, well‐folded polypeptide macrocycles opens up new avenues for both therapeutic design and for bounded and unbounded nanomaterial design, and shows that methods originally developed for protein design can now be used to robustly design molecules quite unlike those that exist in nature.
4. MATERIALS AND METHODS
4.1. Modifications to the Rosetta software suite
Extensive modifications to the Rosetta software suite enabled the design of internally‐symmetric peptide macrocycles, including those able to coordinate a central metal ion. New Rosetta modules were implemented to be compatible with both the PyRosetta and RosettaScripts scripting languages, 33 , 34 allowing their use in the development of future, application‐specific design protocols. Rosetta source code and compiled executables are provided free of charge to academic, government, and not‐for‐profit users through http://www.rosettacommons.org, and the software is licenced to commercial users for a fee. Computational developments are summarized here and described in detail in Data S1.
The Rosetta symmetry code 35 was refactored to add support for mirror operations and improper rotational symmetries, and to correctly interconvert between mirrored amino acid types. Rosetta's simple_cycpep_predict and energy_based_clustering applications, both described previously, 18 , 19 were enhanced to allow sampling and clustering of quasi‐symmetric backbones with a given symmetry (where a quasi‐symmetric backbone is one that is nearly symmetric, but in which small deviations from perfect symmetry are allowed). A Rosetta module (“mover”) for converting quasi‐symmetric structures to fully symmetric structures, called the SymmetricCycpepAlignMover, was added. The interface and internal handling of non‐canonical amino acids during design was greatly reworked, with the user‐controlled PackerPalette introduced to control the set of chemical building‐blocks used for a given design task, permitting deprecation of many problematic idiosyncrasies present in the previous interface to streamline the design process.
To enable the design of metal‐binding peptides, Rosetta's CrosslinkerMover was enhanced with support for a range of metal coordination geometries, with support for asymmetric structures or for the symmetry classes compatible with a given coordination geometry. For example, this mover allows the design of a tetrahedrally‐coordinated zinc in an asymmetric structure or in a C2‐ or S4‐symmetric structure, with suitable repetition of conformations and amino acid identities of the liganding residues.
4.2. Computational design protocol
To design symmetric peptides, we first sampled quasi‐symmetric mainchain conformations using the simple_cycpep_predict application, and enumerated conformations with the energy_based_clustering application. In the case of the S4‐1 polypeptide, this step was modified to sample only those backbone conformations capable of coordinating a central metal ion. Next, with scripts written in the RosettaScripts scripting language, we converted quasi‐symmetric cluster centers to fully symmetric structures, and carried out sequence design with Rosetta's symmetric design algorithms. Finally, we validated each designed sequence by large‐scale conformational sampling, again using the simple_cycpep_predict application, to identify those designed sequences that uniquely favored the designed conformation. Computations were carried out on the University of Washington Hyak cluster, the Argonne National Laboratory Mira and Theta supercomputers, and the Simons Foundation Gordon and Iron clusters. Additionally, some validations were carried out using the Rosetta@Home distributed computing platform, which uses volunteer computers, cellular telephones, and mobile devices through the Berkeley Open Infrastructure for Network Computing (BOINC). 36 Full details of each step, sample command‐lines, and representative RosettaScripts scripts are provided in Data S1.
4.3. Solid‐phase peptide synthesis and purification
Peptides were synthesized using standard Fmoc solid‐phase peptide synthesis techniques using a CEM Liberty Blue peptide synthesizer with microwave‐heated coupling and deprotection steps. Peptides with 12 amino acids or fewer that contained l‐aspartate or l‐glutamate were synthesized tethered by the acidic side‐chain to preloaded Fmoc‐l‐Asp(Wang resin LL)‐ODmab or Fmoc‐l‐Glu(Wang resin LL)‐ODmab resin, and were cyclized on‐bead by a coupling reaction following deprotection of the C‐terminus with 2% (vol./vol.) hydrazine monohydrate treatment in dimethylformamide (DMF). Larger peptides were synthesized with the C‐terminus coupled to Cl‐TCP(Cl) resin from CEM, cleaved from the resin with 1% (vol./vol.) TFA treatment in dicholoromethane (DCM), and cyclized by a solution‐phase coupling reaction prior to final deprotection. Peptides were purified by reverse‐phase HPLC with a water‐acetonitrile gradient, lyophilized, and redissolved in buffer suitable for subsequent experiments (typically 100 mM HEPES, pH 7.5). Masses and purities were assessed by electrospray ionization mass spectrometry with a Thermo Scientific TSQ Quantum Access mass spectrometer. Full synthetic and purification protocols are described in Data S1.
4.4. X‐ray crystallography
Peptides were crystallized by hanging droplet vapour diffusion, with pH, buffer, ionic strength, and precipitants all optimized for each peptide. Growth conditions for the crystals of each peptide are described in Data S1. Diffraction data were collected at the Argonne National Laboratory Advanced Photon Source (APS) beamlines 24‐ID‐C and 24‐ID‐E.
4.5. Metal‐binding assays
To confirm zinc content of the S4‐1 and S4‐2 peptides, and to measure zinc affinity, we used a variant of the 4‐(2‐pyridylazo)resorcinol (PAR) assay described previously. 37 , 38 , 39 We carried this assay out in 96‐well plates (200 μL total solution volume per well). To confirm metal content, 10 μM polypeptide was denatured in 6 M guanidinium hydrochloride (Sigma‐Aldrich, St. Louis, MO), 100 mM HEPES, pH 7.5, and 200 μM PAR, and the change in absorbance at 490 nm was monitored using a SpectraMAX M5e plate reader (Molecular Devices, San Jose, CA). Standard curves were prepared with ZnCl2 to convert absorbance changes to zinc concentrations. The metal affinity of the S4‐1 and S4‐2 polypeptides was measured by competition with PAR, given the known dissociation constants of the PAR2‐Zn complex. Full protocols and mathematical details for both assays are provided in Data S1.
AUTHOR CONTRIBUTIONS
Christine Kang: Data curation; formal analysis; investigation; methodology; writing‐original draft; writing‐review and editing. Michael R. Sawaya: Data curation; formal analysis; methodology; validation; visualization; writing‐original draft; writing‐review and editing. Stephen Rettie: Formal analysis; methodology; resources; validation; writing‐original draft; writing‐review and editing. Xinting Li: Formal analysis; methodology; resources; validation; writing‐review and editing. Inna Antselovich: Formal analysis; methodology; validation; writing‐review and editing. Timothy Craven: Software; writing‐review and editing. Andrew Watkins: Methodology; software; writing‐review and editing. Jason Labonte: Software; writing‐review and editing. Frank DiMaio: Methodology; software; writing‐review and editing. Todd Yeates: Conceptualization; formal analysis; funding acquisition; methodology; project administration; resources; supervision; validation; writing‐original draft; writing‐review and editing. VK Mulligan: Conceptualization; Data curation; Software; Formal analysis; Investigation; Methodology; Validation; Visualization; Writing‐original draft; Writing‐review and editing. D. Bakar: Conceptualization; Methodology; Funding acquisition; Project administration; Resources; Supervision; Writing‐original draft; Writing‐review and editing.
CONFLICT OF INTEREST
V. K. M. is a co‐founder of Menten AI, in which he holds equity. The other authors report no conflicts of interest.
Supporting information
ACKNOWLEDGEMENTS
This work was supported by the Howard Hughes Medical Institute (D. B.), Washington Research Foundation (T. W. C.), Open Philanthropy (D. B.), DTRA grant HDTRA1‐19‐1‐0003 (D.B.), and the Simons Foundation (V. K. M.). An award of computer time was provided to D. B., F. D., and V. K. M. by the Innovative and Novel Computational Impact on Theory and Experiment (INCITE) program. This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE‐AC02‐06CH11357. Crystallographic diffraction data were collected at the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE‐AC02‐06CH11357. Specifically, we used the Northeastern Collaborative Access Team beamlines, which are funded by the National Institute of General Medical Sciences from the National Institutes of Health (P30 GM124165). The Pilatus 6M detector on the 24‐ID‐C beam line is funded by a NIH‐ORIP HEI grant (S10 RR029205). We thank Michael Collazo and Duilio Cascio of the UCLA‐DOE X‐ray Crystallography and Structure Determination Core (supported by U.S. Department of Energy award DE‐FC02‐02ER63421) for their assistance with crystallization and data collection. We thank Swetha Sundaram for assistance in crystal mounting. We also thank Andrew Leaver‐Fay, Sergey Lyskov, and Rocco Moretti for Rosetta support, and Alexander Ford for fruitful algorithmic discussions. Conformational sampling was performed with the Rosetta simple_cycpep_prediction application and peptide design was carried out with the rosetta_scripts application. Modifications to both to make this work possible are now included in public releases of the Rosetta software suite, which is available free of charge to academic, not‐for‐profit, and government users, and can be downloaded from http://www.rosettacommons.org. Instructions and inputs for running these applications, and all other materials necessary to support the results and conclusions, are in Data S1. All X‐ray crystal structures have been deposited in the Protein Data Bank (peptide C2‐1: PDB IDs 6UFU and 6UG2; peptide C3‐1: PDB IDs 6UG3 and 6UG6; peptide C3‐2: PDB ID 6UGB; peptide C3‐3: PDB ID 6UGC; peptide S2‐1: PDB ID 6UCX; peptide S2‐2: PDB ID 6UD9; peptide S2‐3: PDB IDs 6UDR and 6UDW; peptide S2‐4: PDB IDs 6UDZ and 6UF4; peptide S2‐5: PDB ID 6UF7; peptide S2‐6: PDB ID 6UF8; peptide S4‐1(holo): PDB ID 6UFA; peptide S4‐1(apo): PDB ID 6UF9).
Mulligan VK, Kang CS, Sawaya MR, et al. Computational design of mixed chirality peptide macrocycles with internal symmetry. Protein Science. 2020;29:2433–2445. 10.1002/pro.3974
Funding information Defense Threat Reduction Agency, Grant/Award Number: HDTRA1‐19‐1‐0003; Open Philanthropy; National Institutes of Health; National Institute of General Medical Sciences; Argonne National Laboratory; Office of Science, Grant/Award Number: AC02‐06CH11357; U.S. Department of Energy, Grant/Award Number: FC02‐02ER63421; Simons Foundation; Washington Research Foundation; Howard Hughes Medical Institute
REFERENCES
- 1. Yeates TO. Geometric principles for designing highly symmetric self‐assembling protein nanomaterials. Annu Rev Biophys. 2017;46:23–42. [DOI] [PubMed] [Google Scholar]
- 2. King NP, Sheffler W, Sawaya MR, et al. Computational design of self‐assembling protein nanomaterials with atomic level accuracy. Science. 2012;336:1171–1174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. King NP, Bale JB, Sheffler W, et al. Accurate design of co‐assembling multi‐component protein nanomaterials. Nature. 2014;510:103–108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Gonen S, DiMaio F, Gonen T, Baker D. Design of ordered two‐dimensional arrays mediated by noncovalent protein‐protein interfaces. Science. 2015;348:1365–1368. [DOI] [PubMed] [Google Scholar]
- 5. Hsia Y, Bale JB, Gonen S, et al. Design of a hyperstable 60‐subunit protein icosahedron. Nature. 2016;535:136–139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Strauch E‐M, Bernard SM, La D, et al. Computational design of trimeric influenza‐neutralizing proteins targeting the hemagglutinin receptor binding site. Nat Biotechnol. 2017;35:667–671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Férey G, Mellot‐Draznieks C, Serre C, et al. A chromium terephthalate‐based solid with unusually large pore volumes and surface area. Science. 2005;309:2040–2042. [DOI] [PubMed] [Google Scholar]
- 8. Furukawa H, Ko N, Go YB, et al. Ultrahigh porosity in metal‐organic frameworks. Science. 2010;329:424–428. [DOI] [PubMed] [Google Scholar]
- 9. Seo JS, Whang D, Lee H, et al. A homochiral metal–organic porous material for enantioselective separation and catalysis. Nature. 2000;404:982–986. [DOI] [PubMed] [Google Scholar]
- 10. Conti E, Stachelhaus T, Marahiel MA, Brick P. Structural basis for the activation of phenylalanine in the non‐ribosomal biosynthesis of gramicidin S. EMBO J. 1997;16:4174–4183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Hodgkin DC, Oughton BM. Possible molecular models for gramicidin S and their relationship to present ideas of protein structure. Biochem J. 1957;65:752–756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Mandal D, Nasrolahi Shirazi A, Parang K. Self‐assembly of peptides to nanostructures. Organ Biomol Chem. 2014;12:3544–3561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. van Maarseveen JH, Horne WS, Ghadiri MR. Efficient route to C2 symmetric heterocyclic backbone modified cyclic peptides. Org Lett. 2005;7:4503–4506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Pintér Á, Haberhauer G. Synthesis of chiral threefold and sixfold functionalized macrocyclic imidazole peptides. Tetrahedron. 2009;65:2217–2225. [Google Scholar]
- 15. Horne WS, Gellman SH. Foldamers with heterogeneous backbones. Acc Chem Res. 2008;41:1399–1408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Giuliano MW, Horne WS, Gellman SH. An alpha/beta‐peptide helix bundle with a pure beta3‐amino acid core and a distinctive quaternary structure. J Am Chem Soc. 2009;131:9860–9861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Slough DP, McHugh SM, Cummings AE, et al. Designing well‐structured cyclic pentapeptides based on sequence‐structure relationships. J Phys Chem B. 2018;122:3908–3919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Bhardwaj G, Mulligan VK, Bahl CD, et al. Accurate de novo design of hyperstable constrained peptides. Nature. 2016;538:329–335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Hosseinzadeh P, Bhardwaj G, Mulligan VK, et al. Comprehensive computational design of ordered peptide macrocycles. Science. 2017;358:1461–1466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Leaver‐Fay A, Tyka M, Lewis SM, et al. ROSETTA3: An object‐oriented software suite for the simulation and design of macromolecules. Methods Enzymol. 2011;487:545–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Dang B, Wu H, Mulligan VK, et al. De novo design of covalently constrained mesosize protein scaffolds with unique tertiary structures. Proc Natl Acad Sci U S A. 2017;114:10852–10857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Mulligan VK. The emerging role of computational design in peptide macrocycle drug discovery. Expert Opin Drug Discov. 2020;15:833–852. [DOI] [PubMed] [Google Scholar]
- 23. Coutsias EA, Seok C, Jacobson MP, Dill KA. A kinematic view of loop closure. J Comput Chem. 2004;25:510–528. [DOI] [PubMed] [Google Scholar]
- 24. Mandell DJ, Coutsias EA, Kortemme T. Sub‐angstrom accuracy in protein loop reconstruction by robotics‐inspired conformational sampling. Nat Methods. 2009;6:551–552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Burnside W. On some properties of groups of odd order. Proc London Math Soc. 1900;33:162–184. [Google Scholar]
- 26. Yeates TO, Kent SBH. Racemic protein crystallography. Annu Rev Biophys. 2012;41:41–61. [DOI] [PubMed] [Google Scholar]
- 27. Ghadiri MR, Granja JR, Milligan RA, McRee DE, Khazanovich N. Self‐assembling organic nanotubes based on a cyclic peptide architecture. Nature. 1993;366:324–327. [DOI] [PubMed] [Google Scholar]
- 28. Ranganathan D, Lakshmi C, Haridas V, Gopikumar M. Designer cyclopeptides for self‐assembled tubular structures. Pure Appl Chem. 2000;72:365–372. [Google Scholar]
- 29. Bennett WM, Norman DJ. Action and toxicity of cyclosporine. Annu Rev Med. 1986;37:215–224. [DOI] [PubMed] [Google Scholar]
- 30. Witek J, Keller BG, Blatter M, Meissner A, Wagner T, Riniker S. Kinetic models of cyclosporin A in polar and apolar environments reveal multiple congruent conformational states. J Chem Inf Model. 2016;56:1547–1562. [DOI] [PubMed] [Google Scholar]
- 31. Frederickson CJ, Giblin LJ, Krężel A, et al. Concentrations of extracellular free zinc (pZn)e in the central nervous system during simple anesthetization, ischemia and reperfusion. Exp Neurol. 2006;198:285–293. [DOI] [PubMed] [Google Scholar]
- 32. Maret W. Zinc in cellular regulation: The nature and significance of “zinc signals”. IJMS. 2017;18:2285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Chaudhury S, Lyskov S, Gray JJ. PyRosetta: A script‐based interface for implementing molecular modeling algorithms using Rosetta. Bioinformatics. 2010;26:689–691. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Fleishman SJ, Leaver‐Fay A, Corn JE, et al. RosettaScripts: A scripting language interface to the Rosetta macromolecular modeling suite. PLoS ONE. 2011;6:e20161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. DiMaio F, Leaver‐Fay A, Bradley P, Baker D, André I. Modeling symmetric macromolecular structures in Rosetta3. PLoS ONE. 2011;6:e20450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Anderson DP. BOINC: A system for public‐resource computing and storage. In: Fifth IEEE/ACM international workshop on grid computing. Pittsburgh, PA: IEEE; 2004. pp. 4–10. Available from: http://ieeexplore.ieee.org/document/1382809/.
- 37. Hunt JB, Neece SH, Ginsburg A. The use of 4‐(2‐pyridylazo)resorcinol in studies of zinc release from Escherichia coli aspartate transcarbamoylase. Analyt Biochem. 1985;146:150–157. [DOI] [PubMed] [Google Scholar]
- 38. Crow JP, Sampson JB, Zhuang Y, Thompson JA, Beckman JS. Decreased zinc affinity of amyotrophic lateral sclerosis‐associated superoxide dismutase mutants leads to enhanced catalysis of tyrosine nitration by peroxynitrite. J Neurochem. 1997;69:1936–1944. [DOI] [PubMed] [Google Scholar]
- 39. Mulligan VK, Kerman A, Ho S, Chakrabartty A. Denaturational stress induces formation of zinc‐deficient monomers of cu,Zn superoxide dismutase: Implications for pathogenesis in amyotrophic lateral sclerosis. J Mol Biol. 2008;383:424–436. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.