Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 May 23.
Published in final edited form as: Acc Chem Res. 2019 Apr 1;52(5):1160–1167. doi: 10.1021/acs.accounts.9b00032

Noncoded Amino Acids in de Novo Metalloprotein Design: Controlling Coordination Number and Catalysis

Karl J Koebke 1, Vincent L Pecoraro 1,*
PMCID: PMC6533121  NIHMSID: NIHMS1021482  PMID: 30933479

CONSPECTUS:

The relationship between structure and function has long been one of the major points of investigation in Biophysics. Understanding how much, or how little, of a protein’s often complicated structure is necessary for its function can lead to directed therapeutic strategies and would allow one to design proteins for specific desired functions. Studying protein function by de novo design builds the functionality from the ground up in a completely unrelated and noncoded protein scaffold. Our lab has used this strategy to study heavy and transition metal binding within the TRI family of three stranded coiled coil (3SCC) constructs to understand coordination geometry and metalloenzyme catalytic control within a protein environment. These peptides contain hydrophobic layers within the interior of the 3SCC, which one can mutate to metal binding residues to create a minimal metal binding site, while solid phase synthesis allows our lab to easily incorporate a number of noncoded amino acids including D enantiomers of binding or secondary coordination sphere amino acids, penicillamine, or methylated versions of histidine. Our studies of Cd(II) binding to Cys3 environments have determined, largely through the use of 113Cd NMR and 111mCd PAC, that the coordination environment around a heavy metal can be controlled by incorporating noncoded amino acids in either the primary or secondary coordination spheres. We found mutating the metal binding amino acids to l-Pen can enforce trigonal Cd(II)S3 geometry exclusively compared to the mixed coordination determined for l-Cys coordination. The same result can be achieved with secondary sphere mutations as well by incorporating d-Leu above a Cys3. We hypothesize this latter effect is due to the increased steric packing above the metal binding site that occurs when the l-Leu oriented toward the N-terminus of the scaffold is mutated to d-Leu and oriented toward the C-terminus. Mutating the layer below Cys3 to d-Leu instead formed a mixed 4- and 5-coordinate Cd(II)S3(H2O) and Cd(II)S3(H2O)2 construct as steric bulk was decreased below the metal binding site. We have also applied noncoded amino acids to metalloenzyme systems by incorporating His residues that are methylated at the δ- or ε-nitrogen to enforce Cu(I) ligation to the opposite open nitrogen of His and found a 2 orders of magnitude increased catalytic efficiency for nitrite reductase activity with ε-nitrogen coordination compared to δ-nitrogen. These results exemplify the ability to tune coordination environment and catalytic efficiency within a de novo scaffold as well as the utility of noncoded amino acids to increase the chemist’s toolbox. By furthering our understanding of metalloprotein design one could envision, through our use of amino acids not normally available to nature, that protein design laboratories will soon be capable of outperforming the native systems previously used as their benchmark of successful design. The ability to design proteins at this level would have far reaching and exciting benefits within various fields including medical and industrial applications.

Graphical Abstract

graphic file with name nihms-1021482-f0001.jpg

INTRODUCTION

Understanding the relationship between protein structure and function is among the most important goals within Biophysics. Using de novo protein design, researchers can probe this relationship from a bottom-up approach by transplanting the function of one native protein into the tertiary structure of another previously unknown scaffold (unrelated to any natural sequence).13 It is estimated that 30−40% of proteins require a metal to function.4,5 De novo metalloprotein design allows one to determine the requirements to achieve a target metal coordination environment in a simplified protein scaffold. Our lab has focused on using three stranded coiled coils (3SCC) as scaffolds through the TRI system of peptides (Table 1).6 These scaffolds use a heptad repeat strategy with 3, 4, or 5 heptads of the sequence LKALEEK making up a single coil. Hydrophobic Leu residues in the a and d positions enforce the creation of a 3SCC. Mutating one or more of the Leu layers into the metal binding residues Cys or His allows us to create simplified binding sites for heavy or transition metals, respectively.

Table 1.

TRI Family Peptide Sequences Discussed in this Sectiona

peptide 1 2 9 16 23 30 37
TRI Ac-G LKALEEK LKALEEK LKALEEK LKALEEK G-NH2
TRIL16C Ac-G LKALEEK LKALEEK CKALEEK LKALEEK G-NH2
TRIL12AL16C Ac-G LKALEEK LKAAEEK CKALEEK LKALEEK G-NH2
TRIL16Pen Ac-G LKALEEK LKALEEK PenKAAEEK LKALEEK G-NH2
TRIL12HfLL16C Ac-G LKALEEK LKAHfLEEK CKALEEK LKALEEK G-NH2
a

The first residue within each heptad is labeled in the column heading with its overall position within the peptide.

One of Nature’s most impressive accomplishments is the breadth of functions that have been obtained utilizing a restrictive set of 20 commonly available amino acids and a pool of cofactors. Proteins designed in the lab, however, are under no such restriction, and several laboratories have shown the utility of noncoded amino acids to tune metal binding properties.7,8 Our lab benefits from the simplicity with which one can insert a noncoded amino acid through solid phase peptide synthesis. Thus, we have explored their use in several applications centered around either heavy metal coordination control913 or transition metal catalysis tuning.14 This Account will describe the tactics we have implemented using noncoded amino acids to understand coordination number and geometry control of Cd binding to sulfur rich proteins or modifications of the first and second coordination of Cu nitrite reductase (NiR) catalysts.

HEAVY METAL COORDINATION CONTROL

One of our lab’s larger goals is to probe heavy metal coordination to sulfur rich environments that are relevant to understanding metalloregulators and metallochaperones.1517 When we began these studies in the early 1990s, we hypothesized that an atomistic understanding of how these toxic metals bind to designed proteins would provide structural information that was difficult to obtain using the native systems. TRIL16C, which presents a Cys3 binding site between two Leu hydrophobic layers, was one of the earliest successes of this strategy binding Hg as HgS3.6 Hg(II) prefers to bind as a linear 2-coordinate complex with sulfurs, but evidence at the time had suggested that the metalloregulatory protein MerR utilized a Cys3 motif.18 Through spectroscopic and crystallographic studies, it was shown that Hg(II) could form such trigonal planar structures far in advance of the ultimate X-ray structure of mercurated MerR.6,19 Other metalloids such as trigonal pyramidal Pb(II) and As(III) could be interrogated in a similar vein.20,21 Utilizing similar control on Cd(II) proved more difficult since this ion has a less restrictive coordination number in the presence of thiolate ligands. TRIL16C successfully created a CdS3 binding site, but 111mCd perturbed angular correlation (PAC) experiments of this system determined that Cd(II) bound in two geometries with 40% being trigonal planar CdS3 and 60% being tetrahedral CdS3O.22

Our first application of noncoded amino acids sought to exploit changes in steric bulk around the TRIL16C binding site to control Cd(II) coordination. We compared Cd(II) binding in TRIL16C, TRIL16Pen (Pen = penicillamine), TRIL12AL16C, and TRIL12HfLL16C (HfL = hexafluoroleucine) (Table 1). Decreasing steric bulk above the Cys3 binding site in TRIL12AL16C successfully created a single species of CdS3(H2O) as determined by 111mCd PAC and allowed us, using our previous results with Cd(TRIL16C)3, to predict the 113Cd NMR chemical shift for a pure CdS3 system as ~680−700 ppm (Figure 1). We have since determined through examination of the Hg(TRIL12AL16C)3 crystal structure that opening this space allows four waters to insert within the layer above the bound metal, one of which binds to the metal creating the CdS3(H2O) site.23

Figure 1.

Figure 1.

PyMOL models of (left) Cd(II)(TRIL16Pen)3, (middle) Cd(II)(H2O)(TRIL12AL16C)3, and (right) Cd(II)(H2O)- (TRIL12HfLL16C)3 using the crystal structures of apo-(CSL16Pen)3 [pdb 3h5f] and Zn(II)(H2O)(GRANDCSL12AL16C)3 [pdb 5kb2] based on chemical knowledge. Cd(II) ions and water are represented as white and red spheres, respectively. Peptides are depicted with the N-terminus on top and C-terminus on bottom.

Increasing steric bulk in the 12th position using hexafluoroleucine produced unexpected results as it shifted the ratio of 4-/3-coordinate Cd(II) toward more 4-coordinate based on 113Cd NMR. We now realize that the incorporation of 18 fluorine atoms to the layer at the N-terminal side of the cysteine sulfur layer likely caused the Cd(II) to shift to the C-terminal side of the helix, which has more space to accommodate water allowing an increased coordination number (Figure 1). The second strategy utilizing non-native amino acids was to increase the steric bulk within the plane of the first coordination sphere by mutating the Cys3 binding residues to penicillamine. This replaces the β-methylene hydrogen atoms of Cys with methyl groups (Figure 2). The 113Cd NMR of Cd(II)(TRIL6Pen)3 contained a single peak at 684 ppm, a value within the calculated range, leaving us confident that we had achieved a CdS3 binding geometry. This was confirmed by 111mCd PAC that identified a single 3-coordinate trigonal planar species. Hence, via small structural changes to either the second or first coordination spheres, we altered the energy landscape to obtain the desired geometries. These lessons can be applied to both heavy metal binding and metalloenzyme catalytic sites.

Figure 2.

Figure 2.

Structure of (left) penicillamine and (right) hexafluoroleucine.

SECONDARY COORDINATION SPHERE NONCODED AMINO ACIDS

Penicillamine yielded the desired CdS3 center by modifying the sulfur atom plane. We next sought to create a construct that could enforce the same coordination on a Cys3 binding site by modifying steric bulk above the sulfur layer. Because the bulky hexafluoroleucine mutation above the Cys3 layer was unable to enforce a CdS3 structure, we used an alternate strategy exploiting the chirality of amino acids. We altered the original l-Leu diastereomer to its d-Leu variant to create TRIL12d-LL16C (Table 2). Given that the β carbon atoms of l-amino acids point toward the N-terminus of a 3SCC (while the d-diastereomer orients the side chain toward the C terminus), one should be able to remove the small solvent cavity above the metal without increasing the side chain steric bulk (Figure 3). 113Cd NMR of Cd(II)(TRIL12d-LL16C)3 exhibited a peak at 697 ppm, similar to that of TRIL16Pen. Also, 111mCd PAC was fit with a trigonal planar binding site,10 indicating that this design bound Cd(II) solely as CdS3.

Table 2.

TRI Family Peptide Sequences Used to Study Secondary Coordination Sphere Effects of Noncoded Amino Acids Discussed in this Sectiona

peptide 1 2 9 16 23 30 37
TRI Ac-G LKALEEK LKALEEK LKALEEK LKALEEK G-NH2
TRIL16C Ac-G LKALEEK LKALEEK CKALEEK LKALEEK G-NH2
TRIL12d-LL16C Ac-G LKALEEK LKAdLEEK CKALEEK LKALEEK G-NH2
GRANDL12d-LL16CL26AL30C Ac-G LKALEEK LKAdLEEK CKALEEK LKAAEEK CKALEEK G-NH2
TRIL16CL19d-L Ac-G LKALEEK LKALEEK CKAD-LEEK LKALEEK G-NH2
CoilSer (CS) Ac-E WEALEKK LAALESK LQALEKK LEALEHG -NH2
GRAND Ac-G LKALEEK LKALEEK LKALEEK LKALEEK LKALEEK G-NH2
a

The first residue within each heptad is labeled in the column head with its overall position within the peptide. Nomenclature for CS and GRAND follow the same numbering pattern as for TRI.

Figure 3.

Figure 3.

Schematic representation of the l- to d-Leu mutation at position 12. Reorientation of the amino acid side chain serves to increase the steric bulk in the plane above the Cys ligands. Reproduced with permission from ref 10. Copyright 2008 National Academy of Sciences, U.S.A.

Analysis of the X-ray structures of Hg(II)(TRIL6Pen)3 and Hg(II)(TRIL12d-LL16C)3 revealed that the exclusion of solvent was achieved in two different ways. With penicillamine, the β carbon atoms of cysteine must accommodate the more bulky methyl substitution, causing a change in the preferred side chain rotamers. The consequence is that the cysteine sulfur plane is translated toward the N-terminus decreasing the space for solvent. The d-Leu retains the original positioning for the sulfur plane, moving the Leu side chains toward the C-terminus of the scaffold. Effectively, we have built a small house within the 3SCC that can allow a small amount of water in when Cd(II) is present. By using penicillamine, we raise the floor of the house (sulfur layer forming the metal binding site) to displace the water, while d-Leu substitution effectively lowers the ceiling (hydrophobic layer above the metal binding site). Extending this analogy, placing an alanine above the metal binding site moves the ceiling up one level, generating pure 4 coordination.

The power of this strategy is exemplified by the creation of GRANDL12d-LL16CL26AL30C (dichromeric 3SCC), a peptide of increased length designed to incorporate at different sites exclusive 3-coordinate and 4-coordinate Cd(II) structures simultaneously (Table 2). This design separated the two metal binding sites by 20 Å and 2 layers of hydrophobic residues so one would expect the two sites to act independently of each other. Analyzing Cd(II) titrations of this dichromeric 3SCC by UV−vis gave the expected stoichiometry of 2:1 Cd/trimer and [Cd(II)]2(GRANDL12d-LL16CL26AL30C)32−. 113Cd NMR experiments showed two resonances with the first peak at 589 ppm and the second at 690 ppm. Thus, a 5% change in primary sequence created a scaffold that bound Cd(II) selectively in two different binding modes. The Cd(II)S3O site bound Cd(II) at least 10 times more tightly than the Cd(II)S3 site and fully formed at much lower pH. Thus, we determined that secondary sphere changes can influence the primary coordination sphere of a heavy metal, and such coordination sphere changes can have a notable effect on important thermodynamic properties such as binding affinity.

We hypothesized that incorporating d-Leu below the Cys3 site would lower the floor while keeping the ceiling at the same level. This change could allow for the first 5-coordinate Cd(II) (i.e., Cd(II)S3O2) in this scaffold. TRIL16CL19d-L with d-L below the Cys3 layer addresses this question (Table 2). This comparison utilized both TRI and CoilSer (CS) families of peptides.24 CS derivatives of TRI peptides have proven to have comparable properties while CS more easily forms diffraction quality crystals.25,26

Cd NMR experiments of Cd(II)(TRIL16CL19d-L)3 exhibited a peak at 600 ppm, which one would assign as exclusively Cd(II)S3O using previously established models. 111mCd PAC, however, required two unique sites to fit the data; one with parameters matching that expected for Cd(II)S3O (ωo = 0.316 rad ns−1) and a second with ωo= 0.159 rad ns−1, which we had not previously observed for the TRI system, in a roughly 1:1 mixture. Lower angular frequency is indicative of more symmetric charge distribution with perfectly tetrahedral or octahedral geometries exhibiting ωo of 0 rad ns−1; herefore, we proposed two different geometric possibilities27 (Figure 4). The first was a standard pseudotetrahedral Cd(II)S3O (ωo = 0.316 rad ns−1) as previously observed. The second structure (ωo= 0.159 rad ns−1) was a Cd(II)S3O2 coordination in which another exogenous water is bound to the C-terminal side of the bound Cd(II) as well as the N-terminal side. This latter hypothesis is strongly supported by the X-ray structure of the Hg(II)(TRIL16CL19d-L)3, which exhibits water molecules above and below the mercury ion.

Figure 4.

Figure 4.

PyMOL models of hypothesized Cd(II) coordination environments: (left) CdS3(H2O) in a tetrahedral geometry and (right) CdS3(H2O)2 in a trigonal bipyramidal geometry using the crystal structure of Hg(II)(TRIL16CL19d-L)3 based on chemical knowledge. Models create d using Zn(II)(H 2 O) - (GRANDCSL12AL16C)3 [pdb 5kb2]. Cd(II) ions are represented as white spheres, while water is represented as red spheres. Peptides are depicted with the N-terminus on top and C-terminus on bottom.

PRIMARY COORDINATION SPHERE NONCODED AMINO ACIDS

Changing the stereochemistry of the second coordination sphere can have a large effect on the structure of the metal center, but what about perturbing the chirality of an inner sphere ligand? Clearly, substitution of methyl groups on cysteine dramatically altered Cd(II) binding with an l-Pen derivative. What other changes might be observed using d-Pen? By crystallizing apo versions of both l- and d-containing peptides, we sought to understand how the S3 layer in (TRIL16Pen)3 was reoriented by inverting a stereocenter. This is an important design issue as we have shown that the cysteine sulfur layer is predisposed (sulfurs in a layer, but significant rotamer reorganization required) to bind Hg(II), Cd(II), and Zn(II) or preorganized (sulfurs form a layer that does not require rotamer reorganization) to bind Pb(II) and As(III).21 Thus, thermodynamic preferences will be influenced based on whether reorganization energy is necessary to incorporate the ion.

Crystal structures of (CSL16Pen)3 and (CSL16d-Pen)3 were solved to 1.86 and 1.71 Å resolution, respectively. Clear differences in the S3 layer were noted, with the l-Cys diastereopeptide forming a cavity that was predisposed to metal binding, with sulfur atoms oriented toward the N terminus. In contrast the structure of the d-Cys system placed the thiols down toward the C-terminus with the sulfur atoms pointed at the helical interface (Figure 5). Spectroscopic evidence indicated that the binding of Cd(II) differed greatly between these two peptides. 113Cd NMR results for CSL16Pen matched those observed for TRIL16Pen with a peak at 683 ppm while Cd(II)(CSL16d-Pen)3 exhibited a peak at 557 ppm, an even more upfield shifted resonance than seen for CdS3O structures (range of δ 570−600 ppm).11 Thus, this Cd(II) is likely either CdS3O2 or possibly CdS2O2.28 Thus, changing the stereoisomer of a side chain in the primary coordination sphere can change the coordination environment and the physical positioning of the bound metal.

Figure 5.

Figure 5.

PyMOL representations of (left) (CSL16Pen)3 [pdb 3h5f] and (right) (CSL16d-Pen)3 [pdb 3h5g]. Top-down view is done looking down from the N-terminus.

Given the difference in chemistry between l-Cys and l-Pen with Cd(II), we felt it prudent to evaluate how Cd(II) d-Cys peptides would behave. Previously, we had determined that the side chains of (CSL16C)3 were predisposed but not preorganized for Cd(II) binding in the hydrophobic interior.21 Upon metal binding, Cys side chains reorient, increasing the interlayer distance between sulfur atoms and the hydrophobic layer in the 12th position. This increase in interhelical distance was likely the cause of Cd(II)TRIL16C’s mixed coordination state of Cd(II)S3 and Cd(II)S3O. Our hypothesis was that using d-Cys, which points toward the C-terminus, would further increase the distance between the 12th and 16th layers allowing for a greater proportion of CdS3O.

Analysis of the Cd(TRIL16d-Cys)3 spectral properties led to conflicting conclusions. 113Cd NMR experiments gave a peak at 646 ppm, which predicts a 60% Cd(II)S3 and 40% Cd(II)S3O distribution. In contrast, 111mCd PAC spectra were fit with a single species of Cd(II)S3O. We hypothesized that this discrepancy was linked to the sensitivity of 113Cd NMR peaks to ligand to metal charge transfer (LMCT) changes in the Cys−Cd bond.29,30 This bond is affected by the Cys p-orbital and thus one would expect differences between l-Cys and d-Cys, while 111mCd PAC is unaffected by such differences in LMCT but is affected by Cd−Cys distance. This illustrates that the 111mCd PAC and 113Cd NMR structural correlation may have limitations when not strictly applied to l-Cys in an “a” site peptide environment. The final proposed model for Cd(II)(TRIL16d-Cys)3− was Cd(II)S3O with an elongated Cd−O bond and shorter Cd−S distances.

Diffraction quality crystals for apo-(CSL16d-C)3 as well as Zn(II)Cl(CSL16d-C)3 were compared to apo-(CSL16C)3 and Zn(II)(GRAND-CSL12AL16C)3 to evaluate the differences between l-Cys and d-Cys using Zn(II) as a mimic for tetrahedral Cd(II) (Figure 6). Comparison of the apo structures found no significant deviation from α-helicity, but an increased distance between the 12th and 16th layer from 4.92 Å (l-Cys) to 6.10 Å (d-Cys), which we hypothesized allowed the Cd(II)S3O coordination seen in Cd(II)(CSL16d-C)3. Comparing apo-(CSL16d-C)3 to Zn(II)(CSL16d-C)3, we found little reorganization upon metalation. Zn(II) binds d-Cys with a distorted tetrahedral Zn(II)S3Cl coordination environment, which we hypothesized would be a reasonable model for the structure of Cd(II)S3O. That the Zn(II) bound Cl had 100% occupancy within the crystal structure left no doubt that a smaller exogenous water ligand could occupy the same space. A similar analysis of Zn(II) binding to l-Cys using apo-(CSL16C)3 and Zn(II)(GRAND-CSL12AL16C)3 determined that the Cys ligands rotated ~80° to bind Zn(II) while d-Cys ligands rotated ∼10−30°. Thus, we had added primary sphere d-Cys mutation to the growing list of methods one could use to control Cd(II) coordination and determined that it could create a site that was preorganized for a tetrahedral metal rather than predisposed as were the l-Cys sites.

Figure 6.

Figure 6.

Preorganization of d-Cys versus the predisposition of l-Cys upon binding to Zn(II) in the 3SCCs. (top panels) Top down view from the N termini representing the 16 l-Cys side chain orientations in (a) [apo-(CSL16l-C)3 [PDB ID 5K92] and (b) Zn(II)(H2O)-(GRANDCSL12AL16l-C)3 [PDB ID 5KB2] and (c) an overlay between structures from panels a and b. (bottom panels) (d) Apo-(CSL16d-C)3, (e) Zn(II)Cl(CSL16d-C)3, and (f) an overlay between structures from panels d and e. Metal binding ligands are represented as sticks (sulfur atoms in yellow). Zn(II) ions in Zn(II)Cl(CSL16d-C)3 and Zn(II)(H2O)(GRANDCSL12AL16l-C)3 are given in gray and green, respectively. The coordinated water molecule is red, and the chloride ligand is orange. Adapted with permission from ref 12. Copyright 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

APPLYING NONCODED AMINO ACIDS TO DE NOVO CATALYTIC SYSTEMS

A second major thrust of our work is to apply lessons of de novo metalloprotein design we have learned from heavy metal systems to metalloenzymes utilizing His3 binding to transition metals. It should be noted that converting the metal binding ligand from cysteine to histidine has significant consequences. These include moving from a soft to an intermediate Lewis base, a large increase in size going from a sulfur atom to an imidazole ring, a decrease in negative charge, and a significant change in orientational bonding between a spherical thiolate ligand and two different azole nitrogens. Nonetheless, we were able to show that incorporating His3 sites into TRI systems and binding either Zn(II) or Cu(I) allowed for the recapitulation of carbonic anhydrase or Cu nitrite reductase (CuNiR) activities.19,31 X-ray structures of the Zn(II) containing peptides demonstrated that the TRI scaffolds were able to accommodate a tetrahedral metal by increasing the interhelical separations of the 3SCCs.19 Moreover, X-ray absorption spectroscopy of the copper peptides revealed that metal complexes ranging from 2- to 5-coordinate with as many as 3 bound histidines could be isolated. Reviews of our work on de novo carbonic anhydrases and, recently, nitrite reductases have been published, so this Account will focus largely on our application of noncoded amino acids with CuNiR.32,33

Our first CuNiR mimic within a 3SCC system was created by substituting Zn(II) with Cu(I/II) within the TRIW-H peptide used to recapitulate carbonic anhydrase activity (Figure 7).31 X-ray absorption spectroscopy (XAS) indicated that the Cu(I) coordination environment was likely His3 based on the Cu(I)−N/O distance of 1.93 Å, the presence of long distance scatterers expected with His ligation, and the intensity of the 1s → 4p pre-edge transition, which inversely correlates with Cu(I) complex coordination number.34 Electronic absorption and EPR spectroscopies were initially used to estimate the Cu(II) coordination geometry as His3(H2O)2, but more recently, we have revisited this question using XAS, and the average calculated Cu(II)−N/O distance of 1.94 Å as well as analysis of the multiscattering intensity leaves His3(H2O) as the most likely model.

Figure 7.

Figure 7.

(A) Representation of the model for Cu(TRIH)3 based on the structure of Hg(II)S[Zn(II)N(H2O)](CSL9CL23H)3+ [PDB 3PBJ]. (B) View of the Zn(II)(H2O)His32+ site along the pseudo-3-fold axis of Hg(II)S[Zn(II)N(H2O)](CSL9CL23H)3+ (Cyan), superimposed on the type 2 Cu(II)H2O(His)3 site in R. sphaeroides NiR (PDB 2DY2, backbone, green; side chain, pink). Coordinated water molecules are represented as spheres. (C) Side view of the same superposition in panel B. Peptides are depicted as either a side view with the N-terminus on top and C-terminus on bottom or top-down view looking down from the N-terminus. Adapted with permission from ref 31. Copyright 2012 National Academy of Sciences, U.S.A.

Cu(I) and Cu(II) KD values at pH 5.9 were determined to be 3.1 pM and 40 nM, respectively. The reduction potential of the Cu peptide at pH 5.9 was calculated using the Nernst equation to be +400 mV, a value more reminiscent of type 1, rather than type 2, copper proteins. An activity assay monitored the oxidation of ascorbate as NO2 was converted to NO. A pH dependent activity, with a maximum value for the first order rate constant of 4.6 × 10−4 s−1, was found. While this value is 7 orders of magnitude below native CuNiR, this model was capable of 1000s of turnovers without degradation and provided a starting point for investigating CuNiR activity within a de novo scaffold.

We hypothesized that one could apply secondary coordination sphere mutations around the Cu(I)His3 site of TRW-H to affect substrate access and coordination geometry similar to our above work with Cd(II)(Cys3) sites.14 The resting state of this catalyst is Cu(I)(TRIW-H)3 + due to an excess of the reducing agent ascorbate, which was confirmed as Cu(I)His3 by X-ray absorption spectroscopy (XAS). Native CuNiR systems, however, have a CuHis3(H2O) coordination environment in both the cuprous and cupric forms.35,36 We created a series of TRIW-H variants with differing steric bulk above or below the active site to investigate whether steric controls seen with heavy metal binding constructs could be used to enforce a Cu(I)His3(H2O) coordination state and what effects this would have on catalysis. We will focus on two constructs, TRIW-HL19A and TRIW-HL19d-L, which are direct analogues of changes made to control the metal coordination number of Cd(II) in the cysteine containing peptides (Table 3).

Table 3.

TRI-H Family Peptide Sequences for CuNiR Constructsa

peptide 1 2 9 16 23 30
TRIW-H Ac-G WKALEEK LKALEEK LKALEEK HKALEEK G-NH2
TRIW-HL19d-L Ac-G WKALEEK LKALEEK LKAD-LEEK HKALEEK G-NH2
TRIW-HL19A Ac-G WKALEEK LKALEEK LKAAEEK HKALEEK G-NH2
TRIW-δmH Ac-G WKALEEK LKALEEK LKALEEK δmHKALEEK G-NH2
TRIW-εmH Ac-G WKALEEK LKALEEK LKALEEK εmHKALEEK G-NH2
a

The first residue within each heptad is labeled in the column head with its overall position within the peptide.

We determined by XAS that opening space above the His3 site with TRIW-HL19A decreased Cu(I)’s coordination number to Cu(I)His2 based on the calculated Cu−N/O distance of 1.86 Å and the relative intensity of the 1s → 4p absorption band34 (Figure 8). Packing of Leu above the Cu(I)His3 precluded the His from rotating to create a linear 2-coordinate center. Upon decrease of that steric hindrance, Cu(I) adopted its preferred linear geometry. Cu(I)(TRIW-HL19d-L)3+ was, instead, practically indistinguishable from Cu(I)(TRIW-H)3+ in both Cu−N/O distance at 1.92 Å and 1s → 4p absorption band intensity and was assigned as a Cu(I)His3 geometry. Thus, we found that secondary sphere changes allowed for tuning the coordination and reactivity of a transition metal His3 site.

Figure 8.

Figure 8.

XANES of Cu(I) bound constructs of TRIW-H and secondary coordination sphere variants.

Surprisingly, while Cu(I)(TRIW-HL19A)3+ had moved further away from our goal of mimicking the coordination environment of CuNiR, its first-order rate constant was 75 times that of Cu(I)(TRIW-H)3+ or Cu(I)(TRIW-HL19d-L)3+. Other secondary sphere mutations, either above or below the His3 layer, revealed that decreasing the steric bulk above the CuHis3 decreased the coordination number to 2, while the same modification below the CuHis3 site left the coordination number unchanged. Surprisingly, both mutation types increased rates equivalently. This increase in first-order kinetics was correlated with a 1 eV shift to higher energy of the 1s → 4p absorption bands (8982.8 ± 0.2 eV seen in Cu(I)(TRIW-H)3+ vs 8983.6 ± 0.2 eV for Cu(I)(TRIW-HL19A)3+). This energy shift could be related to the net charge on the bound Cu(I) that could then affect the energy of the Cu(I)His3NO2 transition state. This could explain how systems with both 2- and 3-coordinate Cu(I) have the same increase in first order rate constant. Alternatively, the rate limitation could occur from the Cu(II) state.

We next applied noncoded amino acids to the primary coordination sphere of our CuNiR constructs, employing methylated histidine residues to enforce His binding by either the δ- or ε-nitrogen (using Nε or Nδ methylated histidine, respectively) (Figure 9). Previous metastudies of crystal structures deposited into the PDB had postulated that δ- or ε-nitrogen bound copper correlated with differing functions in nature, with δ-N being found in more electron transfer centers and ε-N in metalloenzymes.37,38 This rationale was further supported by the later discovery of the CuH center of peptidylglycine α-hydroxylating monooxygenase (PHM), an electron transfer center with the same His3 metal binding site as CuNiR but binding Cu with the δ-nitrogens rather than ε-nitrogens.39 Based on these factors, we hypothesized the CuNiR efficiency would be higher using Nε coordination than Nδ within our de novo constructs. We designed TRIW-δmH and TRIW-εmH and compared these to the original TRIW-H to investigate this effect (Table 3).

Figure 9.

Figure 9.

Structure of (left) Nε methylated His and (right) Nδ methylated His.

EXAFS, XANES, and EPR analysis indicated that Cu(II) coordination to (TRIW-H)3 was similar to Cu(II) coordination to (TRIW-δmH)3 with EXAFS best fit to 4-coordinate Cu(II)His3(H2O). Cu(II)(TRIW-εmH)3 on the other hand was more tetrahedral in its geometry based on intensity of the 1s → 3d transition and compressed hyperfine coupling constant, while EXAFS was best fit to 4-coordinate Cu(II)-His2(H2O)2. The reasoning for this difference, based on modeling the site within the crystal structure of Hg(II)S[Zn-(II)N(H2O)](CSL9CL23H)3+ (PDB 3PBJ), is likely that ligation to Nδ within a 3SCC does not enable all three His to ligate simultaneously and a Cu(II)His2(H2O)2 coordination was instead preferred similar to Cd(II) binding to (CSL16d-Pen)3. Investigations of Cu(I) coordination gave similar results with EXAFS and XANES analyses of Cu(I)(TRIW-δmH)3 matching Cu(I)(TRIW-H)3 with a 3-coodinate Cu(I)His3 geometry while Cu(I)(TRIW-εmH)3 was bound as a 2-coordinate Cu(I)His2.

Dissociation constants were determined for Cu(I) and Cu(II) for (TRIW-δmH)3 and (TRIW-εmH)3 and compared to literature values for (TRIW-H)3. While Cu(I) KD’s were largely unaffected by methylation of δ- or ε-nitrogen (all three constructs ∼2−6 pM), the Cu(II) KD of (TRIW-δmH)3 was 2 orders of magnitude looser binding (3.3 μM) than either (TRIW-εmH)3 or (TRIW-H)3 (0.04 μM). This difference led to a 130 mV increase in reduction potential for Cu-(TRIW-δmH)32+ (530 mV) versus Cu(TRIW-εmH)32+ or Cu(TRIW-H)32+ (400 mV). We hypothesize that this shift in reduction potential is caused by the inductive effect of the methyl group in comparison to hydrogen. Interestingly, we have shown that changing charge of residues at the helical interface of Cu(TRIW-H)32+ also caused a 100 mV shift in potential; however, this change was primarily due to changes in Cu(I) KD. This reduction potential shift came with a minimal perturbation to CuNiR activity. We hypothesized that these reduction potential changes were due to a hydrogen bond between the outer sphere amino acids and the His3 site. The comparison of methylated histidine containing constructs cannot be influenced by such a hydrogen bond, so other factors must be at play.

Looking at CuNiR activity, Cu(I)(TRIW-δmH)3+ had a 2 orders of magnitude higher first-order rate constant than Cu(I)(TRIW-εmH)3+. This is a significant difference between Nε or Nδ coordination (Table 4) and may be due to the inductive effect of the methyl group. A similar effect was hypothesized for catalytic differences between models of CuNiR using pyrazole versus imidazole.40,41 However, we caution that the differing Cu(I) and Cu(II) coordination environments between these two constructs and the shift in reduction potential complicates such analysis. Interestingly, the shift of 100 mV in potential that occurs due to modification of interhelical residues involved in H-bonding with the imidazole that influences Cu(I) affinity has only a 4-fold change in rates, whereas alkylation of the histidine causing the same 100 mV shift, but this time lessening Cu(II) affinity, results in a 100-fold increase in activity. This result may point to the sensitivity of the kinetics on Cu(II) rather than Cu(I). Cu(I)-(TRIW-δmH)3+ also has 300 times higher CuNiR activity based on first order rate constant than the original Cu(I)-(TRIW-H)3+, with a new record for a homogeneous aqueous system at 0.12 s−1. We argue that this is the first quantitative evidence for the catalytic differences between Nε or Nδ Cu coordination within a protein environment. It is true that the differing coordination spheres of these constructs can complicate such a direct comparison, but given our previous results showing several 2- and 3-coordinate Cu(I) constructs with indistinguishable activity, we do not believe the change in coordination between Cu(TRIW-δmH)3 and Cu(TRIW-εmH)3 is the primary origin of differences in NiR activity between these two constructs.14

Table 4.

Comparison of Cu(I) Coordination and Nitrite Reductase Rate for TRIW-H CuNiR Models

peptide Cu(I) coordination rate (s−1)
TRIW-H Cu(I)(His)3 4.6 × 10−4
TRIW-HL19A Cu(I)(His)2 3.5 × 10−2
TRIW-HL19d-L Cu(I)(His)3 9.7 × 10−4
TRIW-εmH Cu(I)(His)2 1.2 × 10−3
TRIW-δmH Cu(I)(His)3 0.12

In conclusion, we have shown that noncoded amino acids can be used to influence dramatically coordination number and the physical properties of designed metalloproteins. These modifications may be achieved through installation of residues with altered chirality, modified steric bulk, or changes in the electronic nature of a coordinating ligand. With Cd(II), coordination number could be controlled across the series CdS3, CdS3(H2O), and CdS3(H2O)2. With copper, coordination numbers from 2 to 5 could be observed. More important, through the use of noncoded amino acids, we not only created a new benchmark for homogeneous model CuNiR systems in aqueous solution but provided the first quantitative evidence for the catalytic differences between Nε or Nδ Cu coordination within a protein environment. Thus, we conclude that exploitation of a broader toolset of noncoded amino acids promises even greater structural and reactivity control in metalloprotein design.

Biographies

Karl J. Koebke obtained his Ph.D. from the University of Wisconsin—Milwaukee working with Prof. Andy Pacheco exploring the kinetics of nitric oxide dioxygenation in heme proteins. He then spent three years working as a postdoctoral researcher in the lab of Vincent Pecoraro where he designed a plethora of de novo constructs including nitrite reductase and cupredoxin mimics. He is currently working at Wayne State University with Prof. Timothy Stemmler on mitochondrial FeS cluster biogenesis.

Vincent L. Pecoraro is the John T. Groves Collegiate Professor of Chemistry at the University of Michigan, Ann Arbor. He has studied bioinorganic chemistry for his entire career with emphasis on manganese and vanadium systems. In addition to discovering metallacrowns, he has been a pioneer in metalloprotein design. He served as an Associate Editor of Inorganic Chemistry for over 20 years and is a Fellow of the ACS.

Footnotes

Notes

The authors declare no competing financial interest.

REFERENCES

  • (1).Huang P-S; Boyken SE; Baker D The coming of age of de novo protein design. Nature 2016, 537, 320–327. [DOI] [PubMed] [Google Scholar]
  • (2).Yu F; Cangelosi VM; Zastrow ML; Tegoni M; Plegaria JS; Tebo AG; Mocny CS; Ruckthong L; Qayyum H; Pecoraro VL Protein Design: Toward Functional Metalloenzymes. Chem. Rev 2014, 114, 3495–3578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (3).Lu Y; Berry SM; Pfister TD Engineering Novel Metalloproteins: Design of Metal-Binding Sites into Native Protein Scaffolds. Chem. Rev 2001, 101, 3047–3080. [DOI] [PubMed] [Google Scholar]
  • (4).Andreini C; Bertini I; Cavallaro G; Holliday GL; Thornton JM Metal ions in biological catalysis: from enzyme databases to general principles. JBIC, J. Biol. Inorg. Chem 2008, 13, 1205–1218. [DOI] [PubMed] [Google Scholar]
  • (5).Andreini C; Bertini I; Rosato A Metalloproteomes: A Bioinformatic Approach. Acc. Chem. Res 2009, 42, 1471–1479. [DOI] [PubMed] [Google Scholar]
  • (6).Dieckmann GR; McRorie DK; Tierney DL; Utschig LM; Singer CP; O’Halloran TV; Penner-Hahn JE; DeGrado WF; Pecoraro VL De Novo Design of Mercury-Binding Two- and Three-Helical Bundles. J. Am. Chem. Soc 1997, 119, 6195–6196. [Google Scholar]
  • (7).Mills JH; Khare SD; Bolduc JM; Forouhar F; Mulligan VK; Lew S; Seetharaman J; Tong L; Stoddard BL; Baker D Computational Design of an Unnatural Amino Acid Dependent Metalloprotein with Atomic Level Accuracy. J. Am. Chem. Soc 2013, 135, 13393–13399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (8).Clark KM; Tian S; van der Donk WA; Lu Y Probing the role of the backbone carbonyl interaction with the CuA center in azurin by replacing the peptide bond with an ester linkage. Chem. Commun 2017, 53, 224–227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (9).Lee K-H; Matzapetakis M; Mitra S; Marsh ENG; Pecoraro VL Control of Metal Coordination Number in de Novo Designed Peptides through Subtle Sequence Modifications. J. Am. Chem. Soc 2004, 126, 9178–9179. [DOI] [PubMed] [Google Scholar]
  • (10).Peacock AFA; Hemmingsen L; Pecoraro VL Using diastereopeptides to control metal ion coordination in proteins. Proc. Natl. Acad. Sci. U. S. A 2008, 105, 16566–16571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (11).Peacock AFA; Stuckey JA; Pecoraro VL Switching the Chirality of the Metal Environment Alters the Coordination Mode in Designed Peptides. Angew. Chem., Int. Ed 2009, 48, 7371–7374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (12).Ruckthong L; Peacock AFA; Pascoe CE; Hemmingsen L; Stuckey JA; Pecoraro VL d-Cysteine Ligands Control Metal Geometries within De Novo Designed Three-Stranded Coiled Coils. Chem. - Eur. J 2017, 23, 8232–8243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (13).Ruckthong L; Deb A; Hemmingsen L; Penner-Hahn JE; Pecoraro VL Incorporation of second coordination sphere d-amino acids alters Cd(II) geometries in designed thiolate-rich proteins. JBIC, J. Biol. Inorg. Chem 2018, 23, 123–135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (14).Koebke KJ; Yu F; Salerno E; Van Stappen CV; Tebo AG; Penner-Hahn JE; Pecoraro VL Modifying the Steric Properties in the Second Coordination Sphere of Designed Peptides Leads to Enhancement of Nitrite Reductase Activity. Angew. Chem., Int. Ed 2018, 57, 3954–3957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (15).O’Halloran T; Walsh C Metalloregulatory DNA-binding protein encoded by the merR gene: isolation and characterization. Science 1987, 235, 211–214. [DOI] [PubMed] [Google Scholar]
  • (16).Lin Y-F; Yang J; Rosen BP ArsD Residues Cys12, Cys13, and Cys18 Form an As(III)-binding Site Required for Arsenic Metallochaperone Activity. J. Biol. Chem 2007, 282, 16783–16791. [DOI] [PubMed] [Google Scholar]
  • (17).Liu T; Golden JW; Giedroc DP A Zinc(II)/Lead(II)/ Cadmium(II)-Inducible Operon from the Cyanobacterium Anabaena Is Regulated by AztR, an α3N ArsR/SmtB Metalloregulator. Biochemistry 2005, 44, 8673–8683. [DOI] [PubMed] [Google Scholar]
  • (18).Helmann J; Ballard B; Walsh C The MerR metalloregulatory protein binds mercuric ion as a tricoordinate, metal-bridged dimer. Science 1990, 247, 946–948. [DOI] [PubMed] [Google Scholar]
  • (19).Zastrow ML; Peacock AFA; Stuckey JA; Pecoraro VL Hydrolytic catalysis and structural stabilization in a designed metalloprotein. Nat. Chem 2012, 4, 118–123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (20).Touw DS; Nordman CE; Stuckey JA; Pecoraro VL Identifying important structural characteristics of arsenic resistance proteins by using designed three-stranded coiled coils. Proc. Natl. Acad. Sci. U. S. A 2007, 104, 11969–11974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (21).Ruckthong L; Zastrow ML; Stuckey JA; Pecoraro VL A Crystallographic Examination of Predisposition versus Preorganization in de Novo Designed Metalloproteins. J. Am. Chem. Soc 2016, 138, 11979–11988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (22).Matzapetakis M; Farrer BT; Weng T-C; Hemmingsen L; Penner-Hahn JE; Pecoraro VL Comparison of the Binding of Cadmium(II), Mercury(II), and Arsenic(III) to the de Novo Designed Peptides TRI L12C and TRI L16C. J. Am. Chem. Soc 2002, 124, 8042–8054. [DOI] [PubMed] [Google Scholar]
  • (23).Ruckthong L; Stuckey JA; Pecoraro VL How Second Coordination Sphere Modifications Can Impact Metal Structures in Proteins: A Crystallographic Evaluation. Chem. - Eur. J 2019, 10.1002/chem.201806040. [DOI] [PMC free article] [PubMed]
  • (24).O’Neil K; DeGrado W A thermodynamic scale for the helix-forming tendencies of the commonly occurring amino acids. Science 1990, 250, 646–651. [DOI] [PubMed] [Google Scholar]
  • (25).Lovejoy B; Choe S; Cascio D; McRorie D; DeGrado W; Eisenberg D Crystal structure of a synthetic triple-stranded alpha-helical bundle. Science 1993, 259, 1288–1293. [DOI] [PubMed] [Google Scholar]
  • (26).Iranzo O; Ghosh D; Pecoraro VL Assessing the Integrity of Designed Homomeric Parallel Three-Stranded Coiled Coils in the Presence of Metal Ions. Inorg. Chem 2006, 45, 9959–9973. [DOI] [PubMed] [Google Scholar]
  • (27).Hemmingsen L; Bauer R; Bjerrum MJ; Adolph HW; Zeppezauer M; Cedergren-Zeppezauer E The Protein Conformation of Cd-Substituted Horse Liver Alcohol Dehydrogenase and its Metal-Site Coordination Geometry in Binary and Ternary Inhibitor Complexes. Eur. J. Biochem 1996, 241, 546–551. [DOI] [PubMed] [Google Scholar]
  • (28).Iranzo O; Jakusch T; Lee K-H; Hemmingsen L; Pecoraro VL The correlation of 113Cd NMR and 111mCd PAC spectroscopies provides a powerful approach for the characterization of the structure of Cd(II)-substituted Zn(II) proteins. Chem. - Eur. J 2009, 15, 3761–3772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (29).Ramsey NF Magnetic Shielding of Nuclei in Molecules. Phys. Rev 1950, 78, 699–703. [Google Scholar]
  • (30).Ramsey NF; Purcell EM Interactions between Nuclear Spins in Molecules. Phys. Rev 1952, 85, 143–144. [Google Scholar]
  • (31).Tegoni M; Yu F; Bersellini M; Penner-Hahn JE; Pecoraro VL Designing a functional type 2 copper center that has nitrite reductase activity within α-helical coiled coils. Proc. Natl. Acad. Sci. U. S. A 2012, 109, 21234–21239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (32).Koebke KJ; Pecoraro VL Development of de Novo Copper Nitrite Reductases: Where We Are and Where We Need To Go. ACS Catal 2018, 8, 8046–8057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (33).Zastrow ML; Pecoraro VL Designing functional metalloproteins: From structural to catalytic metal sites. Coord. Chem. Rev 2013, 257, 2565–2588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (34).Kau LS; Spira-Solomon DJ; Penner-Hahn JE; Hodgson O; Solomon EI X-ray absorption edge determination of the oxidation state and coordination number of copper. Application to the type 3 site in Rhus vernicifera laccase and its reaction with oxygen. J. Am. Chem. Soc 1987, 109, 6433–6442. [Google Scholar]
  • (35).Kukimoto M; Nishiyama M; Murphy MEP; Turley S; Adman ET; Horinouchi S; Beppu T X-ray Structure and Site-Directed Mutagenesis of a Nitrite Reductase from Alcaligenes Faecalis S-6: Roles of Two Copper Atoms in Nitrite Reduction. Biochemistry 1994, 33, 5246–5252. [DOI] [PubMed] [Google Scholar]
  • (36).Murphy MEP; Turley S; Adman ET Structure of Nitrite Bound to Copper-containing Nitrite Reductase from Alcaligenes faecalis: MECHANISTIC IMPLICATIONS. J. Biol. Chem 1997, 272, 28455–28460. [DOI] [PubMed] [Google Scholar]
  • (37).Karlin S; Zhu Z-Y; Karlin KD The extended environment of mononuclear metal centers in protein structures. Proc. Natl. Acad. Sci. U. S. A 1997, 94, 14225–14230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (38).Libby E; Averill BA Evidence that the Type 2 copper centers are the site of nitrite reduction by Achromobacter cycloclastes nitrite reductase. Biochem. Biophys. Res. Commun 1992, 187, 1529–1535. [DOI] [PubMed] [Google Scholar]
  • (39).Prigge ST; Kolhekar AS; Eipper BA; Mains RE; Amzel LM Amidation of Bioactive Peptides: The Structure of Peptidylglycine α-Hydroxylating Monooxygenase. Science 1997, 278, 1300–1305. [DOI] [PubMed] [Google Scholar]
  • (40).Kujime M; Izumi C; Tomura M; Hada M; Fujii H Effect of a Tridentate Ligand on the Structure, Electronic Structure, and Reactivity of the Copper(I) Nitrite Complex: Role of the Conserved Three-Histidine Ligand Environment of the Type-2 Copper Site in Copper-Containing Nitrite Reductases. J. Am. Chem. Soc 2008, 130, 6088–6098. [DOI] [PubMed] [Google Scholar]
  • (41).Johnson BJ; Mankad NP Model Compounds of Copper-Containing Enzymes Involved in Bacterial Denitrification. Metalloenzymes in Denitrification: Applications and Environmental Impacts; The Royal Society of Chemistry: 2017; Chapter 10, pp 225–251. [Google Scholar]

RESOURCES