Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Feb 10.
Published in final edited form as: J Am Chem Soc. 2016 Jan 26;138(5):1543–1550. doi: 10.1021/jacs.5b10285

A miniature protein stabilized by a cation-π interaction network core

Timothy W Craven †,§, Min-Kyu Cho , Nathaniel J Traaseth , Richard Bonneau §,‡,#,*, Kent Kirshenbaum †,*
PMCID: PMC4867217  NIHMSID: NIHMS785698  PMID: 26812069

Abstract

The design of folded miniature proteins is predicated on establishing non-covalent interactions that direct the self-assembly of discrete thermo-stable tertiary structures. In this work, we describe how a network of cation-π interactions present in proteins containing “WSXWS motifs” can be emulated to stabilize the core of a miniature protein. This 19-residue protein sequence recapitulates a set of interdigitated arginine and tryptophan residues that stabilize a distinctive β-strand:loop:PPII-helix topology. Validation of the compact fold determined by NMR was carried out by mutagenesis of the cation-π network and by comparison to the corresponding disulfide-bridged structure. These results support the involvement of a coordinated set of cation-π interactions that stabilize the tertiary structure.

Keywords: cation-π interactions, mini-protein, “WSXWS” motifs, PPII-helices

INTRODUCTION

Miniature proteins afford an opportunity to isolate and evaluate non-covalent interactions that direct formation of the secondary and tertiary structures present in native proteins1-9. Previous studies of mini-proteins have elucidated thermodynamic and kinetic features of folding that may become convoluted within the context of more complex protein architectures. Within mini-proteins, the biophysical characteristics of particular interactions can therefore be elucidated more precisely. For example, the interactions between the guanidinium group of an arginine and the indole ring of tryptophan have now been extensively described10-19. These so-called cation-π interactions are representative of stabilizing non-covalent contacts between positively charged (Arg, Lys, His) and aromatic (Phe, Tyr, Trp) side-chain functionalities10, and provide distinctive contributions to protein structure and function11-15. Several studies have investigated the energetic contributions of binary cation-π interactions16-19. Their stabilizing strengths have been evaluated in the context of α-helices (−0.4 kcal/mol for residues positioned at i and i+4)17 and β-sheets (−0.20 to −0.48 kcal/mol)18-19. Computational studies have estimated the free energy of cation-π interactions can vary up to −5.5 kcal/mol16. The strengths of cation-π interactions have been demonstrated to increase with temperature20, establishing thermo-protective influences in thermophilic organisms21.

Networks of cation-π interactions are evident in x-ray crystal structures of numerous proteins. One example is the “WSXWS motif” found in fibronectin type III (Fn3) domain containing proteins (Figure 1). The cation-π interactions in these motifs can be observed in x-ray structures stabilizing a relatively rare class of secondary structure, the left-handed poly-proline type-II helix (PPII) (Figures 1, 2). In particular, an unusual PPII-helix devoid of proline residues has been observed in the sequences and structures of several Fn3 domains (Figures 1&2, Table 1). We sought to extract and replicate the β-strand:loop:PPII-helix topology observed in these motifs and determine if this tertiary structure could exist in the absence of the parent Fn3 protein domain. This report details the successful design of a 19-residue mini-protein that emulates the tertiary structure of WSXWS motifs including a core network of cation-π interactions.

Figure 1. Cation-π interaction networks stabilize “WSXWS motifs” in native proteins.

Figure 1

(A) Sequence logo22 and cartoon representation of the conserved secondary structural elements observed in WSXWS motifs. (B) Cartoon representation of the Fibronectin type III domain of IL3RB in grey (PDBid: 1EGJ) with the WSXWS motif highlighted in blue (β-strand), cyan (loop), and purple (PPII-helix). (inset) The cation-π interaction network found in the x-ray structure of IL3RB. The dotted yellow lines highlight the connectivity of the cation-π interaction network formed between arginine (blue) and tryptophan (purple) side chains.

Figure 2. Structural alignment of 15 X-ray structures of WSXWS motifs23-37.

Figure 2

Table 1. UniprotlDs and multiple sequence alignment of WSXWS motifs found in X-ray crystal structures23-37.

Uniprot ID Multiple Sequence Alignment
P40189    RIRCMKEDGKG….YWSDWSE
P14784    QVRVKPLQGEFT…TWSPWSQ
P31785    RVRSRFNPLCGSAQ.HWSEWSH
P78552    RIRVKTNKLCYEDDKLWSNWSQ
P40223    QMRCIRSSLPG….FWSPWSP
P19235    AVRARMAEPSFGG..FWSEWSE
Q99062    QIRCIRWPLPG….HWSDWSP
P42702    RIRCSTETFW…..KWSKWSN
P05710    QTRCKPDHG……YWSRWSQ
Q14627    VVRSKVNIYCSDDG.IWSEWSD
P08887    QLRAQEEFGQG….EWSEWSP
Q01344    QVRAAVSSMCREAG.LWSEWSQ
Q9HBE5    QVRAGPMPGSSYQG.TWSEWSD
P48357    QVRCKRLDGLG….YWSNWSN
P32927(a)    RVRVRTSRTGYNG..IWSEWSE
consensus    +++     WS WS
(a)

The common name for P32927 is IL3RB.

Note: Additional information about X-ray structures of WSXWS motifs including their PDB identifiers can be found in the supporting information (Figure S1).

WSXWS motifs in natural proteins

WSXWS motifs are found in the structures of proteins comprising an Fn3 domain and include a β-strand:loop:PPII topology (Figure 1) with a conserved ‘R…WSXWS’ primary sequence (Table 1). The structures feature a conserved arginine side chain intercalated between the two tryptophan residues (RWSXWS, Figure 3A). Additionally, the hydroxyl groups of the conserved serine residues (R…WSXWS) serve as hydrogen bond acceptors from an adjacent β-strand, as well as potential hydrogen bond donors to the i-1 carbonyl oxygen (Figure 3B). The serine backbone amide HN groups also provide hydrogen bond donors to the adjacent β-strand. In some proteins, such as IL3RB (Figure 1, Table 1), the WSXWS motif is stabilized by a core network of cation-π interactions involving three arginine residues on a β-strand, which interdigitate with two tryptophan residues of an adjacent PPII-helix. The WSXWS motifs have been the focus of diverse biological studies, as they are found in a wide array of extracellular receptors. These motifs are also subject to unique C-mannosylation of their conserved tryptophan residues (PDBid: 3TGX). In addition, a variety of genetic alterations within these sequences are associated with ‘X-linked severed combined immunodeficiency,’ (OMIMid: XSCID; a.k.a. “bubble boy syndrome”) revealing the critical role played by the WSXWS motifs.

Figure 3. Characteristic cation-π interactions and hydrogen bonds in WSXWS motifs from proteins found in Table 1.

Figure 3

(A) Plot of the distance between the Cε2 indole carbon atoms of each tryptophan to the Cζ of the central arginine guanidinium group highlighted with space-filled atoms in black→white. Different shades of black→white identify arginine rotamers having similar side-chain dihedral angles38. (Tables S10, S11). All values are <6.5Å, a characteristic cut-off distance of cation-π interactions. (B) Plot of the distance between the serine side-chain Oγ atom and the i-1 carbonyl oxygen (O) and the adjacent β-stand backbone amide nitrogen (N). Yellow and green circles correspond to the color-labeled serine residues found in the structural alignment. Black lines with arrows represent potential hydrogen bonds (H-bonds).

PPII-helices in miniature proteins

Proline-free PPII-helices are difficult to design. Unlike other secondary structure types, the PPII-helix does not feature a propagating hydrogen bonded network. With three residues per helical turn, the co-facial i and i+3 PPII positions are, on average, 8Å apart (Cα→Cα), and are too distant to generate strong non-covalent interactions. Both the Trp-cage and the Avian Pancreatic Peptide (APP) mini-proteins3,6 include a proline-rich PPII-helix packed against an α-helix generating an α-helix:loop:PPII topology. The structural rigidity engendered by multiple proline residues establishes an underlying foundation for interfacing α-helical elements. Thus, the rigidity of a PPII-helix has been demonstrated to augment the stability of proximal α-helical secondary structures. Here we invert this relationship by stabilizing a proline-free PPII-helix through a set of stabilizing tertiary contacts with a neighboring peptide β-strand. This study presents a previously unexplored strategy for enforcing the left-handed PPII secondary structure type.

EXPERIMENTAL METHODS

Peptide synthesis

All peptides were synthesized using standard FMOC solid-phase chemistry on 2-chlorotrityl (2-ClTrt) resin (Anaspec, Fremont, CA) using PyBop (benzotriazol-1-yl-oxytripyrrolidinophosphonium hexafluorophosphate) activating agent (Anaspec, Fremont, CA). All peptides were precipitated out of cold ether and purified by reverse-phase HPLC using preparative chromatography and eluted with H2O-Acetonitrile (ACN) gradients in 0.1% trifluoroacetic acid (TFA). After purification, all proteins were lyophilized three times in 1:1:1 (v:v:v) H2O:ACN:Acetic Acid to remove TFA salts and transferred into buffer with pH adjustment after protein addition. Protein stock solutions having 2.0 mM concentrations were stored at −20°C. All peptides were characterized by mass analysis using MALDI-TOF MS or ESI MS; sequence and purity were evaluated by HPLC and NMR spectroscopy analysis. Disulfide-bridged peptides were synthesized, purified, and characterized in an analogous way but instead were cleaved from 2-ClTrt resin using a 90:5:3:2 (v:v:v:v) mixture of TFA:thioanisole:ethanedithiol:anisole.

Circular Dichroism

CD spectra were collected using an Aviv (Lakewood, NJ) stopped flow CD Spectropolarimeter Model 202SF. The far-UV (190-260 nm) CD spectra were obtained using a 1 mm path length cell and in 10 mM PBS buffer at a pH of 7.5 at 100 μM protein concentrations. The near-UV (245-320 nm) CD spectra were obtained using a 1 cm path length cell and in 10 mM PBS buffer at a pH of 7.5 at 50 μM concentrations. The consensus CD spectra were established as the gaussian weighted average of 40 scans with evaluation of ellipticity every 0.5 nm, each having an averaging time of 0.5 seconds. Proper ‘blank’ spectra were subtracted from all multiple wavelength and single-wavelength spectra and thermal melts utilizing the equation (Θsample - Θ‘blank’)/(L•c•n) where L is the path length, c is the protein concentration, and n is the number of amino acids in the sequence. CD spectra are presented unsmoothed. Concentrations of protein stock solutions for conversion of millidegrees to mean residue ellipticity (MRE) were determined by UV absorbance at 280 nm using the standard extinction coefficients for the tryptophan (5690 M−1cm−1), tyrosine (1280 M−1cm−1), and disulfide-bridged cysteine (125 M−1cm−1) chromophores in 6M GnCl. Thermal melts were conducted using a 1°C/min temperature gradient, a 1 minute equilibration time at each temperature step, and an average of 20 single wavelength measurements. Spectra for GnCl concentrations that spanned the folded to unfolded transition are presented in Figure 4B and their MRE values at 280 nm were converted to ΔG° at 25°C using the equation ΔG° = −RTlnQ where Q = χ/1-χ and χ is the fraction of folded protein at the given concentration of GnCl. ΔG° values were plotted versus the concentration of GnCl and least-squares fit to a linear equation in order to extrapolate the y-intercept value corresponding to the ΔG° in native buffer at 25°C.

Figure 4. Spectroscopic evaluation of the TrpPlexus mini-protein.

Figure 4

(A) Near-UV Circular Dichroism spectra (50 μM protein, 1 cm path length, 10 mM PBS buffer pH 7.5) in units of mean residue ellipticity (MRE). Inset shows the relationship of the positive lobe of ellipticity at 228 nm as a function of temperature. (B) Far-UV CD spectra (100 μM protein, 1 mm path length, 10 mM PBS buffer pH 7.5). (C) GnCl titrations (50 μM protein, 1 cm path length). (inset) MRE values converted to fraction folded at a given concentration of GnCl from a least-squares fit to a sigmoidal function. (D) Sedimentation equilibrium analysis. (E) Temperature dependent fluorescence spectra (excitation 280 nm) in Arbitrary Units (A.U.). Inset provides dual wavelength fluorescence intensity plot (336 nm vs. 345 nm).

NMR spectroscopy

NMR spectra were collected using a Bruker (Billerica, MA) AVANCE III-600 NMR spectrometer equipped with a CPTCI-cryoprobehead. All samples were prepared in H2O and D2O that had been purged of dissolved O2 gas using the freeze-pump-thaw method. 1D and 2D 1H NMR were collected at protein concentrations of 1.8 mM in 90% H2O:10% D2O solutions at pH 3.8 (uncorrected). For determination of 3Jhn-hα values, 1D 1H NMR spectra were obtained with 32k complex data points. Total correlated spectroscopy (TOCSY) spectra were collected utilizing a mixing time of 120 ms using Bruker’s mlevesgpph pulse sequence. 2D 1H-1H nuclear Overhauser effect spectroscopy (NOESY) spectra were collected utilizing a 250 ms mixing time using Bruker’s noesyesgpph pulse sequence. All chemical shifts are reported relative to a 4,4-dimethyl-4-silapentane-1-sulfonic acid (DSS) internal standard.

NMR structure ensemble

NMR assignment and volume calculations were performed using the Sparky NMR package. Using the Rosetta Molecular Modelling package39, 50,000 structures were initialized with φ values to one of the two values dictated by an empirically derived Karplus equation (6.51•cos2(φ-60) − 1.76•cos(φ-60) + 1.635)40 for 3Jhn-hα values > 6.4 Hz (Table S5). Glycine residues were initialized and randomized into the full range of φ/ψ values. Distance constraints were calculated using the tryptophan indole reference cross-peak volume from W15Hη2→W15Hζ3 as this has a known reference distance of 2.46Å. Other indole ring volumes with identical distances were also found close to this reference volume (Vref). The distance from proton i to j was calculated using the equation di,j = 2.46•(Vref/Vi,j)(1/6). Based upon these calculated distances, NOEs were characterized as strong (1.8-2.5Å), medium (1.8-3.3Å), or weak (1.8-5.0Å). Structures were minimized simultaneously with respect to energy and distance constraints (with “soft” 2.5Å tolerances) using an Armijo type line-search algorithm within a Monte-Carlo Metropolis acceptance criterion algorithm. Randomized backbone φ moves were limited to values within the bounds dictated by the Karplus equation +/− 30°. The 25 lowest energy sequences were found to obey all distance constraints within tolerances and were aligned using their Cα backbone atoms. The structure was further supported by conducting identical calculations using the Xplor-NIH software package41-42. Using a standard Xplor-NIH simulated annealing minimization of 10,000 structures, sequences were found to converge to the same backbone structure determined by the Rosetta calculations (Figure S7).

Sedimentation Equilibrium

Sedimentation equilibrium measurements were carried out on a Beckman (Brea, CA) XL-A analytical ultracentrifuge equipped with absorption optics and an An-60 Ti rotor. Analysis was conduced at rotor speeds of 45,000 and 48,000 rpm at 4°C. Data were acquired at two wavelengths per rotor speed setting and processed globally for the best fit to a single-species model of absorbance versus radial distance. Solvent density and protein partial specific volume were calculated according to solvent and protein composition, respectively.

Fluorescence spectroscopy

Fluorescence spectra were obtained using a 1 cm by 2 mm path length cell (excitation path length versus emission path length) in 10 mM PBS buffer at a pH of 7.5 and 50 μM peptide concentrations. The fluorometer was configured with a calibrated Peltier temperature controller.

Miscellaneous

Least-squares fitting and graphical plotting was performed using the Scientific Python (scipy) scipy.optimize.curve_fit and matplotlib libraries. All 3-dimensional-cartoon protein graphics were created using the Schrodinger (New York, NY) PyMOL software package.

RESULTS AND DISCUSSION

Design and characterization of a mini-protein stabilized by a cation-π network

The native sequence on which to initiate the mini-protein design was chosen based on two primary criteria. First, we sought a sequence containing a loop featuring a β–hairpin, which could be nucleated using a d-Pro-Gly motif2,7. Most of the loop regions observed in WSXWS motifs are lacking in canonical forms of secondary structure that could be enforced with rational peptide design (Figure 2). Second, we wanted a sequence containing three arginine residues on the β-strand portion of the fold composing the cation-π interaction network. The only sequence meeting these two criteria is that of the WSXWS motif found in IL3RB (UniprotID: P32927). Thus, we began our investigation by synthesizing a modified 19-residue sequence from the WSXWS motif of IL3RB. The design included two modifications from the native sequence: D-Pro was incorporated at position 9 to promote a β-hairpin, and an Ile→Thr mutation was introduced at position 14 to enhance solubility. The resulting sequence, “RVRVRTSRpGYNGTWSEWS” is denoted as “TrpPlexus”: a portmanteau of ‘tryptophan’ and ‘plexus’, signifying the network arrangement of stabilizing interactions (Figures 4&5)43. TrpPlexus was found to be monomeric at a concentration of 1.8 mM at 4°C by sedimentation equilibrium analysis (Figure 4D)44. The apparent molecular mass was within 10% of that calculated for an ideal monomer, with no systematic deviation of the residuals. The fluorescence emission spectrum (Figure 4E) for TrpPlexus was observed to have two major emission wavelengths of similar intensities, one at 336nm and 345nm. A two-wavelength emission intensity plot is provided as a proxy for the change in emission spectra shape as a function of temperature. A linear relationship between the intensities of the two emission maxima wavelengths was observed and the spectra were found to be completely reversible.

Figure 5. Structure determination of the TrpPlexus mini-protein.

Figure 5

(A) NMR ensemble alignment of the 25 lowest energy structures satisfying distance constraints (Tables S1-S3). Highlighted regions with arrows show long-range NOE cross-peaks of Hε3 and Hζ2 of W15 and W18. Tryptophan indole ring and arginine side-chain atom descriptors are labeled on W15 and W18 as well as R1 and R5. (B) Cartoon of TrpPlexus secondary structure and color representations for each residue in sections A, D, and F. (C) 3Jhn-hα scalar couplings and Hα CSDs of the linear (black) and cyclic disulfide-bridged (green) sequences. (D) (left panels) NOESY spectra (250 ms mixing time, 1.8 mM protein, 10:90 (v:v) D2O:H2O, 4°C) with highlighted long-range NOE interactions present in the spectra for both linear and cyclic structures. (right panel) Unambiguous NOEs supporting the model of the disulfide-bridged cyclic structure. (E) Analytical HPLC trace monitoring the cyclization of the disulfide-bridged sequence. (F) Ramachandran plot of the backbone dihedral angles of the NMR ensemble structures with labels corresponding to the φ and ψ angle domains of the secondary structures present. (inset) Highlights of the β-strand and PPII regions.

Secondary and tertiary structure content of the mini-protein was initially evaluated using circular dichroism (CD) spectroscopy (Figure 4 A,B). In the near-UV (260-320 nm) CD spectrum, ellipticity can arise from indole and phenol absorbance bands when tryptophan or tyrosine side chains are present in an asymmetric environment. Strong CD signals in this region are exclusively observed for sequences containing significant tertiary structure45. Consequently, the near-UV CD signal is a stringent metric for the presence of tertiary structure. The strong near-UV CD signal for TrpPlexus was found to slightly increase in strength at elevated temperatures, indicating thermo-stability (Figure 4B, see discussion below). Conversely, the near-UV CD signal was observed to decrease in strength upon addition of the chaotropic agent guanidinium chloride (GnCl). The near-UV signal demonstrated two-state sigmoidal characteristics as a function of GnCl concentration. The variable GnCl concentration near-UV CD data correlate to a ΔG° for folding of approximately −6.7 kcal/mol in standard aqueous conditions at 25°C (Figure 4C). The far-UV (180-260 nm) CD spectrum includes features indicating the presence of a PPII-helix, such as a lobe of positive ellipticity at 228 nm46-48 that can be attributed to the n→π* transition of amide groups present in a PPII-helix, as well as minor contributions from aromatic side chains49 (Figure 4A). The far-UV CD spectra also display similar temperature-dependent behavior observed in the spectra of other PPII-helical peptides50.

In order to obtain an atomic-resolution solution structure, nuclear magnetic resonance (NMR) experiments were carried out on TrpPlexus (Figure 5). Proton chemical shifts were assigned in the standard manner using 1H-1H TOCSY experiments. Structural restraints to determine the three-dimensional structure were established using 2D 1H-1H NOESY experiments. The assigned NOE cross-peaks were indicative of interdigitation of arginine and tryptophan residues and therefore complemented the near-UV CD data showing the presence of a highly ordered tertiary structure (Figure 4B). Using a set of unambiguous distance constraints from the NOE spectrum including 12 long-range restraints (Tables S1-S3), we determined the three-dimensional solution structure of TrpPlexus that depicts a cation-π stabilized tertiary structure including a β-strand and a left-handed PPII-helix. The NMR ensemble structure is remarkably similar to the secondary and tertiary structure observed in the template WSXWS motif from the Fn3 domain of IL3RB with only minor differences. These small differences include alternate rotamers in the R8 and Y11 residues forming a more compact structure (Figure 5A), and small deviations of the backbone dihedral angles for the ensemble typically within 30° of the angles observed in the x-ray structure of IL3RB. The ω dihedral angle of d-Pro9 was determined to be in a predominantly trans amide conformation as evidenced by strong R8Hα→p9Hδ NOE cross-peaks. Additionally, the presence of NOEs from G10HN→p9Hδ supports the presence of a type I’ β-hairpin. In addition, the neighboring Asn-Gly (N12, G13) motif1,7,9 serves to terminate the nascent β-strand, allowing for a transition to a left-handed PPII-helix. The patterning and relative intensities of the inter-residue and sequential NOE cross-peaks from both tryptophan indole groups match those that would be expected for a PPII-helix displaying the most probable tryptophan rotamer38,51.

The resulting Hα chemical shift deviations (CSDs) from “random-coil” values were consistent in strength with those found in the CSDs of the WSXWS motifs in published NMR solution structures of whole Fn3 domains (PDBid(s): 2LFG, 1GCF, Figure S4). However, relative to CSDs of the Hα protons observed for other mini-proteins, those found for TrpPlexus are relatively small. To ascertain CD spectra and CSD values that would be expected for a “folded control” sequence, a cyclic version of TrpPlexus was designed with a disulfide-bridge (V2→C, S19→C). The disulfide-bond formation of the cyclic TrpPlexus was monitored by both analytical HPLC and ESI mass spectrometry (Figure 5E) to identify an HPLC retention time shift as well as the loss of the mass of the cysteine sulfhydryl protons upon disulfide formation. The linear cysteine-containing sequence was observed to cyclize into two conformations. The major conformer was isolated using preparative HPLC and analyzed by NMR and CD spectroscopy. The long-range NOEs seen in Figure 5D, the measured 3Jhn-hα scalar couplings, and Hα CSDs (Figure 5C) were found to be in good agreement between the linear and cyclized peptides. Furthermore, the CD characterizations of linear and disulfide-bridged structures were found to be nearly identical in strength and overall spectral shape (Figure 4A,B). The similarity of both CD and NMR results strongly validates the structure and stability of the linear TrpPlexus as depicted in Figure 5A.

To explore the influence of cationic groups on fold stability, mutants were designed that replaced particular guanidinyl (arginine) side chain groups with either amino (lysine) cations or charge-neutral urea (l-citrulline) groups (Figure 6). The near-UV CD signal monitored at 280 nm provides a metric of relative tertiary structure content (Figure 6B). The absence of a strong near-UV CD signal for the l-citrulline triple mutant supports the conclusion that cation-π, and not stacking interactions or hydrogen bonding, is a primary determinant of TrpPlexus’s folding. The near-UV CD signals, as a function of temperature, for TrpPlexus (“RRR”) and the “KRK” mutant demonstrate weakly sigmoidal transitions to stronger ellipticities at elevated temperatures (Figure 6). The near-UV CD signal as a function of temperature, while being an important indicator of tertiary structure, is also dependent on local environmental factors. It is possible that the weak transitions are the result of small changes in the distributions of tryptophan and cationic side-chain rotamers at elevated temperatures. For the other cationic group mutants “RKR” and “KKK”, the observed transition in the ellipticities can similarly be attributed to rotamer populations fluctuating as cation-π interactions become stronger at elevated temperatures20-21. These data can be described using a model in which a structure containing a β-hairpin is able to situate cationic and tryptophan residues on two adjacent peptide strands in proximity to one another. However, in order to achieve a discrete tertiary structure as opposed to a molten globule, a central arginine is required to interdigitate the two tryptophan residues, resulting in a gain of tertiary structure that enhances near-UV ellipticity. Thus, the central arginine residue is the critical determinant of tertiary structure as evidenced by near-UV CD signal strength.

Figure 6. Near-UV CD thermal melts for the TrpPlexus cation-π interaction network mutants.

Figure 6

(A) Schematic of the cation-π network of TrpPlexus detailing variable amino acid positions 1, 3, and 5 in the mutant sequences. (B) Near-UV CD thermal melts for TrpPlexus (“RRR”) and each mutant monitoring mean residue ellipticity at 280 nm (50 μM protein, 1 cm path length, 10 mM PBS buffer pH 7.5). Inset compares side-chain functionalities of the arginine (R), lysine (K), and l-citrulline (Cit) residues.

CONCLUSION

We demonstrate that establishing a network of solvent-exposed cation-π interactions can be an effective strategy in the design of a novel miniature protein tertiary structure. We also validate a strategy in which solvent-exposed cation-π interactions serve as an alternative to hydrophobic-core packing as a protein design paradigm16. Excising polypeptide sequences from protein structures having the following characteristics could prove valuable in the development of additional well-ordered mini-proteins: (1) rich in arginine and tryptophan side-chain tertiary contacts as well as contacts between the side chains of other residues containing cations (lysine and histidine) and π electron systems (phenylalanine and tyrosine), (2) secondary structure content supporting or enabling those contacts such as a β-hairpin, which can be stabilized and nucleated via canonical peptide design, and (3) an evolutionary conservation or homology between similar sequences maintaining a putative cation-π network. A plethora of small sequence motifs stabilized by cation-π interactions can be found in the Protein Databank. This study demonstrates a successful workflow for miniprotein discovery initiated by bioinformatic sequence/structure analysis, followed by canonical peptide design stabilizing small secondary structure motifs, and validation via thorough structural characterization.

This report has detailed a new miniature protein offering several distinctive features, which present new opportunities for future investigation in the context of an isolated well-folded system. These features include: (1) an extensive cation-π interaction network, as opposed to a binary isolated interaction, (2) a novel miniature protein topology composed of a β-strand:loop:PPII-helix, (3) an isolated β-strand in a monomeric protein, which could offer opportunities for modulating β-sheet formation, and (4) a left-handed poly-proline type-II helix formed in the absence of proline residues. The PPII-helix is stabilized through interdigitated tertiary contacts. This mode of PPII-helix stabilization in the context of a miniature protein could afford a new scaffold to emulate the i and i+3 PPII-helical motif positions recognized by SH3 and WW protein domains. The continued engineering of miniature protein structures to contain β -strand52, PPII53-54, or poly-arginine55-57 motifs may prove advantageous for developing a next generation of peptide therapeutics and biochemical tools.

Supplementary Material

Supplemental Material

ACKNOWLEDGMENT

The authors would like to thank Chin Lin for NMR assistance, Daniella Buccella for fluorescence spectroscopy advice and assistance, Neville Kallenbach for insightful discussions and critical manuscript review, and Min Lu (Rutgers University) for sedimentation equilibrium analysis.

Funding Sources

This work was supported by the National Science Foundation (Award CHE-1152317 to K.K.). T.W.C. would like to gratefully acknowledge the New York University Biology Department for the Fleur Strand Fellowship as well as the Graduate School of Arts and Sciences for the Horizon Fellowship in the Natural and Physical Sciences. R.B. would like to acknowledge support from the Simons Foundation. N.J.T. acknowledges support from the National Institutes of Health (R01AI108889). NMR data collected using the TCI CryoProbe at NYU was supported by an NIH S10 grant (OD016343).

ABREVIATIONS

CD

circular dichroism

NMR

nuclear magnetic resonance

PPII

left-handed poly-peptide helix

GnCl

guanidinium chloride

PDB

the Protein Database

PDBid

PDB identifier

APP

Avian Pacreatic Peptide

NOE

Nuclear Overhauser Effect

ESI

electrospray ionization

IL3RB

Cytokine receptor common subunit beta

OMIM

Online Mandelian Inheritance in Man database

OMIMid

OMIM database identifier

ACN

acetonitrile

MRE

mean residue ellipticity

CSD

chemical shift deviation (from ‘random coil’ values)

PBS

phosphate buffered saline

TFA

trifluoroacetic acid

MS

mass spectrometry

MALDI-TOF

Matrix Assisted Laser Desorption/Ionization-Time of flight

TOCSY

total correlation spectroscopy

NOESY

nuclear Overhauser effect spectroscopy

Fn3

fibronectin type III domain

Footnotes

ASSOCIATED CONTENT

Supporting Information. Supplementary includes additional characterization data and bioinformatics analyses in Tables S1-S10, Figures S1-S9, as well as NMR structure coordinates. This material is available free of charge via the Internet at http://pubs.acs.org.

NOTES AND REFERENCES

  • 1.Kortemme T, Ramirez-Alvarado M, Serrano L. Science. 1998;281:253–256. doi: 10.1126/science.281.5374.253. [DOI] [PubMed] [Google Scholar]
  • 2.Struthers MD, Cheng RP, Imperiali B. Science. 1996;271:342–345. doi: 10.1126/science.271.5247.342. [DOI] [PubMed] [Google Scholar]
  • 3.Neidigh JW, Fesinmeyer RM, Andersen NH. Nat. Struct. Biol. 2002;9:425–430. doi: 10.1038/nsb798. [DOI] [PubMed] [Google Scholar]
  • 4.Honda S, Yamasaki K, Sawada Y, Morii H. Structure. 2004;12:1507–1518. doi: 10.1016/j.str.2004.05.022. [DOI] [PubMed] [Google Scholar]
  • 5.Cochran AG, Skelton NJ, Starovasnik MA. Proc. Natl. Acad. Sci. U.S.A. 2001;98:5578–5583. doi: 10.1073/pnas.091100898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Hodges AM, Schepartz A. J. Am. Chem. Soc. 2007;129:11024–11025. doi: 10.1021/ja074859t. [DOI] [PubMed] [Google Scholar]
  • 7.Espinosa JF, Gellman SH. Angew. Chem., Int. Ed. 2000;39:2330–2333. doi: 10.1002/1521-3773(20000703)39:13<2330::aid-anie2330>3.0.co;2-c. [DOI] [PubMed] [Google Scholar]
  • 8.Ottesen JJ, Imperiali B. Nat. Struct. Biol. 2001;8:535–539. doi: 10.1038/88604. [DOI] [PubMed] [Google Scholar]
  • 9.Kier BL, Andersen NH. J. Am. Chem. Soc. 2008;130:14675–14683. doi: 10.1021/ja804656h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Dougherty DA. Science. 1996;271:163–168. doi: 10.1126/science.271.5246.163. [DOI] [PubMed] [Google Scholar]
  • 11.Gallivan JP, Dougherty DA. Proc. Natl. Acad. Sci. U.S.A. 1999;96:9459–9464. doi: 10.1073/pnas.96.17.9459. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Trotta CR, Paushkin SV, Patel M, Li H, Peltz SW. Nature. 2006;441:375–377. doi: 10.1038/nature04741. [DOI] [PubMed] [Google Scholar]
  • 13.Xiu X, Puskar NL, Shanate JAP, Lester HA, Dougherty DA. Nature. 2009;458:534–537. doi: 10.1038/nature07768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Dagil R, Knudsen MJ, Olsen JG, O’Shea C, Franzmann M, Goffin V, Teilum K, Breinholt J, Kragelund BB. Structure. 2012;20:270–282. doi: 10.1016/j.str.2011.12.010. [DOI] [PubMed] [Google Scholar]
  • 15.Hilton DJ, Watowich SS, Katz L, Lodish HF. J. Bio. Chem. 1996;271:4699–4708. doi: 10.1074/jbc.271.9.4699. [DOI] [PubMed] [Google Scholar]
  • 16.Gallivan JP, Dougherty DA. J. Am. Chem. Soc. 2000;122:870–874. [Google Scholar]
  • 17.Shi Z, Olson C, Kallenbach N. J. Am. Chem. Soc. 2002;124:3284–3291. doi: 10.1021/ja0174938. [DOI] [PubMed] [Google Scholar]
  • 18.Tatko CD, Waters ML. Prot. Sci. 2003;12:2443–2452. doi: 10.1110/ps.03284003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Hughes RM, Waters ML. J. Am. Chem. Soc. 2006;128:12735–12742. doi: 10.1021/ja061656g. [DOI] [PubMed] [Google Scholar]
  • 20.Prajapati RS, Sirajuddin M, Durani V, Sreeramulu S, Varadarajan R. Biochemistry. 2006;45:15000–15010. doi: 10.1021/bi061275f. [DOI] [PubMed] [Google Scholar]
  • 21.Chakravarty S, Varadarajan R. Biochemistry. 2002;41:8152–8161. doi: 10.1021/bi025523t. [DOI] [PubMed] [Google Scholar]
  • 22.Crooks GE, Hon G, Chandonia JM, Brenner SE. Gen. Res. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Bravo J, Staunton D, Heath J, Jones E. EMBO J. 1998;17:1665–1674. doi: 10.1093/emboj/17.6.1665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wang X, Rickert M, Garcia K. Science. 2005;310:1159–1163. doi: 10.1126/science.1117893. [DOI] [PubMed] [Google Scholar]
  • 25.Laporte SL, Juo ZS, Vaclavikova J, Colf LA, Qi X, Heller NM, Keegan AD, Garcia KC. Cell. 2008;132:259–272. doi: 10.1016/j.cell.2007.12.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Aritomi M, Kunishima N, Okamoto T, Kuroki R, Ota Y, Morikawa K. Nature. 1999;401:713–717. doi: 10.1038/44394. [DOI] [PubMed] [Google Scholar]
  • 27.Syed RS, Reid SW, Li C, Cheetham JC, Aoki KH, Liu B, Zhan H, Osslund TD, Chirino AJ, Zhang J, Finer-Moore J, Elliott S, Sitney K, Katz BA, Matthews DJ, Wendoloski JJ, Egrie J, Stroud RM. Nature. 1998;395:511–516. doi: 10.1038/26773. [DOI] [PubMed] [Google Scholar]
  • 28.Tamada T, Honjo E, Maeda Y, Okamoto T, Ishibashi M, Tokunaga M, Kuroki R. Proc. Natl. Acad. Sci. U.S.A. 2006;103:3135–3140. doi: 10.1073/pnas.0511264103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Skiniotis G, Lupardus P, Martick M, Walz T, Garcia KC. Mol. Cell. 2008;31:737–748. doi: 10.1016/j.molcel.2008.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Rossjohn J, McKinstry WJ, Woodcock JM, McClure BJ, Hercus TR, Parker MW, Lopez AF, Bagley CJ. Blood. 2000;95:2491–2498. [PubMed] [Google Scholar]
  • 31.Broutin I, Jomain J, Tallet E, Agthoven J, Raynal B, Hoos S, Kragelund BB, Kelly PA, Ducruix A, England P, Goffin V. J. Biol. Chem. 2010;285:8422–8433. doi: 10.1074/jbc.M109.089128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lupardus PJ, Birnbaum M, Garcia K. Structure. 2010;18:332–342. doi: 10.1016/j.str.2010.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Varghese JN, Moritz RL, Lou MZ, Donkelaar A, Ji H, Ivancic N, Branson KM, Hall NE, Simpson RJ. Proc. Natl. Acad. Sci. U.S.A. 2002;99:15959–15964. doi: 10.1073/pnas.232432399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Huyton T, Zhang JG, Luo CS, Lou MZ, Hilton DJ, Nicola NA, Garrett TP. Proc. Natl. Acad. Sci. U.S.A. 2007;104:12737–12742. doi: 10.1073/pnas.0705577104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Patino E, Kotzsch A, Saremba S, Nickel J, Schmitz W, Sebald W, Mueller TD. Structure. 2011;19:1864–1875. doi: 10.1016/j.str.2011.08.015. [DOI] [PubMed] [Google Scholar]
  • 36.Hamming OJ, Kang L, Svensson A, Karlsen JL, Rahbek-Nielsen H;, Paludan SR, Hjorth SA, Bondensgaard K, Hartmann R. J. Biol. Chem. 2012;287:9454–9460. doi: 10.1074/jbc.M111.311084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Carpenter B, Hemsworth GR, Wu Z, Maamra M, Strasburger CJ, Ross RJ, Artymiuk PJ. Structure. 2012;20:487–497. doi: 10.1016/j.str.2012.01.019. [DOI] [PubMed] [Google Scholar]
  • 38.Dunbrack RL, Karplus MA. J. Mol. Biol. 1993;230:543–574. doi: 10.1006/jmbi.1993.1170. [DOI] [PubMed] [Google Scholar]
  • 39.Leaver-Fay A, et al. An Object-Oriented Software Suite for the Simulation and Design of Macromolecules. In: Johnson ML, Brand L, editors. Methods in Enzymology. Computer Methods, Part C. Vol. 487. Academic Press; New York: 2011. pp. 545–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Vuister GW, Bax A. J. Am. Chem. Soc. 1993;115:7772–7777. [Google Scholar]
  • 41.Schwieters CD, Kuszewski JJ, Tjandra N, Clore GM. J. Magn. Res. 2003;160:66–74. doi: 10.1016/s1090-7807(02)00014-9. [DOI] [PubMed] [Google Scholar]
  • 42.Schwieters CD, Kuszewski JJ, Clore GM. Progr. NMR Spectroscopy. 2006;48:47–62. [Google Scholar]
  • 43.Within this report, the notation xy specifies a particular amino acid x at position number y in the 19 residue TrpPlexus sequence. Lower-case “p” denotes a D-Proline residue.
  • 44.Sedimentation equilibrium was conducted on the NMR sample used for solution structure determination.
  • 45.Kelly SM, Price NC. Curr. Prot. Pep. Sci. 2000;4:349–384. doi: 10.2174/1389203003381315. [DOI] [PubMed] [Google Scholar]
  • 46.Horng JC, Raines RT. Protein Sci. 2006;1:74–83. doi: 10.1110/ps.051779806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Chellgren BW, Creamer TP. Biochemistry. 2004;43:5864–5869. doi: 10.1021/bi049922v. [DOI] [PubMed] [Google Scholar]
  • 48.Krimm S, Tiffany ML. Israel J. Chem. 1974;12:189–200. [Google Scholar]
  • 49.The lack of a significant exciton couplet in the far-UV CD spectra indicates that the indole rings of the tryptophan residues of TrpPlexus are not in a stacked or edge-to-face configuration5,9.
  • 50.Brown AM, Zondlo NJ. Biochemistry. 2012;51:5041–5051. doi: 10.1021/bi3002924. [DOI] [PubMed] [Google Scholar]
  • 51.p = 0.54 for χ12 at −66°/100° and φ/ψ at −90°/150°. Strong W18Hδ1→W18HN and W15Hδ1→W15HN NOE cross-peaks were observed indicating the presence of this rotamer.
  • 52.Zheng J, Baghkhanian AM, Nowick JS. J. Am. Chem. Soc. 2013;135:6846–6852. doi: 10.1021/ja310817d. [DOI] [PubMed] [Google Scholar]
  • 53.Nguyen JT, Turck CW, Cohen FE, Zuckermann RN, Lim WA. Science. 1998;282:2088–2092. doi: 10.1126/science.282.5396.2088. [DOI] [PubMed] [Google Scholar]
  • 54.Bunagan MR, Yang X, Saven JG, Gai F. J. Phys. Chem. B. 2006;110:3759–3763. doi: 10.1021/jp055288z. [DOI] [PubMed] [Google Scholar]
  • 55.Appelbaum JM, LaRochelle JR, Smith BA, Balkin DM, Holub JM, Schephartz A. Chem. Bio. 2012;19:819–830. doi: 10.1016/j.chembiol.2012.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Smith B, Daniels DS, Coplin A, Jordan G, McGregor L, Schepartz A. J. Am. Chem. Soc. 2008;130:2948–2949. doi: 10.1021/ja800074v. [DOI] [PubMed] [Google Scholar]
  • 57.Li M, Tao Y, Shu Y, LaRochelle JR, Steinauer A, Thompson D, Schepartz A, Chen Z-Y, Liu D. J. Am. Chem. Soc. 2015;137:14084–14093. doi: 10.1021/jacs.5b05694. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

RESOURCES