Abstract
Intrinsically disordered proteins (IDPs) are characterized by substantial conformational plasticity and undergo rearrangements of the time-averaged conformational ensemble on changes of environmental conditions (e.g., in ionic strength, pH, molecular crowding). In contrast to stably folded proteins, IDPs often form compact conformations at acidic pH. The biological relevance of this process was, for example, demonstrated by nuclear magnetic resonance studies of the aggregation prone (low pH) state of α-synuclein. In this study, we report a large-scale analysis of the pH dependence of disordered proteins using the recently developed meta-structure approach. The meta-structure analysis of a large set of IDPs revealed a significant tendency of IDPs to form α-helical secondary structure elements and to preferentially fold into more compact structures under acidic conditions. The predictive validity of this novel approach was demonstrated with applications to the tumor-suppressor BASP1 and the transcription factor Tcf4.
Keywords: intrinsically disordered proteins, protein meta-structure, pH dependence, structural biology, biomolecular NMR, EPR spectroscopy
Intrinsically disordered proteins (IDPs) display conformational plasticity and undergo structural rearrangements upon changes of pH. Here we provide experimental evidence for α-helix formation and structural compaction of IDPs under acidic conditions. Sequence analysis of a large set of IDPs revealed a general tendency to form α-helical secondary structure elements and to preferentially fold into more compact structures under acidic conditions suggesting that elements sensitive to protonation-induced stabilization might be crucial in folding-upon-binding events.
Introduction
Intrinsically disordered proteins (IDPs) display remarkable physicochemical properties that allow for unique regulatory functionalities in living organisms.1,2 It is well established that IDPs constitute interaction hubs in the protein interactomes of organisms and have therefore attracted considerable scientific interest.3 In addition to their innate conformational flexibility (plasticity), IDPs are also sensitive probes for changes of environmental parameters, particularly changes in pH values.4–6 In a recent thorough analysis, it was shown that the net charge (and proline content) is an important factor determining hydrodynamic radii and governing structural compaction of IDPs.7 For example, it was shown that although α-synuclein remains natively unfolded under acidic conditions, local secondary structure propensities in proximity to acidic residues change on variations in pH and the conformational ensemble becomes enriched in compact structures with pronounced local rigidity of the protein backbone.8 Similar physicochemical behavior was reported for the thymic hormone prothymosin-α9 Using a suite of different biophysical techniques (e.g., small-angle X-ray scattering (SAXS), UV, and NMR), it was shown that prothymosin-α cooperatively folds from a denatured state at neutral pH into a partially folded conformation at lower pH. This folding transition is initiated by the formation of α-helices in glutamic acid–rich regions in which the neutralization of negatively charged side-chains facilitates the hydrophobic collapse.9
Protonation and deprotonation are fundamental chemical mechanisms to modulate ionization states and physicochemical properties of biomolecules and therefore of great relevance for biological systems. In eukaryotic cells, the pH in different organelles is highly variable ranging from 7.2 to 7.4 in the cytosol and nucleus to 4.0–5.5 in the endosomes and lysosomes.10 Thus, pH dysregulation frequently leads to cellular dysfunction and ultimately to disease phenotypes. For example, tumor cells have been shown to behave abnormally under acidic cellular pH conditions11 and pH alterations in the acidic organelles lead to oncological processes and lysosomal storage disorder.12 In addition to modulating the ionization states of molecules and their interaction patterns with authentic binding partners, alteration of pH can also induce structural changes in proteins. Typically, proteins are only marginally stable and small changes in environmental conditions can induce substantial structural rearrangements. Given the small stability differences between the folded and unfolded state(s), slight changes of structure-disrupting factors (concentration of chemical denaturants, pH, or temperature) may lead to substantial and even cooperative changes in the population of folded and unfolded states, respectively. It is important to note that partial proton transfer is often sufficient for catalytic acceleration of chemical reactions and complete proton transfer might not be necessary. This is described in the concept of general versus specific acid catalysis in physical organic chemistry and enzymology. Protonation of a given substrate modifies its electronic state and may yield, for example, a strong electrophilic center that may be prone to attacks of nucleophiles. In return, this means that a reduction of electrostatic repulsion among deprotonated, acidic residues of a protein does not necessarily require complete protonation of acidic side chains.
In this study, we use the recently developed meta-structure concept, a computational approach,13 to analyze structural changes in IDPs under acidic conditions. The meta-structure analysis provides quantitative (on a per residue basis) information about residue structural complexity (compactness) and local secondary structure. Compactness is inversely related to local residue solvent-exposure, whereas local secondary structure values are defined analogously to the NMR secondary Cα chemical shift (α-helices: positive; β-sheets: negative). Numerous applications have already demonstrated the applicability of this approach.14–17 In this study, we computationally consider the effect of lowering the pH by altering the primary sequence through replacing the negatively charged carboxylate groups of Asp/Glu residues with neutral (hydrogen-bond donor) amide groups of Asn/Gln residues. The predictive validity of our meta-structure approach is illustrated by its application to the IDPs BASP1, a tumor-suppressor down regulated by the oncogenic transcription factor Myc,18 and Tcf4, a transcription factor involved in the Wnt signaling pathway.19
Results
The meta-structure derived compactness and local secondary structure parameters were used to assess pH-induced changes in protein structural features. Residue-specific compactness values quantify the spatial embedding of individual residues within the 3D protein structures. Residues in the interior of a structure exhibit large compactness values, whereas residues located on the surface and exposed to the solvent display small (even negative in case of conformationally highly flexible segments) values. The meta-structure derived secondary structure parameter is defined in analogy to the well-established NMR 13Cα chemical shift index, with positive values for α-helices and negative values indicating the presence of an extended conformation. First, we evaluated the performance of the approach by comparing calculated compactness values of IDPs (taken from the DisProt database) with well-folded proteins deposited in the PDB database. As can be seen from the histograms shown in Figure 1(A), IDPs are characterized by significantly smaller compactness values (∼230) compared to their well-folded counterparts (∼330), indicating that compactness values are valuable quantitative probes for structural compaction of proteins. Then, we analyzed whether the calculated local secondary structure parameters are indicative of α-helices and β-strands. To facilitate the analysis, protein backbone dihedral angles φ and ψ were extracted for a set of 857 proteins taken from the PDB database (protein structure database designed by Kontaxis et al.20). Dihedral angle combinations were transformed into a single geometrical parameter using the well-established relationship: 3cosΩ=1−4cos2[(φ+ψ)/2].21 Figure 1(B) shows that meta-structure derived secondary structure values are consistently positive for residues located in α-helical segments, whereas residues populating extended structural elements (β-strands or polyproline II helices) display negative values. It has to be noted that, although α-helices can be directly inferred from the meta-structure analysis, negative secondary structure values are found for both β-strands and other extended segments (polyproline II helices). As a summary, we conclude that changes in local structure preferences (e.g., α-helix to β-strand transition) can be correctly identified by this approach.
Figure 1.

Large-scale performance evaluation of meta-structure protein parameters. (A) Histogram of calculated residue compactness values for proteins taken from the PDB (black) or DisProt database (red). On average, significantly smaller compactness values are found for intrinsically disordered proteins. (B) Comparison between calculated secondary structure parameters and local secondary structure (expressed by the backbone parameter Ω). Protein backbone dihedral angles φ and ψ are transformed into a single geometric parameter using the well-established relationship: 3cosΩ = 1−4cos2[(φ+ψ)/2].21 Note that α-helical regions have Ω values between 90 and 110, whereas β-sheets are characterized by Ω ≍ 140–180. The protein dataset was taken from Bax and coworkers.20
We have applied our meta-structure approach to two well-studied examples of acid-induced structural changes of IDPs. A recent NMR analysis of the pH-dependence of α-synuclein revealed significant structural adaptions in the C-terminal (Asp/Glu-rich) region. Although at near neutral pH residues in the C-terminal part populate extended conformations, propensities for α-helical elements are increased on lowering pH.8 Surprisingly, independent FTIR data suggested an increase in populations of β-sheet conformations.22 The secondary structure change was accompanied by an overall compaction of the polypeptide chain at low pH as probed by pulsed-field gradient (PFG) measurements of the hydrodynamic radius, RH, and long-range paramagnetic relaxation enhancement data.8 Guinier analysis of SAXS data also showed that at neutral pH the radius of gyration of α-synuclein amounts to Rg = 40 Å, whereas it drops to ∼30 Å at low pH.22 The meta-structure analysis shown in Figure 2(A) convincingly supports these findings: (i) increase in α-helical propensities in the C-terminal (Asp- and Glu-rich) region; (ii) increased population of β-sheet conformations in the N-terminal region that houses the aggregation relevant NAC region of α-synuclein; and (iii) significant local compaction of the C-terminal region.
Figure 2.

Meta-structure derived pH dependence of intrinsically disordered proteins. The effect of protonation (low pH condition) is considered by replacing Asp and Glu with Asn and Gln residues in the primary sequence. The meta-structure parameters are calculated for both wild-type and “mutated” protein forms. Differences in meta-structure derived local secondary structure (black) and compactness values (gray) are related to pH-induced alterations of local backbone geometry and compaction of the polypeptide chain. The differences shown in the figure are defined as low pH–high pH. Data are given for (A) α-synuclein and (B) prothymosin α.
A second example is given by prothymosin α. It was shown that prothymosin α exhibits an increase in α-helical content from nearly 0% at high pH to about 15% at low pH, coincident with a considerable reduction of the radius of gyration (high pH: 37.8 Å; low pH: 27.6 Å).9 Inspection of Figure 2(B) shows that the significant increase in compactness and α-helical content compared to neutral conditions is correctly identified by the meta-structure analysis. Summing up, these two applications clearly demonstrated that the novel meta-structure approach provides reliable residue-specific information about the impact of protonation on local second structure and structural compaction.
In a second step, we applied our approach to two IDPs: BASP1, a tumor-suppressor, and Tcf4, a transcription factor involved in Wnt signaling. BASP1 is an N-myristoylated protein that is involved in neurite outgrowth and plasma membrane organization23 and has been shown to be involved in transcription regulation by acting as a co-suppressor of WT1 function (Wilms' tumor suppressor protein 1).24,25 Figure 3(A) shows the results of meta-structure analysis of BASP1. From the residue plot it can be deduced that under acidic conditions (pH 2) BASP1 displays larger fractions of α-helical regions as compared to pH 6. Furthermore, it samples more compact conformations, as indicated by the significantly larger compactness values. The meta-structure-predicted overall compaction of BASP1 was experimentally verified using PFG NMR diffusion measurements (PFG-DOSY), although significant scatter in the extracted diffusion constants impaired a detailed quantitative analysis. Additional support for the structural compaction of BASP1 was obtained from well-established NMR saturation experiments.26 It was shown that the SOFAST-HMQC experiment can efficiently be used to probe 1H-1H spin diffusion or NOE effects, when a selective inversion pulse (Hsat) is applied on aliphatic protons before the start of the pulse sequence. Two data sets are recorded with (Isat) and without (Iref) the inversion pulse Hsat. The intensity ratio (λNOE = Isat/Iref) measured in the two SOFAST-1H-15N-HMQC spectra depends on spin diffusion effects and thus probes the structure and dynamics of proton spin networks (density and mobility) in proteins.26 In well-structured, globular proteins spin diffusion is highly efficient leading to λNOE << 1, whereas in loosely folded proteins (random coils, molten globules) λNOE ≍ 1. Thus, we have applied HET-SOFAST-HMQC to BASP1 to probe structural compaction under high and low pH conditions. The significant decrease of λNOE on lowering pH (0.75–0.60) again indicates structural compaction of BASP1 under low pH conditions. The overall compaction of BASP1 at low pH was independently corroborated by electron paramagnetic resonance (EPR)-based double electron-electron resonance (DEER) measurements.27 Figure 3(B) shows DEER time traces obtained at different pH values (pH 1 and pH 6). Changes in solution pH clearly influence effective modulation depths, Δeff, and the pair-wise distance distributions, P(r). At low pH, the effective modulation depths, Δeff, is larger and P(r) is shifted toward shorter distances. More specific information about local structural dynamics changes was obtained using residue-specific NMR data. Inspection of Figure 3(C) displays a clear correlation between meta-structure derived [Fig. 3(C), top] and experimental NMR-derived differential secondary structural propensity (SSP) changes [SSP (pH2) - SSP (pH6); Fig. 3(C), middle]. Consistently, most pronounced secondary structural changes are found in the glutamic acid-rich region 30–100, whereas smaller secondary structural changes were observed for segments around residues 180 and 230. The regions of increased α-helical content at low pH coincide with regions displaying reduced local mobility under acidic conditions as indicated by differential 15N T2 values (15N T2 (pH6) - 15N T2 (pH2)). These regions appear as local maxima in the differential 15N T2 versus residue plot [Fig. 3(C), bottom]. Interestingly, the global compaction of BASP1 at low pH is further indicated by larger 15N T2 values (smaller apparent correlation time τC) [Fig. 3(C), bottom]. It should be noted that on structural compaction BASP1 retains significant internal mobility that is significantly different to the behavior of well-folded (ordered) proteins. These local changes in internal dynamics lead to residue-specific modulation of the experimental 15N T2 values, for example, local secondary structure formation.
Figure 3.

Experimental verification of meta-structure derived structural changes of BASP1 on lowering pH. (A) Differences in meta-structure derived local secondary structure (black) and compactness values (gray) between acidic and neutral conditions. (B) Schematic illustration of coarse-grained structural changes of BASP1 on changes in solution pH, as detected by DEER spectroscopy. On lowering pH, the two labelling sites (C110 and C229) spatially approach, indicating compaction of the IDP (top). This is evident from time traces (lower left) at pH 6 (red) and at pH 1 (blue) and the corresponding distance distributions (lower right). (C) Comparison between meta-structure derived secondary structure changes (upper trace), NMR derived secondary structure propensities (SSP) changes (middle trace) and differential local conformational dynamics probed by 15N T2 relaxation on lowering pH (lower trace).
The second IDP analyzed was the DNA-binding T-cell factor/lymphoid enhancer factor (Tcf4/LEF), a prominent binding partner of β-catenin. β-Catenin is a central unit in the Wnt signaling pathway, and constitutive activation of the Wnt signaling pathway is involved in the development of various human malignancies, like colorectal carcinomas, melanomas, and ovarian carcinomas.28 Although detailed structural information of unbound Tcf4 is still lacking, it was shown by CD spectroscopy that Tcf4 in its apo-state is devoid of significant secondary structure elements and is reminiscent of a random coil.29 This lack of global structure is also reflected in small meta-structure derived compactness. The average residue compactness of Tcf4 was found to be about 54, which is considerably smaller than the average value for a stably structured, globular protein (about 300).13 Meta-structure calculations performed under acidic conditions revealed that at low pH Tcf4 is more compact and significantly more α-helical [Fig. 4(A)]. To verify this prediction, NMR signal assignment was performed at neutral and low pH (2.0). The increase in α-helical content at low pH was clearly observed in the SSP analysis [Fig. 4(B)]. Interestingly, the two α-helical regions (α1 and α2) displaying more α-helical propensities under acidic than under neutral conditions constitute major parts of the Tcf4 interaction interface and provide most of the binding energy for the β-Catenin/Tcf4 complex [Fig. 4(C)]. It is particularly noteworthy, that the region α1 exhibits the largest SSP changes [Fig. 4(B,D)] and thus points to a significantly larger conformational plasticity of this part of Tcf4.
Figure 4.

NMR data and meta-structure analysis of Tcf4. (A) Differences in meta-structure derived local secondary structure (black) and compactness values (gray) between low and high pH conditions. (B) Differences of NMR-derived secondary structure propensities (Delta SSP) derived under acidic conditions and neutral pH as a function of residue position. Overall, only small 13Cα secondary chemical shift deviations from random coil values are observed for Tcf4 under neutral conditions indicative of marginal residual local secondary structure in Tcf4. Significant increases in 13Cα secondary chemical shifts on lowering the pH were identified in the recognition helices α1 and α2. Missing peaks are due to low signal-to-noise ratios resulting from considerable chemical exchange of amide protons with bulk water. (C) Conformational plasticity of the Tcf4 interaction region. Overlay of 3D structures of β-catenin/Tcf4 protein complexes (1JPW,1JDH). β-Catenin and Tcf4 molecules are shown in green and red (1JPW)39 or cyan and blue (1JDH).40 Interestingly, conformational plasticity (polymorphism, see text) was found for the recognition element α1. (D) The polymorphic recognition element α1 of Tcf4 binds into a basic β-catenin–binding cleft (1JDH).40 Negatively charged Tcf4 (cyan) residues located in the recognition element α1 are shown in orange, whereas β-catenin (gray) lysine and arginine residues located in the basic binding cleft are shown as blue surfaces.
Finally, to analyze whether the observed pH-dependent conformational changes might constitute a conserved feature within the IDP domain, we performed a large-scale analysis of human IDPs. Because the average mean compactness value for a protein in the PDB is about 300,13 we used a threshold of 200 for the selection of human IDPs. Only proteins with mean compactness values smaller than this threshold were considered (in total 1013 proteins were analyzed). We thus only consider globally disordered proteins and exclude globular proteins with (long) unstructured linker segments.
For this set of human IDPs, we calculated local secondary structure and compactness values for high and low pH forms. The histogram of pair-wise secondary structure and compactness differences is shown in Figure 5. Overall, we found a slight but significant preponderance of α-helical content at low pH and increased compactness values on decreasing pH seem to be a conserved feature for IDPs. Moreover, considering only longer residue stretches (larger than 7 residues) with consistent local secondary structure changes, the preponderance of α-helical structures at low pH becomes even more pronounced [Fig. 5(A), insert].
Figure 5.

Large-scale analysis of pH dependencies of (1013) human IDPs. Histograms of meta-structural changes for individual residues are shown. (A) Histogram of individual local secondary structure changes on lowering pH. Positive and negative deviations are indicative for larger α-helical or β-strand propensities. The slightly skewed distribution suggests a more pronounced tendency for IDPs to form α-helical structures at low pH conditions. This effect becomes more visible when considering meta-structure deviations for a continuous stretch of seven residues (corresponding to two α-helical turns). Relative counts for longer (larger than seven residues) continuous α-helical and β-sheet regions are shown in the inset (see text). The formation of α-helical structures is approximately three times more favourable compared to β-strand elements. (B) Histogram of individual compactness changes on lowering pH. Positive deviations indicate structural compaction of the polypeptide chain.
Discussion
IDPs are intriguingly sensitive to changing environmental conditions. Given that local changes of pH are of high relevance to in vivo functionalities of these protein species, local pH levels or proton concentrations are tightly controlled in living organisms. Although detailed studies of the effect of pH on IDP conformational space are rare, recent experimental findings indicate that a decreasing pH is likely to be accompanied by the formation of α-helices in glutamic acid-rich regions and a subsequent folding transition to a non-molten globule state for which the hydrophobic collapse is facilitated by neutralization of negatively charged side-chains. In this study, we have introduced a novel meta-structure approach that can be used to predict the pH dependence of IDPs using only primary sequence information. The pH-dependent differences in meta-structure derived local structure and compactness values were found to be in good agreement with experimental findings. Data obtained on a large set of intrinsically disordered human proteins show that IDPs fold under acidic conditions into more compact structures with higher α-helical content largely due to reduced electrostatic repulsion of negatively charged side chains. It was shown recently that the inherent context-independent α-helical propensities are significantly higher for Asn and Gln than for Asp and Glu, respectively.30 The meta-structure derived increases in α-helical propensities are not only related to individual (context-independent) amino acid specific properties but also take into account context-dependent (primary sequence) influences.
This observation is also particularly interesting in view of a recent finding showing that the evolution of IDP sequences is constrained and involves a significant bias in the amino acid composition.31 In contrast to globular, stably folded proteins structural compaction in IDPs is accompanied by the formation of α-helices and characterized by a low β-structure propensity.31 Therefore, avoiding aggregation despite preformed molecular recognition elements is an important driving force in protein sequence evolution of IDPs. We showed that similar mechanisms are responsible for maintaining soluble, functionally active and nonaggregating IDPs on protonation or under acidic conditions.
Furthermore, as illustrated with the β-catenin/Tcf4 protein complex, stabilization of preformed α-helices via (full or partial) protonation might also be of relevance for protein complex formation involving IDPs. Although globular proteins typically involve autonomously structured domains of about 60–300 amino acid lengths, IDPs mediate protein interaction using short sequence motifs. Depending on the context, they are called eukaryotic linear motifs,32 short linear motifs,33 molecular recognition features,34 or preformed structural elements (PSEs),35 although they are not entirely unrelated and share analogous features. Although protein recognition by IDPs often proceeds via folding-on-binding events (e.g., disorder-to-order transitions),36 there is growing evidence that even in the bound state IDPs (can) retain substantial conformational flexibilities, a conceptual view which is sometimes referred to as “fuzziness”.37 The Tcf4/β-catenin protein complex is an example for static disorder (“polymorphic”) as Tcf4 can bind in several distinct conformations to β-catenin.38 Specifically, the Tcf4/β-Catenin binding interface comprises three epitopes centered at Tcf4 residues D16, E29 and D43 (PDB: 1JPW, 1JDH).39,40 The central epitope (E29), α1, shows considerable plasticity and docks to β-Catenin in several different conformations (extended or α-helical). Inspection of the Tcf4/β-catenin structure (PDB: 1JPW, 1JDH)39,40 reveals that in this complex epitope α1 is surrounded by a basic patch comprising the positively charged β-Catenin residues K312, R342, K345, R376, and R386. The finding that protonation of negatively charged side-chains stabilizes α-helical structures provides an additional chemical mechanism for stabilizing recognition elements in IDP protein complexes. Favorable electrostatic interactions between the basic proton acceptor located at the surface of the IDP recognition element and the acidic proton donor of the interaction partner lead to stabilization of the partially PSE. However, it should be noted that to stabilize the basic proton acceptor complete proton transfer is not required, as even the realization of hydrogen bonds (e.g., partial proton transfer) can already lead to stabilization of the system. Based on our results obtained for the Tcf4/β-catenin protein complex we conclude that tuning of protonation efficiency might constitute a relevant chemical mechanism to modulate interaction affinities and associated recognition selectivity.
As a conclusion, the availability of a fast and reliable prediction tool for preformed α-helical elements sensitive to protonation-induced stabilization may be of great relevance for the (sequence-based) identification of folding-on-binding systems in different organisms and the analysis of recognition patterns with potential implications for large-scale predictions of IDP interaction partners.
Materials and Methods
Meta-structure description of proteins under acidic conditions
The pH dependence of IDPs was analyzed applying the recently developed meta-structure approach.13 In this study, we focus on the assessment of protonation events on the meta-structural parameters of IDPs. In brief, protein 3D structural information is converted into a network structure in which a node refers to a particular residue and edges indicate the existence of neighbourhood relationships (provided that the Cα-Cα distances are below a distance cutoff, typically 8 Å). Subsequently, the topological relationship between two residues is quantified by the shortest path length θ connecting these two residues in the network. The topological relationship between two residues (of type A and B) characteristically depends on the amino acid types and their primary sequence distance, lAB. The statistical distribution functions ρ(θ,A,B,lAB) were extracted from a selected subset of PDB structures.13 The statistical distribution functions ρ(θ,A,B,lAB) can subsequently be used to predict topological information solely based on the primary sequence. The input primary sequence is used to predict for each possible amino acid pair (of residue type A and B and separated by lab in the sequence) in the protein and based on ρ(θ,A,B,lAB) an average topology parameter, eij. Typically primary sequence separations lAB > 5 are not further differentiated and considered long-range. In brief, the topological parameter eij is related to the probability of finding a shortest path length θij=1 (direct neighbour contact). Summation of all pairwise contributions eij leads to the compactness value Ci of residue i. Large compactness Ci values are found for residues embedded in the interior of the structure and surrounded by many neighbouring residues whereas small (even negative) values are found for flexible residues located on the surface and exposed to the solvent. The local secondary structure of proteins is assessed by using statistical distribution function ρ(θ,A,B,lAB) with sequence separations lAB ≤ 4. The local secondary structure parameter Si is defined as Si = αPi-βPi, where αPi and βPi are defined as: αPi=Nα*γ2*γ3*γ4, βPi=Nβ*δ3*δ4. γi = ρ(1,i)*(1.0-ρ(2,i)/[ρ(2,i)*(1.0-ρ(1,i)];δi = ϒi−1. ρ(1,i) and ρ(2,i) denote the probabilities of finding shortest path length values of 1 and 2 between residues i and j. Nα and Nβ are empirical constants ensuring comparable compactness and local secondary structure values.13 Typically, residues located in α-helices display positive Si values whereas for residues located in extended regions (β-strands or polyproleine II helices) significantly smaller (negative) values were observed. Of particular relevance is the fact, that this sequence analysis provides quantitative information about the local secondary structure and residue compactness for each residue position.
The statistical distribution functions ρ(θ,A,B,lAB) used for the calculation of meta-structural parameters were extracted from proteins predominantly crystallized under near neutral pH conditions. Under these conditions the carboxylic groups of Asp and Glu side-chains are predominantly deprotonated (negatively charged). The residue types in ρ(θ,A,B,lAB) thus refer to negatively charged Asp and Glu side-chains. Because distribution functions for protonated Asp and Glu side-chains are not directly accessible through crystal structure analysis, we mimic protonation events by replacing Asp and Glu side-chains by Asn and Gln in the primary sequence. The rationale for our approach is as follows: Firstly, this amino acid replacement ensures electroneutrality (on protonation of carboxylate) of the side-chain. Secondly, the Asn/Gln amide groups are chemically comparable to protonated Asp/Glu carboxyl groups. This is due to the fact that both, amide groups in Asn/Gln and carboxyl groups of protonated (neutral) Asp and Glu, are planar and can act as hydrogen bond donors. The respective meta-structure parameters are calculated for both wild-type (negatively charged Asp/Glu side chains) and “mutated” protein (neutral Asn/Gln) forms. Subsequent differences in meta-structure derived local structure and compactness values are correlated with pH-induced alterations of local backbone geometry and compaction of the polypeptide chain.
NMR spectroscopy and signal assignment
All spectra were acquired at 298 K on an Agilent Direct Drive 700 MHz (BASP1) or Varian Unity 500 and 600 MHz (Tcf4, 1 mM, pH 4.5 and 2) spectrometer using standard 5 mm 1H-13C-15N triple-resonance probe heads. NMR data for Tcf4 were obtained on spectrometers at 298 K. Signal assignment was achieved as described elsewhere.41 PFG NMR diffusion measurements were performed and analyzed as described elsewhere.42
EPR spectroscopy
Double electron-electron (DEER)27,43,44 was applied to glassy solids obtained by freeze-quenching aqueous solutions of BASP1 double mutants (MTSL labeling at Cys-mutated residues 110 and 229 of the chicken analogue of human BASP1) after addition of 30 v/v % glycerol. More details of the experimental procedure are given in the Supporting Information.
Acknowledgments
M.A.H. acknowledges the support of Professor Dr. Hassan Eisa, Dr. Alaa El-Din Barghash, and Dr. Laila A. Abouzeid; Department of Pharmaceutical Organic Chemistry, Faculty of Pharmacy, Mansoura University, Egypt.
Supplementary material
Additional Supporting Information may be found in the online version of this article.
References
- 1.Tompa P. Intrinsically unstructured proteins. Trends Biochem Sci. 2002;27:527–533. doi: 10.1016/s0968-0004(02)02169-2. [DOI] [PubMed] [Google Scholar]
- 2.Fink AL. Natively unfolded proteins. Curr Opin Struct Biol. 2005;15:35–41. doi: 10.1016/j.sbi.2005.01.002. [DOI] [PubMed] [Google Scholar]
- 3.Uversky VN, Oldfield CJ, Midic U, Xie H, Xue B, Vucetic S, Iakoucheva LM, Obradovic Z, Dunker AK. Unfoldomics of human diseases: linking protein intrinsic disorder with diseases. BMC Genomics. 2009;10(Suppl 1):S7. doi: 10.1186/1471-2164-10-S1-S7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Uversky VN. Intrinsically disordered proteins and their environment: effects of strong denaturants, temperature, pH, counter ions, membranes, binding partners, osmolytes, and macromolecular crowding. Protein J. 2009;28:305–325. doi: 10.1007/s10930-009-9201-4. [DOI] [PubMed] [Google Scholar]
- 5.Uversky VN, Dunker AK. Understanding protein non-folding. Biochim Biophys Acta. 2010;1804:1231–1264. doi: 10.1016/j.bbapap.2010.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Uversky VN. Unusual biophysics of intrinsically disordered proteins. Biochim Biophys Acta. 2013;1834:932–951. doi: 10.1016/j.bbapap.2012.12.008. [DOI] [PubMed] [Google Scholar]
- 7.Marsh JA, Forman-Kay JD. Sequence determinants of compaction in intrinsically disordered proteins. Biophys J. 2010;98:2383–2390. doi: 10.1016/j.bpj.2010.02.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cho MK, Nodet G, Kim HY, Jensen MR, Bernado P, Fernandez CO, Becker S, Blackledge M, Zweckstetter M. Structural characterization of alpha-synuclein in an aggregation prone state. Protein Sci. 2009;18:1840–1846. doi: 10.1002/pro.194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Uversky VN, Gillespie JR, Millett IS, Khodyakova AV, Vasiliev AM, Chernovskaya TV, Vasilenko RN, Kozovskaya GD, Dolgikh DA, Fink AL, Doniach S, Abramov VM. Natively unfolded human prothymosin alpha adopts partially folded collapsed conformation at acidic pH. Biochemistry. 1999;38:15009–15016. doi: 10.1021/bi990752+. [DOI] [PubMed] [Google Scholar]
- 10.Paroutis P, Touret N, Grinstein S. The pH of the secretory pathway: measurement, determinants, and regulation. Physiology. 2004;19:207–215. doi: 10.1152/physiol.00005.2004. [DOI] [PubMed] [Google Scholar]
- 11.Warburg O. Origin of cancer cells. Science. 1956;123:309–314. doi: 10.1126/science.123.3191.309. [DOI] [PubMed] [Google Scholar]
- 12.Schultz ML, Tecedor L, Chang M, Davidson BL. Clarifying lysosomal storage diseases. Trends Neurosci. 2011;34:401–410. doi: 10.1016/j.tins.2011.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Konrat R. The protein meta-structure: a novel concept for chemical and molecular biology. Cell Mol Life Sci. 2009;66:3625–3639. doi: 10.1007/s00018-009-0117-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Henen MA, Coudevylle N, Geist L, Konrat R. Toward rational fragment-based lead design without 3D structures. J Med Chem. 2012;55:7909–7919. doi: 10.1021/jm301016m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Mayer C, Slater L, Erat MC, Konrat R, Vakonakis I. Structural analysis of the Plasmodium falciparum erythrocyte membrane protein 1 (PfEMP1) intracellular domain reveals a conserved interaction epitope. J Biol Chem. 2012;287:7182–7189. doi: 10.1074/jbc.M111.330779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Schedlbauer A, Gandini R, Kontaxis G, Paulmichl M, Furst J, Konrat R. The C-terminus of ICln is natively disordered but displays local structural preformation. Cell Physiol Biochem. 2011;28:1203–1210. doi: 10.1159/000335852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hasenohrl D, Konrat R, Blasi U. Identification of an RNase J ortholog in Sulfolobus solfataricus: implications for 5′-to-3′ directional decay and 5′-end protection of mRNA in Crenarchaeota. RNA. 2011;17:99–107. doi: 10.1261/rna.2418211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hartl M, Nist A, Khan MI, Valovka T, Bister K. Inhibition of Myc-induced cell transformation by brain acid-soluble protein 1 (BASP1) Proc Natl Acad Sci U S A. 2009;106:5604–5609. doi: 10.1073/pnas.0812101106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Clevers H, Nusse R. Wnt/beta-catenin signaling and disease. Cell. 2012;149:1192–1205. doi: 10.1016/j.cell.2012.05.012. [DOI] [PubMed] [Google Scholar]
- 20.Kontaxis G, Delaglio F, Bax A. Molecular fragment replacement approach to protein structure determination by chemical shift and dipolar homology database mining. Methods Enzymol. 2005;394:42–78. doi: 10.1016/S0076-6879(05)94003-2. [DOI] [PubMed] [Google Scholar]
- 21.Dickerson RE, Geis I. The structure and action of proteins. New York: Harper & Row; 1969. [Google Scholar]
- 22.Uversky VN, Li J, Fink AL. Evidence for a partially folded intermediate in alpha-synuclein fibril formation. J Biol Chem. 2001;276:10737–10744. doi: 10.1074/jbc.M010907200. [DOI] [PubMed] [Google Scholar]
- 23.Korshunova I, Caroni P, Kolkova K, Berezin V, Bock E, Walmod PS. Characterization of BASP1-mediated neurite outgrowth. J Neurosci Res. 2008;86:2201–2213. doi: 10.1002/jnr.21678. [DOI] [PubMed] [Google Scholar]
- 24.Carpenter B, Hill KJ, Charalambous M, Wagner KJ, Lahiri D, James DI, Andersen JS, Schumacher V, Royer-Pokora B, Mann M, Ward A, Roberts SG. BASP1 is a transcriptional cosuppressor for the Wilms' tumor suppressor protein WT1. Mol Cell Biol. 2004;24:537–549. doi: 10.1128/MCB.24.2.537-549.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Green LM, Wagner KJ, Campbell HA, Addison K, Roberts SG. Dynamic interaction between WT1 and BASP1 in transcriptional regulation during differentiation. Nucleic Acids Res. 2009;37:431–440. doi: 10.1093/nar/gkn955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Schanda P, Forge V, Brutscher B. HET-SOFAST NMR for fast detection of structural compactness and heterogeneity along polypeptide chains. Magn Reson Chem. 2006;44 doi: 10.1002/mrc.1825. Spec No:S177–S 184. [DOI] [PubMed] [Google Scholar]
- 27.Jeschke G, Chechik V, Ionita P, Godt A, Zimmermann H, Banham J, Timmel CR, Hilger D, Jung H. Deeranalysis2006—a comprehensive software package for analyzing pulsed ELDOR data. Appl Magn Reson. 2006;30:473–498. [Google Scholar]
- 28.Kobayashi M, Honma T, Matsuda Y, Suzuki Y, Narisawa R, Ajioka Y, Asakura H. Nuclear translocation of beta-catenin in colorectal cancer. Br J Cancer. 2000;82:1689–1693. doi: 10.1054/bjoc.1999.1112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Knapp S, Zamai M, Volpi D, Nardese V, Avanzi N, Breton J, Plyte S, Flocco M, Marconi M, Isacchi A, Caiolfa VR. Thermodynamics of the high-affinity interaction of TCF4 with beta-catenin. J Mol Biol. 2001;306:1179–1189. doi: 10.1006/jmbi.2001.4463. [DOI] [PubMed] [Google Scholar]
- 30.Moreau RJ, Schubert CR, Nasr KA, Torok M, Miller JS, Kennedy RJ, Kemp DS. Context-independent, temperature-dependent helical propensities for amino acid residues. J Am Chem Soc. 2009;131:13107–13116. doi: 10.1021/ja904271k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Naranjo Y, Pons M, Konrat R. Meta-structure correlation in protein space unveils different selection rules for folded and intrinsically disordered proteins. Mol Biosyst. 2012;8:411–416. doi: 10.1039/c1mb05367a. [DOI] [PubMed] [Google Scholar]
- 32.Neduva V, Russell RB. Linear motifs: evolutionary interaction switches. FEBS Lett. 2005;579:3342–3345. doi: 10.1016/j.febslet.2005.04.005. [DOI] [PubMed] [Google Scholar]
- 33.Davey NE, Shields DC, Edwards RJ. SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent. Nucleic Acids Res. 2006;34:3546–3554. doi: 10.1093/nar/gkl486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Vacic V, Oldfield CJ, Mohan A, Radivojac P, Cortese MS, Uversky VN, Dunker AK. Characterization of molecular recognition features, MoRFs, and their binding partners. J Proteome Res. 2007;6:2351–2366. doi: 10.1021/pr0701411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Fuxreiter M, Simon I, Friedrich P, Tompa P. Preformed structural elements feature in partner recognition by intrinsically unstructured proteins. J Mol Biol. 2004;338:1015–1026. doi: 10.1016/j.jmb.2004.03.017. [DOI] [PubMed] [Google Scholar]
- 36.Wright PE, Dyson HJ. Intrinsically unstructured proteins: re-assessing the protein structure–function paradigm. J Mol Biol. 1999;293:321–331. doi: 10.1006/jmbi.1999.3110. [DOI] [PubMed] [Google Scholar]
- 37.Tompa P, Fuxreiter M. Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions. Trends Biochem Sci. 2008;33:2–8. doi: 10.1016/j.tibs.2007.10.003. [DOI] [PubMed] [Google Scholar]
- 38.Hazy E, Tompa P. Limitations of induced folding in molecular recognition by intrinsically disordered proteins. ChemPhysChem. 2009;10:1415–1419. doi: 10.1002/cphc.200900205. [DOI] [PubMed] [Google Scholar]
- 39.Poy F, Lepourcelet M, Shivdasani RA, Eck MJ. Structure of a human Tcf4-beta-catenin complex. Nat Struct Biol. 2001;8:1053–1057. doi: 10.1038/nsb720. [DOI] [PubMed] [Google Scholar]
- 40.Graham TA, Ferkey DM, Mao F, Kimelman D, Xu W. Tcf4 can specifically recognize beta-catenin using alternative conformations. Nat Struct Biol. 2001;8:1048–1052. doi: 10.1038/nsb718. [DOI] [PubMed] [Google Scholar]
- 41.Geist L, Zawadzka-Kazimierczuk A, Saxena S, Zerko S, Kozminski W, Konrat R. (1)H, (13)C and (15)N resonance assignments of human BASP1. Biomol NMR Assign; 2012 doi: 10.1007/s12104-012-9436-4. doi: 10.1007/s12104-012-9436-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Platzer G, Schedlbauer A, Chemelli A, Ozdowy P, Coudevylle N, Auer R, Kontaxis G, Hartl M, Miles AJ, Wallace BA, Glatter O, Bister K, Konrat R. The metastasis-associated extracellular matrix protein osteopontin forms transient structure in ligand interaction sites. Biochemistry. 2011;50:6113–6124. doi: 10.1021/bi200291e. [DOI] [PubMed] [Google Scholar]
- 43.Jeschke G, Koch A, Jonas U, Godt A. Direct conversion of EPR dipolar time evolution data to distance distributions. J Magn Reson. 2002;155:72–82. doi: 10.1006/jmre.2001.2498. [DOI] [PubMed] [Google Scholar]
- 44.Martin RE, Pannier M, Diederich F, Gramlich V, Hubrich M, Spiess HW. Determination of end-to-end distances in a series of TEMPO diradicals of up to 2.8 nm length with a new four-pulse double electron electron resonance experiment. Angew Chem Int Ed. 1998;37:2834–2837. doi: 10.1002/(SICI)1521-3773(19981102)37:20<2833::AID-ANIE2833>3.0.CO;2-7. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
