Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Feb 10.
Published in final edited form as: J Phys Chem B. 2018 Dec 24;123(1):106–116. doi: 10.1021/acs.jpcb.8b10051

The Solution Ensemble of the C-Terminal Domain from the Transcription Factor Pdx1 Resembles an Excluded Volume Polymer

Erik C Cook , Debashish Sahu , Monique Bastidas , Scott A Showalter ‡,§,*
PMCID: PMC7874520  NIHMSID: NIHMS1661406  PMID: 30525611

Abstract

The pancreatic and duodenal homeobox 1 (Pdx1) is an essential pancreatic transcription factor. The C-terminal intrinsically disordered domain of Pdx1 (Pdx1-C) has a heavily biased amino acid composition; most notably, 18 of 83 residues are proline, including a hexa-proline cluster near the middle of the chain. For these reasons, Pdx1-C is an attractive target for structure characterization, given the availability of suitable methods. To determine the solution ensembles of disordered proteins, we have developed a suite of 13C direct-detect NMR experiments that provide high spectral quality, even in the presence of strong proline enrichment. Here, we have extended our suite of NMR experiments to include four new pulse programs designed to record backbone residual dipolar couplings (RDCs) in a 13C,15N-CON detection format. Using our NMR strategy, in combination with small angle x-ray scattering (SAXS) measurements and Monte Carlo simulations, we have determined that Pdx1-C is extended in solution, with a radius of gyration and internal scaling similar to that of an excluded volume polymer, and a subtle tendency toward a collapsed structure to the N-terminal side of the hexa-proline sequence. This structure leaves Pdx1-C exposed for interaction with trans-regulatory co-factors that contribute with Pdx1 to transcription control in the cell.

Graphical Abstract

graphic file with name nihms-1661406-f0001.jpg

INTRODUCTION

Intrinsically disordered proteins (IDPs) have fascinated both the biological and physical chemistry communities due to their ability to mediate interactions that are critical for signal transduction in the cell and because of the heterogeneous conformational ensembles that ideally suit them to this function. IDPs do show some of the same structural characteristics as their folded counterparts, such as the propensity for secondary structure,1 globular regions,2 and long-range contacts;3 their defining feature is not necessarily the absence of such structures, but rather their transient nature in the overall solution ensemble. That said, not all IDPs display significant secondary structure elements or compaction. For example, polyelectrolytic regions and high total charge polyampholytes tend to exist as extended self-avoiding random flight chains.2, 4 Substantial effort has been invested in classifying the ensemble characteristics of IDPs,5 with common models from the polymer physics community having emerged as especially helpful tools for this effort. For example, detailed polymer description of size and shape in the ensemble of the disordered protein NUS helped to reconcile apparent discrepancies between SAXS and FRET data that have concerned the literature for several years.6

In addition to the powerful data available from SAXS and FRET, advances in NMR spectroscopy now make it possible to collect extensive, site resolved data that describe the solution behavior of IDPs.7 Having recognized this, the community has begun to call for development of advanced NMR techniques that provide constraints on statistical IDP ensembles.8 In our laboratory, we seek to fulfill this need through the development of 13C direct-detect NMR techniques that dramatically improve the completeness of NMR data sets for disordered systems.9 These efforts have allowed us to transform our early and qualitative identification of α-helical structure in the C-terminus of the phosphatase FCP110 into a quantitative model for its solution ensemble.11 In addition, 13C direct-detect techniques have allowed us to quantitatively demonstrate sequence-driven enrichment in the cis-proline content of phosphorylated RNA polymerase II C-terminal domain and connect this conformational switch to readout by the downstream enzyme Ssu72.12

Motivated by our work with RNA polymerase II, we have turned our attention broadly to quantitative description of proline conformational states and the impact proline-enrichment has on the conformational ensembles of IDPs. Proline has long confounded NMR spectroscopists, owing to its lack of an amide hydrogen when found in polypeptide chains, and traditional reliance on the amide 1H-nucleus for NMR detection. Detection on the 13C-carbonyl nucleus removes this barrier and makes comprehensive NMR analysis of proline-rich IDPs practical. Proline-rich regions are both common in biology, where they often mediate protein-protein interactions, and of interest from a physical-chemical perspective. As proline residues lack an amide hydrogen, they possess no hydrogen bond donor with which to form stable secondary structures. Coupled to this, their reduced backbone torsional flexibility suits proline residues to serve as “extended spacers” that force disorder locally and enhance solvent exposure.13 Thus, proline paradoxically promotes disorder while simultaneously providing local stiffness that reduces the conformational entropy of the polypeptide chain – an important yet underappreciated concept that sculpts the ensembles of IDPs.

We have previously utilized the C-terminal disordered region of the transcription factor Pdx1, which we refer to as Pdx1-C throughout, to develop an efficient strategy for chemical shift assignment of IDPs.14 Recombinant human Pdx1-C is an 83 residue segment, containing 18 proline residues (22% of the sequence), with the most extreme cluster being a continuous 6-proline sequence located near the center of the construct. Pdx1 plays a central role in maintenance of the pancreatic β-cell phenotype,15 and is recognized for its significance in both diabetes16 and pancreatic cancer.17 Of further relevance for the current work, Pdx1-C is known to harbor a site of interaction with the E3-ubiquitin ligase substrate adaptor protein SPOP, which binds under low glucose conditions and promotes the proteolytic degradation of Pdx1.18 Thus, a better understanding of Pdx1-C structure and its relationship to function would be clinically significant.

Here we describe the solution ensemble of Pdx1-C, which we find to be highly extended and devoid of secondary structure outside the polyproline-II helix formed by the hexa-proline cluster. Our work relies on a combination of SAXS data with measurements of backbone 15N spin relaxation, residual dipolar couplings (RDCs), and NMR paramagnetic relaxation enhancement (PRE). Significantly, we find that the extent of non-local contacts reported through 13C direct-detect PRE depends on whether the excitation nucleus is also 13C (“protonless” 13C,15N-CON detection19), or 1H (13C,15N-(HACA)-CON detection20). The experimental PRE profiles documented here are consistent with simulated PRE profiles derived from Monte Carlo simulations using the Absinth force field,21 as implemented in the CAMPARI software package,22 and indicate a high similarity to a self-avoiding polymer chain. These findings illustrate how Pdx1-C is able to maintain an accessible conformation in solution, required for interaction with SPOP and potentially other co-regulators of gene expression in the β-cell.

MATERIALS AND METHODS

Construct generation.

The DNA insert coding for the C-terminal residues of Pdx1 (Pdx1-C; residues 205–283 of the human sequence) were synthesized by Geneart (Invitrogen) and subcloned into pET-49b. Expression of recombinant Pdx1-C from the resulting plasmid yields an N-terminal glutathione-S-transferase (GST) and a 6x histidine tag, separated from Pdx1-C by a 3C Protease cleavage site. The mutant Pdx1-C C227S, S273C, required for PRE studies, was generated using site-directed mutagenesis, per manufacturer protocols for the Q5 site-directed mutagenesis kit (New England Biolabs). Plasmids encoding wild-type or mutant Pdx1-C were transformed into BL-21 (DE3) Escherichia coli for protein expression.

Protein expression and purification.

To make protein with naturally abundant isotopes, as needed for SAXS measurements, BL-21 (DE3) E. coli were grown in Luria Broth (LB). For all NMR experiments, the E. coli were grown in M9 minimal media supplemented with 15N-ammonium chloride and U-13C glucose. In either case, E. coli were incubated at 37 °C under agitation until an OD600 of 0.6–0.8 was reached. Pdx1-C expression was induced by adding IPTG to a final concentration of 0.5 mM, after which cells were incubated for 4 hours at 37 °C under agitation. Cells were harvested by centrifugation and lysed by sonication. Cell debris was cleared through centrifugation and cell lysate applied to a Ni-NTA column pre-equilibrated with 50 mM Tris (ph 7.5), 500 mM sodium chloride, and 20 mM imidazole. The Ni-NTA column was then washed with 10 column volumes of 50 mM Tris (pH 7.5), 500 mM sodium chloride, 20 mM imidazole, and 0.1 % Triton X-100. Proteins were eluted from the Ni-NTA column with 50 mM Tris (ph 7.5), 500 mM sodium chloride, and 200 mM imidazole. The affinity tags on Pdx1-C were cleaved with 3C-Protease and buffer exchanged into 50 mM Tris (pH 7.5), 500 mM sodium chloride. Subsequently, Pdx1-C was separated from the cleaved affinity tags by passing the protein solution over a Ni-NTA column pre-equilibrated with the same buffer. An additional size exclusion chromatography purification was achieved through passage over a Sephacryl-200 column, which efficiently removed high molecular weight contaminants and any cleaved affinity tags that may have co-eluted from the Ni-NTA column with Pdx1-C.

Small angle x-ray scattering (SAXS).

Pdx1-C samples for SAXS were buffer exchanged into 50 mM HEPES (pH 7.0), 100 mM NaCl, and 5% glycerol (to protect against radiation damage). Samples were concentrated to > 5 mg mL−1 as determined using a Direct Detect FTIR spectrometer (EMD Millipore). SAXS data were collected at the Cornell High Energy Synchrotron Source (CHESS) on the G1 beamline. Incident radiation was estimated to be 9.9 keV, producing a flux of 8 × 1011 photons s−1, providing a q-space range of 0.01– 0.7 Å−1. Scattering from a silver behenate standard was used for q-axis mapping. Data collection was performed using dual Pilatus 100K-S detectors. Reduction of the 2D images to 1D scattering profiles was performed using BioXTAS Raw.23 For each experimental run, 50 μL of Pdx1-C solution was injected onto a Superdex 200 Increase (5×150 mm) column at a flow rate of 0.2 mL min−1 for in-line SEC-SAXS data acquisition. Final reported SAXS data sets represent the average of three SEC-SAXS runs, each of which produced a total of >100 exposures stored as 20 s frames. Solvent subtraction was performed using equivalent numbers of frames of the Pdx1-C elution peak and surrounding baseline data. No signs of aggregation, inter-particle effects, or radiation damage were observed. Guinier fitting was performed using the method of non-linear least squares in BioXTAS RAW.23 Data fitting of the Guinier region was restricted to the data points satisfying qRg < 0.8 as has been recommended for highly flexible proteins.24

General NMR spectroscopy.

Chemical shift assignments of Pdx1-C have been published previously and are deposited in the BMRB (access code 19596).14 Additional chemical shift assignments of MTSL-tagged Pdx1-C, used for the PRE experiments, were determined using (HACA)N(CA)CON, HNCO, and HN(CA)CO triple resonance experiments (see representative 2D spectra in Figure S1). All NMR data were collected on a Bruker Avance III 500 MHz spectrometer, equipped with a TCI Cryoprobe. NMR spectra were processed using Bruker TopSpin (Billerica, MA), followed by analysis in NMRFAM SPARKY.25

NMR spin relaxation.

15N R1 and R2 spin relaxation parameters were determined using 1HN-start CON experiments previously developed in our laboratory.26 The R1 and R2 spin relaxation experiments were collected as pseudo-3D spectra with a data matrix of 1024 × 512 points for each of the 8 interleaved acquisitions. Spectra were collected with 64-scans of signal averaging per FID. The R1 data were collected as a randomized series of 13C,15N-CONs with relaxation delays set to 50, 150, 250, 350, 450, 550, 650, and 750 ms. For the R2 data set, the relaxation delays were 16, 48, 80, 112, 144, 176, 208, 240, 272, and 304 ms. The sweep widths for the relaxation experiments were set to 19 ppm for the direct 13C-dimension, centered on 173 ppm, and 28 ppm for the indirect 15N-dimension, centered on 120 ppm.

Residual dipolar coupling (RDC) measurements.

NMR samples were prepared by dissolving Pdx1-C at a concentration of 1.1 mM in dilute buffer (50 mM sodium cacodylate, pH 6.5, 50 mM potassium chloride, 5 mM TCEP, 0.01% sodium azide, and 10% (v/v) D2O) prior to transfer into a Shigemi tube. The aligned sample was generated by swelling a neutral 7% polyacrylamide gel in the tube, with the plunger height set to achieve a compression ratio of 1.6:1.27 All spectra were collected at 298K. Spectra were acquired as in phase/antiphase pairs for improved spectral presentation of the data, using in-house pulse sequences for measurement of 1JHN (Figure S2), 1JCαHα (Figure S3), 1J (Figure S4), and 1JCαĆ (Figure S5); see Supporting Information for further description of how these experiments can be implemented. For spectral clarity, 1JCαHα and 1JCαĆ were rescaled by a factor of 0.5 using the accordion principle.28 All spectra for RDC measurements were collected with 64-scans of signal averaging and a 1024 × 512 data matrix in both the in-phase and antiphase spectra. The sweep width for the direct 13C-dimension was set to 14 ppm, centered at 172 ppm, while the indirect 15N-dimension had a 24.2 ppm sweep width centered at 118 ppm.

All pulse programs developed for RDC measurements, as well as those we have developed and reported previously, are available in Bruker Topspin-compatible format for open-access download through the Penn State ScholarSphere (https://scholarsphere.psu.edu/collections/5712mv15x), or by searching for the Corresponding Author’s surname on the ScholarSphere homepage. All 13C direct-detect pulse programs utilized in this study, but previously published, are also available in the same collection.

Paramagnetic relaxation enhancement (PRE) measurements.

Following purification, Pdx1-C was incubated in 50 mM Tris (pH 7.5), 150 mM NaCl, and 5 mM DTT for a minimum of 1 hour at 4 °C in the dark. Pdx1-C samples were then buffer exchanged into 50 mM Tris (pH 7.5) and 150 mM NaCl using PD-10 buffer exchange resin (GE Healthcare Life Sciences of Pittsburgh, PA). A 5x molar excess of MTSL, relative to the Pdx1-C molarity, was then added and the sample incubated overnight at 4 °C. Unconjugated MTSL was separated from conjugated Pdx1-C using PD-10 resin. MTSL-labeled proteins were then buffer exchanged into 50 mM sodium phosphate (pH 6.5), 50 mM potassium chloride, 0.01% sodium azide, and 10% (v/v) D2O for spectroscopy. 13C,15N-CON and 13C,15N-(HACA)CON spectra were acquired on this paramagnetic sample as described below. Following data acquisition, the MTSL radical was quenched using a 10x molar excess of sodium ascorbate. Identical NMR spectra were acquired under diamagnetic conditions. All spectra for the PRE measurements were collected with 32-scans of signal averaging and a 1024 × 512 data matrix. The sweep width for the direct 13C-dimension was set to 18 ppm, centered at 173 ppm, while the indirect 15N-dimension had a 40 ppm sweep width centered at 124 ppm.

CAMPARI simulations.

Monte Carlo simulations were performed using the CAMPARI software package.22 All simulations included Pdx1-C in a random starting conformation, represented using the OPLSS-AA/L force field, and embedded in a spherical droplet with a 250 Å radius. Sodium chloride pairs were modeled explicitly to give a final concentration of ~10 mM, with excess sodium to preserve neutrality. The ABSINTH implicit solvation model was used for all trajectories.21 Replica exchange trajectories were run with twenty replicas at temperatures evenly spanning the range from 230 – 420 K for 5.1 × 107 steps, with the first 1 × 106 steps discarded as equilibration. For the unbiased simulation, the move set was composed as follows: rigid body moves, 10 % (50 %, 2 Å, 20°); side chain moves, 20% (40 %, 30°); omega moves, 10 % (10 %, 4°); pucker moves, 10 %; backbone moves, 56.7% (30 %, 10°). Percentages in parentheses correspond to the fraction of fully randomizing moves, while distances and angles represent the maximum displacement and rotation, respectively. Two additional control simulations were run in which the proline ω-bonds were constrained to be either all-cis or all-trans, with up to a 5° rotation allowed per move. For the ω-constrained simulation, the move set was composed as follows: rigid body moves, 10 % (50 %, 2 Å, 20°); side chain moves, 20% (40 %, 30°); omega moves, 10 % (0 % randomizing moves); pucker moves, 10.0 %; backbone moves, 56.7% (30 %, 10°).

Simulation analysis.

All molecular graphics images were generated using UCSF Chimera.29 All other analysis was performed using custom Python scripts, with graphs generated in the Matplotlib (https://matplotlib.org) prior to export into Adobe Illustrator. Calculation of Rg from the simulated ensembles was performed using CRYSOL, which is a program used to evaluate solution scattering from known structures by fitting to experimental SAXS data30 as previously described.11

Simulated PRE data were generated using the Battiste-Wagner equation to estimate reduction in peak intensity through previously described methods,3, 31 which will be summarized here as this approach remains uncommon. Here, we have collected PRE using two pulse programs that differ in the gyromagnetic ratio of the excitation nucleus and so we have generated theoretical PRE curves separately for the 13C-start and 1H-start CON and (HACA)CON, respectively. Simulated curves were created to model MTSL attachment at either residue 227 or 273, producing a total of four simulated curves. All calculations were performed using in-house Python (v3.6, NumPv v1.15) scripts. The predicted ratio of 15N,13C cross-peak intensities under paramagnetic (oxidized; Iox) and diamagnetic (reduced; Ired) conditions is determined as:

IoxIred=R2exp(R2spt)R2+R2sp Equation 1

where R2sp is the paramagnetic relaxation enhancement due to the spin probe in the paramagnetic form, and R2 is the intrinsic transverse relaxation rate of the protein in the absence of the spin probe, and τ is the total INEPT transfer time in the preparation element of the pulse sequence. To estimate R2, we used the average full-width at half-height for all analyzed peaks in the spectrum. For each site, the distance separating the excitation 1Hα or 13Ć and the unpaired electron is calculated as:

r=[KR2sp(4τc+3τc1+ω1Hor13C2τc2)]16 Equation 2

where ω is the Larmor frequency of the excitation nuclear spin and τc is the rotational correlation time, estimated from the molecular weight of Pdx1-C using previously described methods that account for the internal flexibility of intrinsically disordered proteins.32 The parameter K is composed of fundamental constants:

K=115S(S+1)γ2g2β2 Equation 3

where γ is the gyromagnetic ratio of the nuclear spin, g the electronic g factor, and β the Bohr magneton. The parameters and constants used in our calculation were set to R2,H = 13 s−1, R2,C = 8.1 s−1, τ = 9.0 × 10−3 s, τc = 1.2 × 10−9 s, ω1H = 2π ∙ 500 × 106 s−1, ω1H = 2π ∙ 126 × 106 s−1, K1H = 1.23 × 10−32 cm6 s−2, and K13C = 7.778 × 10−34 cm6 s−2.

The sulfur atom in the side chain of C227 or C273 was used as an estimate for the location of the unpaired electron in MTSL. Distances between the sulfur atom and each Ć or Hα were used to estimate the reduction in the peak intensity using previously described methods.3, 31 The scaling profiles were calculated as the average distance between two residues in 3-dimensional space over all simulation frames, <Rij>, as a function of the sequence separation in the primary structure, |i-j|, using Equation 1:

Rij=1Zijminj|rmirnj Equation 4

where i and j are residue indexes; m and n are atoms in residues i and j, respectively; Zij is equal to the number of unique pairwise distances; and r indicates the position vector for an atom in a particular residue.

RESULTS AND DISCUSSION

SAXS of Pdx1-C.

Intrinsically disordered regions by definition adopt multiple conformations in space and/or time, necessitating characterization of both their average ensemble characteristics and the details of individual high-probability structural features. To begin this study of Pdx1-C, we performed SAXS measurements with the aim of determining what features are present in its solution ensemble at the nm length scale. In order to control for aggregation of other inter-particle interactions in the beamline, we made measurements at three starting concentrations (3.8, 7.6, and 9.8 mg/mL) prior to loading on a Superdex 200 Increase (5×150 mm) column from which the eluate solution flowed directly into the x-ray beamline for SAXS analysis. Data were averaged over the set of frames spanning the SEC peak determined to have approximately the same radius of gyration (Rg). For each run, buffer subtraction was achieved by utilizing a matched number of frames from the SEC chromatogram baseline. The raw scattering data are consistent with preservation of the same assembly state in solution across the full concentration range (Figure 1A) and the low-q region in particular suggests Pdx1-C is not aggregating under our conditions. Analysis of the data in the Guinier region of the scattering data (Figure 1B) is consistent with a solution Rg of 27.4 ± 0.8 Å for the 3.8 mg/mL data set, which will be the source of all discussed statistics and fitting parameters going forward (see Table S1 for a full summary). The fitted Rg value is not consistent with the expectation for a cooperatively folded 83-residue monomeric protein, but is reasonable for a monomeric yet disordered chain (and with the molecular weight determined by in-line MALS following exit from the SEC column; data not shown). Similarly, Kratky analysis suggests that Pdx1-C lacks a closed volume in solution, as evidenced by the divergence of the dimensionless Kratky data at high-q (Figure 1C). In summary, the SAXS profile of Pdx1-C is consistent with the presence of an ensemble of unfolded, but monomeric, conformers in solution.

Figure 1.

Figure 1.

SAXS characterization of Pdx1-C reveals an extended and disordered structure in solution. (A) Raw scattering data for Pdx1-C at concentrations ranging from 3.8 – 9.8 mg/mL. (B) Raw data in the Guinier region (red circles) and Guinier fits (black lines) for Pdx1-C. Data were fit to the Guinier approximation in Python using the method of non-linear least squares. (C) Kratky plots generated from the data in panel (A) show a linear increase as a function of q in the high-q region, which is suggestive of a disordered structure in solution. (D) Pairwise distance distribution plot, which demonstrates few features and suggests an extended, disordered structure for Pdx1-C in solution.

Fourier transformation of SAXS data reveal additional details of the conformational ensemble adopted by proteins in solution. Pairwise distance distribution analysis of Pdx1-C reveals a smooth distribution, best described by Rg = 26.2 Å (in excellent agreement with the Guinier analysis) and a maximum intramolecular distance of 95.7 Å. The pairwise distance distribution of Pdx1-C is otherwise surprisingly smooth and devoid of identifiable features (Figure 1D). For comparison, the experimental Rg compares well to the prediction for an excluded volume random chain (Rg,EV = 27 Å) and is only slightly extended compared with a re-calibration of the Flory scaling law intended to model intrinsically disordered protein behavior (Rg,IDP = 24.6 Å).33 These findings are unsurprising, given the high proline enrichment of Pdx1-C and, nearly complete lack of aromatic or large-chain aliphatic residues, modest fraction of charged residues (FCR = 0.20), and very low net charge per-residue (NCPR = −0.012); all of which are expected to favor a disordered conformation in solution.

Absence of secondary structure indicated by chemical shift and RDC measurements.

The absence of stable tertiary structure and the adoption of a relatively extended average conformation are not necessarily predictive of the secondary structure present at individual motifs within a polypeptide chain. Our previous work has produced complete Pdx1-C chemical shift assignments that are consistent with the absence of significant α-helix or β-strand conformation at any position in Pdx1-C (reproduced in Figure 2A).14 While our chemical shift data suggest the absence of secondary structure, they do not rule out the possibility of other persisting and/or highly populated local structural features at variance with respect to random-coil structure, which might otherwise be well-described as enriched tertiary contacts.

Figure 2.

Figure 2.

NMR characterization of Pdx1-C reveals an absence of detectable secondary structure. (A) Secondary chemical shift analysis from data deposited in the BMRB (accession number 19596). Residual dipolar couplings are displayed as a function of residue in the Pdx1-C sequence for (B) 1DNH, (C) 1DCαHα, (D) 1D, and (E) 1DCαĆ.

As an orthogonal measure of local order, we collected residual dipolar coupling (RDC) data for Pdx1-C in the presence of compressed polyacrylamide gels. RDCs are frequently relied on in contemporary investigations of IDP structure because they allow comparison of the relative orientations sampled by individual bond vectors in the polypeptide chain. An additional strength of RDC measurements in the context of protein disorder is their sensitivity to transient long-range interactions, such as those observed in the solution ensemble of α-synuclein.3 Consistent with our general strategy of reliance on 13C direct-detect spectroscopy, we have taken inspiration from the prior literature on scalar coupling measurement20, 3436 and developed variants of the 13C,15N-CON for this study. In this way, we have generated pulse programs that are fully described in the Supporting Information, which enabled measurement of 1JNH, 1JCαHα, 1J, and 1JCαĆ (see Figures S6S9 for representative spectra). For a random flight chain, the 1DNH RDC profile takes on a bell-shape, with the depth in the center determined by the overall length of the polypeptide.37 The 1DNH RDCs for Pdx1-C shown in Figure 2B are consistent with an inverted bell-shaped profile. Similarly, the 1DCαHα, 1D, and 1DCαĆ feature a bell-like character that is also consistent with a coil configuration for Pdx1-C in solution. Taken together, our SAXS, chemical shift, and RDC data suggest that Pdx1-C adopts an extended and highly random ensemble in solution, with little to no evidence for globularity.

Backbone NMR spin relaxation of Pdx1-C.

Conformational rearrangements of proteins often occur on the ns-μs timescales;38 unsurprisingly, the backbone dynamics of IDPs on these timescales are particularly extensive and have been heavily studied by NMR.3940 Therefore, in order to probe the backbone dynamics of Pdx1-C, we utilized our previously published 13C direct-detect, 15N spin relaxation experiments.26 In the presence of uniform 13C-enrichment, passive scalar couplings can introduce sinewave distortions onto the exponential decays used to measure 15N spin-relaxation. As previously described,26 our pulse sequences were designed to minimize the influence of passive coupling; the overall quality of the recorded decays is evident in representative decay curves that span the observed range of systematic distortions (Figure S10). T1 and T2 relaxation times are sensitive to both global reorientation and local fluctuations of the chain, as has been extensively modeled in studies of cooperatively folded proteins. For IDPs, where separation of global and local structure can lose its meaning, a variety of analysis techniques have been adopted.4144 In the case of Pdx1-C, the T1 and T2 profiles are largely featureless (Figure 3), yet the depressed transverse relaxation times, compared to a folded protein with an equivalent number of amino acids, are again consistent with overall disorder and a lack of collapsed structure. Interestingly, it does appear that the dynamics in the N-terminal half of Pdx1-C are non-uniform. Residues 210–225 appear to be more flexible than their neighbors, consistent with its inclusion of seven glycine residues. Similar variance in relaxation times in correlation with biased amino acid representation has been observed in previous studies of IDP dynamics.4546 In contrast, a reduction of T1 and T2 relative to baseline can suggest high contact density or the presence of secondary structure.47 While there is a slight depression in the T1 profile near residues 205–210, there is little other evidence for such contacts throughout the chain. In summary, the spin relaxation data suggest local structure near residues 205–225 that is distinct from the remainder of the Pdx1-C chain, but overall these data are consistent with the picture of an extended and highly dynamic backbone in solution.

Figure 3.

Figure 3.

Backbone 15N spin relaxation for Pdx1-C. (A) T1 and (B) T2 relaxation profiles as a function of residue position indicate the highly disordered nature of the Pdx1-C backbone. The resonance intensity of each well-resolved residue in 13C,15N-CON spectra was recorded and the resulting decay curves fitted to a single-exponential function. Error bars represent uncertainty in fitting.

PRE data for Pdx1-C.

Paramagnetic relaxation enhancement (PRE) is a powerful measurement for constraint of IDP ensembles because of its ability to detect minor populations that feature long range contacts or locally collapsed states. For example, sensitivity to minor states enabled detection of pathological but low-population conformations in the solution ensemble of α-synuclein.48 An additional feature that enhances the value of PRE as a structure constraint for expanded systems is that the high gyromagnetic ratio of the electron, compared with those of nuclear spins, leads PRE to produce distance constraints for contacts out to a range of 20 Å. Here, PRE was measured through attachment of the MTSL spin-label to the native cysteine residue C227, or to an engineered C227S/S273C double mutant.

Consistent with our preference for 13C direct-detection displayed throughout this study, PRE measurements were recorded in the 13C,15N-CON format. We note that CON-based detection allows measurement of a PRE intensity ratio for most proline residues, but due to spectral overlap the hexa-proline sequence at residues 239–244 was excluded from analysis. Unique to this phase of the study, spectra were acquired in both the “protonless” format and through the proton-start 13C,15N-(HACA)CON. The transverse relaxation rate contribution from proximity to an unpaired electron scales with both the electron g-factor and the gyromagnetic ratio of the observed nucleus.3 In the case of steady-state PRE, as detected here, this suggests that sensitivity to longer range or shorter lived contacts should be enhanced by selection of a high gyromagnetic ratio excitation nucleus (1H) and reduced through excitation on a low gyromagnetic ratio nucleus (13C). Indeed, this effect is observed in the PRE profiles displayed in Figure 4, where in all data panels the dashed line reports the expectation value of the PRE under the null hypothesis that Pdx1-C behaves as a self-avoiding random flight chain.4950 For both MTSL attachment sites the expected Gaussian-shaped well surrounding the site of attachment, representing PRE intensity loss due to obligatory proximity within the covalent network, is substantially broader in the 1H-start spectra (Figure 4A, C) than in the 13C-start spectra (Figure 4B, D), consistent with the expectation that a higher gyromagnetic ratio excitation nucleus will impart sensitivity to the PRE at greater distances.

Figure 4.

Figure 4.

PRE data for Pdx1-C demonstrate non-uniform local chain collapse and the absence of long-range contacts. (A,B) PRE profiles for MTSL conjugation to C227. (C,D) PRE profiles for MTSL conjugation to C273, in the C227S/S273C double-mutant. The profiles in panels A and C report data collected using the proton-start 13C,15N-(HACA)-CON, whereas panels B and D report data collected using the carbon start 13C,15N-CON. All data bars represent the average intensity ratio over duplicate measurements. The dashed line indicates the predicted profile for a self-avoiding random flight chain as described in the Materials and Methods. The solid line represents the back-calculated PRE profile from CAMPARI simulation trajectories. (E) The net charge per residue (NCPR) is plotted to reveal local clusters of positive (blue) and negative (red) charge along the primary structure of Pdx1-C. NCPR is calculated as the average charge over a 5-residue window and assigned to the central residue of the window.

As stated above, Pdx1-C possesses a native C227 residue that was exploited for our initial PRE measurements. The data for 1H-start (Fig. 4A) and 13C-start (Fig. 4B) PRE from C227-MTSL reveal two intriguing features. Recall that perturbations in the spin-relaxation profile near C227 suggested the possibility of globular contacts in the N-terminal half of Pdx1-C. Consistent with this observation, C227-MTSL showed short and medium range contacts consistent with a globule or compaction bias in the ensemble of Pdx1-C (Figure 4A and B), while the S273C mutant PRE profile (Figure 4C and D) was more similar to a random coil. This trend is hinted through somewhat reduced intensity ratios reaching all the way to the N-terminus, but is more pronounced in the region spanning residues 233–239. Further analysis of this trend required molecular simulations that will be discussed below.

Strikingly, there were no observable long-range contacts between the N-terminal and C-terminal half of Pdx1-C for either construct; all structural perturbations or deviations from random coil behavior are limited to relatively local contacts. This observation was surprising, given that the net charge per-residue (NCPR) indicates a tendency to negative charge near C227 and positive charge near the C-terminus of the protein that provided our initial motivation for selecting C227S/S273C as a second target for PRE studies. On a more local scale, although the N-terminal half of Pdx1-C displays a higher local NCPR, it is noteworthy that this arises mostly from reduced charge mixing compared to the C-terminal half. The degree of segregation in charge has been quantified by the Pappu laboratory in the form of a parameter (κ) that represents charge asymmetry over a sliding window.2 The N-terminal and C-terminal halves of Pdx1-C have a κ of 0.331 and 0.132. Thus, one possible explanation for the more collapsed structure of the N-terminal region in Pdx1-C is that it is driven by electrostatic interaction between the strongly negative patch near C227 and the positively charged patch consisting of residues 207–213.

CAMPARI simulations of Pdx1-C.

The analysis of SAXS and NMR data presented above left us with several molecular hypotheses that can best be tested through computational modelling of the Pdx1-C solution ensemble. Recently, Monte Carlo (MC) simulations performed in the CAMPARI software package, utilizing the ABSINTH implicit solvation model have been used to characterize the effects of phosphorylation on the ensemble conformations of Ash1 and the physical properties that contribute to liquid-liquid phase separation of NCID.5152 Similarly, we have recently succeeded in modelling the C-terminal region of the phosphatase FCP1 using this methodology, yielding results that were consistent with both SAXS and NMR measurements.11 Therefore, we chose to conduct MC simulations in ABSINTH to generate an ensemble of Pdx1-C for analysis in the context of the current data, with particular emphasis on reproduction of the PRE data sets described above.

Both our SAXS and PRE data indicated that Pdx1-C adopts a coil-like ensemble in solution. The radius of gyration was determined from the ensemble of CAMPARI-generated structures and was determined to be 25.0 ± 3.4 Å; while slightly smaller than expected, this average is indistinguishable within experimental uncertainty from the SAXS-derived Rg of 27.4 ± 0.77 Å. The simulated PRE data, derived from our MC calculation, is plotted as a solid black line in Figure 4AD. The MC data show extensive short-range contacts between residue 227 and the proximal N-terminal basic patch, but no long-range interaction with residues C-terminal to the central polyproline stretch (Figure 4A, B). Interestingly, PRE experiments performed in either a carbon-start or proton-start format were in excellent agreement with their respective simulated PRE data. This suggests that adjusting the gyromagnetic ratio term in the Battiste-Wagner equation and solving for the peak intensity change was enough to account for the differences in the width of the relaxation enhancement zone surrounding the site of MTSL attachment in carbon-start and proton-start experiments. Similarly, simulated PRE data of Pdx1-C C227S, S273C were in agreement with experimental PRE data (Figure 4C, D). Taken together, these data preclude any persistent long-range intramolecular interaction between the N- and C-terminal halves of Pdx1-C.

The coil-like character of our ensemble is intriguing, given that Pdx1-C features a low fraction of charged residues and net charge per residue, both which are sequence features that often correlate with collapse to a disordered globule state.2 We note that this general tendency was established for sequences depleted in proline residues, an abundance of which will tend to disfavor chain compaction in the disordered state. Given Pdx1-C’s high enrichment in proline, we next turned our attention to the potential impact of these residues on the chain’s dimensions. Proline is the only residue which appreciably populates the cis-peptide isomerization state in solvated polypeptides (~5% prolines are cis);53 because Pdx1-C is 22% proline there is a high probability of a non-trivial amount of cis-proline in any single conformer that contributes to the overall ensemble. Improved Monte Carlo methods described by the Radhakrishnan et al., which were utilized here, can reproduce experimental proline isomer distributions.54 Our MC data showed significant cis-proline in Pdx1-C, with the highest probabilities localized in the center of the protein (Figure 5A). To study the effects of cis-proline on the conformational ensemble of Pdx1-C, we computed two additional MC ensembles that describe the (experimentally hypothetical) states in which all prolines are either completely in the trans or completely in the cis configuration (though small fluctuations in the ω-torsion angles were allowed; see Materials and Methods). Theses all-trans or all-cis MC ensembles were used to compare with the unbiased ensemble, where the ω-bond was allowed to flip between cis and trans isomers for all proline residues. Aspherecity vs. Rg was determined for each model of the unbiased, all-cis, and all-trans Pdx1-C ensembles, as shown in Figure 5B, C, D (left panel). In the unbiased trajectory the mean Rg of the distribution was 25.0 ± 3.4 Å, as described above, and the mean asphericity A = 0.378 ± 0.165 (Fig. 5B). Locking all the prolines into a trans-peptide configuration does not change the mean Rg or asphericity, which were 24.8 ± 3.5 Å and 0.370 ± 0.164, respectively. The all-cis proline Pdx1-C trajectory yields a slight decrease in the Rg to 22.5 ± 3.1 Å, but this decrease is not significantly different within error from the mean in the unbiased trajectory; the same was true for the asphericity, which produced a mean value of 0.36 ± 0.16. Thus, even in an all cis-Pro context, the global chain dimensions are only marginally different from those in the state with native isomer distribution. Fitzkee and Rose have elegantly demonstrated that, consistent with expectations from polymer theory, local non-random structure generally has no impact on global coil-like chain dimensions.55 Consistent with this polymer view, our data demonstrate that proline isomerization cannot significantly alter the global dimensions of the Pdx1-C chain.

Figure 5.

Figure 5.

Analysis of CAMPARI trajectories to determine the influence of proline trans-to-cis isomerization on Pdx1-C compaction. (A) The cis-Pro isomer content is shown for the control CAMPARI trajectory at 310K. The (A) control, (B) all cis-Pro, and (d) all trans-Pro trajectories yield the displayed asphericity vs. Rg distributions (left) and internal scaling profiles (right). Analyzed structure ensembles were from the 310 K temperature replica. For the internal scaling plots, the excluded volume (EV) and Flory random coil (FRC) reference curves are shown in blue and red respectively.

While proline trans-to-cis isomerization will not collapse Pdx1-C on average, it remains likely that the persistence length of a polypeptide enriched in proline will be greater than that expected for a random polypeptide sequence. The scaling of internal distances for each of the three trajectories were calculated and are shown in Figure 5B, C, D (right panel). Internal distances are calculated as the average distance between all atoms in a pair of residues vs. the sequence separation of those two particular residues. Scaling profiles for excluded volume (EV) chains and a Flory random coil (FRC) are shown as references. The internal scaling profile of the unbiased and all-trans proline trajectories were not observably different, but the all-cis trajectory showed a significant overall reduction in the internal scaling parameter.

The analysis above demonstrates that on the global scale, Pdx1-C adopts an open and extended ensemble, inconsistent with globular collapse. On the other hand, our MC ensemble provides us with the opportunity to investigate whether Pdx1-C features segments of its primary structure that collapse into more globular states. Indeed, the PRE data recorded with MTSL attached to C227 suggests the existence of just such a cluster of local interactions. Figure 6 shows a contact map of the 310K replica from the trajectory of Pdx1-C with unbiased proline isomerization. To construct this map, distances were measured for each pairwise Cα-Cα in a model and averaged over all the models in the trajectory. Small distances are depicted in red, medium distances in yellow, and large distances in blue (quantitative scale bar shown in Figure 6). One of the most surprising elements of the Pdx1-C primary structure is a 6-residue polyproline segment encompassing residues 239–244. This polyproline region appears to separate Pdx1-C into two structurally isolated segments; few contacts are indicated between residues bridging this segment. Consistent with the PRE data, region 200–237 appears more globular than region 245–283. We note that the charged residues in the 200–237 region are poorly mixed, leading us to hypothesize that electrostatic attraction between clusters of charged residues in this region drives the chain toward the (slight) compaction observed (Figure 4E). In contrast, relative charge depletion and much higher proline content (~26%) in the C-terminal half of Pdx1-C yields an unusually extended ensemble that is depleted in local residue-residue contacts.

Figure 6.

Figure 6.

Heatmap of contacts observed in the Pdx1-C ensemble, constructed from the 310 K temperature replica in the REMC calculation. Colors depict distances from Cα to Cα as quantified in the color key. In order to generate a representative set of conformations, five structures were extracted at random from the computed ensemble and aligned by the central six proline residues shown in gray (residues 239–244)

CONCLUSIONS

NMR is well-suited to characterize the conformational ensembles adopted by IDPs in solution, even in the absence of stable secondary structure. Here, we have presented extensive NMR characterization of the highly proline-enriched IDP Pdx1-C, relying exclusively on the 13C direct-detect methodology our laboratory has developed. Toward this broader objective, we have introduced four new RDC experiments, which are available for download from our website. Further, PRE measurements are reported in both the 15N,13C-CON and 15N,13C-(HACA)CON detection format, yielding additional structure constraints. We note that it appears the 13C,15N-(HACA)CON is more sensitive to long-range contacts, which will merit further investigation with an IDP that features persistent long-range contacts in its solution ensemble. Here, our PRE data show that Pdx1-C is highly extended overall, with a slight tendency to globular structure at the N-terminus.

We set out to describe the solution ensemble of Pdx1-C, with the purpose of building necessary information regarding the unbound structure of this critical regulatory region of the full transcription factor. Pdx1-C is known to be the target of the E3-ubiquitin ligase substrate adaptor protein SPOP, which promotes Pdx1-C degradation under low blood glucose conditions.18 Often in the IDP community, there is a tendency to seek confirmation of secondary structure, such as partially stable α-helix motifs, in the unbound ensemble of IDPs, as these are presumed to be the site of interaction with biological partners. In the case of SPOP, available crystal structures of the protein’s MATH recruitment domain bound to client protein-derived peptides demonstrate that those peptides bind in an extended format;56 there is no secondary structure for one to seek in the unbound ensemble. However, the SPOP binding motif within Pdx1-C embedded within the region we find to possess the most local compaction, suggesting that SPOP must either capture a rare Pdx1-C conformer or induce a more extended structure in conjunction with binding. Overall, Pdx1-C serves as a valuable counterexample to the broad trends in the IDP literature: molecular interaction motifs are not always found within secondary structural element; yet, even in their absence, a detailed understanding of the unbound-state structure is necessary to develop full mechanistic understanding. The new pulse programs and extended analysis strategy reported here should be of general benefit to future efforts toward determination of IDP ensemble properties.

Supplementary Material

Supplementary Material File

ACKNOWLEDGMENT

This work was supported by a US National Science Foundation CAREER grant MCB-0953918 and grant MCB-1515974 to SAS, and NIH NRSA predoctoral fellowship F31GM101936 to MB. We would like to thank the Penn State ScholarSphere team (http://scholarsphere.psu.edu) for their help establishing permanent and stable web-hosting of our pulse programs. This work is based upon research conducted at the Cornell High Energy Synchrotron Source (CHESS), which is supported by the National Science Foundation and the National Institutes of Health/National Institute of General Medical Sciences under NSF awards DMR-1332208 and DMR-0936384, using the Macromolecular Diffraction at CHESS (MacCHESS) facility, which is supported by award GM-103485 from the National Institute of General Medical Sciences, National Institutes of Health.

Footnotes

Supporting Information. Table of SAXS acquisition and analysis parameters, representative Pdx1-C NMR spectra, timing diagrams for all RDC pulse programs, representative spectra from all pulse programs, representative spin relaxation decays. This material is available free of charge via the Internet at http://pubs.acs.org.

REFERENCES

  • 1.Camilloni C; De Simone A; Vranken WF; Vendruscolo M Determination of Secondary Structure Populations in Disordered States of Proteins Using Nuclear Magnetic Resonance Chemical Shifts. Biochemistry 2012, 51, 2224–2231. [DOI] [PubMed] [Google Scholar]
  • 2.Das RK; Pappu RV Conformations of Intrinsically Disordered Proteins Are Influenced by Linear Sequence Distributions of Oppositely Charged Residues. Proc. Natl. Acad. Sci. U.S.A 2013, 110, 13392–13397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Salmon L; Nodet G; Ozenne V; Yin G; Jensen MR; Zweckstetter M; Blackledge M NMR Characterization of Long-Range Order in Intrinsically Disordered Proteins. J. Am. Chem. Soc 2010, 132, 8407–8418. [DOI] [PubMed] [Google Scholar]
  • 4.Mao AH; Crick SL; Vitalis A; Chicoine CL; Pappu RV Net Charge Per Residue Modulates Conformational Ensembles of Intrinsically Disordered Proteins. Proc. Natl. Acad. Sci. U.S.A 2010, 107, 8183–8188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.van der Lee R; Buljan M; Lang B; Weatheritt RJ; Daughdrill GW; Dunker AK; Fuxreiter M; Gough J; Gsponer J; Jones DT, et al. Classification of Intrinsically Disordered Regions and Proteins. Chem. Rev 2014, 114, 6589–6631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Fuertes G; Banterle N; Ruff KM; Chowdhury A; Mercadante D; Koehler C; Kachala M; Estrada Girona G; Milles S; Mishra A, et al. Decoupling of Size and Shape Fluctuations in Heteropolymeric Sequences Reconciles Discrepancies in SAXS vs. FRET Measurements. Proc. Natl. Acad. Sci. U.S.A 2017, 114, E6342–E6351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gibbs EB; Cook EC; Showalter SA Application of NMR to Studies of Intrinsically Disordered Proteins. Arch. Biochem. Biophys 2017, 628, 57–70. [DOI] [PubMed] [Google Scholar]
  • 8.Sormanni P; Piovesan D; Heller GT; Bonomi M; Kukic P; Camilloni C; Fuxreiter M; Dosztanyi Z; Pappu RV; Babu MM, et al. Simultaneous Quantification of Protein Order and Disorder. Nat. Chem. Biol 2017, 13, 339–342. [DOI] [PubMed] [Google Scholar]
  • 9.Bastidas M; Gibbs EB; Sahu D; Showalter SA A Primer for Carbon-Detected NMR Applications to Intrinsically Disordered Proteins in Solution. Con. Magn. Reson. A 2015, 44, 54–66. [Google Scholar]
  • 10.Lawrence CW; Bonny A; Showalter SA The Disordered C-Terminus of the RNA Polymerase II Phosphatase Fcp1 Is Partially Helical in the Unbound State. Biochem. Biophys. Res. Commun 2011, 410, 461–465. [DOI] [PubMed] [Google Scholar]
  • 11.Gibbs EB; Showalter SA Quantification of Compactness and Local Order in the Ensemble of the Intrinsically Disordered Protein Fcp1. J. Phys. Chem. B 2016, 120, 8960–8969. [DOI] [PubMed] [Google Scholar]
  • 12.Gibbs EB; Lu F; Portz B; Fisher MJ; Medellin BP; Laremore TN; Zhang YJ; Gilmour DS; Showalter SA Phosphorylation Induces Sequence-Specific Conformational Switches in the RNA Polymerase II C-Terminal Domain. Nat. Commun 2017, 8, 15233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Stapley BJ; Creamer TP A Survey of Left-Handed Polyproline II Helices. Protein Sci. 1999, 8, 587–595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sahu D; Bastidas M; Showalter SA Generating Nmr Chemical Shift Assignments of Intrinsically Disordered Proteins Using Carbon-Detected NMR Methods. Anal. Biochem 2014, 449, 17–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Babu DA; Deering TG; Mirmira RG A Feat of Metabolic Proportions: Pdx1 Orchestrates Islet Development and Function in the Maintenance of Glucose Homeostasis. Mol. Genet. Metab 2007, 92, 43–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Porter JR; Barrett TG Monogenic Syndromes of Abnormal Glucose Homeostasis: Clinical Review and Relevance to the Understanding of the Pathology of Insulin Resistance and Beta Cell Failure. J. Med. Genet 2005, 42, 893–902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Roy N; Takeuchi KK; Ruggeri JM; Bailey P; Chang D; Li J; Leonhardt L; Puri S; Hoffman MT; Gao S, et al. Pdx1 Dynamically Regulates Pancreatic Ductal Adenocarcinoma Initiation and Maintenance. Genes Dev. 2016, 30, 2669–2683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Liu A; Desai BM; Stoffers DA Identification of Pcif1, a Poz Domain Protein That Inhibits Pdx-1 (Mody4) Transcriptional Activity. Mol. Cell. Biol 2004, 24, 4372–4383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bermel W; Bertini I; Felli IC; Lee YM; Luchinat C; Pierattelli R Protonless NMR Experiments for Sequence-Specific Assignment of Backbone Nuclei in Unfolded Proteins. J. Am. Chem. Soc 2006, 128, 3918–3919. [DOI] [PubMed] [Google Scholar]
  • 20.Bermel W; Bertini I; Csizmok V; Felli IC; Pierattelli R; Tompa P H-Start for Exclusively Heteronuclear NMR Spectroscopy: The Case of Intrinsically Disordered Proteins. J. Magn. Reson 2009, 198, 275–281. [DOI] [PubMed] [Google Scholar]
  • 21.Vitalis A; Pappu RV Absinth: A New Continuum Solvation Model for Simulations of Polypeptides in Aqueous Solutions. J. Comput. Chem 2009, 30, 673–699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Vitalis A; Pappu RV Methods for Monte Carlo Simulations of Biomacromolecules. Annu. Rep. Comput. Chem 2009, 5, 49–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hopkins JB; Gillilan RE; Skou S Bioxtas Raw: Improvements to a Free Open-Source Program for Small-Angle X-Ray Scattering Data Reduction and Analysis. J. Appl. Crystallogr 2017, 50, 1545–1553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Receveur-Brechot V; Durand D How Random Are Intrinsically Disordered Proteins? A Small Angle Scattering Perspective. Curr. Protein Pept. Sci 2012, 13, 55–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lee W; Tonelli M; Markley JL NMRFAM-Sparky: Enhanced Software for Biomolecular NMR Spectroscopy. Bioinformatics 2015, 31, 1325–1327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Lawrence CW; Showalter SA Carbon-Detected 15N NMR Spin Relaxation of an Intrinsically Disordered Protein: Fcp1 Dynamics Unbound and in Complex with Rap74. J. Phys. Chem. Lett 2012, 3, 1409–1413. [DOI] [PubMed] [Google Scholar]
  • 27.Sass HJ; Musco G; Stahl SJ; Wingfield PT; Grzesiek S Solution NMR of Proteins within Polyacrylamide Gels: Diffusional Properties and Residual Alignment by Mechanical Stress or Embedding of Oriented Purple Membranes. J. Biomol. NMR 2000, 18, 303–309. [DOI] [PubMed] [Google Scholar]
  • 28.Bodenhausen G; Ernst RR Direct Determination of Rate Constants of Slow Dynamic Processes by Two-Dimensional Accordion Spectroscopy in Nuclear Magnetic-Resonance. J. Am. Chem. Soc 1982, 104, 1304–1309. [Google Scholar]
  • 29.Pettersen EF; Goddard TD; Huang CC; Couch GS; Greenblatt DM; Meng EC; Ferrin TE Ucsf Chimera--a Visualization System for Exploratory Research and Analysis. J. Comput. Chem 2004, 25, 1605–1612. [DOI] [PubMed] [Google Scholar]
  • 30.Svergun D; Barberato C; Koch MHJ Crysol - a Program to Evaluate X-Ray Solution Scattering of Biological Macromolecules from Atomic Coordinates. Journal of Applied Crystallography 1995, 28, 768–773. [Google Scholar]
  • 31.Battiste JL; Wagner G Utilization of Site-Directed Spin Labeling and High-Resolution Heteronuclear Nuclear Magnetic Resonance for Global Fold Determination of Large Proteins with Limited Nuclear Overhauser Effect Data. Biochemistry 2000, 39, 5355–5365. [DOI] [PubMed] [Google Scholar]
  • 32.Dunlap TB; Guo HF; Cook EC; Holbrook E; Rumi-Masante J; Lester TE; Colbert CL; Vander Kooi CW; Creamer TP Stoichiometry of the Calcineurin Regulatory Domain-Calmodulin Complex. Biochemistry 2014, 53, 5779–5790. [DOI] [PubMed] [Google Scholar]
  • 33.Bernado P; Svergun DI Structural Analysis of Intrinsically Disordered Proteins by Small-Angle X-Ray Scattering. Mol. Biosyst 2012, 8, 151–167. [DOI] [PubMed] [Google Scholar]
  • 34.Bermel W; Bertini I; Felli IC; Peruzzini R; Pierattelli R Exclusively Heteronuclear NMR Experiments to Obtain Structural and Dynamic Information on Proteins. Chemphyschem 2010, 11, 689–695. [DOI] [PubMed] [Google Scholar]
  • 35.Novacek J; Haba NY; Chill JH; Zidek L; Sklenar V 4d Non-Uniformly Sampled HCBCACON and 1J(NCα)-Selective HCBCANCO Experiments for the Sequential Assignment and Chemical Shift Analysis of Intrinsically Disordered Proteins. J. Biomol. NMR 2012, 53, 139–148. [DOI] [PubMed] [Google Scholar]
  • 36.Permi P; Heikkinen S; Kilpelainen I; Annila A Measurement of 1J(NC’) and 2J(H(N))(C’) Couplings from Spin-State-Selective Two-Dimensional Correlation Spectrum. J. Magn. Reson 1999, 140, 32–40. [DOI] [PubMed] [Google Scholar]
  • 37.Louhivuori M; Paakkonen K; Fredriksson K; Permi P; Lounila J; Annila A On the Origin of Residual Dipolar Couplings from Denatured Proteins. J. Am. Chem. Soc 2003, 125, 15647–15650. [DOI] [PubMed] [Google Scholar]
  • 38.Kleckner IR; Foster MP An Introduction to Nmr-Based Approaches for Measuring Protein Dynamics. Biochim. Biophys. Acta 2011, 1814, 942–968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jensen MR; Zweckstetter M; Huang JR; Blackledge M Exploring Free-Energy Landscapes of Intrinsically Disordered Proteins at Atomic Resolution Using NMR Spectroscopy. Chem. Rev 2014, 114, 6632–6660. [DOI] [PubMed] [Google Scholar]
  • 40.Salvi N; Abyzov A; Blackledge M Analytical Description of Nmr Relaxation Highlights Correlated Dynamics in Intrinsically Disordered Proteins. Angew. Chem. Int. Ed. Engl 2017, 56, 14020–14024. [DOI] [PubMed] [Google Scholar]
  • 41.Gill ML; Byrd RA; Palmer AG Dynamics of Gcn4 Facilitate DNA Interaction: A Model-Free Analysis of an Intrinsically Disordered Region. Phys. Chem. Chem. Phys 2016, 18, 5839–5849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.To V; Dzananovic E; McKenna SA; O’Neil J The Dynamic Landscape of the Full-Length Hiv-1 Transactivator of Transcription. Biochemistry 2016, 55, 1314–1325. [DOI] [PubMed] [Google Scholar]
  • 43.Khan SN; Charlier C; Augustyniak R; Salvi N; Dejean V; Bodenhausen G; Lequin O; Pelupessy P; Ferrage F Distribution of Pico- and Nanosecond Motions in Disordered Proteins from Nuclear Spin Relaxation. Biophys. J 2015, 109, 988–999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Abyzov A; Salvi N; Schneider R; Maurin D; Ruigrok RW; Jensen MR; Blackledge M Identification of Dynamic Modes in an Intrinsically Disordered Protein Using Temperature-Dependent NMR Relaxation. J. Am. Chem. Soc 2016, 138, 6240–6251. [DOI] [PubMed] [Google Scholar]
  • 45.Csizmok V; Felli IC; Tompa P; Banci L; Bertini I Structural and Dynamic Characterization of Intrinsically Disordered Human Securin by Nmr Spectroscopy. J. Am. Chem. Soc 2008, 130, 16873–16879. [DOI] [PubMed] [Google Scholar]
  • 46.Bertoncini CW; Rasia RM; Lamberto GR; Binolfi A; Zweckstetter M; Griesinger C; Fernandez CO Structural Characterization of the Intrinsically Unfolded Protein Beta-Synuclein, a Natural Negative Regulator of Alpha-Synuclein Aggregation. J. Mol. Biol 2007, 372, 708–722. [DOI] [PubMed] [Google Scholar]
  • 47.Baronti L; Erales J; Habchi J; Felli IC; Pierattelli R; Longhi S Dynamics of the Intrinsically Disordered C-Terminal Domain of the Nipah Virus Nucleoprotein and Interaction with the X Domain of the Phosphoprotein as Unveiled by NMR Spectroscopy. Chembiochem 2015, 16, 268–276. [DOI] [PubMed] [Google Scholar]
  • 48.Janowska MK; Baum J The Loss of Inhibitory C-Terminal Conformations in Disease Associated P123H Beta-Synuclein. Protein Sci. 2016, 25, 286–294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Bernado P; Mylonas E; Petoukhov MV; Blackledge M; Svergun DI Structural Characterization of Flexible Proteins Using Small-Angle X-Ray Scattering. J. Am. Chem. Soc 2007, 129, 5656–5664. [DOI] [PubMed] [Google Scholar]
  • 50.Tria G; Mertens HD; Kachala M; Svergun DI Advanced Ensemble Modelling of Flexible Macromolecules Using X-Ray Solution Scattering. IUCrJ 2015, 2, 207–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Pak CW; Kosno M; Holehouse AS; Padrick SB; Mittal A; Ali R; Yunus AA; Liu DR; Pappu RV; Rosen MK Sequence Determinants of Intracellular Phase Separation by Complex Coacervation of a Disordered Protein. Mol. Cell 2016, 63, 72–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Martin EW; Holehouse AS; Grace CR; Hughes A; Pappu RV; Mittag T Sequence Determinants of the Conformational Properties of an Intrinsically Disordered Protein Prior to and Upon Multisite Phosphorylation. J. Am. Chem. Soc 2016, 138, 15323–15335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Sarkar SK; Young PE; Sullivan CE; Torchia DA Detection of Cis and Trans X-Pro Peptide-Bonds in Proteins by 13C NMR - Application to Collagen. Proc. Natl. Acad. Sci. U.S.A 1984, 81, 4800–4803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Radhakrishnan A; Vitalis A; Mao AH; Steffen AT; Pappu RV Improved Atomistic Monte Carlo Simulations Demonstrate That Poly-L-Proline Adopts Heterogeneous Ensembles of Conformations of Semi-Rigid Segments Interrupted by Kinks. J. Phys. Chem. B 2012, 116, 6862–6871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Fitzkee NC; Rose GD Reassessing Random-Coil Statistics in Unfolded Proteins. Proc. Natl. Acad. Sci. U.S.A 2004, 101, 12497–12502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Zhuang M; Calabrese MF; Liu J; Waddell MB; Nourse A; Hammel M; Miller DJ; Walden H; Duda DM; Seyedin SN, et al. Structures of SPOP-Substrate Complexes: Insights into Molecular Architectures of Btb-Cul3 Ubiquitin Ligases. Mol. Cell 2009, 36, 39–50. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material File

RESOURCES