Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 Jul 1:2024.06.27.601048. [Version 1] doi: 10.1101/2024.06.27.601048

The Potent PHL4 Transcription Factor Effector Domain Contains Significant Disorder

Blake D Fonda 1, Dylan T Murray 2,*
PMCID: PMC11244893  PMID: 39005418

Abstract

The phosphate-starvation response transcription-factor protein family is essential to plant response to low-levels of phosphate. Proteins in this transcription factor (TF) family act by altering various gene expression levels, such as increasing levels of the acid phosphatase proteins which catalyze the conversion of inorganic phosphates to bio-available compounds. There are few structural characterizations of proteins in this TF family, none of which address the potent TF activation domains. The phosphate-starvation response-like protein-4 (PHL4) protein from this family has garnered interest due to the unusually high TF activation activity of the N-terminal domain. Here, we demonstrate using solution nuclear magnetic resonance (NMR) measurements that the PHL4 N-terminal activating TF effector domain is mainly an intrinsically disordered domain of over 200 residues, and that the C-terminal region of PHL4 is also disordered. Additionally, we present evidence from size-exclusion chromatography, diffusion NMR measurements, and a cross-linking assay suggesting full-length PHL4 forms a tetrameric assembly. Together, the data indicate the N- and C-terminal disordered domains in PHL4 flank a central folded region that likely forms the ordered oligomer of PHL4. This work provides a foundation for future studies detailing how the conformations and molecular motions of PHL4 change as it acts as a potent activator of gene expression in phosphate metabolism. Such a detailed mechanistic understanding of TF function will benefit genetic engineering efforts that take advantage of this activity to boost transcriptional activation of genes across different organisms.

Keywords: transcription factor, effector, activator, solution nuclear magnetic resonance, protein structure, intrinsically disordered domain, phosphate starvation pathway

Introduction

Despite their ubiquitous role in biological processes, a mechanistic molecular understanding of how DNA-binding TF proteins alter gene expression is elusive. The action of DNA-binding TF proteins is generally due to a two-component architecture consisting of a DNA binding domain and a separate effector domain (Staby et al., 2017). Effector domains, either activating or repressing transcription, have been shown to be functionally interchangeable between TFs (Hummel et al., 2023). For example, genetically engineered chimeric proteins with DNA binding and transcriptional effector domains from different proteins modulate transcription according to the nature of the effector domain for the target gene of the DNA binding domain (Hummel, 2023).

At the molecular level, it is unclear how these effector domains modulate transcription. Some proposals implicate proline rich, glutamine rich, or acidic regions within the effector domain. For instance, the presence of acidic regions activate gene transcription, while deletion of acidic regions decrease transcription rates (Sanborn et al., 2021; Sigler, 1988; Soto et al., 2022). These effector domains increase transcription via interaction with partner proteins such as mediator but also interact with other regulatory proteins that inhibit their effect on transcription (Sanborn et al., 2021). For favorable protein-protein interactions to be possible between a TF effector domain and several different proteins, the ability to populate several distinct conformations is likely beneficial. Indeed, conformational flexibility has been evolutionarily selected for on a genome wide scale for the effector domains of transcription factors (Soto, et al. 2022). Thus, intrinsically disordered regions within effector domains of TFs have been hypothesized to be causative of modulating gene transcription.

The Arabidopsis TF phosphate starvation response like protein-4 (PHL4) belongs to a family of homologous phosphate-starvation response TF proteins (Wang et al., 2018). The best characterized family member, and the one with the closest sequence similarity to PHL4, is the protein phosphate starvation response-1 (PHR1), believed to be the central TF that is most-essential to the plant response to low phosphate availability (Rubio et al., 2001). All family members (PHR1, PHL1, PHL2, PHL3, and PHL4) share highly conserved amino acid sequences in the DNA binding and coiled-coil domains (Bustos et al., 2010; Wang, et al., 2018). Like PHR1, although to a different degree depending on the gene target, PHL4 can increase the transcription of these same genes in the presence of low phosphate levels (Wang et al., 2018). PHL4 is therefore functionally similar to the central TF of the phosphate response, PHR1.

While the coiled-coil and DNA binding domains explain promoter binding and cooperative enhancement of gene expression, these domains do not fully explain the interaction of PHR1 and PHL4 with the transcriptional machinery. The first 227 amino acids of PHL4 are not predicted to adopt a folded conformation by AlphaFold 2.0 (Jumper et al., 2021; Varadi et al., 2022), yet have significant sequence similarity, 48%, with PHR1. PHL4 has an extra 37 residues at the N-terminus, which could be related to additional functional activity relative to PHR1 (Wang, 2018). These first 227 residues of PHL4, the effector domain, were among the most potent in an Arabidopsis-based screen of effector domains for 400 TF proteins, in some cases inducing gene expression more than the state-of-the-art bioengineering effector domain VP16 (Hummel et al., 2023). The N-terminal effector domain of PHL4 is therefore likely an important component of transcriptional activation. In addition to the native function of PHL4, it is also a promising candidate for creating artificial TF proteins in genetically modified organisms (Hummel et al., 2023). It is highly active in multiple eukaryotes including yeast and N. benthamiana (Hummel et al., 2023), promising broad applicability of the PHL4 effector domain as a molecular tool to increase gene expression in genetically engineered crops. Additionally, the PHL4 effector domain is an important case study, helping to build a consensus regarding the shared features of potent TF proteins across eukaryotic organisms.

To the best of our knowledge, despite the importance of PHL4 in phosphate response pathways and as an exceptional transcriptional activator, there is no published structural characterization of the protein. As a starting point for dissecting the protein-protein and protein-nucleic acid interactions of PHL4 as it carries out TF activity, we characterize the structure and conformational flexibility of the protein, with particular focus on the N-terminal effector domain. Fluorescence and circular dichroism measurements report on global structure and order. Cross-linking, chromatography, and diffusion NMR measurements probe higher order oligomeric assembly. Solution NMR measurements determine residue-specific conformations and timescales of motion. Together these results form a foundation for addressing changes in these properties as PHL4 carries out TF activity.

Results

Pure preparations of full-length, wild-type PHL4 (herein referred to as PHL4FL) and the first 227 N-terminal residues (herein referred to as PHL4effector), were made using recombinant expression in E. coli. Each construct, illustrated in Supplementary Figure 1, contains a N-terminal 6x His tag with a tobacco etch virus (TEV) cleavage site that was removed before subsequent experiments. Following TEV cleavage, reverse phase affinity column chromatography yielded highly pure PHL4FL and PHL4effector as observed by SDS-PAGE in Supplementary Figure 1. The domain map in Figure 1a shows PHL4FL follows the typical architecture of a TF, with an effector domain, DNA binding domain, and accessory domains (Staby et al., 2017).

Figure 1. Bulk secondary structure content in PHL4.

Figure 1.

(a) A domain map for PHL4. HTH DBD is the homology predicted helix-turn-helix DNA binding domain, CC is the homology predicted coiled-coil domain, and residues 1–227 and 369–397 are to be shown in this work as primarily disordered. (b) Intrinsic Trp fluorescence emission spectra for PHL4effector with and without 6 M Gu-HCl. (c) PHL4FL intrinsic Trp fluorescence spectra with and without 6 M Gu-HCl. For both (c) and (d), the spectra are normalized so the maximal intensity is 1.0. (d) CD spectra of PHL4FL in red and PHL4effector in blue recorded in non-denaturing conditions. (e) A summary of the analysis of the CD spectra in (d) using BeStSel. “Others” in the BeStSel analysis encompasses disordered regions, 310-helix, π-helix, β-bridge, bend, and loop regions.

As an initial probe of structure formation in PHL4, we compared intrinsic Trp fluorescence in native and denaturing conditions. Figure 1b shows spectra in non-denaturing and guanidine hydrochloride (Gu-HCl) denaturing conditions for PHL4effector. No change in the spectral line shape or position is observed and the intensity maximum (λmax) is 353.0 nm, close to that of a fully water exposed Trp of 355.0 nm (Royer, 2006). Thus, the Trp residues in the PHL4effector are solvent exposed in both conditions. However, Figure 1c shows spectra of PHL4FL revealing an intensity maximum shift from 343.8 to 353.3 nm when the buffer is changed from native-like to denaturing. This shift is consistent with a reduction in solvent accessibility for Trp residues in PHL4FL in non-denaturing conditions. However, 343.8 nm is intermediate between the fully water exposed value (355 nm) and that of partially buried Trp (~335 nm) (Royer, 2006), indicating not all Trp species in PHL4FL are involved in structure formation in native-like buffer conditions. Of the three Trp residues in PHL4FL, two are in the effector domain at W113 and W149, and the third is residue W236 in the DNA binding region. The intrinsic Trp fluorescence spectra for PHL4FL and PHL4effector are therefore consistent with a structured conformation around W236, resulting in the partial burial away from solvent of this residue in PHL4FL, and residues W113 and W149 being solvent exposed and disordered in both PHL4FL and PHL4effector.

An unfolding assay of PHL4FL was then performed to estimate the ΔGfolding. PHL4FL was diluted into a series of buffered, near neutral pH, and increasingly denaturing Gu-HCl solutions and, after allowing the samples to equilibrate, intrinsic Trp fluorescence spectra were recorded. The mole fraction of folded protein at each denaturant concentration was calculated and fit to a sigmoidal relationship providing the apparent Gibbs energy change of folding, ΔGfolding, values for PHL4FL of −2.5 ± 0.3 kcal/mol based on intensity measurements or −2.3 ± 0.8 kcal/mol based on wavelength of maximum intensity (λmax) values following the analysis described in Monsellier and Bedouelle, 2005, and assuming a two state, folded to unfolded, model. The intrinsic Trp spectra and fitted curves are shown in Supplementary Figure 2. These ΔGfolding values are consistent with a weakly folded, or mostly disordered PHL4FL with small regions of well-defined structure. In comparison, the protein lysozyme, for example, with roughly 250 fewer amino acid residues than PHL4, has a much more favorable ΔGfolding of −8.5 kcal/mol (Li-Blamer et al., 2019).

Circular dichroism (CD) was then used to investigate the secondary structure present in PHL4FL and PHL4effector. CD uses the differential absorption of circularly polarized light by chiral protein structural motifs, like α-helix versus β-strand, to estimate global protein secondary structure content when compared with databases of CD spectra and known protein structures. Figure 1d shows the CD spectra of PHL4effector and PHL4FL. Analysis of the spectra using BeStSel (Micsonai et al., 2022) with the “Disordered-Ordered” classification is summarized in Figure 1e. The prediction for the PHL4effector spectrum is consistent with a disordered protein, while the prediction for PHL4FL is consistent with a mixture of ordered secondary structure motifs accounting for approximately half of the protein. In this analysis, unfolded or regions with non-standard structure are indicated as “Other” regions and account for the remaining half of the PHL4FL protein. The lack of secondary structure content determined by CD is consistent with the intrinsic Trp fluorescence results.

To characterize the oligomeric state of PHL4effector and PHL4, mass-spectral analysis was performed with two different spectrometer types: electro-spray with an orbitrap detector (ESI-orbitrap) and matrix assisted laser desorption ionization with a time-of-flight detector (MALDI-TOF). Supplementary Figure 3 shows the mass spectra and analysis for both PHL4FL and PHL4effector which are consistent with primarily monomeric protein. However, these methods are subject to the caveat that the ionization methods used can break apart weakly associated oligomers or miss lowly populated states in the presence of a dominant monomeric species (Leney and Heck, 2017, Rogawski and Sharon, 2022).

Since PHL4FL was shown by fluorescence and CD measurements to contain folded regions, a cross-linking SDS PAGE assay on PHL4FL was ran to confirm the oligomeric state. Figure 2a demonstrates that the PHL4FL monomer band runs on the gel as a 56 kDa mass, higher than the 44 kDa calculated from the protein primary sequence, which is also consistent with the SDS PAGE gel used to analyze the PHL4FL purification shown in Supplementary Figure 1. Distinct, higher molecular weight bands are also observed in the crosslinking gel in Figure 2a. Dividing the mass of the higher molecular weight bands by the 56 kDa monomer mass allows for the determination of oligomeric species present in each sample. These oligomeric states are depicted in Figure 2bd as dimer, tetramer, and higher oligomeric species. The lack of a band corresponding to a trimer indicates that PHL4FL is likely not a monomer or a trimer in solution, but rather is an even numbered oligomer, consistent with a dimer, tetramer, or higher state. The strong monomer band on the gel is unsurprising given that the gel sample conditions (8% w/v sodium dodecyl sulfate) can break apart any weakly associated complexes not covalently linked by glutaraldehyde.

Figure 2. PHL4FL cross-linking assay shows a lack of odd numbered oligomeric species.

Figure 2.

(a) PHL4FL incubated with increasing percentages of glutaraldehyde, doubling the concentration of glutaraldehyde in each successive lane, run on a 4–10% Bis-acrylimide gradient SDS PAGE gel. Arrows point to oligomeric species detected in 0.004% glutaraldehyde and the circle highlights aggregated protein unable to enter the gel. (b) Plot of the molecular weights of the bands observed in the gel in panel (a), arrows point to bands observed in both the 0% and 0.004% lanes. (c) and (d) Bar plots of the integrated gel band intensity versus the molecular weight, measured from the cubic spline fit to the data in panel (b). Dimer and tetramer are based on the observed molecular weight divided by the intense monomer band at 56 kDa. The percentages normalized so that the sum of all band intensities equals 100%.

To further clarify the oligomeric state of PHL4FL, size exclusion chromatography (SEC) was performed in non-denaturing and pH 7.4 buffered solution. Figures 3ab shows the chromatograms for PHL4effector and PHL4FL compared with protein molecular weight standards ran under identical buffer conditions. Since PHL4effector and PHL4FL contain disordered motifs as indicated by CD and fluorescence microscopy, the chromatograms were therefore used to calculate the radius of hydration, Rh, not molecular weight. This approach helps account for larger than expected Rh values for a given protein molecular weight due to extended conformations of disordered domains. Figure 3c shows the Rh and retention time relationship used to estimate Rh. For the protein standards, Rh values were obtained from the literature (Stetefeld et al., 2016; Talmard et al., 2007; Wilkins et al., 1999). PHL4effector and PHL4FL sizes measured by SEC are tabulated in Table 1.

Figure 3. Radius of hydration measurement for PHL4FL and PHL4effector.

Figure 3.

(a) SEC of PHL4effector in blue, overlayed with a chromatogram of standard proteins ran in the same conditions in black, with the labels corresponding to the molecular weights specified by the manufacturer. (b) SEC of PHL4FL in red versus standards ran in the same conditions. (c) Plot of the radius of hydration, Rh, reported in the literature for the SEC standards (Stetefeld et al., 2016; Talmard et al., 2007; Wilkins et al., 1999) as a function of retention volume with a best-fit exponential decay function. The Rh values for PHL4FL and PHL4effector, calculated from the best-fit function, are shown and red and blue circles. (d) Region of PHL4effector diffusion NMR 1H NMR spectrum integrated to determine Rh.

Table 1. Radius of hydration predicts size and folded state for each protein.

The SEC result is the value calculated from the measured SEC retention time. The diffusion NMR NMR result is calculated from the gradient-strength based decay observed in the 1H NMR spectra.

Protein Motif SEC Rh (nm)a Diffusion NMR Rh (nm)b Predicted Folded to Unfolded Rh Rangec Predicted Oligomeric State
Monomer: 33.5–59.5 Å
Dimer: 43.7–83.7 Å
PHL4FL Trimer: 51.0–102.1 Å Partially folded timer to octamer
97.0 ± 1.0 Å 86.9 ± 9.7 Å Tetramer: 67.0–117.7 Å
Octamer: 74.2–165.5 Å
PHL4effector 39.1 ± 0.9 Å 40.4 ± 0.1 Å Monomer: 27.1–45.2 Å Unfolded monomer
Dimer: 35.3–63.6 Å
a

Uncertainty is the standard deviation determined for the best-fit exponential for the retention times of the standard samples.

b

Uncertainty is the standard deviation from the Rh value calculated from three separate diffusion NMR measurements on each sample.

To corroborate the SEC-based Rh measurement, diffusion NMR spectroscopy measurements were made as an additional measure of Rh. The diffusion NMR measurement uses a series of increasing strength gradient pulses during a 1H NMR experiment causing a signal decay. Figure 3c shows the decay observed for PHL4effector. The rate of decay is related to the diffusion of each molecular species whose nuclei give rise to NMR peaks in the spectrum. The rate of diffusion for the protein is calibrated using the 1H NMR signal decay for an internal 1,4-dioxane standard. The diffusion rate data is then used to calculate Rh. Additional spectra and analysis are presented in Supplemental Figures 46. Diffusion NMR measured Rh values are also tabulated in Table 1. Data for horse myoglobin is included as a control with a known molecular weight and radius of hydration, and published diffusion NMR data.

The Rh measured using diffusion NMR and SEC are in agreement, in light of established literature showing that SEC and diffusion NMR Rh measurements likely carry uncertainties of a few Å for the small proteins lysozyme (20 ± 2 Å) and ovalbumin (30 ± 1.4 Å) (Dudás and Bodor., 2019; Parmar et al., 2009). Extrapolation of those uncertainties to the size of PHL4FL measured here suggest that the uncertainty is on the order of ±6 Å (Dudás and Bodor, 2019; Parmar and Muschol, 2009). The slight difference in Rh calculated between diffusion NMR and SEC methods also makes sense given these Rh analyses rest on a spherical approximation of protein shape. Additionally, potential protein-column interactions during the flow of proteins through porous resin during SEC could alter the observed Rh.

The Rh values from both the SEC and diffusion NMR analyses are compared to known relations between the size and number of residues in a protein. The upper and lower bounds in Table 1 come from the estimated Rh calculated from analytical relationships for completely folded proteins (lower bound) and for unfolded proteins (upper bound) (Dudás and Bodor, 2019). This calculation gives a range of Rh values based on the number of amino acid residues for each possible oligomeric state of PHL4effector and PHL4FL. For PHL4effector, the Rh measured using both SEC and diffusion NMR is consistent with monomeric, unfolded protein. For PHL4, on the other hand, the measured Rh value consistent with an oligomer bigger than a dimer.

The molecular weight estimates from diffusion NMR and SEC, the CD and fluorescence analysis of structural disorder, the cross-linking assay showing that the predominate species is an even integer multiple, and the unfolding assay showing a small ΔGfolding, suggest that PHL4FL is to likely be between a tetramer and an octamer with a mixture of folded and unfolded regions. We tentatively propose that these results are consistent with a tetrameric state, which would be consistent with the result for the homologous PHR1 protein, for which the coiled-coil domain forms tetramers (Ried et al., 2021). The tetrameric state upper bound is near the value calculated for a completely unfolded PHL4FL, which is compatible with the domain structure of PHL4FL in Figure 1a showing the structured coiled-coil and helix-turn-helix domains occupying only ~28% of the PHL4FL sequence and our solution NMR measurements showing the N- and C-termini are disordered (vide infra). For PHL4effector, the data robustly show that the effector domain in isolation is a monomer.

To gain structural insight at the level of individual amino acid residues, solution NMR was used to measure the chemical shifts of individual atoms in the protein. The NMR chemical shift values report on both amino acid type and protein secondary structure. Sequence specific assignments are necessary to use NMR to probe structural conformations present in our PHL4FL and PHL4effector samples. The NMR chemical shifts from each protein sample were recorded using a standard set of triple-resonance solution NMR experiments. Figure 4ab shows 1H–15N HSQC spectra for PHL4effector and PHL4FL annotated with the sequence specific assignments. These experiments and the data acquisition parameters are listed in Supplementary Table 1. Sequence specific assignments of the NMR chemical shifts were made manually for PHL4effector sequence, with 72% of the 227 residues in PHL4effector assigned. For PHL4FL, 59% of the N-terminal effector domain was assigned and 100% of the C-terminal residues from G369–E397. Out of all observed signals for PHL4effector, 73% were assigned, whereas for PHL4FL, 62% of the signals were assigned. Many of these unassigned peaks have low signal-to-noise, as shown in Figure 4 and described in Supplementary Tables 36.

Figure 4. 1H–15N HSQC spectra for PHL4effector and PHL4FL.

Figure 4.

(a) 1H–15N HSQC NMR spectrum of PHL4effector. (b) 1H–15N HSQC NMR spectrum of the PHL4FL. Grey peaks represent unassigned peaks. “n” and “s” labels depict ambiguously assigned Asn and Ser peaks that have unique 1H and 15N shifts but non-unique “i” and “i - 1” carbon chemical shifts.

For strong signals, the remaining unassigned NMR chemical shifts for PHL4effector and PHL4FL are primarily due to ambiguity in amino acid sequence. Supplemental Figures 1cd shows the N-terminal region contains an acidic region with an almost identical 12–13 amino acid sequence repeated three times. Other unassigned NMR chemical shifts are likely due to the well folded regions in the PHL4FL oligomer having large global rotational correlation times or slow local molecular motions, leading to reduced signal-to-noise in the NMR experiment and therefore a lack of connecting peaks observed in the spectra used for assignments.

The spectra in Figures 4a and 4b are highly similar to each other. The assigned NMR chemical shifts for PHL4effector appear at a nearly identical NMR chemical shift values in the full-length PHL4FL spectra for many residues. Supplementary Figure 7 quantifies the similarity in NMR chemical shifts via chemical shift perturbations, CSPs. Small CSPs indicate a lack of large structural differences in the assigned regions of PHL4FL and PHL4effector. In sum, the solution NMR measurements indicate the effector domain in isolation adopts similar conformations and has similar molecular motions as the effector domain within the full-length PHL4.

Secondary chemical shifts are used to predict protein secondary structure from the assigned NMR chemical shifts. Secondary chemical shifts are the difference between the observed NMR chemical shift value for a residue and the random coil conformation NMR chemical shift value for that amino acid type (Kjaergaard et al., 2011a, Kjaergaard et al., 2011b). For PHL4effector and PHL4FL, the secondary chemical shifts are represented as the difference between the CA and CB secondary chemical shifts in Figures 5a and 5c. Large positive values over several adjacent residues indicates an a-helical structure, while large negative values for neighboring residues are consistent with a β-strand structure (Wishart, 2011). These plots show that the majority of the secondary chemical shift magnitudes are less than 2, a commonly used cutoff for well-defined structure (Wishart, 2011). The secondary chemical shifts are therefore consistent with the assigned residues in both PHL4FL and PHL4effector lacking well-defined secondary structure.

Figure 5. NMR chemical shift assignments and 15N relaxation rates show a random coil-like conformation.

Figure 5.

(a) PHL4effector secondary chemical shift represented as the difference in CA and CB secondary chemical shifts. (b) TALOS-N predicted secondary structure for PHL4effector based on the NMR chemical shift assignments. (c) PHL4FL secondary chemical shifts represented as described for (a). (d) TALOS-N predicted secondary structure for PHL4FL based on the NMR chemical shift assignments. (e) PHL4effector R2 relaxation rates calculated from R1 and R measurements. (f) Measured PHL4effector R1 relaxation rates.

Additional structural information was obtained from a TALOS-N analysis of the assigned NMR chemical shifts. TALOS-N uses an artificial neural-network with a database of known protein NMR chemical shifts, structures, and sequences to predict secondary structure and backbone torsion angles based on NMR chemical shift assignments and protein sequence (Bax et al., 2013; Shen et al., 2009). The secondary structure predicted for PHL4effector and PHL4FL are depicted in figure 5b and 5d. Overwhelmingly, random coil is the dominant prediction for PHL4effector, which is consistent with our interpretation for the measured Rh, CD spectra, and Trp fluorescence assays. For PHL4FL, the N-terminal effector domain is mostly predicted to be in a random coil arrangement as well, with weak predictions for an a-helix from residues P160–D168. At the C-terminus of PHL4FL, in agreement with the secondary chemical shift analysis, the TALOS-N predictions predict a random coil arrangement.

Lastly, NMR relaxation experiments were used to determine the timescales of motion present in PHL4effector. Residue-specific longitudinal relaxation (R1) and longitudinal relaxation in the rotating frame (R1ρ) measurements were recorded. These data are used to calculate the residue-specific transverse relaxation rate (R2) (Libich et al., 2015; Palmer, 2004). Figures 5ef show the R1 and R2 rates with respect to residue number. Supplemental Figures 8ab shows these relaxation rates as a function of external magnetic field. Supplemental Figure 8c shows heteronuclear 1H–15N Nuclear Overhauser effect (NOE) enhancements. Relaxation parameters are dependent on both global and local timescales of motion, i.e. the reorientational correlation times, for each residue. Consistent with the lack of both global and local structure in PHL4effector, the R1 or R2 values are relatively small across the PHL4effector sequence in Figure 5ef and there is no strong 1H–15N NOE enhancement shown in Supplemental Figure 8c. If there were regions transiently adopting a folded conformation, elevated R2 values would be observed. For example, R2 values above 8 s−1 were observed for transient helical structure in the TDP-43 protein disordered domain (Conicella et al., 2016). As expected, due to additional freedom for movement for the end-of-chain residues, the N- and C-terminal residues do experience greater molecular motion, i.e. shorter reorientational correlation times producing smaller R1 and R2 values, than residues located in the central portion of the sequence.

Our measurements of intrinsic disorder in the effector domain within PHL4FL are not well predicted by computational algorithms. Supplemental Figure 9 shows the DISOPRED 3.0 (Jones et al., 2015) and Espriv (Walsh et al., 2012) analysis of the PHL4FL sequence do not strongly predict a long continuous disordered domain for the effector domain, instead predicting disorder for shorter segments in the vicinity of residues F175–N225 and the N-terminal region. Supplemental Figure 10 shows the AlphaFold 2.0 (Jumper et al., 2021; Varadi et al., 2022) prediction for PHL4. The prediction for the effector domain is mostly lacking secondary structure and has low confidence scores, consistent with our experimental characterization. Supplemental Figure 10b suggests a slight propensity for helical structure between residues P160–L171, consistent with the weak helical TALOS-N prediction for residues P160–D168, based on our solution NMR assignments. However, residues L161–D168 do not have enhanced relaxation parameters and therefore likely retain significant disorder in our samples. Across DISOPRED, ESPRITZ, and AlphaFold, the C-terminal region of PHL4FL is more confidently predicted to be disordered, in agreement with our solution NMR result for residues G369–E397.

Discussion

Our measurements indicate the effector domain of PHL4, both in isolation and in the full-length protein, is intrinsically disordered. Intrinsic Trp fluorescence and solution NMR data show that W113 and W149 are solvent accessible and disordered, while W236, conserved in the DNA binding domain of homologues like PHR1, is likely in a well-folded region. W236 is homologous to W230 in the PHR1 homologue, which has been shown to be rigidly folded in a dimeric conformation when bound to DNA from the P1BS promotor (Jiang et al., 2019). Thus, our results are consistent with structure formation in the putative DNA binding region of PHL4.

The solution NMR chemical shift and relaxation measurements are consistent with the effector domain exhibiting significant disorder both in isolation and within full-length PHL4. Diffusion NMR and SEC show the effector domain exists as a disordered monomer in solution. The oligomeric state of full-length PHL4 in solution appears to be an even-numbered oligomer from the cross-linking assay, likely between a tetramer and an octamer. Our diffusion NMR and SEC results are supportive of this interpretation but the combination of disordered and ordered segments suggested by CD measurements precludes an exact size determination using these methods. However, the coiled-coil region of the PHR1 homologue was observed to be responsible for tetrameric oligomerization in crystal structures, explaining the binding of phosphate response inhibitors like SPX1 (Jiang et al., 2019; Ried et al., 2021).

In combination with the identification of the disordered effector domain and C-terminal region in full length PHL4 from our multidimensional solution state NMR study, these bulk measurements point to structured oligomerization likely involving the DNA binding and coiled-coil domains of PHL4. The precise nature of the PHL4 oligomer remains to be determined, including the dependence on biological context, such as when interacting with the P1BS DNA binding site or partner proteins like SPX or those involved in cellular transcriptional machinery.

How does an intrinsically disordered effector domain fit in with the current picture of transcriptional activation by eukaryotic specific transcription factors? This question is important in light of results showing that the PHL4 effector domain is a multi-organism activating TF (Hummel et al., 2023). It has been suggested that, for mRNA encoding genes in plants, intrinsically disordered regions in transcription factors interact with the mediator complex, facilitating favorable contacts with enhancer sites bound by the DNA-binding domains (Buendía-Monreal et al., 2016). For instance, a recent study showed that the highly acidic repeat DFDLDMLGD in the viral TF VP16 interacts with a component of the native plant mediator complex, Med25 Activation Interaction Domain, potentially explaining the mechanism through which the acidic effector domain exerts its function to increase transcription of mRNA targets (Aguilar et al., 2014). Although the exact VP16 acidic amino acid sequence is not found in the PHL4 effector domain, there is an extreme bias toward aspartic acid residues, with one DDDD repeat and two DDD repeats within the first 35 residues of PHL4. This suggests that PHL4 might amribute some of its gene-activation function through a mechanism similar to the artificial TF VP16, via a highly acidic subregion within its effector domain. It remains to be discovered what accessory transcription components the PHL4 effector domain interact with. Discovery of the interaction network and strengths of interaction of the PHL4 effector domain with the supporting cast of transcriptional machinery could explain why the PHL4 effector domain is more effective than VP16 in activation of transcription for certain genes in some eukaryotes (Hummel et al., 2023).

Acidic regions are more common in activating transcription factors than the general proteome (Kotha et al., 2023; Soto et al., 2022), perhaps in order to make favorable contacts with the preinitiation complex (PIC) to effectively recruit and/or retain PIC binding at a promotor site. In PHL4, the initial acidic region within the N-terminal motif is separated from the predicted DNA binding region by what is shown here to be a long, mostly disordered region. This long, disordered region could provide the necessary conformational flexibility to allow the acidic region of PHL4 to effectively reach the PIC complex. As PHL4 is shown to affect many different genes (Wang et al., 2018) at varying distances from promotor sites like P1BS (Jiang et al. 2019), the disordered effector domain plasticity and dynamics could be a key component to accommodate the different configurations of upregulated genes and the associated promotor regions. One way to visualize this hypothesis is that PHL4 intrinsically disordered effector domain may act like a dynamic extension ladder, able to be shortened and lengthened as needed to increase transcription, whether binding at proximal promotor site or at distant enhancer site.

Conclusion

PHL4 is a plant TF protein that plays an important role in activating genes in response to low levels of phosphate. In addition, the PHL4 effector domain is a promising genetic tool to up-regulate proteins in multiple organism types. This study provides a structural characterization of PHL4 and its potent effector domain. The 227 amino acid N-terminal effector domain is shown by various techniques to be intrinsically disordered which, as a fragment, is a monomer in solution. This study also demonstrates Rh measurements demonstrate that full-length PHL4, although retaining significant disorder in its effector domain and C-terminus, likely forms an oligomer via regions outside of the effector domain. It remains to be shown whether these disordered regions adopt structure when PHL4 binds to DNA or interacts with partner proteins.

Materials and Methods

Protein Expression

The PHL4FL (uniport accession number Q8GXC2) and PHL4effector (N-terminal effector domain, residues 1–227) proteins were expressed with an N-terminal 6xHis tag followed by a TEV cleavage site. These sequences are shown in Supplemental Figure 1. The plasmids, generously provided by the laboratory of Dr. Patrick Shih at UC Berkeley, used pET28 expression vectors containing an inducible lac-operon system and Kanamycin antibiotic resistance. Protein concentrations were quantified using the extinction coefficient calculated using the online ProtParam tool (hmps://web.expasy.org/protparam/).

Bacterial expression procedures were the same for each construct except where specified. Plasmids were transformed into in-house prepared chemically competent BL21(DE3) E. coli cells using standard heat shock methods, spread onto Luria Broth (LB) agar plates containing 50 μg/mL kanamycin, and incubated overnight at 37 °C. A single isolated colony was used to inoculate 10 mL (PHL4FL) or 15 mL (PHL4effector), of Luria Broth (LB) media containing 50 μg/mL of kanamycin. The cultures were incubated with shaking at 37 °C and 220 RPM until an optical density at 600 nm (OD600) of 1.6–1.7 was reached. Glycerol cell stocks were made with equal parts liquid cell culture and 0.22 μm sterile filtered 50% v/v glycerol in water, inverted gently to mix, flash frozen in liquid nitrogen, and stored at −80 °C.

To start large scale protein expression for unlabeled proteins frozen cells were streaked from the −80 °C glycerol stocks with a sterile 200 μL pipet tip onto LB agar plates containing 50 μg/mL kanamycin and placed into a gravity convection incubator at 37 °C for 16 h. The plate was removed from the incubator and stored at room temperature for ~3 h until use. Cells from the plate were transferred using a sterile 200 μL pipet tip into a 100–500 mL starter culture, of sterile LB media containing 50 μg/mL kanamycin and 1% w/v glucose added to prevent un-induced protein expression. The culture was incubated at 37 °C in a gravity convection incubator. After incubation for 21 h, 80 mL of the culture is placed into 1 L of sterile LB media containing 1% w/v glucose and 50 μg/mL kanamycin, and incubated with shaking at 37 °C and 220 RPM until the OD600 reached 0.7−0.9. Isopropyl-β-D-thiogalactoside (IPTG) was added to 0.5 mM to induce protein expression. After 3 h further growth at 37 °C and 220 RPM, the cells are harvested by centrifugation at 6,000 g for 10 min. The cell pellet was scraped into conical tubes, flash frozen in liquid nitrogen, and stored at −80 °C.

Isotopically labeled protein was expressed identically to the unlabeled protein until the transfer to the 1 L LB media culture. Here, 4 L of LB media split evenly between 4 flasks of 4 L volume containing 50 μg/mL kanamycin and 1% w/v glucose was inoculated with 80 mL of the LB pre-culture per L of LB media. The 1 L cultures were incubated with shaking at 37 °C and 220 RPM until an OD600 of 0.7–0.9 was reached. Then the cells were harvested by centrifugation at 6,000 g for 10 m and resuspended in small volumes (~20 mL) of M9 minimal media using an automatic pipemor and serological pipeme, combined, and transferred to a single 1 L of M9 minimal media. The 1 L M9 culture contained 1 g of 15N labelled ammonium chloride and 2 g of 13C uniformly labelled α-D glucose (Cambridge Isotope Laboratories). The M9 culture was incubated for 30 min with shaking at 37 °C and 220 RPM before adding IPTG to 0.5 mM. After 3 h of additional incubation at 37 °C and 220 RPM, the cells were harvested by centrifugation at 6,000 g for 10 min, scraped into a plastic tube, flash frozen in liquid nitrogen, and stored at −80 °C.

Purification of Protein Constructs

The cell pellet was thawed on ice for 30 min, then partially resuspended using a serological pipette with ~25 mL of lysis buffer containing 6 M Gu-HCl, 1% v/v Triton X-100, 500 mM sodium chloride, 50 mM tris(hydroxymethyl)aminomethane (Tris-HCl) pH 7.5, 0.25 mg/mL hen egg white lysozyme (Fisher), and three tablets of EDTA-free Pierce protease inhibitor tablets. The partially resuspended cells were then sonicated in an ice bath with a Branson SFX-250 sonifier pulsed at 0.3 s on, 3 s off, 20 min total “on” time, using a 1/4-inch microtip. The sonicated solution was centrifuged at 4 °C and 75,600 g for 30 min. Protein was purified from the supernatant using a Bio-Rad NGC Quest 10 Plus liquid chromatography instrument and a 5 mL Cytiva HisTrap FF Ni2+ immobilized metal affinity chromatography (IMAC) column. The supernatant was loaded onto a column pre-equilibrated in 6 M urea, 20 mM 4-(2-hydroxyethel)-1-piperazineethanesulfonic acid (HEPES) pH 7.5, and 500 mM sodium chloride, then the column was washed with identical buffer containing an additional 20 mM imidazole until the absorbance at 280 nm (A280) returned to the baseline value. Then, the protein was eluted with the same buffer containing 200 mM imidazole. Purification buffers initially, for the PHL4FL unlabeled expression that yielded the protein for the SEC measurement as well as the 15N labeled PHL4effector in material, also contained 1 mM dithiothreitol, DTT, in the elution buffer. However, DTT was deemed unnecessary for the initial purification steps and therefore DTT was left out in subsequent purifications to no observed detriment, as determined by SDS PAGE gel analyses similar to those in Supplementary Figure 1.

To return the protein to a more native state for each protein preparation, the purified elution fractions were pooled and dialyzed overnight into 20 mM sodium phosphate pH 7.4, 100 mM sodium chloride, 1 mM DTT, so that less than 1% initial buffer remained at the end of the dialysis. The 6xHis purification tag was removed using TEV protease glycerol stocks (25% v/v glycerol, 1 mM EDTA, 5 mM DTT, 250 mM sodium chloride, 10 mM Tris-HCl pH 8.0) prepared in-house. The concentration of TEV in the reaction was changed from preparation to preparation initially, as follows. The PHL4FL sample used in the SEC test used a 1:200 ratio of TEV:protein with an initial TEV stock concentration of 108 μM. The PHL4FL 13C- and 15N-labeled sample for the NMR assignment experiments used a 1:20 ratio of TEV:protein, with an initial TEV stock concentration of 82 μM. At this point, the reaction was seen to be highly robust at fully cleaving the protein, therefore all subsequent protein preparations used a 1:250 ratio of TEV:protein, with all subsequent TEV stocks at an initial concentration between 6–7 μM.

In the TEV reaction solution, DTT was added to 1 mM initially, and the cleavage reaction was incubated for 19–30 h (PHL4FL) and 48 h (PHL4effector domain). Reactions were run until the protein monomer band in an SDS-PAGE gel shifts completely to the tag-cleaved molecular weight. Fresh 1 mM DTT was added after the first 3 h of reaction, and then once every ~24 h to ensure the TEV protease remained active. Depending on the activity of the TEV stock which is slightly different from prep to prep, this procedure of checking the cleavage reaction before purifying the cleaved protein ensured a minimum amount of uncleaved protein due to differences in activity of the TEV stocks. Supplementary Figure 1 shows SDS-PAGE gels for IMAC purified and TEV-cleaved proteins.

After the cleavage reaction, the solution was passed over a gravity column containing Bio-Rad Nuvia IMAC nickel(II) resin equilibrated in 20 mM sodium phosphate pH 7.4, 100 mM sodium chloride, 5 mM imidazole. The initial flow through was collected, and a few CV of the same buffer was used to rinse the column. These protein fractions were stored at 4 °C until used. The protein concentration in all fractions was quantified using absorption at 280 nm with extinction coefficients calculated using the Expasy ProtParam tool (Gasteiger et al., 2005).

Disorder Predictions

DISOPRED 3.0 was ran using default parameters on the UCL bioinformatics web portal (hmp://bioinf.cs.ucl.ac.uk/psipred/), accessed July 2023 (Jones and Cozzemo, 2015). ESpriv was ran with standard parameters utilizing the web portal (hmp://old.protein.bio.unipd.it/espriv/), accessed in December 2023 (Walsh et al., 2012; Walsh et al., 2014).

Cross-linking Assay

PHL4FL was thawed at room-temperature from −80 °C protein stocks consisting of equal parts TEV-cleaved PHL4 FL from the reverse IMAC purification and 50% v/v glycerol in H2O. Then, PHL4 FL was buffer exchanged into 20 mM sodium phosphate pH 7.4 using Amicon-Ultra 0.5 3kDa MWCO centrifuge filters according to the manufacturer’s instructions until the initial buffer remaining is less than 5%. Then, cross-linking reactions were performed with 15.4 μM protein and 0.004–0.0256% glutaraldehyde, with successive concentrations doubling in 20 mM sodium phosphate pH 7.4 buffer at 26 μL total volume, and incubation at 25 °C for 30 min. For glutaraldehyde concentrations above 0.016%, significant aggregation was observed, preventing the samples from entering the SDS-PAGE gels used for analysis. Reactions were quenched by adding 10 μL 4X SDS-PAGE loading buffer, containing 250 mM Tris-HCl pH 6.8, 8% w/v SDS, 0.2% w/v bromophenol blue, and 40% glycerol. 4 μL of 1 M DTT was then added. Samples were heated to 70 °C for 10 min, cooled at room temperature for ~5 min, then centrifuged at 14,000 g for 15 min, then loaded onto a 4–10% gradient gel and ran with tricine running buffer at 45 mA for ~20 min.

Size Exclusion Chromatography

Protein was buffer exchanged and concentrated in an Amicon-Ultra 0.5 mL 3 kDa MWCO centrifuge filter, per manufacturer’s instructions, into SEC buffer (20 mM sodium phosphate pH 6.2, 100 mM sodium chloride, and 1 mM DTT) until the initial buffer remaining was less than 5%. The protein concentration for PHL4 FL was 76.2 μM and the protein concentration for the PHL4effector domain was 96.8 μM. A 200 μL loading loop was pre-loaded with an excess volume of sample, and ran over a Bio-rad ENrich 650 column equilibrated in SEC buffer at a flow rate of 0.75 mL/min. The Bio-Rad gel filtration standards (product #1511901) were resuspended in 0.5 mL of the same SEC buffer, and ran using the same SEC running buffer as PHL4 FL, except that a 100 μL loading loop was used. The peak maxima in the chromatogram were determined using in-house Python scripts and the SciPy package “find_peaks” function. The Rh values for the standards from the literature (Stetefeld et al., 2016; Talmard et al., 2007; Wilkins et al., 1999) were plotted as a function of the natural logarithm of the retention time and fit to an exponential decay with the SciPy package “curve_fit” function.

Fluorescence Experiments

Fluorescence experiments all used a 45 μL 10 mm × 10 mm quarv cuveme (Starna). TEV-cleaved and purified PHL4 FL and the PHL4effector domain were diluted to 2.38 μM in either non-denaturing fluorescence buffer (20 mM sodium phosphate pH 7.4, 100 mM sodium chloride, and 1 mM DTT) or denaturing fluorescence buffer (6 M guanidinium-HCl, 20 mM sodium phosphate, 100 mM sodium chloride, and 1 mM DTT). Fluorescence spectra were recorded using a Cary Eclipse fluorescence spectrometer operating in emission mode with excitation fixed at 290 nm and a slit width for both excitation and emission of 5 nm. The excitation was fixed at 290 nm to avoid contribution from Tyr residues so that the resultant emission spectra would be dominated to the greatest possible extent by signal from Trp residues. The spectral range acquired was 300 to 475 nm, scanned at 120 nm/min in 1 nm intervals, with 0.5 s averaging, and the photomultiplier tube (PMT) set to high.

The denaturing condition spectrum in Figure 1 is the average of three scans, with the spectrum normalized so that the maximum intensity was 1.0. The non-denaturing spectrum in Figure 1, is data from one scan with a 5-point window smoothing function applied and normalized such that the maximum intensity was 1.0. An in-house Python script was used to determine λmax, using the “curve_fit” function from SciPy to fit a Taylor series expansion for fluorescence intensity to the data

I(λ)=a+bλ-λmax22+cλ-λmax26

where I is the intensity at wavelength λ in nm, a,b, and c are fiming constants, and λmax is the wavelength of the maximum intensity in the spectra in nm.

Fluorescence Unfolding Assay

Spectra were recorded of protein diluted to 2.38 μM in a mixture of the non-denaturing and denaturing fluorescence buffers, with final Gu-HCl concentrations from 0 to 6 M; 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0, 1.1, 1.2, 1.3, 1.4, 1.6, 1.8, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0, 5.5, 6.0 M. The samples were allowed to equilibrate overnight at room temperature. Spectra were recoded using a Cary Eclipse fluorescence spectrometer operating in emission mode with excitation fixed at 290 nm and a slit width for both excitation and emission of 5 nm. The excitation was fixed at 290 nm, the acquired spectral range was 300 to 420 nm, scanned at 120 nm/min in 1 nm intervals, with 0.5 s averaging, and the photomultiplier tube (PMT) set to high.

The experimental workflow and equation derivations are described in detail in Monsellier and Bedouelle, 2005, and the analysis here does not deviate substantially. The equations and analysis are restated here for clarity. The overall goal of the procedure is to obtain the mole fraction of unfolded molecular species (fU) as a function of the Gu-HCl concentration with the observed Gibbs energy of unfolding (ΔGunfolding) as a curve-fitting parameter. This analysis assumes a two-state unfolding process, FU, and is done via two approaches. In the first approach, the mole fraction of unfolded protein was calculated from λmax using the equation:

fU(Gu-HCl)=λmax-λFλU-λF

where fU is the apparent mole fraction of the unfolded molecules for a given Gu-HCl concentration, λF and λU are the wavelengths of maximum intensity for the 0 M Gu-HCl and 6 M Gu-HCl spectra, and λmax is the wavelength of maximum intensity for the given concentration of Gu-HCl. λmax is determined from a third order Taylor series polynomial fit to each spectral curve, according to the equation and procedure given in the Fluorescence Experiments section.

In the second approach, the unfolded mole fraction was calculated using the intensity value at the wavelength of maximum intensity difference between the 0 and 6 M spectra, calculated to be 334 nm for this data. A subset of the data, the first five Gu-HCl concentrations for the folded species and the last nine Gu-HCl concentrations for the unfolded species, chosen based on minimal perturbation of the observed spectrum, were independently fit to an equation of the form

YX(Gu-HCl)=mX[Gu-HCl]+YX,0MGuHCl

where YX is the intensity contribution at 334 nm at each Gu-HCl concentration, [Gu-HCl] is the Gu-HCl concentration, YX,0MGuHCl is the projected intensity at 0 M Gu-HCl, and mx is the perturbation to the fluorescence intensity at each Gu-HCl concentration. “X” subscripts can be either “U” representing the unfolded molecular species or “F” representing the folded molecular species. An in-house Python script using the “curve_fit” functionality in SciPy was used to solve for mx and Yx,0MGuHCl.

The measured fluorescence intensity at 334 nm Ymeas is then the sum of the contributions from the folded and unfolded molecular species

Ymeas([Gu-HCl])=YFfF+YUfU

where, fF is the mole fraction of folded protein and fU is the mole fraction of unfolded protein. Since the mole fractions of the folded and unfolded molecular species must sum to one, the above equation can be rewritten to obtain the mole fraction of the folded protein as a function of Ymeas and the concentration of Gu-HCl

fU([Gu-HCl])=Ymeas-YF,0MGu-HCl-mF[Gu-HCl]YU,0MGu-HCl-YF,0MGu-HCl+mU-mF[Gu-HCl]

Regardless of the method used to obtain fU, the ΔGunfolding is determined from a standard relationship between ΔGunfolding and the equilibrium constant, K, with a linear perturbation due to the Gu-HCl denaturant where ε is a measure of the strength of the perturbation

ΔGapp=-RTlnK+ε[GuHCl]
ΔGapp=-RlnfU1-fU+ε[GuHCl]

Which can be rearranged to give the fraction of unfolded molecular species as a function of the Gu-HCl denaturant concentration with ΔGunfolding obtained as a fitting parameter

fU([GuHCl])=11+expΔGunfolding-ε[GuHCl]RT

For the mole fractions derived from the first approach, a correction factor more accurately represents ΔGunfolding due to the non-linearity of the change in λmax with respect to the protein unfolding process

Ecorr=RTlog10b(0MGu-HCl)b(6MGu-HCl)

where b is the second order constant from the Taylor series fit to the spectrum at the specified Gu-HCl concentration. Supplementary Figure 2 shows the application of these equations to the PHL4FL data.

Circular Dichroism Spectroscopy

Pure, TEV-cleaved PHL4effector domain and PHL4FL protein was prepared by dialyzing the gravity column flow through fractions into 5 mM sodium phosphate pH 7.4 with 6–8 kDa molecular weight cut-off 0.32 mL/cm dialysis tubing overnight. The protein solutions were then harvested and diluted to 0.3 mg/mL in the same buffer used for the dialysis. The CD spectrum was acquired on a JASCO J-715 CD spectrometer, from 260–190 nm, in 1 nm increments with a 1 mm pathlength cuvette. Three scans were recorded for each sample, and the spectrometer was blanked with the same buffer used for the dialysis prior to running the protein sample. The three scans were averaged in Python prior to plotting. The “Disordered-Ordered Classification” tool from the BESTSEL web portal (Micsonai et al., 2022) was used to identify the general shape of the observed spectrum as disordered (PHL4effector) and ordered (PHL4FL). BESTSEL was used to determine the structural motifs for PHL4FL.

Mass Spectrometry

Protein for MALDI-MS was prepared by diluting the non-isotopically enriched samples (143 μM PHL4effector and 274 μM PHL4FL) from the diffusion NMR experiments into 10 mM Tris-HCl pH 7.5 buffer to a final protein concentration of 5 μM. MALDI-MS was recorded on a Bruker UltraFlextreme MALDI TOF/TOF spectrometer. Bruker Flex Analysis was used to convert the data to text format, then an in-house Python script was used to smooth the spectrum with a 13-point window function, equivalent to smoothing over a 0.5 m/z window. To find the mass spectral peaks, an in-house Python script was used with the “curve_fit” functionality in SciPy.

Protein ESI-MS was prepared by dialyzing 15 μM pure TEV-cleaved protein from the reverse IMAC into 10 mM ammonium carbamate pH 8 with a 6–8 kDa molecular cut-off, 0.32 mL/cm dialysis tubing overnight. The protein was then harvested and diluted to 9 μM with the same buffer used in dialysis. ESI-MS was recorded with a Thermo Q-Exative High-Field Orbitrap ran in full MS mode, from 1200 to 4000 m/z, with 45000 resolution, in positive polarity mode, a spray voltage of 2 kV, a capillary temp of 473 K, and the auxiliary gas heater temperature at 323 K. Spectral analysis was performed first by converting the data with MSConvert (PreoteoWizard 3.0) to UniDec format (Chambers et al., 2012). Then, UniDec was used similar to the method of Marty et al., 2015 to fit the data with 20-point gaussian smoothing, normalized data, and charge search regions of 4–70 (PHL4effector) and 4–80 (PHL4FL), sampled every 100 Da.

Diffusion NMR Experiments

Horse Myoglobin protein was purchased from Sigma Aldrich (M1882) and 22.6 mg was resuspended with 700 μL of NMR buffer (20 mM MES pH 6.2 buffer, 25 mM NaCl, and 5 mM TCEP). The tube was gently mixed on a rotary mixer for 1 h and left on the benchtop at ambient temperature for ~14 h. The Horse Myoglobin NMR sample was then 0.22 μm sterile filtered. The concentration was determined by absorbance at 280 nm and a calculated extinction coefficient to be 3.1 mM. Then 495 μL of the protein sample was mixed with 55 μL 99.8% deuterated water (Sigma-Aldrich). The diffusion NMR NMR samples for the PHL4FL and PHL4effector domain were prepared by concentrating pure non-isotopically labeled, TEV-cleaved protein from the reverse IMAC and buffer exchanging with Amicon Ultra 3 kDa molecular weight cut-off centrifuge filters according to manufacturer recommendations, using 3 to 5 repetitions of diluting the concentrated protein into NMR buffer. The number of repetitions was dependent on ensuring the initial buffer remaining was calculated to be less than 5%. Finally, 55 μL of 99.8% deuterated water (Sigma-Aldrich) was added to 495 μL of each protein solution, giving a final concentration of 246 μM for PHL4effector and 115 μM for PHL4FL. All spectra are recorded with 0.1% v/v 1,4 dioxane as an internal diffusion standard.

Spectra were recorded on a 300 MHz Avance Neo NMR spectrometer, using the ledbpgppr2s pulse sequence with water pre-saturation added to reduce baseline distortion from the water resonance (Wu et al., 1995). The bipolar gradient powers range from 2–95% in fixed increments of 6.2% (Wu et al., 1995). For each gradient value, 8 scans were added for PHL4effector, and 40 scans were added for PHL4FL. The spectrometer frequency was set to the center of the water resonance and the acquisition time was 3.375 s. For PHL4effector, the gradient pulse length, δ, was 3200 μs, while for PHL4FL, due to slower diffusion, 4000 μs was used. All experiments used a diffusion time, Δ, of 150 ms and an eddy current delay, Te, of 5 ms. Reference horse myoglobin diffusion NMR spectra were recorded at both 3200 μs and 4000 μs gradient pulse lengths to ensure the calibration was performed with the same experimental parameters for both PHL4effector and PHL4FL.

Exponential adiposation functions (1 Hz) were applied and time domain data were Fourier transformed using TopSpin 4.1.4. The spectra were referenced to the 1,4-dioxane resonance to 3.75 ppm. The peak areas were integrated using an in-house Python script and a custom midpoint integral formula for dioxane (3.7525–3.7475 ppm) and the protein (horse myoglobin, 6.0000–9.5000 ppm; PHL4effector, 1.0800–0.7500 ppm; PHL4FL, 1.0800–0.7500 ppm). The decay of the signals as a function of gradient power were fit using in-house Python scripts utilizing the SciPy “curve_fit” functionality with a Gaussian decay function, as described previously (Wilkins et al., 1999)

I(x)=Ae-decayx2

where I(x) is the integrated intensity as a function of gradient power x,A is an amplitude factor, and decay describes the decrease in signal with increasing gradient coil powers in the NMR experiment. This decay value was directly used to calculate the radius of hydration. Using dioxane as an internal standard with Rh = 2.12 Å, the Rh for a protein of unknown size was determined via (Wilkins et al., 1999)

Rh,unknown=CRh,dioxanedecaydioxanedecayunknown

where C is an experimentally derived conversion factor. To determine the C values, Rh was measured in triplicate for horse myoglobin at each gradient length with C set to 1. Comparison with the accepted Rh value for horse myoglobin (20.4 Å) (Zhou et al., 2002) allows the calculation of the C value for a given gradient length. The Rh value measured with a 3200 μs gradient length was 18.5 Å, requiring a C factor of 1.105. For the 4000 μs gradient length the calculated C factor was 1.045. These calibrated C factors were used to calculate the Rh values for PHLeffector domain and PHL4FL in Table 1.

NMR Chemical Shift Measurements

NMR experiments were performed on protein in 90% NMR Buffer and 10% deuterated water (99.8%, Sigma-Aldrich). The PHL4effector domain concentration was 73.2 μM for the 13C and 15N labeled sample and 98.2 μM for the 15N labeled sample. The PHL4FL concentration for the 13C and 15N labeled sample was 114.9 μM.

The NMR samples for PHL4FL and PHL4effector were prepared by concentrating the TEV-cleaved pure protein obtained from the reverse IMAC and buffer exchanging, by 3 to 5 repetitions of concentrating and diluting the protein, into NMR buffer using an Amicon Ultra 3 kDa molecular weight cut-off centrifuge filter according to manufacturer recommendations. The number of repetitions was dependent on ensuring the initial buffer remaining was calculated to be less than 5%. The final NMR sample was obtained by adding 55 μL 99.8% deuterated water (Sigma-Aldrich) to 495 μL of the protein solutions. The data acquisition parameters for the NMR experiments are in Supplemental Tables 1 and 2.

All NMR spectra used for the NMR chemical shift assignments were processed in NMRPipe (Delaglio et al., 1995). Data was converted for peak picking in NMRFAM SPARKY (Lee et al., 2015). Supplementary Tables 1 and 2 contain the data processing parameters.

Sequence specific assignment of the observed NMR chemical shifts for the effector domain were made manually with signal tables consisting of the ”i" and “i – 1” spin clusters resolved in the 3D and 2D spectra according to standard solution NMR assignment strategies. The procedure involves matching the CA and CB NMR chemical shifts from the “i – 1” signal table to the same NMR chemical shifts in the “i" signal table to determine the “i" and “i – 1” amide 15N and 1H NMR chemical shift values. This information is used in combination with the amino acid sequence and the known ranges of NMR chemical shifts for each amino acid type to make unambiguous sequence specific assignment of the observed NMR chemical shifts. Assignments for the PHL4FL C-terminal domain were made according to the same procedure. Due to significant similarity between the spectra of PHL4effector and PHL4FL, sequence specific assignment for much of the PHL4FL N-terminal domain were transferable from PHL4effector. To make assignments on the PHL4FL N-terminal domain, the peaks observed in PHL4effector spectra are filtered by removing peaks without intensity in the PHL4FL spectra. The assignments were then checked manually. The assigned and unassigned peaks are listed in Supplemental Tables 3 and 4 for PHL4effector and in Supplemental Tables 5 and 6 for PHL4FL.

Despite our best effort, including measuring HA and HB NMR chemical shifts for PHL4effector with an additional 3D experiment, the sequence specific assignments were unable to be extended further for PHL4effector and PHL4FL. For PHL4effector, the inability to extend assignments was primarily due to the high redundancy of the sequence, as some remaining unassigned peaks were of the same signal-to-noise as the assigned peaks but could not be placed on the sequence unambiguously. The N-terminal region of PHL4effector and PHL4FL were especially difficult due to pseudo-repetitive amino acid sequences, unique N and HN but non-unique CA, CB NMR chemical shifts, and in some cases, non-unique HA and HB NMR chemical shifts. Additionally, for PHL4FL, sequence specific assignments were also hampered by lower signal-to-noise likely due to the large size of the protein, resulting in less disordered regions of the protein having unfavorable reorientational correlation times for liquid state NMR experiments. This interpretation is also consistent with the smaller number of signals observed in the PHL4FL spectra, and the lack of any signal assignment in the coiled-coil and DNA binding regions of PHL4FL.

BMRB accession numbers for the assignments are 52524 (PHL4FL) and 52525 (PHL4effector).

Secondary chemical shifts are calculated as the difference between the observed chemical shift and the random coil chemical shift from the Poulson IDP calculator. The chemical shift calculation uses a pH of 6.2 and includes nearest neighbor effects (Kjaergaard et al., 2011b). Secondary chemical shift plots were made by plotting the difference of CA and CB secondary chemical shifts ΔδCA-ΔδCB.

NMR Relaxation Measurements

For all relaxation experiments, the 93.2 μM 15N-only labeled PHL4effector sample was used. Data acquisition and processing parameters are listed in Supplementary Table 1. The data were processed and signal intensities determined by integration using NMRPipe (Delaglio et al., 1995). Only non-overlapped peaks in the 1H–15N HSQC experiment were integrated and the bounds of integration were +/− 1 point in both spectra dimensions for the NOE experiment and +/− 2 points for the R1, R2, and R1ρ experiment. integrals were fit using the “curve_fit” SciPy function to an equation of the form (Palmer et al., 1991)

I=Ae-Rt

where A is an arbitrary scaling factor, t is the time in the pulse sequence during which the relaxation process occurred, and R is the relaxation time constant (R1, R2, or R1ρ). The uncertainties in Figure 5ef and Supplementary Figure 8ab are the standard deviations from the curve-fitting procedure. The 1H–15N NOE value calculated in Supplementary Figure 8c is the enhancement, defined as (Kharchenko et al., 2020)

NOEEnhancement=INOE,on-INOE,offINOE,off

where INOE represents the integrated intensity and the “on” and “off” subscript indicating if the 1H–15N NOE interaction was present during the experiment.

The R2 values in Figure 5e are calculated from R1 and R1ρ experiments described in Supplementary Table 1 using the equations described in (Lakomek et al., 2012; Libich et al., 2015)

offset=δspin,i-δcarrierωref
θ=arctanωspin-lockoffset
R2=R1ρ-R1cos2θsin2θ

where δspin,i is the 15N chemical shift in ppm units, δcarrier is the 15N carrier frequency in ppm units, ωref is the 0.0 ppm 15N chemical shift in MHz units, ωspin-lock was set to 2,000 Hz, and R1ρ and R1 are the values from the relaxation experiments.

Supplementary Material

Supplement 1
media-1.pdf (8.4MB, pdf)

Significance.

Transcription factor proteins upregulate genes and are essential to concerted biological response to environmental conditions like stress or low nutrient availability. In this work, we show the activating effector domain of the potent PHL4 transcription factor protein is primarily disordered, without well-defined secondary structure, and that the isolated effector domain behaves similarly in isolation as it does in the full-length protein. Our finding is consistent with protein transcription factors often having regions of disorder within their functional activator domains.

Acknowledgements

Professor Patrick Shih, Niklas Hummel, and Kasey Markel, generously provided the plasmids used in this work, as well as provided insight regarding transcription factor proteins. Dr. Ping Yu and Professor Jim Ames at UC Davis provided technical assistance with solution NMR experiments. Professor David Libich from UT Health San Antonio provided NMR relaxation pulse sequences and provided insight regarding the data analysis. Dr. Upasana Sridharan and Khaled Jami provided technical assistance with the mass spectrometry experiments. The UC Davis NMR Core Facility is partially supported though National Science Foundation award DBI-0722538 and award National Institutes of Health award S10RR013871–01A1. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or the National Science Foundation. UC Davis provided financial support for these experiments.

Footnotes

Description of Supplementary Material: Purification and sequence information for protein constructs; additional data and analysis from the denaturing assay; mass spectrometry data; additional NMR diffusion data and analysis; additional NMR spectra and analysis; disorder and structure predictions; tables of NMR data acquisition parameters, assigned NMR chemical shifts, and unassigned chemical shifts

References

  1. Aguilar X., Blomberg J., Brannström K., Olofsson A., Schleucher J., Bjorklund S. Interaction studies of the human and Arabidopsis thaliana Med25-ACID proteins with the herpes simplex virus VP16- And plant-specific Dreb2a transcription factors. PLoS One 2014; 9: 26–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bax A., Shen Y. Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks. J. Biomol. NMR 2013; 56: 227–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Buendía-Monreal M., Gillmor C.S. Mediator: A key regulator of plant development. Dev. Biol. 2016; 419: 7–18. [DOI] [PubMed] [Google Scholar]
  4. Bustos R., Castrillo G., Linhares F., Puga M.I., Rubio V., Pérez-Pérez J., et al. A central regulatory system largely controls transcriptional activation and repression responses to phosphate starvation in arabidopsis. PLoS Genet. 2010; 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chambers M.C., Maclean B., Burke R., Amodei D., Ruderman D.L., Neumann S., et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 2012; 30: 918–920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Conicella A.E., Zerze G.H., Mimal J., Fawzi N.L. ALS Mutations Disrupt Phase Separation Mediated by α-Helical Structure in the TDP-43 Low-Complexity C-Terminal Domain. Structure 2016; 24: 1537–1549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Delaglio F., Grzesiek S., Vuister G., Zhu G., Pfeifer J., Bax A. NMRPipe: A multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 1995; 6:227–293. [DOI] [PubMed] [Google Scholar]
  8. Dudás E.F., Bodor A. Quantitative, Diffusion NMR Based Analytical Tool to Distinguish Folded, Disordered, and Denatured Biomolecules. Anal. Chem. 2019; 91: 4929–4933. [DOI] [PubMed] [Google Scholar]
  9. Gasteiger E., Hoogland C., Gamiker A., Duvaud S., Wilkins M.R., Appel R.D., et al. (Walker JM, editor). Protein identification and analysis tools on the ExPASy server. Humana Press; 2005. [DOI] [PubMed] [Google Scholar]
  10. Hummel N.F.C., Zhou A., Li B., Markel K., Ornelas I.J., Shih P.M. The trans-regulatory landscape of gene networks in plants. Cell Syst. 2023; 14: 501–511.e4. [DOI] [PubMed] [Google Scholar]
  11. Jiang M., Sun L., Isupov M.N., Limlechild J.A., Wu X., Wang Q., et al. Structural basis for the Target DNA recognition and binding by the MYB domain of phosphate starvation response 1. FEBS J. 2019; 286: 2809–2821. [DOI] [PubMed] [Google Scholar]
  12. Jones D.T., Cozzemo D. DISOPRED3: Precise disordered region predictions with annotated protein-binding activity. Bioinformatics 2015; 31: 857–863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Jumper J., Evans R., Privel A., Green T., Figurnov M., Ronneberger O., et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021; 596: 583–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Kharchenko V., Nowakowski M., Jaremko M., Ejchart A., Jaremko Ł. Dynamic 15N{1H} NOE measurements: a tool for studying protein dynamics. J. Biomol. NMR 2020; 74: 707–716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Kjaergaard M., Brander S., Poulsen F.M. Random coil chemical shift for intrinsically disordered proteins: effects of temperature and pH. J. Biomol. NMR 2011; 49: 139–149. [DOI] [PubMed] [Google Scholar]
  16. Kjaergaard M., Poulsen F.M. Sequence correction of random coil chemical shifts: Correlation between neighbor correction factors and changes in the Ramachandran distribution. J. Biomol. NMR 2011; 50: 157–165. [DOI] [PubMed] [Google Scholar]
  17. Kotha S.R., Staller M.V. Clusters of acidic and hydrophobic residues can predict acidic transcriptional activation domains from protein sequence Kaplan C, editor. Genetics 2023; 225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Lakomek N.-A., Ying J., Bax A. Measurement of 15N relaxation rates in perdeuterated proteins by TROSY-based methods. J. Biomol. NMR 2012; 53: 209–221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Lee W., Tonelli M., Markley J.L. NMRFAM-SPARKY: Enhanced software for biomolecular NMR spectroscopy. Bioinformatics 2015; 31: 1325–1327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Leney A.C., Heck A.J.R. Native Mass Spectrometry: What is in the Name? J. Am. Soc. Mass. Spectrom. 2017; 28: 5–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Li-Blamer X., Seelig J. Thermal and Chemical Unfolding of Lysozyme. Multistate Zimm-Bragg Theory Versus Two-State Model. J. Phys. Chem. B 2019; 123: 10181–10191. [DOI] [PubMed] [Google Scholar]
  22. Libich D.S., Tugarinov V., Clore G.M. Intrinsic unfoldase/foldase activity of the chaperonin GroEL directly demonstrated using multinuclear relaxation-based NMR. Proc. Natl. Acad. Sci. U. S. A. 2015; 112: 8817–8823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Marty M.T., Baldwin A.J., Marklund E.G., Hochberg G.K.A., Benesch J.L.P., Robinson C. V. Bayesian deconvolution of mass and ion mobility spectra: From binary interactions to polydisperse ensembles. Anal. Chem. 2015; 87: 4370–4376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Micsonai A., Moussong É., Wien F., Boros E., Vadászi H., Murvai N., et al. BeStSel: Webserver for secondary structure and fold prediction for protein CD spectroscopy. Nucleic Acids Res. 2022; 50: W90–W98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Monsellier E., Bedouelle H. Quantitative measurement of protein stability from unfolding equilibria monitored with the fluorescence maximum wavelength. Protein Eng. Des. Sel. 2005; 18: 445–456. [DOI] [PubMed] [Google Scholar]
  26. Mureddu L., Vuister G.W. Simple high-resolution NMR spectroscopy as a tool in molecular biology. FEBS J. 2019; 286: 2035–2042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Nguyen H.T.K., Kim S.Y., Cho K.M., Hong J.C., Shin J.S., Kim H.J. A transcription factor γMYB1 binds to the P1BS cis-element and activates PLA2-γ expression with its co-activator γMYB2. Plant Cell Physiol. 2016; 57: 784–797. [DOI] [PubMed] [Google Scholar]
  28. Palmer A.G., Rance M., Wright P.E. Intramolecular motions of a zinc finger DNA-binding domain from Xfin characterized by proton-detected natural abundance carbon-13 heteronuclear NMR spectroscopy. J. Am. Chem. Soc. 1991; 113: 4371–4380. [Google Scholar]
  29. Palmer A.G. NMR Characterization of the Dynamics of Biomacromolecules. Chem. Rev. 2004; 104: 3623–3640. [DOI] [PubMed] [Google Scholar]
  30. Parmar A.S., Muschol M. Hydration and Hydrodynamic Interactions of Lysozyme: Effects of Chaotropic versus Kosmotropic Ions. Biophys. J. 2009; 97: 590–598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ried M.K., Wild R., Zhu J., Pipercevic J., Sturm K., Broger L., et al. Inositol pyrophosphates promote the interaction of SPX domains with the coiled-coil motif of PHR transcription factors to regulate plant phosphate homeostasis. Nat. Commun. 2021; 12: 384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Rogawski R., Sharon M. Characterizing Endogenous Protein Complexes with Biological Mass Spectrometry. Chem. Rev. 2022; 122: 7386–7417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Royer C.A. Probing protein folding and conformational transitions with fluorescence. Chem. Rev. 2006; 106: 1769–1784. [DOI] [PubMed] [Google Scholar]
  34. Rubio V., Linhares F., Solano R., Martín A.C., Iglesias J., Leyva A., et al. A conserved MYB transcription factor involved in phosphate starvation signaling both in vascular plants and in unicellular algae. Genes Dev. 2001; 15: 2122–2133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Sanborn A.L., Yeh B.T., Feigerle J.T., Hao C. V., Townshend R.J.L., Lieberman Aiden E., et al. Simple biochemical features underlie transcriptional activation domain diversity and dynamic, fuzzy binding to Mediator. Elife 2021; 10: 1–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Shen Y., Delaglio F., Cornilescu G., Bax A. TALOS+: A hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. J. Biomol. NMR 2009; 44: 213–223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Sigler P.B. Acid blobs and negative noodles. Nature 1988; 333: 210–212. [DOI] [PubMed] [Google Scholar]
  38. Soto L.F., Li Z., Santoso C.S., Berenson A., Ho I., Shen V.X., et al. Compendium of human transcription factor effector domains. Mol. Cell 2022; 82: 514–526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Staby L., O’Shea C., Willemoës M., Theisen F., Kragelund B.B., Skriver K. Eukaryotic transcription factors: paradigms of protein intrisic disorder. Biochem. J. 2017; 474: 2509–2532. [DOI] [PubMed] [Google Scholar]
  40. Stetefeld J., McKenna S.A., Patel T.R. Dynamic light scamering: a practical guide and applications in biomedical sciences. Biophys. Rev. 2016; 8: 409–427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Talmard C., Guilloreau L., Coppel Y., Mazarguil H., Faller P. Amyloid-beta peptide forms monomeric complexes with CuII and ZnII prior to aggregation. ChemBioChem 2007; 8: 163–165. [DOI] [PubMed] [Google Scholar]
  42. Varadi M., Anyango S., Deshpande M., Nair S., Natassia C., Yordanova G., et al. AlphaFold Protein Structure Database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022; 50: D439–D444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Walsh I., Martin A.J.M., Di Domenico T., Tosamo S.C.E. ESpriv: accurate and fast prediction of protein disorder. Bioinformatics 2012; 28: 503–509. [DOI] [PubMed] [Google Scholar]
  44. Walsh I., Seno F., Tosamo S.C.E., Trovato A. PASTA 2.0: An improved server for protein aggregation prediction. Nucleic Acids Res. 2014; 42: 301–307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Wang Z., Zheng Z., Song L., Liu D. Functional characterization of arabidopsis PHL4 in plant response to phosphate starvation. Front. Plant Sci. 2018; 9: 1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Wilkins D.K., Grimshaw S.B., Receveur V., Dobson C.M., Jones J.A., Smith L.J. Hydrodynamic radii of native and denatured proteins measured by pulse field gradient NMR techniques. Biochemistry 1999; 38: 16424–16431. [DOI] [PubMed] [Google Scholar]
  47. Wishart D.S. Interpreting protein chemical shift data. Prog. Nucl. Magn. Reson. Spectrosc. 2011; 58: 62–87. [DOI] [PubMed] [Google Scholar]
  48. Wu D.H., Chen A.D., Johnson C.S. An Improved Diffusion-Ordered Spectroscopy Experiment Incorporating Bipolar-Gradient Pulses. J. Magn. Reson. Ser. A 1995; 115: 260–264. [Google Scholar]
  49. Zhou H.X. Dimensions of denatured protein chains from hydrodynamic data. J. Phys. Chem. B 2002; 106: 5769–5775. [Google Scholar]
  50. Zhou Z., Wang Z., Lv Q., Shi J., Zhong Y., Wu P., et al. SPX proteins regulate Pi homeostasis and signaling in different subcellular level. Plant Signal. Behav. 2015; 10: 10–12. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.pdf (8.4MB, pdf)

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES