Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2020 Jan 21;21(8):1178–1187. doi: 10.1002/cbic.201900677

Rapid Biophysical Characterization and NMR Spectroscopy Structural Analysis of Small Proteins from Bacteria and Archaea

Nina Kubatova 1, Dennis J Pyper 1, Hendrik R A Jonker 1, Krishna Saxena 1, Laura Remmel 1, Christian Richter 1, Sabine Brantl 2, Elena Evguenieva‐Hackenberg 3, Wolfgang R Hess 4, Gabriele Klug 3, Anita Marchfelder 5, Jörg Soppa 6, Wolfgang Streit 7, Maxim Mayzel 8, Vladislav Y Orekhov 8,11, Monika Fuxreiter 9, Ruth A Schmitz 10,, Harald Schwalbe 1,
PMCID: PMC7217052  PMID: 31705614

Abstract

Proteins encoded by small open reading frames (sORFs) have a widespread occurrence in diverse microorganisms and can be of high functional importance. However, due to annotation biases and their technically challenging direct detection, these small proteins have been overlooked for a long time and were only recently rediscovered. The currently rapidly growing number of such proteins requires efficient methods to investigate their structure–function relationship. Herein, a method is presented for fast determination of the conformational properties of small proteins. Their small size makes them perfectly amenable for solution‐state NMR spectroscopy. NMR spectroscopy can provide detailed information about their conformational states (folded, partially folded, and unstructured). In the context of the priority program on small proteins funded by the German research foundation (SPP2002), 27 small proteins from 9 different bacterial and archaeal organisms have been investigated. It is found that most of these small proteins are unstructured or partially folded. Bioinformatics tools predict that some of these unstructured proteins can potentially fold upon complex formation. A protocol for fast NMR spectroscopy structure elucidation is described for the small proteins that adopt a persistently folded structure by implementation of new NMR technologies, including automated resonance assignment and nonuniform sampling in combination with targeted acquisition.

Keywords: NMR spectroscopy, proteomics, small proteins, structural biology, structure–activity relationships


Reading expands horizons: Structural analysis of small proteins encoded by previously overlooked small open reading frames (sORFs) is achieved by using NMR spectroscopy. A protocol is established for fast NMR spectroscopy structure screening, including optimized sample preparation for systems of this size (14–78 aa), and determination of the structural conformations adopted by the systems.

graphic file with name CBIC-21-1178-g005.jpg

Introduction

For a long time, technical limitations in detection and assumptions in the automated gene annotation tools that were too strict prevented the identification of peptides and small proteins encoded by small open reading frames (sORF). Only after several small proteins were identified in different organisms and shown to be encoded by sORFs was the previous assumption that an ORF should have a minimum length of 100 codons called into question. In the last decade, increasing efforts to identify and study peptides and small proteins and their functions with diverse computational and biochemical methods, including ribosome profiling and mass spectrometry optimized for peptides,1, 2 have resulted in paradigm‐shifting discoveries. An increasing number of small proteins have been found to play important roles in a broad range of cellular functions, including cell division, morphogenesis, and stress response.3

The nomenclature and definition of the maximal size of the small proteins encoded by sORFs varies between 50 and 100 amino acids.4 Typically, peptides are distinguished from proteins by the shorter length of their chain. The exact cutoff is vague, but often set to 50–60 amino acid residues.3 In addition, many names are used for the small‐sized proteins, including micropeptide, microprotein (μ‐protein), miniprotein, and small protein. Even the same name is sometimes defined differently, such as the term microprotein.5, 6, 7 To overcome this diversity, we herein refer to Storz et al.,3 who apply the term “small proteins”, and set the upper sequence length to be fewer than 80 residues.

Small proteins were found in all three domains of life (Archaea, Bacteria, and Eukarya). Small proteins can be categorized according to their mode of biosynthesis as either geneencoded/ribosomally synthesized (RPs) or non‐gene‐encoded/nonribosomally synthesized peptides (NRPs). In contrast to NRPs, RPs are restricted to 22 (including selenocysteine and pyrrolysine) proteinogenic amino acids and often undergo diverse post‐translational modifications (PTMs), such as hydroxylation, methylation, halogenation, prenylation, acylation, thioether and thioester crosslinking, epimerization, and macrocyclization.8, 9, 10 These modifications potentially decrease their structural flexibility and favor structure formation, increase their stability by preventing degradation by proteases, and provide additional regulatory functions. Ribosomally synthesized and post‐translationally modified peptides (RiPPs) often bind to a structured interaction partner. Prominent examples include the antibiotics thiostrepton and micrococcin from the thiopeptides family that bind to the GTPase‐associated region (GAR) of the 50S ribosome and inhibit translation.11 Peptides with antibacterial properties, also known as antimicrobial peptides (AMPs), are essential as therapeutic antibiotics for drug development.12, 13, 14, 15

Many small proteins, if overexpressed, are toxic for bacteria.16, 17, 18 In Escherichia coli, they are often hydrophobic and contain single transmembrane helices. This promotes inner membrane insertion and frequently the loss of membrane potential.16 Among pathogenic bacteria, important examples include the phenol‐soluble modulins (short, amphipathic, α‐helical peptides in staphylococci).19 Many of the small toxic proteins belong to the class I toxin/antitoxin loci, in which expression of a generally stable small and hydrophobic toxin (<60 aa) is inhibited at the level of translation by a cis‐encoded antisense RNA.18, 20, 21, 22 Apart from well‐described examples, for example, essential functions in persister cell formation,23, 24 the functional roles of predicted chromosomally encoded toxin/antitoxin loci often remain unknown. They might be involved in genomic integrity, slowdown of growth under stress conditions or simply act as “selfish DNA”.18

The photosynthetic apparatus contains a number of small proteins with functions that are conserved from cyanobacteria to higher plants; some with fewer than 50 amino acids play a role in photosynthetic electron transport (Cytb 6 f complex; petL, petN, petM, petG 25, 26, 27), in photosystem II (genes psbM, psbT (ycf8), psbI, psbL, psbJ, psbY, psbX, psb30 (ycf12), psbN, psbF, psbK 28, 29), in photosystem I (psaM, psaJ, psaI 30), or with accessory functions in photosynthesis (hliC).31 With 29 amino acids, the cytochrome b 6 f complex subunit VIII, encoded by petN, is the shortest of these proteins.27

However, elucidation of the function of small proteins remains a challenging task and the structural characterization of small proteins can help in elucidating their molecular mechanisms of action. Despite their small size, which has impeded their recognition, small proteins have been found in every living cell.3 Their regulatory function encompasses a wide range, including regulation of the enzyme activity of both membrane‐bound and cytosolic proteins. They can further induce or modulate the conformation of larger proteins or other biomolecules, and thus, can be part of complex signal transduction processes.

Small proteins with cellular function are not exclusively found in bacteria or archaea, but several eukaryotic small proteins have been identified and their functions studied. One example is the selective interaction of the neuropeptide bradykinin (BK), a peptide agonist (nine amino acids), with the human BK G protein coupled receptor, a drug target for cardiovascular regulation. Joedicke et al. investigated the conformational differences of the analogous peptides desArg10‐kallidin (DAKD) and BK induced by interaction with the target, to understand their specific binding behavior.12 Even though the peptides are very similar, their behavior is substantially different. The two small proteins show differences upon receptor binding: the free and bound forms of the kallidin peptide are very similar, whereas the conformation of BK is rearranged upon binding to its receptor. The kallidin peptide binds in an open conformation, whereas parts of the BK peptide chain adopt specific folded conformations. These results show that receptor specificity of peptide ligands is dependent on the conformational and chemical space of peptides. Conformational changes are not necessarily required to achieve specific peptide–receptor interactions (Figure 1).

Figure 1.

Figure 1

A comparison of the structures of DAKD and BK in free and BR‐bound forms determined by means of NMR spectroscopy.12

Small proteins are most often unstructured. In the case of residual structure, the preferred structural motifs are often restricted to α‐helical elements. Further short motifs that mediate specific protein interactions can be identified from the sequence, but they might require longer flanking regions to function.32 The function of many small proteins is often related to their ability to undergo a disorder‐to‐order transition; thus adopting a folded state upon interaction with their biological targets. However, function might not require such a disorder‐to‐order transition. The intrinsically disordered proteins (IDPs), for example, lack a persistent three‐dimensional folded structure and can still be functional, in particular, in transcriptional and translational regulation and in signal transduction within the cell.33, 34

The number of identified small proteins is currently rapidly growing due to novel approaches, including high‐resolution ribosome profiling with stalled initiation complexes.35, 36 Hence, more efficient methods are required to investigate the structure–function relationships of small proteins.

Elucidation of the relationship between the protein sequence, structure, and function is a key part of computational biology studies. Typical prediction methods are, however, limited to inherently structured proteins and are not convenient for disordered proteins, the fold and stability of which potentially depend on their biological target. The estimation of a favorable, stable, bound conformation gives an important insight into the probability of the protein folding upon complex formation; this might be related to its function.37 These prediction methods,38, 39, 40 which were initially developed for IDPs, were successfully applied to the set of 27 small proteins characterized in this study. Computational prediction results were accompanied by experimental solution‐state NMR spectroscopy data of small proteins in their free form.

NMR spectroscopy is an experimental technique that allows the determination of structure and dynamics of isolated small proteins in solution, as well as within complexes with diverse cellular components.41, 42 NMR spectroscopy structure determination has been successfully integrated into other structural genomic approaches to increase the number of resolved 3D proteins structures.43, 44 The quality assessment of the determined NMR protein structure can be further performed by the Worldwide Protein Data Bank (wwPDB) validation report.45, 46

Furthermore, NMR spectroscopy provides an unbiased readout of the folding state of a protein. The method can equally well describe the structure of folded proteins and of partially folded and unstructured (random coil) states.47 This potential of unbiased characterization is particularly important for the structural investigation of small proteins because many are unlikely to adopt a stable folded conformation if isolated in solution. Furthermore, because the proteins of interest (POI) are small, their NMR spectroscopic investigation is, in principle, rather straightforward, including rapid de novo structure determination.

To perform a routine secondary‐structure screening of a high number of small proteins, the sample preparation strategy is required to be fast and precise. Hence, we describe the optimization of a pipeline to conduct structural studies of more than 20 different small proteins, which we investigate within a priority program on small proteins (SPP2002) that has been funded by the German Research Foundation (DFG) since 2018. We have established a routine work protocol and screened 27 small proteins for secondary structure. Within this screening, we observed all possible conformations: 1) entirely unstructured proteins (random‐coil state): 2) proteins with a fluctuating amount of residual structure, but no defined tertiary fold (molten globule), as well as specific structures with only lowly populated excited states; and 3) structured proteins.48, 49 We report herein on the protocol for structure determination of one exemplary small protein (SP‐22; PDB ID: 6Q2Z; BMRB ID: 34334).50 For high‐throughput structural approaches, a reduction of measurement time is advantageous. We used the nonuniform sampling (NUS) approach in combination with targeted acquisition (TA) and a new multidimensional decomposition (MDD) signal processing technique.51, 52, 53

Results and Discussion

Expression and purification strategy

We established an efficient workflow for the characterization of conformation and dynamics of small proteins (Figure 2). Within this workflow, small proteins containing fewer than 30 amino acids were synthesized by means of SPPS and purified by reversed‐phase HPLC. Small proteins containing more than 30 amino acids were heterologously expressed in E. coli. For NMR spectroscopy and structure determination, 15N‐ and 15N/13C‐labeling schemes were performed by using M9 minimal medium containing 15N‐labeled ammonium chloride and 13C‐labeled glucose. The small POI were expressed as N‐terminal small ubiquitin‐related modifier (SUMO) fusion proteins, which enhanced expression, improved solubility, masked possible toxicity, and reduced the proteolytic degradation of recombinant proteins. In particular, the SUMO tag allowed the generation of the exact desired N terminus of POI.54 The SUMO protease cleaved the tertiary SUMO folding motif without additional amino acids.55 This remarkable ability was very important for small POI, in which every additional amino acid can influence the structural properties. Addition of a hexahistidine tag to the SUMO fusion facilitated purification by means of tandem Ni‐NTA affinity chromatography56 followed by size exclusion chromatography. In case a construct did not express with the SUMO tag (e.g., for small protein SP‐24), a hexahistidine tag was linked to the protein sequence through a thrombin cleavage recognition site. Subsequent purification included Ni‐NTA affinity chromatography followed by size exclusion chromatography to separate the small protein from the cleaved hexahistidine tag. For POI that incorporated metal ions (e.g., Zn‐binding proteins), the Ni‐NTA affinity chromatography purification strategy was changed to affinity chromatography with glutathione‐sepharose beads57 by adding an N‐terminal glutathione S‐transferase (GST) fusion tag. Removal of the GST fusion partner after tobacco etch virus (TEV) cleavage was easily achieved by using size exclusion chromatography. The additional amino acids remaining on the N terminus after cleavage are glycine and alanine, with potential small influence on the structure in the elongated small protein. Identification and purity of the produced small POI were confirmed by means of SDS‐PAGE analysis and mass spectrometry (MALDI).

Figure 2.

Figure 2

Workflow for NMR spectroscopy structural investigations of small proteins, including synthesis/expression and purification steps. CD: circular dichroism.

Rapid secondary‐structure determination

After protein purification to a purity above 95 %, the POI were analyzed by means of CD spectroscopy (Figure 3 A) and 1D 1H NMR spectroscopy (Figure 3 C) to obtain global information about their secondary structures. Two‐dimensional 1H,15N HSQC NMR spectra, which can nowadays often be recorded by using the low natural abundance of 0.3 % in 15N, allow direct structural readout. The dispersion of the detected backbone amide signals is a structural fingerprint of the protein. Their position, intensities, and line widths are markers for the conformational state of the investigated protein: structured (Figure 3 D, left), unstructured (Figure 3 D, middle), or partially structured and undergoing millisecond intermediate exchange (Figure 3 D, right). Chemical shift analysis by TALOS allows a detailed and reliable site‐specific secondary‐structure prediction (Figure 3 E).58 TALOS analysis requires the full backbone resonance assignment of the small protein, for which further 3D NMR spectroscopy experiments on a 15N,13C‐labeled sample are needed.

Figure 3.

Figure 3

Left: SP‐22 small protein from Haloferax volcanii; middle: SP‐6 small protein from Methanosarcina mazei; right: SP‐10 small protein from Sinorhizobium fredii. A) CD spectra in phosphate buffer at pH 7. B) MALDI mass analysis of the purified small protein. C) The 1D 1H NMR spectra with an enlargement of the amide proton region. D) The 2D 1H,15N HSQC spectra at 600 MHz, 298 K. E) Left and middle: TALOS secondary‐structure prediction of the residues that are classified as “good”; right: expanded regions from the 2D 1H,15N HSQC spectrum for three signals showing differences in line width and intensity of the NMR signals, which are characteristic for molten globule‐type conformational behavior. These results agree perfectly with calculations by the Espritz method in free‐form propensity of structures SP‐22 (73.3 % folded), SP‐6 (0 % structured), and SP‐10 (61.3 % molten globule).

The throughput for structural screening of small proteins is limited by the NMR spectroscopy measurement time needed to record multidimensional spectra, their resonance assignment, and subsequent structure determination. Depending on the protein concentration, the data needed for determining a protein structure require at least three weeks of measurement time followed by data analysis. The reduced signal overlap achieved by the highly resolved multidimensional spectra is crucial for (manual or automated) signal assignment and structure calculation, but this procedure is very time‐consuming, and thus, not applicable for the screening of a large set of small proteins (and likewise for unstable proteins). The NUS approach provides a solution because it strongly reduces the experimental time and/or allows for higher resolution without the loss of important information, if the obtainable signal‐to‐noise ratio is sufficient.59

In addition, a TA and a new MDD signal‐processing technique51, 52, 53 applied for nonuniform sampled data dramatically reduces the NMR spectroscopy measurement time and simplifies the evaluation of the spectra. In combination with automated methods for signal list generation and resonance assignment (such as FLYA),60 this method is suitable for the rapid structural screening of small proteins.

Structure elucidation (by using CYANA with fully automated NOESY crosspeak assignment)60, 61, 62, 63 can be performed after completing backbone and side‐chain NMR resonance assignment. The final structure calculation generally includes NOE data, hydrogen bonds, ample dihedral angle restraints based on TALOS predictions, and 3 J(HN,Hα) coupling constants.64

Application of the pipeline for 11 bacterial small proteins and 17 archaeal small proteins

In total, 27 different small proteins, ranging from 14 to 78 amino acids, were screened. Ten small proteins were synthesized by means of SPPS (14–31 amino acids) and 17 small proteins were expressed in E. coli (38–71 amino acids). Of these, three small proteins could not be expressed, for four we initially obtained degraded samples, and for one further small protein structural investigation was not possible because of insolubility due to its hydrophobicity. We rapidly assessed the secondary‐structure screening of the remaining 19 small proteins by means of CD and NMR spectroscopy.

All small proteins with fewer than 30 amino acids were found to be unstructured. Six small proteins adopted a molten globule conformation. Thus, almost all small proteins with fewer than 30 amino acids were shown to be predominantly unstructured, whereas those between 30 and 80 amino acids tended to adopt a structured or partially structured conformation. The small size of the proteins clearly seems to restrict the possible structural motifs. Nevertheless, structures might be adopted for these small proteins upon interaction with their biological targets. Furthermore, five small proteins (all above 50 amino acids) were found to be structured or to contain a partially structured conformation. Interestingly, two of the structured small proteins contained at least one Zn2+‐binding motif, which might lead to structural stability of these proteins. The structural investigation of these small proteins was conducted by using 13C/15N isotope labeling (Table 1 and Table S1 A–D in the Supporting Information). Structure determination of one folded small protein has been published,50 and the requirements for structure determination and the quality of structure are exemplarily shown for the SP‐22 small protein from H. volcanii (Figure 4).

Table 1.

Structural analysis validated by means of NMR and CD spectroscopy of the small proteins screened in this study. The table is shown with an ascending molecular weight of small proteins. A full overview can be found in Table S1 A–D.

ID

aa

MW

Microorganism

CD and NMR

[kDa]

structural analysis

SP‐1

14

1.6

Bradyrhizobium japonicum

unstructured

SP‐2

14

1.8

Sinorhizobium meliloti

unstructured

SP‐3

18

1.9

Dinoroseobacter shibae

unstructured

SP‐5

23

2.6

M. mazei

unstructured

SP‐6

23

2.8

M. mazei

unstructured

SP‐7

27

2.9

S. meliloti

unstructured

SP‐8

28

3.1

M. mazei

unstructured

SP‐9

29

3.1

M. mazei

unstructured

SP‐10

31

3.7

S. fredii

molten globule

SP‐11

38

4.0

H. volcanii

molten globule

SP‐12

39

4.5

Bacillus subtilis

molten globule

SP‐13

43

4.8

S. fredii

molten globule

SP‐19

51

5.7

H. volcanii

structured

SP‐21

59

6.5

H. volcanii

structured

SP‐22

60

6.7

H. volcanii

structured

SP‐23

61

6.9

H. volcanii

molten globule

SP‐24

61

7.1

M. mazei

structured

SP‐25

61

7.2

H. volcanii

molten globule

SP‐27

78

8.1

H. volcanii

partially structured

Figure 4.

Figure 4

Structural characterization of SP‐22 small protein from H. volcanii. (PDB ID: 6Q2Z; BMRB ID: 34334). A) The amino acid sequence and the schematic representation of the secondary‐structure elements based on the solution‐state NMR spectroscopy structure. B) Sequential assignment for residues K3 to D23. The 3D HNCACB NMR spectrum was recorded at 700 MHz, 298 K; it contains 5 mm protein, 50 mm sodium phosphate buffer pH 7.5, 100 mm NaCl, 5 % D2O, 0.5 mm DSS. CA are shown in red and CB are highlighted in green. C) 1H,15N Best TROSY spectrum (600 MHz) of 5 mm small protein in 50 mm sodium phosphate buffer, pH 7.5, 100 mm NaCl, 95 % H2O/5 % D2O at 298 K. Backbone resonance assignment is indicated. D) Solution‐state NMR spectroscopy structure of SP‐22 protein. Ribbon representation of the best 20 structures is shown as a symmetrical dimer. The monomer consists of one α helix and four β‐sheet regions. Black and red represent two monomeric subunits. The figure was generated with PyMOL.50

For eight small proteins (four from archaea and four from bacteria), we could not establish protocols for expression. They range from 18–71 amino acids, and thus, do not show any bias in size. At this point, we restricted ourselves to study systems for which the above described workflow could easily be applied. Further efforts for some of the sequences are currently pursued, but beyond the scope of this contribution.

The small proteins screened in this study belong to different bacterial and archaeal organisms (Table S1 D), but the approach is not limited to prokaryotic systems and can be applied to eukaryotic small proteins, as well in future investigations.

Bioinformatic sequence‐to‐structure analysis

A bioinformatic sequence‐to‐structure analysis provides a first readout for possible secondary structure of the small protein. For this purpose, we used fast online structure prediction tools, including pep‐FOLD,65, 66 PEPstrMOD,67, 68 and SWISS‐MODEL.69, 70, 71, 72, 73 The sequence‐based secondary‐structure prediction for small proteins containing fewer than 30 residues generally predicted α‐helical regions. These predictions, however, were not confirmed by CD and NMR spectra, which actually reported a completely unstructured state (Table 1). The s2D method,40 however, which is based on NMR chemical shifts, predicted 12 out of 20 sequences dominantly (>50 %) coiled or disordered (Table S2). In addition, SWISS‐MODEL secondary‐structure predictions, which can be applied to small proteins with more than 30 amino acids, can provide a good starting point if homologous domains or structures with high sequence identity already exist and can therefore be used for generating a structural model of the small protein (Figure S1). SWISS‐MODEL and the s2D method can be used together to predict the structure and dynamics/disorder of small proteins.74

The second layer to link the sequence to structure is to estimate the preference for a folded, stable conformation. A series of experimental and computational results indicate that approximately 40 % of eukaryotic proteomes do not adopt a well‐defined structure, but interconvert between alternative conformations in their native state.75 IDP regions or whole domains often serve regulatory roles through interaction partners, which may induce their folding.33 These findings have triggered the development of a plethora of algorithms to predict the preference for a folded tertiary structure versus intrinsic disorder. The Espritz NMR spectroscopy method with the highest performance, relative to that of NMR data,39 predicted that 11 out of the 19 sequences analyzed in our study contained more than 50 % structured residues and 4 sequences had 30–50 % disordered residues (Tables 2 and S4). This was comparable to the results obtained by a meta approach based on a combination of predictors (PrDOS, DisoPred2, VSL2, IUPred;38 Table S3). Except for short sequences with fewer than 25 amino acids, disorder predictions were in accordance with our experimental data (Table 1). This problem was due to the compositional bias in disordered regions,76 which was difficult to evaluate in short sequences. Small proteins were found to be mostly unstructured, similarly to the protein sequences that were prone to “default” degradation by the 20S proteasomes,77 which significantly decreased the half‐life of these proteins.78 We found that the propensity of predicting residues to be in structured regions paralleled the experimentally detected increasing order in molten globule states and structured small proteins (Tables 2 and S4).

Table 2.

Espritz NMR spectroscopy predictions of sequences in free form. Dynamic transitions induced by interactions were computed by using the FuzPred method with a reference to the Espritz NMR spectroscopy free form. Small proteins were combined in classes with respect to experimental secondary‐structure screening analysis.

ID

aa

Free form [%]

Bound form [%]

Structured

Disordered

Structured

Disordered

folded

SP‐19

51

68.6

31.4

92.2

7.8

SP‐21

59

71.2

28.8

84.7

15.3

SP‐22

60

73.3

26.7

100

0

SP‐24

61

100

0

100

0

SP‐27

78

100

0

100

0

molten

SP‐10

31

61.3

38.8

93.5

6.5

globule

SP‐11

38

73.7

26.4

100

0

SP‐12

39

20.5

79.4

41

59

SP‐13

43

53.5

46.5

67.4

32.6

SP‐23

61

67.2

32.8

78.7

21.3

SP‐25

61

19.7

80.4

75.4

24.6

unstructured

SP‐1

14

78.6

21.4

100

0

SP‐2

14

21.4

78.6

78.6

21.4

SP‐3

18

0

100

61.1

38.9

SP‐5

23

21.7

78.2

87

13

SP‐6

23

0

100

52.2

47.8

SP‐7

27

0

100

63

37

SP‐8

28

0

100

57.1

42.9

SP‐9

29

79.3

20.7

100

0

Disordered peptides may be stabilized by interactions through “coupled folding to binding” mechanisms.79 Conformational heterogeneity, however, can also be retained in complexes. Complex formation then does not induce (complete) structure.37 Thus, it is important to predict the probability of folding upon binding. For this, we used the FuzPred algorithm.80 This method is based on the local compositional and dynamical bias in interacting regions, which can be quantified without prior knowledge of specific interaction partners (protein, RNA, DNA, metabolite, or other possible partners). The method has been validated on more than 2000 protein complexes in the PDB, including both structured and disordered assemblies. Based on FuzPred predictions, the structure propensity increased by interaction partners, that is, disordered regions tended to fold upon binding (Tables 2 and S2 C). The propensity of residues to adopt alternative conformations or exhibit conformational exchange in the bound state was fewer than 20 % in structured sequences and fewer than 35 % in molten globule states. In our screening, SP‐12 was the only sequence that was predicted to dynamically fluctuate in the presence of an interaction partner, possibly owing to the presence of tandem, multivalent motifs. Degradation‐prone small proteins could also gain structure upon binding, which might prolong their lifetimes.81 Unstructured sequences often still exhibit a considerable fraction of disordered residues in the bound state. These regions might be involved in further interactions or regulation of larger, supramolecular assemblies.82, 83

Screening of folding conditions

One of the main tasks in structural proteomics is to identify the optimal protein folding conditions suitable for structure elucidation by means of NMR spectroscopy. This procedure includes screening of the protein stability and ability to fold at different pH, buffers, salt concentration, and temperature. For a first rapid screening, we used CD spectroscopy because these experiments benefited from low sample costs (no isotope labeling needed) and fast implementation. The CD profile is indicative of unstructured and folded protein and can provide information about the presence of structural elements. NMR spectroscopy, however, can provide more precise and detailed information, and is the only possibility for screening proteins with different conformational states or partially folded molten globule states.

Influence of pH and salt concentration on protein conformation

The protein SP‐23 from archaeon H. volcanii, which is part of our screening, shows conformational changes that are dependent on stress and high salt conditions and is therefore an excellent example to show the influence of varied conditions. Changes in pH can dramatically affect the 2D 1H,15N HSQC protein spectra. The expected number of signals is observed at neutral pH 7, whereas a decrease to more acidic pH 6 leads to double the amount of signals; this clearly indicates the presence of conformational heterogeneity under these conditions. In contrast, an increase of the pH to more basic pH 8 leads to faster amide proton exchange and results in signal loss in the spectra (Figure S2 A). This effect is observed in secondary‐structure elements that undergo dynamic opening or closing events and, at basic pH, is difficult to correlate to structure or function of the POI.

One representative conformational marker in the 2D 1H,15N HSQC spectrum is the tryptophan indole side‐chain signal, which appears at an isolated spectral region downfield at around δ=10 ppm (1H). Although tryptophan occurs only once in the protein sequence, the spectrum shows a doublet signal at all screened pH conditions; thus suggesting the presence of a minor populated conformation.

Because this protein is involved in high salt regulation, we monitored the conformational changes upon increasing the NaCl concentration (Figure S2 B). A series of 2D 1H,15N HSQC spectra were recorded at pH 7, with NaCl concentrations ranging from 0 to 1 m. The 2D 1H,15N HSQC spectrum underwent significant changes upon addition of 100 mm NaCl, resulting in the loss of signals and appearance of new signals. A further increase of the salt concentration to 300 mm led to chemical shift perturbations (CSPs) of almost all of the signals, and thus, was indicative of charge screening of the protein. A substantial number of new, less intense signals appeared, which indicated an upcoming low populated conformation. An even further increase in the salt concentration only led to minor CSPs and reached a steady state at 500 mm salt concentration.

Temperature dependence on folding

In contrast to stable folded proteins with linear temperature‐dependent amide chemical‐shift changes (indicative of the presence or absence of hydrogen bonds), small proteins in molten globule states and in conformational equilibria are most often sensitive to changes in temperature, which may impact their preferred structural motif. The proper conditions for temperature‐induced folding can thus be screened easily and rapidly by following the amide signal changes in a series of 2D 1H,15N HSQC spectra measured at different temperatures.

For example, for one of the screened small proteins, SP‐21, we could detect temperature‐dependent conformational heterogeneity (Figure S3). Monitoring the changes of the amide chemical shift and its signal intensity upon temperature variation in 2D 1H,15N HSQC spectra provided information about the structural changes and the equilibrium shift between the conformations adopted at different temperatures.

NMR spectroscopy time‐saving methods

If the screened small protein appears to be folded, the following steps for elucidating the solution‐state NMR spectroscopy structure are conventionally rather time‐consuming. To speed up these steps, several methods have been developed that shorten the experimental time and processes for resonance assignment and NOESY crosspeak assignment. Reduction of the NMR spectroscopy measurement time can be achieved by applying TA in combination with the MDD signal‐processing technique on NUS data.51, 52, 53 Subsequently, a method for automated resonance assignment, FLYA,60 can be performed by using the program CYANA for automated NOESY crosspeak assignment and structure calculations.60, 61, 62, 63 The benefit of using this technique is exemplified for the SP‐22 small protein.

The NMR spectroscopy experiments used for manual assignment were also used as input for the automated assignment with FLYA. The manually determined chemical‐shift assignment was used as a gold standard. Most of the automated assignments were in good agreement with the manual assignment (Figure S4 A), and thus, demonstrated the high quality of the automated FLYA assignment method. Furthermore, FLYA, as a part of the TA procedure, could be performed in a few minutes, which was a dramatic reduction in comparison with the manual assignment process, which generally takes roughly two weeks. The set of experiments for the backbone assignment was acquired by using NUS and the TA technique and included the following 3D heteronuclear NMR spectroscopy experiments: HNCO, HN(CO)CA, HNCA, HN(CO)CACB, HNCACB, and HN(CA)CO. Analysis of the quality of the recorded spectra was performed by using FLYA. The manual process for structure determination by means of NMR spectroscopy, based on time‐consuming, conventional NMR spectroscopy methods, was used as a reference. The measurement time was reduced by more than 20 times (5 days for conventional methods and 4.5 h for TA in combination with MDD) without significant loss of spectral quality (Figures S4 and S5).

Conclusion

NMR spectroscopy can serve as a powerful tool for structural screening of small proteins to provide detailed information about their conformations. We showed that optimization of the folding conditions for structure determination of proteins could profit from rapidly performed CD and NMR spectroscopy experiments.

Using these methods, we demonstrate that most of the investigated small proteins do not adopt a persistent conformation and exist mostly in an unstructured or partially structured state. This is in perfect agreement with the results obtained by the Espritz NMR spectroscopy disorder prediction method performed for small proteins in the free state. Very likely, their small size restricts them to adopt a stable fold. However, these small proteins may need an interaction partner to gain a structured conformation through complex formation. FuzPred predictions of the structure propensity show an increased trend for folded conformation upon intermolecular interaction. Specific interaction partners have to be identified and their induced conformational changes have to be investigated in the future.

The measurement time needed for structural analysis can be reduced by using TA, in combination with the MDD signal processing technique, applied on NUS data. This protocol was successfully applied on two small proteins. The implemented automated FLYA assignment simplifies the evaluation of the spectra and speeds up the structural screening of small proteins.

The investigation of small proteins represents a new field of research that has increased rapidly in recent years. Genomic research will predict and generate a large number of new small proteins. For these small proteins, the function needs to be determined. Akin to the field of IDPs, the structure–function paradigm of molecular biology is challenging. Thus, it is important to holistically investigate these small proteins. Herein, we showed that NMR spectroscopy could be applied to rapidly screen for secondary structure and conformational properties of small proteins, including detailed structural analysis for those proteins that, despite being small, adopted a persistent three‐dimensional fold.

Conflict of interest

The authors declare no conflict of interest.

Supporting information

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supplementary

Acknowledgements

This work was supported by the Deutsche Forschungsgemeinschaft (DFG) within the SPP 2002 priority program and the Swedish Research Council (research grant 201504614). Work at BMRZ is supported by the state of Hessen.

N. Kubatova, D. J. Pyper, H. R. A. Jonker, K. Saxena, L. Remmel, C. Richter, S. Brantl, E. Evguenieva-Hackenberg, W. R. Hess, G. Klug, A. Marchfelder, J. Soppa, W. Streit, M. Mayzel, V. Y. Orekhov, M. Fuxreiter, R. A. Schmitz, H. Schwalbe, ChemBioChem 2020, 21, 1178.

Contributor Information

Prof. Dr. Ruth A. Schmitz, Email: rschmitz@ifam.uni-kiel.de.

Prof. Dr. Harald Schwalbe, Email: schwalbe@nmr.uni-frankfurt.de.

References

  • 1. Pueyo J. I., Magny E. G., Couso J. P., Trends Biochem. Sci. 2016, 41, 665–678. [DOI] [PubMed] [Google Scholar]
  • 2. Makarewich C. A., Olson E. N., Trends Cell Biol. 2017, 27, 685–696. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Storz G., Wolf Y. I., Ramamurthi K. S., Annu. Rev. Biochem. 2014, 83, 753–777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Scheidler C., Kick L., Schneider S., ChemBioChem 2019, 20, 1–9. [DOI] [PubMed] [Google Scholar]
  • 5. Kent S., Curr. Opin. Biotechnol. 2004, 15, 607–614. [DOI] [PubMed] [Google Scholar]
  • 6. Eguen T., Straub D., Graeff M., Wenkel S., Trends Plant Sci. 2015, 20, 477–482. [DOI] [PubMed] [Google Scholar]
  • 7. D'Lima N. G., Ma J., Winkler L., Chu Q., Loh K. H., Corpuz E. O., Budnik B. A., Lykke-Andersen J., Saghatelian A., Slavoff S. A., Nat. Chem. Biol. 2017, 13, 174–180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Arnison P. G., Bibb M. J., Bierbaum G., Bowers A. A., Bugni T. S., Bulaj G., Camarero J. A., Campopiano D. J., Challis G. L., Clardy J., Cotter P. D., Craik D. J., Dawson M., Dittmann E., Donadio S., Dorrestein P. C., Entian K.-D., Fischbach M. A., Garavelli J. S., Göransson U., Gruber C. W., Haft D. H., Hemscheidt T. K., Hertweck C., Hill C., Horswill A. R., Jaspars M., Kelly W. L., Klinman J. P., Kuipers O. P., Link A. J., Liu W., Marahiel M. A., Mitchell D. A., Moll G. N., Moore B. S., Müller R., Nair S. K., Nes I. F., Norris G. E., Olivera B. M., Onaka H., Patchett M. L., Piel J., Reaney M. J. T., Rebuffat S., Ross R. P., Sahl H.-G., Schmidt E. W., Selsted M. E., Severinov K., Shen B., Sivonen K., Smith L., Stein T., Süssmuth R. D., Tagg J. R., Tang G.-L., Truman A. W., Vederas J. C., Walsh C. T., Walton J. D., Wenzel S. C., Willey J. M., van der Donk W. A., Nat. Prod. Rep. 2013, 30, 108–160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Funk M. A., Van Der Donk W. A., Acc. Chem. Res. 2017, 50, 1577–1586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. McIntosh J. A., Donia M. S., Schmidt E. W., Nat. Prod. Rep. 2009, 26, 537–559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Harms J. M., Wilson D. N., Schluenzen F., Connell S. R., Stachelhaus T., Zaborowska Z., Spahn C. M. T., Fucini P., Mol. Cell 2008, 30, 26–38. [DOI] [PubMed] [Google Scholar]
  • 12. Joedicke L., Mao J., Kuenze G., Reinhart C., Kalavacherla T., Jonker H. R. A., Richter C., Schwalbe H., Meiler J., Preu J., Michel H., Glaubitz C., Nat. Chem. Biol. 2018, 14, 284–290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Lewies A., Du Plessis L. H., Wentzel J. F., Probiotics Antimicrob. Proteins 2019, 11, 370–381. [DOI] [PubMed] [Google Scholar]
  • 14. Pfalzgraff A., Brandenburg K., Weindl G., Front. Pharmacol. 2018, 9, 1–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Mahlapuu M., Håkansson J., Ringstad L., Björn C., Front. Cell. Infect. Microbiol. 2016, 6, 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Hemm M. R., Paul B. J., Schneider T. D., Storz G., Rudd K. E., Mol. Microbiol. 2008, 70, 1487–1501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Unoson C., Wagner E. G. H., Mol. Microbiol. 2008, 70, 258–270. [DOI] [PubMed] [Google Scholar]
  • 18. Fozo E. M., Kawano M., Fontaine F., Kaya Y., Mendieta K. S., Jones K. L., Ocampo A., Rudd K. E., Storz G., Mol. Microbiol. 2008, 70, 1076–1093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Otto M., Int. J. Med. Microbiol. 2014, 304, 164–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Fozo E. M., Hemm M. R., Storz G., Microbiol. Mol. Biol. Rev. 2008, 72, 579–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Brantl S., Jahn N., FEMS Microbiol. Rev. 2015, 39, 413–427. [DOI] [PubMed] [Google Scholar]
  • 22. Wagner E. G. H., Romby P., Adv. Genetics 2015, 90, 133–208. [DOI] [PubMed] [Google Scholar]
  • 23. Wagner E. G. H., Unoson C., RNA Biol. 2012, 9, 1513–1519. [DOI] [PubMed] [Google Scholar]
  • 24. Dörr T., Vulić M., Lewis K., PLoS Biol. 2010, 8, 29–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Baniulis D., Yamashita E., Whitelegge J. P., Zatsman A. I., Hendrich M. P., Hasan S. S., Ryan C. M., Cramer W. A., J. Biol. Chem. 2009, 284, 9861–9869. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Allen J. F., Trends Plant Sci. 2004, 9, 130–137. [DOI] [PubMed] [Google Scholar]
  • 27. Schneider D., Volkmer T., Rögner M., Res. Microbiol. 2007, 158, 45–50. [DOI] [PubMed] [Google Scholar]
  • 28. Guskov A., Kern J., Gabdulkhakov A., Broser M., Zouni A., Saenger W., Nat. Struct. Mol. Biol. 2009, 16, 334–342. [DOI] [PubMed] [Google Scholar]
  • 29. Wang Q., Pakrasi H. B., Carroll J. A., Satoh K., Kashino Y., Lauber W. M., Whitmarsh J., Biochemistry 2002, 41, 8004–8012. [DOI] [PubMed] [Google Scholar]
  • 30. Fromme P., Melkozernov A., Jordan P., Krauss N., FEBS Lett. 2003, 555, 40–44. [DOI] [PubMed] [Google Scholar]
  • 31. Knoppová J., Sobotka R., Tichý M., Yu J., Konik P., Halada P., Nixon P. J., Komenda J., Plant Cell 2014, 26, 1200–1212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Davey N. E., Cyert M. S., Moses A. M., Cell Commun. Signaling 2015, 13, 9–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Dyson H. J., Wright P. E., Nat. Rev. Mol. Cell Biol. 2005, 6, 197–208. [DOI] [PubMed] [Google Scholar]
  • 34. van der Lee R., Buljan M., Lang B., Weatheritt R. J., Daughdrill G. W., Dunker A. K., Fuxreiter M., Gough J., Gsponer J., Jones D. T., Kim P. M., Kriwacki R. W., Oldfield C. J., Pappu R. V., Tompa P., Uversky V. N., Wright P. E., Babu M. M., Chem. Rev. 2014, 114, 6589–6631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Meydan S., Marks J., Klepacki D., Sharma V., Baranov P. V., Firth A. E., Margus T., Kefi A., Vázquez-Laslop N., Mankin A. S., Mol. Cell 2019, 74, 481–493.e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Weaver J., Mohammad F., Buskirk A. R., Storz G., mBio 2019, 10, e02819-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Fuxreiter M., Curr. Opin. Struct. Biol. 2019, 54, 19–25. [DOI] [PubMed] [Google Scholar]
  • 38. Ishida T., Kinoshita K., Bioinformatics 2008, 24, 1344–1348. [DOI] [PubMed] [Google Scholar]
  • 39. Walsh I., Martin A. J. M., Di Domenico T., Tosatto S. C. E., Bioinformatics 2012, 28, 503–509. [DOI] [PubMed] [Google Scholar]
  • 40. Sormanni P., Camilloni C., Fariselli P., Vendruscolo M., J. Mol. Biol. 2015, 427, 982–996. [DOI] [PubMed] [Google Scholar]
  • 41. Wüthrich K., Angew. Chem. Int. Ed. 2003, 42, 3340–3363; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2003, 115, 3462–3486. [Google Scholar]
  • 42. Kay L. E., J. Magn. Reson. 2005, 173, 193–207. [DOI] [PubMed] [Google Scholar]
  • 43. Serrano P., Dutta S. K., Proudfoot A., Mohanty B., Susac L., Martin B., Geralt M., Jaroszewski L., Godzik A., Elsliger M., Wilson I. A., Wüthrich K., FEBS J. 2016, 283, 3870–3881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Gileadi O., Knapp S., Lee W. H., Marsden B. D., Müller S., Niesen F. H., Kavanagh K. L., Ball L. J., Von Delft F., Doyle D. A., Oppermann U. C. T., Sundström M., J. Struct. Funct. Genomics 2007, 8, 107–119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Montelione G. T., Nilges M., Bax A., Güntert P., Herrmann T., Richardson J. S., Schwieters C. D., Vranken W. F., Vuister G. W., Wishart D. S., Berman H., Kleywegt G. J., Markley J. L., Structure 2013, 21, 1563–1570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Rosato A., Tejero R., Montelione G. T., Curr. Opin. Struct. Biol. 2013, 23, 715–724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Schwalbe H., Fiebig K. M., Buck M., Jones J. A., Grimshaw S. B., Spencer A., Glaser S. J., Smith L. J., Dobson C. M., Biochemistry 1997, 36, 8977–8991. [DOI] [PubMed] [Google Scholar]
  • 48. Grote J., Krysciak D., Petersen K., Güllert S., Schmeisser C., Förstner K. U., Krishnan H. B., Schwalbe H., Kubatova N., Streit W. R., Front. Microbiol. 2016, 7, 1858. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Hahn J., Thalmann S., Migur A., Freiherr von Boeselager R., Kubatova N., Kubareva E., Schwalbe H., Evguenieva-Hackenberg E., RNA Biol. 2016, 13, 00-00. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Kubatova N., Jonker H. R. A., Saxena K., Richter C., Vogel V., Schreiber S., Marchfelder A., Schwalbe H., ChemBioChem 2019, 20, 1–9. [DOI] [PubMed] [Google Scholar]
  • 51. Isaksson L., Mayzel M., Saline M., Pedersen A., Rosenlöw J., Brutscher B., Karlsson B. G., Orekhov V. Y., PLoS One 2013, 8, e62947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Orekhov V. Y., Jaravine V. A., Prog. Nucl. Magn. Reson. Spectrosc. 2011, 59, 271–292. [DOI] [PubMed] [Google Scholar]
  • 53. Jaravine V. A., Zhuravleva A. V., Permi P., Ibraghimov I., Orekhov V. Y., J. Am. Chem. Soc. 2008, 130, 3927–3936. [DOI] [PubMed] [Google Scholar]
  • 54. Malakhov M. P., Mattern M. R., Malakhova O. A., Drinker M., Weeks S. D., Butt T. R., J. Struct. Funct. Genomics 2004, 5, 75–86. [DOI] [PubMed] [Google Scholar]
  • 55. Marblestone J. G., Protein Sci. 2006, 15, 182–189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Zuo X., Mattern M. R., Tan R., Li S., Hall J., Sterner D. E., Shoo J., Tran H., Lim P., Sarafianos S. G., Kazi L., Navas-Martin S., Weiss S. R., Butt T. R., Protein Expr. Purif. 2005, 42, 100–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Harper S., Speicher D. W., Curr. Protoc. Protein Sci. 2008, 1–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Shen Y., Bax A., J. Biomol. NMR 2013, 56, 227–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Kazimierczuk K., Orekhov V., Magn. Reson. Chem. 2015, 53, 921–926. [DOI] [PubMed] [Google Scholar]
  • 60. Schmidt E., Güntert P., J. Am. Chem. Soc. 2012, 134, 12817–12829. [DOI] [PubMed] [Google Scholar]
  • 61. Varmazyari A., Khalili-Mahaleh J., Roshdi M., Kharazmi K., Adv. Environ. Biol. 2012, 6, 587–592. [Google Scholar]
  • 62. Güntert P. in Protein NMR Techniques (Ed.: A. K. Downing), Humana, Totowa, 2004, pp. 353–378. [Google Scholar]
  • 63. Güntert P., Prog. Nucl. Magn. Reson. Spectrosc. 2003, 43, 105–125. [Google Scholar]
  • 64. Vuister G. W., Bax A., J. Am. Chem. Soc. 1993, 115, 7772–7777. [Google Scholar]
  • 65. Shen Y., Maupetit J., Derreumaux P., Tufféry P., J. Chem. Theory Comput. 2014, 10, 4745–4758. [DOI] [PubMed] [Google Scholar]
  • 66. Thévenet P., Shen Y., Maupetit J., Guyon F., Derreumaux P., Tufféry P., Nucleic Acids Res. 2012, 40, W288–W293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Kaur H., Garg A., Raghava G. P. S., Protein Pept. Lett. 2007, 14, 626–631. [DOI] [PubMed] [Google Scholar]
  • 68. Singh S., Singh H., Tuknait A., Chaudhary K., Singh B., Kumaran S., Raghava G. P. S., Biol. Direct 2015, 10, 1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Guex N., Peitsch M. C., Schwede T., Electrophoresis 2009, 30, S162–S173. [DOI] [PubMed] [Google Scholar]
  • 70. Benkert P., Biasini M., Schwede T., Bioinformatics 2011, 27, 343–350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Bertoni M., Kiefer F., Biasini M., Bordoli L., Schwede T., Sci. Rep. 2017, 7, 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Bienert S., Waterhouse A., De Beer T. A. P., Tauriello G., Studer G., Bordoli L., Schwede T., Nucleic Acids Res. 2017, 45, D313–D319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Waterhouse A., Bertoni M., Bienert S., Studer G., Tauriello G., Gumienny R., Heer F. T., De Beer T. A. P., Rempfer C., Bordoli L., Lepore R., Schwede T., Nucleic Acids Res. 2018, 46, W296–W303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Sormanni P., Piovesan D., Heller G. T., Bonomi M., Kukic P., Camilloni C., Fuxreiter M., Dosztanyi Z., Pappu R. V, Babu M. M., Longhi S., Tompa P., Dunker A. K., Uversky V. N., Tosatto S. C. E., Vendruscolo M., Nat. Chem. Biol. 2017, 13, 339–342. [DOI] [PubMed] [Google Scholar]
  • 75. Ward J. J., Sodhi J. S., McGuffin L. J., Buxton B. F., Jones D. T., J. Mol. Biol. 2004, 337, 635–645. [DOI] [PubMed] [Google Scholar]
  • 76. Tompa P., Trends Biochem. Sci. 2012, 37, 509–516. [DOI] [PubMed] [Google Scholar]
  • 77. Asher G., Reuven N., Shaul Y., BioEssays 2006, 28, 844–849. [DOI] [PubMed] [Google Scholar]
  • 78. van der Lee R., Lang B., Kruse K., Gsponer J., de Groot N. S., Huynen M. A., Matouschek A., Fuxreiter M., Babu M. M., Cell Rep. 2014, 8, 1832–1844. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Wright P. E., Dyson H. J., Curr. Opin. Struct. Biol. 2009, 19, 31–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. http://protdyn-fuzpred.org.
  • 81. Tsvetkov P., Reuven N., Shaul Y., Nat. Chem. Biol. 2009, 5, 778–781. [DOI] [PubMed] [Google Scholar]
  • 82. Wu H., Fuxreiter M., Cell 2016, 165, 1055–1066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Banani S. F., Lee H. O., Hyman A. A., Rosen M. K., Nat. Rev. Mol. Cell Biol. 2017, 18, 285–298. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.

Supplementary


Articles from Chembiochem are provided here courtesy of Wiley

RESOURCES