Skip to main content
Intrinsically Disordered Proteins logoLink to Intrinsically Disordered Proteins
. 2015 Mar 17;3(1):e1027032. doi: 10.1080/21690707.2015.1027032

The intrinsic disorder alphabet. III. Dual personality of serine

Vladimir N Uversky 1,2,3,4,*
PMCID: PMC5314895  PMID: 28232888

Abstract

Proteins are natural polypeptides consisting of 20 major amino acid residues, content and order of which in a given amino acid sequence defines the ability of a related protein to fold into unique functional state or to stay intrinsically disordered. Amino acid sequences code for both foldable (ordered) proteins/domains and for intrinsically disordered proteins (IDPs) and IDP regions (IDPRs), but these sequence codes are dramatically different. This difference starts with a very general property of the corresponding amino acid sequences, namely, their compositions. IDPs/IDPRs are enriched in specific disorder-promoting residues, whereas amino acid sequences of ordered proteins/domains typically contain more order-promoting residues. Therefore, the relative abundances of various amino acids in ordered and disordered proteins can be used to scale amino acids according to their disorder promoting potentials. This review continues a series of publications on the roles of different amino acids in defining the phenomenon of protein intrinsic disorder and represents serine, which is the third most disorder-promoting residue. Similar to previous publications, this review represents some physico-chemical properties of serine and the roles of this residue in structures and functions of ordered proteins, describes major posttranslational modifications tailored to serine, and finally gives an overview of roles of serine in structure and functions of intrinsically disordered proteins.

Keywords: amidase, carboxylesterase, intrinsically disordered protein, protein folding, protein structure, posttranslational modification, serine hydrolase, serine protease

Introduction

Proteins are the major building blocks of life. Although for the long time it was believed that any protein-based function should rely on unique protein structure, the modern understanding of the biologically active proteins is extended by consideration of the intrinsically disordered proteins (IDPs) and hybrid proteins containing both ordered domains and intrinsically disordered protein regions (IDPRs).1,2 These proteins play a number of important roles in various cellular processes.3-17 They are highly abundant in nature,18-22 have intriguing structural and functional properties,23 and are commonly involved in the pathogenesis of various diseases.7,24-29 IDPs/IDPRs are promiscuous binders, being known to interact in various protein-protein, protein-nucleic acid, and protein-small molecule interactions. Many IDPs/IDPRs are able to undergo at least partial disorder-to-order transition caused by their interactions with binding partners.3-6,9,17,30-36 In molecular recognition events, some IDPs/IDPRs are known to interact with several unrelated binding partners; i.e., are engaged in so-called one-to-many binding, and are able to fold differently at binding to different binding partners.16,30,37 The binding modes attainable by IDPs/IDPRs range from the formation of highly stable complexes, to action as dynamic, sensitive, controllable, and regulatable “on-off” switches in signaling networks, to interactions associated with constant “bound-unbound” transitions, to highly specific interactions without very high affinity.4,14 Although some partial folding during the IDP/IDPR-based interactions is very common,31,32 many other IDPs/IDPRs are involved in the formation of “fuzzy complexes,” where an IDP/IDPR keeps a certain amount of disorder in its bound state.35,38-40 Often, not the entire disordered chain is involved in the partner binding. Instead IDPs/IDPRs might contain specific interacting regions, which are only loosely structured in their unbound forms, but gain specific structure at binding.30,31,41,42 As a rule, IDPs/IDPRs return to their intrinsically disordered state after the completion of a particular function. All this constitutes an important arsenal of the unique physiological properties of IDPs/IDPRs that determines their ability to exert different functions in different cellular contests according to a specific conformational state.23

Although amino acid sequences code for both ordered proteins/domains and IDPs/IDPRs, the corresponding sequence codes are dramatically different.4,20,43 The aforementioned differences between amino acid sequence codes for order and disorder are many, with the mot general being amino acid compositions. In fact, in comparison to ordered proteins/domains, IDPs/IDPRs are globally depleted in order-promoting residues (cysteine, tryptophan, isoleucine, tyrosine, phenylalanine, leucine, histidine, valine, asparagine, and methionine) being instead enriched in disorder-promoting residues (lysine, glutamine, serine, glutamic acid, and proline).4,20,23,43-45

Serine is the third most common disorder-promoting residues. Figure 1 and Table 1 represent the result of a statistical analysis of the amino acid content of proteins in 4 standard datasets, such as DisProt,46 UniProt,47 PDB Select 25,48 and surface residues.45 The mean contents of serine in these data sets are 8.65 ± 0.43%, 6.83 ± 0.04%, 6.19 ± 0.06%, and 6.87 ± 0.13%, respectively (http://cprofiler.org/help.html).45 In other words, the content of serine residues in IDPs/IDPRs exceeds those of the averaged proteins in UniProt, the ordered proteins in PDB, and the surfaces of ordered proteins by the factor of 1.27, 1.40, and 1.26, respectively.

Figure 1.

Figure 1.

Amino acid determinants defining structural and functional differences between the ordered and intrinsically disordered proteins. Fractional difference in the amino acid composition (compositional profile) between the typical IDPs from the DisProt database46 and a set of completely ordered proteins48 calculated for each amino acid residue. The fractional difference was evaluated as (CDisProt-CPDB)/CPDB, where CDisProt is the content of a given amino acid in a DisProt databse, and CPDB is the corresponding content in the dataset of fully ordered proteins. Positive bars correspond to residues found more abundantly in IDPs, whereas negative bars show residues, in which IDPs are depleted. Amino acid types were ranked according to their decreasing disorder-promoting potential.44

Table 1.

Amino acid compositions of the standard data sets (modified from ref.45)

Residuea Disorder propensityb SwissProtc PDB S25d Surface residuese DisProtf
Pro (P) 1.000 4.83 ± 0.03 4.57 ± 0.05 5.63 ± 0.10 8.11 ± 0.63
Glu (E) 0.781 6.67 ± 0.04 6.65 ± 0.07 8.70 ± 0.17 9.89 ± 0.61
Ser (S) 0.713 6.83 ± 0.04 6.19 ± 0.06 6.87 ± 0.13 8.65 ± 0.43
Gln (Q) 0.665 3.95 ± 0.03 3.95 ± 0.05 5.21 ± 0.09 5.27 ± 0.37
Lys (K) 0.588 5.92 ± 0.05 6.37 ± 0.08 9.75 ± 0.16 7.85 ± 0.45
Ala (A) 0.450 7.89 ± 0.05 7.70 ± 0.08 6.03 ± 0.13 8.10 ± 0.35
Gly (G) 0.437 6.96 ± 0.04 7.16 ± 0.07 7.06 ± 0.11 7.41 ± 0.40
Asp (D) 0.407 5.35 ± 0.03 5.83 ± 0.05 8.18 ± 0.10 5.80 ± 0.30
Thr (T) 0.401 5.41 ± 0.02 5.63 ± 0.05 6.08 ± 0.11 5.56 ± 0.24
Arg (R) 0.394 5.40 ± 0.04 4.93 ± 0.06 6.56 ± 0.13 4.82 ± 0.23
Met (M) 0.291 2.38 ± 0.02 2.22 ± 0.04 1.13 ± 0.04 1.87 ± 0.10
Asn (N) 0.285 4.13 ± 0.04 4.58 ± 0.06 6.23 ± 0.15 3.82 ± 0.27
Val (V) 0.263 6.73 ± 0.03 6.72 ± 0.06 4.01 ± 0.06 5.41 ± 0.44
His (H) 0.259 2.29 ± 0.02 2.41 ± 0.04 2.60 ± 0.06 1.93 ± 0.11
Leu (L) 0.195 9.65 ± 0.04 8.68 ± 0.08 5.11 ± 0.08 6.22 ± 0.25
Phe (F) 0.117 3.96 ± 0.03 3.98 ± 0.04 2.38 ± 0.05 2.44 ± 0.13
Tyr (Y) 0.113 3.03 ± 0.02 3.50 ± 0.04 3.58 ± 0.08 2.13 ± 0.15
Ile (I) 0.090 5.90 ± 0.04 5.61 ± 0.06 2.77 ± 0.07 3.24 ± 0.13
Trp (W) 0.004 1.13 ± 0.01 1.44 ± 0.03 1.33 ± 0.05 0.67 ± 0.06
Cys (C) 0.000 1.50 ± 0.02 1.74 ± 0.05 0.78 ± 0.04 0.80 ± 0.08

aResidues are arranged according to their decreasing intrinsic disorder propensity;

bDisorder propensity is calculated based on the fractional difference in the amino acid compositions between the disordered and ordered proteins;

cSwissProt 51 is the set closest to the distribution of amino acids in nature among the 4 datasets;47

dPDB Select 25 is a subset of proteins from the Protein Data Bank with less than 25% sequence identity, biased toward the composition of proteins amenable to crystallization studies;48

eSurface residues determined by the Molecular Surface Package over a sample of PDB structures of monomeric proteins suitable for protein surface analysis;

fDisProt 3.4 comprised of a set of experimentally determined disordered regions.46

This article continues a series of publications on the intrinsic disorder alphabet. The goal of this series is the exploration of the amino acid determinants of protein intrinsic disorder. Some functions of serine in IDPs/IDPRs and in ordered proteins/domains are reviewed below to illustrate the panoply of serine-specific functions in disordered proteins and regions.

Structural properties of serine

Serine (Ser, S, see Fig. 2) is one of the 20 major amino acids found in proteins. Being encoded by the standard genetic code, serine is unique among the amino acids in that it has 6 codons, from 2 distinct groups: the TCN group (TCA, TCC, TCG, and TCT) and the AGY group (AGC and AGT).49 Since there is a noticeable bias toward TCN codons in conserved ancient essential proteins, it is believed that evolutionary the TCN group predates the AGY group.50 Serine has a chemical formula HO2CCH(NH2)CH2OH and is considered as a polar residue due to the presence of the hydroxyl group. The molecular mass of this nutritionally nonessential (dispensable) amino acid is of 105.09 Da, whereas the molecular mass of Ser residue is 87.09 Da. Serine is a water soluble (up to 250 mg/ml), uncharged amino acid that has a pKa of the carboxyl and amino groups of 2.21 and 9.15, respectively. The volume of serine residue is of 73 Å3 and its surface is of 115 Å2 (http://www.rfdn.org/bioinfo/APDbase.php). The side chain of serine contains one methylene group and the hydroxyl functional group (see Fig. 2), and serine is one of the 2 biological amino acids with a hydroxyl substituted side chain. Therefore, although serine has a non-charged side chain, it is a polar amino acid that often serves as the nucleophilic amino acid at the active sites of several enzymes. Serine residue has a non-polar surface of 56 Å2, and the estimated hydrophobic effect associated with the burial of this residue is 1.40 kcal/mol, whereas the estimated hydrophobic effect associated with the burial of side chain of this residue is 0.2 kcal/mol.51 In ordered proteins, serine residues are predominantly located on protein surface so that they have access to the solvent. In fact, 70% (375 of 533) of serine residues in analyzed structures of 9 folded proteins are classified as exposed since they have solvent exposed areas of >30 Å2, and 30% of serines in folded proteins possess solvent exposed areas of <10 Å2 and therefore are buried.52

Figure 2.

Figure 2.

Chemical structure of L-serine.

Some evolutionary properties of serine

According to François Jacob, protein evolution is much more similar to the tinkering process than to the free design,53 since “nature does not create a new protein function from a blank canvas nor with unlimited resources, but instead evolves through innovation with existing proteins.”54 Although this “evolutionary tinkering” hypothesis suggested that the probability of mutation is equal for all amino acid residues, recent study based on the analysis of the reconstructed ancestral sequences combined with the computational evaluation of the mutational probabilities of all the amino acid residues revealed that the mutational activities of different residues are highly divergent.54 This analysis revealed that serine evolves very fast by mutating very often and serving as a more likely target of mutations. Therefore, serine is the fastest-evolving amino acid residue that serves as a mutational hot spot being characterized by the highest mutability and high targetability.54 The authors argued that serine is very close in mutational space to most other amino acid residues, being, in most cases, only one nucleotide mutation away. Also, it less likely that the amino acid substitution to serine will be rejected by selection since serine does not possess a bulky or charged side chain, being characterized by moderate physico-chemical properties (see below).54 Despite the fact that serine clearly serves as fast-evolving mutational hub, the mutational activity of this residue in the phosphorylation sites is dramatically suppressed, indicating that, in special circumstances, the mutational properties of amino acid residues are modulated by the cells to prevent functionally important serine residues from evolving as fast as they would under normal circumstances.54

It has been also emphasized that serine is one of the several residues (together with cystein, metionine, histidine, and phenylalanine) that are being accrued significantly in the human proteome.55-57 Finally, based on the analysis of frequencies of mutations in the 1000 Genomes data and in databases collecting information on inherited diseases associated with variants, an important conclusion has been made that serine is involved in proportionally more disease-related mutations than polymorphisms.57

Biological significance of free serine

Metabolic significance of L-serine

Although L-serine is typically classified as one of the nutritionally nonessential (dispensable) amino acids, it is clearly indispensible metabolically due to its multifactorial involvement in a number of essential cellular processes.58 For example, serine, that can be synthesized in the body from an intermediate of the glycolytic pathway, 3-phosphoglycerate, serves as the major source of the one-carbon units required for the methylation reactions that take place via the generation of S-adenosylmethionine.58 Serine serves as a building block for amino acids (e.g. glycine, cysteine), taurine, phospholipids, and is important for the synthesis of purines, pyrimidines, tryptophan (in bacteria), and heme.59 For example, serine hydroxymethyltransferase is responsible for the conversion of serine to glycine (retro-aldol cleavage) and simultaneous conversion of tetrahydrofolate to N5,N10-methylene-tetrahydrofolate (hydrolysis).59 Since the retro-aldol cleavage reaction is reversible, the serine hydroxymethyltransferase controls the interconversion of serine and glycine. Furthermore, the N5,N10-methylene-tetrahydrofolate generated during the tetrahydrofolate hydrolysis serves as a precursor for the 5-methyl-tetrahydrofolate, which is an intermediate in the homocysteine methyltransferase (methionine synthase)-catalyzed methylation of homocysteine to methionine.58 Also, serine is involved in the “transsulfuration pathway” leading to cysteine synthesis.58 Here, cystathionine β-synthase first catalyzes the cystathionine synthesis from serine and homocysteine, and then cystathionine γ-lyase split cystathionine into cysteine and α-ketobutyrate.58 Therefore, there is an intricate metabolic relationship between serine metabolism, methionine, glycine, homocysteine, cystathionine, and cysteine.59

Because of this interlink, a decreased synthesis of serine is known to be accompanied by the decreased concentrations of serine, glycine, and 5-methyltetrahydrofolate.59 Decreased levels of serine production in humans are associated with the serine deficiency disorders originating from the inborn errors of serine metabolism (e.g., caused by the inherited deficiency in the 3-phosphoglycerate dehydrogenase (3PGDH), an enzyme playing a crucial role at the initial stages of the de novo serine biosynthesis from 3-phosphoglycerate). These serine deficiency disorders are manifested by the major neurological symptoms such as congenital microcephaly, seizures, psychomotor retardation or polyneuropathy.59 On the other hand, an increased activity of 3PGDH was reported in human colon carcinoma, rat sarcoma, and rat hepatoma cell lines during the proliferative phase.58,60–62 More recently, the markedly higher expression of the 3PGDH gene was found in several breast cancer cell lines and in estrogen receptor-negative breast cancers.58,63

Curiously, it has been shown that L-serine can also act as a glia-derived trophic factor that strongly supports the survival and differentiation of neurons.64 Another illustration of the indispensable biological role of L-serine is the fact that this amino acid serves as the only source for the formation of D-serine, which is known to play a critical role as a neuromodulator in the brain.58

Biological roles of D-serine

The fact that mammalian brain tissues contain substantial quantities of free D-serine was reported for the first time in 1992.65 D-serine concentrations in the mammalian CNS and peripheral tissues are unexpectedly high, non-uniformly distributed, vary with age,66 typically D-serine concentrations amount one-third of L-serine concentrations,67 but sometimes even exceed the concentrations of the L-enantiomer.68 Free D-serine, being localized predominantly to the forebrain, shows highest levels in the cerebral cortex, hippocampus, and striatum, followed by the limbic forebrain, diencephalon, and midbrain and is present at low levels in the pons, medulla, cerebellum, and spinal cord.68–72 The exceptionally high levels of free endogenous D-serine are found in the brain of mammals throughout their life, whereas peripheral tissues, blood serum, saliva, urine, and retina contain low or trace levels of this D-amino acid.73 Concentrations of free D-serine in human serum were suggested to correlate with damaged renal function.74 In the brain, D-serine is selectively localized to a subtype of glial cells known as protoplasmic type II astrocytes that enclose nerve terminals and are particularly enriched in the gray matter of the cerebral cortex, hippocampus, anterior olfactory nucleus, olfactory tubercle, and amygdala.75-77

Since one of the major pathways for the acquiring of D-serine is the serine racemase (SR)-catalyzed biosynthesis from L-serine, D-serine concentrations are highly correlated with the L-serine and glycine concentrations.66,78 The concentration of free D-serine in the brain depends on the activity of 2 enzymes: CR, which is responsible for both the synthesis and degradation of D-serine and D-amino acid oxidase (DAO) catalyzing D-serine degradation.79 Distribution of free D-serine in the brain correlates with the regional distribution of the N-methyl-D-aspartate receptors (NMDArs), suggesting their functional relationship.66 NMDArs are ionotropic receptor channels known to play an important role in glutamatergic synaptic transmission. They require simultaneous binding of glutamate molecule and co-agonist glycine at 2 different sites for activation. D-serine is known to act as a selective and potent agonist for the “glycine site” of NMDArs.66 Therefore, the NMDAr activity can be regulated by endogenous D-serine. Binding of glycine or D-serine to the “glycine site” of the NMDAr allows glutamate to open the ion channel, but also it decreases desensitization of the receptor.66 The biological importance D-serine is further supported by the fact that the enhanced stimulation of NMDArs was shown to be related to a number of neuropsychiatric disorders, such as schizophrenia and bipolar disorder, and to several acute and chronic neurodegenerative diseases, such as chronic pain, polyneuropathies, amyotrophic lateral sclerosis (ALS), stroke, epilepsy, Huntington's disease (HD), Alzheimer disease (AD), and Parkinson disease (PD).66,80

Serine in structures of ordered proteins

Serine is a short-chain, hydroxyl group-containing residue. High chemical reactivity of this residue defines its versatile roles as active site residue, as well as an important site of several posttranslational modifications (e.g., phosphorylation and O-glycosylation). The hydroxyl group of serine can serve both as a donor or an acceptor of a hydrogen bond, and this residue is commonly involved in hydrogen bond formation with solvent or backbone groups.81 Curiously, due to its short side chain and the ability to form a hydrogen bond with the protein backbone, serine is effectively mimicking proline. Therefore, serine can be found both on the protein surface and within the interior of a protein. Also, based on the analysis of the structural plasticity (or conformational flexibility) of various amino acid residues defined as the number of conformations exhibited by each residue in a set of known protein structures, it has been concluded that in addition to glycine and alanine, the list of highly flexible residues includes serine, threonine, aspartate, and lysine. Such high conformational plasticity of serine, threonine, aspartate, and lysine was explained by the involvement of these residues in strong side-chain interactions with other residues inside proteins.82,83

Hydrogen bonds between donor and acceptor groups in the protein backbone are the major determinants of the protein secondary structure. However, due to their spatial orientation relative to the backbone and the ability to affect backbone motions, different amino acids can favor or disfavor the formation of different secondary structure elements, such as α-helices, β-sheets, β-turns, and loops. Among the residues known to favor α-helices are alanine, cysteine, leucine, methionine, glutamic acid, glutamine, histidine, and lysine; β-structure is favored by valine, isoleucine, phenylalanine, tyrosine, tryptophan, and threonine; whereas β-turns are the place for the abundant presence of serine, glycine, proline, asparagine, and uncharged aspartic acid. Importantly, serine is one of the most unfavorable residues in the middle parts of α-helices due to its ability of making a second shared hydrogen bond to the peptide >C=O group of the residue n-3, thereby causing noticeable destabilization of an α-helix.84

One should keep in mind however, that although being positioned at the middle of an α-helix serine serve as helical destabilize, this residue might favorably contribute to the α-helix stability when found at the helical ends. This is due to the fact that α-helices found in peptides and proteins are not homogeneous entities, since although all the main chain hydrogen bonding groups within the helix body are satisfied by the secondary structure formation, each end of the helix has 4 unsatisfied groups that are either hydrogen bonded to solvent or to the side chain of the so-called cap residue.84 In other words, these first >N‒H groups and last 4 >C=O groups of an α-helix differ from the remaining residues in this helix by being unable to make intra-helical hydrogen bonds and therefore are involved in hydrogen bonding with alternative partners.85-87 Physico-chemical and statistical analysis suggested that certain residues are more preferable at the C- and N-termini of an α-helix, serving as the helical C- and N-caps.86 A global analysis of proteins with known structure revealed the presence of at least 7 distinct capping motifs. Three of these motifs were found at the helix N-terminus and 4 – at the C-terminus.87 One of the most common N-capping motifs is the Ser-X-X-Glu.87–89 Thermodynamic analysis of this Ser-X-X-Glu motif from the GCN4 leucine zipper dimer revealed that helix capping might play a significant role in protein folding since the free energy of helix stabilization associated with the hydrogen-bonding and hydrophobic interactions in this capping structure is −1.2 kcal/mol.88 Since the hydroxyl group of serine can serve as an acceptor of hydrogen bond from the free NH-groups in the first turn of an α-helix,84 this residue is commonly found in N-caps of α-helices, where serine side chain adopts the gauche rotamer and forms a hydrogen bond to the >N‒H group of the residue N3 in the N to C direction.90 Based on their N-cap reference, the amino acids were divided into two groups, the good N-cap residues (Asn, Asp, Ser, Thr, and Cys) and the bad N-cap residues (Gly, Ala, Val, Leu, Ile, Phe, Tyr, Trp, Pro, Cystine, Met, Lys, Arg, His, Glu, and Gln).90 It was pointed out that the good N-cap residues Ser, Thr, Asp, and Asn define the presence of strong structural constraints at a helix N-cap are greater than structural constraints found at any position yet examined in a protein.90 These good N-cap residues are characterized by a restricted range of allowed ψ values when at the N-cap.90

Curiously, serine was found in some C-capping motifs too.91 It was pointed out that the short side chain of serine does not contain enough non-polar atoms to be engaged in strong hydrophobic interactions and these relatively weak hydrophobic interactions are frequently compensated for by a local hydrogen bond of this side-chain back onto the helix (typically with one of the 4 nearest neighboring residues along the sequence).91 Finally, a systematic analysis of α-helical propensity of a series of dodecapeptides containing alanine, asparagine, aspartate, glutamine, glutamate, and serine at the N-terminus and arginine, lysine, and alanine at the C-terminus, it was concluded that the α-helix-stabilizing abilities of these residues can be ranged as follows: aspartate > asparagine > serine > glutamate > glutamine > alanine at the N-terminus and arginine > lysine > alanine at the C-terminus.92

As it was already mentioned, serine is a perfect residue for tight turns, where it can make hydrogen bonds to the neighboring backbone >N‒H and >C=O groups.81 However, the most common location of serine in ordered proteins is within the solvent-exposed parts of turns and loops, where this residue interacts with solvent.81 Also, serine is abundantly found within the disordered regions of ordered proteins primarily due to its low hydrophobicity, low restrictions in backbone conformations, and the ability to readily interact with solvent.81

It was emphasized that when located in the active site of several enzymes (see below), catalytic serine often serves as a nucleophile.93 Structurally, this nucleophile is always located in a very sharp turn at the tip of the strand-turn-helix motif, called the “nucleophile elbow” or “serine elbow.” This nucleophile elbow is positioned so that it can be easily approached by the substrate, as well as by the hydrolytic water molecule.93 The nucleophile elbow is identified by the consensus sequence Sm-X-Nu-X-Sm (Sm = small residue, X = any residue and Nu = nucleophile). The important consequences of the tightness of this functional strand-turn-helix motif are the presence of noticeable steric restrictions on residues located in the proximity of the nucleophilic amino acid residue and the need for the nucleophilic amino acid residue to adopt energetically unfavorable main chain torsion angles.93 As a result, in Ramachandran plot, such nucleophiles are located within the area with unfavorable angles.94

Finally, being a polar residue, serine is expected to positively contribute to protein solubility. This hypothesis was validated by the analysis of the solubility-changing substitutions in proteins, which revealed that serine, together with glutamic acid and aspartic acid, contributes more favorably to protein solubility than other hydrophilic residues (asparagine, glutamine, threonine, lysine, and arginine).95

Serine and functions of ordered proteins

Serine in the active sites of serine hydrolases

Serine hydrolases constitute one of the largest known enzyme family with a very broad range of biological activities.96 In the human proteome the serine hydrolase superfamily comprises almost 200 enzymes. These proteins are present in all organisms, being active in different cellular compartments. The class of serine hydrolases includes serine proteases or serine endopeptidases involved in various processes including the coagulation cascade.97 There are also amidases responsible for the metabolism of endogenous signaling molecules,98 penicillin-binding proteins responsible for antibiotic sensitivity,99 and carboxylesterases related to the hydrolytic biotransformation of a vast number of structurally diverse drugs.100 Some other serine hydrolases are intracellular and extracellular lipases catalyzing the hydrolysis of lipids,101 thioesterases including fatty acid synthase102 and acyl-CoA thiesterase,103 cholinesterases hydrolyzing the neurotransmitter acetylcholine into choline and acetic acid,104 and phospholipases catalyzing hydrolysis of phospholipids into fatty acids and other lipophilic entities.105 This list would be incomplete without mentioning of protein and glycan hydrolases, such as protein phosphatase methylesterase 1,106 acyloxyacyl hydrolase,107 and sialic acid acetylesterase,108 and several peptidases, such as prolylendopeptidase,109 dipeptidyl peptidase 4,110 and fibroblast activation protein.111

Since serine hydrolases are many, and since they have an enormous variety of functions and biological roles, being able to handle a wide array of structurally diverse substrates, the detailed analysis of structures and functions of these important proteins is definitely outside the scopes of this review. However, all of them, despite their immense structural and functional variability, have one very important feature in common, a feature that serves as a defining hallmark of this protein superfamily. This common trait is the presence in their active sites of a nucleophilic serine that is used for the hydrolysis of various substrates. The very peculiar feature of this nucleophilic serine is that it is typically activated by a proton relay involving an acidic residue (glutamic or aspartic acid) and a basic residue (typically histidine). The corresponding functional motifs are known as catalytic triads. Serine-containing catalytic triads can be grouped into Ser/His/Asp, Ser/His/His, Ser/His/Glu, Ser/Glu/Asp, Ser/Nterm, and Ser/cisSer/Lys.112-114 Besides the aforementioned triads, active sites of some serine hydrolases contain catalytic dyads, such as Ser/Lys and Ser/His.113-115

Serine and structure of IDPs/IDPRs

Although catalytic functions of many ordered proteins/domains clearly require the presence of serine residues in their active sites, and also serine residues provide crucial means for the specific regulation of these proteins/domains via various posttranslational modifications, such functionally/regulationally important serine residues, being strategically placed within the corresponding protein structures, are not too many. From the structural viewpoint, serine is not included to the list of residues favoring β-structure (valine, isoleucine, phenylalanine, tyrosine, tryptophan, and threonine) or α-helices (alanine, cysteine, leucine, methionine, glutamic acid, glutamine, histidine, and lysine), being, actually, one of the most unfavorable residues in the middle parts of α-helices. The two logical places for this residue to be needed for structure of ordered proteins are tight turns (which are indeed favored by serine, glycine, proline, asparagine, and uncharged aspartic acid) and helical caps. On the other hand, it is unlikely that protein (or protein region) containing excessive amount of serines would be able to spontaneously fold.

In agreement with this hypothesis, computational analysis of 46 extracellular proteins involved in complex carbohydrate degradation from Microbulbifer degradans that contain polyserine linkers (PSLs) revealed that these PSLs were not predicted to have a regular secondary structure being instead extended ‘loopy’ regions.116 This situation is further complicated by the ability of serines to be variously modified by numerous PTMs. For examples, the aforementioned heavily glycosylated Ser/Thr(/Pro)-rich domains serving as the protease-resistant spacers in some cell surface glycoproteins were proposed to have a ‘bottle brush’-like structure.117 Furthermore, despite the fact that serine is one of the polar residues polyserine is not water-soluble at chain lengths greater than 20 residues.118

Serine and functions of IDPs/IDPRs

Serine as a part of the PEST motifs

Many proteins involved in signaling pathways that control cell growth, differentiation, stress responses and physiological cell death contain specific degradation signals known as PEST motifs, since the sequences are enriched in proline (P), glutamic acid (E), serine (S), and threonine (T).119-122 These PEST motifs are crucial for controlled degradation of regulatory proteins, since these motifs define cellular instability of proteins carrying them by directing these proteins either to the ubiquitin-proteasome degradation or to the calpain cleavage.121,122

As typical for important regulatory regions in proteins, PEST-containing sequences are solvent exposed, conformationally flexible, and not visible in X-ray structures.119 In agreement with these observations, comprehensive computational analysis revealed that these PEST motifs, being disordered by themselves, are most frequently found within the IDPRs.121

Serine in entropic bristle domains

Some highly mobile protein regions were assigned the entropic chain activities that can be attributed to their entropic bristle domains (EBDs). Contrarily to the structurally stable domains of ordered proteins, the intrinsically disordered EBD is defined by a time-averaged occupancy of space by a polypeptide chain and therefore represents a time-average 3D region defined by the thermally driven motion of a certain protein region.123 Such EBDs can exclude lager molecules while allowing small molecules and water to move freely through it.123

Obviously, to serve as an EBD, a given fragment of a protein should be highly disordered and have specific amino acid composition. One of the illustrative examples of biologically active EBDs is given by the side-arms of neurofilament (NF) proteins.124 In fact, analysis of the amino acid sequence of the porcine NF medium polypeptide revealed that this protein has several peculiar features,125 including the presence of a highly disordered C-terminal tail (∼500 residues) with unique amino acid composition characterized by a high content of lysines (17.6%), glutamic acids (26.8%), valines (8.6%), and serines (7.5%).125 Although disordered state of this EBD is supported by the high level of charged residues, the abundance of serines should be noted too. In human NF-M, there are 89 serines (9.7%), 39 of which are located within the C-terminal tail (the last 504 residues). Similarly, human NF-H (a polypeptide comprising 1,026 residues) has 88 serines, 57 of which are found within the 613 residues-long C-terminal tail of this protein. In human NF-L that has 543 residues, there are 57 serine residues, 16 of which are located within the C-terminal tails of this protein (the last 147 residues). In addition to neurofilament polypeptides, EBDs were found in microtubule-associated protein 2 (MAP2)126 and NuMa.127 Analysis of the amino acid compositions of these proteins revealed that they follow the trend established by NFs and contain significant amount of serine residues (186 out of 1,827 residues in human MAP2 are serines, and there are 162 serines in the 2,115 residues-long human NuMa). The high abundance of serine residues in EBDs is not too surprising since serine is not only one of the hydrophilic residues, but is also characterized by a very high conformational variability.82

In biotechnology, EBDs were proposed to be used as protein solubility enhancers, and the translational fusion of a wide variety of natural and artificial EBDs represents an effective solubilizing tool.128 Among the more successful solubilizers were artificial EBDs containing the most disorder-promoting residues (Glu, Pro, Gln, and Ser) in the proportion Glu:Pro:Gln:Ser = 2:2:1:1; i.e., sequences containing >16% serine residues, as well as EBDs with a Asp:Glu:Pro:Gln:Ser:Gly = 1:2:2:1:2:1 composition (i.e., EBDs containing >22% serine residues), where a larger subset of disorder-promoting residues was added to avoid potential issues with expression problems associated with high levels of sequence redundancy.128

The aforementioned intrinsically disordered solubilizers, entropic bristles, are a member of a family of protein tags with a wide spectrum of biotechnological applications. This important subject was covered in a recent comprehensive review published in this journal, where a detailed description of various protein tags, both ordered and intrinsically disordered, was provided.129 The authors emphasized that the disorder content in these tags can range from 6 to 100%, and that less structure often means more function. They also showed that the protein tags with the most disorder can be used not only for the increase in the solubility of target proteins as the aforementioned entropic bristles do, but also can serve as important tools for extending half-life of proteins and for characterizing their biological properties, e.g., binding.129 An incomplete list of the disordered serine-containing tags shown to increase the half-life of a protein of interest inside the cell includes an unstructured recombinant polypeptide of 864 amino acids with the PESTAG composition, called XTEN,130,131 serine-glycine-containing HAP132 and proline-alanine-serine-containing PAS tags,133 as well as tags containing PESAK, PASTDH, and PASTD repeats.134,135

Some functions of serine-rich proteins

To get a clue on what proteins with high serine content can be used for biologically, functions of several proteins with polyserine regions are outlined below. Although for a long time, polyserine sequences have been ignored as the non-functional result of transcription slippage, because these and other homopolymer sequences are typically encoded by trinucleotide repeats at the DNA level,49 data overviewed below clearly shows that polyserine regions not only serve as flexible linkers, but have a rather diverse set of functions. Considered proteins either possess overall high serine content or have functionally important polyserine tracks (PSTs), polyserine linkers (PSLs), or other serine-rich regions. Notes below are arranged not by protein function or disorder contents, but according to the serine content in the corresponding proteins.

Polyserine linkers (PSLs) in bacterial proteins

Although domain likers composed predominantly of serine are rare, the analysis of Microbulbifer degradans proteome revealed the presence of 46 extracellular proteins containing polyserine linkers (PSLs) separating their carbohydrate-binding or catalytic domains from other binding domains.116 The common feature of these 46 M. degradans proteins and 65 PSL-containing bacterial proteins found in sequence depositories is their involvement in complex carbohydrate degradation. Based on these observations, it has been concluded that flexible, disordered nature of PSLs in these proteins enhances their functionality expanding the potential substrate target area available to the enzyme after a carbohydrate-binding module makes contact with a polymer or enhancing the substrate availability to an enzyme anchored to a bacterial outer membrane.116

Abundance and functions of proteins with long PSTs in human proteome

The search for human proteins containing PSTs of 10 residues or longer revealed 59 such proteins, containing a total of 66 serine homopolymers.49 Functional analysis of these proteins reveled that many of them are involved signaling, transcription regulation, DNA binding, or protein–protein interactions.49

Long serine homopolymer of Dictyostelid amoebae

The single-celled eukaryote Dictyostelium discoideum was recently shown to contain one of the longest serine-based homopolymers consisting of 306 tandem serine repeats.136 The authors emphasized that this gene has a paralog with 132 serine repeats and several orthologs with high serine repeat numbers, in various other Dictyostelid species, suggesting that this very long polyserine module is functional.136

Fibroins

Fibroins are insoluble structural proteins present in silk created by spiders and the larvae of many insects. Structurally, fibroin represents a supramolecular ensemble organized as layers of antiparallel β-sheets, tight packing of which defines the known rigid structure and tensile strength of silk. The major component of the fibroin is the recurrent amino acid sequence (Gly-Ser-Gly-Ala-Gly-Ala)n. Although this amino acid sequence feature of fibroins is rather conserved, some noticeable exceptions are known. Spiders can produce up to 7 different types of the protein-based silks/glues that have diverse physical properties.137 One of these 7 silks/glues is the egg case fibroin, tubuliform spidroin 1 (TuSp1). Analysis of the TuSp1 from the black widow spider, Latrodectus hesperus, revealed that this protein is composed of highly homogeneous repeats of 184 amino acids.137 This protein has multiple polyserine blocks and short polyalanine stretches. The overall serine content in the L. hesperus TuSp1 is very high 25.9%, and this high value is characteristic for TuSp1s from other organisms.

Profilaggrin

Human profilaggrin is a 4061 residue-long protein which is predicted to be almost 100% disordered. Filaggrin (FLG) and its precursor proFLG have a highly repetitive structure and a unique amino acid composition. For example, proFLG is characterized by very high serine content (24.1%). Among major biological functions of proFLG are its role as a precursor of FLG, a precursor of the natural moisturizing factor (NMF), a protein aggregating keratin intermediate filaments and promoting disulfide-bond formation among the intermediate filaments during terminal differentiation of mammalian epidermis, as well as multiple functions in skin homeostasis and barrier formation.138 ProFLG becomes heavily phosphorylated immediately after translation in the granular layer of epidermis and oral mucosa.139 This mature phosphorylated form of proFLG is insoluble and is stored in specific membrane-less keratohyalin granules.138 Therefore it is likely that the overly abundant serine residues in intrinsically disordered proFLG are used as easily accessible phosphorylation sites.

SRm160/300 splicing coactivator

The SRm160/300 splicing coactivator is important for the processing of a subset of constitutively spliced pre-mRNAs where it plays crucial role by promoting critical interactions between splicing factors bound to pre-mRNA, including snRNPs and SR family proteins.140 Being a component of the active spliceosome, SRm160/300 splicing coactivator is found in a pre-mRNA splicing complex with SFRS4, SFRS5, SNRP70, SNRPA1, SRRM1 and SRRM2. Sequence analysis revealed that similar to SRm160 (which has a serine content of 16.7%), the SRm300 (with very high serine content of 23.3%) contains domains rich in alternating serine and arginine residues, several unique repeated motifs rich in serine, arginine, and proline residues, and 2 very long (>30 consecutive residues) polyserine tracts.140

Mucin-16 and other human mucins

Mucin-16 or ovarian cancer-related tumor marker CA125 is a member of mucin family of proteins providing protective and lubricating barrier against particles and infectious agents at mucosal surfaces. Mucin-16 is a large protein (22,152 residues) with a strong bias toward serine (14.8%) and threonine (15.9%) residues. It has 3 serine-rich domains (residues 1638–3055, 3961–4385, and 7085–10,395) that contain 21.4%, 22.4% and 22.1% serines. It is predicted to be mostly disordered (with the amount of predicted disordered residues exceeding 75%). This protein is heavily glycosylated. It was already pointed out that mucins contain ∼80% carbohydrate and are characterized by the presence of numerous serine-based O-glycosylation sites.141 Therefore, even though not all mucins have officially defined serine-rich domains, it was not too surprising to find that the overall serine content in these proteins is high, systematically exceeding 11%, with some mucins having significantly large serine proportions (e.g., human mucin-12 (22.7%), mucin-17 (21.6%), mucin-3A (19.5%), mucin-22 (18.5%), etc.).

Gp40 protein of Cryptosporidium parvum

Cryptosporidium parvum is an intestinal apicomplexan parasite causing cryptosporidiosis, the diarrheal disease with the worldwide distribution.142 One of the proteins mediating sporozoite attachment and invasion of host cells is the 40 KDa glycoprotein (gp40), which is present in oocysts and sporozoites and is known to be shed from the parasite during invasion.143 The total serine content of gp40 is of 19.2%. Furthermore, it has been shown that the N-terminal region of gp40 (which is predicted to be highly disordered) contains a polyserine domain consisting of 19 contiguous serine residues that modulates the ability of the parasite to attach to and invade the host cells.143

Multifunctional cellulase CelAB

The endosymbiont Teredinibacter turnerae T7902T encodes a multifunctional cellulase CelAB, which binds cellulose and chitin, degrades multiple complex polysaccharides, and displays 2 catalytic activities, cellobiohydrolase and β-1,4(3) endoglucanase.144 This polyfunctionality of CelAB is determined by the presence of 2 catalytic and 2 carbohydrate-binding domains.144 Each domain of CelAB is separated by polyserine linker regions of 38, 55, and 54 residues in length, containing 32, 40, and 43 serine residues, respectively.144 Furthermore, the overal serine content of this protein is of 18.4%. It is possible that these polyserine linkers act similarly to the aforementioned PSLs in serve the M. degradans proteins,116 expanding the potential substrate target area available to the catalytic domains of CelAB after carbohydrate-binding domains binding to cellulose or chitin.

Human positive cofactor 4

A positive cofactor 4 (PC4) (which is also known as coactivator p15) is a member of the upstream stimulatory activity (USA)-derived cofactors that act complementary to the transcription factor IID (TFIID) machinery and are needed for the activator-dependent transcription in mammalian cells.145 PC4 is a 127 residue-long protein with serine-rich regions near the N-terminus,145 and the overall serine content of this protein is high (16.5%). PC4/p15 is able to bind both single-stranded and double-stranded DNA, and this DNA-binding activity is inhibited by the casein kinase II-catalyzed phosphorylation.146 Besides DNA binding, this protein is involved in interaction with the components of the TFIID complex.146 Using nuclear magnetic resonance (NMR) analysis of the full-length PC4 and its N- and C-terminal domains it has been concluded that although the C-terminal core domain (PC4ctd) is structured, the PC4 N-terminal domain (PC4ntd) is highly flexible and mostly unstructured.147 Furthermore, inter- and intramolecular interactions were shown to promote some local conformational changes in both the structured and unstructured subdomains.147

Human bHLHb5

Basic helix-loop-helix protein 5 (bHLHb5) is a member of the bHLHb family of transcription factors that are implicated in many aspects of neural development, including cell growth, differentiation, and cell migration.148 Being a transcription factor, bHLHb5 is predicted to be highly disordered and has high serine content (15.2%). Although human protein has a profound serine-rich region (residues 208–238) that incorporate a polyserine stretch (residues 226–234), only serine was found at the corresponding position in the mouse Bhlhb5 and hamster BETA3.148

Hox-D13

The HOX proteins constitute a highly conserved family of transcription factors encoded by HOX genes. Although these proteins are key regulators of embryonic development, they continue to be expressed throughout postnatal life.149 In human and mice, there are 39 human HOX genes which are located in 4 clusters (A-D) on different chromosomes and are assumed to have arisen from a primordial homeobox gene by duplication and divergence.149 In the genital tubercle (GT), HOXA13 and HOXD13 are the most highly expressed HOX genes.150 Mutations in HOXA13 and HOXD13 genes are known to cause syndromes affecting genitourinary development and defects of limb formation, such as synpolydactyly and hand-foot-genital syndrome.149 The products of the HOXA13 and HOXD13 genes, the Hox-A13 and Hox-D13 proteins, are similarly distributed in the GT suggesting that they may function in a redundant manner.150 As the majority of other transcription factors,151–153 both Hox-A13 and Hox-D13 are predicted to be highly disordered. The characteristic feature of the Hox-D13 proteins from different species ranging from amphibians (e.g., red spotted newt Notophthalamus viridescens) to marsupials (e.g., opossum and tammar), to mammals (e.g., human and mice) is the presence of high content of serine residues (>14%), with these residues being assembled into one or 2 polyserine regions.154

SRrp37

Analysis of proteins interacting with pNO40, a ribosomal 60S core subunit, by yeast 2-hybrid identified a novel SR-related protein, referred to as SRrp37.155 This protein serves as a splicing regulator located in the nuclear speckles and nucleoli155 and is predicted to be highly disordered. SRrp37 is involved in modulation of alternative pre-mRNA splicing with either 5' distal site activation or preferential use of 3' proximal site, and in case of infection by Herpes simplex virus (HSVI), this protein may act as a splicing inhibitor of HSVI pre-mRNA.155 The characteristic features of SRrp37 are high serine (13.5%) and arginine contents (12.8%) and the presence of 3 motifs, a serine-arginine (SR) dipeptide enriched region (190–220), a polyserine stretch (residues 258–282), and a potential nucleolar localization signal comprising a long array of basic amino acids (residues 286–312).155

Scaffolding protein piccolo

Piccolo is a 5065 residue-long scaffold protein, which is predicted to be highly disordered (it contains ∼85% disordered residues). High level of disorder is a characteristic feature of scaffold proteins that helps them to organize and maintain stable and dynamic protein complexes.156 Piccolo contains 12.4% serine residues and has 2 serine-rich regions (residues 4215–4278 and 4699–4765). The major function of piccolo is to serve as a scaffolding protein responsible for the organization of synaptic active zones, where it interacts with the cytomatrix at the active zone (CAZ)-associated structural protein (CAST) and other CAZ proteins, such as RIMs, Munc13s, Bassoon, and participates in synaptic vesicle trafficking.157,158

Adenomatous polyposis coli (APC) and its mutation cluster region (APC-MCR)

APC (adenomatous polyposis coli) is another large scaffold protein (2843 residues) which is massively enriched in serine residues that account for 15.3% of the entire sequence. APC serves as one of the central hubs involved in the spatiotemporal regulation of key Wnt signaling components.159,160 Wnt pathway involves 5 major proteins, Axin, CKI-α, GSK-3β, APC, and β-catenin, all of which were shown to contain several intrinsically disordered regions, which facilitate protein-protein interactions, post-translational modifications, and signaling.161 APC, which is predicted to be massively disordered (in fact, there are no globular domains in the 2000 amino acid long C-terminal region of APC162), is known to act as flexible concentrator that brings together all other proteins involved in the Wnt-pathway.161 Furthermore, intrinsically disordered APC helps the collection of β-catenin from cytoplasm, facilitates the β-catenin delivery to the binding sites on Axin, and controls the final detachment of β-catenin from Axin.161 APC is known to be mutated in around 80% of somatic colorectal cancer patients.163 The most frequently mutated regions of APC is known as mutation cluster region (APC-MCR, residues 1200–1550), which contains 19.1% serine residues and was shown to be highly disordered by several biophysical techniques.164

WNK1 Kinase

WNK [with no lysine (K)] protein kinases influence ion balance and are present in all multicellular and many unicellular organisms.165 Besides a protein kinase domain WNKs have no other currently known folded domains, but contain an autoinhibitory domain suppressing protein kinase activity, several 2 coiled-coil regions, and many protein-interaction motifs.165 An illustrative member of this interesting kinase family is a serine-rich WNK1 kinase, that contains 13% serines which are preferentially found within the 2000 residues-long, intrinsically disordered, serine-rich region. Although in resting cells, the majority of WNK1 is localized on cytoplasmic puncta, during cell division, this protein localizes to mitotic spindles.165

Lysyl-5 hydroxylase Jmjd6

In vertebrates, the hydroxylation of lysine residues in proteins involved in pre-mRNA splicing is catalyzed by an Fe(II)- and 2OG (2-oxoglutarate)-dependent oxygenase, jumonji-domain-containing protein 6 (Jmjd6),166–168 which is highly conserved throughout animal evolution and plays an important role in embryonic development.169 Loss of this protein function is associated with severe developmental defects and embryonic lethality.170 Jmjd6 has a total serine content of 11.2% and possesses several functional domains and conserved functional motifs, such as catalytic JmjC domain (residues Pro141–Arg305), 5 nuclear localization sequences (NLSs, residues 6–10, 91–95, 141–145, 167–170, and 373–378), an AT-hook motif (residues Lys300–Ser309), a SUMOylation site (Leu316–Glu319),171,172 and a polyserine region comprising 16 serine residues interrupted by 4 aspartate residues (Ser340–Ser359).169 Although the crystal structure for the human Jmjd6 has been resolved, only residues 1 to 334 of this protein were well defined, whereas the C-terminal serine-rich region (residues 335 to 403) was shown to be completely disordered.173 Several alternatively spliced variants of Jmjd6 were reported that lack the polyserine region,172 and functional analysis revealed that the alternatively spliced Jmjd6 without the polyserine domain was localized to the nucleolus, where it was involved in interaction with nucleolar proteins.169 These observations suggested that the Jmjd6 polyserine region might be related to the control of the nuclear/ nucleolar shuttling of this important protein.169

Polyserine tracts in vitellogenins: Insulated connectors

The female-specific, ancient egg-yolk precursor proteins known as vitellogenins (Vg) are found in most oviparous species.174 Although the major functions of Vgs are related to the lipid and ion transport during oogenesis, the honeybee (Apis mellifera) protein (AmVg) is also known to affect social behavior and life-span plasticity of this insect.175 Although the overall serine content of insect Vgs does not exceed that of an average IDP (∼8.5%), an important feature of insect Vgs is the presence of a repetitive and highly variable segment known as polyserine tract (PST) or polyserine linker (PSL), which connects conserved N-terminal and α-helical domains.175 The careful structural analysis of several PSTs from various Apocrita species revealed that highly pronounced sequence variation in this region is translated in noticeable structural variations.175 In fact, structurally, these PSTs ranged from highly extended and disordered conformations (e.g., the polyserine regions of honeybee Apis mellifera, bumblebee Bombus ignitus, and wasp Pimpla nipponica) to a more compact and helical (but still disordered) conformation as in the polyserine region of the jewel wasp Nasonia vitripennis.175 Furthermore, a casein kinase II (CKII) recognition motif was found in PSTs of insect Vgs and these regions were shown to undergo multiple phosphorylation by the CKII kinase.175 Since phosphorylated polyserine region was unexpectedly resistant to trypsin/chymotrypsin digestion, it has been concluded that the multi-site phosphorylation might shield the PST against unspecific cleavage.175

Dehydrin ERD14

The dehydrin family belongs to a class of Late Embryogenesis Abundant (LEA) proteins in plants that includes stress proteins accumulating in response to dehydration-related environmental stresses. Although the dehydrins’ expression in somatic cells is very low under normal conditions, it increases dramatically when dehydration is promoted by water stress, high salinity or cold. Members of the dehydrin family were shown to be intrinsically disordered both computationally128 and by experiment.176,177 One of the well-studied members of this family is the ERD14 protein (early response to dehydration 14) of Arabidopsis thaliana. Comprehensive structural analysis by NMR techniques revealed that although ERD14 is fully disordered under near native conditions, this protein has several short regions of somewhat restricted motion and 5–25% helical propensity.177 In agreement with its experimentally validated intrinsically disordered nature, amino acid sequence of ERD14 is highly enriched in glutamate (21.1%) and lysine residues (18.4%). Although the overall serine content of EDR14 is not too high (8.4%), it was pointed out that its polyserine domain (residues 76–90) serves as a primary target for the multi-site phosphorylation that activates calcium binding activityof this protein.178

Titin

Being the largest vertebrate protein, titin (in human - 34,350 residues) represents an indispensable component of the vertebrate striated muscles. Among numerous cellular functions of titin (also known as connectin) are contribution to the fine balance of forces between the 2 halves of the sarcomere which is crucial for the elasticity of muscle cells, as well as participation in chromosome condensation and chromosome segregation during mitosis of non-muscle cells. The ability of titin to reversibly extend relies on a set of PEVK segments, rich in proline (P), glutamate (E), valine (V), and lysine (K) residues. Although the overall fraction of serine residues in human titin is not too high (7.2%), this protein has a serine-rich domain (residues 34102–34244) with the serine content of 30.6%.

The H±-ATPase fromSaccharomyces cerevisiae

An electrogenic proton pump, the plasma membrane H+-ATPase from Saccharomyces cerevisiae, is involved in regulation of the intracellular pH and generation of the electrochemical proton gradient needed for the cell growth and development.179 Although the overall serine content of yeast H+-ATPase is relatively low (6.4%), the N-terminal tail of this protein contains a serine-rich cluster of 11 serine residues in the first 17 amino acids, including a stretch of 8 consecutive serine residues. Mutational and deletion analysis revealed that although this polyserine cluster was not important for the ATP hydrolysis, it definitely played an important role in the overall stability of this protein, since mutants with 8 or more serine residues converted to alanine showed significantly decreased abundance in the plasma membrane under stress conditions caused by the accelerated degradation rates.179

ICP4 of herpes simplex virus (HSV)

In herpes simplex virus type 1 (HSV-1), which is a member of the Alphaherpesvirinae, ICP4 is a large (175-kDa), homodimeric, DNA-binding phosphoprotein primarily found in the nucleus of the infected cell.180-182 As many other viral proteins,183 ICP4 is characterized by a diverse functionality, being required for the transcriptional activation of most of the essential early (E) and late (L) genes,184-188 acting as an expression repressor of latency-associated transcript (LAT),189-191 long/short transcripts (L/ST),192,193 and even serving as a regulator of own expression.182,184,194,195 DNA binding, dimerization, nuclear localization, and transcriptional activation of ICP4 are ascribed to discrete functional domains.182,194,196-198 Among those functional ICP4 domains, transcriptional activation is attributed to a large C-terminal domain (residues 775–1298) and to a relatively short serine-rich region near the ICP4 N-terminus (residues 143 to 210).182,197 The transcription activation of a protein with deleted transactivation C-terminal domain was noticeably altered,196,199-201 and subsequent deletion of the serine-rich N-terminal domain completely abrogated the ICP4 transactivation function.197,202,203

The overall serine content of ICP4 is relatively low (5.9%). However, among the important features of the serine-rich N-terminal domain are the presence of the polyserine tract (aa 184 to 198) and consensus sites for the cellular kinases protein kinase A (PKA), protein kinase C (PKC), and casein kinase II (CKII).204,205 Furthermore, this region is subjected for the in vitro autophosphorylation.182 Being poorly conserved among the alphaherpesviruses, this region always contain similar stretches of polyserine residues.182 Although for a long time, the functional significance of the polyserine motif was not known, based on the analysis of the viral growth in tissue culture and in neuronal and non-neuronal cells in vivo revealed that the polyserine tract supports an activity of ICP4 specifically required for growth in the sensory ganglia.182

E-cadherin

Calcium-dependent cell adhesion protein E-cadherin is an important player in the epithelial cell-cell adhesion that determines integrity and organization of any tissue, and loss of cell adhesion activates the detachment-induced apoptosis, anoikis.206,207 Being synthesized as a preproprotein, mature E-cadherin (residues 155–882 in human protein, UniProt ID: P12830) is a single-pass membrane protein with a conserved cytoplasmic domain (residues 731–882) and a divergent extracellular region (residues 155–709) composed of 5 cadherin domains separated by interdomain Ca2+ binding sites.208 Although the overall serine content of E-cadherin is not too high (just 4.8%), these residues are not uniformly spread through the sequence, and protein has a short serine-rich region (residues 828–851) located within the disordered C-terminal tail (residues 740–882). This serine-rich region is required for the O-GlcNAcylation of E-cadherin that plays a role in inhibition of the E-cadherin trafficking to the plasma membrane and accelerates anoikis.209 This is because the O-GlcNAcylation leads to the retaining of E-cadherin in the endoplasmic reticulum and blocks the E-cadherin interaction with the type I gamma phosphatidylinositol phosphate kinase (PIPKIγ), a protein required for recruitment of E-cadherin to adhesion sites.209

Epstein-Barr nuclear antigen-1 (EBNA1)

The Epstein-Barr nuclear antigen 1 (EBNA1) protein of Epstein-Barr virus (EBV) is critical for the EBV latent infection due to the involvement of this protein in the replication and segregation of the viral genome in proliferating host cells, and due to its roles in the transactivation of the expression of other EBV latency genes.210 The N-terminal 479 residues of the 641 residue-long EBNA1 are predicted to be disordered. This region has a strong compositional bias, being heavily enriched in Ala and Gly residues. This Ala/Gly-rich region (residues 87–325) is followed by the Gly/Arg-rich region (residues 328–380) and by the Ser-rich module (residues 383–393).

Although the overall serine content of EBNA1 is low (4.2 %), serines definitely play crucial roles in function of this protein. Binding of EBNA1 protein to casein kinase 2β-subunit (CK2β) recruits CK2 to promyelocytic leukemia nuclear bodies (PML NBs), leads to the increased phosphorylation of PML proteins, and triggers degradation of these proteins. This EBNA1-induced degradation of PML proteins in multiple carcinoma cell lines results in the loss of the PML NBs that would otherwise promote apoptosis and suppress viral lytic infection.211-213 Interaction of CK2β with EBNA1 is mediated via a novel CK2β binding pocket, is primed by phosphorylation of EBNA1 on Ser393 located within the polyserine region, and affects the ability of EBNA1 to induce PML degradation.214

Serine-based posttranslational modifications of proteins

At the later stages of protein biosynthesis, after their transcription, many proteins undergo intensive chemical alterations, collectively known as protein posttranslational modifications (PTMs), which range from the proteolytic cleavage to the covalent attachment of diverse functional groups or even proteins to specific side chains of a target protein. PTMs change local physical and chemical properties of a polypeptide chain, such as charge, hydrophobicity, and flexibility. These chemical modifications of a polypeptide chain after its biosynthesis extends the range of amino acid structures and properties, and consequently diversifies structures and functions of proteins. Although DNA typically encodes 20 primary amino acids, proteins contain more than 140 different residues, because of various PTMs. Altogether, as many as 400 post-translational modifications of proteins are known to occur physiologically.215,216

PTMs play a number of fundamental roles in regulating the folding of proteins, their targeting to specific subcellular compartments, their interaction with ligands or other proteins, and their functional state, such as catalytic activity in the case of enzymes or the signaling function of proteins involved in signal transduction pathways. Functions of some proteins rely on several different types of posttranslational modifications.

Although all amino acid side chains are known to undergo chemical diversification due to various PTMs, most often protein PTMs are found at side chains that can act as either strong (Cys, Met, Ser, Thr, Tyr, Lys, His, Arg, Asp, and Glu) or weak (Asn and Gln) nucleophiles, whereas the remaining residues (Pro, Gly, Leu, Ile, Val, Ala, Trp, and Phe) are rarely involved in covalent modifications of their side chains. Different residues are subjected to different types of PTMs. Among PTMs frequently affecting serine are phosphorylation, O-glycosylation, N-acetylation, O-acetylation, phosphopantetheinylation, autocleavage, N-ADP-ribosylation, amidation, N-decanoylation, O-octanoylation, O-palmitoylation, and sulfation. More rare serine-based PTMs found in bacteria are serine conversion to D-alanine and to the thioether amino acid lanthionine. Some of the illustrative examples of these serine-targeting PTMs are considered below.

Phosphorylation

It is estimated that functions of one-third of eukaryotic proteins are controlled via phosphorylation/dephosphorylation cycles that originate from carefully regulated protein kinase and phosphatase activities.217 Phosphorylation modulates the activity of numerous proteins involved in signal transduction, and regulates the binding affinity of transcription factors to their coactivators and DNA thereby altering gene expression, cell growth and differentiation.218 Phosphorylation sites are frequently located within functionally important protein domains. For example, the majority of phosphorylation sites of Mdm2 are located in its p53- and p14-ARF-binding regions,219 whereas ubiquitin-mediated degradation of many proteins is controlled by the phosphorylation of their PEST motifs.119

Eukaryotic protein kinases constitute one of the largest protein families. For example, yeast kinome includes 119 kinases, there are 1019 kinase- and 300 phosphatase-coding genes in Arabidopsis thaliana, there are 540 kinases in mouse kinome, whereas human genome contains ∼520 genes encoding kinases and more than 150 genes encoding phosphatases. Although kinases/phosphatases are very common proteins, in any given eukaryotic proteome, the number of kinases and phosphatases is noticeably smaller than the number of their potential substrates. In fact, on average, each eukaryotic protein kinase serves ∼20 substrates, whereas each human phosphatase is expected to dephosphorylate ∼65 clients.36

Although kinases/phosphatases are known to modify mostly 3 residues (serine, threonine, and tyrosine), it seems that the major phosphorylation target is serine. In fact, according to the PhosphoBase,220 of the 1700 phosphorylated residues found in 400 proteins, 1135 were serines.221 Curiously, numerous bioinformatics studies clearly linked intrinsic disorder with phosphorylation, since the vast majority of this PTM affect protein sites located within the disordered regions.221,222 In agreement with these computational studies, there are several well-characterized examples of phosphorylatable IDPs. An illustrative example of such experimentally validated IDPs targeted to phosphorylation is given by the mutation cluster region (MCR) of the signaling hub adenomatous polyposis coli (APC), which was shown to be was susceptible to proteolysis, lacked α-helical secondary structure, did not display thermal unfolding transition, displayed an extended conformation, and was accessible for phosphorylation by CK1ε in vitro.164

O-glycosylation

More than half of all proteins are glycosylated, and the molecular volume occupied by an oligosaccharide in a glycoprotein is frequently as large as that of the protein domain to which it is attached.117 O-glycosylation is the PTM process of the carbohydrate addition to the hydroxylated amino acids of proteins.223 O-glycosylation represents the addition of heterogeneous in size and structure oligosaccharides to a protein.223 O-linked glycosylation is normally initiated in the Golgi apparatus, most commonly by a N-acetyl galactosaminyltransferase that transfers a N-acetylgalactosamine (GalNAc) residue to the side chain of a serine or a threonine residue.117 Importantly, this event of the O-glycosylation initiation is dependent on the primary, secondary, and tertiary structure of the glycoprotein.224 Subsequent events of a stepwise enzymatic elongation by specific transferases yield several core structures, which are further elongated or modified by sialylation, sulfatation, acetylation, fucosylation, and polylactosamine-extension.117

Among the well-studied O-glycosylation targets in animals is a family of mucins, which are the structural components of the mucus gels that protect the respiratory, gastrointestinal, and reproductive tracts.141 Mucins contain ∼80% carbohydrate on a mass basis and have a high intrinsic viscosity due to their large size (250 – 2,000 kDa) and extreme hydrophilicity caused by the presence of numerous and heterogeneous oligosaccharides.141

Not only mucins, but many other cytoplasmic and nuclear proteins can be reversibly glycosylated with an O-linked GlcNAc, and this O-glycosylation might function in a similar manner as phosphorylation.117 Among the O-glycosylated proteins are low-density lipoprotein receptor,225 the epidermal growth factor (EGF) domains of different proteins, such as the bovine blood coagulation factors VII and IX, human coagulation factors VII and IX, human and bovine protein Z, tissue plasminogen activator, thrombospondin, urokinase, factors VII, IX, and XII, and murine fetal antigen 1,226 the neurotrophin receptor (nerve growth factor receptor, NGFR), IgA1, heparin- and gelatin-binding horse seminal plasma protein-1 (HSP-1), as well as many other biologically important proteins.117

It has been pointed out that since O-linked oligosaccharide chains are typically clustered over short peptide stretches in Ser/Thr(/Pro)-rich domains of several proteins, they can serve as multivalent carbohydrate antigenic or functional determinants for antibody recognition, mammalian cell adhesion, and microorganism binding.224 Furthermore, these heavily glycosylated Ser/Thr(/Pro)-rich domains may serve as protease-resistant spacers in cell surface glycoproteins.117 Curiously, sections of proteins with a high proline-content and many sugars attached to serines or threonines may assume a ‘bottle brush’-like structure.117

Phosphopantetheinylation

Phosphopantetheinylation is a crucial posttranslational modification that is absolutely necessary for the action of the polyketide synthases (PKSs) producing a large class of secondary metabolites known as polyketides227 and multifunctional non-ribosomal peptide synthetases (NRPSs) responsible for the biosynthesis of various peptide-based compounds.228 For example, NRPSs activate amino acid monomers first as aminoacyl-AMP mixed anhydride species then as covalent amino acyl-thioester intermediates, where the thiol group is provided by a 4′-phosphopantetheinyl group posttranslationally attached to a specific serine by a phosphopantetheinyl transferase (PPTase).227

Two illustrative examples of NRPSs are surfactin synthetase responsible for biosynthesis of the lipoheptapeptide antibiotic surfactin by Bacillus subtilis229 and the cyclosporin A producing fungal cyclosporin synthetase, which is a very large single chain enzyme containing more than 15,000 residues and 11 amino acid activation modules, each equipped with the adenylation domains and adjacent consensus serine sites marking condensation, epimerization, N-methylation, and peptidyl carrier protein (PCP) domains.230

Fugal iterative type I PKSs are complex enzymes responsible for the biosynthesis of various important polyketide compounds, such as statins, melanin pigments, and toxic aflatoxins.231 These type I PKSs are large machines consisting of multiple catalytic domains, such as starter acyltransferase (SAT), ketosynthase (KS), acyltransferase (AT), product template (PT), tandem acyl carrier proteins (ACPs) and Claisen cyclase (CLC) domains.231,232 On the contrary, type III PKSs (which, in plants, are responsible for biosynthesis of flavonoids and related compounds) are simple homodimers of ∼42 kDa.233

Autocleavage: Inteins and autocatalytic enzyme-like proteins

Inteins

Inteins are self-splicing elements that catalyze their own removal from the host protein and the formation of a peptide bond between their flanking protein regions (exteins) through a posttranslational process of protein splicing.234-237 There are 4 coupled nucleophilic displacements in the protein splicing mechanism: 1) an N→O(S) acyl shift of Ser/Cys at the intein N-terminus; 2) a transesterification reaction to form a branched intermediate with 2 N termini; 3) cyclization of the intein C-terminal Asn/Gln to release the intein; and 4) an O(S)→N acyl shift of Ser/Thr/Cys to form a native peptide bond between the exteins.238 As it follows from this description, nucleophilic nature of serine place this residue at the center of the series of transformations leading to protein self-splicing.239 In fact, the splice junctions of all inteins are closely related, with 3 types of amino acids being required at precise positions at the junctions: the last residue of the intein must be asparagine, and a hydroxyl or thiol residue (serine, threonine, cysteine) must be at the C-terminal side of each of the 2 junctions.239

One of the first reported examples of inteins is the generation of 2 enzymes via protein splicing from a single Saccharomyces cerevisiae VMA1 gene.236 Here, the self-splicing of the nascent 120-kDa translational product of VMA1 excises out the 50-kDa VMA1-derived endonuclease (VDE or VMA1 intein) and splices the N- and C-terminal exteins to form the mature Vma1p (a catalytic 70-kDa subunit of the vacuolar H+-ATPase).236

Autocatalytic enzyme-like proteins

Inteins can be considered as members of a family of autocatalytic enzyme-like proteins that perform chemical reactions using the same strategies as enzymes, but do not act on multiple substrates.238 Other examples of autoprocessing include RecA-assisted autocleavage of LexA or lambda repressor, autocleavage of N-terminal nucleophile amidohydrolase precursors (glycosylasparaginase, penicillin acylase, proteasome, etc.), processing of Hedgehog embryonic signaling proteins, activation of plasma proteins, and formation of pyruvoyl enzymes.240,241 The common feature of these processes, including protein splicing, is that all of them are initiated by a nucleophilic attack by the activated side-chain thiol of cysteine or hydroxyl group of serine or threonine at an amide bond.238

N-acetylation

Nα-terminal acetylation (Nt-acetylation) is a cotranslational or posttranslational modification found in many eukaryotic proteins by which the acetyl group is added to the α-amino group of their N-terminal residues. In fact, a majority of eukaryotic proteins, for example, more than 85% of human proteins and 68% of yeast proteins are cotranslationally Nt-acetylated.242 Several archaeal and prokaryotic proteins are also modified by N-terminal acetylation.

Since nascent eukaryotic proteins contain N-terminal Met, which, being retained in the mature protein and being followed by the “acetylation-permissive” residues, is usually acetylated at its α-amino group.243,244 If the residue at position 2 of a nascent protein has a small enough side chain, then the N-terminal Met is cleaved off by Met-aminopeptidases, resulting in N-terminal Ala, Val, Ser, Thr, Cys, Gly, or Pro.245 Similarly to the N-terminal Met, these N-terminal residues (with the exception of Pro) are often Nα-terminally acetylated.246–248 This is because the major N-terminal acetyltransferase (NAT), NatA, is able to act on subclasses of proteins with Ser-, Ala-, Thr-, Gly-, Cys-, and Val- N-termini.248 Since these amino acids are more frequently expressed in the N-termini of eukaryotic proteins, NatA is the major NAT corresponding to the whole number of its potential substrates.248 Furthermore, another eukaryotic NAT, NatD, is specifically responsible for the acetylation of proteins with the Ser-Gly-Gly- or Ser-Gly-Arg- N-termini, which are strongly resembling the N-termini of the human histones H2A and H4.249

Although for a long time the functional importance of this most common cotranslational modification was elusive, recent experiments with the yeast Saccharomyces cerevisiae revealed that the Nt-acetylated Met residue, as well as Nt-acetylated Ser, Ala, Val, Thr, and Cys residues, could act as a degradation signal (degron), targeted by the Doa10 ubiquitin ligase.244

O-acetylation

Traditionally, lysine residues were considered as the primary target of acetylation in proteins. However, recently, the ability of a bacterial effector from Yersinia YopJ to serve as an acetyltransferase responsible for acetylation of serine and threonine residues in the host cells was discovered.250,251 YopJ is known to inhibit mitogen-activated protein kinase (MAPK) and the nuclear factor kappaB (NFkappaB) signaling pathways playing an important role in innate immune response by preventing activation of the family of MAPK kinases (MAPKK). Here, YopJ was shown to block phosphorylation of MAPKK6 by acetylation of the critical serine and threonine residues in the activation loop of this important kinase.250 In a subsequent study, another important target of the YopJ acetylation was found, the MAP kinase kinase MEK2, where acetylation of the serine residues in the activation loop was shown to prevent the phosphorylation of these serine residues needed for the MEK2 activation and downstream signal propagation.251 Also, YopJ was able to acetylate a threonine residue in the activation loop of both the α and β subunits of the IKK kinase from the NF-kappaB pathway.251 Based on these observations, it has been concluded that the fact that serine and threonine can be not only phosphorylated and O-glycosylated but also (reversibly) acetylated adds a new level of complexity to the signaling machinery.252

ADP-ribosylation

Although arginine, cysteine, asparagine, and modified histidine (diphthamide) residues in proteins are well-established targets of the ADP-ribosylation (which is the transfer of the ADP-ribose moiety from NAD+ to specific amino acid residues on substrate proteins) via the action of various protein-mono-ADP-ribosyltransferases, the capability of formation of an acetal linkage between ADP-ribose and the hydroxyl group of a protein acceptor such as serine, threonine, tyrosine, hydroxyproline, or hydroxylysine residues has been demonstrated,253 indicating that these residues can be ADP-ribosylated. In agreement with this hypothesis, it has been established that an endoplasmic-reticulum-associated mono-ADP-ribosyltransferase PARP16/ARTD15 interacts with karyopherin-β1, and modifies neither acidic nor basic residues, but threonine or serine residues of this protein.254,255

O-palmitoylation

Palmitoylation is the covalent attachment of a palmitoyl group derived from the palmitic acid (which is a 16-carbon fatty acid) to cysteine (S-palmitoylation), serine or threonine residues (O-palmitoylation). This type of PTM is rather common among various cellular signaling proteins, since it mediates the interaction of proteins with membranes and other proteins and can control the biological activity of a protein.256 In S-palmitoylation, the palmitate attachment occurs through a labile thioester bond and is readily reversible in cells,257 whereas O-palmitoylation involves a serine -OH group to form an oxyester linkage256 via the process catalyzed by the membrane-bound O-acyltransferases.258 An illustrative example of a protein with a crucial O-palmitoylation is Wnt protein, acylation of which at a conserved serine residue Ser209 plays an important role in the Wnt secretion, since protein defective in acylation at Ser209 is not secreted from cells, being retained in the endoplasmic reticulum (ER).258 Another example is histone H4, which has been recently shown to be O-palmitoylated at Ser47 by action of the acyl-CoA:lysophosphatidylcholine acyltransferase (Lpcat1) in a calcium-regulated manner.259

O-decanoylation and O-octanoylation

Serine residues of some proteins or functional peptides are acylated with 8- or 10-carbon fatty acids, octanoate (C8) or decanoate (C10), respectively. An illustrative example of a polypeptide with these PTMs is ghrelin, which is an appetite-regulating hormone (“hunger hormone”) secreted by the food-deprived stomach.260 The aforementioned forms of ghrelin acylation are known to target Ser3 residue of this hormone.261 Fatty acid modifications of ghrelin are essential for the ghrelin-induced growth hormone release from the pituitary and also for the appetite stimulation.260 These acyl-modification of ghrelin are catalyzed by the ghrelin-O-acyl transferase.260

O-sulfonation

O-sulfonation of proteins on serine and threonine residues is one of the PTMs discovered via the use of the on-line high-performance liquid chromatography (HPLC) tandem electrospray mass spectrometry applied to proteins isolated by SDS-PAGE.262 Although O-sulfonation is the major type of cellular sulfonation and although different biopolymers (such as polysaccharides, steroids, catecholamines, and proteins) are known to be commonly subjected to O-sulfonation catalyzed by several cytosolic and membrane-bound sulfotransferases, the primary mode of action of these sulfotransferases on proteins was the catalysis of the direct protein sulfonation on the O4 position of tyrosine residues.263 Since O-sulfonation on tyrosine residues is one of the last PTMS affecting proteins transiting the trans-Golgi, this modification is present on many secreted and plasma membrane proteins of all metazoan species.263 In addition to the tyrosine-targeted O-sulfonation, serine and threonine residues were shown to be subjected to this type of PTM in several proteins of diverse class and function isolated from eukaryotes spanning the range from a unicellular parasite to humans.262

Serine-based PTMs in lantabiotics

Analysis of the lacticin 3147, which is a 2-component bacteriocin (i.e., a member of a large group of proteinaceous compounds produced by Gram-positive bacteria, which display an antimicrobial activity directed primarily against other Gram-positive organisms) generated by the Lactococcus lactis subspecies lactis DPC3147, revealed the presence of several specific PTMs.264 Among these PTMs is selective dehydration of the hydroxy amino acids (Ser and Thr) to form α,β-unsaturated residues dehydroalanine (Dha) and dehydrobutyrine (Dhb), respectively.265 Subsequently, Dha and Dhb can undergo Michael addition reactions with the thiol group of specific cysteine residues to form lanthionine (Lan) and β-methyl-lanthionine (MeLan) residues, which are specific to lantabiotics.264,266 Another unusual posttranslationally modified residue described for the lacticin 3147 is D-alanine, which is produced not due to a simple isomerase conversion from L-alanine, but as a result of a 2-step α-carbon stereoinversion reaction, where serine is initially dehydrated to Dha and then further modified by some stereospecific hydrogenating enzyme or enzyme system to produce D-alanine.264,267 Therefore, this posttranslational serine-D-alanine conversion represents the unique case of the incorporation of a d-amino acid in a ribosomally synthesized prokaryotic peptide.267

Amidation

Secreted proteins can undergo the C-terminal peptidyl-serine amidation, where the hydrolysis and oxidation of an interior Ser-Gly peptide results in the formation of a C-terminal peptidyl-serine amide. Here, glycine donates the “amine group” to the neighboring C-terminal residue generating an amide of the recipient amino acid, effectively removing the reactive carboxy group from the C-terminal amino acid. Although in mediating this amide formation, the only known donor amino acid is glycine, the recipient residue could be any amino acid with the exception of aspartate.268

Yin-Yang sites

Sections above illustrate the ability of serine to undergo multiple types of post-translational modifications (PTMs). Although these different serine-targeting PTMs are discussed individually, one should keep in mind that different PTMs may compete for the occupancy of a single residue. Obviously, this between-PTM competition is not unique to serine. However, one of the best studied classes of such PTM competitive sites is related to the cases which involve the competition between phosphorylation and O-glycosylation (Yin-Yang sites). Competition for the aforementioned Yin-Yang sites is determined by the remarkable similarities of global functional outcomes caused by these 2 PTMs. In fact, similar to phosphorylation, O-GlcNAcylation is known to be a rapidly cycling posttranslational modification that is directly involved in the regulation of many cellular processes.269 The phenomenon of the between-PTM competition was briefly discussed in the 'O-acetylation section', and several illustrative examples of proteins with such Yin-Yang sites are represented below. The murine estrogen receptor-β (mER-β) is one of the best studied cases, where there is an intricate functional competition between the O-GlcNAcylation and O-phosphorylation for Ser16.270 It has been shown that reciprocal occupancy of this residue by either O-phosphate or O-GlcNAc plays a crucial role in modulation of the degradation and activity of mER-β.270

Obviously, the cross-talk between O-GlcNAcylation and O-phosphorylation is not limited to the mER-β, being instead a very common phenomenon. In fact, when phosphorylation site occupancy of 711 sites was determined after the global increase in O-GlcNAcylation, almost all actively cycling phosphorylation sites were noticeably affected, suggesting that an interplay between these 2 PTMs may arise due to the steric competition for occupancy at the same or proximal sites and by the ability of each modification to regulate the enzymatic machinery of other PTM.271 A recent comprehensive review emphasized that this extensive interplay between the O-GlcNAcylation and O-phosphorylation is crucial for functionality of many proteins.269 Also, most anecdotal examples suggest that Yin-Yang sites happen to occur in disordered regions. This conclusion is supported by the resent statistical association study which clearly showed that although many PTM sites were systematically found within the disordered regions, target sites modified by more than one type of PTM showed even stronger preferences toward disordered regions than their single-PTM counterparts.222

Polyserine side of the polyglutamine disorders

A family neurodegenerative diseases including Huntington's disease, spino-cerebellar ataxia (SCA) 1, 2, 6, 7 and 3, spinobulbar muscular atrophy, and dentatorubal-pallidoluysian atrophy are autosomal dominant neurodegenerative conditions caused by the abnormal expansion of a (CAG)n repeat encoding a polyglutamine tract within the target protein (e.g., N-terminus of huntingtin).272 In addition to the polyglutamine expansion some human diseases and malformations (e.g., autosomal dominant oculopharyngeal muscular dystrophy (OPMD), synpolydactyly type II; cleidocranial dysplasia; holoprosencephaly (HPE5); hand-foot-genital syndrome; Blepharophimosis, ptosis, and epicanthus inversus; X-linked mental retardation with isolated growth hormone deficiency; X-linked infantile spasm syndrome; Partington syndrome; X-linked lissencephaly with ambiguous genitalia; congenital central hypoventilation syndrome/Ondine curse; and mental retardation X-linked 36 and 54273) are known to be caused by the polyalanine expansion mutations resulting from the abnormal expansion of the (GCG)n, (GCA)n, (GCT)n, and/or (GCC)n repeats all encoding polyalanine tracks.273-276

An intricate link was found between the polyglutamine and polyalanine expansion diseases, where the +1 frameshift of the original SCA3 gene was shown to result in shifting from the polyglutamine (CAG)n to the polyalanine (GCA)n-encoding frame, thereby defining the transition from the polyglutamine to the polyalanine expansion mutations.277,278 Based on these observations, it was hypothesized that since one of the serine codons is AGC, there is a chance that the +2 frameshift of the original polyglutamine-encoding frame (CAG)n would promote a transition to the polyserine-encoding frame (GCA)n.279 This hypothesis was confirmed by finding both polyalanine- and polyserine-containing huntingtin mutants in human postmortem Huntington's disease brains and in a transgenic mouse model of Huntington's disease.279 On a more general level, it seems that the aforementioned mechanism of homopolymeric conversion is rather common. In fact, recent studies showed that RNA transcripts with expanded CAG repeats can be translated in the complete absence of a starting ATG codon. This novel translation mechanism known as Repeat Associated Non-ATG translation (RAN-translation) occurs across expanded CAG repeats in all reading frames (CAG, AGC, and GCA) to produce homopolymeric proteins of long polyglutamine, polyserine, and polyalanine tracts.280,281 Furthermore, the possibility for RAN-translation is not limited to the expanded CAG repeats since the expanded CTG tracts expressing CUG transcripts also have RAN-translation at all 3 frames (CUG, UGC, and GCU), thereby generating polyleucine, polycysteine, and polyalanine.280

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

Funding

This work was supported in part by a grant from Russian Science Foundation RSCF № 14–24–00131

References

  • 1. Uversky VN. Intrinsically disordered proteins from A to Z. Int J Biochem Cell Biol 2011; 43:1090-103; PMID:21501695; http://dx.doi.org/ 10.1016/j.biocel.2011.04.001 [DOI] [PubMed] [Google Scholar]
  • 2. Tompa P. Unstructural biology coming of age. Curr Opin Struct Biol 2011; 21:419-25; PMID:21514142; http://dx.doi.org/ 10.1016/j.sbi.2011.03.012 [DOI] [PubMed] [Google Scholar]
  • 3. Wright PE, Dyson HJ. Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J Mol Biol 1999; 293:321-31; PMID:10550212; http://dx.doi.org/ 10.1006/jmbi.1999.3110 [DOI] [PubMed] [Google Scholar]
  • 4. Dunker AK, Lawson JD, Brown CJ, Williams RM, Romero P, Oh JS, Oldfield CJ, Campen AM, Ratliff CM, Hipps KW, et al. Intrinsically disordered protein. J Mol Graph Model 2001; 19:26-59; PMID:11381529; http://dx.doi.org/ 10.1016/S1093-3263(00)00138-8 [DOI] [PubMed] [Google Scholar]
  • 5. Tompa P. Intrinsically unstructured proteins. Trends Biochem Sci 2002; 27:527-33; PMID:12368089; http://dx.doi.org/ 10.1016/S0968-0004(02)02169-2 [DOI] [PubMed] [Google Scholar]
  • 6. Dyson HJ, Wright PE. Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol 2005; 6:197-208; PMID:15738986; http://dx.doi.org/ 10.1038/nrm1589 [DOI] [PubMed] [Google Scholar]
  • 7. Iakoucheva LM, Brown CJ, Lawson JD, Obradovic Z, Dunker AK. Intrinsic disorder in cell-signaling and cancer-associated proteins. J Mol Biol 2002; 323:573-84; PMID:12381310; http://dx.doi.org/ 10.1016/S0022-2836(02)00969-5 [DOI] [PubMed] [Google Scholar]
  • 8. Dunker AK, Brown CJ, Lawson JD, Iakoucheva LM, Obradovic Z. Intrinsic disorder and protein function. Biochemistry 2002; 41:6573-82; PMID:12022860; http://dx.doi.org/ 10.1021/bi012159+ [DOI] [PubMed] [Google Scholar]
  • 9. Uversky VN. Natively unfolded proteins: a point where biology waits for physics. Protein Sci 2002; 11:739-56; PMID:11910019; http://dx.doi.org/ 10.1110/ps.4210102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Uversky VN, Oldfield CJ, Dunker AK. Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling. J Mol Recognit 2005; 18:343-84; PMID:16094605; http://dx.doi.org/ 10.1002/jmr.747 [DOI] [PubMed] [Google Scholar]
  • 11. Dunker AK, Cortese MS, Romero P, Iakoucheva LM, Uversky VN. Flexible nets. The roles of intrinsic disorder in protein interaction networks. Febs J 2005; 272:5129-48; PMID:16218947; http://dx.doi.org/ 10.1111/j.1742-4658.2005.04948.x [DOI] [PubMed] [Google Scholar]
  • 12. Xie H, Vucetic S, Iakoucheva LM, Oldfield CJ, Dunker AK, Uversky VN, Obradovic Z. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J Proteome Res 2007; 6:1882-98; PMID:17391014; http://dx.doi.org/ 10.1021/pr060392u [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Vucetic S, Xie H, Iakoucheva LM, Oldfield CJ, Dunker AK, Obradovic Z, Uversky VN. Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. J Proteome Res 2007; 6:1899-916; PMID:17391015; http://dx.doi.org/ 10.1021/pr060393m [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Liu J, Faeder JR, Camacho CJ. Toward a quantitative theory of intrinsically disordered proteins and their function. Proc Natl Acad Sci U S A 2009; 106:19819-23; PMID:19903882; http://dx.doi.org/ 10.1073/pnas.0907710106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Kim PM, Sboner A, Xia Y, Gerstein M. The role of disorder in interaction networks: a structural analysis. Mol Syst Biol 2008; 4:179; PMID:18364713; http://dx.doi.org/ 10.1038/msb.2008.16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Oldfield CJ, Meng J, Yang JY, Yang MQ, Uversky VN, Dunker AK. Flexible nets: disorder and induced fit in the associations of p53 and 14-3-3 with their partners. BMC Genomics 2008; 9(Suppl 1):S1; PMID:18366598; http://dx.doi.org/ 10.1186/1471-2164-9-S1-S1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Wright PE, Dyson HJ. Linking folding and binding. Curr Opin Struct Biol 2009; 19:31-8; PMID:19157855; http://dx.doi.org/ 10.1016/j.sbi.2008.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Romero P, Obradovic Z, Kissinger CR, Villafranca JE, Garner E, Guilliot S, Dunker AK. Thousands of proteins likely to have long disordered regions. Pac Symp Biocomput 1998:437-48; PMID:9697202 [PubMed] [Google Scholar]
  • 19. Dunker AK, Obradovic Z, Romero P, Garner EC, Brown CJ. Intrinsic protein disorder in complete genomes. Genome Inform Ser Workshop Genome Inform 2000; 11:161-71; PMID:11700597 [PubMed] [Google Scholar]
  • 20. Uversky VN, Gillespie JR, Fink AL. Why are "natively unfolded" proteins unstructured under physiologic conditions? Proteins 2000; 41:415-27; PMID:11025552; http://dx.doi.org/ 10.1002/1097-0134(20001115)41:3%3c415::AID-PROT130%3e3.0.CO;2-7 [DOI] [PubMed] [Google Scholar]
  • 21. Ward JJ, Sodhi JS, McGuffin LJ, Buxton BF, Jones DT. Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J Mol Biol 2004; 337:635-45; PMID:15019783; http://dx.doi.org/ 10.1016/j.jmb.2004.02.002 [DOI] [PubMed] [Google Scholar]
  • 22. Xue B, Dunker AK, Uversky VN. Orderly order in protein intrinsic disorder distribution: disorder in 3500 proteomes from viruses and the three domains of life. J Biomol Struct Dyn 2012; 30:137-49; PMID:22702725; http://dx.doi.org/ 10.1080/07391102.2012.675145 [DOI] [PubMed] [Google Scholar]
  • 23. Uversky VN, Dunker AK. Understanding protein non-folding. Biochim Biophys Acta 2010; 1804:1231-64; PMID:20117254; http://dx.doi.org/ 10.1016/j.bbapap.2010.01.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Cheng Y, LeGall T, Oldfield CJ, Dunker AK, Uversky VN. Abundance of intrinsic disorder in protein associated with cardiovascular disease. Biochemistry 2006; 45:10448-60; PMID:16939197; http://dx.doi.org/ 10.1021/bi060981d [DOI] [PubMed] [Google Scholar]
  • 25. Xie H, Vucetic S, Iakoucheva LM, Oldfield CJ, Dunker AK, Obradovic Z, Uversky VN. Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins. J Proteome Res 2007; 6:1917-32; PMID:17391016; http://dx.doi.org/ 10.1021/pr060394e [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Uversky VN, Oldfield CJ, Dunker AK. Intrinsically disordered proteins in human diseases: introducing the D2 concept. Annu Rev Biophys 2008; 37:215-46; PMID:18573080; http://dx.doi.org/ 10.1146/annurev.biophys.37.032807.125924 [DOI] [PubMed] [Google Scholar]
  • 27. Midic U, Oldfield CJ, Dunker AK, Obradovic Z, Uversky VN. Unfoldomics of human genetic diseases: illustrative examples of ordered and intrinsically disordered members of the human diseasome. Protein Pept Lett 2009; 16:1533-47; PMID:20001916; http://dx.doi.org/ 10.2174/092986609789839377 [DOI] [PubMed] [Google Scholar]
  • 28. Uversky VN, Oldfield CJ, Midic U, Xie H, Xue B, Vucetic S, Iakoucheva LM, Obradovic Z, Dunker AK. Unfoldomics of human diseases: linking protein intrinsic disorder with diseases. BMC Genomics 2009; 10(Suppl 1):S7; PMID:19594884; http://dx.doi.org/ 10.1186/1471-2164-10-S1-S7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Midic U, Oldfield CJ, Dunker AK, Obradovic Z, Uversky VN. Protein disorder in the human diseasome: unfoldomics of human genetic diseases. BMC Genomics 2009; 10(Suppl 1):S12; PMID:19594871; http://dx.doi.org/ 10.1186/1471-2164-10-S1-S12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Oldfield CJ, Cheng Y, Cortese MS, Romero P, Uversky VN, Dunker AK. Coupled folding and binding with alpha-helix-forming molecular recognition elements. Biochemistry 2005; 44:12454-70; PMID:16156658; http://dx.doi.org/ 10.1021/bi050736e [DOI] [PubMed] [Google Scholar]
  • 31. Mohan A, Oldfield CJ, Radivojac P, Vacic V, Cortese MS, Dunker AK, Uversky VN. Analysis of molecular recognition features (MoRFs). J Mol Biol 2006; 362:1043-59; PMID:16935303; http://dx.doi.org/ 10.1016/j.jmb.2006.07.087 [DOI] [PubMed] [Google Scholar]
  • 32. Vacic V, Oldfield CJ, Mohan A, Radivojac P, Cortese MS, Uversky VN, Dunker AK. Characterization of molecular recognition features, MoRFs, and their binding partners. J Proteome Res 2007; 6:2351-66; PMID:17488107; http://dx.doi.org/ 10.1021/pr0701411 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Meszaros B, Simon I, Dosztanyi Z. Prediction of protein binding regions in disordered proteins. PLoS Comput Biol 2009; 5:e1000376; PMID:19412530; http://dx.doi.org/ 10.1371/journal.pcbi.1000376 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Dosztanyi Z, Meszaros B, Simon I. ANCHOR: web server for predicting protein binding regions in disordered proteins. Bioinformatics 2009; 25:2745-6; PMID:19717576; http://dx.doi.org/ 10.1093/bioinformatics/btp518 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Uversky VN. Multitude of binding modes attainable by intrinsically disordered proteins: a portrait gallery of disorder-based complexes. Chem Soc Rev 2011; 40:1623-34; PMID:21049125; http://dx.doi.org/ 10.1039/C0CS00057D [DOI] [PubMed] [Google Scholar]
  • 36. Uversky VN. Intrinsic disorder-based protein interactions and their modulators. Curr Pharm Des 2013; 42:4191-213; PMID: 23170892 [DOI] [PubMed] [Google Scholar]
  • 37. Hsu WL, Oldfield C, Meng J, Huang F, Xue B, Uversky VN, Romero P, Dunker AK. Intrinsic protein disorder and protein-protein interactions. Pac Symp Biocomput 2012:116-27; PMID:22174268 [PubMed] [Google Scholar]
  • 38. Meszaros B, Simon I, Dosztanyi Z. The expanding view of protein-protein interactions: complexes involving intrinsically disordered proteins. Phys Biol 2011; 8:035003; PMID:21572179; http://dx.doi.org/ 10.1088/1478-3975/8/3/035003 [DOI] [PubMed] [Google Scholar]
  • 39. Hazy E, Tompa P. Limitations of induced folding in molecular recognition by intrinsically disordered proteins. Chemphyschem 2009; 10:1415-9; PMID:19462392; http://dx.doi.org/ 10.1002/cphc.200900205 [DOI] [PubMed] [Google Scholar]
  • 40. Tompa P, Fuxreiter M. Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions. Trends Biochem Sci 2008; 33:2-8; PMID:18054235; http://dx.doi.org/ 10.1016/j.tibs.2007.10.003 [DOI] [PubMed] [Google Scholar]
  • 41. Fuxreiter M, Simon I, Friedrich P, Tompa P. Preformed structural elements feature in partner recognition by intrinsically unstructured proteins. J Mol Biol 2004; 338:1015-26; PMID:15111064; http://dx.doi.org/ 10.1016/j.jmb.2004.03.017 [DOI] [PubMed] [Google Scholar]
  • 42. Chi SW, Kim DH, Lee SH, Chang I, Han KH. Pre-structured motifs in the natively unstructured preS1 surface antigen of hepatitis B virus. Protein Sci 2007; 16:2108-17; PMID:17766372; http://dx.doi.org/ 10.1110/ps.072983507 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Dunker AK, Garner E, Guilliot S, Romero P, Albrecht K, Hart J, Obradovic Z, Kissinger C, Villafranca JE. Protein disorder and the evolution of molecular recognition: theory, predictions and observations. Pac Symp Biocomput 1998:473-84; PMID:9697205 [PubMed] [Google Scholar]
  • 44. Radivojac P, Iakoucheva LM, Oldfield CJ, Obradovic Z, Uversky VN, Dunker AK. Intrinsic disorder and functional proteomics. Biophys J 2007; 92:1439-56; PMID:17158572; http://dx.doi.org/ 10.1529/biophysj.106.094045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Vacic V, Uversky VN, Dunker AK, Lonardi S. Composition Profiler: a tool for discovery and visualization of amino acid composition differences. BMC Bioinformatics 2007; 8:211; PMID:17578581; http://dx.doi.org/ 10.1186/1471-2105-8-211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Sickmeier M, Hamilton JA, LeGall T, Vacic V, Cortese MS, Tantos A, Szabo B, Tompa P, Chen J, Uversky VN, et al. DisProt: the Database of Disordered Proteins. Nucleic Acids Res 2007; 35:D786-93; PMID:17145717; http://dx.doi.org/ 10.1093/nar/gkl893 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, et al. The Universal Protein Resource (UniProt). Nucleic Acids Res 2005; 33:D154-9; PMID:15608167; http://dx.doi.org/ 10.1093/nar/gki070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. The Protein Data Bank. Nucleic Acids Res 2000; 28:235-42; PMID:10592235; http://dx.doi.org/ 10.1093/nar/28.1.235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Huntley MA, Golding GB. Selection and slippage creating serine homopolymers. Mol Biol Evol 2006; 23:2017-25; PMID:16877497; http://dx.doi.org/ 10.1093/molbev/msl073 [DOI] [PubMed] [Google Scholar]
  • 50. Diaz-Lazcoz Y, Henaut A, Vigier P, Risler JL. Differential codon usage for conserved amino acids: evidence that the serine codons TCN were primordial. J Mol Biol 1995; 250:123-7; PMID:7608964; http://dx.doi.org/ 10.1006/jmbi.1995.0363 [DOI] [PubMed] [Google Scholar]
  • 51. Karplus PA. Hydrophobicity regained. Protein Sci 1997; 6:1302-7; PMID:9194190; http://dx.doi.org/ 10.1002/pro.5560060618 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Bordo D, Argos P. Suggestions for "safe" residue substitutions in site-directed mutagenesis. J Mol Biol 1991; 217:721-9; PMID:2005621; http://dx.doi.org/ 10.1016/0022-2836(91)90528-E [DOI] [PubMed] [Google Scholar]
  • 53. Jacob F. Evolution and tinkering. Science 1977; 196:1161-6; PMID:860134; http://dx.doi.org/ 10.1126/science.860134 [DOI] [PubMed] [Google Scholar]
  • 54. Creixell P, Schoof EM, Tan CS, Linding R. Mutational properties of amino acid residues: implications for evolvability of phosphorylatable residues. Philos Trans R Soc Lond B Biol Sci 2012; 367:2584-93; PMID:22889909; http://dx.doi.org/ 10.1098/rstb.2012.0076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Zuckerkandl E, Derancourt J, Vogel H. Mutational trends and random processes in the evolution of informational macromolecules. J Mol Biol 1971; 59:473-90; PMID:5571595; http://dx.doi.org/ 10.1016/0022-2836(71)90311-1 [DOI] [PubMed] [Google Scholar]
  • 56. Jordan IK, Kondrashov FA, Adzhubei IA, Wolf YI, Koonin EV, Kondrashov AS, Sunyaev S. A universal trend of amino acid gain and loss in protein evolution. Nature 2005; 433:633-8; PMID:15660107; http://dx.doi.org/ 10.1038/nature03306 [DOI] [PubMed] [Google Scholar]
  • 57. de Beer TA, Laskowski RA, Parks SL, Sipos B, Goldman N, Thornton JM. Amino acid changes in disease-associated variants differ radically from variants observed in the 1000 genomes project dataset. PLoS Comput Biol 2013; 9:e1003382; PMID:24348229; http://dx.doi.org/ 10.1371/journal.pcbi.1003382 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Kalhan SC, Hanson RW. Resurgence of serine: an often neglected but indispensable amino Acid. J Biol Chem 2012; 287:19786-91; PMID:22566694; http://dx.doi.org/ 10.1074/jbc.R112.357194 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. de Koning TJ, Poll-The BT, Jaeken J. Continuing education in neurometabolic disorders–serine deficiency disorders. Neuropediatrics 1999; 30:1-4; PMID:10222452; http://dx.doi.org/ 10.1055/s-2007-973447 [DOI] [PubMed] [Google Scholar]
  • 60. Snell K. Enzymes of serine metabolism in normal, developing and neoplastic rat tissues. Adv Enzyme Regul 1984; 22:325-400; PMID:6089514; http://dx.doi.org/ 10.1016/0065-2571(84)90021-9 [DOI] [PubMed] [Google Scholar]
  • 61. Snell K, Weber G. Enzymic imbalance in serine metabolism in rat hepatomas. Biochem J 1986; 233:617-20; PMID:3082329 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Snell K, Natsumeda Y, Weber G. The modulation of serine metabolism in hepatoma 3924A during different phases of cellular proliferation in culture. Biochem J 1987; 245:609-12; PMID:3117048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Locasale JW, Grassian AR, Melman T, Lyssiotis CA, Mattaini KR, Bass AJ, Heffron G, Metallo CM, Muranen T, Sharfi H, et al. Phosphoglycerate dehydrogenase diverts glycolytic flux and contributes to oncogenesis. Nat Genet 2011; 43:869-74; PMID:21804546; http://dx.doi.org/ 10.1038/ng.890 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Furuya S, Watanabe M. Novel neuroglial and glioglial relationships mediated by L-serine metabolism. Arch Histol Cytol 2003; 66:109-21; PMID:12846552; http://dx.doi.org/ 10.1679/aohc.66.109 [DOI] [PubMed] [Google Scholar]
  • 65. Hashimoto A, Nishikawa T, Hayashi T, Fujii N, Harada K, Oka T, Takahashi K. The presence of free D-serine in rat brain. FEBS Lett 1992; 296:33-6; PMID:1730289; http://dx.doi.org/ 10.1016/0014-5793(92)80397-Y [DOI] [PubMed] [Google Scholar]
  • 66. Fuchs SA, Berger R, Klomp LW, de Koning TJ. D-amino acids in the central nervous system in health and disease. Mol Genet Metab 2005; 85:168-80; PMID:15979028; http://dx.doi.org/ 10.1016/j.ymgme.2005.03.003 [DOI] [PubMed] [Google Scholar]
  • 67. Hashimoto A, Oka T, Nishikawa T. Extracellular concentration of endogenous free D-serine in the rat brain as revealed by in vivo microdialysis. Neuroscience 1995; 66:635-43; PMID:7644027; http://dx.doi.org/ 10.1016/0306-4522(94)00597-X [DOI] [PubMed] [Google Scholar]
  • 68. Hashimoto A, Kumashiro S, Nishikawa T, Oka T, Takahashi K, Mito T, Takashima S, Doi N, Mizutani Y, Yamazaki T, et al. Embryonic development and postnatal changes in free D-aspartate and D-serine in the human prefrontal cortex. J Neurochem 1993; 61:348-51; PMID:8515283; http://dx.doi.org/ 10.1111/j.1471-4159.1993.tb03575.x [DOI] [PubMed] [Google Scholar]
  • 69. Hashimoto A, Nishikawa T, Oka T, Takahashi K. Endogenous D-serine in rat brain: N-methyl-D-aspartate receptor-related distribution and aging. J Neurochem 1993; 60:783-6; PMID:8419554; http://dx.doi.org/ 10.1111/j.1471-4159.1993.tb03219.x [DOI] [PubMed] [Google Scholar]
  • 70. Nagata Y, Horiike K, Maeda T. Distribution of free D-serine in vertebrate brains. Brain Res 1994; 634:291-5; PMID:8131079; http://dx.doi.org/ 10.1016/0006-8993(94)91932-1 [DOI] [PubMed] [Google Scholar]
  • 71. Hashimoto A, Oka T, Nishikawa T. Anatomical distribution and postnatal changes in endogenous free D-aspartate and D-serine in rat brain and periphery. Eur J Neurosci 1995; 7:1657-63; PMID:7582120; http://dx.doi.org/ 10.1111/j.1460-9568.1995.tb00687.x [DOI] [PubMed] [Google Scholar]
  • 72. Hashimoto A, Oka T. Free D-aspartate and D-serine in the mammalian brain and periphery. Prog Neurobiol 1997; 52:325-53; PMID:9247969; http://dx.doi.org/ 10.1016/S0301-0082(97)00019-1 [DOI] [PubMed] [Google Scholar]
  • 73. Nishikawa T. Analysis of free D-serine in mammals and its biological relevance. J Chromatogr B Analyt Technol Biomed Life Sci 2011; 879:3169-83; PMID:21992750; http://dx.doi.org/ 10.1016/j.jchromb.2011.08.030 [DOI] [PubMed] [Google Scholar]
  • 74. Imai K, Fukushima T, Santa T, Homma H, Huang Y, Sakai K, Kato M. Distribution of free D-amino acids in tissues and body fluids of vertebrates. Enantiomer 1997; 2:143-5; PMID:9676266 [PubMed] [Google Scholar]
  • 75. Schell MJ, Molliver ME, Snyder SH. D-serine, an endogenous synaptic modulator: localization to astrocytes and glutamate-stimulated release. Proc Natl Acad Sci U S A 1995; 92:3948-52; PMID:7732010; http://dx.doi.org/ 10.1073/pnas.92.9.3948 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Schell MJ, Brady RO, Jr., Molliver ME, Snyder SH. D-serine as a neuromodulator: regional and developmental localizations in rat brain glia resemble NMDA receptors. J Neurosci 1997; 17:1604-15; PMID:9030620 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Wako K, Ma N, Shiroyama T, Semba R. Glial uptake of intracerebroventricularly injected D-serine in the rat brain: an immunocytochemical study. Neurosci Lett 1995; 185:171-4; PMID:7753484; http://dx.doi.org/ 10.1016/0304-3940(95)11253-S [DOI] [PubMed] [Google Scholar]
  • 78. de Koning TJ, Klomp LW. Serine-deficiency syndromes. Curr Opin Neurol 2004; 17:197-204; PMID:15021249; http://dx.doi.org/ 10.1097/00019052-200404000-00019 [DOI] [PubMed] [Google Scholar]
  • 79. Sacchi S, Caldinelli L, Cappelletti P, Pollegioni L, Molla G. Structure-function relationships in human D-amino acid oxidase. Amino Acids 2012; 43:1833-50; PMID:22865246; http://dx.doi.org/ 10.1007/s00726-012-1345-4 [DOI] [PubMed] [Google Scholar]
  • 80. Lo EH, Pierce AR, Matsumoto K, Kano T, Evans CJ, Newcomb R. Alterations in K+ evoked profiles of neurotransmitter and neuromodulator amino acids after focal ischemia-reperfusion. Neuroscience 1998; 83:449-58; PMID:9460753; http://dx.doi.org/ 10.1016/S0306-4522(97)00434-X [DOI] [PubMed] [Google Scholar]
  • 81. Richardson JS, Richardson DC. Principls and patterns of protein conformation. Prediction of Protein Structure and the Principles of Protein Conformation. In: Fasman GD, ed. New York: Plenom Press; 1990:2-98. [Google Scholar]
  • 82. Miller RT, Douthart RJ, Dunker AK. Learning an objective alphabet of amino acid conformations in protein. Tech Protein Chem 1993; 4:541-8; http://dx.doi.org/ 10.1016/B978-0-12-058757-5.50064-0 [DOI] [Google Scholar]
  • 83. Miller RT, Douthart RJ, Dunker AK. An alphabet of amino acid conformations in protein. The Twenty-Sixth Hawaii International Conference on System Sciences. Hawaii: IEEE, 1993:689-98. [Google Scholar]
  • 84. Kendrew JC, Watson HC, Strandberg BE, Dickerson RE, Phillips DC, Shore VC. The amino-acid sequence x-ray methods, and its correlation with chemical data. Nature 1961; 190:666-70; PMID:13752474; http://dx.doi.org/ 10.1038/190666a0 [DOI] [PubMed] [Google Scholar]
  • 85. Presta LG, Rose GD. Helix signals in proteins. Science 1988; 240:1632-41; PMID:2837824; http://dx.doi.org/ 10.1126/science.2837824 [DOI] [PubMed] [Google Scholar]
  • 86. Serrano L, Fersht AR. Capping and alpha-helix stability. Nature 1989; 342:296-9; PMID:2812029; http://dx.doi.org/ 10.1038/342296a0 [DOI] [PubMed] [Google Scholar]
  • 87. Aurora R, Rose GD. Helix capping. Protein Sci 1998; 7:21-38; PMID:9514257; http://dx.doi.org/ 10.1002/pro.5560070103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Lu M, Shu W, Ji H, Spek E, Wang L, Kallenbach NR. Helix capping in the GCN4 leucine zipper. J Mol Biol 1999; 288:743-52; PMID:10329176; http://dx.doi.org/ 10.1006/jmbi.1999.2707 [DOI] [PubMed] [Google Scholar]
  • 89. Zhou HX, Lyu P, Wemmer DE, Kallenbach NR. Alpha helix capping in synthetic model peptides by reciprocal side chain-main chain interactions: evidence for an N terminal "capping box". Proteins 1994; 18:1-7; PMID:8146119; http://dx.doi.org/ 10.1002/prot.340180103 [DOI] [PubMed] [Google Scholar]
  • 90. Doig AJ, MacArthur MW, Stapley BJ, Thornton JM. Structures of N-termini of helices in proteins. Protein Sci 1997; 6:147-55; PMID:9007987; http://dx.doi.org/ 10.1002/pro.5560060117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. Ermolenko DN, Thomas ST, Aurora R, Gronenborn AM, Makhatadze GI. Hydrophobic interactions at the Ccap position of the C-capping motif of alpha-helices. J Mol Biol 2002; 322:123-35; PMID:12215419; http://dx.doi.org/ 10.1016/S0022-2836(02)00734-9 [DOI] [PubMed] [Google Scholar]
  • 92. Forood B, Feliciano EJ, Nambiar KP. Stabilization of alpha-helical structures in short peptides via end capping. Proc Natl Acad Sci U S A 1993; 90:838-42; PMID:8430094; http://dx.doi.org/ 10.1073/pnas.90.3.838 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93. Nardini M, Dijkstra BW. Alpha/beta hydrolase fold enzymes: the family keeps growing. Curr Opin Struct Biol 1999; 9:732-7; PMID:10607665; http://dx.doi.org/ 10.1016/S0959-440X(99)00037-8 [DOI] [PubMed] [Google Scholar]
  • 94. Quevillon-Cheruel S, Leulliot N, Graille M, Hervouet N, Coste F, Benedetti H, Zelwer C, Janin J, Van Tilbeurgh H. Crystal structure of yeast YHR049W/FSH1, a member of the serine hydrolase family. Protein Sci 2005; 14:1350-6; PMID:15802654; http://dx.doi.org/ 10.1110/ps.051415905 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95. Trevino SR, Scholtz JM, Pace CN. Measuring and increasing protein solubility. J Pharm Sci 2008; 97:4155-66; PMID:18240286; http://dx.doi.org/ 10.1002/jps.21327 [DOI] [PubMed] [Google Scholar]
  • 96. Baxter SM, Rosenblum JS, Knutson S, Nelson MR, Montimurro JS, Di Gennaro JA, Speir JA, Burbaum JJ, Fetrow JS. Synergistic computational and experimental proteomics approaches for more accurate detection of active serine hydrolases in yeast. Mol Cell Proteomics 2004; 3:209-25; PMID:14645503; http://dx.doi.org/ 10.1074/mcp.M300082-MCP200 [DOI] [PubMed] [Google Scholar]
  • 97. Davie EW, Fujikawa K, Kurachi K, Kisiel W. The role of serine proteases in the blood coagulation cascade. Adv Enzymol Relat Areas Mol Biol 1979; 48:277-318; PMID:367103 [DOI] [PubMed] [Google Scholar]
  • 98. Patricelli MP, Cravatt BF. Characterization and manipulation of the acyl chain selectivity of fatty acid amide hydrolase. Biochemistry 2001; 40:6107-15; PMID:11352748; http://dx.doi.org/ 10.1021/bi002578r [DOI] [PubMed] [Google Scholar]
  • 99. Pinho MG, de Lencastre H, Tomasz A. An acquired and a native penicillin-binding protein cooperate in building the cell wall of drug-resistant staphylococci. Proc Natl Acad Sci U S A 2001; 98:10886-91; PMID:11517340; http://dx.doi.org/ 10.1073/pnas.191260798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Satoh T, Hosokawa M. The mammalian carboxylesterases: from molecules to functions. Annu Rev Pharmacol Toxicol 1998; 38:257-88; PMID:9597156; http://dx.doi.org/ 10.1146/annurev.pharmtox.38.1.257 [DOI] [PubMed] [Google Scholar]
  • 101. Svendsen A. Lipase protein engineering. Biochim Biophys Acta 2000; 1543:223-38; PMID:11150608; http://dx.doi.org/ 10.1016/S0167-4838(00)00239-9 [DOI] [PubMed] [Google Scholar]
  • 102. Wakil SJ. Fatty acid synthase, a proficient multifunctional enzyme. Biochemistry 1989; 28:4523-30; PMID:2669958; http://dx.doi.org/ 10.1021/bi00437a001 [DOI] [PubMed] [Google Scholar]
  • 103. Hunt MC, Siponen MI, Alexson SE. The emerging role of acyl-CoA thioesterases and acyltransferases in regulating peroxisomal lipid metabolism. Biochim Biophys Acta 2012; 1822:1397-410; PMID:22465940; http://dx.doi.org/ 10.1016/j.bbadis.2012.03.009 [DOI] [PubMed] [Google Scholar]
  • 104. Silman I, Sussman JL. Acetylcholinesterase: how is structure related to function? Chem Biol Interact 2008; 175:3-10; PMID:18586019; http://dx.doi.org/ 10.1016/j.cbi.2008.05.035 [DOI] [PubMed] [Google Scholar]
  • 105. De Maria L, Vind J, Oxenboll KM, Svendsen A, Patkar S. Phospholipases and their industrial applications. Appl Microbiol Biotechnol 2007; 74:290-300; PMID:17221199; http://dx.doi.org/ 10.1007/s00253-006-0775-x [DOI] [PubMed] [Google Scholar]
  • 106. Ogris E, Du X, Nelson KC, Mak EK, Yu XX, Lane WS, Pallas DC. A protein phosphatase methylesterase (PME-1) is one of several novel proteins stably associating with two inactive mutants of protein phosphatase 2A. J Biol Chem 1999; 274:14382-91; PMID:10318862; http://dx.doi.org/ 10.1074/jbc.274.20.14382 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107. Erwin AL, Munford RS. Deacylation of structurally diverse lipopolysaccharides by human acyloxyacyl hydrolase. J Biol Chem 1990; 265:16444-9; PMID:2398058 [PubMed] [Google Scholar]
  • 108. Surolia I, Pirnie SP, Chellappa V, Taylor KN, Cariappa A, Moya J, Liu H, Bell DW, Driscoll DR, Diederichs S, et al. Functionally defective germline variants of sialic acid acetylesterase in autoimmunity. Nature 2010; 466:243-7; PMID:20555325; http://dx.doi.org/ 10.1038/nature09115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109. Vanhoof G, Goossens F, Hendriks L, De Meester I, Hendriks D, Vriend G, Van Broeckhoven C, Scharpe S. Cloning and sequence analysis of the gene encoding human lymphocyte prolyl endopeptidase. Gene 1994; 149:363-6; PMID:7959018; http://dx.doi.org/ 10.1016/0378-1119(94)90177-5 [DOI] [PubMed] [Google Scholar]
  • 110. Chen X. Biochemical properties of recombinant prolyl dipeptidases DPP-IV and DPP8. Adv Exp Med Biol 2006; 575:27-32; PMID:16700505; http://dx.doi.org/ 10.1007/0-387-32824-6_3 [DOI] [PubMed] [Google Scholar]
  • 111. Aertgeerts K, Levin I, Shi L, Snell GP, Jennings A, Prasad GS, Zhang Y, Kraus ML, Salakian S, Sridhar V, et al. Structural and kinetic analysis of the substrate specificity of human fibroblast activation protein alpha. J Biol Chem 2005; 280:19441-4; PMID:15809306; http://dx.doi.org/ 10.1074/jbc.C500092200 [DOI] [PubMed] [Google Scholar]
  • 112. Dodson G, Wlodawer A. Catalytic triads and their relatives. Trends Biochem Sci 1998; 23:347-52; PMID:9787641; http://dx.doi.org/ 10.1016/S0968-0004(98)01254-7 [DOI] [PubMed] [Google Scholar]
  • 113. Ekici OD, Paetzel M, Dalbey RE. Unconventional serine proteases: variations on the catalytic Ser/His/Asp triad configuration. Protein Sci 2008; 17:2023-37; PMID:18824507; http://dx.doi.org/ 10.1110/ps.035436.108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114. Polgar L. The catalytic triad of serine peptidases. Cell Mol Life Sci 2005; 62:2161-72; PMID:16003488; http://dx.doi.org/ 10.1007/s00018-005-5160-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115. Paetzel M, Dalbey RE. Catalytic hydroxyl/amine dyads within serine proteases. Trends Biochem Sci 1997; 22:28-31; PMID:9020589; http://dx.doi.org/ 10.1016/S0968-0004(96)10065-7 [DOI] [PubMed] [Google Scholar]
  • 116. Howard MB, Ekborg NA, Taylor LE, Hutcheson SW, Weiner RM. Identification and analysis of polyserine linker domains in prokaryotic proteins with emphasis on the marine bacterium Microbulbifer degradans. Protein Sci 2004; 13:1422-5; PMID:15075401; http://dx.doi.org/ 10.1110/ps.03511604 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117. Van den Steen P, Rudd PM, Dwek RA, Opdenakker G. Concepts and principles of O-linked glycosylation. Crit Rev Biochem Mol Biol 1998; 33:151-208; PMID:9673446; http://dx.doi.org/ 10.1080/10409239891204198 [DOI] [PubMed] [Google Scholar]
  • 118. Cheng J, Deming TJ. Synthesis of polypeptides by ring-opening polymerization of alpha-amino acid N-carboxyanhydrides. Top Curr Chem 2011; 310:1-26; http://dx.doi.org/ 10.1007/128_2011_173 [DOI] [PubMed] [Google Scholar]
  • 119. Rechsteiner M, Rogers SW. PEST sequences and regulation by proteolysis. Trends Biochem Sci 1996; 21:267-71; PMID:8755249; http://dx.doi.org/ 10.1016/S0968-0004(96)10031-1 [DOI] [PubMed] [Google Scholar]
  • 120. Rogers S, Wells R, Rechsteiner M. Amino acid sequences common to rapidly degraded proteins: the PEST hypothesis. Science 1986; 234:364-8; PMID:2876518; http://dx.doi.org/ 10.1126/science.2876518 [DOI] [PubMed] [Google Scholar]
  • 121. Singh GP, Ganapathi M, Sandhu KS, Dash D. Intrinsic unstructuredness and abundance of PEST motifs in eukaryotic proteomes. Proteins 2006; 62:309-15; PMID:16299712; http://dx.doi.org/ 10.1002/prot.20746 [DOI] [PubMed] [Google Scholar]
  • 122. Belizario JE, Alves J, Garay-Malpartida M, Occhiucci JM. Coupling caspase cleavage and proteasomal degradation of proteins carrying PEST motif. Curr Protein Pept Sci 2008; 9:210-20; PMID:18537676; http://dx.doi.org/ 10.2174/138920308784534023 [DOI] [PubMed] [Google Scholar]
  • 123. Hoh JH. Functional protein domains from the thermally driven motion of polypeptide chains: a proposal. Proteins 1998; 32:223-8; PMID:9714161; http://dx.doi.org/ 10.1002/(SICI)1097-0134(19980801)32:2%3c223::AID-PROT8%3e3.0.CO;2-L [DOI] [PubMed] [Google Scholar]
  • 124. Brown HG, Hoh JH. Entropic exclusion by neurofilament sidearms: a mechanism for maintaining interfilament spacing. Biochemistry 1997; 36:15035-40; PMID:9424114; http://dx.doi.org/ 10.1021/bi9721748 [DOI] [PubMed] [Google Scholar]
  • 125. Geisler N, Fischer S, Vandekerckhove J, Plessmann U, Weber K. Hybrid character of a large neurofilament protein (NF-M): intermediate filament type sequence followed by a long and acidic carboxy-terminal extension. EMBO J 1984; 3:2701-6; PMID:6439558 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126. Voter WA, Erickson HP. Electron microscopy of MAP 2 (microtubule-associated protein 2). J Ultrastruct Res 1982; 80:374-82; PMID:7131650; http://dx.doi.org/ 10.1016/S0022-5320(82)80051-8 [DOI] [PubMed] [Google Scholar]
  • 127. Merdes A, Ramyar K, Vechio JD, Cleveland DW. A complex of NuMA and cytoplasmic dynein is essential for mitotic spindle assembly. Cell 1996; 87:447-58; PMID:8898198; http://dx.doi.org/ 10.1016/S0092-8674(00)81365-3 [DOI] [PubMed] [Google Scholar]
  • 128. Santner AA, Croy CH, Vasanwala FH, Uversky VN, Van YY, Dunker AK. Sweeping away protein aggregation with entropic bristles: intrinsically disordered protein fusions enhance soluble expression. Biochemistry 2012; 51:7250-62; PMID:22924672; http://dx.doi.org/ 10.1021/bi300653m [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129. Minde DP, Halff EF, Tans S. Designing disorder. Intrinsic Disord Protein 2013; 1:e26790; http://dx.doi.org/ 10.4161/idp.26790 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130. Schellenberger V, Wang CW, Geething NC, Spink BJ, Campbell A, To W, Scholle MD, Yin Y, Yao Y, Bogin O, et al. A recombinant polypeptide extends the in vivo half-life of peptides and proteins in a tunable manner. Nat Biotechnol 2009; 27:1186-90; PMID:19915550; http://dx.doi.org/ 10.1038/nbt.1588 [DOI] [PubMed] [Google Scholar]
  • 131. Geething NC, To W, Spink BJ, Scholle MD, Wang CW, Yin Y, Yao Y, Schellenberger V, Cleland JL, Stemmer WP, et al. Gcg-XTEN: an improved glucagon capable of preventing hypoglycemia without increasing baseline blood glucose. PloS One 2010; 5:e10175; PMID:20418955; http://dx.doi.org/ 10.1371/journal.pone.0010175 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132. Schlapschy M, Theobald I, Mack H, Schottelius M, Wester HJ, Skerra A. Fusion of a recombinant antibody fragment with a homo-amino-acid polymer: effects on biophysical properties and prolonged plasma half-life. Protein Eng Des Sel 2007; 20:273-84; PMID:17595342; http://dx.doi.org/ 10.1093/protein/gzm020 [DOI] [PubMed] [Google Scholar]
  • 133. Schlapschy M, Binder U, Borger C, Theobald I, Wachinger K, Kisling S, Haller D, Skerra A. PASylation: a biological alternative to PEGylation for extending the plasma half-life of pharmaceutically active proteins. Protein Eng Des Sel 2013; 26:489-501; PMID:23754528; http://dx.doi.org/ 10.1093/protein/gzt023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134. Buscaglia CA, Alfonso J, Campetella O, Frasch AC. Tandem amino acid repeats from Trypanosoma cruzi shed antigens increase the half-life of proteins in blood. Blood 1999; 93:2025-32; PMID:10068675 [PubMed] [Google Scholar]
  • 135. Alvarez P, Buscaglia CA, Campetella O. Improving protein pharmacokinetics by genetic fusion to simple amino acid sequences. J Biol Chem 2004; 279:3375-81; PMID:14612434; http://dx.doi.org/ 10.1074/jbc.M311356200 [DOI] [PubMed] [Google Scholar]
  • 136. Tian X, Strassmann JE, Queller DC. A conserved extraordinarily long serine homopolymer in Dictyostelid amoebae. Heredity (Edinb) 2014; 112:215-8; PMID:24084645; http://dx.doi.org/ 10.1038/hdy.2013.96 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137. Hu X, Lawrence B, Kohler K, Falick AM, Moore AM, McMullen E, Jones PR, Vierra C. Araneoid egg case silk: a fibroin with novel ensemble repeat units from the black widow spider, Latrodectus hesperus. Biochemistry 2005; 44:10020-7; PMID:16042378; http://dx.doi.org/ 10.1021/bi050494i [DOI] [PubMed] [Google Scholar]
  • 138. Harding CR, Aho S, Bosko CA. Filaggrin - revisited. Int J Cosmet Sci 2013; 35:412-23; PMID:23517450; http://dx.doi.org/ 10.1111/ics.12049 [DOI] [PubMed] [Google Scholar]
  • 139. De Benedetto A, Qualia CM, Baroody FM, Beck LA. Filaggrin expression in oral, nasal, and esophageal mucosa. J Invest Dermatol 2008; 128:1594-7; PMID:18172455; http://dx.doi.org/ 10.1038/sj.jid.5701208 [DOI] [PubMed] [Google Scholar]
  • 140. Blencowe BJ, Bauren G, Eldridge AG, Issner R, Nickerson JA, Rosonina E, Sharp PA. The SRm160/300 splicing coactivator subunits. RNA 2000; 6:111-20; PMID:10668804; http://dx.doi.org/ 10.1017/S1355838200991982 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 141. Gum JR, Jr. Mucin genes and the proteins they encode: structure, diversity, and regulation. Am J Respir Cell Mol Biol 1992; 7:557-64; PMID:1449803; http://dx.doi.org/ 10.1165/ajrcmb/7.6.557 [DOI] [PubMed] [Google Scholar]
  • 142. Griffiths JK. Human cryptosporidiosis: epidemiology, transmission, clinical disease, treatment, and diagnosis. Adv Parasitol 1998; 40:37-85; PMID:9554070; http://dx.doi.org/ 10.1016/S0065-308X(08)60117-7 [DOI] [PubMed] [Google Scholar]
  • 143. Ward H, Hamer D, Keusch G, Cevallos AM. GP40 and uses thereof. In: USPTO, ed. USA: New England Medical Center Hospitals, Inc., 2003. [Google Scholar]
  • 144. Ekborg NA, Morrill W, Burgoyne AM, Li L, Distel DL. CelAB, a multifunctional cellulase encoded by Teredinibacter turnerae T7902T, a culturable symbiont isolated from the wood-boring marine bivalve Lyrodus pedicellatus. Appl Environ Microbiol 2007; 73:7785-8; PMID:17933945; http://dx.doi.org/ 10.1128/AEM.00876-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145. Ge H, Roeder RG. Purification, cloning, and characterization of a human coactivator, PC4, that mediates transcriptional activation of class II genes. Cell 1994; 78:513-23; PMID:8062391; http://dx.doi.org/ 10.1016/0092-8674(94)90428-6 [DOI] [PubMed] [Google Scholar]
  • 146. Kaiser K, Stelzer G, Meisterernst M. The coactivator p15 (PC4) initiates transcriptional activation during TFIIA-TFIID-promoter complex formation. EMBO J 1995; 14:3520-7; PMID:7628453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147. Jonker HR, Wechselberger RW, Boelens R, Kaptein R, Folkers GE. The intrinsically unstructured domain of PC4 modulates the activity of the structured core through inter- and intramolecular interactions. Biochemistry 2006; 45:5067-81; PMID:16605275; http://dx.doi.org/ 10.1021/bi052531b [DOI] [PubMed] [Google Scholar]
  • 148. Xu ZP, Dutra A, Stellrecht CM, Wu C, Piatigorsky J, Saunders GF. Functional and structural characterization of the human gene BHLHB5, encoding a basic helix-loop-helix transcription factor. Genomics 2002; 80:311-8; PMID:12213201; http://dx.doi.org/ 10.1006/geno.2002.6833 [DOI] [PubMed] [Google Scholar]
  • 149. Lappin TR, Grier DG, Thompson A, Halliday HL. HOX genes: seductive science, mysterious mechanisms. Ulster Med J 2006; 75:23-31; PMID:16457401 [PMC free article] [PubMed] [Google Scholar]
  • 150. Scott V, Morgan EA, Stadler HS. Genitourinary functions of Hoxa13 and Hoxd13. J Biochem 2005; 137:671-6; PMID:16002988; http://dx.doi.org/ 10.1093/jb/mvi086 [DOI] [PubMed] [Google Scholar]
  • 151. Liu J, Perumal NB, Oldfield CJ, Su EW, Uversky VN, Dunker AK. Intrinsic disorder in transcription factors. Biochemistry 2006; 45:6873-88; PMID:16734424; http://dx.doi.org/ 10.1021/bi0602718 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 152. Dunker AK, Uversky VN. Drugs for 'protein clouds': targeting intrinsically disordered transcription factors. Curr Opin Pharmacol 2010; 10:782-8; PMID:20889377; http://dx.doi.org/ 10.1016/j.coph.2010.09.005 [DOI] [PubMed] [Google Scholar]
  • 153. Westerheide SD, Raynes R, Powell C, Xue B, Uversky VN. HSF transcription factor family, heat shock response, and protein intrinsic disorder. Curr Protein Pept Sci 2012; 13:86-103; PMID:22044151; http://dx.doi.org/ 10.2174/138920312799277956 [DOI] [PubMed] [Google Scholar]
  • 154. Chew KY, Yu H, Pask AJ, Shaw G, Renfree MB. HOXA13 and HOXD13 expression during development of the syndactylous digits in the marsupial Macropus eugenii. BMC Dev Biol 2012; 12:2; PMID:22235805; http://dx.doi.org/ 10.1186/1471-213X-12-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 155. Ouyang P. SRrp37, a novel splicing regulator located in the nuclear speckles and nucleoli, interacts with SC35 and modulates alternative pre-mRNA splicing in vivo. J Cell Biochem 2009; 108:304-14; PMID:19582790; http://dx.doi.org/ 10.1002/jcb.22255 [DOI] [PubMed] [Google Scholar]
  • 156. Cortese MS, Uversky VN, Dunker AK. Intrinsic disorder in scaffold proteins: getting more from less. Prog Biophys Mol Biol 2008; 98:85-106; PMID:18619997; http://dx.doi.org/ 10.1016/j.pbiomolbio.2008.05.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 157. Fenster SD, Kessels MM, Qualmann B, Chung WJ, Nash J, Gundelfinger ED, Garner CC. Interactions between Piccolo and the actin/dynamin-binding protein Abp1 link vesicle endocytosis to presynaptic active zones. J Biol Chem 2003; 278:20268-77; PMID:12654920; http://dx.doi.org/ 10.1074/jbc.M210792200 [DOI] [PubMed] [Google Scholar]
  • 158. Ohtsuka T. CAST: functional scaffold for the integrity of the presynaptic active zone. Neurosci Res 2013; 76:10-5; PMID:23517713; http://dx.doi.org/ 10.1016/j.neures.2013.03.003 [DOI] [PubMed] [Google Scholar]
  • 159. Aoki K, Taketo MM. Adenomatous polyposis coli (APC): a multi-functional tumor suppressor gene. J Cell Sci 2007; 120:3327-35; PMID:17881494; http://dx.doi.org/ 10.1242/jcs.03485 [DOI] [PubMed] [Google Scholar]
  • 160. McCartney BM, Nathke IS. Cell regulation by the Apc protein Apc as master regulator of epithelia. Curr Opin Cell Biol 2008; 20:186-93; PMID:18359618; http://dx.doi.org/ 10.1016/j.ceb.2008.02.001 [DOI] [PubMed] [Google Scholar]
  • 161. Xue B, Dunker AK, Uversky VN. The roles of intrinsic disorder in orchestrating the Wnt-pathway. J Biomol Struct Dyn 2012; 29:843-61; PMID:22292947; http://dx.doi.org/ 10.1080/073911012010525024 [DOI] [PubMed] [Google Scholar]
  • 162. Minde DP, Anvarian Z, Rudiger SG, Maurice MM. Messing up disorder: how do missense mutations in the tumor suppressor protein APC lead to cancer? Mol Cancer 2011; 10:101; PMID:21859464; http://dx.doi.org/ 10.1186/1476-4598-10-101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 163. Kinzler KW, Vogelstein B. Lessons from hereditary colorectal cancer. Cell 1996; 87:159-70; PMID:8861899; http://dx.doi.org/ 10.1016/S0092-8674(00)81333-1 [DOI] [PubMed] [Google Scholar]
  • 164. Minde DP, Radli M, Forneris F, Maurice MM, Rudiger SG. Large extent of disorder in Adenomatous Polyposis Coli offers a strategy to guard Wnt signalling against point mutations. PloS One 2013; 8:e77257; PMID:24130866; http://dx.doi.org/ 10.1371/journal.pone.0077257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 165. Tu SW, Bugde A, Luby-Phelps K, Cobb MH. WNK1 is required for mitosis and abscission. Proc Natl Acad Sci U S A 2011; 108:1385-90; PMID:21220314; http://dx.doi.org/ 10.1073/pnas.1018567108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 166. Webby CJ, Wolf A, Gromak N, Dreger M, Kramer H, Kessler B, Nielsen ML, Schmitz C, Butler DS, Yates JR 3rd, et al. Jmjd6 catalyses lysyl-hydroxylation of U2AF65, a protein associated with RNA splicing. Science 2009; 325:90-3; PMID:19574390; http://dx.doi.org/ 10.1126/science.1175865 [DOI] [PubMed] [Google Scholar]
  • 167. Boeckel JN, Guarani V, Koyanagi M, Roexe T, Lengeling A, Schermuly RT, Gellert P, Braun T, Zeiher A, Dimmeler S. Jumonji domain-containing protein 6 (Jmjd6) is required for angiogenic sprouting and regulates splicing of VEGF-receptor 1. Proc Natl Acad Sci U S A 2011; 108:3276-81; PMID:21300889; http://dx.doi.org/ 10.1073/pnas.1008098108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 168. Mantri M, Loik ND, Hamed RB, Claridge TD, McCullagh JS, Schofield CJ. The 2-oxoglutarate-dependent oxygenase JMJD6 catalyses oxidation of lysine residues to give 5S-hydroxylysine residues. Chembiochem 2011; 12:531-4; PMID:22238144; http://dx.doi.org/ 10.1002/cbic.201000641 [DOI] [PubMed] [Google Scholar]
  • 169. Wolf A, Mantri M, Heim A, Muller U, Fichter E, Mackeen MM, Schermelleh L, Dadie G, Leonhardt H, Venien-Bryan C, et al. The polyserine domain of the lysyl-5 hydroxylase Jmjd6 mediates subnuclear localization. Biochem J 2013; 453:357-70; PMID:23688307; http://dx.doi.org/ 10.1042/BJ20130529 [DOI] [PubMed] [Google Scholar]
  • 170. Wolf A, Schmitz C, Bottger A. Changing story of the receptor for phosphatidylserine-dependent clearance of apoptotic cells. EMBO Rep 2007; 8:465-9; PMID:17471263; http://dx.doi.org/ 10.1038/sj.embor.7400956 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 171. Cikala M, Alexandrova O, David CN, Proschel M, Stiening B, Cramer P, Bottger A. The phosphatidylserine receptor from Hydra is a nuclear protein with potential Fe(II) dependent oxygenase activity. BMC Cell Biol 2004; 5:26; PMID:15193161; http://dx.doi.org/ 10.1186/1471-2121-5-26 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 172. Hahn P, Bose J, Edler S, Lengeling A. Genomic structure and expression of Jmjd6 and evolutionary analysis in the context of related JmjC domain containing proteins. BMC Genomics 2008; 9:293; PMID:18564434; http://dx.doi.org/ 10.1186/1471-2164-9-293 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 173. Hong X, Zang J, White J, Wang C, Pan CH, Zhao R, Murphy RC, Dai S, Henson P, Kappler JW, et al. Interaction of JMJD6 with single-stranded RNA. Proc Natl Acad Sci U S A 2010; 107:14568-72; PMID:20679243; http://dx.doi.org/ 10.1073/pnas.1008832107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 174. Hayward A, Takahashi T, Bendena WG, Tobe SS, Hui JH. Comparative genomic and phylogenetic analysis of vitellogenin and other large lipid transfer proteins in metazoans. FEBS Lett 2010; 584:1273-8; PMID:20188099; http://dx.doi.org/ 10.1016/j.febslet.2010.02.056 [DOI] [PubMed] [Google Scholar]
  • 175. Havukainen H, Underhaug J, Wolschin F, Amdam G, Halskau O. A vitellogenin polyserine cleavage site: highly disordered conformation protected from proteolysis by phosphorylation. J Exp Biol 2012; 215:1837-46; PMID:22573762; http://dx.doi.org/ 10.1242/jeb.065623 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 176. Kovacs D, Kalmar E, Torok Z, Tompa P. Chaperone activity of ERD10 and ERD14, two disordered stress-related plant proteins. Plant Physiol 2008; 147:381-90; PMID:18359842; http://dx.doi.org/ 10.1104/pp.108.118208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 177. Szalaine Agoston B, Kovacs D, Tompa P, Perczel A. Full backbone assignment and dynamics of the intrinsically disordered dehydrin ERD14. Biomol NMR Assign 2011; 5:189-93; PMID:21336827; http://dx.doi.org/ 10.1007/s12104-011-9297-2 [DOI] [PubMed] [Google Scholar]
  • 178. Alsheikh MK, Heyen BJ, Randall SK. Ion binding properties of the dehydrin ERD14 are dependent upon phosphorylation. J Biol Chem 2003; 278:40882-9; PMID:12917402; http://dx.doi.org/ 10.1074/jbc.M307151200 [DOI] [PubMed] [Google Scholar]
  • 179. Hasper A, Soteropoulos P, Perlin DS. Modification of the N-terminal polyserine cluster alters stability of the plasma membrane H(+)-ATPase from Saccharomyces cerevisiae. Biochim Biophys Acta 1999; 1420:214-22; PMID:10446304; http://dx.doi.org/ 10.1016/S0005-2736(99)00100-5 [DOI] [PubMed] [Google Scholar]
  • 180. Courtney RJ, Benyesh-Melnick M. Isolation and characterization of a large molecular-weight polypeptide of herpes simplex virus type 1. Virology 1974; 62:539-51; PMID:4372790; http://dx.doi.org/ 10.1016/0042-6822(74)90414-0 [DOI] [PubMed] [Google Scholar]
  • 181. Pereira L, Wolff MH, Fenwick M, Roizman B. Regulation of herpesvirus macromolecular synthesis. V. Properties of alpha polypeptides made in HSV-1 and HSV-2 infected cells. Virology 1977; 77:733-49; PMID:193259; http://dx.doi.org/ 10.1016/0042-6822(77)90495-0 [DOI] [PubMed] [Google Scholar]
  • 182. Bates PA, DeLuca NA. The polyserine tract of herpes simplex virus ICP4 is required for normal viral gene expression and growth in murine trigeminal ganglia. J Virol 1998; 72:7115-24; PMID:9696805 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 183. Xue B, Blocquel D, Habchi J, Uversky AV, Kurgan L, Uversky VN, Longhi S. Structural disorder in viral proteins. Chem Rev 2014; 114:6880-911; PMID:24823319; http://dx.doi.org/ 10.1021/cr4005692 [DOI] [PubMed] [Google Scholar]
  • 184. DeLuca NA, Schaffer PA. Activities of herpes simplex virus type 1 (HSV-1) ICP4 genes specifying nonsense peptides. Nucleic Acids Res 1987; 15:4491-511; PMID:3035496; http://dx.doi.org/ 10.1093/nar/15.11.4491 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 185. Everett RD. Trans activation of transcription by herpes virus products: requirement for two HSV-1 immediate-early polypeptides for maximum activity. EMBO J 1984; 3:3135-41; PMID:6098466 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 186. Gelman IH, Silverstein S. Identification of immediate early genes from herpes simplex virus that transactivate the virus thymidine kinase gene. Proc Natl Acad Sci U S A 1985; 82:5265-9; PMID:2991915; http://dx.doi.org/ 10.1073/pnas.82.16.5265 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 187. Godowski PJ, Knipe DM. Transcriptional control of herpesvirus gene expression: gene functions required for positive and negative regulation. Proc Natl Acad Sci U S A 1986; 83:256-60; PMID:3001729; http://dx.doi.org/ 10.1073/pnas.83.2.256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 188. O'Hare P, Hayward GS. Evidence for a direct role for both the 175,000- and 110,000-molecular-weight immediate-early proteins of herpes simplex virus in the transactivation of delayed-early promoters. J Virol 1985; 53:751-60; PMID:2983086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 189. Batchelor AH, O'Hare P. Regulation and cell-type-specific activity of a promoter located upstream of the latency-associated transcript of herpes simplex virus type 1. J Virol 1990; 64:3269-79; PMID:2161941 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 190. Batchelor AH, O'Hare P. Localization of cis-acting sequence requirements in the promoter of the latency-associated transcript of herpes simplex virus type 1 required for cell-type-specific activity. J Virol 1992; 66:3573-82; PMID:1316469 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 191. Rivera-Gonzalez R, Imbalzano AN, Gu B, Deluca NA. The role of ICP4 repressor activity in temporal expression of the IE-3 and latency-associated transcript promoters during HSV-1 infection. Virology 1994; 202:550-64; PMID:8030221; http://dx.doi.org/ 10.1006/viro.1994.1377 [DOI] [PubMed] [Google Scholar]
  • 192. Bohenzky RA, Papavassiliou AG, Gelman IH, Silverstein S. Identification of a promoter mapping within the reiterated sequences that flank the herpes simplex virus type 1 UL region. J Virol 1993; 67:632-42; PMID:8380459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 193. Yeh L, Schaffer PA. A novel class of transcripts expressed with late kinetics in the absence of ICP4 spans the junction between the long and short segments of the herpes simplex virus type 1 genome. J Virol 1993; 67:7373-82; PMID:7901428 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 194. Paterson T, Everett RD. Mutational dissection of the HSV-1 immediate-early protein Vmw175 involved in transcriptional transactivation and repression. Virology 1988; 166:186-96; PMID:2842944; http://dx.doi.org/ 10.1016/0042-6822(88)90160-2 [DOI] [PubMed] [Google Scholar]
  • 195. O'Hare P, Hayward GS. Three trans-acting regulatory proteins of herpes simplex virus modulate immediate-early gene expression in a pathway involving positive and negative feedback regulation. J Virol 1985; 56:723-33; PMID:2999428 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 196. DeLuca NA, Schaffer PA. Activation of immediate-early, early, and late promoters by temperature-sensitive and wild-type forms of herpes simplex virus type 1 protein ICP4. Mol Cell Biol 1985; 5:1997-2008; PMID:3018543 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 197. Paterson T, Everett RD. The regions of the herpes simplex virus type 1 immediate early protein Vmw175 required for site specific DNA binding closely correspond to those involved in transcriptional regulation. Nucleic Acids Res 1988; 16:11005-25; PMID:2849757; http://dx.doi.org/ 10.1093/nar/16.23.11005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 198. Shepard AA, Imbalzano AN, DeLuca NA. Separation of primary structural components conferring autoregulation, transactivation, and DNA-binding properties to the herpes simplex virus transcriptional regulatory protein ICP4. J Virol 1989; 63:3714-28; PMID:2760981 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 199. DeLuca NA, McCarthy AM, Schaffer PA. Isolation and characterization of deletion mutants of herpes simplex virus type 1 in the gene encoding immediate-early regulatory protein ICP4. J Virol 1985; 56:558-70; PMID:2997476 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 200. Gu B, DeLuca N. Requirements for activation of the herpes simplex virus glycoprotein C promoter in vitro by the viral regulatory protein ICP4. J Virol 1994; 68:7953-65; PMID:7966586 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 201. Carrozza MJ, DeLuca NA. Interaction of the viral activator protein ICP4 with TFIID through TAF250. Mol Cell Biol 1996; 16:3085-93; PMID:8649420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 202. Smith CA, Bates P, Rivera-Gonzalez R, Gu B, DeLuca NA. ICP4, the major transcriptional regulatory protein of herpes simplex virus type 1, forms a tripartite complex with TATA-binding protein and TFIIB. J Virol 1993; 67:4676-87; PMID:8392607 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 203. Samaniego LA, Webb AL, DeLuca NA. Functional interactions between herpes simplex virus immediate-early proteins during infection: gene expression as a consequence of ICP27 and different domains of ICP4. J Virol 1995; 69:5705-15; PMID:7637016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 204. Xia K, DeLuca NA, Knipe DM. Analysis of phosphorylation sites of herpes simplex virus type 1 ICP4. J Virol 1996; 70:1061-71; PMID:8551564 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 205. Xia K, Knipe DM, DeLuca NA. Role of protein kinase A and the serine-rich region of herpes simplex virus type 1 ICP4 in viral replication. J Virol 1996; 70:1050-60; PMID:8551563 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 206. Yap AS, Crampton MS, Hardin J. Making and breaking contacts: the cellular biology of cadherin regulation. Curr Opin Cell Biol 2007; 19:508-14; PMID:17935963; http://dx.doi.org/ 10.1016/j.ceb.2007.09.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 207. Grossmann J. Molecular mechanisms of "detachment-induced apoptosis–Anoikis". Apoptosis 2002; 7:247-60; PMID:11997669; http://dx.doi.org/ 10.1023/A:1015312119693 [DOI] [PubMed] [Google Scholar]
  • 208. Gumbiner BM. Regulation of cadherin adhesive activity. J Cell Biol 2000; 148:399-404; PMID:10662767; http://dx.doi.org/ 10.1083/jcb.148.3.399 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 209. Geng F, Zhu W, Anderson RA, Leber B, Andrews DW. Multiple post-translational modifications regulate E-cadherin transport during apoptosis. J Cell Sci 2012; 125:2615-25; PMID:22375065; http://dx.doi.org/ 10.1242/jcs.096735 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 210. Frappier L. EBNA1 and host factors in Epstein-Barr virus latent DNA replication. Curr Opin Virol 2012; 2:733-9; PMID:23031715; http://dx.doi.org/ 10.1016/j.coviro.2012.09.005 [DOI] [PubMed] [Google Scholar]
  • 211. Everett RD, Chelbi-Alix MK. PML and PML nuclear bodies: implications in antiviral defence. Biochimie 2007; 89:819-30; PMID:17343971; http://dx.doi.org/ 10.1016/j.biochi.2007.01.004 [DOI] [PubMed] [Google Scholar]
  • 212. Sivachandran N, Sarkari F, Frappier L. Epstein-Barr nuclear antigen 1 contributes to nasopharyngeal carcinoma through disruption of PML nuclear bodies. PLoS Pathog 2008; 4:e1000170; PMID:18833293; http://dx.doi.org/ 10.1371/journal.ppat.1000170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 213. Sivachandran N, Wang X, Frappier L. Functions of the Epstein-Barr virus EBNA1 protein in viral reactivation and lytic infection. J Virol 2012; 86:6146-58; PMID:22491455; http://dx.doi.org/ 10.1128/JVI.00013-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 214. Cao JY, Shire K, Landry C, Gish GD, Pawson T, Frappier L. Identification of a novel protein interaction motif in the regulatory subunit of casein kinase 2. Mol Cell Biol 2014; 34:246-58; PMID:24216761; http://dx.doi.org/ 10.1128/MCB.00968-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 215. Witze ES, Old WM, Resing KA, Ahn NG. Mapping protein post-translational modifications with mass spectrometry. Nat Methods 2007; 4:798-806; PMID:17901869; http://dx.doi.org/ 10.1038/nmeth1100 [DOI] [PubMed] [Google Scholar]
  • 216. Xin F, Radivojac P. Post-translational modifications induce significant yet not extreme changes to protein structure. Bioinformatics 2012; 28:2905-13; PMID:22947645; http://dx.doi.org/ 10.1093/bioinformatics/bts541 [DOI] [PubMed] [Google Scholar]
  • 217. Marks F. Protein Phosphorylation. New York, Basel, Cambridge, Tokyo: VCH Weinheim; 1996. [Google Scholar]
  • 218. Zor T, Mayr BM, Dyson HJ, Montminy MR, Wright PE. Roles of phosphorylation and helix propensity in the binding of the KIX domain of CREB-binding protein by constitutive (c-Myb) and inducible (CREB) activators. J Biol Chem 2002; 277:42241-8; PMID:12196545; http://dx.doi.org/ 10.1074/jbc.M207361200 [DOI] [PubMed] [Google Scholar]
  • 219. Hay TJ, Meek DW. Multiple sites of in vivo phosphorylation in the MDM2 oncoprotein cluster within two important functional domains. FEBS Lett 2000; 478:183-6; PMID:10922493; http://dx.doi.org/ 10.1016/S0014-5793(00)01850-0 [DOI] [PubMed] [Google Scholar]
  • 220. Kreegipuu A, Blom N, Brunak S. PhosphoBase, a database of phosphorylation sites: release 2.0. Nucleic Acids Res 1999; 27:237-9; PMID:9847189; http://dx.doi.org/ 10.1093/nar/27.1.237 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 221. Iakoucheva LM, Radivojac P, Brown CJ, O'Connor TR, Sikes JG, Obradovic Z, Dunker AK. The importance of intrinsic disorder for protein phosphorylation. Nucleic Acids Res 2004; 32:1037-49; PMID:14960716; http://dx.doi.org/ 10.1093/nar/gkh253 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 222. Pejaver V, Hsu WL, Xin F, Dunker AK, Uversky VN, Radivojac P. The structural and functional signatures of proteins that undergo multiple events of post-translational modification. Protein Sci 2014; 23:1077-93; PMID:24888500; http://dx.doi.org/ 10.1002/pro.2494 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 223. Carraway KL, Hull SR. O-glycosylation pathway for mucin-type glycoproteins. BioEssays 1989; 10:117-21; PMID:2658987; http://dx.doi.org/ 10.1002/bies.950100406 [DOI] [PubMed] [Google Scholar]
  • 224. Hounsell EF, Davies MJ, Renouf DV. O-linked protein glycosylation structure and function. Glycoconj J 1996; 13:19-26; PMID:8785483; http://dx.doi.org/ 10.1007/BF01049675 [DOI] [PubMed] [Google Scholar]
  • 225. Kuwano M, Seguchi T, Ono M. Glycosylation mutations of serine/threonine-linked oligosaccharides in low-density lipoprotein receptor: indispensable roles of O-glycosylation. J Cell Sci 1991; 98(Pt 2):131-4; PMID:1905298 [DOI] [PubMed] [Google Scholar]
  • 226. Harris RJ, Spellman MW. O-linked fucose and other post-translational modifications unique to EGF modules. Glycobiology 1993; 3:219-24; PMID:8358148; http://dx.doi.org/ 10.1093/glycob/3.3.219 [DOI] [PubMed] [Google Scholar]
  • 227. Walsh CT, Gehring AM, Weinreb PH, Quadri LE, Flugel RS. Post-translational modification of polyketide and nonribosomal peptide synthases. Curr Opin Chem Biol 1997; 1:309-15; PMID:9667867; http://dx.doi.org/ 10.1016/S1367-5931(97)80067-1 [DOI] [PubMed] [Google Scholar]
  • 228. Kleinkauf H, Von Dohren H. A nonribosomal system of peptide biosynthesis. Eur J Biochem 1996; 236:335-51; PMID:8612601; http://dx.doi.org/ 10.1111/j.1432-1033.1996.00335.x [DOI] [PubMed] [Google Scholar]
  • 229. Weinreb PH, Quadri LE, Walsh CT, Zuber P. Stoichiometry and specificity of in vitro phosphopantetheinylation and aminoacylation of the valine-activating module of surfactin synthetase. Biochemistry 1998; 37:1575-84; PMID:9484228; http://dx.doi.org/ 10.1021/bi9719859 [DOI] [PubMed] [Google Scholar]
  • 230. Weber G, Schorgendorfer K, Schneider-Scherzer E, Leitner E. The peptide synthetase catalyzing cyclosporine production in Tolypocladium niveum is encoded by a giant 45.8-kilobase open reading frame. Curr Genet 1994; 26:120-5; PMID:8001164; http://dx.doi.org/ 10.1007/BF00313798 [DOI] [PubMed] [Google Scholar]
  • 231. Hashimoto M, Nonaka T, Fujii I. Fungal type III polyketide synthases. Nat Prod Rep 2014; 31:1306-17; PMID:25182423; http://dx.doi.org/ 10.1039/C4NP00096J [DOI] [PubMed] [Google Scholar]
  • 232. Fujii I, Watanabe A, Sankawa U, Ebizuka Y. Identification of Claisen cyclase domain in fungal polyketide synthase WA, a naphthopyrone synthase of Aspergillus nidulans. Chem Biol 2001; 8:189-97; PMID:11251292; http://dx.doi.org/ 10.1016/S1074-5521(00)90068-1 [DOI] [PubMed] [Google Scholar]
  • 233. Abe I, Morita H. Structure and function of the chalcone synthase superfamily of plant type III polyketide synthases. Nat Prod Rep 2010; 27:809-38; PMID:20358127; http://dx.doi.org/ 10.1039/b909988n [DOI] [PubMed] [Google Scholar]
  • 234. Liu XQ. Protein-splicing intein: Genetic mobility, origin, and evolution. Ann Rev Genet 2000; 34:61-76; PMID:11092822; http://dx.doi.org/ 10.1146/annurev.genet.34.1.61 [DOI] [PubMed] [Google Scholar]
  • 235. Pietrokovski S. Intein spread and extinction in evolution. Trends Genet 2001; 17:465-72; PMID:11485819; http://dx.doi.org/ 10.1016/S0168-9525(01)02365-4 [DOI] [PubMed] [Google Scholar]
  • 236. Anraku Y, Mizutani R, Satow Y. Protein splicing: its discovery and structural insight into novel chemical mechanisms. IUBMB Life 2005; 57:563-74; PMID:16118114; http://dx.doi.org/ 10.1080/15216540500215499 [DOI] [PubMed] [Google Scholar]
  • 237. Gogarten JP, Hilario E. Inteins, introns, and homing endonucleases: recent revelations about the life cycle of parasitic genetic elements. BMC Evo Biol 2006; 6:94; PMID:17101053; http://dx.doi.org/ 10.1186/1471-2148-6-94 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 238. Noren CJ, Wang J, Perler FB. Dissecting the Chemistry of Protein Splicing and Its Applications. Angew Chemi 2000; 39:450-66; PMID:10671234; http://dx.doi.org/ 10.1002/(SICI)1521-3773(20000204)39:3%3c450::AID-ANIE450%3e3.0.CO;2-F [DOI] [PubMed] [Google Scholar]
  • 239. Clarke ND. A proposed mechanism for the self-splicing of proteins. Proc Natl Acad Sci U S A 1994; 91:11084-8; PMID:7972014; http://dx.doi.org/ 10.1073/pnas.91.23.11084 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 240. Perler FB, Xu MQ, Paulus H. Protein splicing and autoproteolysis mechanisms. Curr Opin Chem Biol 1997; 1:292-9; PMID:9667864; http://dx.doi.org/ 10.1016/S1367-5931(97)80065-8 [DOI] [PubMed] [Google Scholar]
  • 241. Perler FB. Breaking up is easy with esters. Nat Struct Biol 1998; 5:249-52; PMID:9546209; http://dx.doi.org/ 10.1038/nsb0498-249 [DOI] [PubMed] [Google Scholar]
  • 242. Van Damme P, Hole K, Pimenta-Marques A, Helsens K, Vandekerckhove J, Martinho RG, Gevaert K, Arnesen T. NatF contributes to an evolutionary shift in protein N-terminal acetylation and is important for normal chromosome segregation. PLoS Genet 2011; 7:e1002169; PMID:21750686; http://dx.doi.org/ 10.1371/journal.pgen.1002169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 243. Jornvall H. Acetylation of Protein N-terminal amino groups structural observations on alpha-amino acetylated proteins. J Theoret Biol 1975; 55:1-12; PMID:1207150; http://dx.doi.org/ 10.1016/S0022-5193(75)80105-6 [DOI] [PubMed] [Google Scholar]
  • 244. Hwang CS, Shemorry A, Varshavsky A. N-terminal acetylation of cellular proteins creates specific degradation signals. Science 2010; 327:973-7; PMID:20110468; http://dx.doi.org/ 10.1126/science.1183147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 245. Frottin F, Martinez A, Peynot P, Mitra S, Holz RC, Giglione C, Meinnel T. The proteomics of N-terminal methionine cleavage. Mol Cell Proteomics 2006; 5:2336-49; PMID:16963780; http://dx.doi.org/ 10.1074/mcp.M600225-MCP200 [DOI] [PubMed] [Google Scholar]
  • 246. Polevoda B, Sherman F. N-terminal acetyltransferases and sequence requirements for N-terminal acetylation of eukaryotic proteins. J Mol Biol 2003; 325:595-622; PMID:12507466; http://dx.doi.org/ 10.1016/S0022-2836(02)01269-X [DOI] [PubMed] [Google Scholar]
  • 247. Goetze S, Qeli E, Mosimann C, Staes A, Gerrits B, Roschitzki B, Mohanty S, Niederer EM, Laczko E, Timmerman E, et al. Identification and functional characterization of N-terminally acetylated proteins in Drosophila melanogaster. PLoS Biol 2009; 7:e1000236; PMID:19885390; http://dx.doi.org/ 10.1371/journal.pbio.1000236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 248. Arnesen T, Van Damme P, Polevoda B, Helsens K, Evjenth R, Colaert N, Varhaug JE, Vandekerckhove J, Lillehaug JR, Sherman F, et al. Proteomics analyses reveal the evolutionary conservation and divergence of N-terminal acetyltransferases from yeast and humans. Proc Natl Acad Sci U S A 2009; 106:8157-62; PMID:19420222; http://dx.doi.org/ 10.1073/pnas.0901931106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 249. Hole K, Van Damme P, Dalva M, Aksnes H, Glomnes N, Varhaug JE, Lillehaug JR, Gevaert K, Arnesen T. The human N-alpha-acetyltransferase 40 (hNaa40p/hNatD) is conserved from yeast and N-terminally acetylates histones H2A and H4. PloS One 2011; 6:e24713; PMID:21935442; http://dx.doi.org/ 10.1371/journal.pone.0024713 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 250. Mukherjee S, Keitany G, Li Y, Wang Y, Ball HL, Goldsmith EJ, Orth K. Yersinia YopJ acetylates and inhibits kinase activation by blocking phosphorylation. Science 2006; 312:1211-4; PMID:16728640; http://dx.doi.org/ 10.1126/science.1126867 [DOI] [PubMed] [Google Scholar]
  • 251. Mittal R, Peak-Chew SY, McMahon HT. Acetylation of MEK2 and I kappa B kinase (IKK) activation loop residues by YopJ inhibits signaling. Proc Natl Acad Sci U S A 2006; 103:18574-9; PMID:17116858; http://dx.doi.org/ 10.1073/pnas.0608995103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 252. Mukherjee S, Hao YH, Orth K. A newly discovered post-translational modification–the acetylation of serine and threonine residues. Trends Biochem Sci 2007; 32:210-6; PMID:17412595; http://dx.doi.org/ 10.1016/j.tibs.2007.03.007 [DOI] [PubMed] [Google Scholar]
  • 253. Cervantes-Laurean D, Loflin PT, Minter DE, Jacobson EL, Jacobson MK. Protein modification by ADP-ribose via acid-labile linkages. J Biol Chem 1995; 270:7929-36; PMID:7713889; http://dx.doi.org/ 10.1074/jbc.270.14.7929 [DOI] [PubMed] [Google Scholar]
  • 254. Di Paola S, Micaroni M, Di Tullio G, Buccione R, Di Girolamo M. PARP16/ARTD15 is a novel endoplasmic-reticulum-associated mono-ADP-ribosyltransferase that interacts with, and modifies karyopherin-ss1. PloS One 2012; 7:e37352; PMID:22701565; http://dx.doi.org/ 10.1371/journal.pone.0037352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 255. Feijs KL, Verheugd P, Luscher B. Expanding functions of intracellular resident mono-ADP-ribosylation in cell physiology. Febs J 2013; 280:3519-29; PMID:23639026; http://dx.doi.org/ 10.1111/febs.12315 [DOI] [PubMed] [Google Scholar]
  • 256. Bernstein LS, Linder ME, Hepler JR. Analysis of RGS protein palmitoylation. Methods Mol Biol 2004; 237:195-204; PMID:14501051 [DOI] [PubMed] [Google Scholar]
  • 257. Magee AI, Gutierrez L, McKay IA, Marshall CJ, Hall A. Dynamic fatty acylation of p21N-ras. EMBO J 1987; 6:3353-7; PMID:3322807 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 258. Takada R, Satomi Y, Kurata T, Ueno N, Norioka S, Kondoh H, Takao T, Takada S. Monounsaturated fatty acid modification of Wnt protein: its role in Wnt secretion. Dev Cell 2006; 11:791-801; PMID:17141155; http://dx.doi.org/ 10.1016/j.devcel.2006.10.003 [DOI] [PubMed] [Google Scholar]
  • 259. Zou C, Ellis BM, Smith RM, Chen BB, Zhao Y, Mallampalli RK. Acyl-CoA:lysophosphatidylcholine acyltransferase I (Lpcat1) catalyzes histone protein O-palmitoylation to regulate mRNA synthesis. J Biol Chem 2011; 286:28019-25; PMID:21685381; http://dx.doi.org/ 10.1074/jbc.M111.253385 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 260. Sato T, Nakamura Y, Shiimura Y, Ohgusu H, Kangawa K, Kojima M. Structure, regulation and function of ghrelin. J Biochem 2012; 151:119-28; PMID:22041973; http://dx.doi.org/ 10.1093/jb/mvr134 [DOI] [PubMed] [Google Scholar]
  • 261. Kaiya H, Kojima M, Hosoda H, Riley LG, Hirano T, Grau EG, Kangawa K. Amidated fish ghrelin: purification, cDNA cloning in the Japanese eel and its biological activity. J Endocrinol 2003; 176:415-23; PMID:12630926; http://dx.doi.org/ 10.1677/joe.0.1760415 [DOI] [PubMed] [Google Scholar]
  • 262. Medzihradszky KF, Darula Z, Perlson E, Fainzilber M, Chalkley RJ, Ball H, Greenbaum D, Bogyo M, Tyson DR, Bradshaw RA, et al. O-sulfonation of serine and threonine: mass spectrometric detection and characterization of a new posttranslational modification in diverse proteins throughout the eukaryotes. Mol Cell Proteomics 2004; 3:429-40; PMID:14752058; http://dx.doi.org/ 10.1074/mcp.M300140-MCP200 [DOI] [PubMed] [Google Scholar]
  • 263. Moore KL. The biology and enzymology of protein tyrosine O-sulfation. J Biol Chem 2003; 278:24243-6; PMID:12730193; http://dx.doi.org/ 10.1074/jbc.R300008200 [DOI] [PubMed] [Google Scholar]
  • 264. Ryan MP, Jack RW, Josten M, Sahl HG, Jung G, Ross RP, Hill C. Extensive post-translational modification, including serine to D-alanine conversion, in the two-component lantibiotic, lacticin 3147. J Biol Chem 1999; 274:37544-50; PMID:10608807; http://dx.doi.org/ 10.1074/jbc.274.53.37544 [DOI] [PubMed] [Google Scholar]
  • 265. Gross E, Morell JL. The structure of nisin. J Am Chem Soc 1971; 93:4634-5; PMID:5131162; http://dx.doi.org/ 10.1021/ja00747a073 [DOI] [PubMed] [Google Scholar]
  • 266. Sahl HG, Bierbaum G. Lantibiotics: biosynthesis and biological activities of uniquely modified peptides from gram-positive bacteria. Annu Rev Microbiol 1998; 52:41-79; PMID:9891793; http://dx.doi.org/ 10.1146/annurev.micro.52.1.41 [DOI] [PubMed] [Google Scholar]
  • 267. Skaugen M, Nissen-Meyer J, Jung G, Stevanovic S, Sletten K, Inger C, Abildgaard M, Nes IF. In vivo conversion of L-serine to D-alanine in a ribosomally synthesized polypeptide. J Biol Chem 1994; 269:27183-5; PMID:7961627 [PubMed] [Google Scholar]
  • 268. Cuttitta F, Kasprzyk PG, Treston AM, Avis I, Jensen S, Levitt M, Siegfried J, Mobley C, Mulshine JL. Autocrine groeth factors that regulate the proliferation of pulmonary malignances in man. Biology, Toxicology, and Carcinogenesis of Respiratory Epithelium. Thomassen DG, Nettesheim P, eds. USA: Hemisphere Publishing Corporation; 1990:228-70. [Google Scholar]
  • 269. Hart GW, Slawson C, Ramirez-Correa G, Lagerlof O. Cross talk between O-GlcNAcylation and phosphorylation: roles in signaling, transcription, and chronic disease. Annu Rev Biochem 2011; 80:825-58; PMID:21391816; http://dx.doi.org/ 10.1146/annurev-biochem-060608-102511 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 270. Cheng X, Hart GW. Alternative O-glycosylation/O-phosphorylation of serine-16 in murine estrogen receptor beta: post-translational regulation of turnover and transactivation activity. J Biol Chem 2001; 276:10570-5; PMID:11150304; http://dx.doi.org/ 10.1074/jbc.M010411200 [DOI] [PubMed] [Google Scholar]
  • 271. Wang Z, Gucek M, Hart GW. Cross-talk between GlcNAcylation and phosphorylation: site-specific phosphorylation dynamics in response to globally elevated O-GlcNAc. Proc Natl Acad Sci U S A 2008; 105:13793-8; PMID:18779572; http://dx.doi.org/ 10.1073/pnas.0806216105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 272. Gusella JF, MacDonald ME. Molecular genetics: unmasking polyglutamine triggers in neurodegenerative disease. Nat Rev Neurosci 2000; 1:109-15; PMID:11252773; http://dx.doi.org/ 10.1038/35039051 [DOI] [PubMed] [Google Scholar]
  • 273. Brown LY, Brown SA. Alanine tracts: the expanding story of human illness and trinucleotide repeats. Trends iGenet 2004; 20:51-8; PMID:14698619; http://dx.doi.org/ 10.1016/j.tig.2003.11.002 [DOI] [PubMed] [Google Scholar]
  • 274. Tome FM, Fardeau M. Nuclear inclusions in oculopharyngeal dystrophy. Acta Neuropathol 1980; 49:85-7; PMID:6243839; http://dx.doi.org/ 10.1007/BF00692226 [DOI] [PubMed] [Google Scholar]
  • 275. Uyama E, Tsukahara T, Goto K, Kurano Y, Ogawa M, Kim YJ, Uchino M, Arahata K. Nuclear accumulation of expanded PABP2 gene product in oculopharyngeal muscular dystrophy. Muscle Nerve 2000; 23:1549-54; PMID:11003790; http://dx.doi.org/ 10.1002/1097-4598(200010)23:10%3c1549::AID-MUS11%3e3.0.CO;2-0 [DOI] [PubMed] [Google Scholar]
  • 276. Davies JE, Berger Z, Rubinsztein DC. Oculopharyngeal muscular dystrophy: potential therapies for an aggregate-associated disorder. Int J Biochem Cell Biol 2006; 38:1457-62; PMID:16530457; http://dx.doi.org/ 10.1016/j.biocel.2006.01.016 [DOI] [PubMed] [Google Scholar]
  • 277. Gaspar C, Jannatipour M, Dion P, Laganiere J, Sequeiros J, Brais B, Rouleau GA. CAG tract of MJD-1 may be prone to frameshifts causing polyalanine accumulation. Hum Mol Genet 2000; 9:1957-66; PMID:10942424; http://dx.doi.org/ 10.1093/hmg/9.13.1957 [DOI] [PubMed] [Google Scholar]
  • 278. Toulouse A, Au-Yeung F, Gaspar C, Roussel J, Dion P, Rouleau GA. Ribosomal frameshifting on MJD-1 transcripts with long CAG tracts. Hum Mol Genet 2005; 14:2649-60; PMID:16087686; http://dx.doi.org/ 10.1093/hmg/ddi299 [DOI] [PubMed] [Google Scholar]
  • 279. Davies JE, Rubinsztein DC. Polyalanine and polyserine frameshift products in Huntington's disease. J Med Genet 2006; 43:893-6; PMID:16801344; http://dx.doi.org/ 10.1136/jmg.2006.044222 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 280. Pearson CE. Repeat associated non-ATG translation initiation: one DNA, two transcripts, seven reading frames, potentially nine toxic entities! PLoS Genet 2011; 7:e1002018; PMID:21423665; http://dx.doi.org/ 10.1371/journal.pgen.1002018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 281. Zu T, Gibbens B, Doty NS, Gomes-Pereira M, Huguet A, Stone MD, Margolis J, Peterson M, Markowski TW, Ingram MA, et al. Non-ATG-initiated translation directed by microsatellite expansions. Proc Natl Acad Sci U S A 2011; 108:260-5; PMID:21173221; http://dx.doi.org/ 10.1073/pnas.1013343108 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Intrinsically Disordered Proteins are provided here courtesy of Taylor & Francis

RESOURCES