Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2014 Jun;26:121–130. doi: 10.1016/j.sbi.2014.06.002

The evolution of enzyme function in the isomerases

Sergio Martinez Cuesta 1, Nicholas Furnham 2, Syed Asad Rahman 1, Ian Sillitoe 3, Janet M Thornton 1
PMCID: PMC4139412  PMID: 25000289

Highlights

  • Isomerases usually evolve to become enzymes from other EC classes.

  • Conservation of substrates is more common than conservation of bond changes.

  • The relationship between functional similarity and sequence similarity is not linear.

Abstract

The advent of computational approaches to measure functional similarity between enzymes adds a new dimension to existing evolutionary studies based on sequence and structure. This paper reviews research efforts aiming to understand the evolution of enzyme function in superfamilies, presenting a novel strategy to provide an overview of the evolution of enzymes belonging to an individual EC class, using the isomerases as an exemplar.


Current Opinion in Structural Biology 2014, 26:121–130

This review comes from a themed issue on Sequences and topology

Edited by L Aravind and Christine A Orengo

For a complete overview see the Issue and the Editorial

Available online 5th July 2014

http://dx.doi.org/10.1016/j.sbi.2014.06.002

0959-440X/© 2014 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/3.0/).

Introduction

Enzymes are life's workforce. They catalyse the biochemical reactions that are the basis of metabolism in all living organisms. The major route for creating new enzyme functions is gene duplication and subsequent evolution of one enzyme to another with a novel, though usually related, function. Under the pressures of survival and reproduction, innovating new functions at the metabolic level allows organisms to adapt to an environment of changing chemical conditions [1]; for example, bacterial resistance to man-made chemicals such as drugs or pesticides.

Previous work

Previous studies focusing on analysing enzyme superfamilies [2, 3] and directed evolution experiments [4] discovered aspects of how enzyme evolution is influenced by aspects of the chemistry of enzymes. The overall chemical reaction is often changed by recruiting different catalytic residues within an active site, whilst conserving a few residues required for the catalysis of at least one mechanistic step of the overall reaction [5]. Similarly, binding different substrates is commonly achieved by changing the residues involved in substrate binding and conserving residues involved in the overall reaction [6]. There is substantial evidence supporting changes of the overall chemical reaction [7], as well as results reporting the importance of binding different substrates in the evolution of function in superfamilies [8••, 9•, 10••]. Commonly, enzyme superfamilies evolve by a combination of these two strategies [11, 12]. For instance, phosphate binding sites are often conserved, whilst the rest of the substrate can be changed during evolution [13, 14].

Other comprehensive studies on the variation of enzyme sequence and structure [15, 16•] and plasticity of active sites [17, 18•] have also been fundamental in understanding how homologous enzymes accommodate alternative chemistries. Similarly, research on the convergent evolution of enzyme mechanisms [19] and active sites [20] presented nature's strategies to evolve different structural solutions for the catalysis of similar reactions [21, 22•, 23]. The widespread interest in understanding the evolution and chemistry of enzymes has led to large scale collaborative projects such as the Enzyme Function Initiative (EFI) [24] which aims to determine enzyme function using both experimental and computational approaches. Starting from a comprehensive alignment of genomic regions, Zhao and co-workers from the EFI have identified the epimerase activity, pathway context and biological role in osmoprotection of a structurally characterised enzyme of unknown function from P. bermudensis using a combination of virtual screening, metabolomics, transcriptomics and biochemical experiments [25••].

To explore this area further, we review our current knowledge of the evolution of the isomerase class of reactions, using newly developed computational tools to compare enzyme reactions [26••] and their evolution [27]. This is a specialised class of enzymes, which catalyse geometrical and structural rearrangements between isomers.

Biological relevance of isomerases

Isomerases are present in the metabolism and genome of most living organisms, catalysing up to 4% of the biochemical reactions present in central metabolism, in particular, carbohydrate metabolism. They also play a crucial role in the metabolism of terpenoids and polyketides that are important in generating secondary metabolites, especially in plants (Figure 1a).

Figure 1.

Figure 1

Biological importance of isomerases. (a) Core metabolic pathways (the isomerase reactions are emboldened in black). Carbohydrate and terpenoid/polyketide metabolic pathways are highlighted in blue and green squares, (b) Distribution of known enzymes in the human and E. coli genomes, (c) EC classification of isomerases. (d) Bond changes, reaction centres and structure of substrates and products obtained from the reaction catalysed by alanine racemase (EC 5.1.1.1) using EC-BLAST.

The relative proportion of enzymes encoding for isomerase activity depends on the species. Whereas 2.6% of the genes encoding for enzymatic activity corresponds to isomerases in Homo sapiens, this proportion is higher in bacterial genomes such as Escherichia coli where they account for 6.2%. These figures correlate with the relative proportion of protein-coding genes encoding for enzymatic activity in general. Whereas in human, 20% of genes correspond to enzymes, this value increases to 37% in bacteria (Figure 1b).

The Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) maintains the most widely used functional classification of isomerases in the Enzyme Commission (EC) classification system [28]. Isomerases belong to the EC 5 primary class and they are grouped according to the chemistry of the reactions that they catalyse. They are subdivided in three hierarchical levels: 6 subclasses, 17 sub-subclasses and 231 serial numbers (Figure 1c). These serial numbers are associated with almost 300 biochemical reactions — for example EC 5.1.1.9 describes the racemisation of arginine, lysine or ornithine and it is therefore linked to three distinct reactions.

From a practical viewpoint, the total number of isomerase EC numbers (231) is small compared to other EC classes, which makes them attractive for manual analysis. Three of the six isomerase EC subclasses are similar to three EC primary classes (intramolecular oxidoreductases — EC 5.3 are designated from oxidoreductases — EC 1; intramolecular transferases — EC 5.4 from transferases — EC 2; and intramolecular lyases — EC 5.5 from lyases — EC 4, but refer to intramolecular reactions). Lastly, most of the isomerase reactions are unimolecular (one substrate and one product), which makes them relatively easy to compare.

Isomerases are used in many applications. In metabolic engineering, xylose isomerase (EC 5.3.1.5) has been traditionally used to convert glucose to fructose in the syrup industry and has recently been engineered to increase the yield of alcohol-based biofuels in S. cerevisiae [29]. In organic synthesis, several racemases and epimerases (EC 5.1) have been employed to resolve racemic mixtures in mild conditions and for the production of stereochemically pure amino acids [30]. Efforts in enzyme design have also managed to successfully convert racemases and epimerases acting on amino acids and derivatives (EC 5.1.1) into enzymes with lyase activity (EC 4) [31, 32]. Ultimately, some racemases and epimerases acting on amino acids are also targets for the development of antimicrobial drugs and the treatment of neuropathological disorders [33].

Studies linking chemical details of the catalytic reaction with how enzyme sequences evolve considering multiple enzyme superfamilies are scarce. Whereas some studies have focused on analysing only the chemistry [34, 35], other studies concentrated on sequence and structure evolution [15, 22•]. Some literature is however available addressing certain aspects of the chemistry and evolution of specific isomerases. In the 1990s, mandelate racemase (EC 5.1.2.2) and muconate-lactonizing enzyme (EC 5.5.1.1), members of the enolase superfamily, were among the first enzymes reported to be highly structurally similar yet catalysing different overall reactions. Several isomerases belonging to this superfamily have been studied over the last two decades [7]. Successive research efforts focused on ketosteroid isomerase (EC 5.3.3.1) have also been fundamental in understanding basic principles of enzyme catalysis [36••]. In addition, general strategies to assign isomerase specificity have been recently presented [37, 38, 25••], as well as comparative genomic techniques to discover new isomerases in bacterial genomes [39]. Other investigations have partially explored isomerases in several superfamilies such as the haloacid dehalogenase, crotonase, vicinal oxygen chelate, amidohydrolase, alkaline phosphatase, cupin, short-chain dehydrogenase/reductase and PLP-dependent aspartate amino-transferase superfamilies [23, 40, 41, 42•, 43].

Methods for analysing sequence, structure and functional relationships

Protein similarity networks have been used very successfully to map biological information to large sets of proteins [44, 43]. However, it is also necessary to include associated changes of catalytic function during evolution preferably in an automated fashion. FunTree is a resource developed to accomplish that goal [27] and it is maintained in collaboration with the CATH classification of protein structures [45]. By combining sequence, structure, phylogenetic, chemical and mechanistic information, it allows one to answer fundamental questions about the link between enzyme activities and their evolutionary history in the context of superfamilies. FunTree uses phylogenetic methods to infer ancestral enzymes in superfamilies and estimate their most likely functions [46]. By traversing the generated phylogenetic tree from ancestor to modern enzymes, explicit changes of function are identified between groups of enzymes belonging to a superfamily. Ultimately, each functional change is represented by two sets of enzymes catalysing two distinct functions so both functions and enzymes are comparatively analysed using functional and all-against-all sequence similarity.

To explore the evolution of the isomerases, we have calculated the functional similarity between enzymes using EC-BLAST [26••], a recently developed algorithm to automatically compare biochemical reactions. This approach introduces three measures of functional similarity — comparison of bond changes, reaction centres and structure similarity of substrate(s) and product(s) — derived from the biochemical reaction catalysed by any given enzyme (Figure 1d). Bond changes refer to cleavage, formation and order change of chemical bonds and changes in stereochemistry of atoms and bonds. Reaction centres are molecular substructures representing the local environment around the atoms involved in bond changes. Last, the complete two-dimensional structures of substrate(s) and product(s) are also considered in the comparisons. These three measures are then combined with mechanistic data from MACiE [47] and extensive literature searches in order to inform our analyses.

Review of current status and availability of data on isomerase reactions and their sequences

Information related to the nomenclature of enzymes is publicly available in the ENZYME database [48]. It actively follows the recommendations of the NC-IUBMB and the 24-Jul-2013 version contained 231 current 4-digit isomerase EC numbers. 199 of them have sequence information in UniprotKB [49] and 32 are orphan isomerase EC numbers, also known as orphan enzymes [50, 51], a term given to EC numbers where no gene has been associated with these reactions and no sequence information is available in protein sequence repositories. Almost half of the isomerase EC numbers with sequence information (96) are present in FunTree [27] and Figure 2a shows the distribution by EC 5 subclass.

Figure 2.

Figure 2

(a) Distribution of isomerases in EC classification, UniprotKB, PDB and FunTree. EC exchange matrices representing the changes in function during evolution of isomerases at the EC (b) class and (c) subclass levels. More frequent changes of isomerase function are highlighted in red. Green and blue boxes represent changes within isomerases and with other EC classes, respectively. (d) Frequency of EC changes involving isomerases by superfamily. The 32 superfamilies bearing multiple changes are illustrated.

Protein structural data are available for 126 isomerase EC numbers, which have at least one entry in the PDB [52]. The 96 isomerases currently present in FunTree include domains, which are distributed across 81 CATH superfamilies: 17 are mostly alpha, 5 mostly beta and 59 mixed alpha/beta. Some superfamilies include more isomerases than others, for example, the superfamily UDP-galactose 4-epimerase, domain 1 (CATH 3.90.25.10) includes 7 racemases and epimerases (EC 5.1). In FunTree, one-third of the 96 isomerases including more than one domain superfamily (multidomain), with most of them including two or three superfamilies, but rarely more. Exceptionally, the subclass ‘other isomerases’ (EC 5.99), which has two EC numbers (EC 5.99.1.2 and 5.99.1.3) is distributed across seven and eight superfamilies, respectively. These are types I and II DNA topoisomerases, which are characterised by multiple domains required for the complex process of winding DNA [53].

Observed changes of isomerase function

Change in EC number

Analysis of FunTree data on 58 domain superfamilies identified a total of 145 unique changes of isomerase activity that occurred during evolution. Only one-fifth of the changes occur between isomerases whereas the rest involve changing from isomerases to perform reactions in other EC primary classes (Figure 2b). This is strikingly different from enzymes in other EC classes where changes in lower levels of the EC classification are more common than changes in the primary classification [8••]. Among the 26 changes within isomerases, only 3 change the EC subclass and 23 change the EC serial number, indicating a change in substrate (Figure 2c). A previous limited study of 24 pairs of enzymes reported that changes involving isomerases and lyases (EC 5↔EC 4) occur more often than changes to other EC classes [5]. Other analyses provided further evidence of these changes by revealing the structural insights of the evolution of an isomerase from a family of lyases, namely N-succinylamino acid racemase (EC 5.1.1.-) from o-succinylbenzoate synthases (EC 4.2.1.113) in the enolase superfamily [54]. Our comprehensive analysis confirms that such changes are indeed prevalent, with 39% of the 119 changes in primary classification involving lyases.

Most domain superfamilies show multiple changes of reaction chemistry involving different EC classes (Figure 2d). The most adaptable superfamily domains are aldolase class I (CATH 3.20.20.70) and glutaredoxin (CATH 3.40.30.10), each of them exhibiting 10 changes of isomerase function. Whereas the glutaredoxin ‘isomerase’ domain only exhibits changes of isomerase, oxidoreductase and transferase reactions, the aldolase class I domain has also evolved to become a hydrolase and lyase (Figure 2d).

Correlation of sequence and function evolution

To gain an overview of the relationship between sequence and functional divergence, an overall representation of the sequence and functional similarity between the homologous enzymes that perform different catalytic reactions is presented in Figure 3. This illustrates that most sequences have diverged considerably, with sequence identities in the range lower than 40%. The three measures of functional similarity (Figure 3a–c) capture different properties of the change in function, but none of the plots show any linear relationship between sequence and functional divergence. In addition, the distributions for each of these measures look quite different. In Figure 3a, which assesses the overall bond changes, there are two clusters, one consists of changes exhibiting bond change conservation when the isomerase EC subclass is maintained, and in the second changes at the isomerase EC subclass or EC primary class do not exhibit bond change conservation. This partition is not observed in the comparisons by reaction centres and structures of substrate(s) and product(s) and in overall, the similarities tend to be more uniformly spread (Figure 3b,c). Remarkably, there are only a few changes in which enzymes retain a relatively high degree of sequence and functional similarity. For instance, the glycosyltransferase superfamily (CATH 1.50.10.20) exhibits a change of arabidiol synthase (EC 4.2.1.124) into thalianol synthase (EC 5.4.99.31) (circled in red in Figure 3a–c). This change involves different enzyme sequences from the terpenoid biosynthesis pathway of Arabidopsis thaliana that share high sequence identity (79%) and high reaction similarity (48% — bond change, 72% — reaction centre and 84% — structure similarity). They both act on (S)-2,3-epoxysqualene as the main substrate to synthesise a different product, which explains why the structure similarity is high.

Figure 3.

Figure 3

Sequence and functional similarity of the 145 changes of isomerase function. The three scatterplots represent global sequence identity against overall reaction similarity as calculated using three measures (a) bond change (b) reaction centre and (c) structure similarity of substrate(s) and product(s). Each point represents one change of enzyme function involving two sets of enzymes catalysing two distinct functions each [27]. Average global sequence identities and standard deviations (error bars) from all-against-all pairwise comparisons between sequences corresponding to one function and those corresponding to the second function. Circled in red, the change EC 4.2.1.124→EC 5.4.99.31 (see main text). Pearson's correlation coefficients (r) range from 0.35 to 0.41 and indicate weak but significant linear relationships (p-value < 0.001). (d) Distribution of bond change and structure similarities averaged by CATH superfamily.

In an attempt to analyse the chemical diversity of the domain superfamilies performing changes of function in isomerases, we divided the functional similarity space in four quadrants as depicted in Figure 3d. Each point represents a superfamily whose changes of isomerase function were averaged according to overall chemistry — as measured by bond change similarity — and structures of the reactants — in line with the similarity of the structures of substrate(s) and product(s). Half of the superfamilies shared average similarities of reactants higher than 50% (top two quadrants), whereas only about one-fourth exhibited average similarities of overall chemistry higher than 50% (right two quadrants). Particularly, there are only three instances where the overall chemistry is similar but the structures of the reactants significantly diverge (bottom right quadrant), highlighting that this is a rare event in the evolution of isomerase function.

An example — a family of SDRs acting on NDP-sugars from the UDP-galactose 4-epimerase superfamily

To explore one set of changes in more detail we have studied eight changes of isomerase function involving a group of nine enzymes catalysing transformations between nucleoside diphosphate sugars (NDP-sugars). These metabolites are common in bacterial secondary metabolic pathways and they are necessary in molecular recognition and signalling processes [42]. Several studies have revealed the structural, functional and mechanistic determinants of this group of evolutionary-related enzymes. They are epimerases (EC 5), dehydratases (EC 4), decarboxylases (EC 4) and oxidoreductases (EC 1) belonging to the subfamily of short-chain dehydrogenases/reductases (SDR) acting on NDP-sugars (Figure 4a) [55, 56, 57, 58]. The changes in function involve two-domain enzymes comprising a catalytic NAD(P)-binding Rossmann-like domain (CATH 3.40.50.720) and a domain known as UDP-galactose 4-epimerase (CATH 3.90.25.10), which confers substrate specificity. The active site is located in the interdomain cavity where a conserved Tyr, Lys and Ser/Thr form a catalytic triad. Reactivity takes place on the C4, C5 and C6 atoms of the sugar substructure through a mechanism involving a transient oxidation intermediate mediated by NAD [59]. The sequence data provide evidence that different catalytic amino acids are recruited to the active site in order to change the prevalent UDP-glucose 4-epimerase activity (EC 5.1.3.2) to other enzymatic activities. For instance, a base, Glu and an acid, Asp, are added to the catalytic triad in dTDP-glucose 4,6-dehydratase (EC 4.2.1.46) and GDP-mannose 4,6-dehydratase (EC 4.2.1.47) to perform the dehydration step which takes place in each of these overall reactions [58]. Since the reactivity takes place in the attached sugar moiety, the nucleoside diphosphate substructure (noted as X in Figure 4a) is not disrupted during catalysis and remains conserved in all enzymatic activities of this superfamily.

Figure 4.

Figure 4

The evolution of SDRs acting on NDP-sugars. (a) Overview of the EC changes involving isomerases and domain composition of UDP-glucose 4-epimerases (EC 5.1.3.2). Biochemical reactions are represented in boxes. Black arrows inside boxes denote chemical transformations whereas coloured arrows linking boxes represent EC changes. EC numbers with an asterisk indicate reactions for which we found mechanistic evidence in MACiE [47] or in literature searches. Changing substructures are highlighted in red whereas X corresponds to nucleoside diphosphate moieties (ADP, TDP, GDP, CDP, UDP) in which the base may change, but the ribose diphosphate (or sometimes the 2′-deoxy derivatives) is broadly conserved. Three scatterplots illustrating sequence and functional similarity for this superfamily (b) bond change, (c) reaction centre and (d) structure similarity of substrate(s) and product(s) as in Figure 3.

FunTree catalogues 8 changes of isomerase function within this family of enzymes (Figure 4a). They all share the same domain composition and therefore changes in function result directly from changes in sequence, rather than domain architecture. The analysis of sequence and functional similarities revealed that this family is divergent, with members sharing sequence identities in the 20–40% range. Bond change similarities revealed the already observed bimodal distribution due to the EC classification definitions (Figure 4b). Similarities by reaction centre remain low — not higher than 50% (Figure 4c) whereas overall, this set of functional changes tend to conserve structural similarity, due to the common binding of a conserved nucleoside diphosphate (Figure 4d).

Taken together, we think this overview of sequence and functional relationships may help identify possible sequences catalysing orphan isomerase EC numbers. For instance, comprehensive literature and database searches confirmed that the enzymatic activity UDP-glucosamine 4-epimerase (EC 5.1.3.16) is an orphan EC number. In 1959, it was first experimentally determined in rat liver by Maley [60]. The high functional similarity to the activities UDP-glucose 4-epimerase (EC 5.1.3.2), UDP-arabinose 4-epimerase (EC 5.1.3.5) and UDP-glucuronate 4-epimerase (EC 5.1.3.6) suggests that the sequence catalysing EC 5.1.3.16 may belong to the UDP-galactose 4-epimerase superfamily. Ultimately, experimental analysis will reveal whether candidate sequences actually perform this reaction.

Conclusions

Using isomerases as an example, this review highlighted how enzyme chemistry may change over time, as enzymes evolve to perform different enzyme reactions.

Isomerases are a rare class of enzymes. Unlike other EC classes such as the ligases (EC 6), their functional classification is rather complex. While racemases, epimerases and cis-trans isomerases (EC 5.1 and 5.2) are sensibly grouped according to changes of stereochemistry, intramolecular oxidoreductases, intramolecular transferases and intramolecular lyases (EC 5.3, 5.4 and 5.5) are very similar to the chemistry of other EC classes. The subclass ‘other isomerases’ (EC 5.99) sits apart from other subclasses and exhibits great diversity, as evidenced by the distinct chemistry of DNA topoisomerases.

The surprising observation from our study highlights that isomerases are more likely to evolve new functions in different EC primary classes, rather than evolve to perform different isomerase reactions. This is unlike the other EC classes where more than two-thirds of the exchanges happen within the same EC class. In addition we note that exchanges between isomerases and lyases (EC 4) are prevalent.

Isomerases change their overall chemistry and conserve the structure of their substrates more often than conserving the chemistry and changing substrates. This is also unlike other types of enzymes and reflects the mechanisms of isomerases, which can often incorporate mechanistic components from different classes to provide a different overall outcome while conserving the substrate binding abilities.

This study is based on exploring the evolution of separate domains. However many enzymes are multidomain and change their domain composition and function during evolution [61]. Cataloguing the evolution of each one of the composite domains can lead to multiple different evolutionary pathways. Further analysis of multidomain architecture and more experimental data would complement and broaden this analysis.

The chaotic nature of the sequence and function relationship in superfamilies including isomerases is evidenced by the lack of correlation between sequence and functional similarity. Variations in sequence are always very large revealing that changes happened long ago, emphasizing that evolutionary studies need to be undertaken on a superfamily basis. Here we gave an example of how combining knowledge from the chemistry and evolution of enzymes acting on nucleoside diphosphate sugars may help to characterise related orphan activities.

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

SMC acknowledges Dr Tjaart de Beer and Dr Roman Laskowski for technical discussions and support from the EMBL International PhD Programme. NF and SAR acknowledge support from The Wellcome Trust (Grant No. 081989/Z/07/A). SAR and JMT acknowledge support from EMBL.

Contributor Information

Sergio Martinez Cuesta, Email: scuesta@ebi.ac.uk.

Janet M Thornton, Email: thornton@ebi.ac.uk.

References

  • 1.Copley S.D. Toward a systems biology perspective on enzyme evolution. J Biol Chem. 2012;287:3–10. doi: 10.1074/jbc.R111.254714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gerlt J.A., Babbitt P.C. Divergent evolution of enzymatic function: mechanistically diverse superfamilies and functionally distinct suprafamilies. Annu Rev Biochem. 2001;70:209–246. doi: 10.1146/annurev.biochem.70.1.209. [DOI] [PubMed] [Google Scholar]
  • 3.Todd A.E., Orengo C.A., Thornton J.M. Evolution of function in protein superfamilies, from a structural perspective. J Mol Biol. 2001;307:1113–1143. doi: 10.1006/jmbi.2001.4513. [DOI] [PubMed] [Google Scholar]
  • 4.Khersonsky O., Tawfik D.S. Enzyme promiscuity: a mechanistic and evolutionary perspective. Annu Rev Biochem. 2010;79:471–505. doi: 10.1146/annurev-biochem-030409-143718. [DOI] [PubMed] [Google Scholar]
  • 5.Bartlett G.J., Borkakoti N., Thornton J.M. Catalysing new reactions during evolution: economy of residues and mechanism. J Mol Biol. 2003;331:829–860. doi: 10.1016/s0022-2836(03)00734-4. [DOI] [PubMed] [Google Scholar]
  • 6.Nobeli I., Spriggs R.V., George R.A., Thornton J.M. A ligand-centric analysis of the diversity and evolution of protein–ligand relationships in E. coli. J Mol Biol. 2005;347:415–436. doi: 10.1016/j.jmb.2005.01.061. [DOI] [PubMed] [Google Scholar]
  • 7.Gerlt J.A., Babbitt P.C., Jacobson M.P., Almo S.C. Divergent evolution in enolase superfamily: strategies for assigning functions. J Biol Chem. 2012;287:29–34. doi: 10.1074/jbc.R111.240945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8••.Furnham N., Sillitoe I., Holliday G.L., Cuff A.L., Laskowski R.A., Orengo C.A., Thornton J.M. Exploring the evolution of novel enzyme functions within structurally defined protein superfamilies. PLoS Comput Biol. 2012;8:e1002403. doi: 10.1371/journal.pcbi.1002403. [DOI] [PMC free article] [PubMed] [Google Scholar]; Authors used FunTree [27] to explore changes among 71% of all known enzyme functions across 276 superfamilies. Main findings are: 5% of changes are due to addition and deletion of loops, 27% due to domain recruitment or loss and 15% of changes occur at the primary EC level, and therefore 85% take place between enzymes of the same EC class.
  • 9•.Duarte F., Amrein B.A., Kamerlin S.C.L. Modeling catalytic promiscuity in the alkaline phosphatase superfamily. Phys Chem Chem Phys. 2013;15:11160–11177. doi: 10.1039/c3cp51179k. [DOI] [PMC free article] [PubMed] [Google Scholar]; Authors review computational approaches to study catalytic promiscuity in enzyme superfamilies. They use the alkaline phosphatase superfamily as an example of chemistry-driven evolution.
  • 10••.Bastard K., Smith A.A.T., Vergne-Vaxelaire C., Perret A., Zaparucha A., De Melo-Minardi R., Mariage A., Boutard M., Debard A., Lechaplais C. Revealing the hidden functional diversity of an enzyme family. Nat Chem Biol. 2014;10:42–49. doi: 10.1038/nchembio.1387. [DOI] [PubMed] [Google Scholar]; Strategy combining bioinformatics and experimental techniques to determine enzymatic activities in a protein family where little functional information is known. By applying it to DUF849 Pfam family, they characterised the diversity of β-keto acid cleavage activities and in vivo roles of representative enzymes.
  • 11.Babbitt P.C. Definitions of enzyme function for the structural genomics era. Curr Opin Chem Biol. 2003;7:230–237. doi: 10.1016/s1367-5931(03)00028-0. [DOI] [PubMed] [Google Scholar]
  • 12.Chiang R.A., Sali A., Babbitt P.C. Evolutionarily conserved substrate substructures for automated annotation of enzyme superfamilies. PLoS Comput Biol. 2008;4:e1000142. doi: 10.1371/journal.pcbi.1000142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Amyes T.L., Richard J.P. Specificity in transition state binding: the Pauling model revisited. Biochemistry. 2013;52:2021–2035. doi: 10.1021/bi301491r. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Khersonsky O., Malitsky S., Rogachev I., Tawfik D. Role of chemistry versus substrate binding in recruiting promiscuous enzyme functions. Biochemistry. 2011;50:2683–2690. doi: 10.1021/bi101763c. [DOI] [PubMed] [Google Scholar]
  • 15.Meng E.C., Babbitt P.C. Topological variation in the evolution of new reactions in functionally diverse enzyme superfamilies. Curr Opin Struct Biol. 2011;21:391–397. doi: 10.1016/j.sbi.2011.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16•.Pandya C., Brown S., Pieper U., Sali A., Dunaway-Mariano D., Babbitt P.C., Xia Y., Allen K.N. Consequences of domain insertion on sequence-structure divergence in a superfold. Proc Natl Acad Sci U S A. 2013;110:E3381–E3387. doi: 10.1073/pnas.1305519110. [DOI] [PMC free article] [PubMed] [Google Scholar]; By exploring sequence and structural divergence, authors discovered co-evolution of the cap-domain and Rossmann-fold domain of members in the haloalkanoic dehalogenase superfamily. The cap-domain complements the catalytic Rossmann-fold domain by determining substrate specificity.
  • 17.Todd A.E., Orengo C.A., Thornton J.M. Plasticity of enzyme active sites. Trends Biochem Sci. 2002;27:419–426. doi: 10.1016/s0968-0004(02)02158-8. [DOI] [PubMed] [Google Scholar]
  • 18•.Dessailly B.H., Dawson N.L., Mizuguchi K., Orengo C.A. Functional site plasticity in domain superfamilies. Biochim Biophys Acta. 2013;1834:874–889. doi: 10.1016/j.bbapap.2013.02.042. [DOI] [PMC free article] [PubMed] [Google Scholar]; Authors discovered that structurally diverse superfamilies are plastic in the location of functional sites such as protein-protein binding, catalytic, ligand binding and nucleic acid binding sites. They found that although there is generally a preferred functional site across almost all members of a superfamily and particularly in diverse superfamilies, members tend to exploit other sites for developing other functions. Catalytic sites are distinctively specific in superfamilies compared to other functional sites and they tend to always occur in the same general location within a superfamily.
  • 19.Almonacid D.E., Yera E.R., Mitchell J.B.O., Babbitt P.C. Quantitative comparison of catalytic mechanisms and overall reactions in convergently evolved enzymes: implications for classification of enzyme function. PLoS Comput Biol. 2010;6:e1000700. doi: 10.1371/journal.pcbi.1000700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gherardini P.F., Wass M.N., Helmer-Citterich M., Sternberg M.J.E. Convergent evolution of enzyme active sites is not a rare phenomenon. J Mol Biol. 2007;372:817–845. doi: 10.1016/j.jmb.2007.06.017. [DOI] [PubMed] [Google Scholar]
  • 21.Almonacid D.E., Babbitt P.C. Toward mechanistic classification of enzyme functions. Curr Opin Chem Biol. 2011;15:435–442. doi: 10.1016/j.cbpa.2011.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22•.Elias M., Tawfik D.S. Divergence and convergence in enzyme evolution: parallel evolution of paraoxonases from quorum-quenching lactonases. J Biol Chem. 2012;287:11–20. doi: 10.1074/jbc.R111.257329. [DOI] [PMC free article] [PubMed] [Google Scholar]; Authors investigated the evolution of actives sites, mechanism and promiscuous paraoxonase activity in three structurally different quorum-quenching lactonases. The structural similarity between the reaction intermediates in the native and promiscuous activities identifies the promiscuous activity as a starting point for the evolution of new enzymes that specialise as paraoxonases. This study is key to understand the mechanisms of bacterial resistance to pesticides.
  • 23.Galperin M.Y., Koonin E.V. Divergence and convergence in enzyme evolution. J Biol Chem. 2012;287:21–28. doi: 10.1074/jbc.R111.241976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Gerlt J.A., Allen K.N., Almo S.C., Armstrong R.N., Babbitt P.C., Cronan J.E., Dunaway-Mariano D., Imker H.J., Jacobson M.P., Minor W. The enzyme function initiative. Biochemistry. 2011;50:9950–9962. doi: 10.1021/bi201312u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25••.Zhao S., Kumar R., Sakai A., Vetting M.W., Wood B.M., Brown S., Bonanno J.B., Hillerich B.S., Seidel R.D., Babbitt P.C. Discovery of new enzymes and metabolic pathways by using structure and genome context. Nature. 2013;502:698–702. doi: 10.1038/nature12576. [DOI] [PMC free article] [PubMed] [Google Scholar]; Authors propose a computational and experimental approach to predict enzymatic activities and metabolic pathways in bacterial genomes. They applied it to determine the epimerase activity of a structurally characterised enzyme, characterised the catabolic pathway where it is involved and its role as osmoprotectant and carbon/nitrogen source.
  • 26••.Rahman S.A., Cuesta S.M., Furnham N., Holliday G.L., Thornton J.M. EC-BLAST: a tool to automatically search and compare enzyme reactions. Nat Methods. 2014;11:171–174. doi: 10.1038/nmeth.2803. [DOI] [PMC free article] [PubMed] [Google Scholar]; A novel algorithm to measure similarity of enzyme reactions based on bond changes, reaction centres and structure of substrates and products.
  • 27.Furnham N., Sillitoe I., Holliday G.L., Cuff A.L., Rahman S.A., Laskowski R.A., Orengo C.A., Thornton J.M. FunTree: a resource for exploring the functional evolution of structurally defined enzyme superfamilies. Nucleic Acids Res. 2011;44(Database issue):1–7. doi: 10.1093/nar/gkr852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.McDonald A., Tipton K. Fifty-five years of enzyme classification: advances and difficulties. FEBS J. 2014;281:583–592. doi: 10.1111/febs.12530. [DOI] [PubMed] [Google Scholar]
  • 29.Lee S.M., Jellison T., Alper H.S. Directed evolution of xylose isomerase for improved xylose catabolism and fermentation in the yeast Saccharomyces cerevisiae. Appl Environ Microbiol. 2012;78:5708–5716. doi: 10.1128/AEM.01419-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Schnell B., Faber K., Kroutil W. Enzymatic racemisation and its application to synthetic biotransformations. Adv Synth Catal. 2003;345:653–666. [Google Scholar]
  • 31.Seebeck F.P., Hilvert D. Conversion of a PLP-dependent racemase into an aldolase by a single active site mutation. J Am Chem Soc. 2003;125:10158–10159. doi: 10.1021/ja036707d. [DOI] [PubMed] [Google Scholar]
  • 32.Vick J.E., Gerlt J.A. Evolutionary potential of (beta/alpha)8-barrels: stepwise evolution of a ‘new’ reaction in the enolase superfamily. Biochemistry. 2007;46:14589–14597. doi: 10.1021/bi7019063. [DOI] [PubMed] [Google Scholar]
  • 33.Conti P., Tamborini L., Pinto A., Blondel A., Minoprio P., Mozzarelli A., De Micheli C. Drug discovery targeting amino acid racemases. Chem Rev. 2011;111:6919–6946. doi: 10.1021/cr2000702. [DOI] [PubMed] [Google Scholar]
  • 34.Hu X., Yan A., Tan T., Sacher O., Gasteiger J. Similarity perception of reactions catalyzed by oxidoreductases and hydrolases using different classification methods. J Chem Inf Model. 2010;50:1089–1100. doi: 10.1021/ci9004833. [DOI] [PubMed] [Google Scholar]
  • 35.Nath N., Mitchell J.B. Is EC class predictable from reaction mechanism? BMC Bioinform. 2012;13:60. doi: 10.1186/1471-2105-13-60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36••.Herschlag D., Natarajan A. Fundamental challenges in mechanistic enzymology: progress toward understanding the rate enhancements of enzymes. Biochemistry. 2013;52:2050–2067. doi: 10.1021/bi4000113. [DOI] [PMC free article] [PubMed] [Google Scholar]; Authors review principles of enzyme catalysis focusing on ketosteroid isomerase (EC 5.3.3.1) as a case study and highlight remaining questions in mechanistic enzymology.
  • 37.Song L., Kalyanaraman C., Fedorov A.A., Fedorov E.V., Glasner M.E., Brown S., Imker H.J., Babbitt P.C., Almo S.C., Jacobson M.P., Gerlt J.A. Prediction and assignment of function for a divergent N-succinyl amino acid racemase. Nat Chem Biol. 2007;3:486–491. doi: 10.1038/nchembio.2007.11. [DOI] [PubMed] [Google Scholar]
  • 38.Lukk T., Sakai A., Kalyanaraman C., Brown S.D., Imker H.J., Song L., Fedorov A.A., Fedorov E.V., Toro R., Hillerich B. Homology models guide discovery of diverse enzyme specificities among dipeptide epimerases in the enolase superfamily. Proc Natl Acad Sci U S A. 2012;109:4122–4127. doi: 10.1073/pnas.1112081109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39•.Rodionova I.A., Scott D.A., Grishin N.V., Osterman A.L., Rodionov D.A. Tagaturonate-fructuronate epimerase UxaE, a novel enzyme in the hexuronate catabolic network in Thermotoga maritima. Environ Microbiol. 2012;14:2920–2934. doi: 10.1111/j.1462-2920.2012.02856.x. [DOI] [PMC free article] [PubMed] [Google Scholar]; Discovery of a novel enzyme, d-tagaturonate/d-fructuronate epimerase, in the hexuronate catabolism of Thermotoga maritima using a combination of bioinformatics and experimental techniques.
  • 40.Glasner M.E., Gerlt J.A., Babbitt P.C. Evolution of enzyme superfamilies. Curr Opin Chem Biol. 2006;10:492–497. doi: 10.1016/j.cbpa.2006.08.012. [DOI] [PubMed] [Google Scholar]
  • 41.Nguyen T.T., Fedorov A.A., Williams L., Fedorov E.V., Li Y., Xu C., Almo S.C., Raushel F.M. The mechanism of the reaction catalyzed by uronate isomerase illustrates how an isomerase may have evolved from a hydrolase within the amidohydrolase superfamily. Biochemistry. 2009;48:8879–8890. doi: 10.1021/bi901046x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42•.Singh S., Phillips G.N., Thorson J.S. The structural biology of enzymes involved in natural product glycosylation. Nat Product Rep. 2012;29:1201–1237. doi: 10.1039/c2np20039b. [DOI] [PMC free article] [PubMed] [Google Scholar]; Comprehensive review exploring structural and mechanistic studies of enzymes involved in the biosynthesis of nucleotide sugars.
  • 43.Uberto R., Moomaw E.W. Protein similarity networks reveal relationships among sequence, structure, and function within the cupin superfamily. PLoS ONE. 2013;8:e74477. doi: 10.1371/journal.pone.0074477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Brown S.D., Babbitt P.C. Inference of functional properties from large-scale analysis of enzyme superfamilies. J Biol Chem. 2012;287:35–42. doi: 10.1074/jbc.R111.283408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Sillitoe I., Cuff A.L., Dessailly B.H., Dawson N.L., Furnham N., Lee D., Lees J.G., Lewis T.E., Studer R.A., Rentzsch R. New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res. 2013;41(Database issue):D490–D498. doi: 10.1093/nar/gks1211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Paradis E. Springer; 2012. Analysis of Phylogenetics and Evolution with R. [Google Scholar]
  • 47.Holliday G.L., Andreini C., Fischer J.D., Rahman S.A., Almonacid D.E., Williams S.T., Pearson W.R. MACiE: exploring the diversity of biochemical reactions. Nucleic Acids Res. 2012;40(Database issue):D783–D789. doi: 10.1093/nar/gkr799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Bairoch A. The ENZYME database in 2000. Nucleic Acids Res. 2000;28:304–305. doi: 10.1093/nar/28.1.304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.The Uniprot Consortium Update on activities at the Universal Protein Resource (UniProt) in 2013. Nucleic Acids Res. 2013;41(Database issue):D43–D47. doi: 10.1093/nar/gks1068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Lespinet O., Labedan B. Orphan enzymes? Science. 2005;307:42. doi: 10.1126/science.307.5706.42a. [DOI] [PubMed] [Google Scholar]
  • 51.Pouliot Y., Karp P. A survey of orphan enzyme activities. BMC Bioinform. 2007;8:244. doi: 10.1186/1471-2105-8-244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Berman H.M., Kleywegt G.J., Nakamura H., Markley J.L. The future of the protein data bank. Biopolymers. 2013;99:218–222. doi: 10.1002/bip.22132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.O’Brien P.J. Catalytic promiscuity and the divergent evolution of DNA repair enzymes. Chem Rev. 2006;106:720–752. doi: 10.1021/cr040481v. [DOI] [PubMed] [Google Scholar]
  • 54.Glasner M.E., Fayazmanesh N., Chiang R.A., Sakai A., Jacobson M.P., Gerlt J.A., Babbitt P.C. Evolution of structure and function in the o-succinylbenzoate synthase/N-acylamino acid racemase family of the enolase superfamily. J Mol Biol. 2006;360:228–250. doi: 10.1016/j.jmb.2006.04.055. [DOI] [PubMed] [Google Scholar]
  • 55.Frey P., Hegeman A. Chemical and stereochemical actions of UDP-galactose 4-epimerase. Acc Chem Res. 2013;46:1417–1426. doi: 10.1021/ar300246k. [DOI] [PubMed] [Google Scholar]
  • 56.Kowatz T., Morrison J.P., Tanner M.E., Naismith J.H. The crystal structure of the Y140F mutant of ADP-l-glycero-d-manno-heptose 6-epimerase bound to ADP-beta-d-mannose suggests a one base mechanism. Protein Sci. 2010;19:1337–1343. doi: 10.1002/pro.410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Eixelsberger T., Sykora S., Egger S., Brunsteiner M., Kavanagh K.L., Oppermann U., Brecker L., Nidetzky B. Structure and mechanism of human UDP-xylose synthase: evidence for a promoting role of sugar ring distortion in a three-step catalytic conversion of UDP-glucuronic acid. J Biol Chem. 2012;287:31349–31358. doi: 10.1074/jbc.M112.386706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Hegeman A., Gross J., Frey P. Concerted and stepwise dehydration mechanisms observed in wild-type and mutated Escherichia coli dTDP-glucose 4,6-dehydratase. Biochemistry. 2002;41(8):2797–2804. doi: 10.1021/bi011748c. [DOI] [PubMed] [Google Scholar]
  • 59.Tanner M.E. Transient oxidation as a mechanistic strategy in enzymatic catalysis. Curr Opin Chem Biol. 2008;12:532–538. doi: 10.1016/j.cbpa.2008.06.016. [DOI] [PubMed] [Google Scholar]
  • 60.Maley F., Maley G.F. The enzymic conversion of glucosamine to galactosamine. Biochim Biophys Acta. 1959;31:577–578. doi: 10.1016/0006-3002(59)90047-2. [DOI] [PubMed] [Google Scholar]
  • 61.Bashton M., Chothia C. The generation of new protein functions by the combination of domains. Structure. 2007;15:85–99. doi: 10.1016/j.str.2006.11.009. [DOI] [PubMed] [Google Scholar]

RESOURCES