Abstract
Enzymatic approaches to challenges in chemical synthesis are increasingly popular and very attractive to industry given their green nature and high efficiency compared to traditional methods. In this historical review we highlight the developments across several fields that were necessary to create the modern field of biocatalysis, with enzyme engineering and directed evolution at its core. We exemplify the modular, incremental, and highly unpredictable nature of scientific discovery, driven by curiosity, and showcase the resulting examples of cutting‐edge enzymatic applications in industry.
Keywords: enzymes, protein engineering, history of biocatalysis, immobilization, green chemistry
In this historical review we highlight the developments across several disciplines that were necessary to create the modern field of biocatalysis and showcase the resulting examples of cutting‐edge enzymatic applications in industry.
1. Introduction
Mesophilic organisms can carry out reactions under mild conditions, enabled by excellent catalysts: enzymes. Humans have, unknowingly at first, used these enzymes to their advantage for millennia, for example to ferment sugars into alcohol (as early as 7000 BC). [1] So how did we get from biocatalysis being used unknowingly to the modern application at the forefront of chemical synthesis? In this review, the origin of modern biocatalysis is summarized (Figure 1), starting with the discovery of enzymes, and exploring the principles of biochemistry and molecular biology developed throughout the 20th century. These stepping stones lead to the biotechnological advancements at the turn of the century which resulted in increasingly sophisticated applications of enzymes, in particular in industry. This review aims at condensing these biological developments primarily from the point of view of a chemist and is primarily intended to help chemists, but also scientists from other disciplines, entering the field, while also showcasing selected cutting‐edge applications of enzymes that may be of interest to biologists as well.
2. Early enzymology‐demystifying life
In 1833, diastase (a mixture of amylases) was the first enzyme to be discovered, [2] quickly followed by other hydrolytic enzymes such as pepsin and invertase, [3] but the term enzyme was only coined in 1877 by Wilhelm Kühne. [4] The concept of catalysts, chemicals facilitating a reaction without undergoing any change themselves, was introduced in 1836 [5] by Berzelius who quickly hypothesized that enzymes were such catalysts. [6] Yeast, which had been observed in ethanolic fermentations, was also viewed as a catalyst, but soon it was discovered that it was a living organism which at the time seemed to contradict that concept. [7] Evidence from Pasteur that fermentation occurs in the absence of oxygen and failed attempts to isolate an enzyme able to carry out this transformation, were claimed as evidence by vitalists that a “life‐force” was necessary for these more complex transformations and that enzymes only carried out “simple” hydrolysis reactions. [8a] Indeed, this is often framed as a dispute between Pasteur and Liebig, with the former supporting vitalism and the latter supporting a mechanistic view that ascribes no special place to life. However, it appears more accurate to say that Pasteur supported the idea that fermentation was carried out by yeast through chemical means whereas Liebig opposed the idea of any causal link between yeast as a living organism and the catalytic fermentation reaction, instead thinking that the decay of yeast in the presence of oxygen was catalyzing the formation of alcohol from sugar. [8] Finally, in 1897, Eduard Büchner showed that a dead yeast extract could carry out the same fermentation reaction as living yeast, thus dealing the final blow to vitalism, which had already been on the decline (Nobel Prize in Chemistry 1907).[ 3 , 8a ]
The fermentation of sugars into ethanol and carbon dioxide was attributed to “zymase.” Further investigations started to reveal reaction intermediates, and dependency on phosphate and “co‐zymase” (A. Harden and, H. von Euler‐Chelpin; Nobel Prize in Chemistry 1929) and started to untangle glycolysis. However, the chemical nature of enzymes was still being debated. In 1926, James B. Sumner crystallized the first enzyme (urease), and confirmed it was a protein. [9] John H. Northrop also crystallized several other proteins, amongst them pepsin, trypsin, and chymotrypsin. [6] They were awarded the Nobel Prize in Chemistry in 1946; in his Nobel lecture, [9] Sumner remarks that
“The organic chemist has never been able to synthesize cane sugar, but by using enzymes, the biological chemist can synthesize not only cane sugar but also gum dextran, gum levan, starch and glycogen.”
Indeed, a whole range of industrial applications of mainly whole organisms but also some enzyme preparations had already been developed. For example, glycerol (used in the production of explosives) was produced on a 1000 ton per month scale in Germany during world war I, employing fermentation in yeast with the final acetaldehyde reduction step inhibited by sulfite, resulting in dihydroxyacetone phosphate reduction. By 1949, citric acid was almost exclusively produced using the fungus Aspergillus niger (ca. 26,000,000 pounds per year in the US alone), even though it was not understood how the organism produced it. [10] In 1934, a patent was granted for the condensation of acetaldehyde (produced in‐situ from glucose) with benzaldehyde catalyzed by whole yeast, giving l‐phenylacetylcarbinol, which was then further reacted to give l‐ephedrine, a stimulant used during anesthesia, as a decongestant, and also a pre‐cursor to illicit drugs such as methamphetamine (Scheme 1). [11] This procedure is still used today, highlighting the power of an efficient biocatalytic process. [12] Enzymes prepared from fungi or bacteria became alternatives to those initially obtained from plants or animals (e. g. amylases and proteases). Purified proteases were used to clarify beer since 1911, pectinases (from various fungi or malt) were used to clarify juices and wine. [9] In the early 1950s, several species of fungi were applied to the regio‐selective hydroxylation of steroids for the production of cortisone, which had been impossible using chemical means. [13]
By 1949 a vast number of enzyme classes had been discovered and characterized extensively. Many pathways and intermediates were fully uncovered, yet little was known with regard to the mechanism by which individual enzymes worked. [9] Through the famous lock‐and‐key model, proposed by Emil Fischer in 1894, [15] as well as the Michaelis‐Menten model of enzyme kinetics from 1913 (Equation 1, Figure 2) [16] it was understood that a substrate has to bind to the enzyme prior to catalysis, yet how this binding proceeds and how catalysis occurs afterwards was unsolved. The ratio of kcat/kuncat has been found to be as high as 1017, crowning enzymes as exceptional catalysts. Interestingly, most enzymes have similar kcat values (within two orders of magnitude), while the rate constants of the corresponding un‐catalyzed reactions (kuncat) vary wildly. [17]
Equation 1: The Michaelis‐Menten model and equation, in modern form: a) Enzyme and substrate combine in a reversible fashion to form the enzyme‐substrate complex, which then goes on to release the enzyme and product in an irreversible reaction. b) Under the assumption of a steady state concentration of the enzyme‐substrate complex, the Michaelis‐Menten equation can be written, describing the consumption of substrate depending on substrate concentration [S], maximum velocity vmax (itself dependent on enzyme concentration [E]0), and substrate affinity Km.
3. Enzyme structures and elucidation of mechanisms
In 1948 Linus Pauling proposed that enzymes had to stabilize the transition state rather than the substrate as proposed by Fischer. [18] The detailed concept of a transition state itself had only been developed less than two decades earlier. [19] This was further refined by Koshland in 1958, [20] proposing the concept of “induced‐fit,” explaining the specificity of enzymes on an abstract level. In parallel to this abstract understanding, a more detailed understanding of the structure of proteins was being developed. It had been hypothesized since the beginning of the 20th century that proteins were composed of chains of amino acids connected via amide bonds; [21] however, the order or even the relative amount of amino acids was not well understood, and indeed the peptide hypothesis itself was frequently questioned. [22] In 1951, Sanger determined the amino acid sequence (referred to as primary structure) of insulin, revealing that indeed as expected it was a well‐defined sequence of amino acids linked by amide bonds. [23] He received his first Nobel Prize in Chemistry in 1958 for this work.
Famously, Linus Pauling proposed how a chain of amino acids might fold into regular geometric features (i. e. α‐helices and β‐sheets; referred to as secondary structure, Figure 3) while sick in bed, based on his detailed understanding of the rigidity of the amide bond and “reasonable” interatomic distances. The rigorous understanding of chemical bonds had just been developed for which Pauling received his Nobel Prize in Chemistry in 1954. [24]
In the meantime, X‐ray crystallography, developed from 1912 onward by Max von Laue, (Nobel Prize in Physics in 1914) and William and Lawrence Bragg (Nobel Prize in Physics in 1915), had become more sophisticated and was being applied to increasingly complex compounds. Evidence for Pauling's secondary structure from X‐ray diffraction was reported by Max Perutz in 1951. [25] The first structures of proteins were solved in 1958–1960 by John Kendrew and Max Perutz (Nobel Prize in Chemistry in 1962). [26] This was initially met with some degree of disappointment as it revealed that proteins were “messy” (Figure 4) and squashed the hope that solving the structure of one protein would reveal the structure of all proteins (in contrast to DNA where that expectation largely held true). [27] However, as higher resolution structures were obtained the insight that could be gained into the mystery world of enzymes became apparent and many groups set forth to investigate not just proteins but enzymes (Figure 4). [28]
Structures of lysozyme were solved in 1965, [29] and included structures of the enzyme with inhibitors bound to it, revealing the location and residues of its active site. Other enzyme structures solved around this time include bovine carboxypeptidase A in 1967, [30] both with and without substrate bound‐revealing conformational changes (in agreement with the induced fit hypothesis) as well as key interactions between substrate and enzyme. The crystal structure of chymotrypsin (also in 1967) [31] paved the way to uncover the classic catalytic triad and oxyanion hole of proteases (as well as esterases and other hydrolytic enzymes; Figure 4). In 1971, the Protein Data Bank (PDB) was founded with seven structures, [28] reaching 50 structures in 1979, and 100 structures three years later. At the end of 2019 it contained almost 160,000 structures. [32]
In parallel to the increasing understanding of the structure of enzymes, newly developed physical‐chemistry techniques were also employed to elucidate mechanisms, such as detailed kinetics, isotopic labelling, isotope effects, and spectroscopic techniques. [33] The first mechanisms to be elucidated in that way were of enzymes employing co‐enzymes, as the structures (fragments) of co‐enzymes were determined before the structures of whole proteins. Indeed, as early as 1936, [34] Otto Warburg showed that certain pyridines (analogous to nicotinamide that could be obtained from hydrolysis “co‐zymase”) could transfer hydrides reversibly, implying that such a hydride transfer plays a role during glycolysis (he had previously received the Nobel Prize in Physiology or Medicine in 1931 for his work on the role of iron in respiration).
The full structure of thiamine (cocarboxylase) had been proved in 1936 by Williams and Cline. [35] The structures of pyridoxine, as well as of biologically relevant derivatives pyridoxal and pyridoxamine were established in the early 1940s by Esmond Snell soon after the discovery of transaminases. [36] Full structures of NAD(P)(H) (co‐zymase), [37] ATP, [38] and FAD [39] were proved by Alexander Todd in the late 1940s and 50s (Nobel Prize in Chemistry in 1957). The structure of Vitamin B12 (cyanocobalamin) was solved through X‐ray crystallography by Dorothy Hodgkin in 1955 [40] (Figure 5; Nobel Prize in Chemistry in 1964).
The chemistries of those co‐factors could be investigated in the absence as well as in the presence of their enzymes, and from this, mechanistic details could be inferred. In addition, structural analogues could be synthesized and their reactivities compared. For example, careful isotope labelling studies in the early 1950s revealed that one hydride of the pyridine ring of NAD(P)H was transferred during reduction/oxidation in a stereospecific manner, giving additional detail to Otto Warburg's mechanism (Figure 6). [41] In 1957, Breslow showed by NMR that an anion in position 2 of a thiazolium ring could exist, revealing the reactive center of thiamine (Figure 6). [42] The observation that pyridoxal, the co‐factor of transaminases, as well as structural analogues with electron withdrawing groups on the aromatic ring, can catalyze transamination in the absence of the enzyme allowed Alexander Braunstein and Esmond Snell to postulate independently a likely catalytic cycle in 1954, which later proved to be correct (Figure 7).[ 33b , 36b , 43 ]
However, these advances in the knowledge of how enzymes work had no immediate impact on industrial biocatalysis, which was largely limited by the low quantities most enzymes could be obtained in. Major developments at the time include the application of glucose isomerase for the production of high fructose corn syrup (HFCS) and the development of a penicillin acylase process for the production of 6‐aminopenicillanic acid (6‐APA, at the time obtained from chemical cleavage of penicillin), a building block for semi‐synthetic antibiotics such as ampicillin and amoxycillin (Scheme 2). [44] Key for the success of both applications was the discovery of the possibility to immobilize proteins with retention of their function as discovered in the 1950s. [45] This allowed the enzymes to be recycled and used in a continuous fashion, reducing cost by reducing the quantity of enzyme that has to be isolated. The HFCS processes became wide‐spread in the 1970s. [46] However, the production of 6‐APA via chemical hydrolysis predominated until the early 1990s, at least partially due to the difficulty of obtaining sufficient quantities of penicillin acylase before then. [47] A notable exception is Bayer, who used an immobilized penicillin acylase since 1972 as a closely guarded secret, employing E. coli strains that achieved a penicillin acylase content of ca. 20 %. [48] Processes to synthesize amino acids using immobilized enzymes (as well as whole cells) were being commercialized in Japan from 1973. [49]
4. The DNA revolution
In order for enzymes to enjoy more widespread use, their production had to be ramped up dramatically. In addition, to apply the insights into enzyme mechanisms described above, enzyme active sites had to be tweaked somehow. The key to both these problems was the understanding of how DNA encodes proteins, as well as the development of efficient ways of manipulating DNA. Of course, in parallel to the research into proteins described above, research into DNA was also ongoing. While DNA was originally viewed as less important than proteins (due to its simple make‐up of four building blocks), this view quickly changed with the discovery that it was the carrier of hereditary information by Avery in 1944. [50] Of course, the correct structure of DNA was postulated in 1953 by Watson and Crick [51] (Nobel Prize in Physiology or Medicine in 1962) from X‐ray diffraction data by Rosalind Franklin.
This quickly led to a postulation of how genetic information is encoded in DNA: the hypothesis that DNA encodes amino acid sequences [26e] and that the amino acid sequence alone determines the structure of proteins, as demonstrated by Anfinsen in 1961 [52] (Nobel Prize in Chemistry in 1972). The process of transcription of DNA into mRNA and the translation of mRNA into protein itself was broadly solved within a decade of the discovery of the structure of DNA; transcription as a concept was proposed by François Jacob and Jacques Monod in 1961 [53] (Nobel Prize in Physiology or Medicine in 1965). In the same year, mRNA was discovered, [54] the triplet code was established, [55] and the first codon (UUU) was solved by Nirenberg. [56] In competition with several other groups, [57] amongst them Gobind Khorana developing a sequence specific chemical synthesis of polynucleotides, the genetic code (Figure 8) was fully solved by 1966 and its universality established by 1967 [58] (Nirenberg, Khorana and Robert W. Holley (for the isolation of tRNA) received the Nobel Prize in Physiology or Medicine in 1968).
Understanding the meaning of the genetic code of course is of limited use, unless one can also read the DNA sequence. Frederick Sanger developed an ingenious hi‐jacking of normal DNA replication in 1977 (Figure 9): [59] by supplying a small quantity of nucleotides that could not be further extended (because they missed the 3′‐OH), strands of DNA truncated after every A (or C, G, or T; depending on which was supplied as the dideoxynucleotide) were produced, which could be separated by size using gel electrophoresis. Repeating this experiment for all four nucleotides, the sequence of bases in the template could be deduced. This is earned Frederick Sanger his second Noble Prize in Chemistry in 1980. While initially DNA was visualized using radioactive labels, this was quickly replaced by using fluorescently labelled dideoxynucleotides, allowing for all four bases to be present in the same reaction mixture, increasing throughput. Using capillary electrophoresis, automated sequencing became possible. [60] Next‐generation sequencing, introduced in the mid‐2000s, involves the massive parallelized sequencing of many smaller DNA segments that are then assembled in‐silico. [61] This has significantly reduced the cost and time of genome sequencing, and thus resulted in a dramatic increase of available genomes, with over 55000 genomes deposited in the NCBI database, as of August 2020. [62]
At the same time and leading on from Avery's experiment, the transmission of genetic information in bacteria was being investigated. In 1952, Joshua Lederberg coined the term “plasmid” to describe such transmissible DNA and discovered the nature of its transmission (Nobel Prize in Physiology or Medicine in 1958). [63] Also in 1952, Salvador Luria [64] and Giuseppe Bertani [65] observed that bacteriophages from one strain of E. coli have a decreased virulence in another, but upon growth in the second strain would show increased virulence for it and a decreased virulence for the original strain, observing the effect of restriction enzymes (so called because they restrict the growth of bacteriophage). This effect was then also observed for several other bacteria. However, it was only in the early 1960s that the nature of these enzymes as site specific endonucleases, and that host bacteria protect their own DNA through modification (methylation), was proposed by Werner Arber. [66] Mathew Meselson isolated the first such restriction endonuclease in 1968 (EcoK1). [67] These early restriction enzymes recognized a specific sequence but did not cut in a specific location, and are now known as type I restriction enzymes. Type II restriction enzymes, cutting DNA in specific locations (Figure 10), were discovered by Hamilton Smith in 1970 (HindII and HindIII). [68] In conjunction with gel electrophoresis, this allowed the digestion of DNA into fragments of defined size which could then be separated, as shown by Daniel Nathans in 1971. [69] Arber, Smith, and Nathans won the Nobel Prize in Physiology or Medicine in 1978.
As many type II restriction enzymes produce palindromic single stranded overhangs, it was then realized that DNA from different sources could be stitched together if cut with the same restriction enzyme, through the action of DNA ligase. The first such “recombinant” DNA was reported by Paul Berg in 1972 (Nobel Prize in Chemistry in 1980). [70] It thus became possible to introduce any piece of DNA from any organism into (for example) E. coli. [71] Thus, plasmids for convenient introduction of such recombinant DNA were being developed in the 1970s. [70c] On of the most famous of these plasmids is pBR322, developed by Bolivar and Rodriguez (BR) in 1977.[ 70c , 72 ] This plasmid made use of two antibiotic resistance genes and several unique restriction sites within them to allow for the selection of colonies that had a) up‐taken the plasmid and b) up‐taken a plasmid containing an insert (Figure 11). The propagation of recombinant DNA in a new host is referred to as cloning. The pUC series of plasmids (UC for University of California), derived from pBR322, allowed for colorimetric detection of inserts. [73] Finally, the pET series of vectors, also derived from pBR322, was created in the late 1980s and included a T7 promotor, allowing for the selective expression of the DNA insert (ET for Expression by T7 RNA Polymerase).[ 74a , 74b ] Strains of E. coli were generated containing the gene for the T7 polymerase, under the control of a modified lac promoter (lacUV5), as a lysogen of the DE3 phage.[ 74b , 74c , 74d , 74e ] Alternative promotor systems were also developed, such as the aforementioned lac promoter, as well as the trc promoter, pL promoter, and tetA promoter, and more, each with their own advantages and draw‐backs. [75]
These developments in molecular biology revolutionized enzymology and biocatalysis. For the first time, the DNA sequence of an enzyme of interest could be determined and cloned, and the enzyme could be over‐expressed in E. coli (or another suitable organism) and thus be obtained in sufficient quantities to be studied and used in industrial applications. The first recombinant protein produced was insulin in 1978, and the commercial production of human insulin started in 1982. [76] Prior to that, insulin had to be isolated from pigs or cows and often had limited and inconsistent efficacy, as well as inconsistent supply. [77] DNA recombinant technology also allowed penicillin acylase to be obtained in sufficient quantities and enabled its widespread application toward the synthesis of 6‐APA, as mentioned above.[ 45e , 48 ] Indeed, penicillin acylase was one of the first enzymes expressed recombinantly, in 1979, only one year after insulin.[ 45d , 48 ] The availability of this enzyme also enabled the development of its application in the reverse direction, catalyzing the amide bond formed between 6‐APA and the side chains found in semi‐synthetic antibiotics such as amoxicillin and ampicillin (Scheme 3, also see below for a discussion of the role of immobilization).[ 44 , 47 , 78 ] Around the same time, recombinant chymosin (which selectively hydrolyzes casein between residues F105 and M106, resulting in the curdling of milk) started replacing natural rennet, obtained from calf stomachs, in cheese‐making. [79] This provided a cheaper, more stable supply for cheesemaking as well as more consistent results due to a higher purity. By 2006, up to 80 % of all rennet was recombinant chymosin and cheese production in the US had increased over two‐fold. [80]
5. Directed evolution and the beginnings of modern biocatalysis
As great as natural enzymes are at carrying out their function, they often present drawbacks which make them unsuitable for industrial applications such as their lack of stability, (co‐)solvent‐tolerance, or a very limited substrate scope. While immobilization can address the stability problems (see below), [81] it quickly became desirable to be able to change the properties of the enzymes themselves. Of course, to some extend this had already been done routinely, through strain optimization. Whole organisms were subjected to mutation‐inducing conditions, such as radiation or chemical agents, and the resulting strains were screened for favorable phenotypes. [82] Through this method, strains producing larger quantities of desirable products, either specific enzymes (such as in the case of penicillin acylase at Bayer mentioned above) or chemicals could be obtained, and entirely new pathways could be introduced. [83] However, these approaches were slow, unlikely to directly change the properties of any specific enzyme, and could only really be applied to organisms with sufficiently short replication‐cycles. However, with the availability of recombinant DNA, as well as the understanding of enzymes and their mechanisms as outlined above, introduction of specific mutations into a target enzyme was now within reach, irrespective of the organism it originated from. Indeed, a general method for site‐directed mutagenesis was being reported by Michael Smith in 1978 (Nobel Prize in Chemistry in 1993), the same year as the cloning of insulin was achieved. By designing DNA primers, harboring the desired mutations, complimentary to the target sequence to be mutated, and extending with DNA polymerase using the target sequence as a template, copies containing the mutation could be made (Figure 12). [84] Of course, efficient syntheses of specific DNA sequences such as those developed by Khorana, [85] Gillam, [86] and Caruthers [87] were instrumental for this. [88]
Using this approach, the role of catalytic residues could be directly investigated. For example changing the cysteine in the catalytic triad found in tyrosyl tRNA synthase to a serine (as found in esterases mentioned above) greatly reduces the efficiency of the enzyme. [89] Indeed, the role of several residues in several enzymes could now be quantified, confirming and in some cases revising mechanisms that had been postulated based on crystallography. [90a] However, using this technique to introduce desirable properties into enzymes was quickly met with the realization that the effect of mutations was often unpredictable, and rational engineering of enzymes was often not successful. However, a shift was quickly made toward a more random approach such as the use of site‐saturation mutagenesis, where targeted residues were changed to all possible amino acids rather than a specific one. Through this, some progress was made such as the introduction of a stabilizing mutation into subtilisin, a protease with application in laundry detergents,[ 81 , 90 ] or enhanced thermostability of glucose isomerase. [91]
With the development of the polymerase chain reaction (PCR, Figure 12) in the 1980s by Kary Mullis (Nobel Prize in Chemistry in 1993), it became possible to produce large numbers of copies of DNA sequences from a single template. [92] By modulating the fidelity of the polymerase, random mutations could be introduced into the amplified product (error‐prone or epPCR, Figure 12). In the early 1990s, Frances Arnold used this technique to create large libraries of mutants to which she then applied evolutionary pressure. In her own words, [93] she
“rejected microbial growth or survival selections favored by microbiologists and geneticists. Thus we turned to good old‐fashioned analytical chemistry to develop reproducible, reliable screens that reported what mattered to us.”
In doing so she managed to produce a variant of subtilisin E that could tolerate high concentrations of DMF, introducing a total of 10 mutations. [94] Thus, the field of directed evolution was born. In 1994, Pim Stemmer introduced the concept of DNA shuffling (Figure 12), mimicking DNA recombination which occurs in organisms as a way to increase genetic diversity, and applying it to recombinant DNA in vitro. Without being restricted to genes from a single species, very diverse proteins could be mixed together to create new sequences very distant from natural ones. [95] This technique proved very powerful on its own, but especially when combined with epPCR, allowing the combination of mutations from several mutants without the need for any understanding of how the different mutations would interact with each other. [96] Frances Arnold received the Nobel Prize in Chemistry in 2018 for the directed evolution of enzymes. In addition to co‐solvent tolerance, directed evolution was quickly used to create enzymes with improved thermostability,[ 97a , 97b , 97c , 97d ] pH stability, [97c] as well as enhanced activity at low temperatures,[ 97c , 97e ] activity toward unnatural substrates,[ 96 , 98 ] modified enantioselectivity, [99] or combinations of the above. Thus, it quickly established itself as a powerful tool in protein engineering across structurally and functionally diverse classes of enzymes. Using random mutagenesis methods it was quickly realized that beneficial mutations were often found in unexpected parts of the enzymes, explaining why early rational attempts struggled at accomplishing these modifications.[ 81 , 93 , 100 ]
The sudden availability of biocatalysts with properties suitable for industrial applications, as well as the ability to create those properties at will, made them very attractive for use in synthetic applications that so far had been considered out of reach. [101] Of course, catalysis itself had become a major field of interest in synthetic chemistry in the second half of the 20th century, as the environmental impact of traditional (stoichiometric) chemistry was gaining attention. [102] This gained more traction with the conceptual development of green chemistry in the 1990, in parallel to the advances made in biocatalysis outlined above. [103] Thus, it is not surprising that biocatalysis formed a key strategy of accomplishing the goals of green chemistry from the start. [102b] Indeed, it promises to address many of the “12 principles of green chemistry” (Figure 13), in particular with regard to hazardous reagents and waste, energy requirements, number of steps, and their inherently renewable and biodegradable nature. [104] Indeed, the number of biocatalytic processes in industry started increasing rapidly and continues to do so to this day: there were around 60 processes in 1990, 134 processes in 2002, and several hundred by 2019. [105]
Perhaps one of the most successful examples developed at the time was the use of lipases, in particularly CalB from Candida antarctica, in organic solvents, allowing ester and amide formation without competing hydrolysis, which is frequently employed in (dynamic) kinetic resolutions of chiral alcohols and amines. The latter was developed at BASF [106] and is often referred to as the “BASF process” or “ChiPros technology” (Scheme 4). [107] By 2004, multiple BASF plants produced chiral amines on a >1000 ton scale per year, and this process is still in use today. The reactions can be carried out without solvent, are often nearly quantitative, both amine and amide are readily isolated, and the undesired product can be recycled, making this process highly efficient. [108]
6. “Smart” libraries and applications of enzyme engineering
While the BASF process uses a wild‐type enzyme, stabilized through immobilization, a strong interest in enzyme engineering has developed in industry. One major barrier to directed evolution is screening. In Frances Arnold's original paper, [94a] the enzyme was secreted from colonies of bacteria and digestion of the substrate in plates could be easily observed by a decrease in turbidity, thus allowing the screening of a large number of variants relatively easily. However, in general screening is not straightforward and usually it is the bottleneck. This is tackled on two fronts: development of faster high‐throughput screens, as well as reduction of the size of libraries by increasing the proportion of hits. While the former is often highly specific to the (class of) enzymes being evolved, more general concepts exist for the latter. [109] For example, the structure of the enzyme may be used to assess which recombinations are more likely not to disrupt the overall fold of the enzyme, in a process called SCHEMA, [110] which can then be used to reduce the size of combinatorial libraries.
Advances in the understanding of protein structures as well as dynamics have increasingly allowed target residues to be identified with more reliability than was previously possible, reducing the need for random mutagenesis across the whole gene although it remains a valuable tool. [111] This is accomplished using increasingly sophisticated bioinformatic tools, to allow for the docking of substrates into active sites, molecular dynamic simulations, and protein structure modeling, which can aid to predict the likely effect of potential mutations. One very powerful tool that has emerged is Rosetta, which also has been applied for the complete de‐novo design of proteins. [112] The availability of increasing numbers of structures of diverse enzymes within a given family, and the even larger availability of sequences (due to next generation high‐throughput sequencing technologies) allow points of natural variation to be identified which may then be targeted. The flexibility of residues as determined by X‐ray crystallography may also help identify target regions. [113] Alternatively, random mutagenesis might be used to identify hotspots which are then further investigated by more targeted mutagenesis. [114] In addition, the amino acids found in nature for a given position can inform which substitutions to include in a given library. [115]
Several residues may be targeted together, to increase the chance of detecting synergistic effects of mutations. One such approach is combinatorial active site testing (CASTing), developed by Manfred Reetz, [116] whereby multiple residues lining the active site are saturated at the same time, allowing for synergistic effects between mutations to emerge. This has been particularly successful in changing enantioselectivities and substrate scopes of enzymes.[ 107 , 117 ] Amine dehydrogenases (AmDH) were created from amino acid dehydrogenases in this way. [118] Several sites of interest (each potentially consisting of multiple residues) may be targeted sequentially, in a process call Iterative Saturation Mutagenesis (ISM), also developed by Reetz. [119]
Statistical tools and machine learning are also a powerful way to increase the efficiency of directed evolution, [120] such as the use of protein sequence activity relationships (ProSAR). [121] In an initial library, mutations are classified as beneficial, neutral, or detrimental and can inform which mutations to incorporate into subsequent libraries, as opposed to taking the best overall variant and generating a new library. This strategy was successfully applied by Codexis in the engineering of a halohydrin dehalogenase (HHDH) for the synthesis of (R)‐4‐cyano‐3‐hydroxybutyrate, a key intermediate for the synthesis of atorvastatin, a cholesterol‐lowering drug. [122] Overall, the volumetric productivity was improved 4000‐fold over 18 rounds of evolution and 35 mutations were introduced, meeting the process requirements for the enzyme. The authors note that half of the mutations introduced in the final variant were initially not present in the best variant when selected and would have been missed in a hit‐based approach. While this approach can reduce screening efforts, it requires a larger sequencing effort. However, this has become increasingly possible as the cost of DNA sequencing has steadily declined. [81]
Codexis and Merck combined several of these approaches to engineer a transaminase for the synthesis of sitagliptin. Starting from an enzyme with no activity toward the substrate and minimal activity toward a truncated analogue, 11 rounds of engineering (Figure 14) led to a catalyst that outcompeted the alternative rhodium‐catalyzed reductive amination process in terms of efficiency, yield, enantioselectivity, and waste formation. Overall, 27 mutations were introduced using a combination of site‐saturation mutagenesis, combinatorial libraries (including diversity from homologous sequences), proSAR, and epPCR, screening a total of 36480 variants. [123] Highlighting that, even with the use of tools to maximize the efficiency of evolution, a huge screening effort may still be required for significant catalyst improvements.
In another more recent example, GSK evolved an imine reductase (IRED) to meet the process requirements for the synthesis of the LSD1 inhibitor GSK2879552, currently in clinical trials. [124] Screening of their in‐house panel (of at least 85 IREDS) [125] revealed a suitable candidate for mutagenesis. Given the scarcity of structural data on IREDs and that their highly dynamic mechanism is not fully understood, an initial round of site‐saturation mutagenesis was carried out on 256 out of 296 positions. Beneficial mutations from that round were then used to generate combinatorial libraries, which were then analyzed using the proprietary CodeEvolver software from Codexis. Statistical analysis was performed to identify pairwise interactions of beneficial mutations which were then included in another combinatorial library in a final third round of evolution, yielding an enzyme with 13 mutations that met or exceeded the process requirements, resulting in improved sustainability metrics over the previous route (Figure 15). The enzyme was then used to synthesize 1.4 kg of GSK2879552 for use in additional rounds of clinical trials.
Increasingly, directed evolution has also been applied to create enzymes carrying out reactions not observed in nature, by exploiting the promiscuous nature of enzymes (Figure 16). Frances Arnold's group reported on the evolution of a cytochrome c (cyt c), [126] a protein without any catalytic role in nature, to form carbon‐silicon bonds, a reaction not observed in nature. After screening several P450 enzymes, myoglobins, and cyt c variants, they identified a cyt c from Rhodothermus marinus with low levels of catalytic activity for this reaction. Iterative site‐saturation mutagenesis of just three key residues – an iron coordinating methionine, and two additional residues close to the heme group – resulted in a catalyst with a total turnover number (TTN) of >1500, a>33‐fold improvement over the wild‐type (wt) and a >375‐fold improvement over free heme, outperforming the best chemical catalysts for this reaction. In addition, the turnover frequency (TOF) was increased 7‐fold, the reaction proceeded with nearly perfect enantioselectivity, and was chemoselective for carbene insertion into silanes over alcohols and amines (Figure 16).
Clearly, in addition to the generation of efficient libraries, the choice of template was key in both examples above. Indeed, modern genomics has created huge databases of genes and it has become incredibly easy to identify new sequences that likely have a given catalytic function. The dramatic reduction in cost of synthetic genes has made it possible to create panels of these sequences, in a way producing a “smart” library of sequences already pre‐selected by natural evolution. The likely function of a DNA sequence may be determined from sequence similarity to other proteins of known function (using search algorithms such as BLAST), [127] as well as the identification of motifs. [128] In addition, structure based searches of crystal structures of unknown function may yield new biocatalyst. [129]
7. Enzyme immobilization and flow chemistry
Protein immobilization, which, as already outlined above, is a key strategy for enzyme stabilization and reusability, has also become more advanced and a whole plethora of different strategies and supports are available (Figure 17). Those are needed in part because protein immobilization can be highly unpredictable, and because of application‐dependent requirements on the immobilized catalyst (such as particle size, swelling, hydrophilicity/hydrophobicity, etc.). Enzyme immobilization may be mechanical or physiochemical, the latter can be further divided into covalent or noncovalent/adsorption immobilization. Immobilized proteins may also be used in continuous (i. e. flow chemistry) processes, which is particularly attractive for its scalability, improved efficiency, and increased control. [130]
Mechanical immobilization relies on the entrapment of the enzyme in a matrix that restricts its movement (Figure 17A). This has the advantage that the enzyme itself is not being modified, while allowing for its environment to be fine‐tuned. However, mass transfer to and from the enzyme is often impaired. [131] Furthermore, leaching of the enzyme can occur. [132] Adsorption onto a solid support is another simple immobilization strategy. However, it too often suffers from leaching of the enzyme from the support. It its simplest form (hydrophobic, hydrophilic or ionic interactions between support and enzyme. Figure 17B), no control over the orientation of the enzyme is achieved and the entrance to the active site may become blocked. [133] One of the most widely used biocatalysts, CalB, is immobilized in this way (Novozym 435), through hydrophobic interactions between the enzyme and an acrylic resin support. While leaching is an issue in an aqueous environment, this is suppressed in the organic solvents in which it is usually used. [134]
More specific adsorption is possible with the use of tags. Attached at either the N‐ or C‐terminal of the protein, they can help orient the enzyme in a favorable position. Examples include the use of a polyhistidine‐tag (His‐tag, Figure 17C), originally developed for efficient protein purification in 1988, [135] which coordinates to transition metal cations, streptavidin with its remarkably high affinity for biotin, and sugar‐lectin interactions. While a His‐tag is encoded genetically, the other two examples require biotinylation or glycosylation, respectively. This also allows for enzyme purification and immobilization to be combined into a single step. While these interactions are stronger than the simple adsorption described above, low levels of leaching can still occur and pose problems for applications in flow, where any enzyme leaching from the column will be lost (as opposed to batch processes where temporarily detached enzyme remains in the vicinity of the support and can, in principle, reattach). One example are EziG beads, made of controlled porosity glass which has been modified to coordinate to metal ions. [136a] They appear to be predominantly used in organic solvents, [136] which appear to suppress leaching although use in an aqueous environment without leaching has also been reported (using Fe3+ as the cation). [137]
The problem of leaching can be fully avoided using covalent immobilization (Figure 17D). However, it often results in severe distortions of the enzyme and loss of activity, although this is highly dependent on the enzyme and support and often unpredictable. Leaching may also still occur for multimeric proteins if not all subunits are covalently attached. Moreover, once the enzyme has degraded the support cannot be reused whereas with adsorption the enzyme may be desorbed and replaced with fresh enzyme. [134] The orientation of the enzyme may be controlled by initially adsorbing the enzyme onto the support using tags, followed by the formation of the covalent attachment (Figure 17E). The distortion of the enzyme can be alleviated using small protein tags, with the covalent attachment points being on that protein rather than the enzyme itself (Figure 17F). [138] However, as the enzyme is more exposed to solvent, the stabilizing effect of immobilization is reduced. One elegant tag directed covalent immobilization is the use of the SpyTag/SpyCatcher system, a peptide and protein that spontaneously form an isopeptide bond when coming together. [139]
Enzymes may also be covalently cross‐linked into cross‐linked enzyme aggregates (CLEAs), an immobilization without a support. However, the poorly defined properties (such as particle size) of these aggregates often render them unsuitable for flow applications. However, adsorption of these aggregates onto a support with more defined properties can alleviate this issue. Glucose isomerase, immobilized in this way, is being used for the production of HFCSs in flow on a 10 million tons per year scale, using 500 tons of the immobilized catalyst.[ 45f , 108b , 140 ] Regardless of the immobilization technique used, tuning the characteristics of the support with respect to e. g. their size, pore‐size, hydrophilicity/hydrophobicity, etc. is also key. For example, tuning the composition of the support reduced the synthesis/hydrolysis (S/H) ratio of covalently immobilized penicillin acylase (Scheme 5). This was a key step in rendering it suitable for the kinetically controlled synthesis of various semi‐synthetic β‐lactam antibiotics.[ 44 , 47 , 78 ]
In some cases, subunit dissociation followed by leaching can significantly reduce the operational stability of covalently immobilized biocatalysts. In those cases, coating the biocatalysts either before or after immobilization with other polymers, such as polyethylenimine (PEI) or activated dextran can help maintain the quaternary structure. [131] However, as with any immobilization, excessive rigidification of the enzyme can result in a loss of catalytic efficiency if structural rearrangements necessary for catalysis are impeded. In addition, the increased complexity of more sophisticated immobilization techniques often outweighs any benefits bestowed on the catalyst. In general, the simplest catalysts that can meet the process requirements is the preferred one.
Synthetic cascades, without the need for purification of intermediates are very attractive due to the reduced amount of waste that is produced. In addition, intermediates that are too unstable to be isolated may be telescoped to the next step, offering alternative routes to classical synthesis. Flow chemistry, being inherently modular by design, is a very important platform for such cascades. Sequential reactions can be compartmentalized, avoiding incompatibility between reagents as well as allowing the conditions to be fine‐tuned for each reaction. [130] Cascades involving biocatalysis may either be multiple enzymatic reactions combined in sequence, or chemo(catalytic) reactions combined with enzymatic reactions. [141]
A nice example of the former was demonstrated by Contente and Paradisi, [142] who developed a cascade in flow converting amines into alcohols, employing a transaminase and either an alcohol dehydrogenase (ADH) or ketoreductase (KRED) that had been immobilized covalently on epoxide functionalized methacrylate beads (Scheme 6A). By compartmentalizing both catalysts, reaction temperatures and times were optimized independently for each step. In addition, in‐line purification steps allowed the removal of product. Recycling of the aqueous phase containing co‐factors and buffer salts was also demonstrated, reducing the overall amount of the cofactors required (from 1 : 100 to 1 : 2000) while also eliminating the aqueous waste stream. Uwe Bornscheuer's group demonstrated a Suzuki‐Miyaura coupling in batch to produce a biaryl ketone substrate for a subsequent transaminase‐catalyzed amination in flow (Scheme 6B). [143] Here, the ability of the transaminase reaction to tolerate 30 % (v/v) DMF as well as salts and palladium from the first reaction step was key. Compatibility between palladium and enzyme catalysts can be a problem, as was the case for a halogenase‐Suzuki‐coupling cascade in batch reported by Latham et al (Scheme 6C). [144] Using free enzyme, ultrafiltration or compartmentalization with a semi‐permeable membrane had to be used to physically separate the enzyme and Pd catalyst. Alternatively, immobilization of the halogenase into CLEAs was also successful.
In another collaboration between Codexis and Merck, a three‐step, nine‐enzyme cascade to synthesize the HIV drug islatravir was developed (Figure 18). [145] This involved engineering of five enzymes to accept unnatural substrates, as well as enzyme immobilization to simplify the final purification. For this, a cost‐effective affinity immobilization using a His‐tag was chosen for the first two steps, while the last step used free enzymes. Enzymes from seven organisms were used, and each step involved enzymes from either two or three organisms and one or three evolved enzymes. Thus, by bringing together the right enzymes from the right organisms (a testament to the vast number of genome sequences that are available) and applying directed evolution only were needed, the number of steps in the synthesis of islatravir was cut more than in half (from 12–18 steps), and the overall yield was almost doubled (51 % vs 37 % previously reported [146] ). Atom economy was improved, overall waste was reduced, and hazardous reagents (such as a Birch reduction in Fukuyama et al.’s synthesis [146] ) were avoided.
8. Summary and Outlook
Through the elucidation of enzymatic mechanisms and enzyme structures, as well as the development of powerful tools for DNA manipulation, engineered enzymes are being applied in increasingly complex syntheses. However, challenges remain. Enzyme engineering is time consuming and while excellent enzyme variants have been created, it is not always successful. Even though significant advances have been made in the understanding of protein folding and predicting the effects of mutations, we still rely on the principles of directed evolution developed by Frances Arnold. Further research into the properties of enzymes is necessary to increase their predictability, which will lead to a future where enzymes can be applied more routinely.
Promising approaches include the use of increasingly sophisticated machine learning, based on large sets of sequences of known function (either wild‐type sequences or mutants).[ 120 , 147 ] Exploring a large sequence space has become increasingly possible due to the advances in gene synthesis, building on DNA synthetic strategies and improvements in DNA sequencing mentioned earlier. [148] Understanding the trajectories of substrates in addition to their interactions once inside the active site (the aim of traditional substrate docking) allows new target residues to be identified. [149] Additionally, the de‐novo design of proteins has offered a window into entirely new protein sequences unknown in nature. In addition to offering exciting new catalysts, this is also an invaluable way of testing our understanding of protein folding and the factors that influence catalytic efficiency. [150] Expanding the genetic code to include unnatural amino acids with functional groups not found in nature may further increase enzyme performance and open up new reactions currently outside of the scope of biocatalysis. [151] Synthetic biology, which focusses on the introduction of new metabolic pathways into organisms to produce valuable chemicals from cheap and renewable starting materials, is another exciting emerging field. [152]
More fundamental challenges also remain, such as protein expression which is often difficult to predict. While E. coli has been undoubtedly the expression organism of choice, due to the vast number of molecular biology tools available and the ease with which it can be grown, some proteins cannot be expressed at high levels or in soluble form and, recently, concerns about low levels of endotoxins that are sufficient to cause an immune response have been raised. [153] Thus, alternative expression systems are needed such as fungi and extremophilic archaea to allow for the expression of proteins incompatible with E. coli and bypass potential toxicity. [154]
Lastly, it is still extremely challenging to bring a biocatalytic process to market. Often, even heavily engineered enzymes fall short in terms of space‐time‐yield compared to the best heterogeneous catalysts (particularly challenging for bulk chemicals). Additionally, a significant investment of both time and money is necessary to develop a biocatalytic process. This is especially true for synthetic biology, e. g. in the case of the anti‐malarial drug artemisinin which required 10 years of research and >$150 million to engineer an organism that could produce it. [155] Clearly, additional research to address these issues is needed and a closer interaction between academia and industry could further speed up the process. Nonetheless, the many examples of successful biocatalytic processes mentioned in this review (as well as many not mentioned) highlight the power of enzymes and the bright future of the field of biocatalysis.
Conflict of interest
The authors declare no conflict of interest.
Biographical Information
Growing up in Germany, Christian M. Heckmann studied Chemistry at the University of Nottingham, earning his MSci in 2017. He then started a PhD with Francesca Paradisi in collaboration with Johnson Matthey, studying biocatalytic approaches to the synthesis of chiral amines.
Biographical Information
Prof. Francesca Paradisi is the Chair of Pharmaceutical and Bioorganic Chemistry at the University of Bern. Biocatalysis as a sustainable approach to synthesis of valuable products is the focus of her research group. In particular, the group developed a number of enzyme‐based processes in continuous flow, reducing the gap between academic discovery and industrial application.
Acknowledgements
This work was supported by the Biotechnology and Biological Sciences Research Council through the iCASE scheme in collaboration with Johnson Matthey [grant number BB/M008770/1].
C. M. Heckmann, F. Paradisi, ChemCatChem 2020, 12, 6082.
References
- 1. McGovern P. E., Zhang J., Tang J., Zhang Z., Hall G. R., Moreau R. A., Nuñez A., Butrym E. D., Richards M. P., Wang C. S., Cheng G., Zhao Z., Wang C., Proc. Natl. Acad. Sci. USA 2004, 101, 17593–17598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Armstrong E. F., Nature 1933, 131, 535–537. [Google Scholar]
- 3.E. Büchner, “Eduard Büchner – Nobel Lecture: Cell-free fermentation,” can be found under https://www.nobelprize.org/prizes/chemistry/1907/buchner/lecture/, 1907.
- 4.W. Kühne, in Verhandlungen Des Naturhistorisch-Medicinischen Vereins Zu Heidelb, 1877, pp. 190–193.
- 5. Robertson A. J. B., Platinum Met. Rev. 1975, 19, 64–69. [Google Scholar]
- 6.J. H. Northrop, “John H. Northrop – Nobel Lecture: The preparation of pure enzymes and virus proteins,” can be found under https://www.nobelprize.org/prizes/chemistry/1946/northrop/lecture/, 1946. [PubMed]
- 7. Manchester K. L., Trends Biotechnol. 1995, 13, 511–515. [DOI] [PubMed] [Google Scholar]
- 8.
- 8a. Kohler R., J. Hist. Biol. 1971, 4, 35–61; [DOI] [PubMed] [Google Scholar]
- 8b. Hein G. E., J. Chem. Educ. 1961, 38, 614–619. [Google Scholar]
- 9.J. B. Sumner, “James B. Sumner – Nobel Lecture: The chemical nature of enzymes,” can be found under https://www.nobelprize.org/prizes/chemistry/1946/sumner/lecture/, 1946.
- 10.
- 10a. Von Loesecke H. W., Chem. Eng. News 1945, 23, 1952–1959; [Google Scholar]
- 10b.H. Tauber, The Chemistry and Technology of Enzymes, Wiley, New York, 1949.
- 11. Hildebrandt G., Klavehn W., Manufacture of Laevo-1-Phenyl-2-Methylaminopropanol-1, 1934, US1956950 A. [Google Scholar]
- 12. Maroney K. A. N., Culshaw P. N., Wermuth U. D., Cresswell S. L., Forensic Sci. Int. 2014, 235, 52–61. [DOI] [PubMed] [Google Scholar]
- 13. Peterson D. H., Murray H. C., J. Am. Chem. Soc. 1952, 74, 1871–1872. [Google Scholar]
- 14. Breuer M., Pohl M., Hauer B., Lingen B., Anal. Bioanal. Chem. 2002, 374, 1069–1073. [DOI] [PubMed] [Google Scholar]
- 15. Fischer E., Ber. Dtsch. Chem. Ges. 1894, 27, 2985–2993. [Google Scholar]
- 16. Johnson K. A., Goody R. S., Biochemistry 2011, 50, 8264–8269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Bommarius A. S., Annu. Rev. Chem. Biomol. Eng. 2015, 6, 319–345. [DOI] [PubMed] [Google Scholar]
- 18. Pauling L., Nature 1948, 161, 707–709. [DOI] [PubMed] [Google Scholar]
- 19. Eyeing H., Chem. Rev. 1935, 17, 65–77. [Google Scholar]
- 20.
- 20a. Koshland D. E., Proc. Natl. Acad. Sci. USA 1958, 44, 98–104; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20b. Koshland D. E., J. Cell. Comp. Physiol. 1959, 54, 245–258. [DOI] [PubMed] [Google Scholar]
- 21. Fischer E., Ber. Dtsch. Chem. Ges. 1907, 40, 1754–1767. [Google Scholar]
- 22. Fruton J. S., Ann. N. Y. Acad. Sci. 1979, 325, 1–20. [Google Scholar]
- 23.
- 23a. Sanger F., Tuppy H., Biochem. J. 1951, 49, 463–481; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23b. Sanger F., Tuppy H., Biochem. J. 1951, 49, 481–490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.
- 24a. Pauling L., Corey R. B., J. Am. Chem. Soc. 1950, 72, 5349; [Google Scholar]
- 24b. Pauling L., Corey R. B., Proc. Natl. Acad. Sci. USA 1951, 37, 729–740; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24c.L. Pauling, “Linus Pauling – Nobel Lecture: Modern structural chemistry,” can be found under https://www.nobelprize.org/prizes/chemistry/1954/pauling/lecture/, 1954;
- 24d. Pauling L., Corey R. B., Branson H. R., Proc. Natl. Acad. Sci. USA 1951, 37, 205–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Perutz M. F., Nature 1951, 167, 1053–1054. [DOI] [PubMed] [Google Scholar]
- 26.
- 26a. Kendrew J. C., Bodo G., Dintzis H. M., Parrish R. G., Wyckoff H., Phillips D. C., Nature 1958, 181, 662–666; [DOI] [PubMed] [Google Scholar]
- 26b. Kendrew J. C., Dickerson R. E., Strandberg B. E., Hart R. G., Davies D. R., Phillips D. C., Shore V. C., Nature 1960, 185, 422–427; [DOI] [PubMed] [Google Scholar]
- 26c. Perutz M. F., Rossmann M. G., Cullis A. F., Muirhead H., Will G., North A. C. T., Nature 1960, 185, 416–422; [DOI] [PubMed] [Google Scholar]
- 26d. Perutz M. F., Science 1963, 140, 863–870; [DOI] [PubMed] [Google Scholar]
- 26e. Kendrew J. C., Science 1963, 139, 1259–1266. [DOI] [PubMed] [Google Scholar]
- 27.
- 27a. de Chadarevian S., Protein Sci. 2018, 27, 1136–1143; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27b. Sir Bragg Lawrence, Rep. Prog. Phys. 1965, 28, 1–14. [Google Scholar]
- 28. Jaskolski M., Dauter Z., Wlodawer A., FEBS J. 2014, 281, 3985–4009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.
- 29a. Blake C. C. F., Koenig D. F., Mair G. A., North A. C. T., Phillips D. C., Sarma V. R., Nature 1965, 206, 757–761; [DOI] [PubMed] [Google Scholar]
- 29b. Johnson L. N., Phillips D. C., Nature 1965, 206, 761–763. [DOI] [PubMed] [Google Scholar]
- 30. Reeke G. N., Hartsuck J. A., Ludwig M. L., Quiocho F. A., Steitz T. A., Lipscomb W. N., Proc. Natl. Acad. Sci. USA 1967, 58, 2220–2226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.
- 31a. Matthews B. W., Sigler P. B., Henderson R., Blow D. M., Nature 1967, 214, 652–656; [DOI] [PubMed] [Google Scholar]
- 31b. Blow D. M., Birktoft J. J., Hartley B. S., Nature 1969, 221, 337–340; [DOI] [PubMed] [Google Scholar]
- 31c. Henderson R., J. Mol. Biol. 1970, 54, 341–354; [DOI] [PubMed] [Google Scholar]
- 31d. Blow D. M., Acc. Chem. Res. 1976, 9, 145–152. [Google Scholar]
- 32.“PDB Statistics: Overall Growth of Released Structures Per Year,” can be found under https://www.rcsb.org/stats/growth/growth-released-structures, n.d.
- 33.
- 33a. Bugg T. D. H., Nat. Prod. Reports 2001, 18, 465–493; [DOI] [PubMed] [Google Scholar]
- 33b. Westheimer F. H., Adv. Phys. Org. Chem. 1985, 21, 1–35. [Google Scholar]
- 34. Warburg O., Christian W., Helv. Chim. Acta 1936, 19, E79–E88. [Google Scholar]
- 35. Williams R. R., Cline J. K., J. Am. Chem. Soc. 1936, 58, 1504–1505. [Google Scholar]
- 36.
- 36a. Snell E. E., Guirard B. M., Williams R. J., J. Biol. Chem. 1942, 143, 519–531; [Google Scholar]
- 36b. Snell E. E., J. Am. Chem. Soc. 1944, 66, 2082–2088; [Google Scholar]
- 36c. Harris S. A., Heyl D., Folkers K., J. Am. Chem. Soc. 1944, 66, 2088–2092; [Google Scholar]
- 36d. Braunstein A. E., Kritzmann M. G., Nature 1946, 158, 102–103; [DOI] [PubMed] [Google Scholar]
- 36e. Braunstein A. E., Kritzmann M. G., Nature 1937, 140, 503–504. [Google Scholar]
- 37.
- 37a. Haynes L. J., Todd A. R., J. Chem. Soc. 1950, 303–308; [Google Scholar]
- 37b. Haykes L. J., Hughes N. A., Kenner G. W., Todd A., J. Chem. Soc. 1957, 3727–3732; [Google Scholar]
- 37c. Hughes N. A., Kenner G. W., Todd A., J. Chem. Soc. 1957, 3733–3738. [Google Scholar]
- 38.
- 38a. Baddiley J., Todd A. R., J. Chem. Soc. 1947, 648–651; [DOI] [PubMed] [Google Scholar]
- 38b. Baddiley J., Michelson A. M., Todd A. R., J. Chem. Soc. 1949, 582–586. [Google Scholar]
- 39.
- 39a. Christie S. M. H., Kenner G. W., Todd A. R., Nature 1952, 170, 924; [DOI] [PubMed] [Google Scholar]
- 39b. Christie S. M. H., Kenner G. W., Todd A. R., J. Chem. Soc. 1954, 46–52. [Google Scholar]
- 40.
- 40a. Hodgkin D. C., Pickworth J., Robertson J. H., Trueblood K. N., Prosen R. J., White J. G., Nature 1955, 176, 325–328; [DOI] [PubMed] [Google Scholar]
- 40b. Hodgkin D. C., Kamper J., Mackay M., Pickworth J., Trueblood K. N., White J. G., Nature 1956, 178, 64–66. [DOI] [PubMed] [Google Scholar]
- 41.
- 41a. Loewus F. A., Westheimer F. H., Vennesland B., J. Am. Chem. Soc. 1953, 75, 5018–5023; [Google Scholar]
- 41b. Westheimer F. H., Fisher H. F., Conn E. E., Vennesland B., J. Am. Chem. Soc. 1951, 73, 2403. [Google Scholar]
- 42.
- 42a. Breslow R., J. Am. Chem. Soc. 1957, 79, 1762–1763; [Google Scholar]
- 42b. Breslow R., J. Am. Chem. Soc. 1958, 80, 3719–3726. [Google Scholar]
- 43.
- 43a. Snell E. E., J. Am. Chem. Soc. 1945, 67, 194–197; [Google Scholar]
- 43b. Metzler D. E., Ikawa M., Snell E. E., J. Am. Chem. Soc. 1954, 76, 648–652; [Google Scholar]
- 43c. Snell E. E., J. Biol. Chem. 1944, 154, 313–314. [Google Scholar]
- 44. Bruggink A., Synthesis of β-Lactam Antibiotics, 2001. [Google Scholar]
- 45.
- 45a. Grubhofer N., Schleith L., Naturwissenschaften 1953, 40, 508; [Google Scholar]
- 45b. Michee F., Dinkloh E., Chem. Ber. 1951, 84, 210–212; [Google Scholar]
- 45c. Manecke G., Singer S., Gillert K.-E., Naturwissenschaften 1958, 45, 440–441; [Google Scholar]
- 45d. Bornscheuer U. T., Buchholz K., Eng. Life Sci. 2005, 5, 309–323; [Google Scholar]
- 45e. Elander R. P., Appl. Microbiol. Biotechnol. 2003, 61, 385–392; [DOI] [PubMed] [Google Scholar]
- 45f. Messing R. A., Filbert A. M., J. Agric. Food Chem. 1975, 23, 920–923. [DOI] [PubMed] [Google Scholar]
- 46. Hanover M. L., White J. S., Am. J. Clin. Nutr. 1993, 58, 724S-732S. [DOI] [PubMed] [Google Scholar]
- 47. Bruggink A., Roos E. C., De Vroom E., Org. Process Res. Dev. 1998, 2, 128–133. [Google Scholar]
- 48. Buchholz K., Appl. Microbiol. Biotechnol. 2016, 100, 3825–3839. [DOI] [PubMed] [Google Scholar]
- 49. Hsiao H. Y., Walter J. F., Anderson D. M., Hamilton B. K., Biotechnol. Genet. Eng. Rev. 1988, 6, 179–219. [DOI] [PubMed] [Google Scholar]
- 50. Dawes H., Curr. Biol. 2004, 14, R605–R607. [DOI] [PubMed] [Google Scholar]
- 51. Watson J. D., Crick F. H. C., Nature 1953, 171, 737–738. [DOI] [PubMed] [Google Scholar]
- 52. Anfinsen C. B., Haber E., White S. F. H., Proc. Natl. Acad. Sci. USA 1961, 47, 1309–1314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Jacob F., Monod J., J. Mol. Biol. 1961, 3, 318–356. [DOI] [PubMed] [Google Scholar]
- 54.
- 54a. Gros F., Hiatt H., Gilbert W., Kurland C. G., Risebrough R. W., Watson J. D., Nature 1961, 190, 581–585; [DOI] [PubMed] [Google Scholar]
- 54b. Brenner S., Jacob F., Meselson M., Nature 1961, 190, 576–581. [DOI] [PubMed] [Google Scholar]
- 55. Crick F. H. C., Barnett L., Brenner S., Watts-Tobin R. J., Nature 1961, 192, 1227–1232. [DOI] [PubMed] [Google Scholar]
- 56. Nirenberg M. W., Matthaei J. H., Proc. Natl. Acad. Sci. USA 1961, 47, 1588–1602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.
- 57a. Nirenberg M. W., Matthaei J. H., Jones O. W., Proc. Natl. Acad. Sci. USA 1962, 48, 104–109; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57b. Leder P., Nirenberg M., Proc. Natl. Acad. Sci. USA 1964, 52, 420–427; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57c. Lengyel P., Speyer J. F., Ochoa S., Proc. Natl. Acad. Sci. USA 1961, 47, 1936–1942; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57d. Speyer J. F., Lengyel P., Basilio C., Ochoa S., Proc. Natl. Acad. Sci. USA 1962, 48, 63–68; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57e. Söll D., Ohtsuka E., Jones D. S., Lohrmann R., Hayatsu H., Nishimura S., Khorana H. G., Proc. Natl. Acad. Sci. USA 1965, 54, 1378–1385; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57f. Nirenberg M., Trends Biochem. Sci. 2004, 29, 46–54. [DOI] [PubMed] [Google Scholar]
- 58. Marshall R. E., Caskey C. T., Nirenberg M., Science 1967, 155, 820–826. [DOI] [PubMed] [Google Scholar]
- 59. Sanger F., Nicklen S., Coulson A. R., Proc. Natl. Acad. Sci. USA 1977, 74, 5463–5467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.
- 60a. Rosenblum B., Lee L. G., Spurgeon S. L., Khan S. H., Menchen S. M., Heiner C. R., Che S. M., Nucleic Acids Res. 1997, 25, 4500–4504; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60b. Lee L. G., Spurgeon S. L., Heiner C. R., Benson S. C., Rosenblum B. B., Menchen S. M., Graham R. J., Constantinescu A., Upadhya K. G., Cassel J. M., Nucleic Acids Res. 1997, 25, 2816–2822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Goodwin S., McPherson J. D., McCombie W. R., Nat. Rev. Genet. 2016, 17, 333–351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.“Genome Information by Organism,” can be found under https://www.ncbi.nlm.nih.gov/genome/browse#!/overview/, n. d.
- 63.
- 63a. Lederberg J., Plasmid 1998, 39, 1–9; [DOI] [PubMed] [Google Scholar]
- 63b. Lederberg J., Physiol. Rev. 1952, 32, 403–430. [DOI] [PubMed] [Google Scholar]
- 64. Luria S. E., Human M. L., J. Bacteriol. 1952, 64, 557–569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Bertani G., Weigle J. J., J. Bacteriol. 1953, 65, 113–121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.
- 66a. Arber W., Annu. Rev. Microbiol. 1965, 19, 365–378; [DOI] [PubMed] [Google Scholar]
- 66b. Linn S., Arber W., Proc. Natl. Acad. Sci. USA 1968, 59, 1300–1306; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66c. Arber W., Lin S., Annu. Rev. Biochem. 1969, 38, 467–500. [DOI] [PubMed] [Google Scholar]
- 67.
- 67a. Meselson M., Yuan R., Nature 1968, 217, 1110–1114; [DOI] [PubMed] [Google Scholar]
- 67b. Nathans D., Smith H. O., Annu. Rev. Biochem. 1975, 44, 273–293. [DOI] [PubMed] [Google Scholar]
- 68.
- 68a. Smith H. O., Welcox K. W., J. Mol. Biol. 1970, 51, 379–391; [DOI] [PubMed] [Google Scholar]
- 68b. Kelly T. J., Smith H. O., J. Mol. Biol. 1970, 51, 393–409. [DOI] [PubMed] [Google Scholar]
- 69.
- 69a. Roberts R. J., Proc. Natl. Acad. Sci. USA 2005, 102, 5905–5908; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69b. Danna K., Nathans D., Proc. Natl. Acad. Sci. USA 1971, 68, 2913–2917. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.
- 70a. Mertz J. E., Davis R. W., Proc. Natl. Acad. Sci. USA 1972, 69, 3370–3374; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70b. Jackson D. A., Symons R. H., Berg P., Proc. Natl. Acad. Sci. USA 1972, 69, 2904–2909; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70c.T. A. Brown, Gene Cloning and DNA Analysis: An Introduction, John Wiley & Sons Ltd, 2016;
- 70d. Lobban P. E., Kaiser A. D., J. Mol. Biol. 1973, 78, 453–471. [DOI] [PubMed] [Google Scholar]
- 71. Cohen S. N., Chang A. C., Boyer H. W., Helling R. B., Proc. Natl. Acad. Sci. USA 1973, 70, 3240–3244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.
- 72a. Bolivar F., Rodriguez R. L., Betlach M. C., Boyer H. W., Gene 1977, 2, 75–93; [DOI] [PubMed] [Google Scholar]
- 72b. Bolivar F., Rodrguez R. L., Greene P. J., Betlach M. C., Heyneker H. L., Boyer H. W., Gene 1977, 2, 95–113. [PubMed] [Google Scholar]
- 73.
- 73a. Vieira J., Messing J., Gene 1982, 19, 259–268; [DOI] [PubMed] [Google Scholar]
- 73b. Yanisch-Perron C., Vieira J., Messing J., Gene 1985, 33, 103–119. [DOI] [PubMed] [Google Scholar]
- 74.
- 74a. Rosenberg A. H., Lade B. N., Chui D. S., Lin S. W., Dunn J. J., Studier F. W., Gene 1987, 56, 125–135; [DOI] [PubMed] [Google Scholar]
- 74b. Studier F. W., Rosenberg A. H., Dunn J. J., Dubendorff J. W., Methods Enzymol. 1990, 185, 60–89; [DOI] [PubMed] [Google Scholar]
- 74c. Studier F. W., Moffattf B. A., J. Mol. Biol. 1986, 189, 113–130; [DOI] [PubMed] [Google Scholar]
- 74d. Novy R., Morris B., Innovations 2001, 13, 13–15; [Google Scholar]
- 74e. Daegelen P., Studier F. W., Lenski R. E., Cure S., Kim J. F., J. Mol. Biol. 2009, 394, 634–643. [DOI] [PubMed] [Google Scholar]
- 75.
- 75a. Terpe K., Appl. Microbiol. Biotechnol. 2006, 72, 211–222; [DOI] [PubMed] [Google Scholar]
- 75b. Rosano G. L., Ceccarelli E. A., Front. Microbiol. 2014, 5, 172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Johnson I. S., Science 1983, 219, 632–637. [DOI] [PubMed] [Google Scholar]
- 77. Gualandi-Signorini A. M., Giorgi G., Eur. Rev. Med. Pharmacol. Sci. 2001, 5, 73–83. [PubMed] [Google Scholar]
- 78. Marsden S. R., Mestrom L., McMillan D. G. G., Hanefeld U., ChemCatChem 2020, 12, 426–437. [Google Scholar]
- 79.
- 79a. Harris T. J. R., Lowe P. A., Lyons A., Thomas P. G., Eaton M. A. W., Millican T. A., Patel T. P., Bose C. C., Carey N. H., Doel M. T., Nucleic Acids Res. 1982, 10, 2177–2187; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79b. Emtage J. S., Angal S., Doel M. T., Harris T. J., Jenkins B., Lilley G., Lowe P. A., Proc. Natl. Acad. Sci. USA 1983, 80, 3671–3675; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79c. Hicks C. L., O'Leary J., Bucy J., J. Dairy Sci. 1988, 71, 1127–1131; [Google Scholar]
- 79d. Kumar A., Grover S., Sharma J., Batish V. K., Crit. Rev. Biotechnol. 2010, 30, 243–58; [DOI] [PubMed] [Google Scholar]
- 79e. Gilliland G. L., Winborne E. L., Nachman J., Wlodawer A., Proteins Struct. Funct. Bioinf. 1990, 8, 82–101. [DOI] [PubMed] [Google Scholar]
- 80. Johnson M. E., Lucey J. A., J. Dairy Sci. 2006, 89, 1174–1178. [DOI] [PubMed] [Google Scholar]
- 81. Bornscheuer U. T., Huisman G. W., Kazlauskas R. J., Lutz S., Moore J. C., Robins K., Nature 2012, 485, 185–194. [DOI] [PubMed] [Google Scholar]
- 82.
- 82a. Carlson E. A., Mutat. Res. - Rev. Mutat. Res. 2013, 752, 1–5; [DOI] [PubMed] [Google Scholar]
- 82b. Witkin E. M., Bacteriol. Rev. 1976, 40, 869–907; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82c. Muller H. J., Science 1927, 66, 84–87; [DOI] [PubMed] [Google Scholar]
- 82d. Crow J. F., Abrahamson S., Genetics 1997, 147, 1491–1496; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82e. Lederberg J., Lederberg E. M., J. Bacteriol. 1952, 63, 399–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.
- 83a. Adrio J. L., Demain A. L., FEMS Microbiol. Rev. 2006, 30, 187–214; [DOI] [PubMed] [Google Scholar]
- 83b. Lerner S. A., Wu T. T., Lin E. C. C., Science 1964, 146, 1313–1315; [DOI] [PubMed] [Google Scholar]
- 83c. Parekh S., Vinci V. A., Strobel R. J., Appl. Microbiol. Biotechnol. 2000, 54, 287–301. [DOI] [PubMed] [Google Scholar]
- 84. Hutchison C. A., Phillips S., Edgell M. H., Gillam S., Jahnke P., Smith M., J. Biol. Chem. 1978, 253, 6551–6560. [PubMed] [Google Scholar]
- 85. Khorana H. G., Pure Appl. Chem. 1968, 17, 349–382. [Google Scholar]
- 86. Gillam S., Rottman F., Jahnke P., Smith M., Proc. Natl. Acad. Sci. USA 1977, 74, 96–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87. Matteucci M. D., Caruthers M. H., J. Am. Chem. Soc. 1981, 103, 3185–3191. [Google Scholar]
- 88. Beaucage S. L., Iyer R. P., Tetrahedron 1992, 48, 2223–2311. [Google Scholar]
- 89. Winter G., Fersht A. R., Wilkinson A. J., Zoller M., Smith M., Nature 1982, 299, 756–758. [DOI] [PubMed] [Google Scholar]
- 90.
- 90a. Brannigan J. A., Wilkinson A. J., Nat. Rev. Mol. Cell Biol. 2002, 3, 964–70; [DOI] [PubMed] [Google Scholar]
- 90b. Estell D. A., Graycar T. P., Wells J. A., J. Biol. Chem. 1985, 260, 6518–6521. [PubMed] [Google Scholar]
- 91. Quax W. J., Mrabet N. T., Luiten R. G. M., Schuurhuizen P. W., Stanssens P., Lasters I., Bio/Technology 1991, 9, 738–742. [DOI] [PubMed] [Google Scholar]
- 92.
- 92a.J. M. S. Bartlett, D. Stirling, Eds., Methods in Molecular Biology: PCR Protocols, Humana Press Inc., Totowa, 2003;
- 92b. Saiki R. K., Scharf S., Faloona F., Mullis K. B., Horn G. T., Erlich H. A., Arnheim N., Science 1985, 230, 6–11; [DOI] [PubMed] [Google Scholar]
- 92c. Saik R. K., Gelfand D. H., Stoffel S., Scharf S. J., Higuchi R., Horn G. T., Mullis K. B., Erlich H. A., Science 1988, 239, 487–491. [DOI] [PubMed] [Google Scholar]
- 93. Arnold F. H., Angew. Chem. Int. Ed. 2019, 58, 14420–14426. [DOI] [PubMed] [Google Scholar]
- 94.
- 94a. Chen K., Arnold F. H., Bio/Technology 1991, 9, 1073–1077; [DOI] [PubMed] [Google Scholar]
- 94b. Chen K., Arnold F. H., Proc. Natl. Acad. Sci. USA 1993, 90, 5618–5622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.
- 95a. Stemmer W. P. C., Nature 1994, 370, 389–391; [DOI] [PubMed] [Google Scholar]
- 95b. Stemmer W. P. C., Proc. Natl. Acad. Sci. USA 1994, 91, 10747–10751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96. Moore J. C., Arnold F. H., Nat. Biotechnol. 1996, 14, 458–467. [DOI] [PubMed] [Google Scholar]
- 97.
- 97a. Cherry J. R., Lamsa M. H., Schneider P., Vind J., Svendsen A., Jones A., Pedersen A. H., Nat. Biotechnol. 1999, 17, 379–384; [DOI] [PubMed] [Google Scholar]
- 97b. Zhao H., Giver L., Shao Z., Affholter J. A., Arnold F. H., Nat. Biotechnol. 1998, 16, 258–261; [DOI] [PubMed] [Google Scholar]
- 97c. Ness J. E., Welch M., Giver L., Bueno M., Cherry J. R., Borchert T. V., Stemmer W. P. C., Minshull J., Nat. Biotechnol. 1999, 17, 893–896; [DOI] [PubMed] [Google Scholar]
- 97d. Giver L., Gershenson A., Freskgard P. O., Arnold F. H., Proc. Natl. Acad. Sci. USA 1998, 95, 12809–12813; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97e. Taguchi S., Ozaki A., Momose H., Appl. Environ. Microbiol. 1998, 64, 492–495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.
- 98a. Kumamaru T., Suenaga H., Mitsuoka M., Watanabe T., Furukawa K., Nat. Biotechnol. 1998, 16, 663–666; [DOI] [PubMed] [Google Scholar]
- 98b. Yano T., Oue S., Kagamiyama H., Proc. Natl. Acad. Sci. USA 1998, 95, 5511–5515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.
- 99a. May O., Nguyen P. T., Arnold F. H., Nat. Biotechnol. 2000, 18, 317–320; [DOI] [PubMed] [Google Scholar]
- 99b. Liebeton K., Zonta A., Schimossek K., Nardini M., Lang D., Dijkstra B. W., Reetz M. T., Jaeger K. E., Chem. Biol. 2000, 7, 709–718. [DOI] [PubMed] [Google Scholar]
- 100.
- 100a. Arnold F. H., Acc. Chem. Res. 1998, 31, 125–131; [Google Scholar]
- 100b. Arnold F. H., Volkov A. A., Curr. Opin. Chem. Biol. 1999, 3, 54–59; [DOI] [PubMed] [Google Scholar]
- 100c. Reetz M. T., Jaeger K.-E., Top. Curr. Chem. 1999, 200, 31–57; [Google Scholar]
- 100d. Arnold F. H., Nature 2001, 409, 253–257; [DOI] [PubMed] [Google Scholar]
- 100e. Cobb R. E., Chao R., Zhao H., AlChE J. 2013, 59, 1432–1440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.
- 101a. Schmid A., Dordick J. S., Hauer B., Kiener A., Wubbolts M., Witholt B., Nature 2001, 409, 258–268; [DOI] [PubMed] [Google Scholar]
- 101b. Schoemaker H. E., Mink D. L., WubboLts M. G., Science 2003, 299, 1694–1697. [DOI] [PubMed] [Google Scholar]
- 102.
- 102a. Murphy M. A., Found. Chem. 2018, 20, 121–165; [Google Scholar]
- 102b. Anastas P. T., Kirchhoff M. M., Williamson T. C., Appl. Catal. A 2001, 221, 3–13; [Google Scholar]
- 102c. Trost B. M., Science 1991, 254, 1471–1477. [DOI] [PubMed] [Google Scholar]
- 103.
- 103a. Anastas P. T., Kirchhoff M. M., Acc. Chem. Res. 2002, 35, 686–694; [DOI] [PubMed] [Google Scholar]
- 103b. Linthorst J. A., Found. Chem. 2010, 12, 55–68. [Google Scholar]
- 104.
- 104a. Ni Y., Holtmann D., Hollmann F., ChemCatChem 2014, 6, 930–943; [Google Scholar]
- 104b. Sheldon R. A., Woodley J. M., Chem. Rev. 2018, 118, 801–838. [DOI] [PubMed] [Google Scholar]
- 105.
- 105a. Straathof A. J. J., Panke S., Schmid A., Curr. Opin. Biotechnol. 2002, 13, 548–556; [DOI] [PubMed] [Google Scholar]
- 105b. Sheldon R. A., Brady D., Bode M. L., Chem. Sci. 2020, 11, 2587–2605; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105c. Woodley J. M., Appl. Microbiol. Biotechnol. 2019, 103, 4733–4739. [DOI] [PubMed] [Google Scholar]
- 106. Balkenhohl F., Ditrich K., Hauer B., Aktiengesellschaft B., J. für Prakt. Chemie 1997, 339, 381–384. [Google Scholar]
- 107. Reetz M. T., J. Am. Chem. Soc. 2013, 135, 12480–12496. [DOI] [PubMed] [Google Scholar]
- 108.
- 108a. Breuer M., Ditrich K., Habicher T., Hauer B., Keßeler M., Stürmer R., Zelinski T., Angew. Chem. Int. Ed. 2004, 43, 788–824; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2004, 116, 806–843; [Google Scholar]
- 108b. Basso A., Serban S., Mol. Catal. 2019, 479, 110607. [Google Scholar]
- 109.
- 109a. Qu G., Li A., Acevedo-Rocha C. G., Sun Z., Reetz M. T., Angew. Chem. Int. Ed. 2020, 59, 13204–13231; [DOI] [PubMed] [Google Scholar]
- 109b. Reetz M. T., Angew. Chem. Int. Ed. 2011, 50, 138–174; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2011, 123, 144–182. [Google Scholar]
- 110. Voigt C. A., Martinez C., Wang Z. G., Mayo S. L., Arnold F. H., Nat. Struct. Biol. 2002, 9, 553–558. [DOI] [PubMed] [Google Scholar]
- 111. Bommarius A. S., Blum J. K., Abrahamson M. J., Curr. Opin. Chem. Biol. 2011, 15, 194–200. [DOI] [PubMed] [Google Scholar]
- 112. Leman J. K., Weitzner B. D., Lewis S. M., Adolf-Bryfogle J., Alam N., Alford R. F., Aprahamian M., Baker D., Barlow K. A., Barth P., Basanta B., Bender B. J., Blacklock K., Bonet J., Boyken S. E., Bradley P., Bystroff C., Conway P., Cooper S., Correia B. E., Coventry B., Das R., De Jong R. M., DiMaio F., Dsilva L., Dunbrack R., Ford A. S., Frenz B., Fu D. Y., Geniesse C., Goldschmidt L., Gowthaman R., Gray J. J., Gront D., Guffy S., Horowitz S., Huang P. S., Huber T., Jacobs T. M., Jeliazkov J. R., Johnson D. K., Kappel K., Karanicolas J., Khakzad H., Khar K. R., Khare S. D., Khatib F., Khramushin A., King I. C., Kleffner R., Koepnick B., Kortemme T., Kuenze G., Kuhlman B., Kuroda D., Labonte J. W., Lai J. K., Lapidoth G., Leaver-Fay A., Lindert S., Linsky T., London N., Lubin J. H., Lyskov S., Maguire J., Malmström L., Marcos E., Marcu O., Marze N. A., Meiler J., Moretti R., Mulligan V. K., Nerli S., Norn C., Ó’Conchúir S., Ollikainen N., Ovchinnikov S., Pacella M. S., Pan X., Park H., Pavlovicz R. E., Pethe M., Pierce B. G., Pilla K. B., Raveh B., Renfrew P. D., Burman S. S. R., Rubenstein A., Sauer M. F., Scheck A., Schief W., Schueler-Furman O., Sedan Y., Sevy A. M., Sgourakis N. G., Shi L., Siegel J. B., Silva D. A., Smith S., Song Y., Stein A., Szegedy M., Teets F. D., Thyme S. B., Wang R. Y. R., Watkins A., Zimmerman L., Bonneau R., Nat. Methods 2020, 17, 665–680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113. Reetz M. T., Carballeira J. D., Vogel A., Angew. Chem. Int. Ed. 2006, 45, 7745–7751; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2006, 118, 7909–7915. [Google Scholar]
- 114. Dalby P. A., Curr. Opin. Struct. Biol. 2011, 21, 473–480. [DOI] [PubMed] [Google Scholar]
- 115.
- 115a. Cobb R. E., Si T., Zhao H., Curr. Opin. Chem. Biol. 2012, 16, 285–291; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115b. Moore J. C., Rodriguez-Granillo A., Crespo A., Govindarajan S., Welch M., Hiraga K., Lexa K., Marshall N., Truppo M. D., ACS Synth. Biol. 2018, 7, 1730–1741. [DOI] [PubMed] [Google Scholar]
- 116.
- 116a. Reetz M. T., Bocola M., Carballeira J. D., Zha D., Vogel A., Angew. Chem. Int. Ed. 2005, 44, 4192–4196; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2005, 117, 4264–4268; [Google Scholar]
- 116b. Reetz M. T., Wang L. W., Bocola M., Angew. Chem. Int. Ed. 2006, 45, 1236–1241; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2006, 118, 1258–1263; [Google Scholar]
- 116c. Reetz M. T., Carballeira J. D., Peyralans J., Höbenreich H., Maichele A., Vogel A., Chem. Eur. J. 2006, 12, 6031–6038. [DOI] [PubMed] [Google Scholar]
- 117. Reetz M. T., Chem. Rec. 2016, 16, 2449–2459. [DOI] [PubMed] [Google Scholar]
- 118.
- 118a. Abrahamson M. J., Vázquez-Figueroa E., Woodall N. B., Moore J. C., Bommarius A. S., Angew. Chem. Int. Ed. 2012, 51, 3969–3972; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2012, 124, 4036–4040; [Google Scholar]
- 118b. Abrahamson M. J., Wong J. W., Bommarius A. S., Adv. Synth. Catal. 2013, 355, 1780–1786; [Google Scholar]
- 118c. Bommarius B. R., Schürmann M., Bommarius A. S., Chem. Commun. 2014, 50, 14953–14955. [DOI] [PubMed] [Google Scholar]
- 119. Reetz M. T., Prasad S., Carballeira J. D., Gumulya Y., Bocola M., J. Am. Chem. Soc. 2010, 132, 9144–9152. [DOI] [PubMed] [Google Scholar]
- 120. Yang K. K., Wu Z., Arnold F. H., Nat. Methods 2019, 16, 687–694. [DOI] [PubMed] [Google Scholar]
- 121. Fox R., J. Theor. Biol. 2005, 234, 187–199. [DOI] [PubMed] [Google Scholar]
- 122. Fox R. J., Davis S. C., Mundorff E. C., Newman L. M., Gavrilovic V., Ma S. K., Chung L. M., Ching C., Tam S., Muley S., Grate J., Gruber J., Whitman J. C., Sheldon R. A., Huisman G. W., Nat. Biotechnol. 2007, 25, 338–344. [DOI] [PubMed] [Google Scholar]
- 123. Savile C. K., Janey J. M., Mundorff E. C., Moore J. C., Tam S., Jarvis W. R., Colbeck J. C., Krebber A., Fleitz F. J., Brands J., Devine P. N., Huisman G. W., Hughes G. J., Science 2010, 329, 305–310. [DOI] [PubMed] [Google Scholar]
- 124. Schober M., MacDermaid C., Ollis A. A., Chang S., Khan D., Hosford J., Latham J., Ihnken L. A. F., Brown M. J. B., Fuerst D., Sanganee M. J., Roiban G. D., Nat. Can. 2019, 2, 909–915. [Google Scholar]
- 125. Roiban G. D., Kern M., Liu Z., Hyslop J., Tey P. L., Levine M. S., Jordan L. S., Brown K. K., Hadi T., Ihnken L. A. F., Brown M. J. B., ChemCatChem 2017, 9, 4475–4479. [Google Scholar]
- 126. Kan S. B. J., Lewis R. D., Chen K., Arnold F. H., Science 2016, 354, 1048–1051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127. Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J., J. Mol. Biol. 1990, 215, 403–410. [DOI] [PubMed] [Google Scholar]
- 128. Jiang J., Chen X., Zhang D., Wu Q., Zhu D., Appl. Microbiol. Biotechnol. 2015, 99, 2613–2621. [DOI] [PubMed] [Google Scholar]
- 129. Steffen-Munsberg F., Vickers C., Thontowi A., Schätzle S., Tumlirsch T., SvedendahlHumble M., Land H., Berglund P., Bornscheuer U. T., Höhne M., ChemCatChem 2013, 5, 150–153. [Google Scholar]
- 130. Britton J., Majumdar S., Weiss G. A., Chem. Soc. Rev. 2018, 47, 5891–5918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.M. Romero-Fernández, F. Paradisi, in Catal. Immobil. Methods Appl (Eds.: M. Benaglia, A. Puglisi), 2020, pp. 409–435.
- 132. Thompson M., Penafiel I., Cosgrove S. C., Turner N. J., Org. Process Res. Dev. 2018, acs.oprd.8b00305. [Google Scholar]
- 133.J. M. Guisan, Ed., Immobilization of Enzymes and Cells, Humana Press Inc., 2013.
- 134. Sheldon R. A., Adv. Synth. Catal. 2007, 349, 1289–1307. [Google Scholar]
- 135. Hochuli E., Bannwarth W., Döbeli H., Nat. Biotechnol. 1988, 6, 1321–1325. [Google Scholar]
- 136.
- 136a. Engelmark Cassimjee K., Kadow M., Wikmark Y., Svedendahl Humble M., Rothstein M. L., Rothstein D. M., Bäckvall J. E., Chem. Commun. 2014, 50, 9134–9137; [DOI] [PubMed] [Google Scholar]
- 136b. Böhmer W., Volkov A., Engelmark Cassimjee K., Mutti F. G., Adv. Synth. Catal. 2020, 362, 1858–1867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137. Böhmer W., Knaus T., Volkov A., Slot T. K., Shiju N. R., Engelmark Cassimjee K., Mutti F. G., J. Biotechnol. 2019, 291, 52–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.
- 138a. Planchestainer M., Padrosa D. R., Contente M. L., Paradisi F., Catalysts 2018, 8, 40; [Google Scholar]
- 138b. Padrosa D. R., De Vitis V., Contente M. L., Molinari F., Paradisi F., Catalysts 2019, 9, 232. [Google Scholar]
- 139.
- 139a. Anderson G. P., Liu J. L., Shriver-Lake L. C., Zabetakis D., Sugiharto V. A., Chen H. W., Lee C. R., Defang G. N., Wu S. J. L., Venkateswaran N., Goldman E. R., Anal. Chem. 2019, 91, 9424–9429; [DOI] [PubMed] [Google Scholar]
- 139b. Wong J. X., Gonzalez-Miro M., Sutherland-Smith A. J., Rehm B. H. A., Front. Bioeng. Biotechnol. 2020, 8, 1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 140. Jensen V. J., Rugh S., Methods Enzymol. 1987, 136, 356–370. [Google Scholar]
- 141. Schrittwieser J. H., Velikogne S., Hall M., Kroutil W., Chem. Rev. 2018, 118, 270–348. [DOI] [PubMed] [Google Scholar]
- 142. Contente M. L., Paradisi F., Nat. Can. 2018, 1, 452–459. [Google Scholar]
- 143. Dawood A. W. H., Bassut J., de Souza R. O. M. A., Bornscheuer U., Chem. Eur. J. 2018, 24, 16009–16013. [DOI] [PubMed] [Google Scholar]
- 144. Latham J., Henry J. M., Sharif H. H., Menon B. R. K., Shepherd S. A., Greaney M. F., Micklefield J., Nat. Commun. 2016, 7, 1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145. Huffman M. A., Fryszkowska A., Alvizo O., Borra-Garske M., Campos K. R., Canada K. A., Devine P. N., Duan D., Forstater J. H., Grosser S. T., Halsey H. M., Hughes G. J., Jo J., Joyce L. A., Kolev J. N., Liang J., Maloney K. M., Mann B. F., Marshall N. M., McLaughlin M., Moore J. C., Murphy G. S., Nawrat C. C., Nazor J., Novick S., Patel N. R., Rodriguez-Granillo A., Robaire S. A., Sherer E. C., Truppo M. D., Whittaker A. M., Verma D., Xiao L., Xu Y., Yang H., Science 2019, 366, 1255–1259. [DOI] [PubMed] [Google Scholar]
- 146. Fukuyama K., Ohrui H., Kuwahara S., Org. Lett. 2015, 17, 828–831. [DOI] [PubMed] [Google Scholar]
- 147. Mazurenko S., Prokop Z., Damborsky J., ACS Catal. 2020, 10, 1210–1223. [Google Scholar]
- 148. Hon J., Borko S., Stourac J., Prokop Z., Zendulka J., Bednar D., Martinek T., Damborsky J., Nucleic Acids Res. 2020, 48, W104–W109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 149. Vavra O., Filipovic J., Plhak J., Bednar D., Marques S. M., Brezovsky J., Stourac J., Matyska L., Damborsky J., Bioinformatics 2019, 35, 4986–4993. [DOI] [PubMed] [Google Scholar]
- 150.
- 150a. Zeymer C., Hilvert D., Annu. Rev. Biochem. 2018, 87, 131–157; [DOI] [PubMed] [Google Scholar]
- 150b. Siegel J. B., Zanghellini A., Lovick H. M., Kiss G., Lambert A. R., Clair J. L. S., Gallaher J. L., Hilvert D., Gelb M. H., Stoddard B. L., Houk K. N., Michael F. E., Baker D., Science 2010, 329, 309–313; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150c. Obexer R., Godina A., Garrabou X., Mittl P. R. E., Baker D., Griffiths A. D., Hilvert D., Nat. Chem. 2017, 9, 50–56; [DOI] [PubMed] [Google Scholar]
- 150d. Macdonald D. S., Garrabou X., Klaus C., Verez R., Mori T., Hilvert D., J. Am. Chem. Soc. 2020, 142, 10250–10254. [DOI] [PubMed] [Google Scholar]
- 151. Drienovská I., Roelfes G., Nat. Can. 2020, 3, 193–202. [Google Scholar]
- 152. Keasling J. D., Metab. Eng. 2012, 14, 189–195. [DOI] [PubMed] [Google Scholar]
- 153.
- 153a. Schwarz H., Schmittner M., Duschl A., Horejs-Hoeck J., PLoS One 2014, 9, e113840; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 153b. Mamat U., Wilke K., Bramhill D., Schromm A. B., Lindner B., Kohl T. A., Corchero J. L., Villaverde A., Schaffer L., Head S. R., Souvignier C., Meredith T. C., Woodard R. W., Microb. Cell Factories 2015, 14, 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 154.
- 154a. Fernández F. J., Vega M. C., in Advanced Technologies for Protein Complex Production and Characterization. Advances in Experimental Medicine and Biology (Ed.: Vega M. C.), Springer, Cham, Heidelberg, 2016, pp. 15–24; [PubMed] [Google Scholar]
- 154b. Allers T., Bioeng. Bugs 2010, 1, 290–292; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 154c. Larsen M. W., Bornscheuer U. T., Hult K., Protein Expr. Purif. 2008, 62, 90–97; [DOI] [PubMed] [Google Scholar]
- 154d. Terpe K., Appl. Microbiol. Biotechnol. 2006, 72, 211–222. [DOI] [PubMed] [Google Scholar]
- 155. Hauer B., ACS Catal. 2020, 10, 8418–8427. [Google Scholar]