ABSTRACT
The cotranslational incorporation of pyrrolysine (Pyl), the 22nd proteinogenic amino acid, into proteins in response to the UAG stop codon represents an outstanding example of natural genetic code expansion. Genetic encoding of Pyl is conducted by the pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNA, tRNAPyl. Owing to the high tolerance of PylRS toward diverse amino acid substrates and great orthogonality in various model organisms, the PylRS/tRNAPyl-derived pairs are ideal for genetic code expansion to insert noncanonical amino acids (ncAAs) into proteins of interest. Since the discovery of cellular components involved in the biosynthesis and genetic encoding of Pyl, synthetic biologists have been enthusiastic about engineering PylRS/tRNAPyl-derived pairs to rewrite the genetic code of living cells. Recently, considerable progress has been made in understanding the molecular phylogeny, biochemical properties, and structural features of the PylRS/tRNAPyl pair, guiding its further engineering and optimization. In this review, we cover the basic and updated knowledge of the PylRS/tRNAPyl pair’s unique characteristics that make it an outstanding tool for reprogramming the genetic code. In addition, we summarize the recent efforts to create efficient and (mutually) orthogonal PylRS/tRNAPyl-derived pairs for incorporation of diverse ncAAs by genome mining, rational design, and advanced directed evolution methods.
KEYWORDS: pyrrolysyl-tRNA synthetase, tRNAPyl, archaea, genetic code expansion, noncanonical amino acids, synthetic biology
INTRODUCTION
With few exceptions, all living organisms genetically encode 20 amino acids according to the rules of the genetic code. It is now known that variations to the standard genetic code table are present in nature, including, but not limited to, codon reassignment and ambiguous decoding (1). A notable example is site-specific incorporation of Pyl into polypeptides in response to an in-frame UAG stop codon in certain anaerobic archaea and bacteria (2, 3). Inspired by surprising findings of genetic code flexibility, scientists aim to site-specifically install distinct noncanonical amino acids (ncAAs) with unique physicochemical properties into proteins in living cells. Pioneered by Peter Schultz and others, numerous useful ncAAs have been genetically encoded in various model organisms via genetic code expansion technology (4–6) (Fig. 1), which facilitates the investigation of protein structure and function and enables the development of novel methods to study, control, and evolve biological processes, as summarized in other reviews (7–11).
FIG 1.
Genetic encoding of noncanonical amino acids (ncAAs). The schematic diagram showing the orthogonal aaRS/tRNA pair (grayish purple) decodes the amber stop codon to incorporate ncAA (grayish purple diamond) site-specifically into proteins by genetic code expansion. The resulting products include the full-length ncAA-containing proteins of interest and the undesired truncated polypeptide.
Developing orthogonal aminoacyl-tRNA synthetase and tRNA (aaRS/tRNA) pairs, the central biomolecules ensuring accurate genetic code interpretation, is a prerequisite for genetically encoding “designer” ncAAs. The cross talk between the orthogonal aaRS/tRNA pair and the endogenous tRNAs and aaRSs should be minimized. In addition, the introduced tRNA should be able to decode a specific codon. The UAG stop codon is commonly used since it is not assigned to an amino acid and is the rarest codon in many organisms such as Escherichia coli. Finally, by repurposing the amino acid substrate specificity of the orthogonal aaRS, the orthogonal tRNA is acylated with the desired ncAA, and the resulting ncAA-tRNA is installed into the growing polypeptide during protein translation (Fig. 1). As a natural amber suppressor, tRNAPyl is specifically charged by its cognate PylRS enzyme to form Pyl-tRNAPyl (12, 13), which is then decoded in the same manner as canonical tRNAs during ribosomal protein biosynthesis (Fig. 2A). Despite the PylRS/tRNAPyl pair allowing site-specific incorporation of Pyl into proteins in response to the in-frame UAG codon in the Pyl-decoding organisms, the reassignment of UAG codon to Pyl also leads to the extension of many cellular proteins where UAG specifies translation termination. Pioneering studies demonstrated natural PylRS/tRNAPyl pairs could enable genetic encoding of Pyl and its analogs in E. coli (12, 14, 15). Since then, the PylRS/tRNAPyl-derived pairs have become a popular choice for synthetic biologists to expand the genetic code with ncAAs.
FIG 2.
Biosynthesis and genetic encoding of Pyl in nature and the domain organization of PylRS enzymes. (A) Schematic diagram showing the Pyl biosynthesis pathway. Two lysine molecules (shown as black and cyan, respectively) are used to synthesize the Pyl in three steps by a cascade of enzymes consisting of PylB, PylC, and PylD. The free Pyl is ligated to tRNAPyl, a natural amber suppressor, by its cognate PylRS enzyme to form Pyl-tRNAPyl. By decoding the in-frame amber codon, Pyl is installed into proteins in Pyl-utilizing organisms. (B) Domain organization of the three classes of PylRS enzymes (PylSn–PylSc fusion, PylSn+PylSc, and ΔPylSn). The C-terminal catalytic domain (PylSc), the N-terminal tRNA-binding domain (PylSn), and linker region are labeled in orange, light blue, and dark blue, respectively. The representative species containing each class of PylRS are indicated. The crystal structures of PylSc of M. mazei (PDB accession no. 2ZIM), D. hafniense (PDB accession no. 2ZNI), and “Ca. Methanomethylophilus alvus” (PDB accession no. 6JP2) are shown.
In this review, we provide essential and updated information about key features of the PylRS/tRNAPyl pair that make this tool highly amenable to genetic code expansion. More details on the discovery, ncAA substrate diversity, and structural and functional properties of the native and engineered PylRS/tRNAPyl pair can be found in previous reviews (16–19). Particularly, we focus on key developments in improving the efficiency and (mutual) orthogonality of PylRS/tRNAPyl for cotranslational and site-specific incorporation of many useful ncAAs in the past 5 years; the relevant new approaches and mechanistic understandings accompanying the development and optimization processes are also discussed.
BIOSYNTHESIS AND GENETIC ENCODING OF Pyl
The discovery of Pyl was initiated by Krzycki et al. They discovered the presence of an in-frame UAG stop codon within a gene encoding the methylamine methyltransferase in Methanosarcina barkeri (20, 21). The chemical identity of Pyl was later determined at the position of an UAG-directed residue based on the high-resolution crystal structures of monomethylamine methyltransferase (3, 22). The pylT and pylS genes encoding tRNAPyl and PylRS, respectively, are found in the vicinity of methyltransferase gene cluster (23). Unlike tRNA-dependent pathways for the biosynthesis and genetic encoding of certain amino acids such as selenocysteine (24, 25), the free Pyl is synthesized from two lysine molecules and specifically ligated to tRNAPyl catalyzed by the PylRS enzyme in all Pyl-utilizing organisms (Fig. 2A). Biosynthesis of Pyl is mediated by a cascade of enzymes, including PylB, PylC, and PylD enzymes. Specifically, the radical S-adenosyl-methionine enzyme PylB converts one lysine to 3-methylornithine, which is then ligated to the second lysine moiety with the help of the ATP-dependent enzyme PylC. Finally, the intermediate product is oxidized by the amino acid dehydrogenase PylD to produce Pyl (Fig. 2A) (26). Heterogenous expressing of the pylTSBCD gene cassette in E. coli was found to enable cotranslational incorporation of Pyl by amber suppression (27), indicating that all the natural components responsible for biosynthesis and genetic encoding of Pyl have been identified.
Biological function of Pyl.
The Pyl encoding system is concentrated in methylotrophic methanogens (28). In these Pyl-utilizing organisms, biosynthesis and genetic encoding of Pyl are needed to produce the methylamine methyltransferase, an enzyme that allows the cell to utilize methylamines as a carbon source and energy. Deletion of pylT gene in Methanosarcina acetivorans was found to disable the cell growth on methylamines (29). The appearance of Pyl in the active sites of the methylamine methyltransferases and the co-occurrence of the Pyl operon and at least one methylamine methyltransferase in Pyl-utilizing organisms strongly suggest the essential role of Pyl in catalyzing methane formation (3, 30). In contrast, Pyl, in the archaeal tRNAHis guanylyltransferases, is a dispensable residue and is only required to produce the full-length protein via amber suppression (31); thus, the biological function of Pyl is not solely restricted to its catalytic role. Interestingly, deduction of the genetic code from 21 to 20 amino acids in M. acetivorans by deletion of tRNAPyl leads to a global change of the proteome such as reduced level of methanol methanogenesis enzymes, which indicates that the genetic code expansion of Pyl is related to the proteome composition and cellular metabolism (32). Overall, the exact biological role of Pyl might differ in distinct native environments and requires further investigation.
PylRS: A UNIQUE aaRS FOR COTRANSLATIONAL INSTALLATION OF Pyl
As the dedicated aaRS for genetic encoding of Pyl, PylRS has been extensively characterized to understand its phylogenetic, structural, and functional characteristics. Most PylRS enzymes consist of the C-terminal catalytic domain (PylSc) that resembles that of the class II aaRSs (33, 34) and the N-terminal tRNA-binding domain (PylSn) that shares no sequence homology to any known RNA-binding proteins (35). Based on the different compositions of PylSn and PylSc, PylRS enzymes are categorized into three classes, PylSn–PylSc fusion, PylSn+PylSc, and ΔPylSn (Fig. 2B). This classification is consistent with recent phylogenetic analyses based on PylSc sequences (28, 36, 37). The two domains of the PylSn–PylSc fusion enzyme are linked by an unstructured loop with different lengths (Fig. 2B). The pylS genes encoding the PylSn–PylSc fusion class enzymes have been identified in a total of 47 archaeal genomes, even including nonmethanogenic archaea (28). Derivatives of Methanosarcina mazei PylRS (MmPylRS) and M. barkeri PylRS (MbPylRS) represent the most widely used PylSn–PylSc fusion enzymes, which enables the site-specific incorporation of more than 100 ncAAs into different proteins so far (16). The PylSn+PylSc class enzymes are expressed as different polypeptides encoded by separate pylS genes (35). This class of PylRS enzymes is commonly observed in Pyl-utilizing bacteria. Interestingly, the co-occurrence of individual genes encoding PylSc and PylSn was also found in seven archaeal genomes, including two Asgardarchaeota lineages (28, 38). As the most recently discovered class, the ΔPylSn enzymes are encoded in the genomes that lack gene homologs encoding PylSn. Overall, these newly identified pylS genes extend our knowledge about the diversity and molecular phylogeny of PylRS enzymes in nature.
The recent discovery of ΔPylSn enzymes.
The discovery of numerous ΔPylSn enzymes opens up new opportunities to expand the PylRS-based toolbox for genetic code expansion. A notable example of ΔPylSn enzyme is that from “Candidatus Methanomethylophilus alvus” (MaPylRS), which displayed robust UAG readthrough activity in E. coli cells and cell-free systems (39, 40). However, a recent biochemical study showed that the aminoacylation activity of MaPylRS is less competent than MmPylRS (28), suggesting other factors might be responsible for the high amber suppression activity of MaPylRS in vivo (for example, the solubility of MaPylRS is much higher than MmPylRS [40]). In addition, MaPylRS and its cognate, MatRNAPyl, could be engineered to become mutually orthogonal, with PylRS belonging to the PylSn–PylSc fusion class (39, 41–43). Based on the sequence similarity, two distinct clusters of ΔPylSn enzymes (class A and class B) have been disclosed; measurements of amber suppression activity in E. coli found these ΔPylSn enzymes preferentially work with the tRNAPyls belonging to the same class (44) (Fig. 3A). Another surprising discovery about the ΔPylSn class enzyme is the simultaneous presence of two distinct PylRS enzymes encoded in the genome of Methanomassiliicoccus luminyensis B10 and “Candidatus Methanohalarchaeum thermophilum” HMET1 (45). Interestingly, the two wild-type PylRS/tRNAPyl pairs from HMET1 share different identity elements and are mutually orthogonal in haloarchaeon Haloferax volcanii (45) (Fig. 3A). The combination of engineered ΔPylSn and PylSn–PylSc fusion enzymes enables the simultaneous incorporation of multiple distinct ncAAs into a single polypeptide (39, 41–44, 46–48) (discussed below).
FIG 3.
Schematic diagram showing the mutually orthogonal PylRS/tRNAPyl pairs found in nature and developed by engineering. (A) Mutually orthogonal PylRS/tRNAPyl pairs found in nature. The left panel shows the mutually orthogonal pairs identified from HMET1; the unique features (motif 2 loop with a single amino acid deletion in PylRS2 and A73 discriminator base in tRNAPyl2) accounting for mutual orthogonality are highlighted in red. The right panel shows two distinct clusters of ΔPylSn enzymes (class A and class B), and some of these ΔPylSn/ΔPylSntRNAPyl pairs display mutual orthogonality. Some unique nucleotides between class A and class B ΔPylSntRNAPyl are indicated in brown and black, respectively. (B) Mutually orthogonal PylRS/tRNAPyl pairs by engineering. The variable loop of ΔPylSntRNAPyl (such as MatRNAPyl) recognized by the PylSn domain could be engineered to become orthogonal with PylRS belonging to the PylSn–PylSc fusion class (such as MmPylRS). Triply orthogonal PylRS/tRNAPyl pairs could be further created by evolving the variable loop of class A ΔPylSntRNAPyl and both the acceptor stem and variable loop of the class B ΔPylSntRNAPyl with PylSn–PylSc full-length pairs. The evolved sites are shown in cyan.
High tolerance for amino acid substrates.
PylRS harbors a deep catalytic pocket and has low selectivity toward the side chain and α-amine of substrate amino acids (34, 49–51). The remarkably broad polyspecificity of PylRS for the substrate amino acids has been illustrated by crystal structures of the wild-type and engineered PylRS enzymes in complex with Pyl and its analogs (19). A recent study provides new insights into how bulky lysine derivatives are recognized by the MmPylRS variant bearing two mutations (Y306A and Y384F) by determining crystal structures of PylSc in complex with 14 different ncAAs with diverse functional applications (such as click chemistry) (52), which reveals distinct binding modes and additional key residues for the substrate interaction of PylRS enzyme. Only a few modifications of the PylRS catalytic pocket are required to repurpose this enzyme to recognize phenylalanine and histidine derivatives (53–56). In addition, PylRS variants with improved specificity for aromatic ncAA analogs have been recently developed via the semirational design of the enzyme-substrate binding pocket (57). As PylRS weakly recognizes the α-amine group, noncanonical substrates with high backbone diversity, including α-hydroxyacid, Nα-methyl-amino acid, and d-Pyl derivatives, could be recognized by the wild-type and evolved PylRS (58, 59). Although high plasticity in the active site of PylRS is desirable for genetic code expansion of diverse ncAAs, the cross-reactivity between evolved PylRS enzymes and different ncAA substrates should be paid attention to in future applications of simultaneous incorporation of multiple ncAAs with structural similarity into a single polypeptide.
Recognition of tRNAPyl by PylRS.
PylRS has evolved to specifically recognize tRNAPyl, a noncanonical tRNA bearing several unique features (17). In contrast to the majority of aaRS enzymes that employ the tRNA anticodon as an identity element (60), PylRS does not recognize the anticodon of tRNAPyl based on the biochemical and structural data (14, 61, 62). This notable feature allows the facile engineering of tRNAPyl to decode diverse codons, such as unnatural codons and four-base codons (63, 64). Early aminoacylation assays discovered that the discriminator base (G73) and the first base pair in the acceptor stem (G1·C72) in tRNAPyl serve as the major identity elements (14, 65). The crystal structure of the PylRS/tRNAPyl pair from Desulfitobacterium hafniense further illustrates the key role of motif 2 loop in interacting G73 and the G1-C72 base pair in a base-specific manner (61). PylRS also has evolved a specialized tRNA binding surface to accommodate the compact core of tRNAPyl while rejecting the bulkier tertiary core of other canonical tRNAs (61). The crystal structure of the PylSn domain of MmPylRS in complex with MmtRNAPyl further explains the high orthogonality of the PylSn–PylSc fusion class enzyme (62). Mechanistically, the steric constraint is employed by the PylSn domain to accept the shortened variable loop of tRNAPyl but excludes other canonical tRNAs with a larger variable arm (62). Based on this recognition mode, MatRNAPyl with an expanded variable loop has been designed to be orthogonal to PylSn–PylSc fusion PylRS while still serving as the substrate for MaPylRS, a ΔPylSn class enzyme (41, 42) (Fig. 3B). In general, recent studies have deepened our understanding of the molecular basis of orthogonality adopted by the PylRS/tRNAPyl pair, which guides the design and creation of (mutually) orthogonal tools for genetic code expansion.
PylRS/tRNAPyl: AN OUTSTANDING GENETIC CODE EXPANSION TOOL
As mentioned above and discussed in other reviews (16, 17), the PylRS/tRNAPyl pair exhibits high orthogonality due to its many unique properties compared to other canonical aaRS/tRNA. For this reason, the PylRS/tRNAPyl pair has become the most popular workhorse of synthetic biologists for genetic code expansion. Over the past 5 years, the versatility and robustness of the PylRS/tRNAPyl pair for genetic encoding ncAAs have been further demonstrated in more organisms such as Synechococcus elongatus (66), Xenopus oocyte (67), Lactococcus lactis (68), Pseudomonas aeruginosa (69), and Bombyx mori (70). Also, PylRS/tRNAPyl pair derivatives have also been heavily employed in the cell-free system (40, 71–73). In this section, we focus on recent progress in the incorporation of diverse ncAAs implemented by PylRS/tRNAPyl-derived pairs, their optimization in the aspects of efficiency and substrate specificity, and the development of the mutually orthogonal tools for simultaneously encoding multiple distinct ncAAs.
Genetic encoding of a wide range of function-oriented ncAAs by PylRS/tRNAPyl pairs.
The high expansibility of the PylRS substrate spectrum has been employed to create PylRS variants for the incorporation of a variety of “designer” ncAAs with novel chemical and physical properties. These recent efforts have enabled many exciting applications, including cross-linking, regulation of protein function, posttranslational modifications (PTMs), probing, and protein imaging. Detailed information about the chemical names of these ncAAs, application purposes, and the sequence of PylRS variants using MmPylRS as a reference sequence are summarized in Table 1.
TABLE 1.
Information of recently developed PylRS variants that recognize different ncAA substrates and relevant applications in different heterogenous organismsa
| Source | Host | Mutation site | ncAA | Application |
|---|---|---|---|---|
| MmPylRS | E. coli | A302T/V346A/W348A/L401V/W417T (75) | (S)-2-amino-3-(4-(3-bromopropoxy)-3-ethynylphenyl) propanoic acid (EB3) | Crosslinking |
| E. coli | L301M/Y306A/L309A/C348F (128) | Nε-heptanoyl-l-lysine (HepoK) | Crosslinking | |
| E. coli and HEK293 cells | A302I/N346T/C348I/Y384L/W471K (76) | Fluorosulfate-l-tyrosine (FSY) | Crosslinking | |
| E. coli and mammalian cells | Y306L/L309A/N346A/C348M/W417T (129) | o-Sulfonyl fluoride-O-methyltyrosine (SFY) | Crosslinking | |
| E. coli | C348W/W417S (125) | S-allyl cysteine (Sac) | Crosslinking | |
| E. coli | Y306A/Y384F (80) | o-Nitrobenzyl alcohol lysine (o-NBAK) | Photo-crosslinking | |
| E. coli | A302T/N346A/C348G/Y384F/W417T (130) | Dibenzo[b,f][1,4,5]thiadiazepine-based alanine (DBTDA) | Photo-crosslinking | |
| HEK293 cells, HeLa cells, CHO cells | R91K/G131E/Y306A/Y384F (131) | Nε-(para-trifluoromethyl-diazirinyl-benzyloxycarbonyl)-l-lysine (pTmdZLys) | Photo-crosslinking | |
| E. coli and HEK293 cells | Y306V/L309A/C348F/Y384F (79) | N-methyl pyrroletetrazole-lysine (mPyTK) | Photo-crosslinking | |
| E. coli and HEK293 cells | L309G/N346A/C348I/V401K/W417I (88) | Thyronine (Thy) | Protein labeling | |
| E. coli with deletion of release factor 1 | Y306A/Y384F (132) | Trans-cyclooct-2-ene-lysine (TCO*-AK) | Protein labeling (fluorescence) | |
| E. coli and HeLa cells | N346Q/C348S/V401G/W417T/G419G (95) | l-β-(Quinolin-6-yl) alanine (Qui) | Protein labeling | |
| E. coli | R61K/M300L/A302G/L309F/C348A/E444G (133) | Nε-allyloxycarbonyl-l-lysine (AlockOH) | Protein labeling (self-cleavage) | |
| E. coli | F271L/F313C, F271L/F313M, and F271N/F313I (84) | Nε-thioacetyl-l-lysine (TAcK) | PTM (acetylation) | |
| E. coli | Y306L/C348I/Y384F (86) | Azidonorleucine (AznL) | PTM (acetylation) | |
| E. coli | A302S/L309M/I322L/N346A/W348G/W417T (87) | Phosphortyrosine (pTyr) | PTM (phosphorylation) | |
| E. coli and HEK293 cells | N346D/C348S/Y384F (134) | o-Chlorophenylalanine (o-ClF) | Substrate expansion | |
| E. coli | N346S/C348M/V401G (57) | 3-Benzothienyl-l-alanine (Bta) and 3-(1-naphthyl)-l-alanine (1-NaA) | Substrate expansion | |
| E. coli | N346G/C348M/W417L (57) | 3-Chloro-l-tyrosine (3-ClY) and 3-bromo-l-tyrosine (3-BrY) | Substrate expansion | |
| MbPylRS | E. coli and HEK293 cells | L274[309]A/C313[348]F/Y349[384]F (78) | o-4-Fluorophenyl lysine (FPheK) | Crosslinking |
| HEK293 cells | Y271[306]A/L274[309]A/C313[348]A (74) | (S)-2-amino-6-(6-bromohexanamido) hexanoic acid n lysine (BrCnK) | Crosslinking | |
| E. coli | Y271[306]C/N311[346]Q/Y349[384]F/V366[401]C (135) | 2,3-Diaminopropionic acid (DAP) | Photo-crosslinking | |
| E. coli | C323[358]W/W382[417]T (136) | Se-allyl selenocysteine (Asec) | Caged amino acid | |
| E. coli and HEK293 cells | N311[346]Q/C313[348]S (137) | N6-((2-azidoacetyl) glycyl)-l-lysine (AzGGK) | Caged amino acid | |
| E. coli and HeLa cells | L270[305]F/L274[309]M/N311[346]A/C313[348]G (81) | (2 R)-2-amino-3-fluoro-3-(4-((2-nitrobenzyl) oxy) phenyl) propanoic acid (FnbY) | Photocage | |
| HEK293 cells | L270[305]G/N311[346]G/C313[348]A (138) | 3-(6-Alkyl-s-tetrazin-3-yl) phenylalanine (Tet-v3.0) | Protein labeling | |
| E. coli and HEK293 cells | N311[346]S/C313[348]G/V366[401]A/W382[417]T (93) | Acridonylalanine (Acd) | Protein labeling (fluorescence) | |
| E. coli | A267[302]Q/N311[346]S/C313[348]W (139) | 4-Thiazoylalanine (4ThzA) | Protein labeling | |
| E. coli | N311[346]Q/Y349[384]F (139) | 3-(3-Thienyl) alanine (3ThiA) | Protein labeling | |
| E. coli and HEK293 cells | L266[301]M/L270[305]I/L274[309]A/C313[348]F (89) | Nε-trifluoroacetyl-l-lysine (TFAcK) | Protein labeling (NMR) | |
| E. coli and HEK293 cells | Y271[306]M/C313(348)T (85) | Nε-l-lactyl-l-lysine (LacK) | PTM (lactylation) | |
| E. coli and HEK293 cells | C313[348)T/Y349[384]F (85) | Nε-β-hydroxybutyryl-l-lysine (BhbK) | PTM (butyrylation) | |
| E. coli and HEK293 cells | Y271[306]A (85) | Nε-lipoyl-l-lysine (LipoK) | PTM (lipoylation) | |
| E. coli | T13[48]I/I36[71]V/C313[348]W/W382[417]S (140) | S-allyl-l-cysteine (Sac) | Substrate expansion | |
| MaPylRS | E. coli | Y126[306]G/M129[309]A/V168[348]F/H227[405]T/Y228[406]P/L229[407]I (77) | Fluorosulfonyloxybenzoyl-l-lysine (FSK) | Crosslinking |
| E. coli and mammalian cells | Y126[306]L/M129[309]A/N166[346]A/V168[348]M/W239[417]T (129) | o-Sulfonyl fluoride-O-methyltyrosine (SFY) | Crosslinking | |
| E. coli | L125[305]F/N166[346]A/V168[348]G (82) | o-2-Nitrobenzyl-β-fluorotyrosine (FnbY), o-2-nitrobenzyl-3-fluoromethyl tyrosine (FmnbY) | Photocage | |
| E. coli-based cell-free system | Y126[306]A (40) | Nε-((((E)-cyclooct-2-en-1-yl) oxy) carbonyl)-l-lysine (TCO*K) | Protein labeling (fluorescence) | |
| E. coli-based cell-free system | V168[348]G/A223[401]C/Y206[384]F (92) | N6-(((trimethylsilyl) methoxy) carbonyl)-l-lysine (TMSK) | Protein labeling (NMR) | |
| E. coli HEK293 cells | Y126[306]T/M129[309]R/V168[348]H/H227[405]I/Y228[406]P and Y126[306]T/M129[309]R/V168[348]H/Y206[384]W (47) | Nε-benzoyllysine (BzK) | PTM (benzoylation) | |
| G1PylRS (PylRS from methanogenic archaeon ISO4-G1) | E. coli-based cell-free system | V167[348]G/A221[401]C/Y204[384]F (92) | N6-(((trimethylsilyl) methoxy) carbonyl)-l-lysine (TMSK) | Protein labeling (NMR) |
| E. coli | L124[305]G/Y125[306]F/N165[346]G/V167[348]F/Y204[384]W/A221[401]G/W237[417]Y (91) | 7-Fluoro-l-tryptophan (7FTrp) | Protein labeling (NMR) | |
| DhPylRS (PylRS from Desulfitobacterium hafniense) | E. coli | N176[367]A/T178[368]G (104) | Para-azido-l-Phenylalanine (AzF) | Crosslinking |
| ChPylRS (chimeric PylRS consisting of PylSn MbPylRS and MmPylRS PylSc) | E. coli and Bacillus stearothermophilus | C348G/V401C/Y384F, and V401K/Y384F (92) | N6-(((trimethylsilyl) methoxy) carbonyl)-l-lysine (TMSK) | Protein labeling (NMR) |
The mutation site corresponding to the residue of MmPylRS is listed in square brackets.
Genetic encoding of ncAAs bearing the cross-linking side chains to selectively form the covalent bond is a valuable approach to identifying weak and transient protein interactions. For instance, using a PylRS variant with the ability to install lysine derivatives that contain structurally flexible bromoalkyl moieties into proteins, a method was developed to capture the stabilized protein complex in a proximity-dependent way (74). A similar method called genetically encode chemical cross-linkers (GECX) was also invented, which is based on covalent bond formation between the alkyl bromide group of the ncAA in the target protein and the cysteine residue in partner proteins (75). Notably, GECX facilitates the subsequent mass spectrometric analysis of partner proteins through biotin enrichment with the help of an additional bio-orthogonally reactive group in the ncAA (75). In addition, by the genetic encoding of fluorosulfate-containing ncAAs with latent bioreactivity to cross-link with diverse natural residues, a novel method called sulfur-fluoride exchange (SuFEx) was recently developed (76, 77). Evolved PylRS variants also allowed the site-specific incorporation of ncAAs with reactive aryl carbamate groups into proteins, which could also be utilized for cross-linking purified proteins under the basic environment (78). To map protein interactions with spatiotemporal resolution in living cells, several lysine-derived photo-cross-linkers were designed and installed into proteins of interest by genetic code expansion using developed PylRS enzymes (79, 80).
To precisely and optically control protein function with residue selectivity, photocaged ncAAs bearing photocleavable protecting groups have been site-specifically incorporated into target proteins. As for the regulation of cross-linking process, an ncAA bearing the photocaged quinone methide was successfully inserted into proteins using an engineered MbPylRS enzyme; the quinone methide could be liberated to cross-link with nearby nucleophilic residues upon photoactivation (81). Genetic code expansion of the photocaged quinone methide could also be employed to endow proteins with diverse functionalities via facile protein labeling (82). Additionally, by genetically encoding a photocaged ncAA that could trap diverse acyl-enzyme complexes upon activation, a powerful method was recently developed to identify unknown substrates of various hydrolases (83).
The capability to site-specifically incorporate ncAAs with designed PTMs into proteins provides a powerful way to elucidate the role of these modifications in protein function. By repurposing the amino acid substrate specificity, many PylRS/tRNAPyl-derived pairs were recently developed to genetically encode PTMs on the lysine residue, including a nonhydrolyzable analog of lysine acetylation (84), lipoylation, and newly discovered benzoylation, lactylation, and β-hydroxybutyrylation (47, 85). In addition, site-specific insertion of lysine succinylation into proteins was achieved by combing the PylRS-based genetic code expansion with traceless Staudinger ligation (86). Except for PTMs on the lysine residue, phosphotyrosine could be directly installed into different proteins by the engineered MmPylRS variant in two steps: the phosphotyrosine analog was incorporated in response to the amber stop codon, followed by deprotection to generate the native phosphotyrosine (87).
Genetic code expansion using the PylRS/tRNAPyl pair derivatives enables many applications of ncAA-based probes. For example, genetic encoding of a novel ncAA called thyronine could be used for fluorescent detection of peroxynitrite in vivo (88). Various ncAAs with the fluorine atom have been site-specifically installed into proteins by engineered PylRS variants and serve as novel nuclear magnetic resonance (NMR) probes to study conformational changes of these proteins with essential physiological functions (89–91). In addition, a PylRS variant was recently evolved to genetically encode a trimethylsilyl analog on lysine residue as a probe for 1H NMR spectra, facilitating the investigation of protein structure and function (92).
Compared with fluorescent fusion proteins or self-labeling tags, site-specific incorporation of ncAAs for protein imaging might have some advantages, such as minimal perturbation of protein function and an expanded range of fluorophores (7). A fluorescent ncAA with high photostability and a long lifetime was recently reported to be directedly inserted into proteins by engineered MbPylRS for live cell imaging (93). A PylRS variant was identified to produce proteins with ncAA composed of styrene, an unstrained alkene; the ncAA residue could be subsequently labeled with novel fluorophore via the bio-orthogonal reaction (94). Interestingly, a pH-dependent fluorescent protein was developed using a PylRS derivative-assisted genetic code expansion; this protein exhibits enhanced fluorescence at the acid pH condition through the switch of chromophore charge enabled by a novel quinoline-containing ncAA (95).
Improving the ncAA incorporation efficiency by PylRS/tRNAPyl pairs.
Although PylRS/tRNAPyl pair derivatives enable widespread applications of genetic code expansion in living organisms, in vitro aminoacylation experiments reveal PylRS enzymes exhibiting significantly lower turnover rates than other canonical aaRSs (18). Also, compared with the native substrate Pyl, many ncAAs are poorly recognized by the wild-type and engineered PylRS enzymes (1 to 2 orders of magnitude lower affinity) (28, 62). Thus, improving the catalytic activity of PylRS enzymes is an important field in the optimization of genetic code expansion tools.
As many key residues in the amino acid binding pocket of PylSc have been determined based on crystal structures, PylRS variants with improved substrate specificity (listed in Table 1) could be obtained by successive rounds of selections from the saturation mutagenesis library corresponding to these residues (96). The strategy of positive selection to select active PylRS variants often relies on cell survival based on antibiotic tolerance, fluorescence intensity measurement, and colorimetric assay (97, 98). Considering the advantages of throughput, sensitivity, and quantifiability, the fluorescence-activated cell sorting (FACS) technology was utilized to discover designed PylRS derivatives with enhanced incorporation efficiency and selectivity toward ncAAs (99, 100). To speed up the process of laboratory evolution, which typically requires a month's effort via successive rounds of positive and negative selections, advanced methods, such as phage-assisted continuous evolution (PACE) (101), phage- and robotics-assisted near-continuous evolution (PRANCE), a high-throughput version of PACE (102), and phage-assisted noncontinuous evolution (PANCE) that does not require specialized equipment (62), were developed for directed evolution of PylRS. The strategy of positive selection employed in the phage-assisted evolution is to link the catalytic activity of PylRS to the production of pIII protein, an essential protein for phage infectivity, via amber suppression (101). One advantage of these new methods is that no prior knowledge about the enzyme structure is needed for directed evolution, which is similar to the error-prone PCR-based laboratory evolution. Using these advanced methods, superior PylRS variants bearing various beneficial mutations were evolved which exhibit improved kcat and affinity for both tRNAPyl and ncAA substrates (62, 101). Employment of these evolved PylRS enzymes achieved a much improved yield of ncAA-containing proteins (62, 101, 103). As the PylSn and PylSc domains of different PylRS enzymes share high sequence similarity, transplantation of the beneficial mutations from evolved PylRS variants to other PylRS derivatives serve as a simplified way to promote the ncAA incorporation efficiency (39, 97, 98, 101, 104). Interestingly, manipulating the length of the linker connecting PylSn and PylSc was also found to be effective to increase the ncAA incorporation efficiency (105), highlighting the feasibility of rational design strategy to optimize the catalytic efficiency of PylRS.
As the protein stability affects its robustness to be engineered (106, 107), enhancing this property of the PylRS enzyme could be employed to promote its activity for ncAA incorporation. For instance, the PylRS enzymes from thermophilic archaea exhibit enhanced thermostability and are more tolerant to mutations in the catalytic pocket than MbPylRS (108); these thermophilic PylRS derivatives outperform the mesophilic counterpart for installing diverse ncAAs into proteins (108). Improving the stability of MmPylRS by site-directed mutations that prevent the enzyme from hydrolysis in E. coli was found to increase the amber suppression activity (109). In addition, enhancing the solubility of PylRS enzyme is also an effective strategy to improve the efficiency of genetic code expansion. For example, by fusing a soluble tag called small metal binding protein (SmbP) to MmPylRS enzyme derivatives, drastic improvement of the enzyme solubility and increased production of ncAA-containing proteins were achieved (110). To facilitate the directed evolution of PylRS enzyme by PACE, a chimeric PylRS with enhanced solubility was utilized (101). Also, employment of MaPylRS exhibits 5-fold-higher solubility than MmPylRS and could enable efficient ncAA incorporation into an antibody fragment using the cell-free system (40).
The introduced tRNAPyl, a noncanonical tRNA with many characteristic features, is not optimal for amber suppression in heterologous cells, probably owing to the lack of evolutionary tuning (111, 112). A recent study found that MmPylRS displays 2-fold-higher activity of amber suppression in vivo for MatRNAPyl than MmtRNAPyl. However, the aminoacylation efficiency for MatRNAPyl is worse than that of MmtRNAPyl (28). This finding suggests that factors other than aminoacylation efficiency would affect the overall efficiency of ncAA incorporation in heterogenous organisms. An improved tRNAPyl variant for ncAA insertion was developed in E. coli by directed evolution of the acceptor and T stem, a key region in tRNA involved in the interaction with elongation factor Tu (113). By rational design, two novel classes of tRNAPyl variants were recently developed in mammalian cells which exhibited enhanced intracellular concentration, intracellular stability, and ncAA incorporation efficiency (114). As posttranscriptional modification is widespread in all domains of life and modified nucleosides on MbtRNAPyl were found (13, 114), manipulation of the posttranscriptional modification in tRNAPyl might open up new directions for its further optimization in heterologous organisms.
Development of mutually orthogonal PylRS/tRNAPyl-based pairs.
Simultaneous incorporation of distinct ncAAs into a single protein requires multiple mutually orthogonal aaRS/tRNA pairs. Due to the excellent orthogonality of the PylRS/tRNAPyl pair to canonical aaRS/tRNA pairs, the joint use of PylRS/tRNAPyl-based pairs and other developed aaRS/tRNA pair derivatives (such as the tyrosyl pair from archaea) allows encoding of two to three distinct ncAAs in one gene (46, 115–117). Increased understanding of key features in PylRS that underly the molecular basis of orthogonality also contributes to the creation of mutually orthogonal PylRS/tRNAPyl derivatives. The N-terminal domain of the PylSn–PylSc fusion enzyme such as MmPylRS, which is absent in the ΔPylSn enzyme (e.g., MaPylRS), interacts with the T arm and variable loop of its cognate tRNAPyl. Thus, it is possible to engineer these regions in MatRNAPyl to interfere with its recognition by the PylSn domain of the PylSn–PylSc fusion class enzyme; the engineered tRNAPyl variants still retain the interaction with the cognate ΔPylSn enzyme (Fig. 3B). By rational modification and directed evolution of the variable loop, MatRNAPyl variants were developed which exhibit orthogonality to the full-length MmPylRS enzyme and remain as the substrate for MaPylRS (41, 42). Combining these mutually orthogonal pairs allowed the genetic encoding of two distinct ncAAs into one protein in both E. coli and mammalian cells (41–43). The unique properties of the engineered MatRNAPyl variant (the acceptor stem and modified variable loop) could be transplanted into another tRNAPyl to create a hybrid tRNA with good orthogonality to the MmPylRS/MmtRNAPyl pair and high activity in mammalian cells (118). Two classes of ΔPylSn enzymes and their cognate tRNAPyls (ΔPylSntRNAPyl) were recently identified (44). Taking advantage of inherent mutual orthogonality between some of these ΔPylSn/ΔPylSntRNAPyl pairs, triply orthogonal PylRS/tRNAPyl pairs were further developed by evolving the variable loop of class A ΔPylSntRNAPyl and both the acceptor stem and variable loop of the class B ΔPylSntRNAPyl (44) (Fig. 3B). A novel mechanism underlying the natural orthogonality of two novel PylRS/tRNAPyl pairs was recently disclosed which provides a compelling example of a single base change at the discriminator position, leading to additional coding capacity (45). The unique features accounting for mutual orthogonality consist of the A73 discriminator base of HMET1 tRNAPyl2 (almost all tRNAPyl contains the canonical G73 base) and a shorter motif 2 loop in HMET1 PylRS2 (45) (Fig. 3A). Overall, recent works highlight the feasibility of creating mutually orthogonal PylRS/tRNAPyl pair derivatives based on distinct tRNA recognition modes.
Genetic encoding of different ncAAs in a single polypeptide also requires additional “blank” codons other than the amber codon, which could be the liberated stop and sense codons created via whole-genome synthesis and new codons with synthetic nucleotides or four bases (119). In contrast to other developed aaRS/tRNA pairs that are mainly restricted to decoding the commonly used amber codon, the PylRS/tRNAPyl pair is recognized as a versatile tool to decode different codons by engineering. For instance, the anticodon of tRNAPyl could be modified accordingly to decode the UAA stop codon (46, 117), different sense codons (TCG and TCA) (120), the unnatural codons based on the dNaM/dTPT3 pair (63), and various four-base codons (48). These (mutually) orthogonal PylRS/tRNAPyl derivatives with the capability to decode a variety of “blank” codons allowed cotranslational installation of multiple distinct (up to four) ncAAs (46, 48, 117, 120, 121).
CONCLUDING REMARKS AND OUTLOOK
Over the past 2 decades, employment and engineering of PylRS/tRNAPyl pairs to site-specifically insert a variety of useful ncAAs into proteins have attracted increasing attention. The optimization process of PylRS for ncAA incorporation is accelerated by recent progress in understanding the biochemical and structural properties of PylRS enzymes and the development of advanced methods for the directed evolution of proteins (62, 101, 102). The recent discovery of a novel class of PylRS enzymes, especially the ΔPylSn class enzymes, further enriches the toolbox for genetic code expansion, laying the foundation for creating active and (mutually) orthogonal PylRS/tRNAPyl-derived pairs.
Continued efforts on developing more PylRS/tRNAPyl-based aaRS/tRNA pairs for genetic encoding of diverse ncAA substrates and improving their catalytic power are worth adhering to. As the orthogonality of engineered PylRS/tRNAPyl pairs has been confirmed in a variety of living organisms, transplantation of key orthogonal components from PylRS/tRNAPyl pairs to other aaRS/tRNA pairs would serve as a promising strategy to generate additional orthogonal translation systems, as demonstrated in a previous study (122). Recent works also revealed different tRNA recognition modes adopted by natural and engineered PylRS/tRNAPyl pairs to maintain mutual orthogonality (41, 42, 44); it might be possible to rationally design more (mutually) orthogonal PylRS/tRNAPyl pairs by combining distinct identity elements. To further increase the efficiency of genetic code expansion, efforts need to focus on improving the solubility, especially for pylSn–pylSc fusion enzymes, aminoacylation activity, and compatibility of tRNAPyl with the ribosome and other translational components in heterogeneous cells, as suggested in recent works (28, 40). To bypass the potential problem of ncAA uptake and the requirement for a supply of exogenous sources of ncAAs, coupling metabolic engineering of ncAA biosynthesis and genetic code expansion is a promising strategy to evolve PylRS/tRNAPyl-derived pairs for enhanced activity (123–126). As the complete Pyl biosynthesis pathway is known and functional in heterologous cells (26, 127), utilization of autonomous cells with the capability to synthesize Pyl, the authentic substrate for PylRS, may facilitate to improve the amber suppression activity of engineered PylRS/tRNAPyl pairs in future directed evolution experiments. Once the superior PylRS/tRNAPyl pairs are developed, the molecular architecture of amino acid binding pockets, which is highly conserved in all PylRS enzymes (28), could be rapidly redesigned by transplanting key residues from other developed PylRS variants with different ncAA substrate specificity (52). Overall, we envision an increased mechanistic understanding of how PylRS/tRNAPyl-derived pairs work in heterogenous organisms, and persistent optimization efforts would further unleash their potential for applications of genetic code expansion.
ACKNOWLEDGMENTS
This work was supported by grants from the National Key Research and Development Program of China (no. 2018YFA0900100 to X.F.), the National Natural Science Foundation of China (no. 31901029 to X.F.), the Natural Science Foundation of Guangdong Province, China (no. 2021A1515010995 to X.F.), and the Guangdong Provincial Key Laboratory of Genome Read and Write (no. 2017B030301011). Due to space limitations, we apologize to others whose works were not cited.
Biographies

Xuemei Gong is a Ph.D. student at University of Chinese Academy of Sciences, Beijing. Her research is focused on expanding the genetic code of cells for simultaneous incorporation of distinct noncanonical amino acids (ncAAs). She is interested in developing novel aminoacyl-tRNA synthetases/tRNA pairs, especial the pyrrolysyl-tRNA synthetase/tRNAPyl, as tools to install the distinct ncAAs with novel functions.

Haolin Zhang is a Research Associate of Synthetic Biology in BGI-Research. His interests are focused on the application of genetic code expansion (GCE) in the characterization of biological pathways and industrial applications. His research focuses on improving the incorporation efficiency of noncanonical amino acids in yeasts and archaea to explore GCE application in these organisms.

Yue Shen currently serves as the Chief Scientist of Synthetic Biology in BGI-Research, Deputy Director of the Guangdong Provincial Key Laboratory of Genome Read and Write, PI of Shenzhen Synthetic Biology Innovation Institute, and Visiting Professor of Center for Synthetic Genomics at SIAT, CAS. She earned her Ph.D. in Molecular Biology from the University of Edinburgh. She is one of the key members in the synthetic yeast consortium (Sc2.0). Her research focuses on the development of DNA synthesis technologies and instruments, synthetic genomics and its downstream applications, and DNA-based data storage. She has published over 40 papers in prestigious journals, including Science, Genome Research, Nature Communications, and Nature Computational Science, and was selected as one of the Top Ten Scientific Advances of China in 2017.

Xian Fu is a principal investigator at the synthetic biology platform of BGI-Research at Shenzhen. He earned his Ph.D. from University of Florida in 2016, mentored by Dr. Julie Maupin-Furlow. His Ph.D. thesis uncovered key components involved in targeted proteolysis and ubiquitin-like protein modification in archaea. He finished the postdoctoral training in Dr. Dieter Söll’s lab at Yale University to engineer aminoacyl-tRNA synthetase and tRNA pairs for genetic code expansion (GCE) and selenoprotein biosynthesis. By combining synthetic genomics, genome mining, and advanced directed evolution methods, his main research interests focus on developing an efficient GCE system in yeasts and archaea to genetically encode a variety of “designer” noncanonical amino acids (ncAAs) for investigation of fundamental biology and industrial applications. So far, he has published 24 peer-reviewed articles. He also serves on the editorial board of Frontiers in Genetics.
Contributor Information
Xian Fu, Email: fuxian1@genomics.cn.
Julie A. Maupin-Furlow, University of Florida
REFERENCES
- 1.Ling J, O'Donoghue P, Soll D. 2015. Genetic code flexibility in microorganisms: novel mechanisms and impact on physiology. Nat Rev Microbiol 13:707–721. 10.1038/nrmicro3568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Gaston MA, Jiang R, Krzycki JA. 2011. Functional context, biosynthesis, and genetic encoding of pyrrolysine. Curr Opin Microbiol 14:342–349. 10.1016/j.mib.2011.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hao B, Gong W, Ferguson TK, James CM, Krzycki JA, Chan MK. 2002. A new UAG-encoded residue in the structure of a methanogen methyltransferase. Science 296:1462–1466. 10.1126/science.1069556. [DOI] [PubMed] [Google Scholar]
- 4.Chin JW. 2014. Expanding and reprogramming the genetic code of cells and animals. Annu Rev Biochem 83:379–408. 10.1146/annurev-biochem-060713-035737. [DOI] [PubMed] [Google Scholar]
- 5.Wang L, Brock A, Herberich B, Schultz PG. 2001. Expanding the genetic code of Escherichia coli. Science 292:498–500. 10.1126/science.1060077. [DOI] [PubMed] [Google Scholar]
- 6.Wang L, Xie J, Schultz PG. 2006. Expanding the genetic code. Annu Rev Biophys Biomol Struct 35:225–249. 10.1146/annurev.biophys.35.101105.121507. [DOI] [PubMed] [Google Scholar]
- 7.Chin JW. 2017. Expanding and reprogramming the genetic code. Nature 550:53–60. 10.1038/nature24031. [DOI] [PubMed] [Google Scholar]
- 8.Courtney T, Deiters A. 2018. Recent advances in the optical control of protein function through genetic code expansion. Curr Opin Chem Biol 46:99–107. 10.1016/j.cbpa.2018.07.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kelemen RE, Erickson SB, Chatterjee A. 2018. Synthesis at the interface of virology and genetic code expansion. Curr Opin Chem Biol 46:164–171. 10.1016/j.cbpa.2018.07.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Qin X, Liu T. 2022. Recent advances in genetic code expansion techniques for protein phosphorylation studies. J Mol Biol 434:167406. 10.1016/j.jmb.2021.167406. [DOI] [PubMed] [Google Scholar]
- 11.Tang H, Zhang P, Luo X. 2022. Recent technologies for genetic code expansion and their implications on synthetic biology applications. J Mol Biol 434:167382. 10.1016/j.jmb.2021.167382. [DOI] [PubMed] [Google Scholar]
- 12.Blight SK, Larue RC, Mahapatra A, Longstaff DG, Chang E, Zhao G, Kang PT, Green-Church KB, Chan MK, Krzycki JA. 2004. Direct charging of tRNACUA with pyrrolysine in vitro and in vivo. Nature 431:333–335. 10.1038/nature02895. [DOI] [PubMed] [Google Scholar]
- 13.Polycarpo C, Ambrogelly A, Berube A, Winbush SM, McCloskey JA, Crain PF, Wood JL, Soll D. 2004. An aminoacyl-tRNA synthetase that specifically activates pyrrolysine. Proc Natl Acad Sci USA 101:12450–12454. 10.1073/pnas.0405362101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ambrogelly A, Gundllapalli S, Herring S, Polycarpo C, Frauer C, Soll D. 2007. Pyrrolysine is not hardwired for cotranslational insertion at UAG codons. Proc Natl Acad Sci USA 104:3141–3146. 10.1073/pnas.0611634104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Polycarpo CR, Herring S, Berube A, Wood JL, Soll D, Ambrogelly A. 2006. Pyrrolysine analogues as substrates for pyrrolysyl-tRNA synthetase. FEBS Lett 580:6695–6700. 10.1016/j.febslet.2006.11.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wan W, Tharp JM, Liu WR. 2014. Pyrrolysyl-tRNA synthetase: an ordinary enzyme but an outstanding genetic code expansion tool. Biochim Biophys Acta 1844:1059–1070. 10.1016/j.bbapap.2014.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Tharp JM, Ehnbom A, Liu WR. 2018. tRNAPyl: structure, function, and applications. RNA Biol 15:441–452. 10.1080/15476286.2017.1356561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Crnkovic A, Suzuki T, Soll D, Reynolds NM. 2016. Pyrrolysyl-tRNA synthetase, an aminoacyl-tRNA synthetase for genetic code expansion. Croat Chem Acta 89:163–174. 10.5562/cca2825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yanagisawa T, Umehara T, Sakamoto K, Yokoyama S. 2014. Expanded genetic code technologies for incorporating modified lysine at multiple sites. Chembiochem 15:2181–2187. 10.1002/cbic.201402266. [DOI] [PubMed] [Google Scholar]
- 20.Burke SA, Lo SL, Krzycki JA. 1998. Clustered genes encoding the methyltransferases of methanogenesis from monomethylamine. J Bacteriol 180:3432–3440. 10.1128/JB.180.13.3432-3440.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Paul L, Ferguson DJ, Jr, Krzycki JA. 2000. The trimethylamine methyltransferase gene and multiple dimethylamine methyltransferase genes of Methanosarcina barkeri contain in-frame and read-through amber codons. J Bacteriol 182:2520–2529. 10.1128/JB.182.9.2520-2529.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hao B, Zhao G, Kang PT, Soares JA, Ferguson TK, Gallucci J, Krzycki JA, Chan MK. 2004. Reactivity and chemical synthesis of L-pyrrolysine-the 22(nd) genetically encoded amino acid. Chem Biol 11:1317–1324. 10.1016/j.chembiol.2004.07.011. [DOI] [PubMed] [Google Scholar]
- 23.Srinivasan G, James CM, Krzycki JA. 2002. Pyrrolysine encoded by UAG in Archaea: charging of a UAG-decoding specialized tRNA. Science 296:1459–1462. 10.1126/science.1069588. [DOI] [PubMed] [Google Scholar]
- 24.Meng K, Chung CZ, Soll D, Krahn N. 2022. Unconventional genetic code systems in archaea. Front Microbiol 13:1007832. 10.3389/fmicb.2022.1007832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fu X, Soll D, Sevostyanova A. 2018. Challenges of site-specific selenocysteine incorporation into proteins by Escherichia coli. RNA Biol 15:461–470. 10.1080/15476286.2018.1440876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gaston MA, Zhang L, Green-Church KB, Krzycki JA. 2011. The complete biosynthesis of the genetically encoded amino acid pyrrolysine from lysine. Nature 471:647–650. 10.1038/nature09918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Longstaff DG, Larue RC, Faust JE, Mahapatra A, Zhang L, Green-Church KB, Krzycki JA. 2007. A natural genetic code expansion cassette enables transmissible biosynthesis and genetic encoding of pyrrolysine. Proc Natl Acad Sci USA 104:1021–1026. 10.1073/pnas.0610294104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Guo LT, Amikura K, Jiang HK, Mukai T, Fu X, Wang YS, O'Donoghue P, Soll D, Tharp JM. 2022. Ancestral archaea expanded the genetic code with pyrrolysine. J Biol Chem 298:102521. 10.1016/j.jbc.2022.102521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Mahapatra A, Patel A, Soares JA, Larue RC, Zhang JK, Metcalf WW, Krzycki JA. 2006. Characterization of a Methanosarcina acetivorans mutant unable to translate UAG as pyrrolysine. Mol Microbiol 59:56–66. 10.1111/j.1365-2958.2005.04927.x. [DOI] [PubMed] [Google Scholar]
- 30.Krzycki JA. 2004. Function of genetically encoded pyrrolysine in corrinoid-dependent methylamine methyltransferases. Curr Opin Chem Biol 8:484–491. 10.1016/j.cbpa.2004.08.012. [DOI] [PubMed] [Google Scholar]
- 31.Heinemann IU, O'Donoghue P, Madinger C, Benner J, Randau L, Noren CJ, Soll D. 2009. The appearance of pyrrolysine in tRNAHis guanylyltransferase by neutral evolution. Proc Natl Acad Sci USA 106:21103–21108. 10.1073/pnas.0912072106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.O'Donoghue P, Prat L, Kucklick M, Schafer JG, Riedel K, Rinehart J, Soll D, Heinemann IU. 2014. Reducing the genetic code induces massive rearrangement of the proteome. Proc Natl Acad Sci USA 111:17206–17211. 10.1073/pnas.1420193111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Eriani G, Delarue M, Poch O, Gangloff J, Moras D. 1990. Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature 347:203–206. 10.1038/347203a0. [DOI] [PubMed] [Google Scholar]
- 34.Kavran JM, Gundllapalli S, O'Donoghue P, Englert M, Soll D, Steitz TA. 2007. Structure of pyrrolysyl-tRNA synthetase, an archaeal enzyme for genetic code innovation. Proc Natl Acad Sci USA 104:11268–11273. 10.1073/pnas.0704769104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Jiang R, Krzycki JA. 2012. PylSn and the homologous N-terminal domain of pyrrolysyl-tRNA synthetase bind the tRNA that is essential for the genetic encoding of pyrrolysine. J Biol Chem 287:32738–32746. 10.1074/jbc.M112.396754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Borrel G, Gaci N, Peyret P, O'Toole PW, Gribaldo S, Brugere JF. 2014. Unique characteristics of the pyrrolysine system in the 7th order of methanogens: implications for the evolution of a genetic code expansion cassette. Archaea 2014:374146. 10.1155/2014/374146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Mukai T, Crnkovic A, Umehara T, Ivanova NN, Kyrpides NC, Soll D. 2017. RNA-dependent cysteine biosynthesis in bacteria and archaea. mBio 8:e00561-17. 10.1128/mBio.00561-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Sun J, Evans PN, Gagen EJ, Woodcroft BJ, Hedlund BP, Woyke T, Hugenholtz P, Rinke C. 2021. Recoding of stop codons expands the metabolic potential of two novel Asgardarchaeota lineages. ISME Communications 1:30. 10.1038/s43705-021-00032-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Yamaguchi A, Iraha F, Ohtake K, Sakamoto K. 2018. Pyrrolysyl-tRNA synthetase with a unique architecture enhances the availability of lysine derivatives in synthetic genetic codes. Molecules 23:2460. 10.3390/molecules23102460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Seki E, Yanagisawa T, Kuratani M, Sakamoto K, Yokoyama S. 2020. Fully productive cell-free genetic code expansion by structure-based engineering of Methanomethylophilus alvus pyrrolysyl-tRNA synthetase. ACS Synth Biol 9:718–732. 10.1021/acssynbio.9b00288. [DOI] [PubMed] [Google Scholar]
- 41.Willis JCW, Chin JW. 2018. Mutually orthogonal pyrrolysyl-tRNA synthetase/tRNA pairs. Nat Chem 10:831–837. 10.1038/s41557-018-0052-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Meineke B, Heimgartner J, Lafranchi L, Elsasser SJ. 2018. Methanomethylophilus alvus Mx1201 provides basis for mutual orthogonal pyrrolysyl tRNA/aminoacyl-tRNA synthetase pairs in mammalian cells. ACS Chem Biol 13:3087–3096. 10.1021/acschembio.8b00571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Beranek V, Willis JCW, Chin JW. 2019. An evolved Methanomethylophilus alvus pyrrolysyl-tRNA synthetase/tRNA pair is highly active and orthogonal in mammalian cells. Biochemistry 58:387–390. 10.1021/acs.biochem.8b00808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Dunkelmann DL, Willis JCW, Beattie AT, Chin JW. 2020. Engineered triply orthogonal pyrrolysyl–tRNA synthetase/tRNA pairs enable the genetic encoding of three distinct non-canonical amino acids. Nat Chem 12:535–544. 10.1038/s41557-020-0472-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Zhang H, Gong X, Zhao Q, Mukai T, Vargas-Rodriguez O, Zhang H, Zhang Y, Wassel P, Amikura K, Maupin-Furlow J, Ren Y, Xu X, Wolf YI, Makarova KS, Koonin EV, Shen Y, Soll D, Fu X. 2022. The tRNA discriminator base defines the mutual orthogonality of two distinct pyrrolysyl-tRNA synthetase/tRNAPyl pairs in the same organism. Nucleic Acids Res 50:4601–4615. 10.1093/nar/gkac271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Tharp JM, Vargas-Rodriguez O, Schepartz A, Soll D. 2021. Genetic encoding of three distinct noncanonical amino acids using reprogrammed initiator and nonsense codons. ACS Chem Biol 16:766–774. 10.1021/acschembio.1c00120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Cao L, Liu J, Ghelichkhani F, Rozovsky S, Wang L. 2021. Genetic Incorporation of ϵ-N-benzoyllysine by engineering Methanomethylophilus alvus pyrrolysyl-tRNA synthetase. Chembiochem 22:2530–2534. 10.1002/cbic.202100218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Dunkelmann DL, Oehm SB, Beattie AT, Chin JW. 2021. A 68-codon genetic code to incorporate four distinct non-canonical amino acids enabled by automated orthogonal mRNA design. Nat Chem 13:1110–1117. 10.1038/s41557-021-00764-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Yanagisawa T, Ishii R, Fukunaga R, Kobayashi T, Sakamoto K, Yokoyama S. 2008. Crystallographic studies on multiple conformational states of active-site loops in pyrrolysyl-tRNA synthetase. J Mol Biol 378:634–652. 10.1016/j.jmb.2008.02.045. [DOI] [PubMed] [Google Scholar]
- 50.Yanagisawa T, Sumida T, Ishii R, Yokoyama S. 2013. A novel crystal form of pyrrolysyl-tRNA synthetase reveals the pre- and post-aminoacyl-tRNA synthesis conformational states of the adenylate and aminoacyl moieties and an asparagine residue in the catalytic site. Acta Crystallogr D Biol Crystallogr 69:5–15. 10.1107/S0907444912039881. [DOI] [PubMed] [Google Scholar]
- 51.Takimoto JK, Dellas N, Noel JP, Wang L. 2011. Stereochemical basis for engineered pyrrolysyl-tRNA synthetase and the efficient in vivo incorporation of structurally divergent non-native amino acids. ACS Chem Biol 6:733–743. 10.1021/cb200057a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Yanagisawa T, Kuratani M, Seki E, Hino N, Sakamoto K, Yokoyama S. 2019. Structural basis for genetic-code expansion with bulky lysine derivatives by an engineered pyrrolysyl-tRNA synthetase. Cell Chem Biol 26:936–949.e13. 10.1016/j.chembiol.2019.03.008. [DOI] [PubMed] [Google Scholar]
- 53.Wang YS, Russell WK, Wang Z, Wan W, Dodd LE, Pai PJ, Russell DH, Liu WR. 2011. The de novo engineering of pyrrolysyl-tRNA synthetase for genetic incorporation of L-phenylalanine and its derivatives. Mol Biosyst 7:714–717. 10.1039/c0mb00217h. [DOI] [PubMed] [Google Scholar]
- 54.Wang YS, Fang X, Wallace AL, Wu B, Liu WR. 2012. A rationally designed pyrrolysyl-tRNA synthetase mutant with a broad substrate spectrum. J Am Chem Soc 134:2950–2953. 10.1021/ja211972x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Wang YS, Fang X, Chen HY, Wu B, Wang ZU, Hilty C, Liu WR. 2013. Genetic incorporation of twelve meta-substituted phenylalanine derivatives using a single pyrrolysyl-tRNA synthetase mutant. ACS Chem Biol 8:405–415. 10.1021/cb300512r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Xiao H, Peters FB, Yang PY, Reed S, Chittuluru JR, Schultz PG. 2014. Genetic incorporation of histidine derivatives using an engineered pyrrolysyl-tRNA synthetase. ACS Chem Biol 9:1092–1096. 10.1021/cb500032c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Tseng HW, Baumann T, Sun H, Wang YS, Ignatova Z, Budisa N. 2020. Expanding the scope of orthogonal translation with pyrrolysyl-tRNA synthetases dedicated to aromatic amino acids. Molecules 25:4418. 10.3390/molecules25194418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kobayashi T, Yanagisawa T, Sakamoto K, Yokoyama S. 2009. Recognition of non-α-amino substrates by pyrrolysyl-tRNA synthetase. J Mol Biol 385:1352–1360. 10.1016/j.jmb.2008.11.059. [DOI] [PubMed] [Google Scholar]
- 59.Li YM, Yang MY, Huang YC, Li YT, Chen PR, Liu L. 2012. Ligation of expressed protein α-hydrazides via genetic incorporation of an α-hydroxy acid. ACS Chem Biol 7:1015–1022. 10.1021/cb300020s. [DOI] [PubMed] [Google Scholar]
- 60.Giege R, Sissler M, Florentz C. 1998. Universal rules and idiosyncratic features in tRNA identity. Nucleic Acids Res 26:5017–5035. 10.1093/nar/26.22.5017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Nozawa K, O'Donoghue P, Gundllapalli S, Araiso Y, Ishitani R, Umehara T, Soll D, Nureki O. 2009. Pyrrolysyl-tRNA synthetase-tRNAPyl structure reveals the molecular basis of orthogonality. Nature 457:1163–1167. 10.1038/nature07611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Suzuki T, Miller C, Guo LT, Ho JML, Bryson DI, Wang YS, Liu DR, Soll D. 2017. Crystal structures reveal an elusive functional domain of pyrrolysyl-tRNA synthetase. Nat Chem Biol 13:1261–1266. 10.1038/nchembio.2497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Zhang Y, Ptacin JL, Fischer EC, Aerni HR, Caffaro CE, San Jose K, Feldman AW, Turner CR, Romesberg FE. 2017. A semi-synthetic organism that stores and retrieves increased genetic information. Nature 551:644–647. 10.1038/nature24659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Niu W, Schultz PG, Guo J. 2013. An expanded genetic code in mammalian cells with a functional quadruplet codon. ACS Chem Biol 8:1640–1645. 10.1021/cb4001662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Herring S, Ambrogelly A, Polycarpo CR, Soll D. 2007. Recognition of pyrrolysine tRNA by the Desulfitobacterium hafniense pyrrolysyl-tRNA synthetase. Nucleic Acids Res 35:1270–1278. 10.1093/nar/gkl1151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Chemla Y, Friedman M, Heltberg M, Bakhrat A, Nagar E, Schwarz R, Jensen MH, Alfonta L. 2017. Expanding the genetic code of a photoautotrophic organism. Biochemistry 56:2161–2165. 10.1021/acs.biochem.7b00131. [DOI] [PubMed] [Google Scholar]
- 67.Infield DT, Lueck JD, Galpin JD, Galles GD, Ahern CA. 2018. Orthogonality of pyrrolysine tRNA in the Xenopus oocyte. Sci Rep 8:5166. 10.1038/s41598-018-23201-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Bartholomae M, Baumann T, Nickling JH, Peterhoff D, Wagner R, Budisa N, Kuipers OP. 2018. Expanding the genetic code of Lactococcus lactis and Escherichia coli to incorporate non-canonical amino acids for production of modified lantibiotics. Front Microbiol 9:657. 10.3389/fmicb.2018.00657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Zheng H, Lin S, Chen PR. 2020. Genetically encoded protein labeling and crosslinking in living Pseudomonas aeruginosa. Bioorg Med Chem 28:115545. 10.1016/j.bmc.2020.115545. [DOI] [PubMed] [Google Scholar]
- 70.Lu W, Wang R, Wang P, Ma S, Xia Q. 2021. Genetic code expansion system for tight control of gene expression in Bombyx mori cell lines. Insects 12:1081. 10.3390/insects12121081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Chemla Y, Ozer E, Shaferman M, Zaad B, Dandela R, Alfonta L. 2019. Simplified methodology for a modular and genetically expanded protein synthesis in cell-free systems. Synth Syst Biotechnol 4:189–196. 10.1016/j.synbio.2019.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Van Raad D, Huber T. 2021. In vitro protein synthesis in semipermeable artificial cells. ACS Synth Biol 10:1237–1244. 10.1021/acssynbio.1c00044. [DOI] [PubMed] [Google Scholar]
- 73.Seki E, Yanagisawa T, Yokoyama S. 2018. Cell-free protein synthesis for multiple site-specific incorporation of noncanonical amino acids using cell extracts from RF-1 deletion E. coli strains. Methods Mol Biol 1728:49–65. 10.1007/978-1-4939-7574-7_3. [DOI] [PubMed] [Google Scholar]
- 74.Cigler M, Muller TG, Horn-Ghetko D, von Wrisberg MK, Fottner M, Goody RS, Itzen A, Muller MP, Lang K. 2017. Proximity-triggered covalent stabilization of low-affinity protein complexes in vitro and in vivo. Angew Chem Int Ed Engl 56:15737–15741. 10.1002/anie.201706927. [DOI] [PubMed] [Google Scholar]
- 75.Yang B, Tang S, Ma C, Li ST, Shao GC, Dang B, DeGrado WF, Dong MQ, Wang PG, Ding S, Wang L. 2017. Spontaneous and specific chemical cross-linking in live cells to capture and identify protein interactions. Nat Commun 8:2240. 10.1038/s41467-017-02409-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Wang N, Yang B, Fu C, Zhu H, Zheng F, Kobayashi T, Liu J, Li S, Ma C, Wang PG, Wang Q, Wang L. 2018. Genetically encoding fluorosulfate-l-tyrosine to react with lysine, histidine, and tyrosine via SuFEx in proteins in vivo. J Am Chem Soc 140:4995–4999. 10.1021/jacs.8b01087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Liu J, Cao L, Klauser PC, Cheng R, Berdan VY, Sun W, Wang N, Ghelichkhani F, Yu B, Rozovsky S, Wang L. 2021. A genetically encoded fluorosulfonyloxybenzoyl-l-lysine for expansive covalent bonding of proteins via SuFEx chemistry. J Am Chem Soc 143:10341–10351. 10.1021/jacs.1c04259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Xuan W, Shao S, Schultz PG. 2017. Protein crosslinking by genetically encoded noncanonical amino acids with reactive aryl carbamate side chains. Angew Chem Int Ed Engl 56:5096–5100. 10.1002/anie.201611841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Tian Y, Jacinto MP, Zeng Y, Yu Z, Qu J, Liu WR, Lin Q. 2017. Genetically encoded 2-aryl-5-carboxytetrazoles for site-selective protein photo-cross-linking. J Am Chem Soc 139:6078–6081. 10.1021/jacs.7b02615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Hu W, Yuan Y, Wang C-H, Tian H-T, Guo A-D, Nie H-J, Hu H, Tan M, Tang Z, Chen X-H. 2019. Genetically encoded residue-selective photo-crosslinker to capture protein-protein interactions in living cells. Chem 5:2955–2968. 10.1016/j.chempr.2019.08.020. [DOI] [Google Scholar]
- 81.Liu J, Li S, Aslam NA, Zheng F, Yang B, Cheng R, Wang N, Rozovsky S, Wang PG, Wang Q, Wang L. 2019. Genetically encoding photocaged quinone methide to multitarget protein residues covalently in vivo. J Am Chem Soc 141:9458–9462. 10.1021/jacs.9b01738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Liu J, Cheng R, Van Eps N, Wang N, Morizumi T, Ou WL, Klauser PC, Rozovsky S, Ernst OP, Wang L. 2020. Genetically encoded quinone methides enabling rapid, site-specific, and photocontrolled protein modification with amine reagents. J Am Chem Soc 142:17057–17068. 10.1021/jacs.0c06820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Tang S, Beattie AT, Kafkova L, Petris G, Huguenin-Dezot N, Fiedler M, Freeman M, Chin JW. 2022. Mechanism-based traps enable protease and hydrolase substrate discovery. Nature 602:701–707. 10.1038/s41586-022-04414-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Venkat S, Nannapaneni DT, Gregory C, Gan Q, McIntosh M, Fan C. 2017. Genetically encoding thioacetyl-lysine as a non-deacetylatable analog of lysine acetylation in Escherichia coli. FEBS Open Bio 7:1805–1814. 10.1002/2211-5463.12320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Ren C, Wu Q, Xiao R, Ji Y, Yang X, Zhang Z, Qin H, Ma JA, Xuan W. 2022. Expanding the scope of genetically encoded lysine post-translational modifications with lactylation, β-hydroxybutyrylation and lipoylation. Chembiochem 23:e202200302. 10.1002/cbic.202200302. [DOI] [PubMed] [Google Scholar]
- 86.Wang ZA, Kurra Y, Wang X, Zeng Y, Lee YJ, Sharma V, Lin H, Dai SY, Liu WR. 2017. A versatile approach for site-specific lysine acylation in proteins. Angew Chem Int Ed Engl 56:1643–1647. 10.1002/anie.201611415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Hoppmann C, Wong A, Yang B, Li S, Hunter T, Shokat KM, Wang L. 2017. Site-specific incorporation of phosphotyrosine using an expanded genetic code. Nat Chem Biol 13:842–844. 10.1038/nchembio.2406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Li S, Yang B, Kobayashi T, Yu B, Liu J, Wang L. 2020. Genetically encoding thyronine for fluorescent detection of peroxynitrite. Bioorg Med Chem 28:115665. 10.1016/j.bmc.2020.115665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Zhang F, Zhou Q, Yang G, An L, Li F, Wang J. 2018. A genetically encoded 19F NMR probe for lysine acetylation. Chem Commun (Camb) 54:3879–3882. 10.1039/c7cc09825a. [DOI] [PubMed] [Google Scholar]
- 90.Wang X, Liu D, Shen L, Li F, Li Y, Yang L, Xu T, Tao H, Yao D, Wu L, Hirata K, Bohn LM, Makriyannis A, Liu X, Hua T, Liu ZJ, Wang J. 2021. A genetically encoded F-19 NMR probe reveals the allosteric modulation mechanism of cannabinoid receptor 1. J Am Chem Soc 143:16320–16325. 10.1021/jacs.1c06847. [DOI] [PubMed] [Google Scholar]
- 91.Qianzhu H, Abdelkader EH, Herath ID, Otting G, Huber T. 2022. Site-specific incorporation of 7-fluoro-L-tryptophan into proteins by genetic encoding to monitor ligand binding by 19F NMR spectroscopy. ACS Sens 7:44–49. 10.1021/acssensors.1c02467. [DOI] [PubMed] [Google Scholar]
- 92.Abdelkader EH, Qianzhu H, Tan YJ, Adams LA, Huber T, Otting G. 2021. Genetic encoding of N6-(((trimethylsilyl)methoxy)carbonyl)-l-lysine for NMR studies of protein–protein and protein–ligand interactions. J Am Chem Soc 143:1133–1143. 10.1021/jacs.0c11971. [DOI] [PubMed] [Google Scholar]
- 93.Jones CM, Robkis DM, Blizzard RJ, Munari M, Venkatesh Y, Mihaila TS, Eddins AJ, Mehl RA, Zagotta WN, Gordon SE, Petersson EJ. 2021. Genetic encoding of a highly photostable, long lifetime fluorescent amino acid for imaging in mammalian cells. Chem Sci 12:11955–11964. 10.1039/d1sc01914g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Shang X, Song X, Faller C, Lai R, Li H, Cerny R, Niu W, Guo J. 2017. Fluorogenic protein labeling using a genetically encoded unstrained alkene. Chem Sci 8:1141–1145. 10.1039/c6sc03635j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Fu C, Kobayashi T, Wang N, Hoppmann C, Yang B, Irannejad R, Wang L. 2018. Genetically encoding quinoline reverses chromophore charge and enables fluorescent protein brightening in acidic vesicles. J Am Chem Soc 140:11058–11066. 10.1021/jacs.8b05814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.O'Donoghue P, Ling J, Wang YS, Soll D. 2013. Upgrading protein synthesis for synthetic biology. Nat Chem Biol 9:594–598. 10.1038/nchembio.1339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Owens AE, Grasso KT, Ziegler CA, Fasan R. 2017. Two-tier screening platform for directed evolution of aminoacyl–tRNA synthetases with enhanced stop codon suppression efficiency. Chembiochem 18:1109–1116. 10.1002/cbic.201700039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Sharma V, Zeng Y, Wang WW, Qiao Y, Kurra Y, Liu WR. 2018. Evolving the N-terminal domain of pyrrolysyl-tRNA synthetase for improved incorporation of noncanonical amino acids. Chembiochem 19:26–30. 10.1002/cbic.201700268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Hohl A, Karan R, Akal A, Renn D, Liu X, Ghorpade S, Groll M, Rueping M, Eppinger J. 2019. Engineering a polyspecific pyrrolysyl-tRNA synthetase by a high throughput FACS screen. Sci Rep 9:11971. 10.1038/s41598-019-48357-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Kwok HS, Vargas-Rodriguez O, Melnikov SV, Soll D. 2019. Engineered aminoacyl-tRNA synthetases with improved selectivity toward noncanonical amino acids. ACS Chem Biol 14:603–612. 10.1021/acschembio.9b00088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Bryson DI, Fan C, Guo LT, Miller C, Soll D, Liu DR. 2017. Continuous directed evolution of aminoacyl-tRNA synthetases. Nat Chem Biol 13:1253–1260. 10.1038/nchembio.2474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.DeBenedictis EA, Chory EJ, Gretton DW, Wang B, Golas S, Esvelt KM. 2022. Systematic molecular evolution enables robust biomolecule discovery. Nat Methods 19:55–64. 10.1038/s41592-021-01348-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Fischer JT, Soll D, Tharp JM. 2022. Directed evolution of Methanomethylophilus alvus pyrrolysyl-tRNA synthetase generates a hyperactive and highly selective variant. Front Mol Biosci 9:850613. 10.3389/fmolb.2022.850613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Fladischer P, Weingartner A, Blamauer J, Darnhofer B, Birner-Gruenberger R, Kardashliev T, Ruff AJ, Schwaneberg U, Wiltschi B. 2019. A semi-rationally engineered bacterial pyrrolysyl-tRNA synthetase genetically encodes phenyl azide chemistry. Biotechnol J 14:e1800125. 10.1002/biot.201800125. [DOI] [PubMed] [Google Scholar]
- 105.Jiang HK, Lee MN, Tsou JC, Chang KW, Tseng HW, Chen KP, Li YK, Wang YS. 2020. Linker and N-terminal domain engineering of pyrrolysyl-tRNA synthetase for substrate range shifting and activity enhancement. Front Bioeng Biotechnol 8:235. 10.3389/fbioe.2020.00235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Bloom JD, Labthavikul ST, Otey CR, Arnold FH. 2006. Protein stability promotes evolvability. Proc Natl Acad Sci USA 103:5869–5874. 10.1073/pnas.0510098103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Bloom JD, Silberg JJ, Wilke CO, Drummond DA, Adami C, Arnold FH. 2005. Thermodynamic prediction of protein neutrality. Proc Natl Acad Sci USA 102:606–611. 10.1073/pnas.0406744102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Hu L, Qin X, Huang Y, Cao W, Wang C, Wang Y, Ling X, Chen H, Wu D, Lin Y, Liu T. 2020. Thermophilic pyrrolysyl-tRNA synthetase mutants for enhanced mammalian genetic code expansion. ACS Synth Biol 9:2723–2736. 10.1021/acssynbio.0c00257. [DOI] [PubMed] [Google Scholar]
- 109.Cho CC, Blankenship LR, Ma X, Xu S, Liu W. 2022. The pyrrolysyl-tRNA synthetase activity can be improved by a P188 mutation that stabilizes the full-length enzyme. J Mol Biol 434:167453. 10.1016/j.jmb.2022.167453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Koch NG, Baumann T, Budisa N. 2021. Efficient unnatural protein production by pyrrolysyl-tRNA synthetase with genetically fused solubility tags. Front Bioeng Biotechnol 9:807438. 10.3389/fbioe.2021.807438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Wang J, Kwiatkowski M, Forster AC. 2016. Kinetics of tRNAPyl -mediated amber suppression in Escherichia coli translation reveals unexpected limiting steps and competing reactions. Biotechnol Bioeng 113:1552–1559. 10.1002/bit.25917. [DOI] [PubMed] [Google Scholar]
- 112.Uhlenbeck OC, Schrader JM. 2018. Evolutionary tuning impacts the design of bacterial tRNAs for the incorporation of unnatural amino acids by ribosomes. Curr Opin Chem Biol 46:138–145. 10.1016/j.cbpa.2018.07.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Fan C, Xiong H, Reynolds NM, Soll D. 2015. Rationally evolving tRNAPyl for efficient incorporation of noncanonical amino acids. Nucleic Acids Res 43:e156. 10.1093/nar/gkv800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Serfling R, Lorenz C, Etzel M, Schicht G, Bottke T, Morl M, Coin I. 2018. Designer tRNAs for efficient incorporation of non-canonical amino acids by the pyrrolysine system in mammalian cells. Nucleic Acids Res 46:1–10. 10.1093/nar/gkx1156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Venkat S, Sturges J, Stahman A, Gregory C, Gan Q, Fan C. 2018. Genetically incorporating two distinct post-translational modifications into one protein simultaneously. ACS Synth Biol 7:689–695. 10.1021/acssynbio.7b00408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Tharp JM, Liu WR. 2018. Using amber and ochre nonsense codons to code two different noncanonical amino acids in one protein gene. Methods Mol Biol 1728:147–154. 10.1007/978-1-4939-7574-7_9. [DOI] [PubMed] [Google Scholar]
- 117.Italia JS, Addy PS, Erickson SB, Peeler JC, Weerapana E, Chatterjee A. 2019. Mutually orthogonal nonsense-suppression systems and conjugation chemistries for precise protein labeling at up to three distinct sites. J Am Chem Soc 141:6204–6212. 10.1021/jacs.8b12954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Meineke B, Heimgartner J, Eirich J, Landreh M, Elsasser SJ. 2020. Site-specific incorporation of two ncAAs for two-color bioorthogonal labeling and crosslinking of proteins on live mammalian cells. Cell Rep 31:107811. 10.1016/j.celrep.2020.107811. [DOI] [PubMed] [Google Scholar]
- 119.Fu X, Huang Y, Shen Y. 2022. Improving the efficiency and orthogonality of genetic code expansion. BioDesign Res 2022:9896125. 10.34133/2022/9896125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Robertson WE, Funke LFH, de la Torre D, Fredens J, Elliott TS, Spinck M, Christova Y, Cervettini D, Boge FL, Liu KC, Buse S, Maslen S, Salmond GPC, Chin JW. 2021. Sense codon reassignment enables viral resistance and encoded polymer synthesis. Science 372:1057–1062. 10.1126/science.abg3029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Fischer EC, Hashimoto K, Zhang Y, Feldman AW, Dien VT, Karadeema RJ, Adhikary R, Ledbetter MP, Krishnamurthy R, Romesberg FE. 2020. New codons for efficient production of unnatural proteins in a semisynthetic organism. Nat Chem Biol 16:570–576. 10.1038/s41589-020-0507-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Ding W, Zhao H, Chen Y, Zhang B, Yang Y, Zang J, Wu J, Lin S. 2020. Chimeric design of pyrrolysyl-tRNA synthetase/tRNA pairs and canonical synthetase/tRNA pairs for genetic code expansion. Nat Commun 11:3154. 10.1038/s41467-020-16898-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Voller JS, Budisa N. 2017. Coupling genetic code expansion and metabolic engineering for synthetic cells. Curr Opin Biotechnol 48:1–7. 10.1016/j.copbio.2017.02.002. [DOI] [PubMed] [Google Scholar]
- 124.Chen Y, Tang J, Wang L, Tian Z, Cardenas A, Fang X, Chatterjee A, Xiao H. 2020. Creation of bacterial cells with 5-hydroxytryptophan as a 21st amino acid building block. Chem 6:2717–2727. 10.1016/j.chempr.2020.07.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Exner MP, Kuenzl T, To TM, Ouyang Z, Schwagerus S, Hoesl MG, Hackenberger CP, Lensen MC, Panke S, Budisa N. 2017. Design of S-allylcysteine in situ production and incorporation based on a novel pyrrolysyl-tRNA synthetase variant. Chembiochem 18:85–90. 10.1002/cbic.201600537. [DOI] [PubMed] [Google Scholar]
- 126.Mehl RA, Anderson JC, Santoro SW, Wang L, Martin AB, King DS, Horn DM, Schultz PG. 2003. Generation of a bacterium with a 21 amino acid genetic code. J Am Chem Soc 125:935–939. 10.1021/ja0284153. [DOI] [PubMed] [Google Scholar]
- 127.Ho JML, Miller CA, Smith KA, Mattia JR, Bennett MR. 2021. Improved pyrrolysine biosynthesis through phage assisted non-continuous directed evolution of the complete pathway. Nat Commun 12:3914. 10.1038/s41467-021-24183-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Fu C, Chen Q, Zheng F, Yang L, Li H, Zhao Q, Wang X, Wang L, Wang Q. 2019. Genetically encoding a lipidated amino acid for extension of protein half-life in vivo. Angew Chem Int Ed Engl 58:1392–1396. 10.1002/anie.201811837. [DOI] [PubMed] [Google Scholar]
- 129.Li S, Wang N, Yu B, Sun W, Wang L. 2022. Genetically encoded chemical crosslinking of carbohydrate. Nat Chem 15:33–42. 10.1038/s41557-022-01059-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Xiong Q, Zheng T, Shen X, Li B, Fu J, Zhao X, Wang C, Yu Z. 2022. Expanding the functionality of proteins with genetically encoded dibenzo[b,f][1,4,5]thiadiazepine: a photo-transducer for photo-click decoration. Chem Sci 13:3571–3581. 10.1039/D1SC05710C. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Kita A, Hino N, Higashi S, Hirota K, Narumi R, Adachi J, Takafuji K, Ishimoto K, Okada Y, Sakamoto K, Tomonaga T, Takashima S, Mizuguchi H, Doi T. 2016. Adenovirus vector-based incorporation of a photo-cross-linkable amino acid into proteins in human primary cells and cancerous cell lines. Sci Rep 6:36946. 10.1038/srep36946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Kipper K, Lundius EG, Curic V, Nikic I, Wiessler M, Lemke EA, Elf J. 2017. Application of noncanonical amino acids for protein labeling in a genomically recoded Escherichia coli. ACS Synth Biol 6:233–255. 10.1021/acssynbio.6b00138. [DOI] [PubMed] [Google Scholar]
- 133.Ohtake K, Mukai T, Iraha F, Takahashi M, Haruna KI, Date M, Yokoyama K, Sakamoto K. 2018. Engineering an automaturing transglutaminase with enhanced thermostability by genetic code expansion with two codon reassignments. ACS Synth Biol 7:2170–2176. 10.1021/acssynbio.8b00157. [DOI] [PubMed] [Google Scholar]
- 134.Vatansever EC, Yang KS, Geng ZZ, Qiao Y, Li P, Xu S, Liu WR. 2022. A designed, highly efficient pyrrolysyl-tRNA synthetase mutant binds o-chlorophenylalanine using two halogen bonds. J Mol Biol 434:167534. 10.1016/j.jmb.2022.167534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Huguenin-Dezot N, Alonzo DA, Heberlig GW, Mahesh M, Nguyen DP, Dornan MH, Boddy CN, Schmeing TM, Chin JW. 2019. Trapping biosynthetic acyl-enzyme intermediates with encoded 2,3-diaminopropionic acid. Nature 565:112–117. 10.1038/s41586-018-0781-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Liu J, Zheng F, Cheng R, Li S, Rozovsky S, Wang Q, Wang L. 2018. Site-specific incorporation of selenocysteine using an expanded genetic code and palladium-mediated chemical deprotection. J Am Chem Soc 140:8807–8816. 10.1021/jacs.8b04603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Fottner M, Brunner AD, Bittl V, Horn-Ghetko D, Jussupow A, Kaila VRI, Bremm A, Lang K. 2019. Site-specific ubiquitylation and SUMOylation using genetic-code expansion and sortase. Nat Chem Biol 15:276–284. 10.1038/s41589-019-0227-4. [DOI] [PubMed] [Google Scholar]
- 138.Jang HS, Jana S, Blizzard RJ, Meeuwsen JC, Mehl RA. 2020. Access to faster eukaryotic cell labeling with encoded tetrazine amino acids. J Am Chem Soc 142:7245–7249. 10.1021/jacs.9b11520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Pott M, Tinzl M, Hayashi T, Ota Y, Dunkelmann D, Mittl PRE, Hilvert D. 2021. Noncanonical heme ligands steer carbene transfer reactivity in an artificial metalloenzyme. Angew Chem Int Ed Engl 60:15063–15068. 10.1002/anie.202103437. [DOI] [PubMed] [Google Scholar]
- 140.Koch NG, Goettig P, Rappsilber J, Budisa N. 2021. Engineering pyrrolysyl-tRNA synthetase for the incorporation of non-canonical amino acids with smaller side chains. Int J Mol Sci 22:11194. 10.3390/ijms222011194. [DOI] [PMC free article] [PubMed] [Google Scholar]



