Abstract
Site-specific incorporation of distinct non-canonical amino acids into proteins via genetic code expansion requires mutually orthogonal aminoacyl-tRNA synthetase/tRNA pairs. Pyrrolysyl-tRNA synthetase (PylRS)/tRNAPyl pairs are ideal for genetic code expansion and have been extensively engineered for developing mutually orthogonal pairs. Here, we identify two novel wild-type PylRS/tRNAPyl pairs simultaneously present in the deep-rooted extremely halophilic euryarchaeal methanogen Candidatus Methanohalarchaeum thermophilum HMET1, and show that both pairs are functional in the model halophilic archaeon Haloferax volcanii. These pairs consist of two different PylRS enzymes and two distinct tRNAs with dissimilar discriminator bases. Surprisingly, these two PylRS/tRNAPyl pairs display mutual orthogonality enabled by two unique features, the A73 discriminator base of tRNAPyl2 and a shorter motif 2 loop in PylRS2. In vivo translation experiments show that tRNAPyl2 charging by PylRS2 is defined by the enzyme's shortened motif 2 loop. Finally, we demonstrate that the two HMET1 PylRS/tRNAPyl pairs can simultaneously decode UAG and UAA codons for incorporation of two distinct noncanonical amino acids into protein. This example of a single base change in a tRNA leading to additional coding capacity suggests that the growth of the genetic code is not yet limited by the number of identity elements fitting into the tRNA structure.
INTRODUCTION
Accurate translation of the genetic code depends on several molecular determinants in a tRNA species (identity elements) that are responsible for the recognition by the cognate aminoacyl-tRNA synthetase (aaRS) leading to correct aminoacyl-tRNA (aa-tRNA) formation (1). One important tRNA identity element, the 4th nucleotide from the 3′-end was recognized early on (2) and named the discriminator base; in combination with several other nucleotides/identity elements (mainly in the anticodon and the acceptor helix), it is involved in providing acylation accuracy for at least eighteen aaRSs (1).
Pyrrolysine (Pyl) is the 22nd genetically encoded amino acid found in many archaea and certain bacteria (3). Pyrrolysyl-tRNA synthetase (PylRS) together with tRNAPyl supply Pyl-tRNAPyl for incorporation of Pyl into proteins in response to the UAG codon (4). Because many PylRS variants possess remarkably relaxed amino acid specificity (5), and the enzyme does not recognize the tRNA anticodon as identity element (6), PylRS became the tRNA synthetase of choice for synthetic biologists to expand the genetic code with noncanonical amino acids (5,7). Most PylRS enzymes consist of two functional domains, an N-terminal tRNA-binding domain and a C-terminal catalytic domain (CTD) (8,9). The two domains are either contained in the same polypeptide or occur as separate proteins. Recently, the ΔPylSn class of PylRS enzymes was discovered in certain archaea (10–12). The ΔPylSn lack the N-terminal domain and display strong activity of the catalytic CTD alone (13,14). In the few years since their discovery, ΔPylSn enzymes have become a popular tool for incorporation of noncanonical amino acids (ncAAs) in a variety of organisms (13–21).
Candidatus Methanohalarchaeum thermophilum HMET1 is a member of the extreme halophilic methanogens (22); they belong to the class Methanonatronarchaeia in which the type species is Methanonatronarchaeum thermophilum (22,23). The phylogenetic affinity of this group remains a matter of debate (24–26), but recent phylogenetic analysis suggests that it forms the deepest clade in the Halobacteriota superphylum that includes classes Halobacteria, Methanosarcinia, Methanomicrobia, Archaeoglobi, and Methanonatronarchaeia (27). Regardless of the exact position of this clade, it is becoming increasingly clear that methanogenesis originated early in archaeal evolution, likely in the last common ancestor of archaea, and was later lost independently in extant non-methanogen lineages within TACK superphylum and possibly later within Halobacteriota superphylum (25,27,28). Pyrrolysine biosynthesis and incorporation of Pyl into several proteins specific to methanogenesis (e.g. methylamine methyltransferases) is observed in the methanogenic lineages of Halobacteriota and Thermoplasmatota (super)phyla. Thus, it is currently inferred that the Pyl system originated in the common ancestor of these two phyla which are sister groups in the archaeal phylogeny (29). Most genomes encode a single PylRS gene. One of the few exceptions is the HMET1 genome, which harbors two different PylRS genes.
Here we characterize the unusual properties and demonstrate the mutual orthogonality of the two distinct Pyl incorporation systems in Candidatus Methanohalarchaeum thermophilum HMET1.
MATERIALS AND METHODS
Plasmids
All plasmids used in this study are listed in Supplementary Table S1. Cassettes encoding PylRS, tRNAPyl and Flag-SAMP1 or Flag-SAMP1(SAMP1UAG24) genes were synthesized and cloned into pJAM202c-based vectors by Gibson assembly. To construct plasmids containing noncognate PylRS/tRNAPyl pairs, PylRS and tRNAPyl coding sequences were swapped between different vectors by inserting the noncognate tRNAPyl gene into the vector devoid of tRNAPyl via Gibson assembly. Cassette amplification, removal of PylRS or tRNAPyl gene from expression vectors, and site-directed mutagenesis were done by PCR with primers listed in Supplementary Table S2.
Strains and culture conditions
Strains used in this study are listed in Supplementary Table S1. Escherichia coli (E. coli) DH5α was used for routine selection and propagation of plasmid DNA. E. coli GM2163 was used to prepare plasmid DNA that are free of Dam and Dcm methylations prior to H. volcanii transformation. E. coli strains were grown in Luria-Bertani (LB) medium at 37°C and H. volcanii cells were grown at 42°C in Hv-YPC by rotary shaking at 200 rpm. Media were supplemented with ampicillin (Amp, 0.1 mg/ml), spectinomycin (Sm, 50 μg/ml), tetracycline (Tet, 12.5 μg/ml), kanamycin (Km, 50 mg/ml), novobiocin (Nv, 0.5 μg/ml), Nϵ-Boc-l-lysine (BocK, 1mM) and 3-iodo-l-Phe (3-I-Phe, 1mM) as needed. Solid medium containing 2.0% (wt/vol) agar was used to culture H. volcanii cells. H. volcanii colonies were streaked from the -80°C freezer stocks onto Hv-YPC agar plates and freshly isolated colonies of H. volcanii were inoculated into 2.5 ml medium and grown to log phase (optical density at 600 nm [OD600], 0.6 to 1.2). The log-phase cells were then subcultured into 4.0 ml fresh Hv-YPC medium for future assays.
In vivo activity and solubility assays for PylRS/tRNAPyl pairs in E. coli
To measure the amber suppression activity of PylRS/tRNAPyl pairs in E. coli, pSCW11-PylRS/tRNAPyl and pBAD-sfGFP(150TAG) plasmids were co-transformed into E. coli TOP10 cells via electroporation. The transformed cells were recovered for 1 hour in 500 μl SOC medium at 37°C. Then, 100 μl of the transformed cells was inoculated into 1 mL fresh medium (LB containing 50 μg/ml spectinomycin, 50 μg/ml ampicillin and 0.2% arabinose) in 15 mL culture tube, supplemented with or without 1 mM BocK. OD600 and sfGFP fluorescence (λex: 480 nm; λem: 520 nm) were measured during overnight culture at 37°C. For PylRS solubility assay, the E. coli BL21(DE3) cells containing different pET28a-PylRS plasmids were incubated at 37°C, grown to OD600 at 0.6, followed by protein expression induction by addition of 1 mM IPTG at 20 °C for 16 hr. E. coli cells were harvested via centrifugation (5,000 g for 10 min at 25 °C) and divided into two portions. One portion was resuspended in 2X reducing SDS loading buffer and boiled for 15 minutes prior to SDS-PAGE analysis, which represents the whole cell lysate portion. To prepare the supernatant extract, the harvested cells of the second portion were lysed by French press homogenization (12,000 lb/in2) in lysis buffer A (20 mM Tris-HCl [pH 7.4], 1 mM DTT and 300 mM or 500 mM) supplemented with 1 mg·ml−1 DNase I (Tiangen) and 1X protease inhibitor cocktail (Roche). The cell lysate was clarified by centrifugation (20,000 g for 30 min at 4 °C) to remove the cell debris. The supernatant faction (20 μl) was collected and mixed with an equal volume of 2X SDS-PAGE loading buffer prior to SDS-PAGE analysis. The amount of soluble PylRS proteins by two different treatments was determined on 12% reducing SDS-PAGE gel and stained with Coomassie G250 dye.
In vivo activity assay for the HMET1 PylRS/tRNAPyl pairs in H. volcanii
To examine the amber suppression activity, H. volcanii cells expressing different plasmids containing reporter proteins and different combinations of the wild-type or engineered PylRS/tRNAPyl (listed in Supplementary Table S1) were re-patched on the Hv-YPC plate, then were grown to log phase (OD600 at 0.4 to 0.8) in Hv-YPC medium. The mid-log phase cells (OD600, 0.6 to 0.8) were inoculated into 4 ml fresh Hv-YPC medium supplemented Nv, with or without 1mM ncAAs (BocK and 3-I-Phe) and grown to stationary phase (OD600, 2.0 to 3.0). Cell pellets were harvested by centrifugation (10,000 g, 5 min, 25°C) and used to analyze the expression level of SAMP1 reporter proteins by immunoblotting as described below.
SDS-PAGE and immunoblotting analysis
Cell pellets (2 OD600 units) were resuspended in 100 μl of 2X SDS-PAGE loading buffer by vigorous pipetting and vortexing and were then boiled for 15 minutes prior to separation by SDS-PAGE analysis. The OD600 of the cell culture was measured to normalize the loading amount of total cellular proteins (0.1 units per lane) and equivalent levels of protein loading were confirmed by Coomassie G250 staining of parallel gels. Total cellular proteins were separated on 12% reducing SDS-PAGE gel followed by electroblotting onto polyvinylidene difluoride (PVDF) membranes (Amersham) according to the BioRad standard protocol. Flag-tagged proteins were detected by alkaline phosphatase-linked anti-Flag M2 monoclonal antibody (Sigma-Aldrich). Immunoreactive antigens were detected by chemiluminescence using horseradish peroxidase (HRP) substrate (Immobilon).
Immunoprecipitation
H. volcanii cells were harvested by centrifugation (8,000 g, 10 min, 25°C) and washed twice with TBS buffer (150 mM NaCl, 50 mM Tris-HCl, pH 7.4). Cell pellets were stored at -20°C until further use. The harvested cells were lysed by French press homogenization (12,000 lb/in2) in lysis buffer composed of TBS supplemented with 0.1 mM CaCl2, 2 mM MgCl2, DNase I and protease inhibitor cocktail. Cell debris was removed by centrifugation (20,000 g, 40 min, 4°C) and filtration (0.22 μm) and the clarified cell lysate was then applied to an anti-Flag column pre-equilibrated with TBS. The column was 1 cm in diameter and was filled with 0.2–0.4 ml anti-Flag G1 affinity resin (Genscript). The supernatant was incubated with resin at room temperature for 1 h. The bound proteins were washed in 15 mL TBS prior to elution in 100 μl 2X reducing SDS-PAGE loading buffer. The purified Flag-tagged proteins were separated on 12% reducing SDS-PAGE gel and stained with Coomassie G250 dye. The protein band representing SAMP1(SAMP1UAG24)-MoaE conjugate at 50 kDa was cut for further analysis with liquid chromatography-tandem mass spectrometry (LC-MS/MS).
In-gel digestion and LC-MS/MS analysis
Proteins of interest were first excised from the gel. After destaining, reduction and alkylation, the proteins in gel were digested by 0.02 μg/μl trypsin. The peptides were then extracted by acetonitrile. After drying, the peptides were dissolved in 2% (vol/vol) acetonitrile containing 0.1% (vol/vol) formic acid. A total of 1 μg peptides was injected into an Q Exactive HF hybrid quadrupole-Orbitrap mass spectrometer equipped with an UltiMate 3000 UHPLC (Thermo Scientific). The LC conditions and MS parameters were described previously (30). The raw data was analyzed by Proteome Discoverer 1.4 with the Mascot search algorithm. Mascot was set up to search the database including all cellular proteins of H. volcanii DS2, common pollution proteins, SAMP1-MoaE protein with BocK and 20 natural amino acids at G24 residue and SAMP1-MoaE protein with BocK, 3-I-Phe and 20 natural amino acids at G14 and G24 residue assuming the digestion with trypsin. Precursor and fragment mass tolerance was set at 10 ppm and 50 mmu, respectively. Carbamidomethyl of cysteine was specified as a fixed modification. Deamidation of Asn and Gln and oxidation of Met were specified as variable modifications. The false-discovery rate (FDR) was specified at 1.0% to filter the peptides. Finally, the MS/MS spectrums were labeled by pLabel software.
3D homology structural modeling and analysis
HMET1 PylRS1 and PylRS2 were comparatively modelled by Phyre2 (Protein Homology/analogY Recognition Engine (31) and 100% of residues modelled at > 90% confidence. Visualization and electrostatic potentials analysis of HMET1 PylRS1 and PylRS2 and M. mazei PylRS catalytic domain (PDB 2ZIM) were conducted by UCSF Chimera (32). Superposition of the model structure of HMET1 PylRS1/2 onto Desulfitobacterium hafniense (D. hafniense) PylRS-tRNAPyl complex (PDB 2ZNI) and superposition of crystal structures of H. volcanii SAMP1 and UbaA onto E. coli MoeB-MoaD complex (PDB 3PO0) were also obtained using UCSF Chimera.
Sequence and phylogenetic analysis
Amino acid sequences and comparative genomic data were retrieved from the archaeal clusters of orthologous genes (arCOG) database (33). Multiple alignment of PylS proteins was constructed using MUSCLE (34). The maximum-likelihood phylogenetic tree was built using IQ-TREE version 2.1.2 based on the best-fitting model, LG + I + G4 (35). PSI-BLAST (36) with default parameters was used to search for PylS homologs in the NCBI non-redundant (NR) database. To identify genes with in-frame stop codons, arCOG profiles were mapped to six-frame genome translations using PSI-BLAST (36),and then, the genome fragments corresponding to the full-length hit matches were scanned for stop codons (excluding the first and the last 20 codons).
RESULTS
Occurrence of the ΔPylSn-type pyrrolysyl-tRNA synthetase among archaea
Examination of the archaeal clusters of orthologous genes (arCOGs) (33) revealed that PylRS is encoded in 44 of the 524 archaeal genomes, of which the ΔPylSn enzyme is encoded only in nine complete or nearly complete genomes (Supplementary Table S3). Among the latter, Methanomassiliicoccus luminyensis B10 (Thermoplasmatales) and Candidatus Methanohalarchaeum thermophilum HMET1 (Methanonatronarchaeia) encompass two paralogous versions of ΔPylSn genes (Figure 1 and Supplementary Table S3). These are the only known organisms that encode two distinct PylRS enzymes (22). Using PylRS from HMET1 (OKY77552.1) as a query, we additionally searched the NCBI non-redundant (NR) database (December 2021) to identify other such genomes, but no sufficiently complete genomes with two or more paralogs of ΔPylSn were found. Therefore, for further analysis, we proceeded with complete genomes from the arCOG database (Supplementary Table S3). Using the multiple amino acid sequence alignment of the catalytic domains of non-redundant PylRS sequences, we reconstructed a phylogenetic tree and mapped the Pyl biosynthesis and incorporation genes pylS/pylD/pylC gene neighborhoods to the branches of this tree (Figure 1, Supplementary Table S4 and Additional File 1). Generally, the tree topology was compatible with the phylogeny of the respective archaeal species, with monophyletic Methanosarcinales, Methanonatronarchaeia and Thermoplasmales. The single exception was pylS from Methermicoccus shengliensis DSM 18856, which belongs to the phylum Methanosarcinales and appears to represent a rare case of horizontal gene transfer from a phylogenetically distant archaeal lineage and displacement of Pyl biosynthesis genes. These observations suggest that duplications of pylS genes and tRNAPyl genes (pylT) encoded upstream of pylS occurred independently in HMET1 and M. luminyensis B10.
Figure 1.
Phylogenetic tree of PylRS proteins and the gene neighborhoods of pylS genes. Maximum-likelihood, unrooted phylogenetic tree was built using IQ-tree based on the multiple alignment of PylRS catalytic domain (348 amino-acid sites) with aBayes support values shown for internal branches. The root was placed between Halobacteriota and Thermoplasmatota phyla according to the established archaeal phylogeny (27). The tree shown in this figure includes only ΔPylSn sequences; the complete tree is provided in the Additional File 1 in Supplementary Information. The leaves are identified by GenBank protein accession numbers and species names. Two species encoding two ΔPylSn proteins are highlighted in orange and magenta for Candidatus Methanohalarchaeum thermophilum HMET1 and Methanomassiliicoccus luminyensis B10, respectively. Asterisk indicates the sequence from Methermicoccus shengliensis, which does not belong to the Thermoplasmatales lineage. Genomic neighborhood of pylS genes is mapped to each leaf and is shown to the right of the tree. The genes are shown roughly to scale as arrows. If the intergenic distance was shortened for compactness, it is shown by a dashed line or, if genes are not next to each other, using the ‘//’ symbol as a separator. Protein names are provided below the arrows if available. The genes are colored according to their general functions as follows: yellow, genes involved in Pyl biosynthesis; pale blue, genes involved in methanogenesis; green, tightly linked pair of genes coding for putative amino acid transporter RhaT and uncharacterized protein of arCOG04962; white, all other genes. The genes containing in-frame stop codon are marked by red rectangles. Detailed information on these neighborhoods is provided in Supplementary Table S4.
Further analysis of HMET1 and M. luminyensis B10 genomes led to the identification of different sets of genes that appear to be duplicated in addition to pylS. In HMET1, the duplicated genes include the Pyl biosynthesis pylB gene encoding (2R,3R)-3-methylornithine synthase, mtmB encoding monomethylamine methyltransferase and mtbC1 encoding the methylamine corrinoid protein, whereas in M. luminyensis B10, three genes for Pyl biosynthesis (pylB, pylC, pylD), mtmB and mtbC1 are duplicated. Considering that these genes often appear in the vicinity of pylS, most likely, in both cases, the duplication involved a large DNA region, with several genes lost or transposed to other locations during subsequent evolution. Next, we analyzed the mono-, bi- and trimethylamine methyltransferase genes (mtmB, mtbB and mttB) containing an in-frame stop codon in the nine genomes encoding ΔPylSn; the number of TAG-containing genes in these genomes ranged from two to six (Supplementary Figure S1). Although the available data are insufficient for rigorous statistical testing, these observations suggest that pylS duplication is not tightly linked to duplication(s) of the Pyl encoding genes. Thus, the occurrence of two PylRS enzymes in HMET1 appears not to be necessitated by an increased number of UAG codons in this genome.
Candidatus Methanohalarchaeum thermophilum HMET1 encodes two distinct PylRS/tRNAPyl pairs
Extension of our genomic analysis of pylS genes to the cognate tRNAPyl species led to the remarkable discovery, in the HMET1 strain, of two tRNAPyl (pylT) genes (Figure 2A), with different discriminator bases (position 73 of the tRNA) (2). Like all other known tRNAPyl species, HMET1 tRNAPyl1 uses the discriminator base G, whereas HMET1 tRNAPyl2 has A as discriminator base. The pylT genes are located adjacent to the pylS genes: pylT1 and pylT2 are immediately upstream of pylS1 and pylS2, respectively (Figure 1). These pylT genes contain one or two introns with bulge-helix-bulge splicing motifs that presumably are removed during processing of the HMET1 pre-tRNAPyl (37,38) (Supplementary Figure S2). HMET1 tRNAPyl1 resembles the canonical tRNAPyl except for the presence of an extra nucleotide between the D-arm and the anticodon arm (Figure 2A), similar to the tRNAPyl of methanogenic archaeon ISO4-G1 (39). In contrast, tRNAPyl2 contains base substitutions in the variable loop, in addition to the noncanonical A73 (Figure 2A). Could PylRS1 with tRNAPyl1 and PylRS2 with tRNAPyl2 form cognate pairs?
Figure 2.
Prediction and structural analysis of the two HMET1 PylRS/tRNAPyl pairs. (A) Inferred cloverleaf structures of HMET1 tRNAPyl1 and tRNAPyl2 compared with the M. alvus tRNAPyl. (B) 3D structural model of the two HMET1 PylRSs (all residues modeled at > 90% confidence) compared with the X-ray crystal structure of M. alvus PylRS (PDB 6JP2). Electrostatic potential of the selected PylRSs is represented by coulombic surface coloring with the unit of the potential colored in the range of values − 10 (red), 0 (white), and 10 (blue) kcal/mol · e using Chimera v 1.12.
PylRS1 and PylRS2 show 53% sequence identity; the predicted 3D structures of both enzymes are similar and closely resemble the catalytic domain of Methanomethylophilus alvus (M. alvus) PylRS and other enzymes of the Class II aaRS family as demonstrated by homology modeling (Figure 2B). HMET1, found in hypersaline environments, contains enzymes that often have reduced levels of hydrophobic residues (40). Not surprisingly, the surface of HMET1 PylRSs encompasses more charged amino acids compared with M. alvus PylRS (Figure 2B). Taken together, we discovered two novel PylRS/tRNAPyl pairs that naturally co-exist in the extremophile HMET1.
Development of a facile system for in vivo characterization of the HMET1 PylRS/tRNAPyl pairs
To determine whether the PylRS/tRNAPyl pairs are functional in vivo, we first investigated the ability of PylRS2/tRNAPyl2 pair to produce superfolder green fluorescent protein (sfGFP) from the sfGFP(150TAG) gene in the presence and in the absence of the non-proteinogenic amino acid BocK, a known substrate for many PylRS enzymes. Expression in E. coli by Methanosarcina mazei (M. mazei) and M. alvus PylRS/tRNAPyl pairs produces a large amount of soluble sfGFP (positive control), whereas expression of the HMET1 PylRS1/tRNAPyl1 pair yielded only insoluble PylRS in the supernatant extract after cell lysis (Supplementary Figure S3). This finding is not surprising because halophilic enzymes often aggregate and misfold in low ionic conditions (41). Thus, functional characterization of HMET1 PylRS/tRNAPyl translation products in the well-established E. coli system is unfeasible.
Therefore, we decided to use H. volcanii, a model halophilic euryarchaeon (42), as the chassis for functional characterization of the two PylRS/tRNAPyl pairs from HMET1. A system intended for genetic code expansion in H. volcanii requires: (i) high level expression of the PylRS/tRNAPyl pair; and (ii) a reporter protein that can be readily detected for measuring the BocK incorporation in response to an in-frame UAG stop codon. We developed a plasmid-based cassette for co-expressing the PylRS/tRNAPyl pair and a small archaeal ubiquitin-like modifier protein (SAMP1) as the reporter in H. volcanii (Figure 3A). The reasons for the choice of SAMP1 were: (i) SAMP1 is small and a homolog of ubiquitin, which has often been utilized as the reporter protein for genetic code expansion experiments (43), (ii) the known structure of SAMP1 can guide us to select permissive sites for BocK incorporation; (iii) only full-length SAMP1 can modify protein substrates, such as MoaE (the large subunit of molybdopterin synthase), and the SAMP1 conjugate can be readily detected by immunoblotting (44). In contrast, unsuccessful UAG suppression leads to truncated SAMP1 protein, which is unable to modify cellular substrates. Immunoblotting was utilized to estimate UAG suppression of the reporter protein.
Figure 3.
Heterologous protein expression in H. volcanii. (A) Schematic representation of the shuttle plasmid for co-expression of genes encoding PylRS/tRNAPyl pairs and SAMP1 protein. Promoters, terminators, and RBS are indicated. To estimate UAG suppression activity of HMET1 PylRS/tRNAPyl pairs, a SAMP1 gene with UAG as codon 24 (SAMP1UAG24) was utilized. (B) Soluble PylRS1 and PylRS2 proteins in the supernatant extract after cell lysis were detected via anti-His tag antibody immunoblotting (IB). (C) The expression of the wild-type Flag-SAMP1 was detected via anti-Flag antibody IB in cell lysate. The lower band corresponds to the full-length SAMP1 that is not conjugated with other proteins and the upper band corresponds to the SAMP1-MoaE conjugate. (D) The predicted structural model of SAMP1-UbaA complex based on the crystal structure of a bacterial MoeB-MoaD complex (1JW9). The glycine 14 and 24 residue (cyan) of the wild-type SAMP1 (PDB 3PO0) for site-specific incorporation of ncAAs are indicated. Experiments were performed in duplicate, and representative images are shown.
For expression of tRNAPyl in H. volcanii, we placed the tRNA coding sequence and 15 nt each of upstream and downstream flanking sequences under control of the constitutive tRNALys promoter (45). A polycistronic expression system driven by a strong constitutive promoter was utilized for producing PylRS and the reporter protein SAMP1 (Figure 3A). The recombinant PylRS protein with an N-terminal 6xHis-tag expressed well in H. volcanii and was soluble in the supernatant extract after cell lysis (Figure 3B). In order to increase the expression level of the samp1 gene, a ribosome binding site (RBS) sequence was inserted ahead of the SAMP1 coding sequence (46), resulting in substantially increased abundance of free SAMP1 protein as well as its major conjugated form (SAMP1-MoaE) (Figure 3C).
Next, we sought to determine the permissive site(s) in the SAMP1 protein for site-specific incorporation of BocK. Our structural modeling showed that Gly24 is away from the SAMP1-UbaA (the H. volcanii ubiquitin-activating E1 enzyme homolog) interface, and Gly24 is located in the solvent-exposed loop (Figure 3D). Thus, a SAMP1 gene with UAG as codon 24 (SAMP1UAG24) provides the reporter protein SAMP1BocK24 where BocK should have a minimal impact on the structure and function of SAMP1. Because SAMP1 was mainly in the conjugated form under our culture conditions (Figure 3C), we evaluated the in vivo activity of the HMET1 PylRS/tRNAPyl pairs by measuring the protein abundance of the SAMP1UAG24-MoaE conjugate in the following experiments.
Both HMET1 PylRS/tRNAPyl pairs are active and orthogonal in H. volcanii
In order to test whether the two HMET1 PylRS/tRNAPyl pairs are functional and orthogonal in H. volcanii, we transformed plasmids encoding the PylRS/tRNAPyl pairs and SAMP1UAG24, and cultured the cells in the presence or in the absence of 1 mM BocK. The ability to form UAG suppressor BocK-tRNAPyl was assayed by monitoring the production of the SAMP1Bock24-MoaE conjugate via anti-Flag immunoblotting. The results demonstrated that both HMET1 PylRS/tRNAPyl pairs facilitated efficient amber suppression as assessed by comparing the abundance of the produced SAMP1-MoaE conjugate to that formed by the wild-type under the same conditions (Figure 4A-B, lanes 4 versus 2). We observed minimal UAG readthrough by HMET1 PylRS/tRNAPyl pairs in the absence of BocK (Figure 4A-B, lane 3) or when the HMET1 PylRS or tRNAPyl were omitted (Figure 4A-B, lanes 5 to 8). Thus, both HMET1 PylRS/tRNAPyl pairs are orthogonal with respect to the aaRS enzymes and tRNA species endogenously present in H. volcanii.
Figure 4.
Both HMET1 PylRS/tRNAPyl pairs are active and orthogonal in H. volcanii. (A-B) Activity and orthogonality tests of HMET1 PylRS/tRNAPyl pairs in the presence and absence of BocK when compared to the abundance of wild-type SAMP1-MoaE conjugate formed under the same conditions. SAMP1-MoaE and SAMP1Bock24-MoaE conjugates were detected by anti-Flag antibody immunoblotting in cell lysate. Experiments were performed in triplicates, and representative images are shown. (C) High mass accuracy tandem mass spectrometry MS/MS unambiguously confirmed site-specific incorporation of BocK mediated by PylRS1/tRNAPyl1 (left) and PylRS2/tRNAPyl2 (right). Representative MS/MS spectra of BocK-containing peptides derived by collision-induced dissociation of the doubly charged precursor after trypsin digestion of the purified SAMP1Bock24-MoaE conjugate. The y-ion (colored blue) and b-ion (colored red) series detected are indicated.
To further validate BocK incorporation, the reporter protein was analyzed with LC-MS/MS. The SAMP1BocK24 and SAMP1Bock24-MoaE conjugate were purified by immunoprecipitation, and they represent the two major bands on the SDS-PAGE gel (Supplementary Figure S4). The conjugate band was extracted from the gel and then digested by trypsin and analyzed with tandem MS. A series of b and y ions unambiguously indicate that BocK was incorporated at the UAG-specified position 24 (Figure 4C). This result confirms that native HMET1 PylRS/tRNAPyl pairs incorporate BocK into proteins via genetic code expansion in H. volcanii.
The discriminator base G73 is a major tRNAPyl identity element (6). Because HMET1 tRNAPyl1 contains the canonical G73 while tRNAPyl2 carries A73, we tested the effect of swapping the N73 base (between tRNAPyl1 and tRNAPyl2) on UAG suppression. The G73A mutation in tRNAPyl1 decreased suppression efficiency markedly (Figure 4A, lanes 9–10), which is consistent with a previous result using a G73A Methanosarcina barkeri tRNAPyl variant that reduced suppression efficiency by ≈ 70% (6). Interestingly, the A73G mutation in tRNAPyl2 also led to a drastic decrease in amber suppression efficiency (Figure 4B, lanes 9–10). These findings support the notion that PylRS1 with tRNAPyl1, and PylRS2 with tRNAPyl2 are cognate pairs and that the nature of the discriminator base alone determines the mutual orthogonality of tRNAPyl1 and tRNAPyl2 (Figure 1).
The two HMET1 PylRS/tRNAPyl pairs are mutually orthogonal
To better understand the role of the A or G discriminator base in tRNAPyl1 and tRNAPyl2, respectively, we analyzed the predicted structures of PylRS1 and PylRS2 superimposed onto the crystal structure of the Desulfitobacterium hafniense PylRS-tRNAPyl complex (47). We were especially interested in the aaRS–tRNA interaction surface close to the discriminator base, focusing on the enzyme's motif 2 loop that is proximal to N73 and plays an essential role in the recognition of the tRNA CCA terminus. To our surprise, we found that the PylRS2 motif 2 loop is shorter than that of PylRS1 (Figure 5A).
Figure 5.
The A73 discriminator of tRNAPyl2 and a shorter motif 2 loop in PylRS2 determine the mutual orthogonality between the two HMET1 PylRS/tRNAPyl pairs. (A) 3D structural models of HMET1 PylRS1 and PylRS2 (blue and green) in complex with Desulfitobacterium hafniense tRNAPyl (yellow). Magnified view shows the recognition of the CCA terminus by the motif 2 loop. (B-D) Production of SAMP1Bock24-MoaE conjugate was mediated by cognate, non-cognate and engineered PylRS/tRNAPyl pairs analyzed with anti-Flag immunoblotting (IB) in cell lysate. mPylRS1 (PylRS1 with motif 2 loop from PylRS2) and mtRNAPyl1 (tRNAPyl1 with G73A mutation) are labelled red. Migration of SAMP1Bock24-MoaE conjugate is indicated. Schematic on the right summarizing the activity and functional orthogonality between different cognate, non-cognate and engineered HMET1 PylRS/tRNAPyl pairs. Black arrows indicate high activity, and dashed grey arrows indicate minimal activity. Experiments were performed in triplicates, and representative images are shown. See Materials and Methods for details.
Next, we aimed to study whether the two PylRS/tRNAPyl pairs are mutually orthogonal in their aminoacylation specificity when expressed in H. volcanii. We swapped the PylRS and tRNAPyl coding sequences between expression vectors producing PylRS/tRNAPyl pairs and the SAMP1BocK24 reporter protein. Minimal readthrough of the amber codon was observed when the HMET1 PylRS paired with its non-cognate tRNAPyl in the presence or in the absence of BocK (Figure 5B, lanes 5–6 and 11–12). This result demonstrates that these two HMET1 PylRS/tRNAPyl pairs are mutually orthogonal in H. volcanii. To further validate this result, we rationally transplanted each unique feature (A73 or shorter motif 2 loop) from the PylRS2/tRNAPyl2 pair to the PylRS1/tRNAPyl1 pair. The PylRS1 with the short motif 2 loop from PylRS2 and tRNAPyl1 with G73A mutation are designated as mPylRS1 and mtRNAPyl1, respectively (Figure 5C-D). We compared the production of the SAMP1BocK24-MoaE conjugate by these engineered PylRS/tRNAPyl pairs to that mediated by HMET1 cognate PylRS/tRNAPyl pairs. Surprisingly, mPylRS1/tRNAPyl2 and PylRS2/mtRNAPyl1 pairs showed UAG suppression efficiency comparable to that of HMET1 cognate PylRS1/tRNAPyl pairs (Figure 5C-D, lanes 7 versus 3). In contrast, no SAMP1Bock24-MoaE conjugate was detected in the presence of engineered mPylRS1/tRNAPyl1 and PylRS1/mtRNAPyl1 pairs (Figure 5C-D, lane 9). These results demonstrate that the shorter motif 2 loop in PylRS2 combined with the discriminator base A73 in tRNAPyl2 determine the mutual orthogonality of the two PylRS/tRNAPyl pairs in HMET1.
Motif 2 loop length of PylRS2 determines its ability to acylate tRNAPyl2
To better understand the relationship of the short motif 2 loop in PylRS2 and the discriminator base A73 in tRNAPyl2, we constructed and examined a multiple amino acid sequence alignment of HMET1 PylRS1 and PylRS2 with the other known C-terminal domain-only PylRS enzymes. The most remarkable observed difference was that HMET1 PylRS2 contains a shorter (by one amino acid residue) motif 2 loop (Figure 6A). Next, we sought to demonstrate that this single amino acid deletion accounts for its activity towards tRNAPyl2. We constructed a series of PylRS2 variants with a shortened motif 2 by rational design (sV1-sV6). We also generated a series of PylRS2 variants with the full-length canonical motif 2 loop as found in other PylRSs (cV1-6). We compared the production of the SAMP1BocK24-MoaE conjugate by these engineered PylRS/tRNAPyl pairs to that mediated by the wild-type PylRS2/tRNAPyl2 pair. All PylRS2 variants (sV1-6) with one amino acid deletion in the motif 2 loop could still function with tRNAPyl2, although these PylRS2/tRNAPyl pairs varied in UAG suppression efficiency (Figure 6B, lanes 4, 6, 8, 10, 12 and 14). In contrast, we observed minimal amber codon readthrough by engineered PylRS/tRNAPyl pairs consisting of ‘full-length’ PylRS2 variants (cV1-6) and tRNAPyl2 in the presence of BocK (Figure 6C, lanes 4, 6, 8, 10, 12 and 14), compared to the wild-type PylRS2/tRNAPyl2 pair (Figure 6C, lane 2). These findings suggest that the length of motif 2 loop in PylRS2 determines the capacity of these enzyme variants to acylate the non-canonical tRNAPyl2 (with the discriminator base A73).
Figure 6.
Motif 2 loop length of PylRS2 determines its ability to acylate tRNAPyl2. (A) Multiple amino acid sequence alignment of ΔPylSn enzymes from HMET1, M. luminyensis B10 and M. alvus. The predicted secondary structure elements including α-helices, β-sheets and random coils are indicated above the PylRS sequence. Residues corresponding to the motif 2 loop of PylRS are highlighted in red. (B) Production of SAMP1Bock24-MoaE conjugate by PylRS2 variants (sV1-6) co-expressed with HMET1 tRNAPyl2 analyzed with anti-Flag IB in cell lysate. The sV1-6 are PylRS2 variants with a shortened motif 2 loop (one amino acid deletion). (C) Production of SAMP1Bock24-MoaE conjugate by PylRS2 variants (cV1-6) co-expressed with HMET1 tRNAPyl2 analyzed with anti-Flag IB in cell lysate. The cV1-V6 are PylRS2 variants with the full-length canonical motif 2 loop as found in other six ΔPylSn enzymes from M. mazei, D. hafniense, M. alvus, Candidatus Methanoplasma termitum MpT1, Candidatus Methanomassiliicoccus intestinalis Issoire-Mx1 and Methanomassiliicoccus luminyensis B10. The amino acid sequence of motif 2 loop in PylRS2 variants (sV1-6 and cV1-6) is shown in the table on top of the immunoblot images. Migration of SAMP1Bock24-MoaE conjugate is indicated. Experiments were performed in duplicate, and representative images are shown. See Materials and Methods for details.
In the absence of BocK, the UAG readthrough efficiency of SAMP1UAG24 by tRNAPyl2 and PylRS variants (sV1-6) was low, but non-negligible (Figure 6B, lanes 3, 5, 7, 9, 11 and 13). To determine the nature of the mis-incorporated canonical amino acids, the relevant SAMP1-MoaE conjugate products formed by the PylRS2 variants sV1 and sV6 (Figure 6B, lanes 3 and 13, respectively) were isolated and analyzed by LC-LC/MS. Lys and Gln were found to be mis-incorporated when PylRS2 variant sV1 (DSKN) was used (Supplementary Figure S5), and Glu, Lys and Gln were identified using PylRS2 variant sV6 (SGKN) (Supplementary Table S5 and Supplementary Figure S6). To answer the question whether these mis-incorporated amino acids are carried by tRNAPyl2 to the ribosome, we undertook an experiment in the absence of HMET1 tRNAPyl2. Only minimal UAG readthrough was observed with PylRS2 variants (sV1-sV6) in the presence or in the absence of BocK (Supplementary Figure S7A and Supplementary Figure S7B, lanes 9–14 versus lanes 3–8). Thus, it appears that tRNAPyl2 is mainly responsible for the observed low level misincorporation. As shown above, the PylRS2/tRNAPyl2 pair is completely orthogonal with respect to the endogenous H. volcanii aaRSs and tRNAs (Figure 5B and Supplementary Figure S7B lane1); thus, we suggest that engineering of the PylRS2 motif 2 could impair binding to tRNAPyl2, thus allowing endogenous aaRSs to charge canonical amino acids (e.g. Lys or Gln) onto tRNAPyl2.
Encoding two distinct ncAAs into one protein by the mutually orthogonal HMET1 PylRS/tRNAPyl pairs
As a demonstration of the orthogonality of the two PylRS/tRNAPyl systems, we decided to incorporate two distinct ncAAs into a single polypeptide. To achieve this goal, engineering of the amino acid-binding pocket of PylRS1 and PylRS2 for selective incorporation of two different ncAAs, and construction of a tRNAPyl species able to suppress UAA or UGA were required. The repertoire of ncAAs that can be acylated to tRNAPyl by PylRS variants has been greatly expanded in recent years so that PylRS variants exist that are selective for several ncAA substrates but exclude others (5). For instance, in previous work, we constructed a highly active PylRS mutant (IFRS; Asn346Ser/Cys348Ile) that is selective for 3-iodo-l-Phe (3-I-Phe) but rejects BocK (48). From the alignment of the amino acid-binding pockets of HMET1 PylRS1 and IFRS, we identified Asn163 and Val165 in PylRS1 as the residues corresponding to Asn346 and Cys348 in IFRS, and introduced the corresponding Asn163Ser and Val165Ile amino acid substitutions into HMET1 PylRS1 (PylRS1-SI). The PylRS1-SI/tRNAPyl1 pair exhibited strong amber suppression in a 3-I-Phe-dependent manner as measured by the production of a SAMP1(3I)24-MoaE conjugate (Figure 7A). Given that PylRS lacks anticodon recognition (6) and our PylRS/tRNAPyl pairs would decode diverse codons (49), we next mutated the tRNAPyl2 anticodon to generate tRNAPyl2UUA which would facilitate BocK incorporation into SAMP1 at the position directed by the UAA codon. Based on the analysis of the SAMP1 structure, the Gly14 codon was mutated to UAA for BocK incorporation (Figure 3D). We transformed H. volcanii with plasmids encoding the HMET1 PylRS2/tRNAPyl2UUA pair and SAMP1(UAA14), and then cultured the cells in the presence or absence of 1 mM BocK. The PylRS2/tRNAPyl2UUA pair exhibited ochre suppression in a BocK-dependent manner (Figure 7B). Only a small amount of SAMP1BocK14-MoaE conjugate was formed by the PylRS2/tRNAPyl2UUA pair, which is consistent with the previous finding that UAA/UGA mediated suppression is inefficient in H. volcanii (45). Thus, we now developed PylRS/tRNAPyl pairs for site-specific incorporation in H. volcanii of 3-I-Phe and BocK directed by UAG and UAA codons, respectively.
Figure 7.
Encoding two distinct ncAAs using mutually orthogonal HMET1 PylRS/tRNAPyl pairs. (A) Production of SAMP1(3I)24-MoaE conjugate by PylRS1-SI/tRNAPyl1 pair analyzed by anti-Flag immunoblotting (IB) in cell lysate. (B) Production of the SAMP1BocK14-MoaE conjugate by PylRS2/tRNAPyl2UUA pair analyzed by anti-Flag IB in cell lysate. (C) Detection of purified SAMP1(3I)14Bock24-MoaE conjugate produced by PylRS1-SI/tRNAPyl1 and PylRS2/tRNAPyl2UUA pairs. Total 100 ml H. volcanii cells expressing the one-plasmid system were grown in the presence and absence of 3-I-phe and BocK followed by the anti-Flag immunoprecipitation to enrich the SAMP1(3I)14Bock24-MoaE conjugate. Migration of SAMP1(3I)14Bock24-MoaE conjugate is indicated. The same amount of cellular proteins were utilized for immuno-enrichment of Flag-tagged proteins confirmed by Coomassie blue staining of the input. Experiments were performed in duplicates, and representative images are shown. See Materials and Methods for details.
To demonstrate simultaneous incorporation of 3-I-Phe and BocK into a single protein in response to two distinct codons in H. volcanii, we chose a one-plasmid system encoding two copies of SAMP1UAA14, UAG24 and one copy of PylRS1-SI/tRNAPyl1 and PylRS2/tRNAPyl2UUA (Supplementary Figure S8). H. volcanii cells expressing the one-plasmid system were grown in the presence or absence of 3-I-Phe and BocK followed by the anti-Flag immunoprecipitation to enrich the reporter proteins. Production of a SAMP1(3I)14Bock24-MoaE conjugate was dependent on the addition of both ncAAs (Figure 7C). Further LC-MS/MS analysis of the purified reporter protein confirmed that BocK and 3-I-Phe were incorporated at the TAA and TAG-directed positions respectively (Supplementary Figure S9). This result demonstrates that the HMET1 PylRS/tRNAPyl pairs function in the same heterologous H. volcanii cell to incorporate distinct ncAAs into a single polypeptide programmed by different stop codons. Thus, mutual orthogonality, most likely, was the original property of these PylRS/tRNAPyl pairs in HMET1.
A different scenario has been reported previously where PylRS/tRNA pairs from different archaea were shown to be mutually orthogonal based on their tRNA recognition mode. This allowed the rational design of two orthogonal PylRS/tRNA pairs with distinct amino acid specificities that can operate together in a host cell (13,17).
DISCUSSION
Potential applications of HMET1 PylRS/tRNAPyl-derived pairs in archaea
A variety of PylRS/tRNAPyl-derived pairs have been developed in bacteria and eukaryotic cells to incorporate useful Pyl derivatives into proteins of interest (50), enabling many important applications including identification of weak protein-protein interactions in living cells (51), generation of physiologically relevant proteins with authentic post-translational modifications (e.g. acetylation) (52), and protein labelling for cellular imaging (53). However, genetic incorporation of ncAA into protein mediated by engineered aaRS/tRNA pairs in archaea, the third domain of life, has never been reported before. As shown in this work, it is facile to generate designated PylRS/tRNAPyl pairs in H. volcanii by transplanting previously reported mutations into HMET1 PylRS/tRNAPyl pairs. We envision HMET1 PylRS/tRNAPyl-derived pairs could be further engineered and optimized to enrich the toolbox for dissecting fundamental biological processes in archaea.
Halophilic enzymes are of major interest to many industrial applications as they serve as efficient catalysts to produce biomaterials and biofuels in low water activity conditions (54). For instance, halophilic pyrophosphatase displays superior catalytic activity at high concentrations of organic solvents (55). However, expression of halophilic enzymes in heterologous hosts, such as E. coli, is often problematic because these proteins tend to misfold and aggregate in a low ionic strength environment. Here we demonstrate that H. volcanii can function as a facile chassis to characterize novel enzymes, discovered by mining metagenomic and microbial genomic data, from halophiles that are usually difficult or impossible to culture. In particular, novel aaRS candidates identified in the microbial dark matter (56,57) could be good targets. Furthermore, the ability to genetically encode ncAAs has greatly expanded structural and functional repertoire of proteins. Examples include catalytic efficiency and thermal stability of proteins improved by genetic code expansion (58–60). Thus, introducing ncAAs into proteins of interest in H. volcanii in vivo using the HMET1 PylRS/tRNAPyl-derived pairs can be expected to open a new avenue for halophilic protein engineering and biotechnological applications.
Why have two PylRS/tRNAPylpairs in the same organism?
A simple argument suggests that two ΔPylSn gene copies could be needed for efficient translation of in-frame UAG stop codons with Pyl in the relevant genes in the HMET1 genome. However, the data in Supplementary Figure S1 indicate that this is not the case because the HMET1 genome contains an average number of such genes compared with the seven other archaea that only possess a single ΔPylSn copy. These two PylRS enzymes might not be expressed under the same growth conditions, but rather, in response to different environmental cues, ensuring the presence of at least one active PylRS/tRNAPy pair at all times during the growth cycle. The fact the HMET1 PylRS/tRNAPyl pairs are mutually orthogonal is far more interesting. Although the translation of UAG codons with BocK suggests that both HMET1 PylRS enzymes recognize Pyl, their in vivo amino acid specificities in HMET1 cells remain to be determined biochemically. The possibility remains that one HMET1 PylRS/tRNAPyl pair evolved a different amino acid specificity, necessitating the simultaneous presence of the two mutually orthogonal pairs.
Which of the two PylRS/tRNAPyl is ancestral?
The duplication of the ΔPylSn gene in HMET1, followed by the emergence of two orthogonal ΔPylSn/tRNAPyl pairs, is so far unique among the available genomes. The only detectable ΔPylSn protein in the current databases with a deletion in the motif 2 loop is KXB05636.1, a truncated PylRS with 62% sequence identity to HMET1 PylRS2 (61). This protein is encoded in a contig from a candidate division MSBL1 archaeon SCGC-AAA382A03 that also belongs to Methanonatronarchaeia, suggesting that the duplication event occurred at some point in the evolution of this class of archaea. Furthermore, the tree built using pylT genes does not show any major deviations from the pylS tree (Supplementary Figure S10), suggesting that pylT was a part of the duplicated region, evolving in parallel with the cognate PylRS (Figure 1). Comparison of the pylS gene neighborhoods (Supplementary Table S4) suggests that PylRS2 is the ancestral form because it is encoded in the vicinity of the pair of genes encoding the putative permease RhaT and an uncharacterized protein from arCOG04962 (Figure 1). The latter pair of tightly linked genes is present in similar neighborhoods in the majority of Thermoplasmatales genomes that contain a single pylS gene, so this gene arrangement is likely to be ancestral. If this is the case, the duplication of a large region encoding pylS, pylB and several other genes, including methylamine methyltransferases with an in-frame UAG codon, most likely, occurred before the deletion in the PylRS2 motif 2 loop. A similar independent duplication that occurred in the lineage of M. luminyensis B10 did not result in analogous changes in either ΔPylSn or the cognate tRNAPyl. It remains to be determined whether the ΔPylSn/tRNAPyl pairs in this species are mutually orthogonal.
Why is the PylRS2/tRNAPyl2 pair absent in the overwhelming majority of Pyl encoding organisms?
As mentioned above, apart from HMET1, only a candidate division MSBL1 archaeon SCGC-AAA382A03 (61) may have a PylRS2-like enzyme (with a 1 aa deletion). Likewise, the presence of a discriminator A73 tRNAPyl so far was detected only in these two organisms (61). Why did PylRS identity switch to a G73 discriminator base? A possible reason might be the increased substrate specificity conferred by the G73 discriminator base given that the majority of tRNAs use the A73 identity element. For PylRS enzymes (e.g. Desulfitobacterium hafniense) a G73→A73 switch lowers the catalytic efficiency ∼100 fold (62).
tRNA identity and genetic code
The mutually orthogonal PylRS/tRNAPyl pairs in HMET1 provide the clearest example yet for the importance of a single base change in the discriminator position leading to a concurrent identity change.
The co-existence and natural orthogonality of the two HMET1 PylRS/tRNA pairs also offer unique insights into the prospect of a still expanding genetic code. Studies based on the conspicuous absence of the tRNAGlyACC species from eukaryotic genomes suggested that genetic code expansion halted due to functional and structural constraints in the tRNA structure that limit the effective use of newly evolved tRNA identities (63). However, the HMET1 PylRS/tRNA pairs showcase that new tRNA identities can emerge with minimal changes of a tRNA if an aaRS partner is able to adapt to those changes. Similar cases were recently reported for discriminator base variants of a methanogen tRNACys coupled with a cysteinyl-tRNA synthetase variant with a mutated small loop (64), and for a Streptomyces tRNA with Ala anticodon that is exclusively acylated with Pro by a noncanonical prolyl-tRNA synthetase (65). Thus, simultaneous/tandem duplication of tRNA and aaRS genes might lead to orthogonal aaRS/tRNA pairs able to evolve novel amino acid specificities.
DATA AVAILABILITY
The mass spectrometry raw data have been deposited to the ProteomeXchange Consortium under the dataset identifier PXD027053 as well as CNGB Sequence Archive (CNSA) of China National GeneBank DataBase (CNGBdb) with accession number CNP0001963.
Supplementary Material
ACKNOWLEDGEMENTS
We are grateful to the China National GeneBank for DNA synthesis and Yuliang Dong, Fei Guo and Yue Zheng at BGI-Shenzhen for their assistance with the immunoprecipitation experiments. We thank Dr. Thorsten Allers for providing the H. volcanii H26 strain.
Notes
Present address: Qianqian Zhao, Department of Rheumatology and Immunology, Shenzhen People's Hospital, The Second Clinical Medical College of Jinan University, Shenzhen, 518020, China; Integrated Chinese and Western Medicine Postdoctoral Research Station, Jinan University, Guangzhou, 510632, China.
Present address: Takahito Mukai, Department of Life Science, College of Science, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171-8501, Japan.
Present address: Kazuaki Amikura, Department of Interdisciplinary Space Science, Institute of Space and Astronautical Science, Japan Aerospace Exploration Agency, Kanagawa, Japan.
Contributor Information
Haolin Zhang, BGI-Shenzhen, Shenzhen, 518083, China.
Xuemei Gong, BGI-Shenzhen, Shenzhen, 518083, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
Qianqian Zhao, BGI-Shenzhen, Shenzhen, 518083, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
Takahito Mukai, Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06511, USA.
Oscar Vargas-Rodriguez, Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06511, USA.
Huiming Zhang, BGI-Shenzhen, Shenzhen, 518083, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
Yuxing Zhang, BGI-Shenzhen, Shenzhen, 518083, China; Sino-Danish College, University of the Chinese Academy of Sciences, Beijing, China.
Paul Wassel, Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, FL 32611, USA.
Kazuaki Amikura, Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06511, USA.
Julie Maupin-Furlow, Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, FL 32611, USA; Genetics Institute, University of Florida, Gainesville, FL 32611, USA.
Yan Ren, BGI-Shenzhen, Shenzhen, 518083, China.
Xun Xu, BGI-Shenzhen, Shenzhen, 518083, China.
Yuri I Wolf, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
Kira S Makarova, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
Eugene V Koonin, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
Yue Shen, BGI-Shenzhen, Shenzhen, 518083, China; Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen 518120, China; Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
Dieter Söll, Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06511, USA; Department of Chemistry, Yale University, New Haven, CT 06511, USA.
Xian Fu, BGI-Shenzhen, Shenzhen, 518083, China; Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen 518120, China.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
National Key Research and Development Program of China (No. 2018YFA0900100); National Natural Science Foundation of China (No. 31901029 to X.F. and No. 31800078 to Y. S.); Natural Science Foundation of Guangdong Province, China (No. 2021A1515010995 to X.F.); Guangdong Provincial Key Laboratory of Genome Read and Write (No. 2017B030301011); National Institute of General Medical Sciences (R35GM122560 and 3R35GM122560-05S1 to D.S.; R01GM057498 to J.M.F.); Department of Energy, Office of Science, Office of Basic Energy Sciences (DE-FG02-98ER20311 to D.S.; DE-FG02-05ER15650 to J.M.F.). The YIW, KSM and EVK research is supported by the NIH Intramural Research Program at the National Library of Medicine, US Department of Health and Human Services.
Conflict of interest statement. None declared.
REFERENCES
- 1. Giege R., Sissler M., Florentz C.. Universal rules and idiosyncratic features in tRNA identity. Nucleic Acids Res. 1998; 26:5017–5035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Crothers D.M., Seno T., Söll D. Is there a discriminator site in transfer RNA. Proc. Natl. Acad. Sci. USA. 1972; 69:3063–3067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Gaston M.A., Jiang R., Krzycki J.A.. Functional context, biosynthesis, and genetic encoding of pyrrolysine. Curr. Opin. Microbiol. 2011; 14:342–349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Polycarpo C., Ambrogelly A., Berube A., Winbush S.M., McCloskey J.A., Crain P.F., Wood J.L., Söll D. An aminoacyl-tRNA synthetase that specifically activates pyrrolysine. Proc. Natl. Acad. Sci. USA. 2004; 101:12450–12454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Wan W., Tharp J.M., Liu W.R.. Pyrrolysyl-tRNA synthetase: an ordinary enzyme but an outstanding genetic code expansion tool. Biochim. Biophys. Acta. 2014; 1844:1059–1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Ambrogelly A., Gundllapalli S., Herring S., Polycarpo C., Frauer C., Söll D. Pyrrolysine is not hardwired for cotranslational insertion at UAG codons. Proc. Natl. Acad. Sci. USA. 2007; 104:3141–3146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. de la Torre D., Chin J.W.. Reprogramming the genetic code. Nat. Rev. Genet. 2021; 22:169–184. [DOI] [PubMed] [Google Scholar]
- 8. Crnkovic A., Suzuki T., Söll D., Reynolds N.M.. Pyrrolysyl-tRNA synthetase, an aminoacyl-tRNA synthetase for genetic code expansion. Croat. Chem. Acta. 2016; 89:163–174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Krahn N., Tharp J.M., Crnković A., Söll D. Engineering aminoacyl-tRNA synthetases for use in synthetic biology. Enzymes. 2020; 48:351–395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Borrel G., Gaci N., Peyret P., O’Toole P.W., Gribaldo S., Brugere J.F.. Unique characteristics of the pyrrolysine system in the 7th order of methanogens: implications for the evolution of a genetic code expansion cassette. Archaea. 2014; 2014:374146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Mukai T., Crnkovic A., Umehara T., Ivanova N.N., Kyrpides N.C., Söll D. RNA-dependent cysteine biosynthesis in bacteria and archaea. Mbio. 2017; 8:e00561–e00517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Chibani C.M., Mahnert A., Borrel G., Almeida A., Werner A., Brugère J.F., Gribaldo S., Finn R.D., Schmitz R.A., Moissl-Eichinger C.. A catalogue of 1,167 genomes from the human gut archaeome. Nat Microbiol. 2022; 7:48–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Willis J.C.W., Chin J.W.. Mutually orthogonal pyrrolysyl-tRNA synthetase/tRNA pairs. Nat. Chem. 2018; 10:831–837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Yamaguchi A., Iraha F., Ohtake K., Sakamoto K.. Pyrrolysyl-tRNA synthetase with a unique architecture enhances the availability of lysine derivatives in synthetic genetic codes. Molecules. 2018; 23:2460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Seki E., Yanagisawa T., Kuratani M., Sakamoto K., Yokoyama S.. Fully productive cell-free genetic code expansion by structure-based engineering of methanomethylophilusalvus pyrrolysyl-tRNA synthetase. ACS Synth Biol. 2020; 9:718–732. [DOI] [PubMed] [Google Scholar]
- 16. Beranek V., Willis J.C.W., Chin J.W.. An evolved methanomethylophilusalvus pyrrolysyl-tRNA synthetase/tRNA pair is highly active and orthogonal in mammalian cells. Biochemistry. 2019; 58:387–390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Meineke B., Heimgärtner J., Lafranchi L., Elsässer S.J.. Methanomethylophilus alvus Mx1201 provides basis for mutual orthogonal pyrrolysyl tRNA/aminoacyl-tRNA synthetase pairs in mammalian cells. ACS Chem. Biol. 2018; 13:3087–3096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Tharp J.M., Vargas-Rodriguez O., Schepartz A., Söll D. Genetic encoding of three distinct noncanonical amino acids using reprogrammed initiator and nonsense codons. ACS Chem. Biol. 2021; 16:766–774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Cao L., Liu J., Ghelichkhani F., Rozovsky S., Wang L.. Genetic incorporation of ϵ-N-benzoyllysine by engineering methanomethylophilusalvus pyrrolysyl-tRNA synthetase. ChemBioChem. 2021; 22:2530–2534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Dunkelmann D.L., Willis J.C.W., Beattie A.T., Chin J.W.. Engineered triply orthogonal pyrrolysyl–tRNA synthetase/tRNA pairs enable the genetic encoding of three distinct non-canonical amino acids. Nature Chem. 2020; 12:535–544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Liu J., Cao L., Klauser P.C., Cheng R., Berdan V.Y., Sun W., Wang N., Ghelichkhani F., Yu B., Rozovsky S.et al.. A genetically encoded fluorosulfonyloxybenzoyl-L-lysine for expansive covalent bonding of proteins via sufex chemistry. J. Am. Chem. Soc. 2021; 143:10341–10351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Sorokin D.Y., Makarova K.S., Abbas B., Ferrer M., Golyshin P.N., Galinski E.A., Ciordia S., Mena M.C., Merkel A.Y., Wolf Y.I.et al.. Discovery of extremely halophilic, methyl-reducing euryarchaea provides insights into the evolutionary origin of methanogenesis. Nat Microbiol. 2017; 2:17081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Sorokin D.Y., Merkel A.Y., Abbas B., Makarova K.S., Rijpstra W.I.C., Koenen M., Sinninghe Damsté J.S., Galinski E.A., Koonin E.V., van Loosdrecht M.C.M.. Methanonatronarchaeum thermophilum gen. nov., sp. nov. and 'Candidatus methanohalarchaeum thermophilum', extremely halo(natrono)philic methyl-reducing methanogens from hypersaline lakes comprising a new euryarchaeal class methanonatronarchaeia classis nov. Int. J. Syst. Evol. Microbiol. 2018; 68:2199–2208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Aouad M., Borrel G., Brochier-Armanet C., Gribaldo S.. Evolutionary placement of methanonatronarchaeia. Nat Microbiol. 2019; 4:558–559. [DOI] [PubMed] [Google Scholar]
- 25. Martijn J., Schön M.E., Lind A.E., Vosseberg J., Williams T.A., Spang A., Ettema T.J.G.. Hikarchaeia demonstrate an intermediate stage in the methanogen-to-halophile transition. Nat. Commun. 2020; 11:5490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Sorokin D.Y., Makarova K.S., Abbas B., Ferrer M., Golyshin P.N., Galinski E.A., Ciorda S., Mena M.C., Merkel A.Y., Wolf Y.I.et al.. Reply to 'Evolutionary placement of methanonatronarchaeia. Nat Microbiol. 2019; 4:560–561. [DOI] [PubMed] [Google Scholar]
- 27. Rinke C., Chuvochina M., Mussig A.J., Chaumeil P.A., Davín A.A., Waite D.W., Whitman W.B., Parks D.H., Hugenholtz P.. A standardized archaeal taxonomy for the genome taxonomy database. Nat Microbiol. 2021; 6:946–959. [DOI] [PubMed] [Google Scholar]
- 28. Borrel G., Adam P.S., Gribaldo S.. Methanogenesis and the wood-ljungdahl pathway: an ancient, versatile, and fragile association. Genome Biol Evol. 2016; 8:1706–1711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Brugère J.F., Atkins J.F., O’Toole P.W., Borrel G.. Pyrrolysine in archaea: a 22nd amino acid encoded through a genetic code expansion. Emerg Top Life Sci. 2018; 2:607–618. [DOI] [PubMed] [Google Scholar]
- 30. Li S., He Y., Lin Z., Xu S., Zhou R., Liang F., Wang J., Yang H., Liu S., Ren Y.. Digging more missing proteins using an enrichment approach with proteominer. J. Proteome Res. 2017; 16:4330–4339. [DOI] [PubMed] [Google Scholar]
- 31. Kelley L.A., Mezulis S., Yates C.M., Wass M.N., Sternberg M.J.. The phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015; 10:845–858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Pettersen E.F., Goddard T.D., Huang C.C., Couch G.S., Greenblatt D.M., Meng E.C., Ferrin T.E.. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 2004; 25:1605–1612. [DOI] [PubMed] [Google Scholar]
- 33. Makarova K.S., Wolf Y.I., Koonin E.V.. Towards functional characterization of archaeal genomic dark matter. Biochem. Soc. Trans. 2019; 47:389–398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Edgar R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004; 32:1792–1797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Minh B.Q., Schmidt H.A., Chernomor O., Schrempf D., Woodhams M.D., von Haeseler A., Lanfear R.. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 2020; 37:1530–1534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Altschul S.F., Madden T.L., Schäffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J.. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997; 25:3389–3402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Marck C., Grosjean H.. Identification of BHB splicing motifs in intron-containing tRNAs from 18 archaea: evolutionary implications. RNA. 2003; 9:1516–1531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Sugahara J., Yachie N., Arakawa K., Tomita M.. In silico screening of archaeal tRNA-encoding genes having multiple introns with bulge-helix-bulge splicing motifs. RNA. 2007; 13:671–681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Hoffman K.S., Crnković A., Söll D. Versatility of synthetic tRNAs in genetic code expansion. Genes (Basel). 2018; 9:537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Siglioccolo A., Paiardini A., Piscitelli M., Pascarella S.. Structural adaptation of extreme halophilic proteins through decrease of conserved hydrophobic contact surface. BMC Struct. Biol. 2011; 11:50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Mevarech M., Frolow F., Gloss L.M.. Halophilic enzymes: proteins with a grain of salt. Biophys. Chem. 2000; 86:155–164. [DOI] [PubMed] [Google Scholar]
- 42. Hartman A.L., Norais C., Badger J.H., Delmas S., Haldenby S., Madupu R., Robinson J., Khouri H., Ren Q., Lowe T.M.et al.. The complete genome sequence of haloferax volcanii DS2, a model archaeon. PLoS One. 2010; 5:e9605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Yang A., Ha S., Ahn J., Kim R., Kim S., Lee Y., Kim J., Söll D., Lee H.Y., Park H.S.. A chemical biology route to site-specific authentic protein modifications. Science. 2016; 354:623–626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Fu X., Adams Z., Liu R., Hepowit N.L., Wu Y., Bowmann C.F., Moskovitz J., Maupin-Furlow J.A.. Methionine sulfoxide reductase a (MsrA) and its function in ubiquitin-like protein modification in archaea. Mbio. 2017; 8:e01169-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Bhattacharya A., Köhrer C., Mandal D., RajBhandary U.L.. Nonsense suppression in archaea. Proc. Natl. Acad. Sci. USA. 2015; 112:6015–6020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Fu X., Liu R., Sanchez I., Silva-Sanchez C., Hepowit N.L., Cao S., Chen S., Maupin-Furlow J.. Ubiquitin-like proteasome system represents a eukaryotic-like pathway for targeted proteolysis in archaea. Mbio. 2016; 7:e00379-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Nozawa K., O’Donoghue P., Gundllapalli S., Araiso Y., Ishitani R., Umehara T., Söll D., Nureki O.. Pyrrolysyl-tRNA synthetase-tRNAPyl structure reveals the molecular basis of orthogonality. Nature. 2009; 457:1163–1167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Guo L.T., Wang Y.S., Nakamura A., Eiler D., Kavran J.M., Wong M., Kiessling L.L., Steitz T.A., O’Donoghue P., Söll D. Polyspecific pyrrolysyl-tRNA synthetases from directed evolution. Proc. Natl. Acad. Sci. USA. 2014; 111:16724–16729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Wan W., Huang Y., Wang Z., Russell W.K., Pai P.J., Russell D.H., Liu W.R.. A facile system for genetic incorporation of two different noncanonical amino acids into one protein in escherichia coli. Angew. Chem. Int. Ed Engl. 2010; 49:3211–3214. [DOI] [PubMed] [Google Scholar]
- 50. Chin J.W. Expanding and reprogramming the genetic code of cells and animals. Annu. Rev. Biochem. 2014; 83:379–408. [DOI] [PubMed] [Google Scholar]
- 51. Hino N., Okazaki Y., Kobayashi T., Hayashi A., Sakamoto K., Yokoyama S.. Protein photo-cross-linking in mammalian cells by site-specific incorporation of a photoreactive amino acid. Nat. Methods. 2005; 2:201–206. [DOI] [PubMed] [Google Scholar]
- 52. O’Donoghue P., Ling J., Wang Y.S., Söll D. Upgrading protein synthesis for synthetic biology. Nat. Chem. Biol. 2013; 9:594–598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Lang K., Chin J.W.. Cellular incorporation of unnatural amino acids and bioorthogonal labeling of proteins. Chem. Rev. 2014; 114:4764–4806. [DOI] [PubMed] [Google Scholar]
- 54. DasSarma S., DasSarma P.. Halophiles and their enzymes: negativity put to good use. Curr. Opin. Microbiol. 2015; 25:120–126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. McMillan L.J., Hepowit N.L., Maupin-Furlow J.A.. Archaeal inorganic pyrophosphatase displays robust activity under high-salt conditions and in organic solvents. Appl. Environ. Microbiol. 2016; 82:538–548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Mukai T., Amikura K., Fu X., Söll D., Crnkovic A.. Indirect routes to aminoacyl-tRNA: the diversity of prokaryotic cysteine encoding systems. Front Genet. 2022; 12:794509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Mukai T., Reynolds N.M., Crnković A., Söll D. Bioinformatic analysis reveals archaeal tRNATyr and tRNATrp identities in Bacteria. Life. 2017; 7:8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Xiao H., Nasertorabi F., Choi S.H., Han G.W., Reed S.A., Stevens R.C., Schultz P.G.. Exploring the potential impact of an expanded genetic code on protein function. Proc. Natl. Acad. Sci. USA. 2015; 112:6961–6966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Li J.C., Liu T., Wang Y., Mehta A.P., Schultz P.G.. Enhancing protein stability with genetically encoded noncanonical amino acids. J. Am. Chem. Soc. 2018; 140:15997–16000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Ohtake K., Mukai T., Iraha F., Takahashi M., Haruna K.I., Date M., Yokoyama K., Sakamoto K.. Engineering an automaturing transglutaminase with enhanced thermostability by genetic code expansion with two codon reassignments. ACS Synth Biol. 2018; 7:2170–2176. [DOI] [PubMed] [Google Scholar]
- 61. Guan Y., Haroon M.F., Alam I., Ferry J.G., Stingl U.. Single-cell genomics reveals pyrrolysine-encoding potential in members of uncultivated archaeal candidate division MSBL1. Environ Microbiol Rep. 2017; 9:404–410. [DOI] [PubMed] [Google Scholar]
- 62. Herring S., Ambrogelly A., Polycarpo C.R., Söll D. Recognition of pyrrolysine tRNA by the desulfitobacteriumhafniense pyrrolysyl-tRNA synthetase. Nucleic Acids Res. 2007; 35:1270–1278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Saint-Leger A., Bello C., Dans P.D., Torres A.G., Novoa E.M., Camacho N., Orozco M., Kondrashov F.A., Ribas de Pouplana L.. Saturation of recognition elements blocks evolution of new tRNA identities. Sci. Adv. 2016; 2:e1501860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Perona J.J., Rauch B.J., Driggers C.M.. Rampelotto P.H. Molecular Mechanisms of Microbial Evolution. 2018; Cham: Springer International Publishing; 371–408. [Google Scholar]
- 65. Vargas-Rodriguez O., Badran A.H., Hoffman K.S., Chen M., Crnkovic A., Ding Y., Krieger J.R., Westhof E., Söll D., Melnikov S.. Bacterial translation machinery for deliberate mistranslation of the genetic code. Proc. Natl. Acad. Sci. USA. 2021; 118:e2110797118. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The mass spectrometry raw data have been deposited to the ProteomeXchange Consortium under the dataset identifier PXD027053 as well as CNGB Sequence Archive (CNSA) of China National GeneBank DataBase (CNGBdb) with accession number CNP0001963.