Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Jun 5.
Published in final edited form as: J Am Chem Soc. 2022 Oct 4;144(41):19196–19203. doi: 10.1021/jacs.2c08985

A conserved histidine residue drives extein dependence in an enhanced atypically split intein

Giridhar Sekar a,#, Adam J Stevens b,$, Anahita Z Mostafavi b, Pulikallu Sashi a,#, Tom W Muir b,*, David Cowburn a,*
PMCID: PMC10241006  NIHMSID: NIHMS1859971  PMID: 36194550

Abstract

Split intein-mediated protein trans splicing (PTS) is widely applied in chemical biology and biotechnology to carry out traceless and specific protein ligation. However, the external residues immediately flanking the intein (exteins), can reduce the splicing rate, thereby limiting certain applications of PTS. Splicing by a recently developed intein with atypical split architecture (‘Cat’) exhibits a stark dependence on the sequence of its N-terminal extein residues. Here, we further developed Cat using error prone PCR and a cell-based selection assay to produce Cat* having greatly enhanced PTS activity in the presence of unfavorable N-extein residues. We then applied solution nuclear magnetic resonance spectroscopy and molecular dynamics simulations to explore how dynamics of a conserved B-block histidine residue (His78) contribute to this extein dependence. The enhanced extein tolerance of Cat* reported here should expand the applicability of atypically split inteins, and the mechanism highlights common principles that contribute to extein dependence.

Keywords: Protein trans splicing (PTS), Intein, Protein Chemistry, Protein Engineering, NMR Spectroscopy, MD simulation

Introduction

Protein splicing is a biochemical reaction wherein an intervening protein (intein) undergoes a posttranslational auto-processing event in which it self-excises while simultaneously ligating its flanking external residues (exteins) (Fig. S1) 1,2. Although most identified inteins are contiguous domains and undergo cis-splicing, some are endogenously found as fractured genes. The resulting naturally split inteins are expressed as two separate domains, the N-intein (IntN) and C-intein (IntC) which condense prior to undergoing protein trans-splicing (PTS) 3,4. Some naturally split inteins splice with exceptionally fast rates (t½ <1 min), and these ultrafast split inteins serve as powerful tools for protein engineering and synthetic biology 58. For example, ultrafast split inteins have been applied to synthetically incorporate chemical modifications such as fluorescent probes or posttranslational modifications within recombinantly expressed proteins, both in vitro and in vivo911. Moreover, split inteins have enabled the generation of cyclic peptide libraries for screening inhibitors of protein-protein interactions, and gene therapy strategies to deliver large genetic cargo1214.

Naturally split inteins canonically fracture to form an IntN and IntC of approximately 100 and 35 amino acids respectively. However, inteins with atypical split sites (small IntN and large IntC) have also been discovered 1517. Since the shorter intein fragment is readily accessible through chemical synthesis, atypically split inteins are especially suited for the N-terminal modification of recombinant proteins 18. Previously, we applied consensus design of inteins from the T4-bacteriophage-type DNA-packaging terminase large subunit (TerL) to engineer a consensus atypical intein (Cat) 17. Cat exhibits enhanced splicing activity and stability in ambient conditions compared to other members of the TerL family. However, despite these advantages, substitutions at the N-extein −1 position from the endogenous glutamate residue profoundly decrease splicing activity. Such extein dependence is common among inteins evolved within the context of specific flanking residues, and can significantly impair applications of PTS.

Prior efforts have both elucidated the molecular mechanism by which extein residues impact splicing and engineered inteins with minimal extein dependence 1923. Inherently, a Cys, Thr, or Ser residue is mandatory at the first position of the C-extein (+1) as the nucleophile for formation of the branched intermediate. Additional residues resembling the native proximal N- and C- extein sequence are often favored as well. For example, naturally split inteins from the α subunit of DNA Polymerase III (DnaE) strictly depend on the +2 C-extein residue19,21,22. This +2-position residue interacts with a catalytic F-block histidine (His125), and mutations that augment His125 dynamics can partially restore splicing activity in unfavorable extein contexts 2022. Furthermore, the atypically split VidaL intein favors glycine at the −1 position, which may facilitate scissile amide bond distortion through hydrogen bonding interactions with the side chains of a B-block Histidine and F-block Tyrosine23,24. However, efforts to relieve N-extein dependence in VidaL have been unsuccessful, and a naturally split intein with an atypical split site that exhibits minimal extein dependence has yet to be identified or engineered.

Here, through employing an unbiased approach using error prone polymerase chain reaction (PCR) and a cell-based selection, we identify mutations in the Cat intein that accelerate splicing in nonnative extein contexts. The enhanced sequence tolerance is validated in vitro, and the mechanism is explored through NMR spectroscopy and molecular dynamics (MD) simulations. A key catalytic histidine residue (His78) and a single substitution at an adjacent non-catalytic residue (S79P) are found to dramatically impact the activity, structure, and dynamics of the intein active site.

Results

Identification of a Cat mutation that imparts enhanced N-extein tolerance

Although Cat exhibits ultrafast splicing with its native Glu−1 residue (1.17± 0.10 ×10−2 s−1), we previously showed that N-extein substitutions such as Gly−1 (7.92 ± 0.33 ×10−5 s−1) or Ala−1 (2.14 ± 0.08 ×10−4 s−1) drastically reduce reaction rates (Fig. 1a, Table S1)16. This dependence is mainly localized to the −1 ExtN residue, as −2, +2 and +3 C-extein mutations are tolerated. To engineer enhanced activity under nonnative N-extein residues, we adapted a previously reported cell selection system in which kanamycin resistance is coupled to PTS activity through reconstitution of a split aminoglycoside-3’-phosphotransferase (KanR) from two individually transcribed fragments (Fig. 1b)20. We carried out error prone PCR on the CatN and CatC inteins, transformed the plasmid library into DH5α E. coli cells, and selected for increased survival in the presence of an unfavorable Ala−1 residue. A colony containing the mutations S70T and S79P (Cat*) was identified and shown to infer increased survival against kanamycin (Fig. 1c).

Figure 1.

Figure 1.

Extein-dependent protein trans-splicing of the Cat TerL split intein. (a) Left: Schematic depicting Cat ultrafast trans-splicing under a favorable extein context. Cat is embedded within a target protein of interest (POI) and flanked by its native extein residues (glutamate at the N-extein and cysteine at the C-extein). The −1 position of the N-extein is highlighted in red. Right: Schematic depicting Cat splicing when embedded within a POI in the context of an unfavorable extein context. Glycine is present at the −1 N-extein position and is highlighted in red. (b) Depiction of the PTS-dependent E. coli selection system. Top: The kanamycin resistance protein, KanR, is split and fused to N- and C-intein fragments (CatN and CatC). The region subject to error prone PCR is highlighted in blue. Bottom: Depiction of the selection system, wherein transformed E. coli cells are subject to increasing concentrations of kanamycin (Kan) on an LB-Agar plate. (c) LB-Agar selection of E. coli transformed with split KanR plasmids containing Cat with −1 glutamate (top), Cat Glu−1, Cat Ala−1 (middle), and Cat* Ala−1 (bottom).

We next validated the enhanced activity of the Cat* intein using an in vitro splicing assay (Fig. 2a). We individually expressed and purified intein fragments fused to the model N- and C-extein domains, Maltose Binding Protein (MBP) and Green Fluorescent protein (GFP) respectively. For both Cat and Cat*, three residues from the endogenous N- and C- exteins were preserved as linkers, with mutations present at the −1 N-extein position. The cognate intein fragments were mixed to a final concentration of 2 μM, incubated at 30 ˚C, and reaction progress was monitored by gel electrophoresis. Enhanced activity for Cat* is observed in the presence of Ala−1 (10-fold increase in splicing rate, t½ = 4 m), Gly−1 (16-fold increase, t½ = 9.2 m), Ser−1 (2.5- fold increase, t½ = 3 m), and Pro−1 (1.5-fold increase, t½ = 98 m) N-exteins compared to Cat (Fig. 2b, 2c, supplementary Table S1). Although activity decreases by 2-fold for Glu−1, Cat* splicing still proceeds at an exceptionally fast rate (t½ = 2 m, Fig. 2b, 2c).

Figure 2.

Figure 2.

Characterization of the enhanced splicing activity of Cat*. (a) Schematic of the in vitro trans-splicing assay. An N-extein maltose binding protein (MBP) is fused to CatN, and a C-extein green fluorescent protein (GFP) is fused to CatC. The N-extein −1 residue is shown as a red X. (b) Reaction progress curves for splicing of Cat and Cat* at 30 °C with the indicated −1 N-extein residue. (c) In vitro splicing half-lives of Cat (red) and Cat* (blue) with indicated −1 N-extein residues (mean ± SD, n = 3). (d) In vitro splicing half-lives of Cat Gly−1 and the indicated mutations derived from Cat* (mean ± SD, n = 3). (e) Depiction of the Cat active site (PDB 6DSL) highlighting the proximity of the S79P mutation (cyan) to the active site B-block loop. The catalytic residues Thr75 and His78 are rendered as sticks. The values for Cat are reproduced from previously reported data16 in all panels.

To delineate the contribution of each mutation in Cat*, we measured splicing of Cat S79P and Cat S70T in the presence of Gly−1 (Fig. 2d). Introduction of S79P into Cat largely accounts for the enhanced activity of Cat* (t½ = 11.8 m), while no increase in splicing rate is observed for Cat S70T (t½ = 152 m). Mapping these mutations onto the previously reported structure of Cat (PDB 6DSL), Ser70 is located on the periphery of the intein, while Ser79 is present within the intein functional site (Fig. 2e). Importantly, Ser79 sits directly adjacent to the B-block intein loop, which contains both catalytic Ser75 and His78 residues responsible for activation of the N-splice junction.

N-extein substitutions in Cat impact B-block catalytic residues

To understand how the S79P mutation restores activity in Cat*, we began by investigating the structural consequences of −1 N-extein substitutions. Cat Glu−1, Cat Gly−1, and Cat S79P Gly−1 constructs containing inactivating C1A and N134A mutations were expressed in 15N, 13C isotopically enriched media, complexed, purified to homogeneity by size-exclusion chromatography, and analyzed by NMR spectroscopy. Backbone resonances for the three constructs were assigned, and chemical shift perturbations (CSP) between each substitution were calculated (Table S2, Fig. S2). The 1H-15N HSQCs of all complexes display very similar fingerprints, which demonstrates that large structural changes such as misfolding do not account for the observed differences in activity.

The CSP values for the Cat E-1G substitution were next mapped onto the previously reported Cat structure (PDB ID 6DSL) (Fig. 3a). Residues near the site of the N-extein substitution, particularly the catalytic residues C1A and His78, show significant values of CSP. In addition, a well resolved Hδ2/Cδ2 crosspeak assigned for His78 in Cat Glu−1 could not be determined for Cat Gly−1 in the 1H-13C aromatic HSQCs (Fig 3b). Correspondingly, one less crosspeak is also observed for histidine Hε1/Cε1 resonances in Cat Gly−1 than Cat Glu−1, indicating line broadening of His78 imidazole resonances. Increased local dynamics, such as loss of a steric restriction resulting from the E-1G mutation, can account for these line broadening effects. The B-block histidine is highly conserved and contributes to both the linear intermediate formation and the branched intermediate resolution splicing steps25. Thus, mutations that alter His78 dynamics may in turn augment splicing kinetics.

Figure 3.

Figure 3.

Structural and dynamic effects of Cat N-extein substitutions. (a) Differences in backbone chemical shifts between Cat Glu−1 and Cat Gly−1. The weighted average chemical shift perturbation (CSP) was calculated for each residue and is mapped onto the solution structure of Cat (PDB ID: 6DSL). The substituted −1 residue is depicted in green, and active site residues that exhibit significant chemical shifts are rendered as sticks. (b) 1H −13C-HSQC spectra of Cat in the context of Glu −1(left), Gly −1 (middle), or S79P Gly −1 (right). (c) Differences in backbone chemical shifts between Cat Gly−1 and Cat S79P Gly−1. The weighted average chemical shift perturbation (CSP) was calculated for each residue and is mapped onto the solution structure of Cat (PDB ID: 6DSL). The S79P mutation is depicted in green, and reactive site residues that exhibit significant chemical shifts are rendered as sticks

The CSP values between Cat Gly−1 and Cat S79P Gly−1 were then mapped on the Cat structure with significant chemical shift perturbations predominantly localized in the active site (Fig. 3c). S79P strongly impacts the backbone chemical shifts of Asp115, Tyr125, and the adjacent block B loop (Asp76, Asn77, Phe80, Gly81). While His78 Hδ2/Cδ2 cross peaks are not observed in the Cat Gly−1 1H-13C aromatic HSQC spectrum due to line broadening, the S79P mutation reverses this effect (Fig. 3b). Moreover, a peak in the Cat S79P Gly−1 histidine Hε1/Cε region is also restored. This observation suggests that the dynamic state of the His78 side chain in Cat S79P Gly−1 more resembles that of Cat Glu−1 than Cat Gly−1.

Sidechain conformational dynamics of His78 modulate reaction rate in Cat

Molecular dynamics simulations were carried out to explore the molecular basis for −1 N-extein dependence. In the NMR structure ensemble of Cat (PDB 6DSL), which contains Glu−1, the side chain of His78 occupies two main conformations. The imidazole group Nδ1 is oriented proximal to the −1 N-extein scissile peptide bond in the first conformation but sits distal in the second (Fig S3). Prior studies suggest that the proximal orientation of Nδ1 is optimal for catalysis through interaction with the ExtN −1 backbone carbonyl oxygen. Since the −1 ExtN side chain interacts with His78, we hypothesized that N-extein substitution could modify His78 conformational dynamics. MD simulation trajectories were thus calculated from structures originating from both configurations for Cat Glu−1, Cat Gly−1, and Cat S79P Gly−1. When the His78 sidechain starts in the proximal configuration, limited χ2 torsion angle change is observed irrespective of the −1 residue or the S79P mutation (Fig. S4). However, when the simulation originates from the distal configuration for Cat Glu−1, the imidazole ring undergoes a flip and reorients itself to achieve the proximal conformation thereby reducing the distance between His78 Nδ1 and Glu−1 carbonyl oxygen. (Fig 4a, 4b, S5). The ring flip is preceded by a restructuring of the loop spanning residues 78 to 90 wherein the loop extends to open the active site and enable the His78 sidechain to reorient itself (Fig. 4a, Movie S1). Distances calculated from the trajectories between Cα atoms of Gly81 and Ser133 increase by about 4 Å prior to the ring flip, indicating an extension of the loop away from the active site. (Fig 4c, S5). The retraction of the loop residues also corresponds with an increase in the void volume around the His78 side chain, possibly to accommodate its reorientation (Fig. 4d, Table S3). In the Cat Gly−1 simulation, however, the His78 side chain remains in the distal configuration with no observed β-hairpin reconfiguration. This unfavorable distal configuration may thus contribute to the decreased splicing activity observed for the Gly−1 extein (Movie S2). For Cat S79P Gly−1, the His78 sidechain undergoes a reorientation like that in the Cat Glu−1 simulation. Flipping of the imidazole ring is once again preceded by restructuring of the residue 78–90 loop region and an increase in void volume around the His78 sidechain and results in a reduced distance between His78 Nδ1 and the Gly−1 carbonyl oxygen. (Fig 4c, 4d, S5, Movie S3). Interestingly in proximal configuration simulations, this loop restructuring is observed for Cat S79P in proximal configuration simulations as well, but not in any of other cases (Fig S4). Thus, the in silico observation that S79P augments residue 78–90 loop dynamics and facilitates His78 reorientation to the proximal configuration provides a possible mechanism for the enhanced Cat* extein tolerance.

Figure 4.

Figure 4.

Structural and dynamic effects of Cat S79P mutation. (a) Structural overlay of residues 74–89 from the MD simulation for Cat Glu−1 (black), Cat Gly−1 (gray), and Cat S79P Gly−1 (orange) at t = 0, 150, and 600 ns. His78 is represented as a stick and an arrow is depicted adjacent to the Gly81-Asp83 loop. (b) His78 χ2 angles calculated from individual MD simulation trajectories for Cat Glu−1 (left), Cat Gly−1 (center), and Cat S79P Gly−1 (right). (c) Distances between Gly81 (Cα) and Ser133 (Cα) computed from MD simulation trajectories for Cat Glu−1 (left), Cat Gly−1 (center), and Cat S79P Gly−1 (right) (d) CASTp void volume calculations surrounding His78 for Cat Glu−1 (left), Cat Gly−1 (center), and Cat S79P Gly−1 (right). An arrow is shown depicting the point of His78 rotation.

Discussion and Conclusions

Extein dependence can hinder applications of split intein-mediated protein trans-splicing and necessitate insertion of nonnative residues into an exogenous protein of interest. This limitation motivates efforts to both understand and resolve extein dependence within split inteins. Here, we applied a cell-based selection assay to identify a variant of the atypically split Cat TerL intein (Cat*) with enhanced extein tolerance at the −1 N-extein position. An S79P mutation accounts for this enhanced activity and is adjacent to His78. We then applied NMR spectroscopy to demonstrate that S79P acts locally to augment active site residues, including His78 side chain dynamics. Lastly, MD simulations suggest these His78 dynamics may directly alter splicing activity by impacting its distance to the −1-residue backbone carbonyl oxygen.

The enhanced promiscuity of Cat* expands the intein toolkit available to protein engineers by enabling splicing under previously unfavorable extein environments and should facilitate the N-terminal modification of proteins due to its synthetically accessible N-intein. Specifically, applications of naturally split inteins that could benefit from this enhanced promiscuity include efforts overcome cargo limitations of adeno-associated virus gene therapies, generate cyclic peptide libraries for drug screening, and synthetically modify proteins in cells23,2628. Each of these applications of protein trans-splicing could benefit from a greater ability to conserve native residues adjacent to the intein insertion site.

Examining this work in the context of prior efforts to engineer splicing promiscuity potentially reveals common principles to design split inteins with minimal extein dependence. The observation that Glu−1 interacts with catalytic residues within the active site and steers His78 is reminiscent of prior investigations into the C-extein dependence of DnaE inteins 21. In Npu DnaE intein, a phenylalanine at the +2 C-extein position interacts with and guides the position of a catalytic F-block histidine (His125) to facilitate succinimide formation. For both Cat and Npu, loss of a steric interaction from introduction of an extein residue such as glycine or alanine leads to unfavorable orientation of active site catalytic residues. Interestingly, much like the restoration of activity from the S79P mutation directly adjacent to His78 in Cat reported here, introduction of a “GEP” substitution in the loop adjacent to His125 (residues 122–124) in DnaE inteins enhances activity in unfavorable extein contexts22. In both cases, NMR and MD simulations suggest the substitutions compensate for losing the steric extein interaction by augmenting dynamics and conformation of the nearby catalytic histidine. However, some efforts to overcome extein dependence have been more challenging, such as for the atypically split VidaL intein, which requires a −1 Gly23. In VidaL, Gly−1 is thought to facilitate distortion of the scissile amide bond. It is thus possible that inteins in which extein dependence results from loss of a side chain interaction, such as Npu and Cat, may be more suitable for engineering than those whose extein dependence is caused by introduction of unfavorable steric bulk, as is the case for VidaL.

Intriguingly, the identified mutations in Cat* (S70T, S79P) correspond to the native residues in the previously reported AceL TerL intein16. In addition, both Thr70 and Pro79 are the second most prevalent residues at their respective positions within the TerL alignment from which the Cat sequence was derived (Fig. S6). Cat, which contains Ser70 and Ser79, may therefore be less promiscuous than AceL (and other uncharacterized TerL inteins) despite its increased activity and stability. Consensus protein engineering assumes that the most common residues within a multiple sequence alignment (MSA) are selected to improve fitness, but because inteins evolve within the context of a specific insertion site (e.g. TerL), a consensus intein may optimize activity for the endogenous extein and therefore not favor extein substitutions. This finding agrees with the general observation that enzyme specialization often accompanies enhanced catalytic efficiency29. The counterbalance between splicing efficiency and promiscuity is further noted in the decreased WT extein splicing rates for both the Cat* (~2 fold with WT Glu−1 N-extein) and the “GEP” DnaE mutations (~2 fold with Phe+2 C-extein). Future studies that identify and engineer highly efficient and robust inteins through consensus design should therefore benefit from additional characterization and expansion of extein promiscuity.

There have been few inquiries into why certain split inteins naturally exhibit partial N- or C-extein promiscuity, although significant effort has been invested to understand the cause of extein dependence and to increase splicing promiscuity. For example, while the splicing rate of Cat depends on the identity of the −1 N-extein position, it is largely unperturbed by C- extein substitutions17. Conversely, splicing in Npu depends on the identity of the +2 position, but tolerates substitutions at the −1 position30. For Cat, which lacks an F-block histidine, branch resolution may be accelerated by a penultimate G-block histidine that does not interact with the C-extein. For Npu, however, it is unclear why N-extein mutations do not result in a significant loss of activity. A thorough understanding of natural intein promiscuity could enable grafting of endogenous motifs to facilitate promiscuity or identifying novel promiscuous inteins from sequence databases.

Materials and Methods

NMR Spectroscopy

NMR experiments were performed on a CatN /CatC complex (500 μM) enriched with 13C and 15N isotopes. The intein fragments were rendered inactive through CatN(C1A) and CatC(N134A) mutations. Polypeptides were individually expressed as SUMO fusion constructs in isotopically enriched M9 minimal media, purified and the fusion tag was removed by Ulp1 protease. Intein fragments were combined in 1.5:1 ratio of CatN: CatC and the complex formed was purified by size exclusion chromatography using a GE HiPrep 16/60 Sephacryl S-200 column. NMR experiments were performed on a Bruker DRX600 NMR spectrometer equipped with a TCI cryoprobe.

Molecular Dynamics Simulation

Starting structures for Molecular Dynamics (MD) simulation were obtained from the previously determined NMR structure of the Cat intein complex (PDB ID: 6DSL). The first and the fifth structure from the PDB file were used as starting structures which have the major and minor His78 configurations. Cys1 and Asn134 residues were modeled to replace the inactivating alanine mutations in the PDB structure file using Chimera31. Additionally, the different mutants investigated in this work were modeled using Chimera to obtain respective starting structures. Standard all-atom molecular dynamics simulations in explicit water were performed using the GPU enhanced Amber 18 software code. The ff14SB force field was used for the protein molecules. TIP-3P explicit water model was used with a box having a 10 Å buffer limit to the solute molecule. Molecules were neutralized using additional Na+ ions as required. The system was prepared by performing energy minimization, heating, and density equilibration. Positional restraints were applied on the backbone of the protein using the SHAKE algorithm. The system was further equilibrated for 25 ps before the production was set up for a total simulated time of 500–700 ns. Void volume calculations were performed using the CastP web server 32.

Splicing Assay

Splicing reactions of CatN carrying maltose binding protein as the N-extein (MBP-CatN) and CatC with GFP as the C-extein (CatC-GFP) were performed as follows. Equal volumes of MBP-CatN and CatC-GFP (4 μM stocks) were incubated at 30 °C and mixed to start the PTS reaction. 10 μL aliquots were removed at the required time points and quenched by mixing with 10 μL of 2x Laemmli sample buffer (BioRad). Samples were analyzed by SDS-PAGE. Gel photographs were captured using the BioRad GelDoc imaging system and ImageJ software was used to quantify the bands in each lane.

Error Prone PCR Mutagenesis and Kanamycin Resistance Screening

Fast acting mutants were selected from a kanamycin resistance screen coupled to the intein splicing activity. KanR was encoded as two split intein precursor genes KanRN-CatN and KanRC-CatC using a pBlusecript plasmid under the T3 promoter (referred to hence as pKanR). The pKanR plasmid was transformed into 40 μL E. coli DH ultracompetent cells (NEB) and outgrown in 800 μL of SOC medium at 37 °C. 160 μL of the outgrown cells were plated on LB agar plates containing 100 mg/L of ampicillin and the required concentration of kanamycin (0, 5, 7.5, 10, 20 and 30 μg/L). The plates were incubated overnight at 37 °C.

Error prone PCR was performed on a pKanR plasmid carrying an FA extein using the Diversify PCR random mutagenesis kit (Takara, Mountain View, CA). The primer design was done such that the CatN and CatC sequences alone were subject to the mutagenesis and amplification by PCR while leaving the KanR gene and the ribosome binding sites of the plasmid unperturbed. The fragments amplified by error prone PCR were cloned into plasmids to obtain a pKanR mutant library. The library was transformed into DH cells and plated in increasing kanamycin concentrations as described above. Colonies viable in higher kanamycin concentration were selected for subsequent gene sequencing and characterization.

Supplementary Material

Supplementary material
Suppl. Movie 1
Download video file (71.5MB, mp4)
Suppl. Movie 2
Download video file (61.6MB, mp4)
Suppl Move 3
Download video file (68.2MB, mp4)

ACKNOWLEDGMENTS

The authors thank members of the Muir lab for valuable discussion, and Dr. Sean Cahill for NMR advice and assistance. This work was supported by the US National Institutions of Health (R37-GM086868, S10-OD016432, P30-CA013330) and NSF (Fellowship to AJS and DGE-1148900, XSEDE Allocation).

Footnotes

ASSOCIATED CONTENT

Supplementary information for methods, figures, and tables are available for this manuscript.

The authors declare no competing financial interest.

References

  • 1.Shah NH & Muir TW Inteins: Nature’s Gift to Protein Chemists. Chem Sci 5, 446–461 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Novikova O, Topilina N. & Belfort M. Enigmatic distribution, evolution, and function of inteins. J Biol Chem 289, 14490–7 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Shah NH, Eryilmaz E, Cowburn D. & Muir TW Naturally split inteins assemble through a “capture and collapse” mechanism. J Am Chem Soc 135, 18673–81 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wu Q. et al. Conserved residues that modulate protein trans-splicing of Npu DnaE split intein. Biochem J 461, 247–55 (2014). [DOI] [PubMed] [Google Scholar]
  • 5.Zettler J, Schutz V. & Mootz HD The naturally split Npu DnaE intein exhibits an extraordinarily high rate in the protein trans-splicing reaction. FEBS Lett 583, 909–14 (2009). [DOI] [PubMed] [Google Scholar]
  • 6.Carvajal-Vallejos P, Pallisse R, Mootz HD & Schmidt SR Unprecedented rates and efficiencies revealed for new natural split inteins from metagenomic sources. J Biol Chem 287, 28686–96 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Shah NH, Dann GP, Vila-Perello M, Liu Z. & Muir TW Ultrafast protein splicing is common among cyanobacterial split inteins: implications for protein engineering. J Am Chem Soc 134, 11338–41 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Stevens AJ et al. Design of a Split Intein with Exceptional Protein Splicing Activity. J Am Chem Soc 138, 2162–5 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.David Y, Vila-Perello M, Verma S. & Muir TW Chemical tagging and customizing of cellular chromatin states using ultrafast trans-splicing inteins. Nat Chem 7, 394–402 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Borra R, Dong D, Elnagar AY, Woldemariam GA & Camarero JA In-cell fluorescence activation and labeling of proteins mediated by FRET-quenched split inteins. J Am Chem Soc 134, 6344–53 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Charalambous A, Andreou M. & Skourides PA Intein-mediated site-specific conjugation of Quantum Dots to proteins in vivo. J Nanobiotechnology 7, 9 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Castillo F. & Tavassoli A. Genetic Selections with SICLOPPS Libraries: Toward the Identification of Novel Protein-Protein Interaction Inhibitors and Chemical Tools. Methods Mol Biol 2001, 317–328 (2019). [DOI] [PubMed] [Google Scholar]
  • 13.Yao Z. et al. Split Intein-Mediated Protein Ligation for detecting protein-protein interactions and their inhibition. Nat Commun 11, 2440 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Levy JM et al. Cytosine and adenine base editing of the brain, liver, retina, heart and skeletal muscle of mice via adeno-associated viruses. Nat Biomed Eng 4, 97–110 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bachmann AL & Mootz HD An Unprecedented Combination of Serine and Cysteine Nucleophiles in a Split Intein with an Atypical Split Site. J Biol Chem 290, 28792–804 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Thiel IV, Volkmann G, Pietrokovski S. & Mootz HD An atypical naturally split intein engineered for highly efficient protein labeling. Angew Chem Int Ed Engl 53, 1306–10 (2014). [DOI] [PubMed] [Google Scholar]
  • 17.Stevens AJ, Sekar G, Gramespacher JA, Cowburn D. & Muir TW An Atypical Mechanism of Split Intein Molecular Recognition and Folding. J Am Chem Soc 140, 11791–11799 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Bachmann AL & Mootz HD N-terminal chemical protein labeling using the naturally split GOS-TerL intein. J Pept Sci 23, 624–630 (2017). [DOI] [PubMed] [Google Scholar]
  • 19.Amitai G, Callahan BP, Stanger MJ, Belfort G. & Belfort M. Modulation of intein activity by its neighboring extein substrates. Proc Natl Acad Sci U S A 106, 11005–10 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Lockless SW & Muir TW Traceless protein splicing utilizing evolved split inteins. Proc Natl Acad Sci U S A 106, 10999–1004 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Shah NH, Eryilmaz E, Cowburn D. & Muir TW Extein residues play an intimate role in the rate-limiting step of protein trans-splicing. J Am Chem Soc 135, 5839–47 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Stevens AJ et al. A promiscuous split intein with expanded protein engineering applications. Proc Natl Acad Sci U S A 114, 8538–8543 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Burton AJ, Haugbro M, Parisi E. & Muir TW Live-cell protein engineering with an ultra-short split intein. Proc Natl Acad Sci U S A 117, 12041–12049 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Neugebauer M, Bocker JK, Matern JC, Pietrokovski S. & Mootz HD Development of a screening system for inteins active in protein splicing based on intein insertion into the LacZalpha-peptide. Biol Chem 398, 57–67 (2017). [DOI] [PubMed] [Google Scholar]
  • 25.Du Z. et al. Highly conserved histidine plays a dual catalytic role in protein splicing: a pKa shift mechanism. J Am Chem Soc 131, 11581–9 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tornabene P. et al. Intein-mediated protein trans-splicing expands adeno-associated virus transfer capacity in the retina. Sci Transl Med 11(2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Tavassoli A. SICLOPPS cyclic peptide libraries in drug discovery. Curr Opin Chem Biol 38, 30–35 (2017). [DOI] [PubMed] [Google Scholar]
  • 28.Khoo KK et al. Chemical modification of proteins by insertion of synthetic peptides using tandem protein trans-splicing. Nat Commun 11, 2284 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Khersonsky O. & Tawfik DS Enzyme promiscuity: a mechanistic and evolutionary perspective. Annu Rev Biochem 79, 471–505 (2010). [DOI] [PubMed] [Google Scholar]
  • 30.Cheriyan M, Pedamallu CS, Tori K. & Perler F. Faster protein splicing with the Nostoc punctiforme DnaE intein using non-native extein residues. J Biol Chem 288, 6202–11 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Pettersen EF et al. UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem 25, 1605–12 (2004). [DOI] [PubMed] [Google Scholar]
  • 32.Tian W, Chen C, Lei X, Zhao J. & Liang J. CASTp 3.0: computed atlas of surface topography of proteins. Nucleic Acids Res 46, W363–W367 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material
Suppl. Movie 1
Download video file (71.5MB, mp4)
Suppl. Movie 2
Download video file (61.6MB, mp4)
Suppl Move 3
Download video file (68.2MB, mp4)

RESOURCES