Graphical abstract
Lessons from molecular evolution: the acquired knowledge on the human MGMT is the starting point for applying a rational design to other AGTs, in order to obtain alternative (in our case thermostable) homologues and expand the “SNAP-tag technology” to organisms and/or reaction conditions incompatible with the activity of the commercial protein-tags.
Abbreviations: AGT, OGT, MGMT, O6-alkyl-guanine-DNA-alkyl-transferase; BC, O2-benzyl-cytosine; BG, O6-benzyl-guanine; HTH, helix-turn-helix motif; IMAC, immobilized metal affinity chromatography; SLP, Self-Labelling Protein-tag
Keywords: Protein-tag, Protein labelling, Orthogonal substrate specificity, Thermozymes, Protein engineering
Abstract
Self-labelling protein tags (SLPs) are resourceful tools that revolutionized sensor imaging, having the versatile ability of being genetically fused with any protein of interest and undergoing activation with alternative probes specifically designed for each variant (namely, SNAP-tag, CLIP-tag and Halo-tag). Commercially available SLPs are highly useful in studying molecular aspects of mesophilic organisms, while they fail in characterizing model organisms that thrive in harsh conditions. By applying an integrated computational and structural approach, we designed a engineered variant of the alkylguanine-DNA-alkyl-transferase (OGT) from the hyper-thermophilic archaeon Saccharolobus solfataricus (SsOGT), with no DNA-binding activity, able to covalently react with O6-benzyl-cytosine (BC-) derivatives, obtaining the first thermostable CLIP-tag, named SsOGT-MC8.
The presented construct is able to recognize and to covalently bind BC- substrates with a marked specificity, displaying a very low activity on orthogonal benzyl-guanine (BG-) substrate and showing a remarkable thermal stability that broadens the applicability of SLPs. The rational mutagenesis that, starting from SsOGT, led to the production of SsOGT-MC8 was first evaluated by structural predictions to precisely design the chimeric construct, by mutating specific residues involved in protein stability and substrate recognition. The final construct was further validated by biochemical characterization and X-ray crystallography, allowing us to present here the first structural model of a CLIP-tag establishing the molecular determinants of its activity, as well as proposing a general approach for the rational engineering of any O6-alkylguanine-DNA-alkyl-transferase turning it into a SNAP- and a CLIP-tag variant.
1. Introduction
Self-labelling protein-tags (SLPs) catalyse the covalent, highly specific and irreversible binding of the fluorogenic moiety of synthetic ligands designed by mimicking the physiological protein substrate. Their relatively small dimension makes them powerful tools for a huge number of applications [1]. To this aim, in 2003, the group of Prof. Kai Johnsson introduced for the first time an engineered version of the human O6-alkyl-guanine-DNA-alkyl-transferase (hMGMT), by applying a molecular evolution approach in order to: i), erase the protein DNA binding activity, ii) increase the catalytic activity toward the pseudo-substrate O6-benzyl-guanine (BG-) derivatives [2], [3], [4], [5], [6] (Fig. 1). The unique features of this protein that covalently retain a part of the BG- substrate allowed a huge number “biotech” applications: conceptually, all desired chemical groups, if previously conjugated to the benzyl moiety, could be covalently linked to the protein [2], [3], [4], [5], [6]. Accordingly, an impressive number of papers appeared in literature, leading to the commercialization of this engineered hMGMT as “SNAP-tag” (New England Biolabs) whose application cover various disciplines, from medicine to biotechnology: genetically fused to a protein of interest (POI), it has been proposed as valid alternative to fluorescent proteins (FPs) and affinity-tags proteins (as GST, MBP, His-tag, etc.) mainly in fluorescence microscopy [7], but also in protein purification/immobilization methodologies [8]. Noteworthy, despite the disadvantage to utilise an external substrate, SNAP-tag revealed particularly useful in some contexts, as anaerobic conditions, where FPs fail to produce a fluorescent signal [9]. Furthermore, the high enzyme specificity for the substrate and the extreme versatility of the production of BG-conjugates make the SNAP-tag an ideal tool for the production and development of new generation biosensors [10], [11].
Fig. 1.
Scheme of the irreversible reaction of SNAP- and CLIP-tag on their respective BG- and BC-derivative substrates. Upon the reaction, the benzyl moiety is covalently linked to the catalytic cysteine (white triangle). If desired chemical groups (indicated as 1 and 2) are previously conjugated to BG- and BC-, it is possible to achieve a specific multi-protein labelling by these commercial SLPs.
Afterwards, the need to get an available SLP with an orthogonal substrate specificity with respect to the SNAP-tag for in vivo multi-protein labelling, induced Johnsson and co-workers to a further development, again by molecular evolution, of a variant of the hMGMT (the CLIP-tag), able to recognise O2-benzyl-cytosine (BC-) derivatives [12] (Fig. 1). This is peculiar, because the great majority of model organisms does not react with BC-, avoiding any endogenous activity [12]. The advantage of possessing two orthogonal SLPs to be genetically fused to two respective POIs is particularly suitable for in vivo and in vitro protein–protein interaction studies, by methodologies as the Selective crosslinking of interacting proteins (S-CROSS) [13], as well as by employing FRET fluorophore pairs conjugated to the BG- and BG- substrates.
Despite the effort pushed on the hMGMT variants in order to increase their general stability [6], it is known that the SNAP-tag and the CLIP-tag have the limitation of being employable in vitro under moderate reaction conditions and in vivo in “mesophilic” model organisms characterized by a temperature optimum lower than 40 °C. Although these conditions cover most applications, these SLPs are not suitable for studies on (hyper)thermophilic organisms and, more generally, organisms that thrive in non-permissive environmental conditions (high pressure, extremes of pH, high ionic strength, etc.). To fill this gap, from 2012 on, Perugino and collaborators identified an O6-alkyl-guanine-DNA-alkyl-transferase activity in the hyperthermophilic archaea Saccharolobus solfataricus: as expected, this protein is involved in in vivo direct DNA repair by alkylating agent treatment [14]. The fruitful lesson from the SNAP-tag, where the aminoacidic residues involved in the recognition of the BG- derivatives were identified, led to the production by site-directed mutagenesis of a S. solfataricus OGT variant (defined SsOGT-H5, Fig. 2 and Fig. S1), which has been fully characterized: it is unable to bind DNA but still retaining a high catalytic activity on the fluorescein BG-derivative (SNAP-Vista® Green, hereinafter BG-FL, Fig. S2) [14]. Furthermore, SsOGT-H5 revealed a strong thermostability comparable to the wild type and a general resistance to other chemical-physical denaturants [14]. Unlike hMGMT and relative variants, the OGT homologue from S. solfataricus is not sensitive to chelating agents [14], suggesting the absence of any structural Zinc ion: this was then confirmed by the 3D resolution of this archaeal protein [15]. All these premises constituted the starting point for exploiting this mutant as an innovative SLP in “thermophilic” contexts. Indeed, SsOGT-H5 was used as a protein tool genetically fused to a thermostable enzyme [16], as well as it underwent heterologous expression in thermophilic organisms [16], [17]. In particular, we purified at high yield and purity the SsOGT-H5-lacS fusion protein by means of immobilized metal affinity chromatography (IMAC), as well as by incubating the E. coli cell free extract at high temperature (70 °C) in order to obtain a selective precipitation of E. coli proteome and recover the heterologously expressed thermostable protein in the soluble fraction [16]. Moreover, the presence of this thermostable SLP (hereinafter TSSNAP) also allowed a fourfold increase the thermostability of the alpha-carbonic anhydrase from Sulfurihydrogenibium yellowstonense [18], [19]. The heterologous expression of the TSSNAP was successfully achieved in the thermophilic bacterium Thermus thermophilus HB27EC, as well as in the hyperthermophilic archaea Sulfolobus islandicus [16], [17]. In both cases, these organisms were permeable to BG-FL, making possible the determination and the measure of the activity of TSSNAP, demonstrating the correct folding at high temperatures in these organisms [16], [17].
Fig. 2.
Commercial and thermostable SLPs. Cartoon representation of SLPs, with BG- and BC- substrate specificity coloured as indicated in the legend.
By means of computational and structural approach, we have rationally driven the tuning of the substrate specificity of the TSSNAP from BG- to BC-derivatives, in order to produce and fully characterize the first, to date, thermostable CLIP-tag (TSCLIP; Fig. 2 and Fig. S1). The structure-based analysis allowed us to detect the peculiar amino acids involved in substrate specificity for this class of enzymes, as well as to propose a general approach for the engineering of any O6-alkyl-guanine-DNA-alkyl-transferase turning it into a SNAP- and a CLIP-tag variant.
2. Results and discussion
2.1. Production of a thermostable CLIP-tag by rational design
2.1.1. General considerations
Currently, several dozen O6-alkyl-guanine-DNA-alkyl-transferases (E.C: 2.1.1.63) have been fully characterized (see BRENDA web site, https://www.brenda-enzymes.org/enzyme.php?ecno=2.1.1.63; see UniProt web site, https://www.uniprot.org/uniprot/?query=2.1.1.63&sort=score) and more than forty 3D structures were present in Protein Data Bank (including 3D structures of variants, in complex with DNA or ligands, etc.), highlighting the molecular mechanisms of these proteins in the direct DNA repair and substrate specificity. Spanning from mesophilic to thermophilic environments, AGTs display a different primary structure, whereas all of them show a typical protein architecture, mainly consisting of two globular domains [8], [20]: a poorly conserved Nter domain, whose function is not still well understood (likely involved in regulation, cooperative binding, and stability; [15], [21], [22]), and a highly conserved Cter domain housing all the functional elements for protein activity; i.e., the helix-turn-helix motif (HTH) responsible for the DNA binding, the -V/IPCHRVV/I- amino acid consensus sequence including the catalytic cysteine, and the active site loop that is mainly involved in the substrate specificity and it is characterized by a conformational plasticity for which it is annotated as a structural element undergoing spatial rearrangement along the catalytic cycle (Fig. 2) [8], [21], [22], [23], [24], [25].
The 3D structure of SsOGT revealed some peculiarities of this thermostable protein in the Nter domain, as the presence of a S—S bridge (C29-C31), the absence of any structural Zinc ion, and the interconnection between the Nter and the Cter domain by the D27-R133 residues interaction [15]. Site directed mutagenesis targeting the above-mentioned residues demonstrated their role on thermal stability of this protein [15]. Based on these data, we explore a structure- and computational-based pipeline to design a thermostable OGT specifically active on BC-derivatives for biotechnological application which require an SLP with an orthogonal substrate recognition both for molecular in vitro studies and for in vivo multi-protein labelling.
Accordingly, the experimental workflow described below includes three key steps: i) primary sequence design; ii) in-silico structural analysis to select the molecular determinants of thermal stability and substrate specificity; iii) engineered protein purification and biochemical characterization.
2.1.2. The ChimeraCLIP mutant
The molecular evolution approach leading to the production of the SNAP-tag and the CLIP-tag starting from hMGMT, clearly revealed that, as expected, the mutations for the enhancement of the activity towards BG-, as well as for the change in substrate specificity towards BC-derivatives mainly fall in the Cter domain, in particular in the H4 helix of the HTH motif and the active site loop (Fig. 2 and S1) [6], [12]. The first attempt to produce a thermostable OGT active on BC-derivatives was indeed the construction of a chimeric gene, by combining the Nter domain (including the connecting loop) of SsOGT with the Cter domain of the CLIP-tag. Trying to assure enough stability to this chimera, the well-known D27-R133 ion-pair of SsOGT [15] was introduced, by replacing a glycine residue in the CLIP-tag Cter domain with an arginine residue (G160R mutation). This chimera (ChimeraCLIP; Fig. 2 and Fig. S1) inside E. coli cells revealed a specific activity on CLIP-Cell™ TMR-Star (hereinafter BC-TMR; Fig. S2), demonstrating that the selectivity offered by the Cter domain coming from the commercial tag was not influenced by the presence of a different Nter domain from the SsOGT (Fig. S3a). This very promising preliminary result, however, was hampered by the purification trials failure, probably due to the strong instability of the chimeric protein.
2.1.3. The tailored gene expressing SsOGTCLIP
Afterwards, a synthetic gene of SsOGT was produced, where its H4 helix and the active site loop were substituted by the same structures from CLIP-tag (the so-defined “CLIP” region in Fig. S1), leading to the purification of SsOGTCLIP (Fig. 2 and Fig. S1).
In order to rationalize the target protein design, we generated the predicted tertiary structure of SsOGTCLIP and ChimeraCLIP using Robetta (https://robetta.bakerlab.org/) server [30]: by optimal superposition of the predicted models with the experimental structure of SsOGT (PDB ID: 4ZYE) [15], we were able to validate the potential improved stability of SsOGTCLIP, mainly based on the introduction, among the others, of three aminoacidic substitutions belonging to the wild type SsOGT, i.e., K78E, F84W and H147L. In particular, the mentioned mutations led to the instauration of an ion pair, a H-bonds and a hydrophobic contact, respectively (Fig. 3). Interestingly, two of the resulting contact are responsible for molecular bridge connecting the N-terminal to the C-terminal domain. Indeed, the side chain of W84 is involved in a H-bond with the carboxylic oxygen of the I58 main chain, on the other hand L147 of H5 produced a hydrophobic interaction with F53 that lies on H1 helix.
Fig. 3.
Molecular contacts gained by SsOGTCLIP with respect to ChimeraCLIP.Fig. 3a represents the ion pair between E78 and K81; Fig. 3b shows the Nter-to-Cter contact based on H-bond of W84 with I58; Fig. 3c highlights the hydrophobic interaction between L147 and F53. Each panel is indicated with the corresponding letter (in red) in the cartoon representing the optimal superposition of SsOGTCLIP and ChimeraCLIP predicted structures, depicted in orange and grey, respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Although the heterologous expression of this protein was quite low, a few amounts was purified and tested with BG- and BC- fluorescent derivatives. As shown in Fig. S3b, by a qualitative assay with fluorescent substrates, we revealed that TSSNAP and SsOGTCLIP possess a strong specificity towards their relative substrates, and an undetectable activity with the orthogonal ones, even after a prolonged incubation (see 5.4 Section). Unfortunately, also in the case of SsOGTCLIP, heat treatment above 50 °C resulted in a thermal denaturation and in vitro protein precipitation of this protein (data not shown). The insertion of mesophilic fragments from the CLIP-tag in the scaffold of the SsOGT led to protein instability as in the heterologous expression as well as in the thermal denaturation. For these reasons, we decided to maintain the overall primary structure of SsOGT as much as possible, in order to hit only few aminoacidic residues in the wild-type protein, trying to preserve the thermophilic feature of the designed construct. Since biochemical evidences were in good agreement with the comparative structural analysis, we applied again the computational approach to drive the design of the SsOGT-MC8 mutant.
2.1.4. Biochemical characterization of the SsOGT-MC8 mutant
Detailed information from the in-silico studies on SsOGT were then applied for the construction of a synthetic gene, in which eight aminoacidic residues in the wild type were substituted, in particular four of them (S100A, R102A, M106T, K110E) were kept as in the TSSNAP enzyme [16], in order to avoid any DNA binding activity. Other substitutions (G105N, S109D, G130P, S132L) coincide with the aminoacidic residues typical of the CLIP-tag enzyme (Fig. S1) from which we just excluded the Y-to-E mutation on H3 helix keeping the Y90 in the primary structure of SsOGT-MC8, in order to maintain the active site loop anchored to the H4 helix by establishing H-bond with Y90 (the structure-based rationale of such aminoacidic mutation is discussed below).
The expressed protein, the SsOGT-MC8 (Fig. 2) was easily purified from the E. coli cell free extract by affinity chromatography. The catalytic activity (in terms of second-order rate constant; [12], [13], [15], [16], [26]) of this tag was determined at the relative physiological temperature (65 °C): as the previously citated engineered variants, it exhibits activity on the BC-TMR substrate, whereas it was not possible to measure any activity on orthogonal BG-FL substrate, displaying a specular behaviour with respect to the TSSNAP (Table 1), thus impeding in both cases to determine a BC/BG ratio value. On the other hand, purified CLIP-tag reacts only 102 faster on BC-TMR than on BG-FL, partially in agreement with previous data [12], [26], whereas SNAP-tag is generally more specific on BG-derivatives (103 faster; Table 1) [12], [26]. The residual reactivity of CLIP-tag towards its non-respective substrate, makes it advisable to first label the SNAP-tag to minimize cross-reactions [26].
Table 1.
Catalytic activity of commercial and thermostable SLPs on BG- and BC-derivatives.
| enzyme | T (°C) |
kBC (s−1 M−1) |
kBG (s−1 M−1) |
Ref. |
|---|---|---|---|---|
| SNAP-tag | 25 | 2.6 × 101 (BC-FL) | 2.8 × 104 (BG-FL) | [12] |
| SNAP-tag | 25 | 3.2 × 102 (BC-TMR) | 4.3 × 105 (BG-TMR) | [26] |
| CLIP-tag | 25 | 1.1 × 103 (BC-FL) | 1.0 × 101 (BG-FL) | [12] |
| CLIP-tag | 25 | 1.9 × 104 (BC-TMR) | 8.3 × 101 (BG-TMR) | [26] |
| CLIP-tag | 25 | 1.2 × 105 (BC-TMR) | 9.4 × 103 (BG-FL) | this study |
| SsOGT-H5 | 65 | NDa (BC-TMR) | 2.0 × 104 (BG-FL) | this study |
| SsOGT-MC8 | 65 | 9.1 × 103 (BC-TMR) | NDa (BG-FL) | this study |
a ND = not determined.
An analysis of the cross-reactivity of thermostable SLPs was therefore performed by employing two different approaches. First, by using fluorescent substrates in competition with customised non-fluorescent nucleobases (BG-1 and BC-2, Fig. S2 and S4) [15], [27], IC50 values were determined. As shown in Table 2, all enzymes particularly prefer fluorescent substrates respect to our customised product (Fig. S5), leading to very high IC50 values, if compared to classical AGTs’ inhibitor, as BG- and Lomeguatrib [28]. Nevertheless, SsOGT-MC8 and CLIP-tag displayed an expected high orthogonal specificity, but this behaviour could be an effect of the substrate pairs used [26].
Table 2.
Cross-reactivity of SLPs by competitive inhibition method (IC50), by using fluorescent derivatives as substrates and non-fluorescent ones as competitors.
| enzyme | T (°C) |
substrate | competitor | IC50 (μM) |
|---|---|---|---|---|
| CLIP-tag | 25 | BC-TMR | BG-1 | 310.2 ± 81.7 |
| CLIP-tag | 25 | BC-TMR | BC-2 | 11.8 ± 3.9 |
| SsOGT-H5 | 65 | BG-FL | BG-1 | 1.5 ± 0.3 |
| SsOGT-H5 | 65 | BG-FL | BC-2 | 17.3 ± 16.4 |
| SsOGT-MC8 | 65 | BC-TMR | BG-1 | NDa |
| SsOGT-MC8 | 65 | BC-TMR | BC-2 | 127.6 ± 31.1 |
aND = not determined.
The second approach involved the utilization of a BG-conjugated agarose resin (SNAP-Capture Pull Down Resin, New England Biolabs), generally suitable for the SNAP-tag selective immobilization. As shown in Fig. 4a, the degree of the covalent protein immobilization depends from the total amount of protein (input, I) and the unbound protein in the flowthrough (FT) by following the equation in Fig. 4a. Only a specific activity on BG-derivatives (green enzymes in Fig. 4a) leads to the immobilization, increasing the value in term of percentage of bound protein on the resin.
Fig. 4.
Substrate specificity of CLIP-tag and SsOGT-MC8 by immobilization studies. a) A fixed amount of protein (indicated as input, I) is added to a specific amount of a BG-resin: if the covalent reaction occurs (green enzymes), the resin captures the protein reducing the amount of non-bound protein (indicated as flow-through, FT), as measurable after SDS-PAGE analysis. On the contrary, orthogonal activity (red enzymes) does not allow the covalent binding of the protein on the resin. The value of the specific covalent binding is calculated by the equation shown in the figure; b) SDS-PAGE and histogram of the relative efficiency of binding of commercial and thermostable SLPs. Gels are an example of three independent experiments for each protein. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Considering that this experiment was performed at room temperature (see Section 5.7), as expected, SNAP-tag is effectively the most active on the BG-resin, followed by the TSSNAP and SsOGT, confirming that the mutant is more active than the wild type at room temperature [16] (Fig. 4b). Because the inactive mutant SsOGT C119A [15] does not interact with the BG- substrate on the resin, the covalent binding of BG-reactive SLPs is clearly from their specific activity (Fig. 4b).
The addition of BC-reactive SLPs on BG-resin (red bars in the histogram of Fig. 4b) resulted in a partial but not negligible immobilization of the CLIP-tag, whereas SsOGT-MC8 displayed a much lower protein immobilization (Fig. 4b), probably due to the low activity of this protein-tag at room temperature. All data clearly demonstrated that this variant of SsOGT is effectively specific on BC-derivatives and could be used in combination with orthogonal BG-derivatives with a low risk of cross-reactivity.
Finally, we tested the thermal stability of SsOGT-MC8 by Differential Scan Fluorimetry (DSF), as described [15], [16], [29]. It is very important to underline that this technique is usually carried out from 20 °C to 95 °C at a ramping of 1 % (scan rate of 1 min/°C × cycle). Under these conditions, SNAP-tag showed a Tm over 65 °C, revealing an increased stability of ca. 17 °C respect to the wild type hMGMT counterpart [6]. In the case of thermostable OGTs from Saccharolobus solfataricus and Pyrococcus furiosus, they were not effectively denatured and it was not possible determine Tm values [15], [16], [29]. The experimental protocol was then opportunely modified by increasing to 5 min/°C × cycle and maintaining the temperature limits [15], [16], [29]. This prolonged heat treatment considerably reduced the Tm value of the SNAP-tag (Table 3 and Fig. S6), while the SsOGT-MC8 variant showed a stability to thermal denaturation almost identical to TSSNAP [16], which is slightly lower than the SsOGT wild type (80 °C) [16]. Indeed, the mutations introduced in SsOGT in order to change the substrate specificity did not affect the thermal stability of the SsOGT-MC8 mutant, thus proposing it as a valid alternative of the CLIP-tag under high temperature conditions, both in vivo and in vitro.
Table 3.
Protein stability of SLPs by the Differential Scan Fluorimetry method.
| enzyme | rate (min/°C × cycle) |
Tm | note |
|---|---|---|---|
| SNAP-tag | 5 | 51.32 ± 0.73 | this study |
| SsOGT-H5 | 5 | 73.87 ± 2.74 | this study |
| SsOGT-MC8 | 5 | 72.32 ± 1.03 | this study |
2.1.5. In vivo expression of TSSNAP and SsOGT-MC8 in thermophilic bacteria. The marked thermostability and the substrate specificity of this new variant have been tested in an in vivo experiment in the thermophilic bacterium T. thermophilus HB27EC, as described for TSSNAP [16]. After an overnight culture at 65 °C, transformed cells with the empty pMK184 plasmid, as well as the constructs containing the ogt-H5 and ogt-MC8 genes, were centrifuged and resuspended in PBS 1 × buffer in the presence of fluorescent substrates. As seen for BG-FL [16], cells are also permeable to BC-TMR (Fig. 5), suggesting that also this substrate can be utilised in this model organism. Although the protein expression is far from being optimised, a specific fluorescent signal is only present where the tags with the own substrate are incubated (1 h at 65 °C), whereas the reaction with the orthogonal one did not reveal any fluorescent signal, reproducing the identical in vitro results employing purified proteins (Fig. 5). The presence of a specific BC- activity from the heterologous expression of the ogt-MC8 gene clearly demonstrated that this SLP is correctly folded and, as the TSSNAP, could be employed in thermophilic model organisms as TSCLIP.
Fig. 5.
In vivo heterologous expression of TSSNAP and TSCLIP in the thermophilic bacterium T. thermophilus HB27EC. The ogt-H5 and ogt-MC8 genes were cloned into the pMK184 shuttle vector for their heterologous expression in HB27EC strain. Overnight cultured cells were washed and incubated at 65 °C for 1 h in the presence of 1.0 μM fluorescent substrates (as indicated) and then subjected to SDS-PAGE (further details are described in the Section 5.4). The HB27EC/pMK184 strain was used as negative control, as well as 1 μg of each purified protein was used as positive control. In the H5 lane was also loaded the protein marker.
2.1.5. The 3D structure of TSCLIP
The presented crystallographic studies allow us to obtain the first structural model of a SLP with substrate specificity towards the BC-substrates (PDB ID: 8AES). TSCLIP produced crystals that diffracted at 2.8 Å of resolution (Table 4) and thanks to the high-quality electron density map we were able to assign the orientation of all the amino acids along their sidechains, building the structural model of SsOGT-MC8 that shows the typical folding architecture of all AGTs, consisting of two globular domains connected by a long loop. Since the sequence is composed for the 95 % of the aminoacidic residues of SsOGT, the engineered construct maintains the overall molecular architecture of SsOGT including the determinants of the protein stability discussed below. In particular, the Nter domain (a.a. 1–53) folds into an anti-parallel β-sheet, connected to a conserved α-helix (H1) by a random-coiled region; that is further stabilized by the peculiar disulphide bridge established between the C29 and C31. As mentioned above, the main modifications with respect to wild type SsOGT occurred on the C-terminal domain, as it is directly involved in substrate recognition; indeed, it houses the catalytic C119 residue within the conserved -PCHR- signature, the “inactivated” HTH motif, responsible in wild type protein for the DNA minor groove binding, the active site loop and the ligand-binding pocket composed of the ‘asparagine hinge’ and H4 helix.
Table 4.
Data collection and refinement statistics.
| dataset | SsOGT-MC8 |
|---|---|
| Data collection | |
| Wavelength (Å) | 0.9184 |
| Resolution range (Å) | 47.38–2.8 (2.9–2.8) |
| Unit cell constants | |
| a (Å) | 100.835 |
| b (Å) | 119.435 |
| c (Å) | 180.237 |
| α, β, γ (°) | 90 |
| Space group | P 21 21 21 |
| N° of molecules in asua | 10 |
| N° of reflections | 185,454 |
| N° of unique reflections | 53,272 (7791) |
| Rpim (%) | 8.1 |
| Rmerge (%) | 13.8 |
| Rmeas (%) | 16.2 |
| I/σ | 7.1 (2.2) |
| Completeness (%) | 98.4 (99.3) |
| CC1/2 | 0.988 (0.562) |
| Multiplicity | 3.5 |
| Refinement | |
| R-factor (%) | 19.6 |
| R-free (%)b | 25.3 |
| Number of non-hydrogen atoms | 12,153 |
| macromolecules | 12,086 |
| ligands | 0 |
| solvent | 67 |
| Average B-factor (Å2) | 54 |
| macromolecules | 54.48 |
| solvent | 46.69 |
| Average RMSDc | |
| bond (Å) | 0.009 |
| angle (°) | 1.18 |
| Ramachandran statistics | |
| Residues (%) | |
| in most favoured regions | 92.34 |
| in additional allowed regions | 7.06 |
| in disallowed regions | 0.60 |
| Clashscore | 9.51 |
| Rotamer outliers (%) | 4.58 |
By optimal superposition of TSCLIP with the wild type protein, we observed the repositioning of discrete regions in TSCLIP, in particular the active site loop moves of 2.6 Å from the central core of the protein towards the bulk solvent; it could be referred to the simultaneous presence of P130 that imposes rigidity and L132 that is characterized by higher flexibility (B-factor = 57.35) compared to the wild type serine (B-factor = 26.27). As consequence of such conformational change, we notice that R133 side chain shifts towards H4 inducing a restriction of the upper part of the active site pocket. However, in the new conformation R133 still maintains the salt bridge with D27, whose importance for protein stability has been described (Fig. S7) [15], [25], [30].
The crystallographic analysis of TSCLIP allowed us to determine the molecular contacts responsible for the achieved stability of such protein compared to SsOGTCLIP, confirming the reliability of protein engineering process performed based of the predicted structure [31].
As observed in Fig. 6, TSCLIP gained an intrahelical ion bond between D95 and K91, and a second H-bond between E78 and S96, connecting H2 and H3 helices, both bettering the overall stability of the enzyme; the main chain I128-K138 H-bond was restored too, as occurred in SsOGT, anchoring the H5 helix to the ligand binding cavity wall, opposing the one constituted by the HTH. In addition, the substitution of a glutamic acid with a tyrosine (E90Y) with respect to SsOGTCLIP has a huge relevance, for the fact that Y90 is involved in the coordination of a H-bond-based network with V122, P130 main chain and K138: all these residues are located on the ligand entry site, making these interactions important functional elements for the selectivity towards BC-TMR substrates.
Fig. 6.
Molecular contacts gained by TSCLIP with respect to SsOGTCLIP.Fig. 6a is a zoom-in of the network of contacts coordinated by Y90 in which we highlighted the potential H-bond between Y90 and the main chain of P130, V122 and the side chain of K138 respectively. Fig. 6b represents the H3 intra-helix ion pair between D95 and K91 and the H-bond established by S96 and E78; the latter substitutes the intra-helix contact of SsOGTCLIP (orange) with an inter-helix bond in TSCLIP (raspberry). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Being the first crystal structure of a SLP with binding activity on BC-derivatives we were able to depict the role of each selected mutations (G105N, S109D, G130P, S132L, Fig. S1) introduced to guarantee the substrate specificity, performing also a comparative analysis with the mutated residues included in CLIP-tag [12]. In particular, N105 and D109 on H4 helix, introduced for the coordination of cytosine moiety, and P130 on the opposite surface contribute to narrow the accessibility to the substrate binding pocket by steric hindrance (Fig. 7).
Fig. 7.
Surface representation of crystal structures of SsOGT and TSCLIP bound to the BC- and BG-derivatives.Fig. 7a refers to TSCLIP (raspberry) bound to BC (on the left) and BG (on the right) with a zoom on the active site entrance showing the difficult accessibility of BG imidazole moiety; Fig. 7b refers to the same comparison performed on wild type SsOGT (cyan) (representative of TSSNAP) highlighting the wider active site gate of this construct. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
In fact, by comparing the active site gate of the wild type SsOGT with those of TSCLIP (both crystallographic structures) in complex with a BC molecule optimally superposed to the BG ligand of the SNAP-BG complex (PDB ID: 3KZZ) it is evident that SsOGT-MC8 active site entry is shaped to host the BC moiety, while it clashes with the BG compound. Considering the steric contribution, the P130 plays a key role (Fig. 7) in re-shaping the active pocket entry, also because its conformation is stabilized by H-bond with Y90. Along playing a relevant role in the interaction with P130, Y90 (Y114E mutation in CLIP-tag) is the only residue that has not been modified by following the CLIP-tag generation protocol [12].
Johnsson and co-workers speculated on the possible role of such tyrosine in the formation of a hydrogen bond with the N3 of BG to stabilize the developing negative charge on the leaving group guanine; consequently, they decided to mutate it into glutamic acid to abolish BG binding activity [12], [23]. On the contrary, TSCLIP is characterized by a higher substrate specificity also in presence of such tyrosine. Differently from CLIP-tag, in TSCLIP the selectivity has been achieved also by steric hindrance, obtained as result of the inserted mutations, that allow the binding of BC- instead of BG- derivatives.
This observation is further confirmed by the biochemical investigation in which it was not possible to determine the kinetic of binding of BG due to its complete exclusion from the active site pocket.
3. Conclusions and perspectives
This work, resulting from the lesson of the molecular evolution performed on the hMGMT, is the demonstration that the knowledge acquired by the group of prof. Johnsson led to the identification of residues involved in the substrate specificity of this enzyme [12]. Since the AGTs are evolutionary conserved [14], this information could become a tool to apply mutations rapidly and unequivocally to all known AGTs, in order to develop new engineered SLPs. The only limitation of this approach is that the identified AGTs must be sensitive to benzyl-guanine inhibition (note that E. coli AdaC is not inhibited by BG, so it is not active on BG-derived substrates [32], [33]). As shown in Fig. S8, the “SNAP-tag technology” could potentially be used in each model organism: if the chemical-physical growth conditions are compatible with commercial SNAP- and CLIP-tag, it is advisable to exploit them, given their high catalytic activity on BG- and BC-derived substrates. In other cases, it is possible to search for a specific BG- activity using fluorescent substrates and, starting from the acquired knowledge, proceed to the identification and appropriately the modification of the endogenous AGT, in order to produce homologous SNAP- and CLIP-tag, whose behaviours are compatible with the in vivo conditions of the organism used.
Using this methodology and starting from an archaeal OGT, which is stable at high temperatures and to the most common denaturing agents [14], we engineered it obtaining a pair of thermostable SLPs (TSSNAP and, for the first time, TSCLIP). These new protein-tags could allow an expansion of this technology in (hyper)thermophilic organisms and in all non-permissive reaction conditions for commercial SLPs.
4. Experimental Section/Methods
4.1. General experimental procedures
IR spectra were recorded on an Avatar 370 FT-IR Techno-Nicolet apparatus. 1H (400 MHz) and13C (100 MHz) NMR spectra were measured on Bruker Avance 400 MHz spectrometer or on a Bruker Avance 500 MHz. Chemical shifts were referenced to the residual solvent signal (CDCl3: δH = 7.25, δC = 77.0, CD3OD: δH = 3.34, δC = 49.0, DMSO: δH = 2.50, δC = 39.5, (CO(CD3)2: δH = 2.05, δC = 206.7, 29.9). Reactions were monitored by thin-layer chromatography (TLC) on Merck 60 F254 (0.25 mm) plates, visualized by staining with 5 % H2SO4 in EtOH and heating. Organic phases were dried with Na2SO4 before evaporation. Chemical reagents and solvents were purchased from Sigma-Aldrich, TCI Europe or Fluorochem and were used without further purification unless stated otherwise. Petroleum ether with boiling point of 40–60 °C was used. Silica gel 60 (70–230 mesh) was used for gravity column chromatography (GCC).
4.1.1. Synthesis of BG-1
to a stirred solution of BG-NH2 (900 mg, 3.31 mmol, 1 eq. mol) in AN/DMF dry 1:1 (10 mL), 6-azidoesanoic acid (804 mg, 3.981 mmol, 1.2 eq. mol), triethylamine (1.85 mL, 13.27 mmol, 4 eq. mol) and T3P (sol. 50 % in EtOAc, 2.45 mL, 3.98 mmol, 1.2 eq. mol) were sequentially added. The reaction was stirred under N2 at 40 °C overnight, quenched with a saturated solution of NH4Cl and extracted with EtOAc. The organic phases were washed with brine, dry over sodium sulfate and evaporated under reduced pressure. The crude was purified by gravity column chromatography on silica gel (gradient PE-EtOAc from 3:7 to EtOAc-MeOH 85:15) to afford BG-1 as a white solid (500 mg, 35 % yield). 1H NMR (400 MHz, CD3OD) δ 7.85 (brs, 1H), 7.50 (d, J = 6.2 Hz, 2H), 7.31 (d, J = 6.2 Hz, 2H), 5.54 (s, 2H), 4.38 (s, 2H), 3.28 (t, J = 6.7 Hz, 2H), 2.26 (t, J = 7.3 Hz, 2H), 1.63 (m, 4H), 1.41 (m, 2H). HRESIMS m/z [M + H]+ 410.20480 (calcd for C19H24N9O2, 410.20475).
4.1.2. Synthesis of BC-2
to a stirred solution of BC-NH2 (140 mg, 0.61 mmol, 1 eq. mol) in AN/DMF dry 1:1 (4 mL), 6-azidoesanoic acid (116 mg, 0.73 mmol, 1.2 eq. mol), triethylamine (333 μL, 2.44 mmol, 4 eq. mol) and T3P (sol. 50 % in EtOAc, 447 μL, 0.73 mmol, 1.2 eq. mol) were sequentially added. The reaction was stirred under N2 at 40 °C overnight, quenched with a saturated solution of NH4Cl and extracted with EtOAc. The organic phases were washed with brine, dry over sodium sulfate and evaporated under reduced pressure. The crude was purified by gravity column chromatography on silica gel (gradient PE-EtOAc from 3:7 to EtOAc-MeOH 85:15) to afford BC-2 as a pale-yellow solid (102 mg, 40 % yield). 1H NMR (400 MHz, CO(CD3)2) δ 7.97 (brs, 1H), 7.92 (d, J = 6.7 Hz, 2H), 7.40 (d, J = 6.6 Hz, 2H), 7.29 (d, J = 6.6 Hz, 2H), 6.20 (d, J = 5.7 Hz, 1H), 5.29 (s, 2H), 4.39 (d, J = 5.9, 2H), 3.33 (t, J = 6.8 Hz, 2H), 2.25 (t, J = 7.3 Hz, 2H), 1.63 (m, 4H), 1.42 (m, 2H). HRESIMS m/z [M + H]+ 370.19834 (calcd for C18H24N7O2, 370.19860).
4.2. DNA constructs
To obtain the E. coli expression vectors, similar procedures were applied for all the proteins. In particular, pQE-CLIP-tag and pQE-ogtMC8 plasmids were used as templates to amplify the relative genes, by using Nco-NZY-Fwd / QE-Rev oligonucleotides pairs (5′-AGGAGATATACCATGGCACACCATCACCATCACCATACGG-3′ / 5′-CATTACTGGAT CTATCAACAGGAG-3′). Subsequently, the amplified fragments and the pHTP1 (NZYTech, Portugal) recipient vector were digested with Nco I and Hind III restriction enzymes and ligated. Then, the resulting ligation mixture was used to transform E. coli DH5α competent cells and positive colonies were confirmed by DNA sequence analyses. Whereas pQE-ogtH5 was cloned as previously described [14]. In addition, to reach the final vector for the heterologous expression of SsOGT-MC8 mutant in T. thermophilus HB27EC strain [34], pQE-MC8 plasmid was first digested with BamHI and HindIII enzymes to recover the MC8 gene fragment, which was then subcloned in the pMK-ogtH5 [16].
4.3. Protein expression and purification
TSSNAP was purified as previously described [16]. CLIP-tag and SsOGT-MC8 proteins were expressed in E. coli BL21 Rosetta2 (DE3) cells, grown at 37 °C in Lysogeny Broth (LB) medium supplemented with 50 mg/L kanamycin and 30 mg/L chloramphenicol [35], [36], [37]. The protein expression was induced with 1 mM isopropyl-thio-β-d-galactoside (IPTG) when an OD600nm of 0.5–0.6 was reached. The biomass was collected and resuspended 1:3 (w/v) in purification buffer A (50 mM phosphate, 300 mM NaCl; pH 8.0) supplemented with 1 % Triton X-100 and stored overnight at −20 °C. Subsequently, it was treated in ice with lysozyme and DNAse for 60 min and sonicated as described [14]. After a centrifugation of 30 min at 60,000 × g, the cell extract was recovered and applied to a Protino Ni-NTA Column 1 mL (Macherey-Nagel) for His6-tag affinity chromatography, accordingly to the procedure previously described [27]. The eluted protein fractions were collected, dialysed against phosphate buffered saline (PBS 1×, 20 mM phosphate buffer, NaCl 150 mM, pH 7.3) and confirmed by SDS-PAGE analysis. To test the activity of purified proteins, 5 μM of enzyme was incubated with 10 μM of the relative substrate BG-FL and BC-TMR (New England Biolabs, USA) in 1 × Fluo Reaction Buffer (50 mM phosphate, 0.1 M NaCl, 1.0 mM DTT, pH 6.5) at 25 °C for CLIP-tag and 65 °C for SsOGT-H5 and SsOGT-MC8. After stopped the reaction in Laemmli buffer 1 × (formamide 95 %; EDTA 20 mM; bromophenol 0.05 %), samples were loaded on SDS-PAGE gel and analysed by gel-imaging technique on a VersaDoc 4000™ system (Bio-Rad) as previously reported [14], [15], [22].
4.4. In vivo fluorescent assay
For the in vivo assay, T. thermophilus HB27EC cells transformed with pMK184 (used as control), pMK-ogtH5 and pMK-ogtMC8 plasmids were grown at 65 °C in SC selective medium (tryptone 8 g/L, yeast extract 4 g/L, NaCl 3 g/L, in mineral water pH 7.5) supplemented with 30 mg/L kanamycin as late as stationary phase (OD600nm > 1.5) [16], [38]. Cell pellets from 1 mL were resuspended in 0.1 mL of SC medium in presence of 3 μM of BG-FL and BC-TMR fluorescent substrates and incubated at 65 °C for 1 h. After the reaction, cells were first washed twice with 1 mL of SC medium, then denatured for 15 min at 100 °C by adding a Laemmli buffer 1 × and directly loaded on SDS-PAGE.
4.5. Substrate specificity assay by competitive IC50 inhibition method
The substrate specificity of CLIP-tag, SsOGT-H5 and SsOGT-MC8 proteins were evaluated on BG-N3 and BC-N3 substrates by the competitive inhibition assay performed as described [15], [27], [29]. By using fixed concentrations of the fluorescent BC-TMR substrate (5 μM) and enzymes (5 μM), an increasing concentration of guanine/cytosine-azide derivatives (0–1 mM) was added to the mixtures. The reactions were incubated for 60 min at 25 °C for CLIP-tag and 65 °C for MC8 respectively and then stopped by adding Laemmli buffer 1 ×. Subsequently, the samples were loaded on SDS-PAGE and the fluorescent bands were measured by gel-imaging on a VersaDoc 4000™ system (Bio-Rad), by applying green LED/605 bandpass filter. Then, obtained data were plotted by IC50 equation [15], [25]. As control, 5 μM of SsOGT-H5 protein was incubated at 65 °C with 5 μM of the fluorescent substrate BG-FL in presence of an increasing amount of the BG/BC-azide derivatives (0–1 mM). In this case, fluorescent bands were visualized on VersaDoc 4000™ system (Bio-Rad), by applying a blue LED/530 bandpass filter.
4.6. Rate constants determination
5 μM of purified CLIP-tag, SsOGT-H5 and SsOGT-MC8 proteins were incubated in PBS 1 × buffer with an excess of BG-FL/BC-TMR substrates (20 μM) at 25 °C and 65 °C respectively, accordingly to the method described [12], [27].. Aliquots were taken at different times and the reactions were immediately stopped in 1 × Leammli Buffer. Finally, the samples were loaded on SDS-PAGE for the gel-imaging analysis and data were fitted to a pseudo-first-order reaction model using the GraFit 5.0 software package (Erithacus Software ltd.), and second-order rate constants k (in s−1 M−1) were obtained by dividing the pseudo-first-order constant by the concentration of substrate.
4.7. Capture and protein immobilization of SLPs
The SNAP-Capture Pull Down resin (New England Biolabs, binding capacity: 1 mg pure protein/mL of bed resin) 80 µL was washed with 1 × PBS and incubated with 120 µL of 1 mg / mL of protein (SNAP-tag / CLIP-tag / H5 / MC8 / C119A mutant). After one minute, 5 uL of supernatant (protein plus resin) was withdrawn and the rest was incubated for 16 h at room temperature. Afterwards, 5 μL of supernatant (protein plus resin) was withdrawn and all samples were first loaded on SDS-PAGE and then protein bands were determined by coomassie staining. Visible gel-imaging was used for the determination of intensity of each protein band.
4.8. Thermal stability analysis by using DSF method
The stability of the OGTs variants was analysed by the differential scan fluorimetry method (DSF) by following the protocol previously described [6], [15], [16], [25], [39]. Triplicates of each condition containing 25 μM of enzyme in PBS 1 × buffer and SYPRO Orange dye 1× (Invitrogen, USA) were subjected to a scan of 70 cycles at temperatures from 20 to 95 °C for 5 min/°C × cycle, in a CFX96 Touch Real-Time PCR (Bio-Rad). Relative fluorescence data were normalized to the maximum fluorescence value and plotted vs temperature. The resulting sigmoidal curves allowed the determination of the inflection points (Tm values) by fitting the Boltzmann equation [6], [15], [16], [25], [39].
4.9. Crystallographic studies
For crystallization trials, the protein was concentrated up to 7.5 mg/mL using Amicon Ultra-0.5 mL centrifugal filter devices (membrane MWCO = 10 kDa). Crystallization was performed by means of a robot-assisted (Oryx4; Douglas Instruments) sitting-drop-based spare-matrix strategy, using screen kits from Hampton Research and Qiagen, by the vapor diffusion method. Optimal SsOGT-MC8 crystals grew in two weeks at 20 °C in a drop obtained by mixing equal volumes of a protein and a reservoir solution containing 0.2 M tri-Potassium citrate and 20 % (w/v) PEG3350 in a final droplet volume of 2 µL, equilibrated against 50 μL of the reservoir solution. For X-ray data collection, crystals were cryoprotected in the precipitant solution supplemented with 12 % glycerol, mounted in a cryo-loop and flash frozen in liquid nitrogen for subsequent X-ray diffraction analysis. The crystals diffracted at 2.8 Å of resolution and data were collected at the ID30-B beamline at the European Synchrotron Radiation Facility (ESRF, Grenoble, France) equipped with a Pilatus PILATUS3 6 M 1000 µm Si sensor (Dectris) [40], at a wavelength of 1.008 Å. The diffraction data were indexed with XDS program [41], which assigned the crystal to the orthorhombic space-group P212121 with the following cell dimension: a = 100.83 Å, b = 119.43 Å, c = 180.24 Å, α = β = γ = 90°. This cell contained 10 molecules per asymmetric unit, with a corresponding solvent content of 56.9 % and a Matthews coefficient of 2.85. Data processing was carried out using the CCP4 program suite [42] and in particular the structure was solved by molecular replacement using the program Phaser of the PHENIX software suite [43]; the structure of the wild type SsOGT protein was used as the search model (pdb: 4ZYE) and the overall sequence identity between the two proteins was 95 %. The resulting electron density map was good enough to allow iterative cycles of manual model building using Coot [44] and refinement using phenix.refine [43] (Table 4). The atomic coordinates and structure factors of the SsOGT-MC8 have been deposited in the Protein Data Bank (http://www.rcsb.org) under the accession code PDB ID: 8AES.
4.10. Tertiary structure mode
Tertiary structure model of ChimeraCLIP, SsOGTCLIP and CLIP-tag were generated by Robetta (https://robetta.bakerlab.org/) servers [31]. The software PROCHECK from CCP4 program suite [44] was used to analyse the Ramachandran plot of generated models along with the bond angles and bond lengths of the protein structure.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
Acknowledgements
R.Me. and R.Ma. contributed equally to this work as First Author. G.P. would like to thank all the authors in the performing experiments and writing the present manuscript; Chiara Nobile, Marco Petruzziello and Giovanni del Monaco for technical assistance.
This work was supported by Fondazione CARIPLO (Ricerca biomedica condotta da giovani ricercatori, project 2020-3589).
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.csbj.2022.09.015.
Contributor Information
Riccardo Miggiano, Email: riccardo.miggiano@uniupo.it.
Giuseppe Perugino, Email: giuseppe.perugino@unina.it.
Appendix A. Supplementary data
The following are the Supplementary data to this article:
References
- 1.Liss V., Barlag B., Nietschke M., Hensel M. Self-labelling enzymes as universal tags for fluorescence microscopy, super-resolution microscopy and electron microscopy. Sci Rep. 2015;5:17740–17752. doi: 10.1038/srep17740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Juillerat A., Gronemeyer T., Keppler A., Gendreizig S., Pick H., et al. Directed evolution of O6-alkylguanine-DNA alkyltransferase for efficient labeling of fusion proteins with small molecules in vivo. Chem Biol. 2003;10:313–317. doi: 10.1016/s1074-5521(03)00068-1. [DOI] [PubMed] [Google Scholar]
- 3.Keppler A., Gendreizig S., Gronemeyer T., Pick H., Vogel H., et al. A general method for the covalent labeling of fusion proteins with small molecules in vivo. Nat Biotechnol. 2003;21:86–89. doi: 10.1038/nbt765. [DOI] [PubMed] [Google Scholar]
- 4.Gronemeyer T., Chidley C., Juillerat A., Heinis C., Johnsson K. Directed evolution of O6-alkylguanine-DNA alkyltransferase for applications in protein labelling. Protein Eng Des Sel. 2006;19:309–316. doi: 10.1093/protein/gzl014. [DOI] [PubMed] [Google Scholar]
- 5.Hinner M.J., Johnsson K. How to obtain labeled proteins and what to do with them. Curr Opin Biotechnol. 2010;21:766–776. doi: 10.1016/j.copbio.2010.09.011. [DOI] [PubMed] [Google Scholar]
- 6.Mollwitz B., Brunk E., Schmitt S., Pojer F., Bannwarth M., et al. Directed evolution of the suicide protein O⁶-alkylguanine-DNA alkyltransferase for increased reactivity results in an alkylated protein with exceptional stability. Biochemistry. 2012;51:986–994. doi: 10.1021/bi2016537. [DOI] [PubMed] [Google Scholar]
- 7.Keppler A., Pick H., Arrivoli C., Vogel H., Johnsson K. Labeling of fusion proteins with synthetic fluorophores in live cells. Proc Natl Acad Sci USA. 2004;10:9955–9959. doi: 10.1073/pnas.0401923101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mattossovich R., Merlo R., Miggiano R., Valenti A., Perugino G. O6-alkylguanine-DNA alkyltransferases in microbes living on the edge: from stability to applicability. Int J Mol Sci. 2020;21:2878. doi: 10.3390/ijms21082878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Streett H., Charubin K., Papoutsakis E.T. Anaerobic fluorescent reporters for cell identification, microbial cell biology and high-throughput screening of microbiota and genomic libraries. Curr Opin Biotechnol. 2021;71:151–163. doi: 10.1016/j.copbio.2021.07.005. [DOI] [PubMed] [Google Scholar]
- 10.Brun M.A., Tan K.-T., Griss R., Kielkowska A., Reymond L., Johnsson K. A semisynthetic fluorescent sensor protein for glutamate. J Am Chem Soc. 2012;134:7676–7678. doi: 10.1021/ja3002277. [DOI] [PubMed] [Google Scholar]
- 11.Xue L., Yu Q., Griss R., Schena A., Johnsson K. Bioluminescent antibodies for point-of-care diagnostics. Angew Chem Int Ed. 2017;56:7112–7116. doi: 10.1002/anie.201702403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gautier A., Juillerat A., Heinis C., Corrêa I.R., Kindermann M., et al. An engineered protein tag for multiprotein labeling in living cells. Chem & Biol. 2008;15:128–136. doi: 10.1016/j.chembiol.2008.01.007. [DOI] [PubMed] [Google Scholar]
- 13.Gautier A., Nakata E., Lukinavicius G., Tan K.-T., Johnsson K. Selective cross-linking of interacting proteins using self-labeling tags. J Am Chem Soc. 2009;131:17954–17962. doi: 10.1021/ja907818q. [DOI] [PubMed] [Google Scholar]
- 14.Perugino G., Vettone A., Illiano G., Valenti A., Ferrara M.C., et al. Activity and regulation of archaeal DNA alkyltransferase: conserved protein involved in repair of DNA alkylation damage. J Biol Chem. 2012;287:4222–4231. doi: 10.1074/jbc.M111.308320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Perugino G., Miggiano R., Serpe M., Vettone A., Valenti A., et al. Structure-function relationships governing activity and stability of a DNA alkylation damage repair thermostable protein. Nuc Ac Res. 2015;43:8801–8816. doi: 10.1093/nar/gkv774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Vettone A., Serpe M., Hidalgo A., Berenguer J., del Monaco G., et al. A novel thermostable protein-tag: optimization of the Sulfolobus solfataricus DNA-alkyl-transferase by protein engineering. Extremophiles. 2016;20:1–13. doi: 10.1007/s00792-015-0791-9. [DOI] [PubMed] [Google Scholar]
- 17.Visone V., Han W., Perugino G., del Monaco G., She Q., et al. In vivo and in vitro protein imaging in thermophilic archaea by exploiting a novel protein tag. PLoS ONE. 2017;12:e0185791. doi: 10.1371/journal.pone.0185791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Merlo R., Del Prete S., Valenti A., Mattossovich R., Carginale V., et al. An AGT-based protein-tag system for the labelling and surface immobilization of enzymes on E. coli outer membrane. J Enz Inhib Med Chem. 2019;34:490–499. doi: 10.1080/14756366.2018.1559161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Del Prete S., Merlo R., Valenti A., Mattossovich R., Rossi M., et al. Thermostability enhancement of the α-carbonic anhydrase from Sulfurihydrogenibium yellowstonense by using the anchoring-and-self-labelling-protein-tag system (ASLtag) J Enz Inhib Med Chem. 2019;34:946–954. doi: 10.1080/14756366.2019.1605991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Fang Q., Kanugula S., Pegg A.E. Function of domains of human O6-alkylguanine-DNA alkyltransferase. Biochemistry. 2005;44:15396–15405. doi: 10.1021/bi051460d. [DOI] [PubMed] [Google Scholar]
- 21.Daniels D.S., Mol C.D., Arvai A.S., Kanugula S., Pegg A.E., et al. Active and alkylated human AGT structures: a novel zinc site, inhibitor and extrahelical base binding. EMBO J. 2000;19:1719–1730. doi: 10.1093/emboj/19.7.1719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Miggiano R., Casazza V., Garavaglia S., Ciaramella M., Perugino G., et al. Biochemical and structural studies of the Mycobacterium tuberculosis O6-methylguanine methyltransferase and mutated variants. J Bac. 2013;195:2728–2736. doi: 10.1128/JB.02298-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Daniels D.S., Woo T.T., Luu K.X., Noll D.M., Clarke N.D., et al. DNA binding and nucleotide flipping by the human DNA repair protein AGT. Nat Struct Mol Biol. 2004;11:714–720. doi: 10.1038/nsmb791. [DOI] [PubMed] [Google Scholar]
- 24.Rossi F., Morrone C., Massarotti A., Ferraris D.M., Valenti A., et al. Crystal structure of a thermophilic O6-alkylguanine-DNA alkyltransferase-derived self-labeling protein-tag in covalent complex with a fluorescent probe. Biochem Biophys Res Commun. 2018;500:698–703. doi: 10.1016/j.bbrc.2018.04.139. [DOI] [PubMed] [Google Scholar]
- 25.Morrone C., Miggiano R., Serpe M., Massarotti A., Valenti A., et al. (107) Interdomain interactions rearrangements control the reaction steps of a thermostable DNA alkyltransferase. Biochim Biophys Acta Gen Subj. 1861:86–96. doi: 10.1016/j.bbagen.2016.10.020. [DOI] [PubMed] [Google Scholar]
- 26.Wilhelm J., Kühn S., Tarnawski M., Gotthard G., Tünnermannm J., et al. Kinetic and structural characterization of the self-labeling protein tags HaloTag7, SNAP-tag, and CLIP-tag. Biochemistry. 2021;60:2560–2575. doi: 10.1021/acs.biochem.1c00258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Merlo R., Caprioglio D., Cillo M., Valenti A., Mattossovich R., et al. The SNAP- tag technology revised: an effective chemo-enzymatic approach by using a universal azide-based substrate. J Enz Inhib Med Chem. 2021;36:85–97. doi: 10.1080/14756366.2020.1841182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Farag N., Mattossovich R., Merlo R., Nierzwicki Ł., Palermo G., et al. Folding-upon-repair DNA nanoswitches for monitoring the activity of DNA repair enzymes. Angew Chem Int Ed. 2021;60:7283–7289. doi: 10.1002/anie.202016223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Mattossovich R., Merlo R., Fontana A., d’Ippolito G., Terns M.P., et al. A journey down to hell: new thermostable protein-tags for biotechnology at high temperatures. Extremophiles. 2020;24:81–91. doi: 10.1007/s00792-019-01134-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Miggiano R., Valenti A., Rossi F., Rizzi M., Perugino G., et al. Every OGT is illuminated… by fluorescent and synchrotron lights. Int J Mol Sci. 2017;18:2613. doi: 10.3390/ijms18122613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kim D.E., Chivian D., Baker D. Protein structure prediction and analysis using the Robetta server. Nuc Ac Res. 2004;32:W526–W531. doi: 10.1093/nar/gkh468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Elder R.H., Margison G.P., Rafferty J.A. Differential inactivation of mammalian and Escherichia coli O6-alkylguanine-DNA alkyltransferases by O6-benzylguanine. Biochem J. 1994;298:231–235. doi: 10.1042/bj2980231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Goodtzova K., Kanugula S., Edara S., Pauly G.T., Moschel R.C., et al. Repair of O6-benzylguanine by the Escherichia coli Ada and Ogt and the human O6-alkylguanine-DNA alkyltransferase. J Biol Chem. 1997;272:8332–8339. doi: 10.1074/jbc.272.13.8332. [DOI] [PubMed] [Google Scholar]
- 34.Swarts D.C., Jore M.M., Westra E.R., Zhu Y., Janssen J.H., et al. DNA-guided DNA interference by a prokaryotic Argonaute. Nature. 2014;507:258–261. doi: 10.1038/nature12971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bertani G. Studies on lysogenesis. I. The mode of phage liberation by lysogenic Escherichia coli. J Bacteriol. 1951;62:293–300. doi: 10.1128/jb.62.3.293-300.1951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bertani G. Lysogeny at mid-twentieth century: P1, P2, and other experimental systems. J Bacteriol. 2004;186:595–600. doi: 10.1128/JB.186.3.595-600.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ezraty B., Henry C., Hérisse M., Denamur E., Barras F. Commercial Lysogeny Broth culture media and oxidative stress: a cautious tale. Free Radic Biol Med. 2014;74:245–251. doi: 10.1016/j.freeradbiomed.2014.07.010. [DOI] [PubMed] [Google Scholar]
- 38.Cava F., Hidalgo A., Berenguer J. Thermus thermophilus as biological model. Extremophiles. 2009;13:213. doi: 10.1007/s00792-009-0226-6. [DOI] [PubMed] [Google Scholar]
- 39.Niesen F.H., Berglund H., Vedadi M. The use of differential scanning fluorimetry to detect ligand interactions that promote protein stability. Nat Protoc. 2007;2:2212–2221. doi: 10.1038/nprot.2007.321. [DOI] [PubMed] [Google Scholar]
- 40.McCarthy A.A., Barrett R., Beteva A., Caserotto H., Dobias F., et al. ID30B - a versatile beamline for macromolecular crystallography experiments at the ESRF. J Synchrotron Radiat. 2018;1(25):1249–1260. doi: 10.1107/S1600577518007166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kabsch W. XDS. Acta Crystallogr D. 2010;66:125–132. doi: 10.1107/S0907444909047337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Potterton L., Agirre J., Ballard C., Cowtan K., Dodson E., et al. CCP4i2: the new graphical user interface to the CCP4 program suite. Acta Crystallogr D Struct Biol. 2018;74:68–84. doi: 10.1107/S2059798317016035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Adams P.D., Afonine P.V., Bunkoczi G., Chen V.B., Davis I.W., et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Collaborative Computational Project Number 4 (CCP4) (1994) The CCP4 suite: programs for protein crystallography. Acta Crystallogr D Biol Crystallogr. 1994;50:760–763. doi: 10.1107/S0907444994003112. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








