Abstract
Labeling of native proteins invites interest from diverse segments of science. However, there remains the significant unmet challenge in precise labeling at a single site of a protein. Here, we report the site-specific labeling of natural or easy-to-engineer N-terminus Gly in proteins with remarkable efficiency and selectivity. The method generates a latent nucleophile from N-terminus imine that reacts with an aldehyde to deliver an aminoalcohol under physiological conditions. It differentiates N-Gly as a unique target amongst other proteinogenic amino acids. The method allows single-site labeling of proteins in isolated form and extends to lysed cells. It administers an orthogonal aldehyde group primed for late-stage tagging with an affinity tag, 19F NMR probe, and a fluorophore. A user-friendly protocol delivers analytically pure tagged proteins. The mild reaction conditions do not alter the structure and function of the protein. The cellular uptake of fluorophore-tagged insulin and its ability to activate the insulin-receptor mediated signaling remains unperturbed.
Subject terms: Chemical modification, Chemical tools, Chemical modification, Chemical tools
Single-site labelling of proteins is desirable, e.g., for analytical purposes. Here, the authors developed a method in which they use an aldol-type reaction to modify proteins at N-terminal glycine residues in an efficient and selective manner, which is also applicable to cell lysates.
Introduction
Single-site labeling of proteins facilitates investigation of several biological processes through attachment of biophysical probes, imaging probes, and toxins. Such labeling emerges through pre-engineered protein equipped with unnatural amino acids1, an amino acid sequence recognized by enzymes2, and single cysteine3,4 or a cysteine placed in a π-clamp5. The understanding of organic chemistry with proteins has nurtured the growth of bioconjugation. These biomolecules can be perceived as a macromolecular substrate with multiple nucleophilic functional groups (NuP). The obvious route for bioconjugation would involve its reaction with an electrophile (Fig. 1a). For single-site labeling in such cases, a functional group has to compete with all the other nucleophilic residues (chemoselectivity) and its additional copies (site-selectivity). The chemoselective labeling of low-frequency residues (Cys6,7, Tyr8, and Trp9) can address the latter to some extent. For other cases, the single-site labeling of side chain functionalities could be driven through ligand–protein interaction10, linchpin directed modification11 or targeting the ε-amine of most reactive Lys residue (Nε-NH2)12–16. Alongside, the N-terminus α-amine (Nα-NH2) has established its place as a notable reactivity hotspot. In a remarkable example, the biomimetic transformation of N-terminus residue delivers a carbonyl group primed for subsequent orthogonal reactions17.
The kinetic labeling of Nα-NH2 can be achieved through its chemoselective modification with an electrophile as determined by a few groups including us (Fig. 1a)18–30. The alternate route involves a latent electrophile (EL) generated in the form of imine from Nα-NH2 and aldehyde (Fig. 1b). It gives us the opportunity to address the two challenges in two separate steps. If we can generate EL chemoselectively, the nucleophile is only required to address the site-selectivity. This approach delivers reductive alkylation in a chemoselective transformation (path a, Fig. 1b)31. However, the site-selectivity gets compromised very early in the reaction in the presence of competing imines. On the contrary, the nucleophilic attack of backbone amide to imine generated by Nα-NH2 and 2-pyridinecarboxaldehyde can render isolable imidazolidinone with high selectivity (path b, Fig. 1b)32. Unfortunately, none of these methodologies can distinguish one N-terminus residue from the other. In this perspective, N-terminal Cys containing proteins can render unique reactivity to form thiazolidine with an aldehyde or 2-cyanobenzothiazole (path c, Fig. 1b)33–35. However, it is a challenge to identify unique reactivity for other N-terminus residues. In particular, selective targeting of N-terminus Gly poses a prominent complexity as there is no side-chain residue to assist.
Here, we demonstrate that an aldehyde with appropriately designed hydrogen bond acceptor can result in the exclusive labeling of N-terminus Gly residue. The remarkable selectivity of C–C bond formation extends from an amino acid to structurally diverse proteins. Besides, the protocol operates conveniently in a mixture of proteins or cell lysate. The residue-specific installation of an orthogonal aldehyde group renders analytically pure tagged proteins in high yields through the late-stage single-site introduction of the desired probe.
Results
We hypothesized that a multi-step transformation could provide a roadmap to address the challenge of chemoselectivity and residue selectivity. The initial generation of the EL with capability to generate another reactive center (latent nucleophile, NuL, path d) could provide the platform. While the first step can address chemoselectivity, the latter can give us an opportunity to explore the residue specificity. Also, the NuL generated with a protein will be required to display orthogonal reactivity in the presence of proteinogenic nucleophiles (Fig. 1c). At first, the challenge was to design an imine that can generate the NuL under physiological conditions. We hypothesized that H-bond acceptors in the imine group (Fig. 1b) could assist proton shuttling and stabilize the NuL. We anticipated that the intramolecular H-bond acceptors might overcome the competing intermolecular interactions with protein and solvent. Subsequently, we can explore the reactivity of NuL towards an external electrophile (Fig. 1c). Also, we anticipated that the unsubstituted amino acid (Gly) could potentially render a reactivity profile different from all the other substituted amino acids.
Aminoalcohol formation with Gly under ambient conditions
We selected ortho-substituted benzaldehyde to garner the advantage of geometrical constraint imposed by the aromatic ring. We placed one H-bond acceptor ortho to the aldehyde in the form of a methoxy group (2a, Fig. 2a). The reaction of Gly amide 1a with reagent 2a fails to result in any adduct (Fig. 2a). Next, we designed the reagent 2b that proffers a carbonyl group as H-bond acceptor. It was exciting to note the formation of isolable product (3ab, 27% conversion, LC-MS). Encouraged by this result, we investigated the effect of two oxygen centers as H-bond acceptors in the reagent 2c. We found it to be inefficient in improving the conversion to the product (3ac, 33%, LC-MS). The redesigned reagent 2d led to 63% conversion of the product that further enhanced (3a, > 95%, LC-MS) with reagent 2e. A non-polar derivative of product (3a) was purified and characterized as aminoalcohol after thorough mass and NMR analysis (Supplementary Figs. 13–27).
Unsubstituted amino acid versus substituted amino acids
In another exciting result, we noted that the imine formed by Ala 1b remained inert towards any subsequent transformation contrary to Gly 1a (Fig. 2b and Supplementary Table 2). With this data in hand, we examined whether all the other proteinogenic amino acids (1c–1t) behave similarly to Ala 1b. No trace of product 3b–3t except imine formation 4b–4t was observed with the amino acids 1b–1t (Fig. 2b and Supplementary Table 2). In a control experiment, the vortexing of Phe (1n) under the reaction conditions did not result in its racemization (Fig. 2c and Supplementary Fig. 48). These results reconfirm the presence of a high barrier to the formation of the NuL in substituted amino acids (1b–1t, Fig. 2d). We believe that the unfavorable orientation of the H-bond acceptor in these cases inhibit the process. Also, the imine from the Lys side chain (Nɛ-NH2) remains unreactive for any subsequent transformation (see Supplementary Fig. 73).
N-terminus Gly labeling of large peptides and proteins
To explore the translation of aminoalcohol chemistry, we selected melittin 6a [26 residues, one Nα-NH2 (Gly), three Lys Nε-NH2]. The reaction of reagent 2e with 6a led to mono-labeled melittin 7a (52% conversion, Fig. 3a); in a control experiment, RNase A (N-terminus Lys) resulted in no detectable product in 72 h. We confirmed the site of labeling (N-terminus Gly) by MS (labeled G1-Q26, m/z 3027.51 [M+H]+) followed by MS-MS (Supplementary Fig. 49). Next, we selected a diagnostically relevant essential hemoprotein myoglobin 6b [153 residues, one Nα-NH2 (Gly), nineteen Lys Nε-NH2] and vortexed it with aldehyde 2e. The mono-labeled myoglobin (7b, 40% conversion) is formed exclusively in this case (Fig. 3a). Peptide mapping of the enzymatic digest (labeled G1-K16, m/z 1996.37 [M+H]+) and MS-MS confirmed the site of labeling at N-terminus Gly (Supplementary Fig. 50). It was imperative to evaluate the chemistry with recombinant proteins. The engineering of Gly at the N-terminus of proteins is convenient36. In bacteria, the N-terminal Met is excised from proteins exposing the penultimate Gly with high efficiency37. Also, the recombinant proteins expressed with a recognition sequence for prescission protease renders the desired N-terminus Gly post proteolytic digestion38. We adopted the latter route and generated small Ubiquitin-like modifier (SUMO1) protein 6c with a Gly residue at the N-terminus (see Supplementary information for details). The labeling of 6c with 2e went smoothly to result in N-terminus mono-labeled SUMO1 (7c, 43% conversion, Fig. 3a). Peptide mapping (labeled G1-L3, m/z 466.37 [M+H]+) and MS-MS confirmed the site of labeling at N-terminus Gly (Supplementary Fig. 51). Next, we selected insulin 6d that contains two N-terminus [51 residues, two Nα-NH2, chain A (Gly), chain B (Phe), one Lys Nε-NH2]. The imine formed by Nα-NH2 (Phe) of chain B is established to exhibit the preferential reactivity21–23. However, the vortexing of insulin 6d with the reagent 2e results in the mono-labeled insulin 7d in 71% conversion (Fig. 3a). The disulfide reduction of 7d, peptide mapping, and MS-MS led to the identification of Nα-NH2 (Gly) modification in chain A (labeled G1-N21, m/z 2563.00 [M+H]+, Supplementary Fig. 52). Next, we selected a mixture of seven proteins (6d–6j, Fig. 3b). Gratifyingly, the labeling experiment with reagent 2e resulted in exclusive labeling of N-terminus Gly bearing insulin 6d to render mono-labeled product 7d (42% conversion, Fig. 3b and Supplementary Fig. 53). The methodology exhibits remarkable selectivity as none of the other residues in any protein interfere.
Access to analytically pure tagged proteins
At this stage, understanding the impact of functional group derivatizations of 2e on the reaction was essential for achieving the downstream tasks. We found that the amide linkage could be used while redesigning the reagent 2e to a symmetric bis-aldehyde (2f and 2g, Fig. 4a) with the handle for late-stage modification. Initially, the reaction of insulin with bis-aldehyde 2f resulted in poor conversions. However, the incorporation of polyethylene glycol (PEG)3 linker in redesigned aldehyde 2g regained the solubility of reagent and efficiency of transformation (88%, step 1, Fig. 4b; for the reaction of proteins 6a–6c and PfAOS1 6k with 2g, see Supplementary Figs. 62–64). We did not observe the formation of insulin–insulin complex (Supplementary Table 3 and Fig. 65). Initially, we examined the efficiency of late-stage installation with benzyloxyamine. The product characterization involved disulfide reduction, peptide mapping, and MS-MS (Supplementary Figs. 54 and 55). Subsequently, we showcased the potential of this transformation with derivatives of O-hydroxylamine such as 19F NMR probe 10a, biotin 10b, and a fluorophore 10c (step 2a, Fig. 4b). The excellent overall conversion (85%, 87%, and 87%, respectively) over the two steps (steps 1 and 2a; route 1) while retaining the selectivity is noteworthy (Supplementary Figs. 56–59). 19F NMR probe attached insulin 11a shows a sharp NMR signal at −62.80 ppm (Fig. 4 and Supplementary Fig. 66). The incubated mixture of biotinylated insulin 11b and streptavidin (Stv-n) renders a sharp band at ~70 kDa for Stv-11b complex (Supplementary Fig. 67). The coumarin attached insulin 11c results in fluorescence emission at 428 nm (excitation at 333 nm, Supplementary Fig. 68).
Our next goal was to integrate a purification protocol with the aminoalcohol chemistry (steps 1, 2b, and 3; route 2; Fig. 4b). Initially, we mixed the reaction mixture of step 1 (9 + 6d) with hydrazide functionalized resin (step 2b, Fig. 4b). This step exhibits excellent efficiency and results in > 95% single-site immobilization of the labeled protein (9). The unreacted protein (6d) was recovered with > 95% efficiency and recycled. Finally, the immobilized protein (13) was released with O-hydroxylamine derivatives (10a–10c) from resin through transoximization (step 3, Fig. 4b). The centrifugal spin concentration results in the analytically pure single-site tagged protein (Supplementary Figs. 60 and 61). The single-site immobilized labeled insulin rendered efficient parallel installation of 19F NMR probe, biotin, and fluorophore (three steps, two purifications; 54%, 57%, and 58% overall isolated yields; Supplementary Figs. 60 and 61). The coumarin tagged insulin (11c) exhibits excellent product stability over a broad range of pH (3–11) at 25 °C for 36 h (Supplementary Fig. 76).
Bioactivity assays
Subsequently, we asked how the labeling would affect the structure and activity of insulin. The earlier investigations suggest that Nα-NH2 (Gly) of chain A is important for insulin structure and function39,40. The circular dichroism of coumarin tagged insulin (11c) confirmed the conservation of structure (Supplementary Fig. 69). We examined the activity in cell-based assays using 11c through its ability to activate insulin receptor (IR) mediated signaling. Activation of insulin-dependent IR-signaling was assessed by the increase in levels of phospho-Akt (pSer-473) and subsequent uptake of insulin inside the cells. The prior is detected by phospho-Akt (pAkt) specific antibodies on western blotting and immunofluorescence. The latter is assessed through the coumarin fluorophore signal inside cells. While the untreated (mock) cells exhibit basal pAkt reactivity, cells treated with 11c or untagged insulin (6d) render enhanced phospho-Akt (pAkt) reactivity relative to loading control protein (GAPDH) (Supplementary Fig. 78a). The quantitation of pAkt signals relative to loading control GAPDH is represented in Supplementary Fig. 78b. These observations directed us towards the estimation of EC50 values for native and modified insulin41. Both native and modified insulin activated the insulin receptor and pAkt generation with similar efficacy (Fig. 5a). The EC50 values were estimated to be 2.7 and 2.9 nM for native (6d) and modified insulin (S29), respectively (Fig. 5b). In the cellular uptake assays, we observed that cells treated with 11c display coumarin signal accumulated throughout the cells indicating their regular uptake (Fig. 5c, 1st row, middle panel). This signal can be competed out in the presence of excess untagged insulin (11c:6d, 1:3; Fig. 5c, 1st row, right panel). Concomitant to this, we observe a similar increase in pAkt signals (pSer-473) in tagged or untagged insulin-treated cells (Fig. 5d, 1st row middle and right panels). These observations unambiguously establish that aminoalcohol derivatization of N-Gly in insulin results in no adverse effect on its capability to activate the insulin receptor signaling. Next, we investigated the SUMOylation reaction that provides a versatile tool to identify and characterize novel SUMO enzymes and their substrates42. We reconstituted the SUMOylation reaction with modified SUMO1 and monitored the SUMO1-substrate conjugate. The data concludes that the N-Gly modification of SUMO1 does not interfere with its biochemical activity (for details, see Supplementary Fig. 79).
Residue-specific labeling of a protein in cell lysate
Finally, we challenged the methodology to precisely label the N-terminus Gly containing protein in a cell lysate prepared from E. coli BL21 (DE3) cells overexpressing GST-SUMO1. Proteolytic digest generates N-terminal Gly containing SUMO1 in a mixture of multiple other proteins present in the lysate sample (lanes 1 and 2, Fig. 6). Such a lysate was vortexed with the symmetrical bis-aldehyde 2g. The late-stage installation of coumarin on the N-terminal Gly residue is visualized by SDS-PAGE and fluorescence imaging. The labeling of SUMO1 is observed without noticeable participation of any other protein (lane 4, Fig. 6).
Discussion
In summary, we deliver a methodology that enables single-site residue-specific labeling of a protein. We demonstrated that a systematic multi-step approach could provide the platform to solve the multifaceted selectivity challenges. The chemoselective generation of an electrophilic intermediate followed by site- and residue-specific formation of a latent nucleophile is the key to success. The reaction proceeds exclusively with the N-terminus Gly residue of a protein. The selectivity remains uncompromised even when the method is extended to a protein in the cell lysate. The reaction excludes the N-terminus α-amine of other proteinogenic amino acids and multiple Lys residues bearing ε-amine. The absence of competing reactions is one of the most noteworthy features of this chemical technology. The symmetric bis-aldehyde reagent allows installation of various tags on single-site labeled protein. A user-friendly purification protocol enables the isolation of analytically pure tagged protein. The overall yield of the product after three steps is remarkable. The protocol offers convenient imaging of insulin in HEK293T cells without perturbing its structure and activity.
Methods
Single-site labeling of a protein
In a 1.5 mL microcentrifuge tube, protein (3 nmol) was mixed with sodium bicarbonate buffer (120 µl, 0.1 M, pH 7.8). To this solution, 2-(2-formylphenoxy) acetic acid 2e (1500 nmol) in DMSO (30 µl) from a freshly prepared stock solution was added and vortexed at 25 °C. The overall concentration of protein and 2-(2-formylphenoxy)acetic acid was 20 µM and 10 mM respectively. After 24–48 h, the reaction mixture was diluted with acetonitrile:water (10:90, 3000 µl). Unreacted 2-(2-formylphenoxy)acetic acid and salts were removed by using Amicon® Ultra-0.5 mL 3-kDa or 10-kDa MWCO centrifugal spin concentrator. The protein mixture was further washed with Millipore Grade I water (5 × 0.4 mL). The sample was analyzed by ESI-MS or MALDI-ToF-MS. The aqueous sample was concentrated by lyophilization before subjecting it to digestion, peptide mapping, and sequencing by MS-MS.
Late-stage tagging and purification of a protein
Hydrazide beads (200 µl, hydrazide resin loading: 16 µmol mL−1) were taken in a 5 mL fritted polypropylene chromatography column with end tip closures. Phosphate buffer (0.1 M, pH 7.0, 5 × 1 mL) was used to wash the beads. The beads were re-suspended in phosphate buffer (100 µl, 0.1 M, pH 7.0). Protein mixture (250 µM) in phosphate buffer (150 µl, 0.1 M, pH 7.0) and aniline (100 mM) in phosphate buffer (100 µl, 0.1 M, pH 7.0) were added to the beads followed by end-to-end rotation (30 rpm, rotary mixer) at 25 °C. The progress of the immobilization of the labeled protein on hydrazide resin was monitored by UV-absorbance of the supernatant. After 8–10 h, the supernatant was collected and the beads were washed with phosphate buffer (0.3 M, pH 7.3, 4 × 1 mL) and KCl (1 M, 3 × 1 mL) to remove the adsorbed protein from resin. The beads were further washed with the phosphate buffer (0.3 M, pH 7.0, 4 × 1 mL) and re-suspended (phosphate buffer, 200 µl, 0.3 M, pH 7.0). To release the labeled protein from its immobilized derivative, aniline (100 mM) in phosphate buffer (100 µl, 0.3 M, pH 7.0) and coumarin or fluoro or biotin derivatives (only one at a time) of O-hydroxylamine (50 µl, 150 mM in DMSO) were added followed by vortex at 25 °C for 6–8 h. The supernatant was collected while the salts, aniline, and O-hydroxylamine were removed using the spin concentrator (3 kDa MWCO). The purity of the labeled protein was confirmed by ESI-MS. The purification protocol renders analytically pure tagged protein.
Residue-specific labeling of a protein in cell lysate
The cell lysate was prepared from E. coli BL21 (DE3) cells overexpressing GST-SUMO1 (section 2d in Supplementary information). Protein concentration was determined by BCA assay and adjusted to 1.0 mg mL−1. Protein (10 nmol) in sodium bicarbonate buffer (120 µl, 0.1 M, pH 7.8) was taken in a 1.5 mL microcentrifuge tube. To this solution, N,N’-(((oxybis(ethane-2,1-diyl))bis(oxy))bis(propane-3,1-diyl))bis(2-(2-formylphenoxy)acetamide) 2g (5000 nmol) in DMSO (30 µl) from a freshly prepared stock solution was added and vortexed at 25 °C. After 24 h, the reaction mixture was diluted with acetonitrile:buffer (10:90, 1500 µl). Unreacted N,N’-(((oxybis(ethane-2,1-diyl))bis(oxy))bis(propane-3,1-diyl))bis(2-(2-formylphenoxy)acetamide) 2g and salts were removed by spin concentrator (0.5 mL 3-kDa MWCO). The solution was further washed with sodium bicarbonate buffer (0.1 M, pH 7.8) and concentrated to 160 µl. To the concentrated sample in sodium bicarbonate buffer, derivatives of O-hydroxylamine 7-((3-(aminooxy)propyl)thio)-4-methyl-2H-chromen-2-one (2 µmol) 10c in DMSO (40 µl) from a freshly prepared stock solution was added to convert mono-labeled protein to its oxime derivative for 3–6 h. The excess of O-alkoxyamine and salts were removed by the spin concentrator. The sample was analyzed by SDS-PAGE.
Biological assays
Imaging assays: The HEK293T (Cell Repository, NCCS, Pune India) cells were grown in a six-well plate with coverslips in Dulbecco’s modified Eagle’s medium (DMEM) containing 1% serum for 24 h. Subsequently, the cells were washed twice with PBS and treated with coumarin tagged 11c and untagged insulin 6d (3 µg mL−1, 0.5 µM) for 30 min in 10% FBS containing DMEM media. For competition experiment, tagged and untagged insulin were used in 1:3 ratios in final 2.0 µM insulin concentration. Post-treatment, cells were again washed twice with PBS and fixed using 100% chilled methanol for 15 min at −20 °C. The cells were then rehydrated and permeabilized with rehydration buffer (10 mM Tris, 150 mM NaCl, 0.1% TritonX-100) for 10 min. For coumarin tagged insulin imaging, nuclei were stained with Hoechst 33342 (Invitrogen) directly after permeabilization and images were taken. For pAkt imaging, cells were blocked with 5% Normal Goat Serum (NGS) for 30 min at room temperature after rehydration. The cells were stained overnight with pAkt antibodies (1:200, Cell Signaling Technology) at 4 °C. After primary antibody incubation, cells were washed three times with PBS-T (5 min each). Alexa Fluor-568 conjugated goat anti-rabbit IgG (1:800, Life Technologies) secondary antibody was used against pAkt. After this, the nuclei were stained with Hoechst 33342 (Invitrogen), and fluorescence images were captured on APOTOME/Zeiss LSM 780 confocal microscope. All image analysis was performed using ZEN (Zeiss) or Image J software.
EC50 analysis
For the assay, 50,000 human embryonic kidney cells (HEK293T) were grown in each well of a 96- well plate to ~50% confluence in DMEM supplemented with 10% fetal bovine serum (FBS) and antibiotics in a humidified incubator with 5% CO2 at 37 °C. The cells were serum starved for 12 h in no serum DMEM media and then treated with varying concentrations (100, 10, 5, 2.5, 1, 0.5, 0.1 nM) of insulin and modified insulin in no serum DMEM media for 40 min at 37 °C. After the treatment, the insulin-containing media was removed, cells were washed and intracellular level of pAkt Ser473 was measured using HTRF pAkt Ser473 kit (Cisbio, France, Catalogue No. 64AKSPET and Perkin Elmer, Catalogue No. TRF4002C) following the manufacturer's protocol. In brief, cells were lysed in 50 µl of supplemented lysis buffer under mild horizontal shaking at room temperature for 45 min. The homogenized cell lysate (16 µl) was added to 4 µl of the premixed antibody solutions in the HTRF 96-well detection plate. After 4 h of incubation at room temperature, the absorbance was recorded in a Cytation 5 multi-mode reader (BioTek Instrument, USA).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work is supported by SERB, DBT, DAE, Ramanujan Fellowship, and IISER Bhopal. L.P., S.R.A., D.G.R., U.S. and M.G. are the recipients of a research fellowship from CSIR, UGC, and IISER Bhopal. We thank Dr. Benjamin Rotstein (University of Ottawa, Canada) for his critical comments. We thank Dr. Manoj Kumar Bhat and Dr. Jomon Joseph of NCCS, Pune for providing the IRβ over-expressing CHO cells.
Author contributions
V.R. and L.P. conceived the research. L.P., S.R.A. and U.V.B. designed and performed the bioconjugation experiments. R.K.M. designed the experiments related to insulin bioactivity assay. U.S., M.G., L.P. and U.V.B. performed the biological assays. D.G.R. synthesized and characterized a set of reagents. All the authors wrote the manuscript.
Data availability
All data supporting the findings of this study are available within the Article and its accompanying Supplementary Information file. The source data underlying Figs. 5a, 6b and Supplementary Figures 63b, 67, 68, 69, 72, 78a, 79c is provided in the Source Data file. Raw data for all the NMR spectra and HPLC-MS/ESI-MS/MALDI-ToF-MS can be accessed from the corresponding author upon reasonable request.
Competing interests
V.R. is the founder of Plabeltech Private Limited. A patent application has been filed on this work with V.R. and L.P. as inventors (Patent application no. WO-2018104962-A1, June 14, 2018). The remaining authors declare no competing interests.
Footnotes
Journal peer review information: Nature Communications thanks Knud Jensen and other anonymous reviewer(s) for their contribution to the peer review of this work.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information accompanies this paper at 10.1038/s41467-019-10503-7.
References
- 1.Krall N, da Cruz FP, Boutureira O, Bernardes GJL. Site-selective protein- modification chemistry for basic biology and drug development. Nat. Chem. 2016;8:103–113. doi: 10.1038/nchem.2393. [DOI] [PubMed] [Google Scholar]
- 2.Rashidian M, Dozier JK, Distefano MD. Enzymatic labeling of proteins: techniques and approaches. Bioconjugate Chem. 2013;24:1277–1294. doi: 10.1021/bc400102w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Shinmi D, et al. One-step conjugation method for site-specific antibody-drug conjugates through reactive cysteine-engineered antibodies. Bioconjugate Chem. 2016;27:1324–1331. doi: 10.1021/acs.bioconjchem.6b00133. [DOI] [PubMed] [Google Scholar]
- 4.Bernardim B, et al. Stoichiometric and irreversible cysteine-selective protein modification using carbonylacrylic reagents. Nat. Commun. 2016;7:13128. doi: 10.1038/ncomms13128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhang C, et al. π-Clamp-mediated cysteine conjugation. Nat. Chem. 2016;8:120–128. doi: 10.1038/nchem.2413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mantovani G, et al. Design and synthesis of N-maleimido-functionalized hydrophilic polymers via copper-mediated living radical polymerization: a suitable alternative to PEGylation chemistry. J. Am. Chem. Soc. 2005;127:2966–2973. doi: 10.1021/ja0430999. [DOI] [PubMed] [Google Scholar]
- 7.Backus MK, et al. Proteome-wide covalent ligand discovery in native biological systems. Nature. 2016;534:570–574. doi: 10.1038/nature18002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Joshi NS, Whitaker LR, Francis MB. A three-component Mannich-type reaction for selective tyrosine bioconjugation. J. Am. Chem. Soc. 2004;126:15942–15943. doi: 10.1021/ja0439017. [DOI] [PubMed] [Google Scholar]
- 9.Antos JM, Francis MB. Selective tryptophan modification with rhodium carbenoids in aqueous solution. J. Am. Chem. Soc. 2004;126:10256–10257. doi: 10.1021/ja047272c. [DOI] [PubMed] [Google Scholar]
- 10.Chen G, et al. Reactivity of functional groups on the protein surface: development of epoxide probes for protein labeling. J. Am. Chem. Soc. 2003;125:8130–8133. doi: 10.1021/ja034287m. [DOI] [PubMed] [Google Scholar]
- 11.Adusumalli SR, et al. Single-site labeling of native proteins enabled by a chemoselective and site-selective chemical technology. J. Am. Chem. Soc. 2018;140:15114–15123. doi: 10.1021/jacs.8b10490. [DOI] [PubMed] [Google Scholar]
- 12.Purushottam L, Adusumalli SR, Chilamari M, Rai V. Chemoselective and site-selective peptide and native protein modification enabled by aldehyde auto- oxidation. Chem. Commun. 2017;53:959–962. doi: 10.1039/C6CC09555K. [DOI] [PubMed] [Google Scholar]
- 13.Hacker MS, et al. Global profiling of lysine reactivity and ligandability in the human proteome. Nat. Chem. 2017;9:1181–1190. doi: 10.1038/nchem.2826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chilamari M, Purushottam L, Rai V. Site-selective labeling of native proteins by a multicomponent approach. Chem. Eur. J. 2017;23:3819–3823. doi: 10.1002/chem.201605938. [DOI] [PubMed] [Google Scholar]
- 15.Chilamari M, Kalra N, Shukla S, Rai V. Single-site labeling of lysine in proteins through a metal-free multicomponent approach. Chem. Commun. 2018;54:7302–7305. doi: 10.1039/C8CC03311K. [DOI] [PubMed] [Google Scholar]
- 16.Matos MJ, et al. Chemo- and regioselective lysine modification on native proteins. J. Am. Chem. Soc. 2018;140:4004–4017. doi: 10.1021/jacs.7b12874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Gilmore JM, Scheck RA, Esser-Kahn AP, Joshi NS, Francis MB. N-terminal protein modification through a biomimetic transamination reaction. Angew. Chem. Int. Ed. 2006;45:5307–5311. doi: 10.1002/anie.200600368. [DOI] [PubMed] [Google Scholar]
- 18.Singudas R, Adusumalli SR, Joshi PN, Rai V. A phthalimidation protocol that follows protein defined parameters. Chem. Commun. 2015;51:473–476. doi: 10.1039/C4CC08503E. [DOI] [PubMed] [Google Scholar]
- 19.Chen X, Muthoosamy K, Pfisterer A, Neumann B, Weil T. Site-selective lysine modification of native proteins and peptides via kinetically controlled labeling. Bioconjugate Chem. 2012;23:500–508. doi: 10.1021/bc200556n. [DOI] [PubMed] [Google Scholar]
- 20.Chan AO, et al. Modification of N-terminal α-amino groups of peptides and proteins using ketenes. J. Am. Chem. Soc. 2012;134:2589–2598. doi: 10.1021/ja208009r. [DOI] [PubMed] [Google Scholar]
- 21.O’ Harte FPM, et al. Structure, Antihyperglycemic activity and cellular actions of a novel diglycated human insulin. Peptides. 2000;21:1519–1526. doi: 10.1016/S0196-9781(00)00306-5. [DOI] [PubMed] [Google Scholar]
- 22.O’ Harte FPM, Højrup P, Barne CR, Flatt PR. Identification of the Site of Glycation of Human Insulin. Peptides. 1996;17:1323–1330. doi: 10.1016/S0196-9781(96)00231-8. [DOI] [PubMed] [Google Scholar]
- 23.Mei H, Yu C, Chan KK. NB1-C16-Insulin: Site-Specific. Synth., Purif. Biol. Act. Pharm. Res. 1999;16:1680–1686. doi: 10.1023/a:1018993629553. [DOI] [PubMed] [Google Scholar]
- 24.Busch, G. K. et al. Specific N-terminal protein labelling: use of FMDV 3Cpro protease and native chemical ligation. Chem. Commun. 3369–3371 (2008). [DOI] [PubMed]
- 25.Geoghegan KF, Stroh JG. Site-directed conjugation of nonpeptide groups to peptides and proteins via periodate oxidation of a 2-amino alcohol. application to modification at n-terminal serine. Bioconjugate Chem. 1992;3:138–146. doi: 10.1021/bc00014a008. [DOI] [PubMed] [Google Scholar]
- 26.Bandyopadhyay A, Cambray S, Gao J. Fast and selective labeling of N-terminal cysteines at neutral pH via thiazolidino boronate formation. Chem. Sci. 2016;7:4589. doi: 10.1039/C6SC00172F. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wagner AM, et al. N-terminal protein modification using simple aminoacyl transferase substrates. J. Am. Chem. Soc. 2011;133:15139–15147. doi: 10.1021/ja2055098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Williamson DJ, Fascione MA, Webb ME, Turnbull WB. Efficient N-terminal labeling of proteins by use of sortase. Angew. Chem. Int. Ed. 2012;51:9377–9380. doi: 10.1002/anie.201204538. [DOI] [PubMed] [Google Scholar]
- 29.Raj M, Wu H, Blosser SL, Vittoria MA, Arora PS. Aldehyde capture ligation for synthesis of native peptide bonds. J. Am. Chem. Soc. 2015;137:6932–6940. doi: 10.1021/jacs.5b03538. [DOI] [PubMed] [Google Scholar]
- 30.Spears RJ, Fascione MA. Site-selective incorporation and ligation of protein aldehydes. Org. Biomol. Chem. 2016;14:7622. doi: 10.1039/C6OB00778C. [DOI] [PubMed] [Google Scholar]
- 31.Chen D, Disotuar MM, Xiong X, Wang Y, Chou DH. Selective N-terminal functionalization of native peptides and proteins. Chem. Sci. 2017;8:2717–2722. doi: 10.1039/C6SC04744K. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.MacDonald JI, Munch HK, Moore T, Francis MB. One-step site-specific modification of native proteins with 2-pyridinecarboxyaldehydes. Nat. Chem. Biol. 2015;11:326–331. doi: 10.1038/nchembio.1792. [DOI] [PubMed] [Google Scholar]
- 33.Casi G, Huguenin-Dezot N, Zuberbühler K, Scheuermann J, Neri D. Site-specific traceless coupling of potent cytotoxic drugs to recombinant antibodies for pharmacodelivery. J. Am. Chem. Soc. 2012;134:5887–55892. doi: 10.1021/ja211589m. [DOI] [PubMed] [Google Scholar]
- 34.Bandyopadhyay A, Cambray S, Gao J. Fast and selective labeling of N-terminal Cysteines via Boronic acid-accelerated conjugation. Chem. Sci. 2016;7:4589–4593. doi: 10.1039/C6SC00172F. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ren H, et al. A biocompatible condensation reaction for the labeling of terminal cysteine residues on proteins. Angew. Chem. Int. Ed. 2009;48:9658–9662. doi: 10.1002/anie.200903627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Waugh DS. An overview of enzymatic reagents for the removal of affinity tags. Protein Expr. Purif. 2011;80:283–293. doi: 10.1016/j.pep.2011.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Hirel Ph-H, Schmitter J-M, Dessen P, Fayat G, Blanquet S. Extent of N-terminal methionine excision from Escherichia coli proteins is governed by the side-chain length of the penultimate amino acid. Proc. Natl Acad. Sci. USA. 1989;86:8247–8251. doi: 10.1073/pnas.86.21.8247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cordingley MG, Callahan PL, Sardana VV, Garsky VM, Colonno RJ. Substrate requirements of human rhinovirus 3C protease for peptide cleavage in vitro. J. Biol. Chem. 1990;265:9062–9065. [PubMed] [Google Scholar]
- 39.Mayer JP, Zhang F, DiMarchi RD. Insulin structure and function. Biopolymers. 2007;88:687–713. doi: 10.1002/bip.20734. [DOI] [PubMed] [Google Scholar]
- 40.Thibaudeau K, et al. Synthesis and evaluation of insulin-human serum albumin conjugates. Bioconjugate Chem. 2005;16:1000–1008. doi: 10.1021/bc050102k. [DOI] [PubMed] [Google Scholar]
- 41.Disotuar Maria M., Petersen Mark E., Nogueira Jason M., Kay Michael S., Chou Danny Hung-Chieh. Synthesis of hydrophobic insulin-based peptides using a helping hand strategy. Organic & Biomolecular Chemistry. 2019;17(7):1703–1708. doi: 10.1039/C8OB01212A. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wilkinson KA, Henley JM. Mechanisms, regulation and consequences of protein SUMOylation. Biochem J. 2010;428:133–145. doi: 10.1042/BJ20100158. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data supporting the findings of this study are available within the Article and its accompanying Supplementary Information file. The source data underlying Figs. 5a, 6b and Supplementary Figures 63b, 67, 68, 69, 72, 78a, 79c is provided in the Source Data file. Raw data for all the NMR spectra and HPLC-MS/ESI-MS/MALDI-ToF-MS can be accessed from the corresponding author upon reasonable request.