Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Nov 9.
Published in final edited form as: J Am Chem Soc. 2011 Oct 19;133(44):17855–17868. doi: 10.1021/ja207220g

Nonenzymatic assembly of natural polyubiquitin chains of any linkage composition and isotopic labeling scheme

Carlos Castañeda a, Jia Liu b, Apurva Chaturvedi a, Urszula Nowicka a, T Ashton Cropp b, David Fushman a
PMCID: PMC3226840  NIHMSID: NIHMS330535  PMID: 21962295

Abstract

Polymeric chains made of a small protein ubiquitin act as molecular signals regulating a variety of cellular processes controlling essentially all aspects of eukaryotic biology. Uncovering the mechanisms that allow differently linked polyubiquitin chains to serve as distinct molecular signals requires the ability to make these chains with the native connectivity, defined length, linkage composition, and in sufficient quantities. This however has been a major impediment in the ubiquitin field. Here we present a robust, efficient, and widely accessible method for controlled iterative non-enzymatic assembly of polyubiquitin chains using recombinant ubiquitin monomers as the primary building blocks. This method uses silver-mediated condensation reaction between the C-terminal thioester of one ubiquitin and the ε-amine of a specific lysine on the other ubiquitin. We augment the non-enzymatic approaches developed recently by using removable orthogonal amine-protecting groups, Alloc and Boc. The use of bacterially expressed ubiquitins allows cost-effective isotopic enrichment of any individual monomer in the chain. We demonstrate that our method yields completely natural polyubiquitin chains (free of mutations and linked through native isopeptide bonds) of essentially any desired length, linkage composition, and isotopic labeling scheme, and in milligram quantities. Specifically, we successfully made Lys11-linked di-, tri-, and tetra-ubiquitins, Lys33-linked di-ubiquitin, and a mixed-linkage Lys33,Lys11-linked tri-ubiquitin. We also demonstrate the ability to obtain, by high-resolution NMR, residue-specific information on ubiquitin units at any desired position in such chains. This method opens up essentially endless possibilities for rigorous structural and functional studies of polyubiquitin signals.

Introduction

Polymeric chains composed of a small protein ubiquitin (Ub) function as molecular signals in numerous cellular processes in eukaryotes, including protein turnover, progression through the cell cycle, transcriptional activation, antigen processing, and vesicular trafficking of proteins (reviewed in 17). Ub monomers in these chains are usually linked via an isopeptide bond between the C-terminal G76 of one Ub (referred to here as the distal Ub) and the ε-amino group of one of seven lysines of the other Ub (proximal). This post-translational modification is performed and tightly controlled by a series of enzymes, and the regiospecificity of the isopeptide linkage is defined by various Ub-conjugating enzymes, E2s.8,9 The same chemistry is involved in the attachment of a monomeric Ub (monoUb) or a polyubiquitin (polyUb) chain (via its proximal Ub unit) to a lysine residue of the target protein.

The functional outcome of polyubiquitinaton of a target protein depends on the length of the polyUb tag and the lysine residue involved in the Ub-Ub linkage. For example, long (n≥4) polyUb chains linked via K48 serve as the principal signal targeting proteins for degradation by the 26S proteasome2,10,11, whereas K63-linked chains act as regulatory rather than proteolytic signals in a variety of non-degradative processes.1215 The biological roles and the recognition features of the polyUb chains linked through the other five lysines (K6, K11, K27, K29, and K33) or head-to-tail, as well as mixed-linkage or branched chains are poorly understood and are currently the focus of extensive research. A central question in ubiquitin biology concerns how the broad functional range of Ub signaling is achieved. Despite the wealth of information on the various processes controlled or regulated by polyubiquitination, the molecular mechanisms underlying the ability of different polyUb chains to act as distinct molecular signals for diverse cellular events remain poorly understood. It is believed that the specificity of the recognition signal carried by a particular polyUb chain is determined by the unique conformations that a particular chain can adopt, which in turn are dictated by the linkage type.16 For example, the conformational differences between K48-linked and K63-linked chains17,18 are responsible for the ability of K48-linked polyUb to bind K48-selective receptors in a “sandwich”-like mode 19, which contrasts the avid binding of K63-linked chains to their specific receptors.20

Studies of the relationship between the linkage, structure, and function of the polyUb signals require the ability to generate these chains with the native connectivity, controlled length, defined linkage and composition, and in sufficient (mg-scale) quantities. This, however, is a significant challenge, because lysine-specific E2 enzymes are not available for all linkages. The problem is further exacerbated for in vitro ubiquitination of target proteins, because of the need for substrate-specific Ub-ligases (E3) which are often not known. Until recently, controlled enzymatic assembly of polyUb chains of defined length required the introduction of chain-terminating mutations that inevitably resulted in polyUb chains having surrogate linkages or some lysines permanently replaced with other residues (e.g. Arg or Cys).2123 We have demonstrated24 that this problem can be addressed by using lysines with removable protecting groups (incorporated into Ub as genetically encoded unnatural amino acids), thus allowing controlled assembly of native polyUb chains of any desired length. Nevertheless, the lack of linkage-specific E2s and substrate-specific E3s remains a significant bottleneck.

These limitations have motivated several research groups to develop non-enzymatic methods to form the isopeptide bond. 2535 In particular, it has been demonstrated that by using total chemical synthesis combined with native isopeptide chemical ligation (through mercaptolysine residues) it is possible to make Ub2 chains 28,29 and even Ub4.33 While this is a remarkable accomplishment that opens new opportunities to build and characterize polyUb chains, these methods are limited in their broad implementation by the need to use total chemical synthesis, which is not readily available in every biochemical laboratory. Also, the need to desulfurize the final product, in order to obtain a native Gly-Lys isopeptide linkage, limits the applicability of various mercaptolysine-based approaches in cases of substrate proteins containing sulfur-bearing side chains, e.g. cysteines.

Even given access to such polyUb chains, another major challenge remains, namely the need to study these chains by high resolution structural methods. The weak noncovalent intra-chain interactions in polyUb complicate the use of crystallographic methods: the chains either evade crystallization or the resulting crystal structures might not represent the physiologically relevant conformations.16,17,36,37 This highlights the necessity to study polyUb chains in solution, and thus makes NMR the method of choice for such studies. Unfortunately, the homopolymeric nature of polyUb, combined with the chemical and spectroscopic similarity of Ub monomers, makes it almost impossible to resolve the NMR signals from the individual Ub units within the chain, required for monomer-specific characterization, unless the individual Ubs are differentially isotopically labeled (23,38, also Figure S1). The latter task, however, is essentially impractical for chemically synthesized chains due to the high cost of incorporating isotopic labeling into chemical synthesis. The need to circumvent the formidable cost of total peptide synthesis of isotope-labeled proteins motivated us to seek alternative ways of achieving non-enzymatic ubiquitination, which would (1) be affordable and accessible, (2) yield fully natural polyUb chains of any desired length and linkage, and (3) allow isotopic labeling of any selected Ub moiety in the polyUb chain. We believe that this can be achieved by taking advantage of accessible and cost-effective production of isotope-enriched proteins using bacterial expression.

Here we present a robust and cost-effective method for iterative non-enzymatic assembly of bacterially expressed Ub monomers into an all-natural polyUb chain (i.e. containing no mutations and linked through native isopeptide bonds) of any desired length and composition. This method uses a silver-mediated condensation reaction 39 between a thioester on the C-terminus of one Ub and the ε-amine of a specific lysine in the other Ub (or another target protein). Our strategy is based on incorporation (as an unnatural amino acid) of a lysine derivative bearing a removable protecting group (e.g., Boc-lysine (Lys(Boc))) at the desired position in the protein (for the isopeptide linkage) and blocking all other primary amines with an orthogonal protecting group, allyloxycarbonyl (Alloc), followed by removal of one or the other protecting group as necessary (Scheme 1). We were inspired by the recent work by Virdee et al.30 that utilized an analogous strategy to make K6- and K29-linked Ub2s. The novelty and advantages of the method presented here are (1) the use of the Alloc group which, unlike the carbobenzyloxy (Cbz) group used in 30, is entirely orthogonal to the Boc group and requires less harsh conditions for removal, and that our method allows (2) synthesis of polyUb chains of any length/composition and (3) isotopic labeling of any desired Ub monomer in the chain. Using this method, we successfully assembled K11-linked Ub2, Ub3, and Ub4, K33-linked Ub2, and mixed-linkage, K33,K11-linked Ub3 chains containing all natural isopeptide linkages. Moreover, by 15N enrichment of a specific Ub within the Ub2 and Ub3 chains, we were able, for the first time, to characterize, using high-resolution NMR, Ub units at any position in such chains.

Scheme 1.

Scheme 1

(a) Assembly of K11-linked Ub2 from recombinant Ub monomers. The two Ubs are shaded differently to illustrate the concept of unit-specific isotopic labeling. In particular, the Ub2 chain with an isotope-labeled proximal Ub (shaded gray) is assembled from the unlabeled Ub monomer and isotope-labeled K11Boc Ub. Similarly, Ub2 with a labeled distal Ub is made from an isotope-labeled Ub and unlabeled K11Boc Ub. The same concept applies to making Ub2 chains linked via other lysines (by incorporating Lys(Boc) at other Lys positions) and/or with both Ub units labeled but with different isotopes (2H, 15N, 13C or combination thereof) uniformly or using different residue- or group-specific labeling schemes. (b) Structure of Alloc and Boc protecting groups on a lysine residue.

Materials and Methods

Assembly of Ub chains

Below we describe in detail all steps and protocols involved in the non-enzymatic assembly of a Ub chain, using Ub2 as an example (Scheme 1). Iterative assembly of longer chains (Schemes 2, 3) from preassembled Ub2 or longer constructs involves similar procedures.

Scheme 2.

Scheme 2

Assembly of homogeneously K11-linked Ub3 from Ub monomer and K11-linked Ub2. To illustrate the concept of unit-specific isotope labeling, the middle Ub unit (highlighted in gray) in this chain is isotope-labeled, as in the current study.

Scheme 3.

Scheme 3

Schematics on the assembly of long Ub chains with either homogeneous or mixed linkages. Shown in (a,b) are two methods to form a homogeneously K11-linked Ub4 from either (a) two K11-linked dimers, or (b) a combination of a homogeneously K11-linked Ub3 and K11Boc Ub monomer. (c) The ability to make Ub chains with mixed linkages can be easily exploited with our present method. Here we illustrate the formation of a mixed K33,K11-linked Ub3 from a K33-linked Ub2 and K11Boc Ub monomer. The dotted line indicates the isopeptide bond that would be formed during the ligation reaction. Longer Ub chains can be assembled iteratively by using the same concepts as illustrated here.

Preparation of Lys(Boc)-containing Ub

Plasmids pTXB1, containing the E. coli codon-optimized Ub gene with the TAG mutation at residue position 11 or 33, and pSUP-PylT-PylS, containing cellular machinery to incorporate Lys(Boc) as a genetically encoded unnatural amino acid at the TAG codon, were double transformed into E. coli BL21(DE3) chemically competent cells as detailed elsewhere.24 15N-labeled wild type (WT) Ub and 15N-labeled Ub containing Lys(Boc) were expressed in E. coli using auto-inducing minimal media with 15NH4Cl as the sole source of nitrogen40 and purified as described.24

Generation of Ub-SR

Ub variants with the thioester group (Ub-SR) at the C-terminal G76 were generated from respective Ub monomers (unlabeled or 15N labeled) using the following procedure. A 1 mL solution of 5 mg/mL Ub was prepared in 20 mM sodium phosphate buffer (pH 8.0) with 10 mM ATP, 10 mM MgCl2, 100 mM sodium 2-mercaptoethane-sulphonate (MESNA), and 250 nM of Ub-activating enzyme E1. 31 The mixture was incubated at 37°C for 6 hours and then stored at 4°C. The E1 enzyme was precipitated by adding a drop of glacial acetic acid to the solution. The protein solution was buffer-exchanged into dH2O containing 0.4% TFA and subsequently lyophilized. Ub-SR generation was confirmed by ESI-MS (electrospray ionization mass spectrometry).

Alloc protection of Ub monomers

The Alloc protection reaction took place by initially dissolving 5 mg of each Ub monomer in 450 μL DMSO for 15–20 minutes on a platform rocker. To this solution, 17 μL of diisopropylethylamine (DIEA) and 75 μL of freshly made 40 mg/mL Alloc-OSu (from TCI America) solution in DMSO were added and allowed to react for at least 1 hour at room temperature on the platform rocker. Complete Alloc protection was monitored by performing ESI-MS on the sample. For each 100 μL of protein solution, the protein was precipitated into a small white pellet by mixing ~2 mL of ice-cold ether, vortexing for 15 seconds, and centrifuging for 10 minutes at 4°C. The top organic layer was removed after centrifugation and two more rounds of ether precipitation were conducted. The pellet was then air-dried for 15–20 minutes.

Boc deprotection of Ub containing Lys(Boc) at desired position for isopeptide linkage

The pellet (5 mg) from the previous step was dissolved in 500 μL of ice-cold 3:2 TFA/dH2O solution and left to react for at least 2–3 hours at 4°C.30 Complete dissolution occurred after 15 minutes; and can be aided by gently pipetting the solution. Removal of Boc group (molecular weight loss of 100 Da) was monitored by ESI-MS. For every 100 μL of solution, the protein was precipitated with three rounds of cold ether as described above. Typically the protein precipitated instantly after initial ether addition, and the resulting white pellet was air-dried for 15–20 minutes.

Ligation of Ub monomers

Approximately 2 mg of each of the two protected Ub monomers were dissolved in DMSO and then added together to a total volume of 90 μL as originally described.30 DIEA (4 μL), hydroxysuccinimide (H-OSu) (1 μL of fresh 390 mg/mL solution dissolved in DMSO), and AgNO3 (1 μL of fresh 57 mg/mL solution dissolved in DMSO) were added to the Ub monomers, mixed, and incubated at room temperature in the dark for at least 20–30 hours. Formation of ligated Ub2 species was confirmed by SDS-PAGE. The solution was subjected to three rounds of ether precipitation as described above and then air-dried. The pellet typically was slightly yellowish in color.

Global Alloc deprotection

Ub was deprotected using 50 mol % chloro-pentamethylcyclopentadienyl-cyclooctadiene-ruthenium(II) ([Cp*Ru(cod)Cl], from Sigma) and 50 equivalents of thiophenol, relative to the number of moles of protected amines in the solution (counting all lysine and histidine residues and the N-terminal amine). For example, for 4 mg of total Ub monomer and assuring slight excess over nine amine protection sites on each Ub molecule, the reaction pellet was dissolved in 420 μL of DMSO to which 240 μL of H2O, 116.2 μL of 20 mM [Cp*Ru(cod)Cl] (freshly dissolved in DMSO), and 23.8 μL of neat thiophenol were added. The resulting dark-brown/black solution was divided into 200 μL aliquots in PCR tubes. The reactions were incubated in a thermal cycler at 50°C for 2 hours. After this the tubes were allowed to cool for 10–15 minutes; the resulting solution was typically a dark orange cloudy solution with black precipitate.

Approximately 100 μL of each reaction tube were aliquoted into 2 mL eppendorf tubes for cold ether precipitation as described above. Typically, at least 10 rounds of ether precipitation were necessary to form a small dark-brown pellet in each of these eppendorf tubes. After each round of centrifugation, the top organic layer was very carefully removed, so as to not disturb the aqueous bottom layer. When cold ether was added, vigorous vortexing for 15 seconds was performed to ensure proper mixing of the solution. After pellet formation, the pellet was allowed to air dry for 10–15 minutes.

Renaturation and purification of the ligation product

The above protein solution from Alloc deprotection was initially dissolved in a total of 500 μL – 1 mL of filtered 6 M GdnHCl, 20 mM sodium phosphate (pH 6.8) solution for 10–15 minutes. The solution was centrifuged for 5 minutes to remove the undissolved precipitate. To this solution, 200 μL of 20 mM sodium phosphate (pH 6.8) buffer containing 130 mM NaCl were added step-wise for 10–15 minutes at a time, to a final volume of 10–15 mL. The clear solution was then transferred to 3K MWCO dialysis tubing and dialyzed overnight against 2 L of the same buffer at 4°C.

The next morning, the dialysate was concentrated to < 1 mL using Amicon Ultra-15 3K MWCO units and then purified on a Superdex 75 120 mL column on an AKTA FPLC system. The fractions pertaining to the desired Ub species were collected, concentrated, and exchanged into NMR buffer (20 mM sodium phosphate, pH 6.8). Purity was assessed to be > 99% based on SDS-PAGE gel analysis. Typically from an input of 5 mg of Ub monomers, 0.7 mg – 1 mg of Ub2 was obtained. To ensure that complete Alloc deprotection took place, a small protein aliquot (3–4 μL of a 100 μM sample) was analyzed by ESI-MS.

Ub3 assembly from Ub2 and Ub

To assemble K11-linked Ub3 with the middle Ub 15N-labeled, approximately 0.75 mg of K11-linked Ub2 (15N-labeled on the distal Ub containing K11Boc) was reacted with 1.25 mg of thioesterified WT Ub. Both proteins were protected with Alloc groups, and K11-linked Ub2 was treated with TFA to expose the ε-amine of K11 for the chemical condensation reaction. Immediately prior to the reaction, each protein was dissolved in 45 μL of DMSO. After mixing the proteins together, the remaining ingredients (DIEA, AgNO3, and H-OSu) were added. Alloc deprotection was performed assuming 3 mg of Ub monomer with ten Alloc-protected amines per Ub. This overestimation of the amount of Ub ensured complete Alloc deprotection. The reaction pellet was dissolved in 315 μL of DMSO to which 180 μL of H2O, 87.2 μL of 20 mM [Cp*Ru(cod)Cl] (freshly dissolved in DMSO), and 17.9 μL of neat thiophenol were added. Protein was precipitated with ether, renatured, and purified as described above. A small aliquot (5 μL of 40 μM protein) was analyzed by ESI-MS to confirm that all Alloc groups on K11-linked Ub3 were removed (Supporting Information). Total amount of purified Ub3 was determined to be 0.3 mg (15% yield).

To assemble K33,K11-linked Ub3 with the middle Ub 15N-labeled, 0.75 mg of K33-linked Ub2 (15N-labeled on the proximal Ub) was reacted with 3 mg (an unintentional excess amount) of K11Boc Ub. The C-terminus of K33-Ub2 was thioesterified using MESNA and E1 enzyme as described above; complete thioesterification was verified by ESI-MS. The protein was protected with Alloc groups as described above, assuming 2 mg of Ub, and checked by ESI-MS. K11Boc Ub was protected with Alloc groups and treated with TFA as described above. For the condensation reaction, each protein was dissolved in 45 μL of DMSO before mixing both proteins together. DIEA, AgNO3, and H-OSu were added in the same amounts as described above. Subsequent Alloc deprotection took place assuming a total of 5 mg of Ub monomer with ten Alloc-protected amines per Ub. The reaction pellet was dissolved in 525 μL of DMSO to which 300 μL of H2O, 145.3 μL of 20 mM [Cp*Ru(cod)Cl] (freshly dissolved in DMSO), and 29.8 μL of neat thiophenol were added. Protein was renatured and purified as detailed above. Total amount of K33,K11-linked Ub3 was 0.35 mg (9% yield). The apparent lower yield reflects the excess amount of unreacted K11Boc Ub present in the reaction.

NMR experiments

All NMR measurements were performed on Avance III 600 MHz spectrometer (Bruker Biospin) equipped with TXI cryoprobe using standard or in-house pulse sequences; the sample temperature was set to 23°C. The data were processed using NMRPipe41 and analyzed using Sparky42.

Mass spectrometry

For Ub monomers and some Ub2, high resolution mass spectra of m/z 250–2500 were acquired with JEOL AccuTOF-CS mass spectrometer in electrospray positive mode using flow injection. To determine the molecular weight, spectra were deconvoluted using MagTran software with the maximum charge set to 30. For dilute Ub2 and Ub3 samples, high resolution mass spectra of m/z 400–2000 (or 600–4000 for protein with alloc protection group) were acquired on Thermo Scientific LTQ Orbitrap XL mass spectrometer using flow injection. The resolution was 60,000 at m/z 400. Deconvolution was carried out using either the Xtract program in Xcalibur software or MagTran with a maximum charge set to 30.

Results

We devised a strategy for iterative non-enzymatic assembly of Ub chains of any desired length and linkage composition using bacterially expressed recombinant Ub monomers as primary building blocks. At the heart of this approach is the formation of an isopeptide bond through chemical condensation reaction39 between the thioesterified C-terminus of one Ub and a specific lysine residue of the other Ub (Scheme 1). The same reaction can be used for site-specific non-enzymatic (poly)ubiquitination of any other protein. To control the reaction and direct the linkage to a specific lysine, the latter is initially protected with the Boc group (introduced as a genetically encoded unnatural amino acid, Lys(Boc)24), while all other amines on both reactant proteins are protected with the Alloc groups. Subsequent removal of the Boc group from Lys(Boc) makes this lysine the only available amine to interact with the C-terminal thioester (SR) of another Ub to condense into an isopeptide bond. This step is conceptually similar to that in ref30, with the principal difference being the use of the Alloc protecting group instead of the Cbz group. The complete orthogonality of the Alloc and Boc groups (see below) allows their removal entirely independently from one another. This enables iterative Ub chain assembly (Schemes 2, 3) to construct all-natural Ub chains of any desired length and linkage, comprised of either homogeneous or mixed linkages (Scheme 3), and to have any Ub in the chain isotopically enriched at will (Schemes 1, 2). The basic steps in our iterative ubiquitination/chain-assembly procedure are as follows (Schemes 13):

  1. activation (by thioesterification) of the C-terminus of Ub (or a Ub chain) for the ligation with a target protein (e.g., another Ub or some other protein);

  2. incorporation of a lysine side chain bearing a removable protecting group (e.g., Lys(Boc)) as a genetically encoded unnatural amino acid at the desired position in the target protein;

  3. protection of all available amines (lysine and histidine side chains and the N-terminal amine) on both reactant proteins with a different, orthogonal protecting group (Alloc);

  4. removal of the Boc protecting group to expose the specific Lys side chain as the sole ligation site;

  5. ligation of the two proteins via chemical condensation reaction;

  6. complete removal of all Alloc groups;

  7. renaturation and purification of the desired product.

Optimization of the Alloc deprotection reaction

Central to the success of this method is complete Alloc deprotection. Recently it has been shown 43 that Alloc protecting groups can be efficiently removed using relatively mild conditions in neutral aqueous solution using chloro-pentamethylcyclopentadienyl-cyclooctadiene-ruthenium(II) ([Cp*Ru(cod)Cl], Ru catalyst) and thiophenol. However, application of these exact conditions to Ub protein, which can have up to nine Alloc groups attached to it simultaneously, failed to completely remove all Alloc groups. To determine the correct conditions for Alloc deprotection, we tested multiple experimental conditions on a 15N- labeled sample of WT Ub (Figure S2). After many rounds of optimization, we discovered the correct amounts of Ru catalyst, thiophenol, and water to use in the Alloc deprotection reaction. The most critical features to the success of the reaction are (i) near-stoichiometric amounts of Ru catalyst (50% mol) relative to the moles of amine present, (ii) at least 30% H2O (v/v), and (iii) temperature raised to at least 50°C. When any of these conditions were not met, ESI-MS analysis revealed that Ub species typically still contained a variable number of Alloc protecting groups covalently attached to Ub (Figure S2). For example, lowering the amount of ruthenium metal in the reaction did not yield full deprotection. Once the Alloc deprotection reaction began, typically the reaction reached completion after a maximum of two hours. Allowing reactions to proceed overnight did not change the ESI-MS results.

Thioesterification of the C-terminus of ubiquitin

To prepare the monomers for the ligation reaction (Scheme 1) the C-terminus of the (distal) Ub must be first activated with a thioester functional group. Traditionally, Ub-SR is made via MESNA-induced cleavage reaction of a Ub-intein-CBD (chitin binding domain) fusion construct. Unfortunately, this approach is slow (each cleavage reaction takes 24–40 hours), necessitates HPLC purification (removal of an unwanted byproduct of unactivated Ub), and produces Ub-SR in low yield (only 10 mg of pure Ub-SR from expression in 2 L of culture). A promising alternative, first introduced by Oualid et al. 31, generates Ub-SR by reacting wild type (WT) Ub with Ub-activating enzyme, E1, and MESNA. We found this approach very simple and efficient at generating Ub-SR. Complete thioesterification of Ub’s C-terminus occurs after 6 hours of treatment with the E1 enzyme and MESNA, resulting in addition of 125 Da (MES) to the molecular weight of Ub, as detected by ESI-MS (Figure 1a). Subsequent treatment with glacial acetic acid precipitates E1 and lowers the pH to maintain Ub-SR stability. Finally, buffer exchange removes other reaction components to yield a pure Ub-SR product. This method produces Ub-SR at high yield, since it is easy to express and purify WT Ub in large amounts (50–100 mg from 1 L culture) and react with E1 to make Ub-SR. Importantly, the same procedure works nicely for activating Lys(Boc) Ub as well as all preassembled polyUb chains that we tested (see below).

Figure 1.

Figure 1

ESI-MS spectra of the various steps in the non-enzymatic assembly of K11-linked Ub2. (a) Preparation of Ub for incorporation as the distal unit in K11-linked Ub2 15N- labeled on the proximal Ub. The molecular weight of wild-type Ub is 8564 Da (black). Reaction with E1 and MESNA adds a C-terminal thioester functional group to Ub, increasing its molecular weight by 125 Da to 8689 Da (blue), and fully converts all Ubs into Ub-SR. Alloc protection of Ub-SR (red) adds a total of nine Alloc groups (each Alloc protecting group is 84 Da). (b) Preparation of 15N K11Boc Ub for incorporation as the proximal Ub. The molecular weight of 15N K11Boc Ub relative to WT Ub is increased by 200 Da to 8765 Da (black) as a result of 15N isotopic enrichment (100 Da) and addition of the Boc protecting group (100 Da). Alloc protection (blue) adds eight Alloc groups to the protein, one less than for Ub-SR because of the Boc protection on residue K11. TFA treatment (red) removes only the Boc group on K11, reducing the molecular weight by 100 Da to 9336 Da. (c) After chemical condensation and complete Alloc deprotection, the expected molecular weight of purified K11-linked Ub2 is 17210 Da as shown, resulting from the sum of one unlabeled Ub (8564 Da) and one 15N-labeled Ub (8664 Da) and the loss of one water molecule from the K11 isopeptide linkage. See also Fig. S10.

Assembly of natural K11-linked Ub2

As a proof of principle, we first assembled natural K11-linked Ub2 15N-labeled on the proximal Ub. We chose this di-Ub because we can compare its properties to the enzymatically synthesized K11-linked Ub2 construct made using the K11-specific E2 Ube2s. K11 linkages could be as abundant as K48 and K63 linkages,44 and K11-linked polyUb chains appear to act both as regulatory and proteolytic signals.7,45 To assemble the K11-linked Ub2 non-enzymatically, we prepared two proteins, WT Ub and 15N K11Boc Ub. The ligation via chemical condensation reaction was carried out as outlined above and indicated in Scheme 1.

All of the unprotected amines (lysines, N-terminus, and the histidine30) on both the distal Ub-SR and the proximal 15N K11Boc Ub were protected with the Alloc group (molecular weight 84 Da). ESI-MS results confirmed that a total of nine Alloc groups were covalently attached to Ub-SR (Figure 1a). On 15N K11Boc Ub, generally only eight Alloc groups were attached, consistent with the fact that K11 still contains a Boc group (Figure 1b). The Alloc-protected 15N K11Boc Ub was then Boc-deprotected at residue 11 via TFA treatment (see Materials and Methods). ESI-MS confirmed a loss of 100 Da, attributed to the molecular weight of a single Boc group (Figure 1b). Importantly, the Boc group removal occurred without affecting the Alloc groups. Both Alloc-protected Ub-SR and Alloc-protected 15N Ub (bearing deprotected K11) were reacted together with AgNO3 and H-OSu to form an Alloc-protected K11-linked Ub2. Typically the reaction yields between 30–50% formation of Ub2 as monitored by SDS-PAGE (Figure 2a).

Figure 2.

Figure 2

Chemical assembly of K11-linked di- and tri-Ub chains. (a,c,e) Coomassie-stained 15% SDS PAGE gels of the chemical condensation reactions of (a) K11-linked Ub2 15N-labeled on the proximal Ub, (c) K11-linked Ub3 with the middle Ub unit 15N-labeled, and (e) K33,K11-linked Ub3 with the middle Ub unit 15N-labeled. K11-linked Ub2 (panel a) was assembled from Alloc-protected monomers of Ub-SR (Ub-SRA) and TFA-treated 15N-labeled K11Boc Ub (TFA K11B UbA). K11-linked Ub3 (panel c) was assembled from Alloc-protected Ub-SR and distal-15N-labeled K11-linked Ub2 (TFA K11B Ub2A) whose distal-K11 side chain was made available for ligation by TFA treatment. K33,K11-linked Ub3 (panel e) was assembled from Alloc-protected proximal-15N- labeled K33-linked Ub2 (K33-Ub2-SR) and TFA-treated K11Boc Ub (TFA K11B UbA). Formation of Ub2 or Ub3 is observed after 20 hours of reacting the proteins with AgNO3, H-OSu, and DIEA. 0.3 – 0.4 μL are loaded from the reaction directly onto gel. Notations used here: “TFA” indicates that the corresponding protein was treated with TFA to remove the Boc group in order to make the corresponding Lys available for the ligation reaction, while the superscript “A” indicates that the protein is Alloc-protected.

(b,d,f) Size-exclusion chromatograms of (b) K11-linked Ub2, (d) homogeneously K11-linked Ub3, and (f) mixed-linkage K33,K11-linked Ub3 performed after Alloc deprotection and protein renaturation to purify the desired product from the unreacted components. The ability to separate Ub2 or Ub3 products from unreacted species is illustrated in Fig. S8.

After Alloc deprotection followed by renaturation, the desired Ub2 product was purified from the unreacted monomers using size exclusion chromatography (Figure 2b), concentrated, and buffer exchanged into the appropriate NMR buffer. The complete removal of all Alloc protecting groups from the purified Ub2 was confirmed by ESI-MS (Figure 1c). As expected, the observed molecular weight of the all-natural Ub2 with uniform 15N labeling on the proximal Ub was 17210 Da.

Using Scheme 1 it is also straightforward to assemble a K11-linked Ub2 15N-labeled on the distal Ub. This can be accomplished by either (i) 15N labeling the (distal) WT Ub rather than K11Boc Ub in Scheme 1, or (ii) using 15N K11Boc Ub for the distal unit and unlabeled K11Boc Ub for the proximal. Note that using K11Boc Ub as the distal unit is essential for assembling longer chains by elongation on the distal end (see below). Therefore, as proof of principle, we prepared K11-linked Ub2 15N-labeled on the distal Ub in this manner. All of the steps of Scheme 1 were followed as is, and verified by ESI-MS (Figure 3). Most importantly, Alloc deprotection did not remove the remaining Boc group on the distal Ub (Figure 3c, also Figure S3). Together with the results in Figure 1b, this demonstrates the complete orthogonality of the Boc and Alloc protecting groups. Only after purification of the K11-linked Ub2, the protein was treated with 3% TFA for 4–6 hours to remove the remaining Boc group on the distal Ub. A significant advantage of this step is that it can be completed without the need to fully denature the product (Ub2).

Figure 3.

Figure 3

ESI-MS spectra of various Ub components in the assembly of K11-linked Ub2 with the distal Ub unit both 15N-labeled and K11Boc-protected. (a) Preparation of 15N K11Boc Ub for incorporation as the distal unit in distal-15N- labeled K11-linked Ub2. The molecular weight of 15N K11Boc is 8764 Da (black). Reaction with E1 and MESNA adds a C-terminal thioester functional group to the Ub protein, increasing its molecular weight by 125 Da to 8889 Da (blue), and fully converts all Ubs into 15N-K11Boc-SR. Alloc protection of 15N-K11Boc-Ub-SR (red) adds a total of eight Alloc groups (each Alloc protecting group is 84 Da). (b) Preparation of K11Boc Ub for incorporation as the proximal Ub. The molecular weight of K11Boc Ub is 8665 Da (black) as a result of addition of the Boc protecting group (100 Da). Alloc protection (blue) adds eight Alloc groups to the protein, similar to the distal 15N-K11Boc-SR Ub protein. TFA treatment (red) removes only the Boc group on K11, reducing the molecular weight by 100 Da to 9236 Da. (c) After chemical condensation and complete Alloc deprotection, the expected molecular weight of the purified distal-15N-labeled and distal-K11Boc K11-linked Ub2 is 17310 Da as shown, resulting from the sum of one Ub (8564 Da) and one 15N-labeled K11Boc Ub (8764 Da) and the loss of one water molecule from the K11 isopeptide linkage.

Unit-specific characterization of natural K11-linked Ub2

NMR is exquisitely sensitive to nuances in the chemical environment of the nucleus under observation that are not detectable by other methods. In the 1H-15N TROSY-HSQC46 and SOFAST-HMQC47 experiments used here, primarily signals from the backbone 1H-15N amide bonds are recorded, thus providing a characteristic spectral ‘fingerprint’ of a protein. To examine the resulting Ub2 by NMR, we recorded 1H-15N TROSY spectra of the K11-linked Ub2 chains 15N-labeled on the distal or the proximal Ub, assembled as detailed above, and overlaid them separately on 1H-15N TROSY spectra of WT monoUb (Figure 4a,b). The NMR spectra of both Ub units in the chemically ligated K11-linked Ub2 revealed an excellent spectral dispersion of NMR resonances indicative of a fully renatured Ub2. No minor peaks were observed, indicating that the chemically ligated K11-linked Ub2 is free of contaminating species. On the distal Ub, the largest signal shifts compared to monoUb were observed for the C-terminal residues G75 and G76, which is consistent with the formation of an isopeptide linkage involving the C-terminus of the distal Ub. The K11 backbone amide signal is not present in the distal-Ub spectrum because Lys(Boc) at residue 11 in this Ub was introduced as an unlabeled (naturally abundant, 14N) amino acid, which is “invisible” in the 15N NMR spectra. Likewise, the K11 backbone amide signal is absent also in the spectrum of the 15N-labeled proximal Ub (Figure 4b). Most importantly, the G76-K11 isopeptide signal is also absent (Figure S4), and there are no unaccounted for signals in the spectrum. Taken together, these observations indicate that the resulting Ub2 species is comprised of a single, K11-specific isopeptide linkage. Moreover, the overall similarity of the NMR spectra of both Ub units with the spectra of monoUb and the relatively small magnitudes of the signal shifts (except for the C-terminus of the distal Ub) strongly suggest that the structure of each Ub unit in the chemically assembled K11-linked Ub2 is intact.

Figure 4.

Figure 4

Overlay of 1H-15N TROSY-HSQC spectra of (a) the distal Ub and (b) the proximal Ub in all-natural K11-linked Ub2 (red) and of WT Ub (blue). The spectral differences between K11-linked di-Ub and mono-Ub, quantified as amide chemical shift perturbations (CSPs), are plotted as a function of the residue number for (c,e) distal Ub and (d,f) proximal Ub in (c,d) all-natural K11-linked Ub2 and (e,f) enzymatically-synthesized K11-linked Ub2. The CSPs were calculated as Δδ = [(ΔδH)2 + (ΔδN/5)2]1/2, where ΔδH and ΔδN are chemical shift differences for 1H and 15N, respectively. Residues with significant CSPs are indicated on the spectra in panels a and b. Note that the absence of the K11 backbone amide signal in the spectra of K11-linked Ub2 (panels a and b) serves as a direct confirmation of the incorporation of the unlabeled Lys at residue 11.

To validate that the chemically ligated natural K11-linked Ub2 had the same structural/conformational properties as the enzymatically assembled chain, we compared the 1H-15N NMR spectra of this K11-linked Ub2 with those of the K11-linked Ub2 made using K11-specific E2 enzyme, Ube2s. Unfortunately, the enzymatically assembled Ub2 contained chain-terminating mutations (K11R and K63R on the distal Ub and K63R and D77 on the proximal Ub), complicating direct comparison of the spectra of the two Ub2 chains. Therefore, we compared NMR spectra for each Ub unit in these chains with its respective unconjugated Ub monomer. Residue-specific chemical shift perturbations (CSPs) were quantified and plotted in Figures 4c–f. The remarkable similarity between the CSPs in the chemically ligated (Figure 4c,d) and enzymatically assembled (Figure 4e,f) K11-linked Ub2s, serves as a strong indicator that the two chains have identical structural and conformational properties.

Notably, the two Ub units in K11-linked Ub2 show strikingly different CSP patterns, namely, significant CSPs are spread throughout the proximal Ub but almost absent in the distal Ub (except for the ligated C-terminus). This is in contrast with the CSP patterns observed in K48-linked17 and K63-linked18 Ub2s, raising the question about the nature of the CSPs detected in the proximal Ub: do these reflect some noncovalent interdomain contacts in K11-linked Ub2 or merely are caused by the chemical (isopeptide) modification of K11? A close inspection shows that the residues with significant CSPs in the proximal Ub cluster around K11 in the 3D structure of Ub, thus suggesting that these perturbations could be due to changes in the microenvironment of K11 upon the isopeptide linkage formation. To test this hypothesis, we measured the CSPs in K11Boc Ub vs. WT Ub (Figure S5a), where the only difference between the two proteins is the presence of the Boc protecting group on the side chain of K11. Indeed, the results show a remarkable overall similarity between the CSP patterns in K11Boc Ub (vs. WT Ub, Figure S5a) and in the proximal Ub of K11-linked Ub2 (Figure 4d,f). This strongly suggests that the spectral perturbations observed in the latter are primarily caused by the chemical modification of the side chain of K11 resulting from the isopeptide linkage. Further structural studies by solution NMR (currently underway) are required to fully address this issue.

Assembly and characterization of K33-linked Ub2

To demonstrate that our method is not exclusive to K11 linkages, we also assembled, for the first time, all-natural K33-linked Ub2 15N-labeled on the proximal Ub. K33-linked polyUb chains have been found to play a non-proteolytic, regulatory role in T-cell signaling.48 We made this chain using the same approach as in Scheme 1, but substituting K11Boc Ub with K33Boc Ub. The ESI-MS data (Figure S6) confirm that the purified K33-linked Ub2 has the expected molecular weight of 17210 Da, assuming one Ub is 15N-labeled and the other is not.

NMR characterization of the proximal Ub in K33-linked Ub2 revealed an excellent spectral dispersion in both 15N and 1H dimensions (Figure 5), strongly indicating that this Ub unit is well folded. Overall, the 1H-15N spectrum of the proximal Ub unit is very similar to that of monoUb (Figure 5a,c), and the relatively small magnitude of the observed CSPs (Figure 5c) suggests that structurally this Ub unit is essentially intact. As expected, the 1H-15N signal of K33 is absent, because this residue was introduced as an unlabeled Boc-containing amino acid. Note that the CSP pattern for the proximal Ub in this chain is distinct from that in the K11-linked Ub2 (Figure 4d). Interestingly, in K33-linked Ub2, the CSPs are quite small for the hydrophobic patch residues L8, I44, and V70, and instead, cluster around residues 12–17 and 26–35 located close in space or adjacent to K33. This suggests that the majority of the spectral perturbations observed in this Ub unit are a consequence of the chemical modification of K33 by the isopeptide linkage. To further support this conclusion, we overlaid the spectra of K33Boc Ub and WT Ub (Figure 4b) and calculated the corresponding CSPs (Figure 5d). Indeed, the pattern of spectral perturbations for K33Boc Ub is overall very similar to that for the proximal Ub in K33-linked Ub2, with the exception of few residues around V70 in K33Boc Ub.

Figure 5.

Figure 5

(a) Overlay of 1H-15N TROSY spectra of the proximal Ub in K33-linked Ub2 (red) and WT monoUb (blue). (b) Overlay of 1H-15N TROSY spectra of K33Boc Ub (red) and WT Ub (blue). (c,d) The spectral differences between residues in the (c) proximal Ub of K33-linked Ub2 and WT Ub and in (d) K33Boc Ub and WT Ub are quantified as amide chemical shift perturbations (CSPs). Residues with significant CSPs are indicated on the spectra in panels a and b. The absence of the K33 backbone amide signal in the spectra of K33-linked Ub2 and K33Boc Ub (panels a and b, respectively) serves as a direct confirmation of the incorporation of the unlabeled Lys at residue 33.

Assembly of longer Ub chains

The power of our approach is in its ability to easily extend from producing natural di-Ubs to natural polyUb chains of any desired length, either homogeneously linked or comprised of mixed linkages. In Scheme 2, we devised an iterative approach that allowed us to assemble, for the first time, an all-natural Ub3 chain comprised of only K11 linkages. Moreover, the middle Ub unit in this chain was enriched with 15N to enable its characterization by heteronuclear NMR. Furthermore, to demonstrate the ability to make even longer chains, we also assembled a K11-linked Ub4 chain (see below).

As the preparatory step for the construction of K11-linked Ub3, we assembled and purified K11-linked Ub2 whose distal Ub both was 15N-labeled and still contained the Boc group on K11 (Scheme 2). Figure S5b demonstrates the ability to successfully renature and examine by NMR the K11-linked Ub2, with the Boc group attached to the distal Ub. Treating an aliquot of this Ub2 with 3% TFA removed the Boc group without denaturing the protein and yielding a completely natural K11-linked Ub2 15N labeled on the distal Ub (Figure 4a). This illustrates that our iterative chain assembly procedure allows characterization of any intermediate species in the process of forming longer chains. For example, as shown in Figure S5, NMR can be used to detect the presence of a Boc group on 15N-labeled Ub species. This provides a useful tool for monitoring and understanding the effects of Ub chain elongation on the structure and intra-chain interactions of individual Ub units and on the conformational properties of the polyUb chain.

As illustrated in Scheme 2, the assembly of K11-linked Ub3 involved the use of two proteins, Ub-SR and K11-linked Ub2 containing 15N-labeled K11Boc Ub as the distal unit. First, Ub-SR was prepared from WT Ub using E1 and MESNA as described above. Second, Alloc protection was performed on both Ub-SR and K11-linked Ub2. Subsequent TFA treatment of the Ub2 unmasked K11 on the distal Ub, enabling the formation of an isopeptide linkage with the C-terminus of Ub-SR. Ligation of the two proteins was performed as described above, and after 16–20 hours formation of Ub3 was confirmed using SDS-PAGE (Figure 2c). Following Alloc deprotection, the Ub3 was renatured and purified. The size exclusion chromatogram in Figure 2d revealed that the desired Ub3 species can be isolated from the unreacted monoUb and Ub2 components. Finally, ESI-MS of the purified Ub3 product confirmed that it indeed has the correct molecular weight (25758 Da) expected for all-natural Ub3 with one Ub 15N-labeled (Figure S7). It should be mentioned here, for completeness, that the Ub3 chain could also be assembled by reacting K11-linked Ub2-SR with monoUb, as exemplified in Scheme 3c for a mixed-linkage chain.

Unit-specific characterization of K11-linked Ub3

Isotopic labeling of the middle Ub in K11-linked Ub3 allowed us to selectively examine by NMR this specific Ub unit in the chain (Figure 6). The excellent spectral dispersion of both 1H and 15N resonances of the backbone amides clearly indicates that the middle Ub is fully folded despite being tethered to two other Ub units in the chain. The 1H transverse relaxation time (T2) of amide protons in K11-linked Ub3 was 19 ms, consistent with the 1H T2 expected of a protein of this molecular weight, and between the T2 values of 25 ms measured for its precursor, K11-linked Ub2, as well as K48-linked Ub217 and 13 ms for K48-linked Ub4.17 Also the average 15N T1 of 870 ms measured for the middle Ub is consistent with the size of the resulting chain.19

Figure 6.

Figure 6

Overlay of 1H-15N TROSY spectra of the middle Ub in all-natural homogeneously K11-linked Ub3 (red) and in blue, (a) WT Ub, (b) the distal Ub of K11-linked Ub2, and (c) the proximal Ub of K11-linked Ub2. The spectral differences between residues in the middle Ub unit of K11-linked Ub3 and (d) WT Ub, (e) distal Ub of K11-linked Ub2, and (f) proximal Ub of K11-linked Ub2 are quantified as amide chemical shift perturbations (CSPs). Residues with significant CSPs are indicated on the spectra in panels a–c.

The observed spectral differences between the middle Ub in K11-linked Ub3 and monoUb are a composite of the perturbations observed above for both distal and proximal Ubs upon formation of K11-linked Ub2 (Figure 6d). This is not unexpected given that the middle Ub unit is both proximal and distal with regard to the two Ub units flanking it. To deconvolute the observed spectral perturbations in the middle Ub, we calculated the corresponding CSPs versus the distal and proximal Ubs of natural K11-linked Ub2 (Figures 6e and 6f, respectively). The CSPs versus the distal Ub of K11-linked Ub2 (Figure 6e) are remarkably similar to the CSPs in the proximal Ub (versus monoUb) upon formation of the K11-linked Ub2, shown in Figure 4d. Also the directions of the signal shifts (Figure 6b) are identical to those in Figure 4b. The remarkable similarity of the CSP patterns in Figures 4d and 6e strongly indicates that (i) the K11 isopeptide linkage is formed between the middle Ub and the distal Ub in K11-linked Ub3, and (ii) no additional Ub/Ub interface (compared to that in K11-linked Ub2) exists between the two Ub units in this chain.

Likewise, the CSPs in the middle Ub versus the proximal Ub of K11-linked Ub2 (Figure 6f) are similar to those observed for the distal Ub (versus monoUb) upon its incorporation into K11-linked Ub2 (Figure 4c). The strongest CSPs are observed for the C-terminal residues G75 and G76. Such perturbations are typically the hallmark of G76’s involvement in an isopeptide bond.17,18,23 Thus, this observation verifies that the C-terminus of the middle Ub in K11-linked Ub3 is connected via an isopeptide bond to the proximal Ub. Interestingly, the magnitude of the CSPs for the rest of the residues in the middle Ub is somewhat higher than in the distal Ub of K11-linked Ub2 (Figure 4c). Here the elevated CSPs cluster around residues 7–15, 40–50, and 67–70, which make up part of the hydrophobic surface patch in Ub. It is conceivable that there is an increase in Ub/Ub interfacial contacts between the middle Ub and the proximal Ub in K11-linked Ub3 compared to K11-linked Ub2. This might indicate an onset of additional inter-Ub contacts as the chain gets longer. However, it is important to bear in mind that the magnitudes of these CSPs are quite small overall, especially compared to those observed in K48-linked Ub217, suggesting that the Ub/Ub interfaces that form in K11-linked Ub3 are transient.

Assembly and characterization of mixed-linkage Ub3

To demonstrate the versatility of our method, we also assembled, for the first time, a mixed-linkage Ub3 chain comprised of K33- and K11-linkages, as outlined in Scheme 3c. For this we used K33-linked Ub2 (assembled and characterized above, with the proximal Ub 15N labeled), and K11Boc Ub. The Ub2 was treated with E1 and MESNA to activate (thioesterify) its C-terminus and then subjected to Alloc protection of all available seventeen amines (see an example in Fig. S11). K11Boc Ub underwent Alloc protection of all available amines followed by the removal of the Boc group to expose K11 as the sole lysine side chain for the ligation reaction. Ligation of the two proteins was performed as described above, and after 16–20 hours the formation of Ub3 was confirmed by SDS-PAGE (Figure 2e). Following Alloc deprotection, the Ub3 was renatured and purified (Figure 2f). The correct size (number of Ub units) of the assembled Ub3 chain was independently confirmed by NMR relaxation measurements, which revealed a decrease in the 1H T2 value from 25 ms for the precursor K33-linked Ub2 to 18.5 ms for the final Ub3 product, similar to that for K11-linked Ub3 (see above).

The 1H-15N NMR spectra of the middle Ub in the mixed-linkage K33,K11 Ub3 chain are shown in Figure 7. As with the K11-linked Ub3 (see above), the spread of the NMR signals and the strong similarity with the spectra of monoUb indicate that the structure of this Ub unit is essentially intact. Similar to the middle Ub in K11-linked Ub3, the spectral perturbations in the middle Ub of K33,K11-linked Ub3 are a composition of the perturbations in the respective Ub2 constructs, reflecting the role of the middle Ub as both the distal and the proximal Ub with respect to its neighbors in the chain.

Figure 7.

Figure 7

Overlay of 1H-15N SOFAST spectra of the middle Ub in all-natural mixed K33,K11-linked Ub3 (red) and in blue, (a) WT Ub, (b) the proximal Ub of K33-linked Ub2. The spectral differences between residues in the middle Ub unit of K33,K11-linked Ub3 and (c) WT monoUb and (d) proximal Ub of K33-linked Ub2 are quantified as amide chemical shift perturbations (CSPs). Residues with significant CSPs are indicated on the spectra in panels a and b.

Assembly of K11-linked tetra-Ub

As proof of principle, we also assembled K11-linked Ub4 using as reaction components the above-characterized homogeneously K11-linked Ub3 and K11Boc Ub. Following Scheme 3b, we first treated K11-linked Ub3 with E1 enzyme to generate a thioester at its C-terminus, and subsequently Alloc-protected all the amines. As detailed above, we used K11Boc Ub as the proximal Ub and protected all the amines with Alloc groups prior to TFA treatment that exposed the ε-amine of K11 for the ligation reaction. The two proteins were mixed and ligated for 20 hours, and the formation of a Ub4 product was confirmed by SDS-PAGE (Figure 8). Combined with the data shown above, this result demonstrates the ability to assemble Ub chain of any desired length using the method proposed here.

Figure 8.

Figure 8

Coomassie-stained urea SDS-PAGE gel demonstrating the formation of homogeneously K11-linked Ub4 via a 20-hour incubation of Alloc-protected middle-15N-labeled K11-linked Ub3-SR and TFA-treated, Alloc-protected K11Boc Ub monomer (TFA K11Boc UbA). The contaminating Ub2 band from an impure sample of K11Boc is inert as it is visible both before and after the ligation reaction. The sample after ligation was loaded at higher concentration to emphasize the formation of Ub4.

Discussion

Deciphering the “ubiquitin code” requires understanding of the linkage-structure-function relationship for various polyUb signals. Structural and functional studies of polyUb chains necessitate the ability to generate natural Ub chains of controlled length, linkage composition, and isotopic labeling (for studies in their native milieu). With the methods devised in this paper, we have demonstrated, for the first time, the ability to make completely natural Ub chains of any desired length and linkage composition, and at the same time to isotopically label any Ub in the resulting chain. The latter renders such chemically assembled chains amenable for unit-specific studies by NMR and any other biophysical technique where isotopic labeling is critical, e.g., for contrast variation in small angle neutron scattering (SANS). Our method can also be used to incorporate modified Ub variants at any specific location in the chain, to study the effect of monomer/site-directed mutations on inter-subunit interactions, chain structure, and receptor recognition, and to enable monomer-selective labeling with various physical probes (e.g. fluorophores, spin-labels, paramagnetic ions) for structural and binding studies by fluorescence/FRET, ESR, NMR etc.

Our approach builds upon several recent significant breakthroughs in enzyme-free ubiquitination. First, our method was inspired by the successful demonstration by Virdee et al.30 that natural di-Ub chains can be synthesized via the GOPAL (genetically encoded orthogonal protection and activated ligation) approach. We use the same basic concept, but replace the Cbz protecting group with the Alloc group.

Second, our method was facilitated by the recent finding by several research groups 43,49 of a relatively mild set of conditions for removal of the Alloc protecting group from a target lysine on a protein. As we showed here (Figure S3), the use of Ru catalyst together with thiophenol in a partially aqueous pH-neutral solution allows complete Alloc deprotection of multiple amines in a protein without affecting the Boc protecting group. This is in contrast to the Cbz group used in ref 30, whose removal with a strong acid cocktail (TFMSA, TFA) also removes the Boc group. Moreover, Boc deprotection can be achieved without affecting the Alloc protecting groups (Figure 3b). This demonstrates a complete, bi-directional orthogonality of the Boc and Alloc groups, i.e. each group can be removed independently of the other. This feature is absolutely essential for the iterative assembly of Ub chains longer than Ub2. Also critical is the fact that Alloc protection does not interfere with the thioester-activated C-terminus of Ub (Figures 1a, 3a). Together with the ability to activate the C-terminus of an existing chain, this feature is essential for chain elongation on the proximal end. It is worth mentioning here that the milder, Alloc removal conditions proved harmless to Ub chains and have potentially important utility for applying the same strategy to ubiquitinate other proteins which could be less stable than Ub in acidic conditions.

Third, the recent use of the E1 enzyme combined with MESNA by Ouilad et al.31 to activate the C-terminus of Ub with a thioester functional group had profound implications for our iterative assembly of longer Ub chains. Indeed, as demonstrated above, the same method can be used to efficiently activate the C-terminal G76 of various Ub chains studied here: K11-linked Ub2 and Ub3 and K33-linked Ub2, and is likely to work with essentially any polyUb chain. By ESI-MS we verified (see an example in Fig. S11) that we can activate the C-terminus of a Ub2 with E1 and MESNA with high yield and with no need to purify the product, as it is completely converted into thioester-activated Ub2.

As demonstrated here on multiple examples, our method of non-enzymatic chain assembly allows isotopic labeling of any Ub unit at will and in any type of Ub chain. Consider, for example, a tri-Ub chain. Here we successfully achieved isotopic labeling of the middle Ub in two different Ub3 chains, homogeneously K11-linked and mixed-linkage, K33,K11-Ub3. In order to label the distal Ub in Ub3, one can use an isotope-labeled Ub-SR in Scheme 2 and an unlabeled K11-linked Ub2. Similarly, to obtain a K11-linked Ub3 isotope-labeled on the proximal Ub, one can simply substitute the K11-linked Ub2 in Scheme 2 with a similar Ub2 isotope-labeled on the proximal Ub. Moreover, this method is not limited to labeling a single Ub unit in the chain: the use of recombinant Ub monomers enables a “mix-and-match” approach, where several (or all) Ub units could be isotopically labeled, each using a different labeling scheme.

The ability to isotopically label individual Ub units in the chain is absolutely critical for unit-specific structural and functional studies of polyUb in solution (see Fig. S1) Thus, selective isotopic labeling allowed us to obtain, for the first time, residue-specific information on the middle Ub unit in a Ub3 chain. An important observation here is that the spectral perturbations in the middle Ub are a composition of the spectral perturbations in both the distal and proximal Ubs upon formation of the corresponding Ub2 fragments. Moreover, a detailed analysis revealed that the non-covalent Ub/Ub contacts in K11-linked Ub3 and in K33,K11-linked Ub3 are relatively weak or transient, such that the observed spectral perturbations in the middle Ub largely reflect alterations in the electronic environment caused by the formation of isopeptide linkages.

Our methodology can be easily extended to iteratively assemble completely natural polyUb chains of any length and linkage composition (as exemplified in Scheme 3) and, if required, with any desired composition of isotopic enrichments of the individual Ub units. For example, a Ub4 comprised only of K11 linkages can be built by either reacting two K11-linked Ub2 chains or a K11-linked Ub3 with monoUb (as demonstrated in Figure 8).

Overall, the proposed method of non-enzymatic Ub chain assembly using thioester-based, silver-mediated chemical condensation combined with Alloc/Boc-based protection/deprotection described here is extremely robust: (1) the method is relatively simple and straightforward, with the most time-intensive step being the ether precipitation of the proteins from the Alloc deprotection reaction, (2) all Ub chains assembled so far (including Ub3s) have been denatured and refolded successfully, and (3) the K11-linked Ub2 chains assembled using our method possess the same spectral and structural properties as the Ub2s produced enzymatically. The latter is an important control as we begin to make chains consisting of non-canonical linkages (K6, K27, K29, K33) for which no linkage-specific E2 enzymes are known. In summary, this makes our method particularly well suited for synthesis of any Ub chain containing homogeneous or mixed linkages, and potentially for E2- and E3-free ubiquitination of other (target) proteins with these chains.

Ultimately, the goal is to be able to attach Ub chains of any length and linkage composition to any target protein (with no need for specific E2 and E3 enzymes) to begin to understand the biological impact of ubiquitination. Such studies have been hindered by the unavailability of substrate-specific Ub ligases (E3). Our non-enzymatic ubiquitination method can be extended to enable in vitro construction of a fully natural ubiquitinated substrate. For example, the reaction shown in Scheme 1 (or Scheme 3b,c) can be used to attach a Ub-SR (or a pre-assembled polyUb-SR chain) to a recombinant target protein bearing Lys(Boc) at the desired ubiquitination site. The latter can be bacterially expressed and purified using the same protocol as for Lys(Boc)-Ub (24,50, see also Materials and Methods). Alternatively, polyUb-SR chains pre-assembled using our method can be attached through native chemical ligation to a mercaptolysine on a substrate of interest obtained using recently developed techniques for site-specific incorporation of a mercaptolysine into a target protein using chemical synthesis 28,29 or bacterial expression 34. This provides essentially endless possibilities for non-enzymatic ubiquitination of virtually any substrate protein that can be refolded or can withstand TFA or desulfurization treatment, which will open new, previously unavailable opportunities for structural and functional studies of the outcome of ubiquitination.

Conclusions

The dearth of native polyUb chains of any defined length, linkage composition, and isotopic labeling has been a major hindrance to the ubiquitin field. To address this challenge, we developed an affordable and widely accessible method for controlled, iterative non-enzymatic assembly of completely natural polyUb chains using recombinant monomers as the primary building blocks. Moreover, the use of bacterially expressed Ub monomers allows cost-effective isotopic enrichment of any individual Ub unit in the chain using various isotopic labeling schemes (uniform or residue/group selective) already developed for biomolecular NMR applications (e.g.,51). This opens up broad opportunities to study polyUb chains by high-resolution NMR, SANS, and other biophysical and biochemical methods, to obtain unit-specific atomic-resolution information on the structure, intra-chain interactions, and receptor-recognition properties of the polyUb signal. Using this method, we synthesized homogeneously K11-linked Ub2, Ub3, and Ub4, and K33-linked Ub2, as well as a mixed-linkage K33,K11-linked Ub3. These Ub3 and Ub4 chains have never been assembled before. This allowed us to characterize by NMR, for the first time, the middle Ub unit in homogeneously K11-linked Ub3 and in K33,K11-linked Ub3, as well as the proximal Ub in K33-linked Ub2. With this chain-assembly method in hand, it is now possible to generate and study essentially any polyUb chain, both homogeneously linked and with mixed linkages, in order to uncover the structure and receptor recognition of the polyUb signal and its processing by linkage-specific deubiquitinases. Importantly, the assembled chains contain natural isopeptide linkages and can be made from wild type Ub monomers, with no need for permanent mutations, thus yielding completely natural Ub chains of essentially any length and in milligram (or larger) quantities. Our method can be extended to (poly)ubiquitinate a target protein, with no need for specific E2 and E3 enzymes, and could also be used to form an isopeptide bond between virtually any two proteins.

Supplementary Material

1_si_001

Acknowledgments

Supported by the National Institutes of Health (GM084396 to T.A.C and GM065334 to D.F.) and the National Science Foundation (CHE-0848398 to T.A.C and NSF Postdoctoral Fellowship DBI-0905967 to C.A.C.). We thank Yan Wang (U. Maryland Proteomics facility) for help with MS studies, and Mike McCrane and Steve Rokita for HPLC advice and usage. We also thank Michael Rape (UC Berkeley) for the Ube2s plasmid.

Footnotes

Supporting Information Available: Supplementary figures presenting the results of ESI-MS and NMR analyses. This material is available free of charge via the Internet at http://pubs.acs.org.

Contributor Information

T. Ashton Cropp, Email: tacropp@vcu.edu.

David Fushman, Email: fushman@umd.edu.

References

  • 1.Hershko A, Ciechanover A. Annu Rev Biochem. 1998;67:425–480. doi: 10.1146/annurev.biochem.67.1.425. [DOI] [PubMed] [Google Scholar]
  • 2.Glickman MH, Ciechanover A. Physiol Rev. 2002;82:373–428. doi: 10.1152/physrev.00027.2001. [DOI] [PubMed] [Google Scholar]
  • 3.Muratani M, Tansey WP. Nat Rev Mol Cell Biol. 2003;4:192–201. doi: 10.1038/nrm1049. [DOI] [PubMed] [Google Scholar]
  • 4.Aguilar RC, Wendland B. Curr Opin Cell Biol. 2003;15:184–90. doi: 10.1016/s0955-0674(03)00010-3. [DOI] [PubMed] [Google Scholar]
  • 5.Osley MA. Biochim Biophys Acta. 2004;1677:74–8. doi: 10.1016/j.bbaexp.2003.10.013. [DOI] [PubMed] [Google Scholar]
  • 6.Ikeda F, Dikic I. EMBO Rep. 2008;9:536–42. doi: 10.1038/embor.2008.93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wickliffe K, Williamson A, Jin L, Rape M. Chem Rev. 2009;109:1537–48. doi: 10.1021/cr800414e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ye Y, Rape M. Nat Rev Mol Cell Biol. 2009;10:755–64. doi: 10.1038/nrm2780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.David Y, Ziv T, Admon A, Navon A. J Biol Chem. 2010;285:8595–604. doi: 10.1074/jbc.M109.089003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Chau V, Tobias JW, Bachmair A, Marriott D, Ecker DJ, Gonda DK, Varshavsky A. Science. 1989;243:1576–83. doi: 10.1126/science.2538923. [DOI] [PubMed] [Google Scholar]
  • 11.Thrower JS, Hoffman L, Rechtenstein M, Pickart CM. EMBO J. 2000;19:94–102. doi: 10.1093/emboj/19.1.94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hicke L, Dunn R. Annu Rev Cell Dev Biol. 2003;19:141–72. doi: 10.1146/annurev.cellbio.19.110701.154617. [DOI] [PubMed] [Google Scholar]
  • 13.Sun L, Chen ZJ. Curr Opin Cell Biol. 2004;16:119–26. doi: 10.1016/j.ceb.2004.02.005. [DOI] [PubMed] [Google Scholar]
  • 14.Spence J, Sadis S, Haas A, Finley D. Mol Cell Biol. 1995;15:1265–73. doi: 10.1128/mcb.15.3.1265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Spence J, Gali RR, Dittmar G, Sherman F, Karin M, Finley D. Cell. 2000;102:67–76. doi: 10.1016/s0092-8674(00)00011-8. [DOI] [PubMed] [Google Scholar]
  • 16.Pickart CM, Fushman D. Curr Opin Chem Biol. 2004;8:610–6. doi: 10.1016/j.cbpa.2004.09.009. [DOI] [PubMed] [Google Scholar]
  • 17.Varadan R, Walker O, Pickart C, Fushman D. J Mol Biol. 2002;324:637–47. doi: 10.1016/s0022-2836(02)01198-1. [DOI] [PubMed] [Google Scholar]
  • 18.Varadan R, Assfalg M, Haririnia A, Raasi S, Pickart C, Fushman D. J Biol Chem. 2004;279:7055–7063. doi: 10.1074/jbc.M309184200. [DOI] [PubMed] [Google Scholar]
  • 19.Varadan R, Assfalg M, Raasi S, Pickart C, Fushman D. Mol Cell. 2005;18:687–98. doi: 10.1016/j.molcel.2005.05.013. [DOI] [PubMed] [Google Scholar]
  • 20.Sims JJ, Cohen RE. Mol Cell. 2009;33:775–83. doi: 10.1016/j.molcel.2009.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Piotrowski J, Beal R, Hoffmann L, Wilkinson KD, Cohen RE, Pickart CM. J Biol Chem. 1997;272:23712–21. doi: 10.1074/jbc.272.38.23712. [DOI] [PubMed] [Google Scholar]
  • 22.Pickart CM, Raasi S. Methods Enzymol. 2005;399:21–36. doi: 10.1016/S0076-6879(05)99002-2. [DOI] [PubMed] [Google Scholar]
  • 23.Varadan R, Assfalg M, Fushman D. In: Ubiquitin and Protein Degradation, Methods in Enzymology, Vol.399 part B. part B. Deshaies RJ, editor. Vol. 399. 2005. pp. 177–192. [DOI] [PubMed] [Google Scholar]
  • 24.Castaneda CA, Liu J, Kashyap TR, Singh RK, Fushman D, Cropp TA. Chem Commun (Camb) 2011;47:2026–8. doi: 10.1039/c0cc04868b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Chatterjee C, McGinty RK, Pellois JP, Muir TW. Angew Chem Int Ed Engl. 2007;46:2814–8. doi: 10.1002/anie.200605155. [DOI] [PubMed] [Google Scholar]
  • 26.Yang R, Pasunooti KK, Li F, Liu XW, Liu CF. J Am Chem Soc. 2009;131:13592–3. doi: 10.1021/ja905491p. [DOI] [PubMed] [Google Scholar]
  • 27.Ajish Kumar KS, Haj-Yahya M, Olschewski D, Lashuel HA, Brik A. Angew Chem Int Ed Engl. 2009;48:8090–4. doi: 10.1002/anie.200902936. [DOI] [PubMed] [Google Scholar]
  • 28.Kumar KS, Spasser L, Erlich LA, Bavikar SN, Brik A. Angew Chem Int Ed Engl. 2010;49:9126–31. doi: 10.1002/anie.201003763. [DOI] [PubMed] [Google Scholar]
  • 29.Yang R, Pasunooti KK, Li F, Liu XW, Liu CF. Chem Commun (Camb) 2010;46:7199–201. doi: 10.1039/c0cc01382j. [DOI] [PubMed] [Google Scholar]
  • 30.Virdee S, Ye Y, Nguyen DP, Komander D, Chin JW. Nat Chem Biol. 2010;6:750–7. doi: 10.1038/nchembio.426. [DOI] [PubMed] [Google Scholar]
  • 31.El Oualid F, Merkx R, Ekkebus R, Hameed DS, Smit JJ, de Jong A, Hilkmann H, Sixma TK, Ovaa H. Angew Chem Int Ed Engl. 2010;49:10149–53. doi: 10.1002/anie.201005995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kumar KS, Spasser L, Ohayon S, Erlich LA, Brik A. Bioconjug Chem. 2011;22:137–43. doi: 10.1021/bc1004735. [DOI] [PubMed] [Google Scholar]
  • 33.Kumar KS, Bavikar SN, Spasser L, Moyal T, Ohayon S, Brik A. Angew Chem. 2011;50:6137–6141. doi: 10.1002/anie.201101920. [DOI] [PubMed] [Google Scholar]
  • 34.Virdee S, Kapadnis PB, Elliott T, Lang K, Madrzak J, Nguyen DP, Riechmann L, Chin JW. J Am Chem Soc. 2011;133:10708–11. doi: 10.1021/ja202799r. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.For a recent example of using non-natural amino acids for making an isopeptide mimic see Eger S, Scheffner M, Marx A, Rubini M. J Am Chem Soc. 2010;132:16337–9. doi: 10.1021/ja1072838.
  • 36.Eddins MJ, Varadan R, Fushman D, Pickart CM, Wolberger C. J Mol Biol. 2007;367:204–211. doi: 10.1016/j.jmb.2006.12.065. [DOI] [PubMed] [Google Scholar]
  • 37.Datta AB, Hura GL, Wolberger C. J Mol Biol. 2009;392:1117–24. doi: 10.1016/j.jmb.2009.07.090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.For a first demonstration of using chemical ligation of recombinant proteins for selective segmental isotopic labeling of protein domains see Xu R, Ayers B, Cowburn D, Muir TW. Proc Natl Acad Sci USA. 1999;96:388–393. doi: 10.1073/pnas.96.2.388.
  • 39.Aimoto S. Biopolymers. 1999;51:247–65. doi: 10.1002/(SICI)1097-0282(1999)51:4<247::AID-BIP2>3.0.CO;2-W. [DOI] [PubMed] [Google Scholar]
  • 40.Studier FW. Protein Expr Purif. 2005;41:207–34. doi: 10.1016/j.pep.2005.01.016. [DOI] [PubMed] [Google Scholar]
  • 41.Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A. J Biomol NMR. 1995;6:277–93. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
  • 42.Goddard TD, Kneller DG. SPARKY3. University of California; San Francisco: [Google Scholar]
  • 43.Streu C, Meggers E. Angew Chem Int Ed Engl. 2006;45:5645–8. doi: 10.1002/anie.200601752. [DOI] [PubMed] [Google Scholar]
  • 44.Xu P, Duong DM, Seyfried NT, Cheng D, Xie Y, Robert J, Rush J, Hochstrasser M, Finley D, Peng J. Cell. 2009;137:133–45. doi: 10.1016/j.cell.2009.01.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Williamson A, Wickliffe KE, Mellone BG, Song L, Karpen GH, Rape M. Proc Natl Acad Sci U S A. 2009;106:18213–8. doi: 10.1073/pnas.0907887106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Pervushin K, Riek R, Wider G, Wuthrich K. Proc Natl Acad Sci USA. 1997;94:12366–71. doi: 10.1073/pnas.94.23.12366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Schanda P, Kupce E, Brutscher B. J Biomol NMR. 2005;33:199–211. doi: 10.1007/s10858-005-4425-x. [DOI] [PubMed] [Google Scholar]
  • 48.Huang H, Jeon MS, Liao L, Yang C, Elly C, Yates JR, 3rd, Liu YC. Immunity. 2010;33:60–70. doi: 10.1016/j.immuni.2010.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Ai HW, Lee JW, Schultz PG. Chem Commun (Camb) 2010;46:5506–8. doi: 10.1039/c0cc00108b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Liu J, Castaneda CA, Wilkins BJ, Fushman D, Cropp TA. Bioorg Med Chem Lett. 2010;20:5613–6. doi: 10.1016/j.bmcl.2010.08.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Tugarinov V, Hwang PM, Kay LE. Annu Rev Biochem. 2004;73:107–46. doi: 10.1146/annurev.biochem.73.011303.074004. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1_si_001

RESOURCES