Highlights
► I review enzymes commonly used to remove affinity tags. ► Both endoproteases and exoproteases are covered. ► The specificity and biochemical properties of proteases are discussed.
Keywords: Carboxypeptidase, Aminopeptidase, PreScission protease, Affinity tag, Polyhistidine tag, His-tag, TEV protease, Enterokinase, Enteropeptidase, Thrombin, 3C protease, Factor Xa
Abstract
Although they are often exploited to facilitate the expression and purification of recombinant proteins, every affinity tag, whether large or small, has the potential to interfere with the structure and function of its fusion partner. For this reason, reliable methods for removing affinity tags are needed. Only enzymes have the requisite specificity to be generally useful reagents for this purpose. In this review, the advantages and disadvantages of some commonly used endo- and exoproteases are discussed in light of the latest information.
Introduction
Affinity tags have become essential tools for the production of recombinant proteins in a wide variety of settings, from basic research to high-throughput structural biology. Not only do they facilitate the detection and purification of their fusion partners, as originally intended, but they may also have a beneficial impact on the yield of recombinant proteins and, in some cases, increase their solubility and even promote their proper folding [2], [3].
Despite these important advantages, the Achilles heel of the affinity tagging strategy always has been and remains the removal of tags. Whereas many tagged proteins retain their structural integrity and biological activity, others clearly do not, e.g., [4], [5], [6], [7], [8], [9], [10], [11]. Therefore, whenever possible, it is prudent to remove tags from recombinant proteins. Although both chemical and enzymatic methods have been used to cleave fusion proteins at designed sites, only the natural proteolytic enzymes have the requisite specificity to be broadly useful reagents for this purpose. Because they are not as versatile and therefore generally less useful than trans-acting reagents, neither the self-cleaving inteins [12] and self-cleaving variants of subtilisin [13] will be discussed here, nor will the Ulp1 protease since it only cleaves SUMO tags [14]. Rather, this review will focus on the most generally applicable and commonly used enzymatic reagents for the removal of affinity tags (Table 1 ). Since the last comprehensive review of this topic [3] much research on these reagents has been conducted. As a result, a wealth of new information has accumulated on the advantages, disadvantages, and biochemical characteristics of various reagents.
Table 1.
Endo- and exoproteases for the removal of affinity tags.
| Enzyme | Source(s) | Molecular weight (kDa) | Tagged forms | Inhibitors | Recognition Site | Notes |
|---|---|---|---|---|---|---|
| Endoproteases | ||||||
| Enteropeptidase | Duodenum E. coli S. cerevisiae |
110 + 35 | His6 | Reducing agents | DDDDK↓ | P1′ ≠ Pro, Trp 35 KDa light chain is active by itself |
| Thrombin | Plasma CHO cells |
32 + 4.5 | None | Reducing agents | LVPR↓GS | |
| Factor Xa | Plasma HEK 293 cells |
42 + 17 | None | Reducing agents Chelating agents Phosphate ions |
LVPR↓GS | Very promiscuous |
| TEV Protease | E. coli | 27 | His6 MBP GST Strep II |
Thiol alkylating agents | ENLYFQ↓G | P1′ can vary [68] P2′ ≠ Pro Ac-TEV™ = S219 V mutant |
| Rhinovirus 3C Protease | E. coli | 27 | His6 GST His6-GST |
Thiol alkylating agents | LEVLFQ↓GP | Same as PreScission™ protease |
| Exoproteases | ||||||
| Carboxypeptidase A | Pancreas E. coli S. cerevisiae S. frugiperda (baculovirus) |
33 | His6 | Reducing agents Chelating agents |
C-terminal amino acids except Pro, Lys and Arg | Asp, Glu, Gly cleaved slowly |
| Carboxypeptidase B | Pancreas E. coli P. pastoris |
35 | none | Reducing agents Chelating agents |
C-terminal Lys and Arg | Will cleave hydrophobic resides under certain conditions [119], [120] |
| DAPase | Kidney S. frugiperda (baculovirus) |
23 + 16 + 6 | His6 | Reducing agents Thiol alkylating agents |
N-terminal dipeptides | P2 ≠ Pro, Lys, Arg P1 ≠ Pro |
Endoproteases
For many years, serine proteases such as activated blood coagulation factor X (factor Xa), 1 enterokinase (hereafter referred to by its more appropriate moniker enteropeptidase), and α-thrombin were the reagents of choice for removing affinity tags, yet the literature is replete with reports of fusion proteins that were cleaved by these proteases at locations other than the designed site. Over the last decade or so, it has become increasingly evident that certain viral proteases have far more stringent sequence specificity, which has led to an upsurge in their popularity. These enzymes have a chymotrypsin-like fold with an atypical catalytic triad in which cysteine replaces serine, and they exhibit an absolute requirement for glutamine in the P1 2 position of their substrates. The nuclear inclusion protease from tobacco etch virus (TEV) is probably the best-characterized enzyme of this type. The other is the human rhinovirus 3C protease. In stark contrast to factor Xa, enteropeptidase and thrombin, there have been very few if any reports of cleavage at noncanonical sites in designed fusion proteins by these viral proteases.
The stringent specificity of the viral proteases probably can be attributed to their low turnover rates. The number of substrate residues that are recognized by the serine proteases and the viral proteases is similar (e.g., LVPRGS and ENLYFQS in the case of thrombin and TEV protease, respectively). The Michaelis constants (K M) for the two classes of enzymes are also similar, falling in the low to mid micromolar range, but the catalytic rate constants (k cat) of the viral proteases are on the order of 100-fold lower than those of the serine proteases, resulting in much slower turnover rates [15], [16], [17], [18], [19]. Each class of protease undoubtedly associates transiently with suboptimal recognition sites, but on average, a catalytic event is far more likely to occur when a serine protease does so because its k cat is so much greater than that of the typical viral protease. The practical ramification of this observation is that one must use considerably more viral protease than serine protease to digest a fixed amount of fusion protein at a similar rate. However, this is not a significant handicap because, unlike the serine proteases, large quantities of recombinant viral proteases can easily be produced in Escherichia coli. This advantage, coupled with their more stringent sequence specificity, has made viral proteases the reagents of choice for endoproteolytic removal of affinity tags.
Enteropeptidase
Native enteropeptidase is a disulfide-linked heterodimer composed of “heavy” and “light” chains (with apparent molecular weights of 110 and 35 kDa, respectively), which are extensively glycosylated [20]. Although originally purified from natural sources [20], [21], recent advances have facilitated the production of a recombinant enteropeptidase light chain in the periplasm of E. coli [22], [23], making it more economical to manufacture and yielding a product free of contaminating proteases. Moreover, the 26 kDa light chain (which, when expressed in E. coli, is devoid of glycosylation) retains the specificity of the native enzyme and is even more active on fusion protein substrates than the heterodimer [22]. The availability of recombinant enteropeptidase also presents an opportunity for the attachment of affinity tags to the enzyme to facilitate its purification and separation from the products of an enteropeptidase digest [24]. One drawback of enteropeptidase, however, is that the light chain contains 4 disulfide bonds that are essential for catalytic activity, thus this enzyme is incompatible with buffers containing reducing agents like dithiothreitol (DTT). On the other hand, a recent study showed enteropeptidase to be relatively insensitive to a wide variety of detergents [25], which may enhance its value as a tool for the production of membrane proteins.
A significant advantage of enteropeptidase is that no critical specificity determinants are located on the C-terminal side of the scissile bond in its substrates (Table 1). Consequently, when an affinity tag is joined to the N-terminus of the protein of interest, in most cases enteropeptidase is able to generate a digestion product with a native N-terminus. A comprehensive study examining the relative processing rates of otherwise identical fusion proteins with all twenty possible amino acids in the P1′ position confirmed this property of enteropeptidase and revealed the rank order of processing efficiency [26]. Only proline and tryptophan were not well tolerated in the P1′ position. Another study examining the importance of the P1–P5 positions concluded that the P1 lysine was the most important specificity determinant, followed by the aspartate residues in the P2, P3, P5 and P4 positions, respectively, with the latter position contributing very little to specificity [27]. Interestingly these investigators found that the sequence DDDDR was cleaved more efficiently than the canonical DDDDK.
The principal drawback of enteropeptidase is its promiscuity [28], [29], which is particularly troublesome when a cryptic cleavage site is located within the protein of interest. Although efforts have been made to mitigate this problem by varying the recognition site [30] and tinkering with the enzyme [31], a satisfactory solution has yet to present itself.
Thrombin
Thrombin is typically purified from bovine plasma [32]. Although recombinant human thrombin has been produced in Chinese hamster ovary cells for clinical applications [33], as of now no convenient procedure for the expression and purification of recombinant thrombin at the bench-level scale has been described. The lack of a ready source of genetically engineered thrombin is a noteworthy disadvantage because, unlike enteropeptidase and the viral proteases (below), no affinity tags can be added to facilitate its removal following the digestion of a fusion protein.
Like many serine proteases, thrombin can be inactivated by phenylmethylsulfonyl fluoride (PMSF) or 4-(2-aminoethyl) benzenesulfonyl fluoride hydrochloride (AEBSF). Alternatively, a biotinylated form of thrombin that can be adsorbed on avidin or streptavidin resin is commercially available (Novagen, Madison, WI). Like enteropeptidase, thrombin is a disulfide-linked heterodimer. It also has three intramolecular disulfide bonds in one of its two chains [34], rendering it sensitive to reducing agents. The optimum temperature for thrombin activity is 45 °C and the enzyme maintains 20% of its maximal activity at 15 °C [35]. Thrombin is active over a pH range of 5–10, with a pH optimum of about 9.5 in the absence of NaCl and 8.3 in the presence of 1 M NaCl [35]. At pH 8.3, its activity increases with rising NaCl concentration up to at least 1 M. Glycerol and polyethylene glycol (PEG 600) enhance the stability of thrombin. Like enteropeptidase, thrombin is relatively resistant to a wide variety of detergents [25].
The thrombin cleavage site most frequently used in fusion protein substrates (LVPR↓GS) has an interesting history. This sequence closely resembles that of a natural thrombin cleavage site (LVPR↓GF) in human factor VIII. Yet the factor VIII sequence was not cleaved as efficiently as others in a comparative study [36], raising the question of why it was viewed as a particularly good thrombin site in the first place. Later, in their seminal paper describing the first application of glutathione S-transferase (GST) as an affinity tag, Smith and Johnson replaced the P2′ phenylalanine in the factor VIII sequence with a serine residue purely for the purpose of creating a BamHI restriction endonuclease cleavage site that would facilitate the construction of GST fusion proteins [37]. After that, LVPR↓GS was universally espoused as the canonical thrombin cleavage site, irrespective of whether or not BamHI was utilized for cloning, e.g., [38]. Therefore, it is possible this is not the linear epitope that is most efficiently cleaved by thrombin.
Specificity studies with thrombin have shown a range of sequence selectivity of 656-fold and 33-fold at the P2′ and P3′ positions, respectively [39]. Proline and the negatively charged amino acids greatly diminished processing efficiency when present in the P2′ position of peptide substrates (the “canonical” serine was not tested in this study). The identity of the residue in the P3′ position was not nearly as influential, although the negatively charged residues were most inhibitory. In another study, little bias was observed in the P2 and P3 sites, except for the exclusion of acidic residues. Interestingly, a strong preference was found for serine in the P1′ position, in contrast to the “canonical” glycine, which was a distant fourth, after alanine and threonine [40]. When a large library of different peptide sequences was screened for thrombin cleavage, the results revealed that the P1 arginine is the most conserved residue, followed by the P1′ glycine [41]. Yet a remarkable level of promiscuity was observed, consistent with reports of cleavage at cryptic sites in fusion proteins [42]. The nominal requirement for a Gly-Ser dipeptide in the P1′ and P2′ positions, which would result in the retention of two nonnative residues on the N-terminus of the protein of interest following thrombin digestion, is a marked disadvantage of thrombin relative to enteropeptidase.
Factor Xa
A blood clotting enzyme like thrombin, the γ-carboxylated glycoprotein factor Xa is either isolated from blood plasma or expressed recombinantly and secreted from mammalian cells [43], [44], [45], [46], [47]. Despite the availability of recombinant factor Xa, no affinity-tagged forms of the enzyme have been engineered to date. Factor Xa is composed of two disulfide-linked polypeptide chains with apparent molecular weights of 17 and 42 kDa, each of which contains a number of internal disulfide bonds, rendering the enzyme sensitive to reducing agents. Factor Xa also binds calcium ions and therefore should not be used in the presence of chelating agents such as ethylene glycol tetraacetic acid (EGTA) and ethylenediaminetetraacetic acid (EDTA). The sensitivity of Factor Xa to various detergents has also been studied [25].
The specificity determinants of factor Xa have been examined in some detail [48], [49], [50], [51]. Although it is a commonly held belief that this enzyme is insensitive to the identity of the residues on the C-terminal side of the scissile bond (its recognition site is usually denoted as IEGR↓), this is not the case. In the P1′ position, processing efficiency varies over a 50-fold range, with hydrophobic residues being most favorable and negatively charged residues and proline being least favorable. An even greater range of processing efficiency (160-fold) was observed in the P2′ position, with threonine and proline being particularly poorly tolerated [51]. As with enteropeptidase and thrombin, cleavage by factor Xa at undesired cryptic sites remains a common problem with no apparent solution [42].
TEV protease
Virologist William Dougherty and his colleagues were the first to characterize TEV protease in the late 1980s. Initially identified as a 49 kDa processing product of the viral polyprotein and termed “nuclear inclusion protein Ia” or “NIa” [52], later work demonstrated that the protease activity resides in its C-terminal 27 kDa domain [53]. Aligning the sequences of experimentally determined processing sites suggested that the recognition site for TEV protease is a linear epitope consisting of seven amino acid residues. This conjecture was confirmed by demonstrating that the sequence ENLYFQ↓S could function as a TEV protease cleavage site when placed in an artificial context [54] and led to the realization that TEV protease might be a useful reagent for the removal of affinity tags by site-specific endoproteolysis [55].
Early efforts to overproduce a polyhistidine-tagged form the 27 kDa catalytic domain of TEV protease in E. coli met with only limited success, yielding approximately 4 mg of pure protein per liter of bacterial culture [55]. In later work, the yield was improved dramatically by utilizing a solubility-enhancing fusion partner [56] in conjunction with a tRNA-encoding accessory plasmid or synonymous codon substitutions in the TEV protease open reading frame [57]. Further improvements in the production of soluble His-tagged TEV protease were subsequently realized, yielding up to 400 mg/L or 15 mg/g of cell paste [58], [59]. TEV protease has also been fused to a number of other affinity tags, including glutathione S-transferase [59], maltose binding protein (MBP) [56], and the Streptag II [60]. TEV protease expression vectors can be obtained from a variety of sources, including the American Type Culture Collection (ATCC), the Addgene plasmid repository (http://www.addgene.org), and the Protein Structure Initiative Biological Materials Repository (http://psimr.asu.edu/). Straightforward procedures for the expression and purification of TEV protease have been described [61], [62].
The 27 kDa catalytic domain of TEV protease readily cleaves itself near its C-terminus to generate a truncated enzyme with greatly diminished activity [63], [18]. However, autolysis of TEV protease can be avoided by introducing amino acid substitutions in the vicinity of the internal cleavage site [62], [18]. One of these mutants (S219V) is approximately 100-fold more resistant to autoinactivation than the wild-type protease and, fortuitously, also has moderately greater catalytic activity [18]. The S219V mutant is marketed by Invitrogen under the trade name Ac-TEV protease. Additional mutations have been introduced into TEV protease as another means of improving its solubility [64], [65].
The specificity determinants of TEV protease have been thoroughly investigated. Early experiments by Dougherty and colleagues demonstrated a strong preference for glutamic acid in the P6 position of a TEV protease recognition site, little or no selectivity in the P5 position, a moderate preference for leucine in the P4 position, a strong preference for tyrosine in the P3 position, a roughly equal tolerance for phenylalanine, cysteine and isoleucine in the P2 position (with all other residues being poorly tolerated in this position), a strong bias in favor of glutamine in the P1 position, and some degree of selectivity in the P1′ position, with serine, isoleucine, and asparagine being the most favorable of the residues tested [66]. These results led to the notion that the “consensus” TEV protease cleavage site could be defined as Glu-Xaa-Xaa-Tyr-Xaa-Gln↓Ser/Gly and the assumption that those positions denoted as “Xaa” could be freely substituted. However, more recent experiments have shown that TEV protease is quite sensitive even to conservative amino acid substitutions in the P4 and P2 positions [67]. In another study, the P1′ specificity of TEV protease was systematically assessed in the context of a model fusion protein [68]. These investigators found that the enzyme was surprisingly tolerant of a wide variety of residues in this position. Small aliphatic residues such as glycine, alanine, serine, methionine, and cysteine were exceptionally well tolerated, whereas the negatively charged and β-branched hydrophobic residues were inhibitory to processing. These findings upended the dogma that a serine or glycine residue is essential in the P1′ position of a TEV protease recognition site, revealing instead that, like enteropeptidase and factor Xa, in many cases TEV protease is capable of generating digestion products with native N-termini. As noted, the discrepancies between early and later results are probably due to differences in the experimental methods employed [68]. In the initial studies carried out by Dougherty and colleagues, the concentration of enzyme (which was the full-length NIa protein rather than the 27 kDa catalytic domain) exceeded that of the substrates, which were far below the K M of the enzyme. Later experiments were conducted under more realistic reaction conditions, using pure preparations of enzyme and substrate with a substantial molar excess of the latter. An important footnote to early studies of TEV protease specificity that is sometimes overlooked by today’s protein engineers is that proline (but no other residue) is decidedly inhibitory in the P2′ position of its substrates [66]. Other studies in which large combinatorial libraries of peptides have been interrogated by TEV protease are in good general agreement with the results of systematic mutagenesis experiments [69], [70].
The crystal structure of catalytically inactive (C151A) TEV protease in complex with the peptide substrate TTENLYFQSGT revealed that although the P7, P2′ and P3′ residues were clearly visible in the electron density map, their side chains do not engage in any noteworthy interactions with the enzyme (no density was observed for the P8 threonine residue) [71]. Of the side chains in the P6–P1′ positions of the peptide, only that of P5 asparagine does not make intimate contact with the enzyme, consistent with the observed lack of sequence specificity at this position [66]. The tolerance for a wide variety of side chains in the P1′ position of TEV protease substrates [68] is possible because the S1′ “pocket” of the enzyme is actually a long, shallow groove on its surface. Consequently, the side chain of the P1′ residue is partially exposed to solvent rather than completely buried within the complex (Fig. 1 A). This is not a general property of potyviral proteases, however. In the co-crystal structure of the related (53% amino acid identity) tobacco vein mottling virus (TVMV) protease with its canonical peptide substrate, the S1′ pocket is small and round and the side chain of the P1′ serine residue projects directly into it (Fig. 1B) [72]. Hence, from a biotechnological standpoint, it is fortuitous that TEV protease, with its relaxed P1′ specificity, emerged as a reagent of choice for removing affinity tags.
Fig. 1.

Space-filling representations of the S1′ pockets of TEV protease (A) and TVMV protease (B). The canonical peptides that were co-crystallized with each enzyme are shown as stick representations. The side chains of the P1′ serine residues are encased in mesh. Note that the side chain of the P1′ serine projects along the surface of a shallow groove in TEV protease whereas the side chain of the corresponding serine points into a small, shallow pocket in TVMV protease.
Quite a lot is known about the performance of TEV protease under various reaction conditions and in the presence of different additives. It is active over a pH range between 6 and 9, but inactive at or below pH 5 [63]. In the same study, the enzyme was reported to be relatively insensitive to NaCl concentrations between 0.1 and 2.0 M. In a later study, TEV protease activity was observed to be greatest in the absence of monovalent salt, but decreased only moderately at NaCl concentrations up to 200 mM [73]. Although the optimum temperature for TEV protease activity is 30–34 °C, the enzyme retains significant activity at 4 °C [73], [74]. Activity drops off abruptly above 37 °C, probably due to denaturation of the enzyme [73]. TEV protease is not inhibited by 2.5% sucrose or 0.1% Triton X-100, but is completely inactivated by 0.01% sodium dodecyl sulfate [75]. The enzyme is insensitive to the protease inhibitors PMSF, AEBSF, and pepstatin A, relatively tolerant of aprotinin (up to 0.3 mg/ml) and leupeptin (up to 100 μM), and can tolerate up to 0.5 M urea [75], [76]. Its catalytic activity is impeded by some detergents, however [25], [77], [78]. TEV protease activity is not impeded by reducing agents such as dithiothreitol (DTT) or chelators like EDTA, but, as expected, it is highly sensitive to thiol alkylating agents like iodoacetamide [75].
Although TEV protease is by far the best studied and most widely used potyviral protease, several others have also been characterized to varying degrees. These include the TVMV protease [73], [79], the plum pox virus protease [80], and the turnip mosaic virus protease [81], [82]. None of these enzymes offer any compelling advantages over TEV protease, however. TVMV protease expression vectors can be obtained from the Addgene plasmid repository (http://www.addgene.org).
Rhinovirus 3C protease
Because human rhinoviruses may account for the majority of common colds and have been associated with acute and chronic bronchitis along with other respiratory tract illnesses, the literature pertaining to rhinovirus 3C protease is dominated by the quest for inhibitors of this enzyme [83]. The same is true of the 3C-like proteases encoded by related viruses that cause polio, hepatitis A, and severe acute respiratory syndrome (SARS), among other diseases. Indeed, the poliovirus 3C protease was among the earliest enzyme of its type to be characterized [84], [85], [86]. It is not entirely clear why human rhinovirus 14 3C protease emerged from this group of picornaviruses as the prototypical enzyme for endoproteolytic removal of affinity tags from recombinant proteins. Similarly, there seems to be no compelling reason why TEV protease was selected for this role from among well over one hundred potyviruses that infect plants. In both cases, it may simply be a matter of happenstance. In any event, the tobacco etch virus and the human rhinovirus are members of the same viral superfamily [87], [88]. Consequently, their 3C-like proteases are structurally and functionally related to one another and share many common characteristics.
Rhinovirus 14 3C protease is commercially available in the form of a glutathione S-transferase (GST) fusion protein called PreScission™ protease [89] and as a dual His6-GST fusion protein. PreScission™ protease and rhinovirus 3C protease were identified as distinct enzymes with differing sequence specificity in the latest review article to include an in-depth survey of enzymatic methods for tag removal [3], but this is not the case. Rhinovirus 3C protease has also been fused to His6 and MBP tags in E. coli [90], to DsbA [91], and very recently to the biotin acceptor peptide (BAP) [92]. The yield of BAP-tagged 3C was reported to be ∼6 mg/g of wet cell paste. However, only about 60% of the BAP-tagged 3C protease was biotinylated in vivo (inexplicably, it was purified by conventional methods instead of by affinity chromatography on monomeric avidin resin). The yields of other tagged forms of the enzyme have not been reported. Unfortunately, rhinovirus 3C expression vectors are not presently available from open plasmid repositories, making them somewhat difficult to acquire.
Knowledge about the substrate specificity of human rhinovirus 3C protease has been gained mainly through studies of the enzyme from serotype 14. One comparative analysis of peptide substrates derived from natural 3C cleavage sites in the rhinovirus polyprotein established that the 2C/3A site was cleaved most efficiently [93] and that the specificity determinants are confined to the P5–P2′ sites [94], leading to the conclusion that the sequence ETLFQ↓GP is the optimum recognition site. At about the same time and using a very similar approach, a different group found that the 3B/3C cleavage site was the most efficiently processed of the natural sites [95] and that the specificity determinants are confined to the P5–P2′ sites [19], leading them to identify a different consensus sequence: PVVVQ↓GP. However, the canonical recognition site identified in the product literature accompanying PreScission™ protease is LEVLFQ↓GP. Walker and colleagues [90] speculated that this unnatural recognition site may have evolved in part from the observation that the peptide EVLFQPG was hydrolyzed nearly five times more efficiently than the parental ETLFQGP peptide derived from the natural 2C/3A polyprotein processing site [94]. In the same study, a longer peptide encompassing the 2C/3A processing site (DSLETLFQGPVYKDL) was cleaved about twice as efficiently as the shorter ETLFQGP peptide. Although the longer peptide includes the natural leucine residue in the P6 position, there was insufficient data to conclude that its presence (rather than some other difference between the two peptides) was responsible for the observed twofold variation in processing efficiency. Yet in a more recent study, replacing a leucine in the P6 position of a peptide substrate with an arginine led to a 10-fold reduction in the catalytic efficiency (k cat/KM) of processing by rhinovirus 3C protease [96]. Hence, it seems that the octapeptide LEVLFQPG, the so-called “PreScission protease site”, approximates the optimum recognition site. Although, as discussed above, two groups arrived at somewhat different conclusions regarding the optimum sequence for a rhinovirus 3C protease cleavage site, both were in agreement on one key point: the requirement for the Gly-Pro dipeptide immediately after the scissile bond is very stringent. This is a noteworthy disadvantage of 3C compared to TEV protease; there is virtually no opportunity to produce a recombinant protein with a native N-terminus after digesting a fusion protein with 3C protease. There have been no “substrate protease” experiments performed with 3C protease (i.e., no genetic selections or screens), and consequently the identity of the “ideal” 3C cleavage site rests entirely on the systematic mutagenesis studies.
The impact of protease inhibitors, additives, and reaction conditions on the activity of rhinovirus 3C protease has been investigated [96], [97], [98], and in general, the results are very similar to those reported for TEV protease. However, rhinovirus 3C protease is rumored to have greater catalytic activity than TEV protease at low temperature (4 °C). Yet an exhaustive search of the literature failed to identify any instances in which the temperature-dependence of the two proteases has been compared directly. Instead, one finds only anecdotal statements that are unsupported by facts [99]. It would be gratifying to finally see this issue addressed experimentally.
While, as discussed above, the human rhinovirus serotype 14 3C protease has been widely used as a tool for endoproteolytic cleavage of fusion proteins, it suffers from the major disadvantage of leaving behind a Gly-Pro dipeptide on the N-terminus of a recombinant protein after digestion. The 3C proteases encoded by SARS virus [100] and poliovirus [101] also have a stringent requirement for specific residues on the C-terminal side of the scissile bond. However, the hepatitis A virus 3C protease has more relaxed specificity, as it has been shown to tolerate a variety of small aliphatic residues in the P1′ position and any residue except arginine and proline in the P2′ position [102], [103], [104]. This enzyme is straightforward to overproduce in E. coli and can be purified to greater than 95% homogeneity in two conventional chromatography steps, yielding more than 6 mg per liter of bacterial culture [105]. Perhaps it is worth reexamining the utility of hepatitis A 3C protease as a reagent for the endoproteolytic removal of affinity tags.
An advantage of affinity-tagged proteases
The ease of overproducing affinity-tagged forms of TEV and rhinovirus 3C proteases in E. coli has inspired the development of a generic protocol for protein purification [2], [3] that is outlined in Fig. 2 . The same affinity tag is attached to both the protein of interest and to an endoprotease. The tagged protein of interest (POI) is first purified from a soluble extract by some type of affinity chromatography, depending on the tag that is used. In the second step, the affinity tag is removed from the POI by the correspondingly tagged endoprotease. Finally, the digestion products are subjected to the same form of affinity chromatography a second time, in this case to remove undigested fusion protein substrate, the tagged protease, the cleaved tag, and any endogenous proteins that bound to the affinity resin during the first round of chromatography, leaving only the untagged POI in the unbound effluent. Variations of this strategy have been developed by a number of groups and have found widespread use in the structural genomics community, e.g., [106], [107], [108], [109].
Fig. 2.

A generic strategy for protein purification that utilizes an affinity-tagged endoprotease. See text for discussion.
Troubleshooting endoproteolytic cleavage of affinity tags
As mentioned above, the removal of affinity tags has always been the Achilles heel of the fusion protein strategy. The problem of cleavage at secondary sites has been mitigated to a large degree by the use of viral proteases with very stringent sequence specificity. Still, it is not uncommon to encounter a situation where a fusion protein is cleaved very inefficiently by the chosen protease. There are several possible reasons for this. The most trivial explanation can be attributed to a failure to recognize the incompatibility of certain proteases with buffer components and additives. For example, a calcium-dependent protease such as factor Xa should never be used to cleave a fusion protein in a phosphate buffer (the most common buffer used for the purification of His-tagged proteins) because of the potential for the formation of insoluble calcium phosphate.
The inability of a protease to cleave a fusion protein may also be caused by steric hindrance, which can be of several types. For example, the cleavage site may be too close to ordered structure in the target protein. An example of such a situation is illustrated in Fig. 3 . In this case, the substrate is an MBP-SycH fusion protein. SycH is an export chaperone for the protein tyrosine phosphatase YopH in Yersinia pestis, the plague-causing bacterium. When SycH is expressed in E. coli as a fusion to the C-terminus of MBP, with a TEV protease recognition site between the two domains, the fusion protein is not cleaved very efficiently (Fig. 3A). (In this experiment, the fusion protein was cleaved in vivo, using a separate plasmid expression vector to produce TEV protease [110].) In a second-generation construct, five glycine residues were inserted between the TEV protease recognition site and the N-terminus of SycH. As shown in Fig. 3B, this results in much more efficient processing of the fusion protein by TEV protease. However, no crystals of SycH with the additional glycine residues appended to its N-terminus were ever obtained. Instead, the form of SycH that was eventually crystallized had no extra residues added to its N-terminus. The structure revealed that the N-terminus of SycH is an integral part of the folded protein [111], consistent with the hypothesis that the close proximity of the TEV protease recognition site to the (structured) N-terminus of SycH in the original fusion protein was responsible for its failure to be cleaved efficiently and that the addition of five glycine residues alleviated this impediment.
Fig. 3.

Overcoming steric hindrance by inserting “spacer” residues between a protease recognition site and the N-terminus of the protein of interest. (A) Expression of MBP-SycH fusion protein with a TEV protease recognition site immediately adjacent to the N-terminal residue of SycH in the absence (−) or presence (+) of TEV protease. (B) Expression of an otherwise identical MBP-SycH fusion protein in which five consecutive glycine residues were inserted between the TEV protease recognition site and the N-terminal residue of SycH in the absence (−) and presence (+) of TEV protease.
The second level of steric hindrance applies to oligomeric proteins. It may be intuitively obvious how the presence of a tag like the 42 kDa MBP on a multimeric protein might create steric problems for affinity tag removal, yet even a tag as small as polyhistidine can present similar difficulties [112].
Finally, steric hindrance can sometimes result from the use of solubility-enhancing affinity tags like MBP [56] and NusA [113]. While these solubility enhancers can maintain their fusion partners in a soluble state in E. coli, and maybe in other heterologous hosts as well, their fusion partners may or may not be folded into their native conformations. At least some of these fusion proteins probably exist in the form of soluble aggregates that may not be readily accessible to an endoprotease, as has been shown to be the case for one MBP fusion protein [114].
Exoproteases
Exoproteases (aminopeptidases and carboxypeptidases) are not used nearly as frequently as endoproteases for the removal of affinity tags. This is mainly because most of the time affinity tags are joined to the N-terminus of a protein of interest, where they can best exert a positive effect on the yield and solubility of their fusion partners [2], and endoproteases like enteropeptidase and TEV protease can remove N-terminal affinity tags without leaving behind any nonnative residues. Yet sometimes there are compelling reasons to add a tag to the C-terminus of a recombinant protein. For example, a C-terminal tag is less likely to interfere with the removal of signal peptides and the secretion of proteins. However, endoproteolytic removal of C-terminal tags is complicated by the fact that the principal specificity determinants of endoproteolytic enzymes (e.g., Factor Xa, thrombin, enteropeptidase, tobacco etch virus and rhinovirus 3C proteases) are located on the N-terminal side of the scissile bond. Consequently, the removal of a C-terminal tag by any of them would leave behind a significant number of non-native residues (six in the case of TEV protease, which is equivalent in number to a hexahistidine tag). One solution to this problem is to use an exopeptidase, specifically a carboxypeptidase, to remove short C-terminal tags such as polyhistidine.
Aminopeptidases have also been used to remove short affinity tags from the N-termini of recombinant proteins [3], although not very frequently because endoproteases can accomplish this task quite well.
Metallocarboxypeptidases
Of the many varieties of carboxypeptidases, those belonging to the M14A subfamily of zinc-dependent metallocarboxypeptidases, historically known as the “digestive carboxypeptidases,” have proven to be particularly useful for the processive hydrolysis of C-terminal residues from peptides and globular proteins [115], [116]. Digestive carboxypeptidases can be classified on the basis of their sequence specificity. For example, type A carboxypeptidases preferentially remove C-terminal amino acid residues having aromatic or branched aliphatic side chains [117] whereas type B carboxypeptidases exhibit a strong preference for basic amino acids [118]. However, carboxypeptidase B will remove aromatic residues under certain conditions [119], [120]. The A-type carboxypeptidases are further subdivided into type A1 and type A2 isoforms in rodents and humans [121]. Carboxypeptidase A1 preferentially catalyzes the removal of aliphatic residues from peptide substrates, while the A2 isoforms show higher specificity toward aromatic residues like phenylalanine and tryptophan. Carboxypeptidase A2 is not present in the bovine pancreas. Instead, the single bovine carboxypeptidase A (BoCPA) has relatively broad substrate specificity [121]. Exoproteolytic removal of C-terminal His-tags and polyarginine tags from globular proteins has been achieved using BoCPA and bovine carboxypeptidase B (BoCPB), respectively [116], [122], [123], [124], [125], [126], [127], [128], [129]. Neither enzyme will digest a C-terminal proline residue. Thus, incorporation of a proline at the appropriate location is one way to ensure that the digestion product will have a homogeneous endpoint.
A- and B-type carboxypeptidases are synthesized as zymogens with an N-terminal signal peptide that targets them for secretion from cells. Following secretion, the non-catalytic domain (termed the prodomain) is removed by proteolytic cleavage to give rise to the catalytically active carboxypeptidase. The prodomains not only maintain the enzymes in a catalytically dormant state, but they are also required for their proper folding [130]. The mature ∼30–35 kDa BoCPA and BoCPB contain one and three disulfide bonds, respectively. Consequently, it is not advisable to use them in the presence of reducing agents.
As zinc-dependent enzymes, the digestive carboxypeptidases are inhibited by chelating agents like EDTA and 1,10-phenanthroline [120]. Sodium dodecyl sulfate (SDS) strongly inhibits carboxypeptidase A but hardly affects carboxypeptidase B activity (up to 1% w/v), while octyl-(polydisperse)oligooxyethylene and lauryl glutamate have only a mild inhibitory effect on both enzymes [131]. Carboxypeptidases A and B are optimally active between pH 7 and 9; they are inhibited by mercury but stimulated by cobalt [132], [133].
Because of their complex folding and activation pathways, the metallocarboxypeptidases have been challenging enzymes to overproduce in heterologous systems. Even now, virtually every commercial preparation is obtained from natural sources. One group described the production of pre-pro-BoCPA in the yeast Saccharomyces cerevisiae as a secreted zymogen and its subsequent activation by trypsin, but the yield of recombinant protein was not reported [134]. More recently, proBoCPA was produced as a thioredoxin fusion protein in E. coli, but the yield of purified fusion protein (prior to activation and purification of BoCPA) was only 0.8 mg per liter of bacterial culture and most of the fusion protein accumulated in the form of insoluble aggregates [135]. Efforts have also been made to overproduce mammalian carboxypeptidase A subtypes A1 and A2 in yeast [130], [136], [137], [138], but their narrower specificity makes these enzymes less appealing than BoCPA for the removal of C-terminal affinity tags from recombinant proteins. Moreover, none of the recombinant mammalian type A carboxypeptidases was engineered to include an affinity tag. Recently, a polyhistidine-tagged type A carboxypeptidase (MeCPA) from the fungal entomopathogen Metarhizium anisopliae has been secreted from baculovirus-infected insect cells and purified to homogeneity [117]. However, the yield of pure, active enzyme was a disappointing 0.25 mg per liter of conditioned medium. Yet this material proved to be highly active and, as predicted [139], the specificity of MeCPA was found to be even broader than that of BoCPA (Fig. 4 ). The same group has subsequently developed a strategy to produce MeCPA in E. coli that may be generally applicable to metallocarboxypeptidases. The proMeCPA zymogen is fused to the C-terminus of MBP to maintain it in a soluble form in the cytosol while the protein disulfide isomerase DsbC is co-expressed from a separate plasmid. This procedure yields about 1 mg of pure MeCPA per liter of bacterial culture (Austin et al., in preparation).
Fig. 4.

Relative processing efficiency (kcat/KM) of peptide substrates by MeCPA (black bars) and BoCPA (gray bars). The peptide substrates used were VSQNPKX, wherein X was the variable amino acid. No processing of peptides terminating in Arg, Lys, or Pro was observed for either enzyme. Data were compiled from [117].
Porcine carboxypeptidase B has been expressed at very high levels (250 mg per liter) by secretion of the zymogen from the yeast Pichia pastoris [140]. Rat procarboxypeptidase B has been expressed in E. coli, refolded, activated by trypsin and purified to yield 40 mg per liter of culture medium [141]. However, neither of these enzymes included an affinity tag to facilitate their purification or their removal from the products of a carboxypeptidase digest. There is currently no known source of recombinant BoCPB.
Aminopeptidases
Interestingly, there are no known aminopeptidase counterparts to the digestive carboxypeptidases. However, endoproteases that exhibit little or no specificity for residues on the C-terminal side of the scissile bond (e.g., enteropeptidase and TEV protease) can be used to remove N-terminal affinity tags without leaving any extra residues on the N-terminus of the digestion product in most cases. Nevertheless, N-terminal exopeptidases are occasionally used for the removal of affinity tags. The TAGZyme™ system, a commercial product, is presently the only noteworthy tool of this type [142].
The TAGZyme system consists of three enzymatic reagents: dipeptidyl-aminopeptidase I (DAPase), glutamine cyclotransferase (Qcyclase), and pyroglutamylaminopeptidase (pGAPase). The principal enzyme is DAPase, which catalyzes the stepwise excision of dipeptides from the N-terminus of a polypeptide. Natural “road blocks” to DAPase digestion are a lysine or arginine in the first position of a dipeptide or a proline in either position. Consequently, DAPase cannot remove any N-terminal affinity tag. For instance, it could only remove a FLAG-tag (DYKDDDDK) if the tag were modified so that the proximal lysine residue did not occupy the first position of the second dipeptide, and it could not remove a polyarginine tag (RRRRR) or a Strep II tag (WSHPQFEK) under any circumstances. Instead, it is mainly used as a reagent for removing N-terminal polyhistidine tags, which need not contain any road-blocking residues. Incorporation of an arginine or lysine residue following the polyhistidine sequence will terminate DAPase digestion at that point, leaving behind the basic residue on the N-terminus of the digestion product. Alternatively, if a true “native” N-terminus is desired, then this can be accomplished by inserting a glutamine residue immediately after the polyhistidine sequence and utilizing, in addition to DAPase, the other two enzymes, Qcyclase and pGAPase. When glutamine is exposed at the N-terminus of the polypeptide chain, it can be cyclized to pyroglutamate by the action of Qcyclase, rendering it resistant to further digestion by DAPase. The pyroglutamyl residue is then excised by pGAPase to yield a product with a native N-terminus.
DAPase , which is also known as cathepsin C, is composed of three polypeptide chains, at least two of which are linked together by one or more disulfide bonds [143]. Consequently, its enzymatic activity is sensitive to reducing agents. Moreover, as a cysteine protease, DAPase is inhibited by thiol akylating agents. DAPase is active over a pH range of 4–8 and exhibits maximal activity at pH 5.5 [143]. A polyhistidine-tagged form of DAPase has been secreted from baculovirus-infected insect cells, yielding approximately 50 mg of active enzyme per liter [144].
The TAGZyme™ system has been further refined such that all three enzymatic reagents are modified by the addition of polyhistidine tags, thereby facilitating their removal from a digestion product [145]. Although clever and elegant, this approach is complicated by the fact that it requires, in most cases, three distinct enzymes to produce a digestion product with a native N-terminus. Additionally, care must be taken to remove the DAPase prior to the addition of pGAPase. Vectors and methods for the production of these reagents are proprietary and therefore not readily available to researchers. As a result, the cost of these reagents is a potential impediment to their use on a large scale, such as would be required for structural biology.
Concluding remarks
Considering its high specificity yet relatively good tolerance for a wide variety of residues in the P1′ position of its recognition site, coupled with its ease and economy of production and the ready availability of expression vectors from open sources (including affinity-tagged variants), TEV protease is probably the single best endoprotease for the removal of N-terminal affinity tags. The ideal reagent would have all of these attributes along with totally relaxed P1′ specificity. Perhaps such an enzyme can be identified from among those encoded by other potyviruses or be engineered by rational design. As for the digestive carboxypeptidases, it would be useful to investigate whether a combination of A- and B-type enzymes can be used to remove a wider variety of short affinity tags from the C termini of recombinant proteins. Moreover, since the termini of proteins are often disordered, the ability to trim unstructured residues (apart from affinity tags) from their C termini might improve their propensity to crystallize. Yet it is unclear if the activity of these carboxypeptidases will be deterred upon encountering ordered structure in a globular protein. This needs to be tested. In principle, DAPase could be used in a similar fashion to remove disordered residues from the N-termini of proteins, subject to the same caveat. Proline is a roadblock for all of these exopeptidases and it would be useful to find a means of circumventing this issue. Finally, there are at least two ways in which endoproteases and exoproteases might be used in concert with each other that have yet to be explored. First, large C-terminal affinity tags (globular proteins) could be removed first by endoproteolytic cleavage at a designed site and then by treatment of the product with one or more carboxypeptidases to remove the remnants of the endoprotease recognition site from its C-terminus. The second potential application pertains to X-ray crystallography. It has been shown that the addition of trace amounts of relatively nonspecific endoproteases (e.g., chymotrypsin, subtilisin, thermolysin) to concentrated solutions of proteins prior to setting them up in crystallization trials sometimes enhances the growth of crystals [146], [147]. In some cases, the endoproteolytic cleavage events occur within disordered internal loops in the protein, in which case further trimming of the “loose ends” left by the endoproteases might be affected by exopeptidases.
Acknowledgment
This work was supported by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research.
Footnotes
Abbreviations used: factor Xa, blood coagulation factor X; TEV, tobacco etch virus; DTT, dithiothreitol; PMSF, phenylmethylsulfonyl fluoride; AEBSF, 4-(2-aminoethyl) benzenesulfonyl fluoride hydrochloride; PEG, polyethylene glycol; GST, glutathione S-transferase; EGTA, ethylene glycol tetraacetic acid; EDTA, ethylenediaminetetraacetic acid; MBP, maltose binding protein; ATCC, American Type Culture Collection; TVMV, tobacco vein mottling virus; SARS, severe acute respiratory syndrome; BAP, biotin acceptor peptide; POI, protein of interest; BoCPA, bovine carboxypeptidase A; BoCPB, bovine carboxypeptidase B; SDS, sodium dodecyl sulfate; DAPase, dipeptidyl-aminopeptidase I; Qcyclase, glutamine cyclotransferase; pGAPase, pyroglutamylaminopeptidase.
The nomenclature used here to describe individual amino acids in protease recognition sites and corresponding amino acid-binding sites in proteases was introduced by Schechter and Berger [1].
References
- 1.Schechter I., Berger A. On the size of the active site in proteases. Biochem. Biophys. Res. Com. 1967;27:157–162. doi: 10.1016/s0006-291x(67)80055-x. [DOI] [PubMed] [Google Scholar]
- 2.Waugh D.S. Making the most of affinity tags. Trends Biotechnol. 2005;23:316–320. doi: 10.1016/j.tibtech.2005.03.012. [DOI] [PubMed] [Google Scholar]
- 3.Arnau J., Lauritzen C., Petersen G.E., Pedersen J. Current strategies for the use of affinity tags and tag removal for the purification of recombinant proteins. Protein Expr. Purif. 2006;48:1–13. doi: 10.1016/j.pep.2005.12.002. [DOI] [PubMed] [Google Scholar]
- 4.Sun-Lailam B.B., Hevel J.M. Efficient cleavage of problematic tobacco etch virus (TEV)-protein arginine methyltransferase constructs. Anal. Biochem. 2009;387:130–132. doi: 10.1016/j.ab.2008.12.031. [DOI] [PubMed] [Google Scholar]
- 5.Wu J., Filutowicz M. Hexahistidine (His6)-tag dependent protein dimerization: a cautionary tale. Acta Biochim. Pol. 1999;46:591–599. [PubMed] [Google Scholar]
- 6.Woestenenk E.A., Hammarström M., van den Berg S., Härd T., Berglund H. His tag effect on solubility of human proteins produced in Escherichia coli: a comparison between four expression vectors. J. Struct. Funct. Genom. 2004;5:217–229. doi: 10.1023/b:jsfg.0000031965.37625.0e. [DOI] [PubMed] [Google Scholar]
- 7.Chant A., Kraemer-Pecore C.M., Watkin R., Kneale G.G. Attachment of a histidine tag to the minimal zinc finger protein of the Aspergillus nidulans gene regulatory protein AreA causes a conformational change at the DNA-binding site. Protein Expr. Purif. 2005;39:152–159. doi: 10.1016/j.pep.2004.10.017. [DOI] [PubMed] [Google Scholar]
- 8.Amor-Mahjoub M., Suppini J.P., Gomez-Vrielyunck N., Ladjimi M. The effect of the hexahistidine-tag in the oligomerization of HSC70 constructs. J. Chromatogr. B. 2006;844:328–334. doi: 10.1016/j.jchromb.2006.07.031. [DOI] [PubMed] [Google Scholar]
- 9.Renzi F., Panetta G., Vallone B., Brunori M., Arceci M., Bozzoni I., Laneve P., Caffarelli E. Large-scale purification and crystallization of the endoribonuclease XendoU: trouble-shooting with His-tagged proteins. Acta Crystallogr. F. 2006;62:298–301. doi: 10.1107/S1744309106006373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gaberc-Porekar V., Menart V., Jevsevar S., Vidensek A., Stalc A. Histidines in affinity tags and surface clusters for immobilized metal-ion affinity chromatography of trimeric tumor necrosis factor alpha. J. Chromatogr. A. 1999;852:117–128. doi: 10.1016/s0021-9673(99)00374-x. [DOI] [PubMed] [Google Scholar]
- 11.Horchani H., Quertani S., Gargouri Y., Sayari A. The N-termninal His-tag and the recombination process affect the biochemical properties of Staphylococcus aureus lipase produced in Escherichia coli. J. Mol. Catal. B Enzyme. 2009;61:194–201. [Google Scholar]
- 12.Li Y. Self-cleaving fusion tags for recombinant protein production. Biotechnol. Lett. 2011;33:869–881. doi: 10.1007/s10529-011-0533-8. [DOI] [PubMed] [Google Scholar]
- 13.Ruan B., Fisher K.E., Alexander P.A., Doroshko V., Bryan P.N. Engineering subtilisin into a fluoride-triggered processing protease useful for one-step protein purification. Biochemistry. 2004;43:14539–14546. doi: 10.1021/bi048177j. [DOI] [PubMed] [Google Scholar]
- 14.Panavas T., Sanders C., Butt T.R. SUMO fusion technology for enhanced protein production in prokaryotic and eukaryotic expression systems. Methods Mol. Biol. 2009;497:303–317. doi: 10.1007/978-1-59745-566-4_20. [DOI] [PubMed] [Google Scholar]
- 15.Isetti G., Maurer M.C. Employing mutants to study thrombin residues responsible for factor XIII activation peptide recognition: a kinetic study. Biochemistry. 2007;46:2444–2452. doi: 10.1021/bi0622120. [DOI] [PubMed] [Google Scholar]
- 16.Bromfield K.M., Quinsey N.S., Dugan P.J., Pike R.N. Approaches to selective peptidic inhibitors of factor Xa. Chem. Biol. Drug Des. 2006;68:11–19. doi: 10.1111/j.1747-0285.2006.00404.x. [DOI] [PubMed] [Google Scholar]
- 17.Gasparian M.E., Ostapchenko V.G., Schulga A.A., Dolgikh D.A., Kirpichnikov M.P. Expression, purification, and characterization of human enteropeptidase catalytic subunit in Escherichia coli. Protein Expr. Purif. 2003;31:133–139. doi: 10.1016/s1046-5928(03)00159-1. [DOI] [PubMed] [Google Scholar]
- 18.Kapust R.B., Tözsér J., Fox J.D., Anderson D.E., Cherry S., Copeland T.D., Waugh D.S. Tobacco etch virus protease: mechanism of autolysis and rational design of stable mutants with wild-type catalytic proficiency. Protein Eng. 2001;14:993–1000. doi: 10.1093/protein/14.12.993. [DOI] [PubMed] [Google Scholar]
- 19.Long A.C., Orr D.C., Cameron J.M., Dunn B.M., Kay J. A consensus sequence for substrate hydrolysis by rhinovirus 3C proteinase. FEBS Lett. 1989;258:75–78. doi: 10.1016/0014-5793(89)81619-9. [DOI] [PubMed] [Google Scholar]
- 20.Liepnieks J.J., Light A. The preparation and properties of bovine enterokinase. J. Biol. Chem. 1979;254:1677–1683. [PubMed] [Google Scholar]
- 21.Anderson L.E., Walsh K.A., Neurath H. Bovine enterokinase, purification, specificity, and some molecular properties. Biochemistry. 1977;16:3354–3360. doi: 10.1021/bi00634a011. [DOI] [PubMed] [Google Scholar]
- 22.Collins-Racie L.A., McColgan J.M., Grant K.L., DiBlasio-Smith E.A., McCoy J.M., LaVallie E.R. Production of recombinant bovine enterokinase catalytic subunit in Escherichia coli using the novel secretory partner DsbA. Biotechnology. 1995;13:982–987. doi: 10.1038/nbt0995-982. [DOI] [PubMed] [Google Scholar]
- 23.Huang L., Ruan H., Gu W., Xu Z., Cen P., Fan L. Functional expression and purification of bovine enterokinase light chain in recombinant Escherichia coli. Prep. Biochem. Biotechnol. 2007;37:205–217. doi: 10.1080/10826060701386695. [DOI] [PubMed] [Google Scholar]
- 24.Choi S.I., Song H.W., Moon J.W., Seong B.L. Recombinant enterokinase light chain with affinity tag: expression from Saccharomyces cerevisiae and its utilities in fusion protein technology. Biotechnol. Bioeng. 2001;75:718–724. doi: 10.1002/bit.10082. [DOI] [PubMed] [Google Scholar]
- 25.Vergis J.M., Wiener M.C. The variable detergent sensitivity of proteases that are utilized for recombinant protein affinity tag removal. Protein Expr. Purif. 2011;78:139–142. doi: 10.1016/j.pep.2011.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hosfield T., Lu Q. Influence of the amino acid residue downstream of (Asp)4Lys on enterokinase cleavage of a fusion protein. Anal. Biochem. 1999;269:10–16. doi: 10.1006/abio.1998.3084. [DOI] [PubMed] [Google Scholar]
- 27.Kim Y.-T., Nishii W., Matsushima M., Inoue H., Ito H., Park S.J., Takahashi K. Substrate specificities of porcine and bovine enteropeptidases toward the peptide Val-(Asp)4-Lys-Ile-Val-Gly and its analogs. Biosci. Biotechnol. Biochem. 2008;72:905–908. doi: 10.1271/bbb.70732. [DOI] [PubMed] [Google Scholar]
- 28.Boulware K.T., Daugherty P.S. Protease specificity determination by using cellular libraries of peptide substrates (CLiPS) Proc. Natl Acad. Sci. USA. 2006;103:7583–7588. doi: 10.1073/pnas.0511108103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Liew O.W., Ching Chong J.P., Yandle T.G., Brennan S.O. Preparation of recombinant thioredoxin fused N-terminal proCNP: analysis of enterokinase cleavage products reveals new enterokinase cleavage sites. Protein Expr. Purif. 2005;41:332–340. doi: 10.1016/j.pep.2005.03.006. [DOI] [PubMed] [Google Scholar]
- 30.M.E. Gasparian, M.L. Bychkov, D.A. Golgikh, M.P. Kirpichnikov, Strategy for improvement of enteropeptidase efficiency in tag removal processes, Protein Expr. Purif. (2011) Epub ahead of print. [DOI] [PubMed]
- 31.Chun H., Joo K., Lee J., Shin H.C. Design and efficient production of bovine enterokinase light chain with higher specificity in E. coli. Biotechnol. Lett. 2011;33:1227–1232. doi: 10.1007/s10529-011-0562-3. [DOI] [PubMed] [Google Scholar]
- 32.Owen W.G., Esmon C.T., Jackson C.M. The conversion of prothrombin to thrombin. I. Characterization of the reaction products formed during the activation of bovine prothrombin. J. Biol. Chem. 1974;249:594–605. [PubMed] [Google Scholar]
- 33.Bishop P.D., Lewis K.B., Schultz J., Walker K.M. Comparison of recombinant human thrombin and plasma-derived human alpha-thrombin. Semin. Thromb. Hemostasis. 2006;32:86–97. doi: 10.1055/s-2006-939558. [DOI] [PubMed] [Google Scholar]
- 34.Bush-Pelc L.A., Marino F., Chen Z., Pineda A.O., Mathews F.S. E. Di Cera, Important role of the cys-191 cys-220 disulfide bond in thrombin function and allostery. J. Biol. Chem. 2007;282:27165–27170. doi: 10.1074/jbc.M703202200. [DOI] [PubMed] [Google Scholar]
- 35.Le Borgne S., Graber M. Amidase activity and thermal stability of human thrombin. Appl. Biochem. Biotechnol. 1994;48:125–135. doi: 10.1007/BF02796167. [DOI] [PubMed] [Google Scholar]
- 36.Chang J.-Y. Thrombin specificity: requirement for apolar amino acids adjacent to the thrombin cleavage site of polypeptide substrate. Eur. J. Biochem. 1985;151:217–224. doi: 10.1111/j.1432-1033.1985.tb09091.x. [DOI] [PubMed] [Google Scholar]
- 37.Smith D.B., Johnson K.S. Single-step purification of polypeptides expressed in Escherichia coli as fusions with glutathione S-transferase. Gene. 1988;67:31–40. doi: 10.1016/0378-1119(88)90005-4. [DOI] [PubMed] [Google Scholar]
- 38.Houbrechts A., Moreau B., Abagyan R., Mainfroid V., Preaux G., Lamproye A., Poncin A., Goormaghtigh E., Ruysschaert J.M., Martial J.A., Goraj K. Second -generation octarellins: two new de novo (beta/alpha)8 polypeptides designed for investigating the influence of beta-residue packing on the alpha/beta-barrel structure stability. Protein Eng. 1995;8:249–259. doi: 10.1093/protein/8.3.249. [DOI] [PubMed] [Google Scholar]
- 39.Le Bonniec B.F., Myles T., Johnson T., Knight C.G., Tapparelli C., Stone S.R. Characterization of the P2’ and P3’ specificities of thrombin using fluorescence-quenched substrates and mapping of the subsites by mutagenesis. Biochemistry. 1996;35:7114–7122. doi: 10.1021/bi952701s. [DOI] [PubMed] [Google Scholar]
- 40.Petrassi H.M., Williams J.A., Li J., Tumanut C., Ek J., Nakai T., Masick B., Backes B.J., Harris J.L. A strategy to profile prime and non-prime proteolytic substrate specificity. Bioorg. Med. Chem. Lett. 2005;15:3162–3166. doi: 10.1016/j.bmcl.2005.04.019. [DOI] [PubMed] [Google Scholar]
- 41.Shilling O., Overall C.M. Proteone-derived database-searchable peptide libraries for identifying protease cleavage sites. Nat. Biotechnol. 2008;26:685–694. doi: 10.1038/nbt1408. [DOI] [PubMed] [Google Scholar]
- 42.Jenny R.J., Mann K.G., Lundblad R.L. A critical review of the methods for cleaving fusion proteins with thrombin and factor Xa. Protein Expr. Purif. 2003;31:1–11. doi: 10.1016/s1046-5928(03)00168-2. [DOI] [PubMed] [Google Scholar]
- 43.Jackson C.M., Johnson T.F., Hanahan D.J. Studies on bovine Factor X. I. Large-scale purification of the bovine plasma protein posessing factor X activity. Biochemistry. 1968;7:4492–4505. doi: 10.1021/bi00852a046. [DOI] [PubMed] [Google Scholar]
- 44.Heidtmann H.-H., Kontermann R.E. Cloning and recombinant expression of mouse coagulation factor X. Thrombosis Res. 1998;92:33–41. doi: 10.1016/s0049-3848(98)00110-8. [DOI] [PubMed] [Google Scholar]
- 45.Rudolph A.E., Mullane M.P., Porche-Sorbet R., Tsuda S., Miletich J.P. Factor Xa St. Louis II: identification of a glycine substitution at residue 7 and characterization of the recombinant protein. J. Biol. Chem. 1996;271:28601–28606. doi: 10.1074/jbc.271.45.28601. [DOI] [PubMed] [Google Scholar]
- 46.Rudolph A.E., Mullane M.P., Porche-Sorbet R., Miletich J.P. Expression, purification and characterization of recombinant human factor X. Protein Expr. Purif. 1977;10:373–378. doi: 10.1006/prep.1997.0752. [DOI] [PubMed] [Google Scholar]
- 47.Manithody C., Yang L., Rezaie A.R. Role of basic residues of the autolysis loop in the catalytic function of factor Xa. Biochemistry. 2002;41:6780–6788. doi: 10.1021/bi0255367. [DOI] [PubMed] [Google Scholar]
- 48.Lottenberg R., Hall J.A., Pautler E., Zupan A., Christensen U., Jackson C.M. The action of factor Xa on peptide p-nitroanilide substrates: substrate selectivity and examination of hydrolysis with different reaction conditions. Biochim. Biophys. Acta. 1986;874:326–336. doi: 10.1016/0167-4838(86)90032-4. [DOI] [PubMed] [Google Scholar]
- 49.Harris J.L., Backes B.J., Leonetti F., Mahrus S., Ellman J.A., Craik C.S. Rapid and general profiling of protease specificity by using combinatorial fluorogenic substrate libraries. Proc. Natl Acad. Sci. USA. 2000;97:7754–7759. doi: 10.1073/pnas.140132697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Hsu H.-J., Tsai K.-C., Sun Y.-K., Chang H.-J., Huang Y.-J., Yu H.-M., Lin C.-H., Mao S.-S., Yang A.-S. Factor Xa active site substrate specificity with substrate phage display and computational molecular modeling. J. Biol. Chem. 2008;283:12343–12353. doi: 10.1074/jbc.M708843200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Ludeman J.P., Pike R.N., Bromfield K.M., Duggan P.J., Cianci J., Le Bonniec B., Shjisstock J.C., Bottomley S.P. Determination of the P1’, P2’ and P3’ subsite-specificity of factor Xa. Int. J. Biochem. Cell Biol. 2003;35:221–225. doi: 10.1016/s1357-2725(02)00128-0. [DOI] [PubMed] [Google Scholar]
- 52.Carrington J.C., Dougherty W.G. Small nuclear inclusion protein encoded by a plant potyvirus genome is a protease. J. Virol. 1987;61:2540–2548. doi: 10.1128/jvi.61.8.2540-2548.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Dougherty W.G., Parks T.D. Post-translational processing of the tobacco etch virus 49-kDa small nuclear inclusion polyprotein: identification of an internal cleavage site and delimitation of VPg and proteinase domains. Virology. 1991;183:449–456. doi: 10.1016/0042-6822(91)90974-g. [DOI] [PubMed] [Google Scholar]
- 54.Carrington J.C., Dougherty W.G. A viral cleavage site cassette: Identification of amino acid sequences required for tobacco etch virus polyprotein processing. Proc. Natl Acad. Sci. USA. 1988;85:3391–3395. doi: 10.1073/pnas.85.10.3391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Parks T.D., Leuther K.K., Howard E.D., Johnston S.A., Dougherty W.G. Release of proteins and peptides from fusion proteins using a recombinant plant virus proteinase. Anal. Biochem. 1994;216:413–417. doi: 10.1006/abio.1994.1060. [DOI] [PubMed] [Google Scholar]
- 56.Kapust R.B., Waugh D.S. Escherichia coli maltose-binding protein is uncommonly effective at promoting the solubility of polypeptides to which it is fused. Protein Sci. 1999;8:1668–1674. doi: 10.1110/ps.8.8.1668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Kapust R.B., Routzahn K.M., Waugh D.S. Processive degradation of nascent polypeptides, triggered by tandem AGA codons, limits the accumulation of recombinant tobacco etch virus protease in Escherichia coli BL21(DE3) Protein Expr. Purif. 2002;24:61–70. doi: 10.1006/prep.2001.1545. [DOI] [PubMed] [Google Scholar]
- 58.van den Berg S., Löfdahl P.-A., Härd T., Berglund H. Improved solubility of TEV protease by directed evolution. J. Biotechnol. 2006;121:291–298. doi: 10.1016/j.jbiotec.2005.08.006. [DOI] [PubMed] [Google Scholar]
- 59.Blommel P.G., Fox B.G. A combined approach to improving large-scale production of tobacco etch virus protease. Protein Expr. Purif. 2007;55:53–68. doi: 10.1016/j.pep.2007.04.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Miladi B., Bouallagui H., Dridi C., El Marjou A., Boeuf G., Di Martino P., Dufour F., Elm’Selmi A. A new tagged-TEV protease: Construction, optimisation of production, purification and test activity. Protein Expr. Purif. 2011;75:75–82. doi: 10.1016/j.pep.2010.08.012. [DOI] [PubMed] [Google Scholar]
- 61.Tropea J.E., Cherry C., Waugh D.S. Expression and purification of soluble His(6)-tagged TEV protease. Methods Mol. Biol. 2009;498:297–307. doi: 10.1007/978-1-59745-196-3_19. [DOI] [PubMed] [Google Scholar]
- 62.Lucast L.J., Batey R.T., Doudna J.A. Large-scale purification of a stable form of recombinant tobacco etch virus protease. BioTechniques. 2001;30:544–0546. doi: 10.2144/01303st06. [DOI] [PubMed] [Google Scholar]
- 63.Parks T.D., Howard E.D., Wolpert T.J., Arp D.J., Dougherty W.G. Expression and purification of a recombinant tobacco etch virus NIa proteinase: biochemical analyses of the full-length and a naturally occurring truncated proteinase form. Virology. 1995;210:194–201. doi: 10.1006/viro.1995.1331. [DOI] [PubMed] [Google Scholar]
- 64.Cabrita L.D., Gilis D., Robertson A.L., Dehouck Y., Rooman M., Bottomley S.P. Enhancing the stability and solubility of TEV protease using in silico design. Protein Sci. 2007;16:2360–2367. doi: 10.1110/ps.072822507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.van den Berg S., Löfdahl P.A., Härd T., Berglund H. Improved solubility of TEV protease by directed evolution. J. Biotechnol. 2006;121:291–298. doi: 10.1016/j.jbiotec.2005.08.006. [DOI] [PubMed] [Google Scholar]
- 66.Dougherty W.G., Cary S.M., Parks T.D. Molecular genetic analysis of a plant virus polyprotein cleavage site: a model. Virology. 1989;171:356–364. doi: 10.1016/0042-6822(89)90603-x. [DOI] [PubMed] [Google Scholar]
- 67.Tözsér J., Tropea J.E., Cherry S., Bagossi P., Copeland T.D., Wlodawer A., Waugh D.S. Comparison of the substrate specificity of two potyvirus proteases. FEBS J. 2005;272:514–523. doi: 10.1111/j.1742-4658.2004.04493.x. [DOI] [PubMed] [Google Scholar]
- 68.Kapust R.B., Tözsér J., Copeland T.D., Waugh D.S. The P1’ specificity of tobacco etch virus protease. Biochem. Biophys. Res. Commun. 2002;294:949–955. doi: 10.1016/S0006-291X(02)00574-0. [DOI] [PubMed] [Google Scholar]
- 69.Kostallas G., Samuelson P. Novel fluorescence-assisted whole-cell assay for engineering and characterization of proteases and their substrates. Appl. Environ. Microbiol. 2010;76:7500–7508. doi: 10.1128/AEM.01558-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Boulware K.T., Jabaiah A., Daugherty P.S. Evolutionary optimization of peptide substrates for proteases that exhibit rapid hydrolysis kinetics. Biotechnol. Bioeng. 2010;106:339–346. doi: 10.1002/bit.22693. [DOI] [PubMed] [Google Scholar]
- 71.Phan J., Zdanov A., Evdokimov A.G., Tropea J.E., Peters H.K., 3rd, Kapust R.B., Li M., Wlodawer A., Waugh D.S. Structural basis for the substrate specificity of tobacco etch virus protease. J. Biol. Chem. 2002;277:50564–50572. doi: 10.1074/jbc.M207224200. [DOI] [PubMed] [Google Scholar]
- 72.Sun P., Austin B.P., Tözsér J., Waugh D.S. Structural determinants of tobacco vein mottling virus substrate specificity. Protein Sci. 2010;19:2240–2251. doi: 10.1002/pro.506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Nallamsetty S., Kapust R.B., Tözsér J., Cherry S., Tropea J.E., Copeland T.D., Waugh D.S. Efficient site-specific processing of fusion proteins by tobacco vein mottling virus protease in vito and in vitro. Protein Expr. Purif. 2004;38:108–115. doi: 10.1016/j.pep.2004.08.016. [DOI] [PubMed] [Google Scholar]
- 74.Polayes D.A., Parks T.D., Joghston S.A., Dougherty W.G. Application of TEV protease in protein production. Methods Mol. Med. 1998;13:169–183. doi: 10.1385/0-89603-485-2:169. [DOI] [PubMed] [Google Scholar]
- 75.Polayes D.A., Goldstein A., Ward G., Hughes A.J., Jr. TEV protease, recombinant. A site-specific protease for efficient cleavage of affinity tags from expressed proteins. FOCUS. 1994;16:2–5. [Google Scholar]
- 76.Dougherty W.G., Parks D., Cary S.M., Bazan J.F., Fletterick R.J. Characterization of the catalytic residues of the tobacco etch virus 49-kDa proteinase. Virology. 1989;172:302–310. doi: 10.1016/0042-6822(89)90132-3. [DOI] [PubMed] [Google Scholar]
- 77.Mohanty A.K., Simmons C.R., Wiener M.C. Inhibition of tobacco etch virus protease activity by detergents. Protein Expr. Purif. 2003;27:109–114. doi: 10.1016/s1046-5928(02)00589-2. [DOI] [PubMed] [Google Scholar]
- 78.Lundbäck A.K. S. van den Berg, J. Hebert, H. Berglund, S. Eschaghi, Exploring the activity of tobacco etch virus protease in detergent solutions. Anal. Biochem. 2008;382:69–71. doi: 10.1016/j.ab.2008.07.018. [DOI] [PubMed] [Google Scholar]
- 79.Hwang D.C., Kim D.H., Kang B.H., Song B.D., Choi K.Y. Molecular cloning, expression and purification of nuclear inclusion A protease from tobacco vein mottling virus. Mol. Cells. 2000;10:148–155. doi: 10.1007/s10059-000-0148-8. [DOI] [PubMed] [Google Scholar]
- 80.Zheng N., de Jesús Pérez J., Zhang Z., Domínguez E., Garcia J.A., Xie Q. Specific and efficient cleavage of fusion proteins by recombinant plum pox virus NIa protease. Protein Expr. Purif. 2008;57:153–162. doi: 10.1016/j.pep.2007.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Kim D., Kang B.H., Han J.S., Choi K.Y. Temperature and salt effects on the proteolytic function of turnip mosaic potyvirus nuclear inclusion protein a exhibiting a low-temperature optimum activity. Biochim. Biophys. Acta. 2000;1480:29–40. doi: 10.1016/s0167-4838(00)00101-1. [DOI] [PubMed] [Google Scholar]
- 82.Kang H., Lee Y.J., Goo J.H., Park W.H. Determination of the substrate specificity of turnip mosaic virus NIa protease using a genetic method. J. Gen. Virol. 2001;82:3115–3117. doi: 10.1099/0022-1317-82-12-3115. [DOI] [PubMed] [Google Scholar]
- 83.Wang Q.M., Chen S.-H. Human rhinovirus 3C protease as a potential target for the development of antiviral agents. Curr. Protein Pept. Sci. 2007;8:19–27. doi: 10.2174/138920307779941523. [DOI] [PubMed] [Google Scholar]
- 84.Nicklin M.J., Kräusslich H.G., Toyoda H., Dunn J.J., Wimmer E. Poliovirus polypeptide precursors: expression and in vitro processing by exogenous 3C and 2A proteinases. Proc. Natl Acad. Sci. USA. 1987;84:4002–4006. doi: 10.1073/pnas.84.12.4002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Nicklin M.J., Harris K.S., Pallai P.V., Wimmer E. Poliovirus proteinase 3C: large-scale expression, purification, and specific cleavage activity on natural and synthetic substrates in vitro. J. Virol. 1988;62:4586–4593. doi: 10.1128/jvi.62.12.4586-4593.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Pallai P.V., Burkhardt F., Skoog M., Schreiner K., Bax P., Cohen K.A., Hansen G., Palladino D.E.H., Harris K.S., Nicklin M.J., Wimmer E. Cleavage of synthetic peptides by purified poliovirus 3C proteinase. J. Biol. Chem. 1989;264:9738–9741. [PubMed] [Google Scholar]
- 87.Gorbalenya A.E., Donchenko A.P., Blinov V.M., Koonin E.V. Cysteine proteases of positive strand RNA viruses and chymotrypsin-like serine proteases. A distinct protein superfamily with a common structural fold. FEBS Lett. 1989;243:103–114. doi: 10.1016/0014-5793(89)80109-7. [DOI] [PubMed] [Google Scholar]
- 88.Bazan J.F., Fletterick R.J. Detection of a trypsin-like serine protease domain in flaviviruses and pestiviruses. Virology. 1989;171:637–639. doi: 10.1016/0042-6822(89)90639-9. [DOI] [PubMed] [Google Scholar]
- 89.Leong L.E. The use of recombinant fusion proteases in the affinity purification of recombinant proteins. Mol. Biotechnol. 1999;12:269–274. doi: 10.1385/MB:12:3:269. [DOI] [PubMed] [Google Scholar]
- 90.Walker P.A., Leong L.E., Ng P.W., Tan S.H., Waller S., Murphy D., Porter A.G. Efficient and rapid affinity purification of proteins using recombinant fusion proteases. Biotechnology. 1994;12:601–605. doi: 10.1038/nbt0694-601. [DOI] [PubMed] [Google Scholar]
- 91.Zhang Y., Olsen D.R., Nguyen K.B., Olson P.S., Rhodes E.T., Mascarenhas D. Expression of eukaryotic proteins in soluble form in Escherichia coli. Protein Expr. Purif. 1998;12:159–165. doi: 10.1006/prep.1997.0834. [DOI] [PubMed] [Google Scholar]
- 92.J. Youell, D. Fordham, K. Firman, Production and single-step purification of EGFP and a biotinylated version of the human rhinovirus 14 3C protease, Protein Expr. Purif. (2011) Epub ahead of print. [DOI] [PubMed]
- 93.Cordingley M.G., Register R.B., Callahan P.L., Garsky V.M., Colonno R.J. Cleavage of small peptides in vitro by human rhinovirus 14 3C protease expressed in Escherichia coli. J. Virol. 1989;63:5037–5045. doi: 10.1128/jvi.63.12.5037-5045.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Cordingley M.G., Callahan P.L., Sardana V.V., Garsky V.M., Colonno R.J. Substrate requirements of human rhinovirus 3C protease for peptide cleavage in vitro. J. Biol. Chem. 1990;265:9062–9065. [PubMed] [Google Scholar]
- 95.Orr D.C., Long A.C., Kay J., Dunn B.M., Cameron J.M. Hydrolysis of a series of synthetic peptide substrates by the human rhinovirus 14 3C proteinase, cloned and expressed in Escherichia coli. J. Gen. Virol. 1989;70:2931–2942. doi: 10.1099/0022-1317-70-11-2931. [DOI] [PubMed] [Google Scholar]
- 96.Davis G.J., Wang Q.M., Cox G.A., Johnson R.B., Wakulchik M., Dotson C.A., Villarreal E.C. Expression and purification of recombinant rhinovirus 14 3CD proteinase and its comparison to the 3C prorteinase. Arch. Biochem. Biophys. 1997;346:125–130. doi: 10.1006/abbi.1997.0291. [DOI] [PubMed] [Google Scholar]
- 97.Wang Q.M., Johnson R.B., Cox G.A., Villarreal E.C., Loncharich R.J. A continuous colorimetric assay for rhinovirus-14 3C protease using peptide p-nitroanilides as substrates. Anal. Biochem. 1997;252:238–245. doi: 10.1006/abio.1997.2315. [DOI] [PubMed] [Google Scholar]
- 98.Wang Q.M., Johnson R.B. Activation of human rhinovirus-14 3C protease. Virology. 2001;280:80–86. doi: 10.1006/viro.2000.0760. [DOI] [PubMed] [Google Scholar]
- 99.Rubio V., Shen Y., Saijo Y., Liu Y., Gusmaroli G., Dinesh-Kumar S.P., Deng X.W. An alternative tandem affinity purification strategy applied to Arabidopsis protein complex isolation. The Plant J. 2005;41:767–778. doi: 10.1111/j.1365-313X.2004.02328.x. [DOI] [PubMed] [Google Scholar]
- 100.Fan K., Ma L., Han X., Liang H., Wei P., Liu Y., Lai L. The substrate specificity of SARS coronavirus 3C-like proteinase. Biochem. Biophys. Res. Comm. 2005;329:934–940. doi: 10.1016/j.bbrc.2005.02.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Weidner J.R., Dunn B.M. Development of synthetic peptide substrates for the poliovirus 3C proteinase. Arch. Biochem. Biophys. 1991;286:402–408. doi: 10.1016/0003-9861(91)90058-q. [DOI] [PubMed] [Google Scholar]
- 102.Petithory J.R., Masiarz F.R., Kirsch J.F., Santi D.V., Malcom B.A. A rapid method for determination of endoproteinase substrate specificity: specificity of the 3C proteinase from hepatitis A virus. Proc. Natl Acad. Sci. USA. 1991;88:11510–11514. doi: 10.1073/pnas.88.24.11510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Huitema C., Eltis L.D. A fluorescent protein-based biological screen of proteinase activity. J. Biomol. Screen. 2010;15:224–229. doi: 10.1177/1087057109357790. [DOI] [PubMed] [Google Scholar]
- 104.Jewell D.A., Swietnicki W., Dunn B.M., Malcom B.A. Hepatitis A virus 3C proteinase substrate specificity. Biochemistry. 1992;31:7862–7869. doi: 10.1021/bi00149a017. [DOI] [PubMed] [Google Scholar]
- 105.Malcom B.A., Chin S.M., Jewell D.A., Stratton-Thomas J.R., Thudium K.B., Ralston R., Rosenberg S. Expression and characterization of recombinant hepatitis A virus 3C proteinase. Biochemistry. 1992;31:3358–3363. doi: 10.1021/bi00128a008. [DOI] [PubMed] [Google Scholar]
- 106.Kim Y., Bigelow L., Borovilos M., Dementieva I., Duggan E., Eschenfeldt W., Hatzos C., Joachimiak G., Li H., Maltseva N., Mulligan R., Quartey P., Slather A., Stols L., Volkart L., Wu R., Zhou M., Joachimiak A. High-throughput protein purification for X-ray crystallography and NMR. Adv. Protein Chem. Struct. Biol. 2008;75:85–105. doi: 10.1016/S0065-3233(07)75003-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Nallamsetty S., Waugh D.S. A generic protocol for the expression and purification of recombinant proteins in Escherichia coli using a combinatorial His6-maltose binding protein fusion tag. Nat. Protoc. 2007;2:383–391. doi: 10.1038/nprot.2007.50. [DOI] [PubMed] [Google Scholar]
- 108.Tropea J.E., Cherry S., Nallamsetty S., Bignon C., Waugh D.S. A generic method for the production of recombinant proteins in Escherichia coli using a dual hexahistidine-maltose-binding protein affinity tag. Methods Mol. Biol. 2007;363:1–19. doi: 10.1007/978-1-59745-209-0_1. [DOI] [PubMed] [Google Scholar]
- 109.Donnelly M.I., Zhou M., Millard C.S., Clancy S., Stols L., Eschenfeldt W.H., Collart F.R., Joachimiak A. An expression vector tailored for large-scale, high-throughput purification of recombinant proteins. Protein Expr. Purif. 2006;47:446–454. doi: 10.1016/j.pep.2005.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Kapust R.B., Waugh D.S. Controlled intracellular processing of fusion proteins by TEV protease. Protein Expr. Purif. 2000;19:312–318. doi: 10.1006/prep.2000.1251. [DOI] [PubMed] [Google Scholar]
- 111.Phan J., Tropea J.E., Waugh D.S. Structure of the Yersinia pestis type III secretion chaperone SycH in complex with a stable fragment of YscM2. Acta Crystallogr. D. 2004;60:1591–1594. doi: 10.1107/S0907444904017597. [DOI] [PubMed] [Google Scholar]
- 112.Kenig M., Peternel S., Gaberc-Porekar V., Menhart V. Influence of protein oligomericity on final yield after affinity tag removal in purification of recombinant proteins. J. Chromatogr. A. 2006;110:293–306. doi: 10.1016/j.chroma.2005.09.089. [DOI] [PubMed] [Google Scholar]
- 113.Davis G.D., Elisee C., Newham D.M., Harrison R.G. New fusion protein systems designed to give soluble expression in Escherichia coli. Biotechnol. Bioeng. 1999;65:382–388. [PubMed] [Google Scholar]
- 114.Nominé Y., Ristriani T., Laurent C., Lefèvre J.F., Weiss E., Travé G. Formation of soluble inclusion bodies by HPV E6 oncoprotein fused to maltose-binding protein. Protein Expr. Purif. 2001;23:22–32. doi: 10.1006/prep.2001.1451. [DOI] [PubMed] [Google Scholar]
- 115.Vendrell J., Querol E., Avilés F.X. Metallocarboxypeptidases and their protein inhibitors: structure, function and biomedical properties. Biochim. Biophys. Acta. 2000;1477:284–298. doi: 10.1016/s0167-4838(99)00280-0. [DOI] [PubMed] [Google Scholar]
- 116.Fernández D., Pallarès I., Vendrell J., Avilés F.X. Progress in metallocarboxypeptidases and their small molecular weight inhibitors. Biochimie. 2010;92:1484–1500. doi: 10.1016/j.biochi.2010.05.002. [DOI] [PubMed] [Google Scholar]
- 117.Austin B.P., Tözsér J., Bagossi P., Tropea J.E., Waugh D.S. The substrate specificity of Metarhizium anisopliae and Bos taurus carboxypeptidases A: insights into their use as tools for the removal of affinity tags. Protein Expr. Purif. 2011;77:53–61. doi: 10.1016/j.pep.2010.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Tan A.K., Eaton D.L. Activation and characterization of procarboxypeptidase B from human plasma. Biochemistry. 1995;34:5811–5816. doi: 10.1021/bi00017a012. [DOI] [PubMed] [Google Scholar]
- 119.Zisapel N., Kurn-Abramowitz N., Sokolovsky M. Basic and non-basic substrates of carboxypeptidase B. Eur. J. Biochem. 1973;35:507–511. doi: 10.1111/j.1432-1033.1973.tb02866.x. [DOI] [PubMed] [Google Scholar]
- 120.Wintersberger E., Cox D.J., Neurath H. Bovine pancreatic procarboxypeptidase B. I. Isolation, properties, and activation. Biochemistry. 1962;1:1069–1078. doi: 10.1021/bi00912a017. [DOI] [PubMed] [Google Scholar]
- 121.Gardell S.J., Craik C.S., Clauser E., Goldsmith E.J., Stewart C.-B., Graf M., Rutter W.J. A novel rat carboxypeptidase, CPA2: characterization, molecular cloning and evolutionary implications on substrate specificity in the carboxypeptidase gene family. J. Biol. Chem. 1988;263:17828–17836. [PubMed] [Google Scholar]
- 122.Lilus G., Persson M., Bulow L., Mosbach K. Metal affinity precipitation of proteins carrying genetically attached polyhistidine affinity tails. Eur. J. Biochem. 1991;198:499–504. doi: 10.1111/j.1432-1033.1991.tb16042.x. [DOI] [PubMed] [Google Scholar]
- 123.Garcia K.C., Tallquist M.D., Pease L.R., Brunmark A., Scott C.A., Degano M., Stura E.A., Peterson P.A., Wilson I.A., Teyton L. αβ T cell receptor interactions with syngeneic and allogeneic ligands: affinity measurements and crystallization. Proc. Natl Acad. Sci. USA. 1997;94:13838–13843. doi: 10.1073/pnas.94.25.13838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Persson K., Åslund L., Grahn B., Hanke J., Heby O. Trypanosoma cruzi has not lost its S-adenosylmethionine decarboxylase: characterization of the gene and the encoded enzyme. Biochem. J. 1998;333:527–537. doi: 10.1042/bj3330527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Celia J., Wilson-Kubalek E., Milligan R.A., Teyton L. Structure and function of a membrane-bound murine MHC class I molecule. Proc. Natl Acad. Sci. USA. 1999;96:5634–5639. doi: 10.1073/pnas.96.10.5634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Smith J.C., Derbyshire R.B., Cook E., Dunthorne L., Viney J., Brewer S.J., Sassenfeld H.M., Bell L.D. Chemical synthesis and cloning of a poly(arginine)-coding gene fragment designed to aid polypeptide purification. Gene. 1984;32:321–327. doi: 10.1016/0378-1119(84)90007-6. [DOI] [PubMed] [Google Scholar]
- 127.Fuchs S.M., Raines R.T. Polyarginine as a multifunctional fusion tag. Protein Sci. 2005;14:1538–1544. doi: 10.1110/ps.051393805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Rickert M., Wang X., Boulanger M.J., Goriatcheva N., Garcia K.C. The structure of interleukin-2 complexed with its alpha receptor. Science. 2005;308:1477–1480. doi: 10.1126/science.1109745. [DOI] [PubMed] [Google Scholar]
- 129.Hirabayashi J., Kasai K. Arginine-tail method, an affinity tag procedure utilizing anhydrotrypsin agarose. J. Chromatogr. 1992;597:181–187. doi: 10.1016/0021-9673(92)80108-7. [DOI] [PubMed] [Google Scholar]
- 130.Phillips M.A., Rutter W.J. Role of the prodomain in folding and secretion of rat pancreatic carboxypeptidase A1. Biochemistry. 1996;35:6771–6776. doi: 10.1021/bi960113o. [DOI] [PubMed] [Google Scholar]
- 131.Kamo M., Tsugita A. Carboxypeptidase digestion in the presence of detergents. J. Biochem. 1987;102:243–246. doi: 10.1093/oxfordjournals.jbchem.a122047. [DOI] [PubMed] [Google Scholar]
- 132.Auld D.S., Vallee B.L. Kinetics of carboxypeptidase A. The pH dependence of tripeptide hydrolysis catalyzed by zinc, cobalt, and manganese enzymes. Biochemistry. 1970;9:4352–4359. doi: 10.1021/bi00824a016. [DOI] [PubMed] [Google Scholar]
- 133.Zhang X.-Y., Li S.-X., Yuan Q.-S. The renaturation of procarboxypeptidase B by urea gradient gel filtration and some properties of recombinant carboxypeptidase B. Protein Peptide Lett. 2005;12:671–676. doi: 10.2174/0929866054696145. [DOI] [PubMed] [Google Scholar]
- 134.Goo J.H., Kim K.H., Choi K.Y. Cloning, sequencing and expression of the gene encoding a major allotypic preprocarboxypeptidase A from bovine pancreas. Gene. 1995;165:333–334. doi: 10.1016/0378-1119(95)00454-e. [DOI] [PubMed] [Google Scholar]
- 135.Seddi R., Chaix J.C., Puigserver A., Guo X.J. Expression of a soluble and activatable form of bovine procarboxypeptidase A in Escherichia coli. Protein Expr. Purif. 2003;27:220–228. doi: 10.1016/s1046-5928(02)00573-9. [DOI] [PubMed] [Google Scholar]
- 136.Laethem R.M., Blumenkopf T.A., Cory M., Elwell L., Moxham C.P., Ray P.H., Walton L.M., Smith G.K. Expression and characterization of human pancreatic preprocarboxypeptidase A1 and preprocarboxypeptidase A2. Arch. Biochem. Biophys. 1996;332:8–18. doi: 10.1006/abbi.1996.0310. [DOI] [PubMed] [Google Scholar]
- 137.Reverter D., Ventura S., Villegas V., Vendrell J., Avilés F.X. Overexpression of human procarboxypeptidase A2 in Pichia pastoris and detailed characterization of its activation pathway. J. Biol. Chem. 1998;273:3535–3541. doi: 10.1074/jbc.273.6.3535. [DOI] [PubMed] [Google Scholar]
- 138.Darnis S., Juge J., Marino C., Avilés F.X., Puigserver A., Chaix J.C., Guo X.J. Cloning, sequencing and functional expression of a cDNA encoding porcine pancreatic preprocarboxypeptidase A1. Eur. J. Biochem. 1999;259:719–725. doi: 10.1046/j.1432-1327.1999.00091.x. [DOI] [PubMed] [Google Scholar]
- 139.Joshi L., St. Leger R.J. Cloning, expression, and substrate specificity of MeCPA, a zinc carboxypeptidase that is secreted into ifected tissues by the fungal entomopathogen Metarhizium anisopliae. J. Biol. Chem. 1999;274:9803–9811. doi: 10.1074/jbc.274.14.9803. [DOI] [PubMed] [Google Scholar]
- 140.Ventura S., Villegas V., Sterner J., Larson J., Vendrell J., Hershberger C.L., Avilés F.X. Mapping the pro-region of carboxypeptidase B by protein engineering. J. Biol. Chem. 1999;274:19925–19933. doi: 10.1074/jbc.274.28.19925. [DOI] [PubMed] [Google Scholar]
- 141.Li S.-X., Zhang Y.-J., Tian L.-P., Yuan Q.-S., Gong Y. Cloning and expression of a new rat procarboxypeptidase B gene in Escherichia coli and purification of recombination carboxypeptidase B. Protein Peptide Lett. 2003;10:581–590. doi: 10.2174/0929866033478627. [DOI] [PubMed] [Google Scholar]
- 142.Pedersen J., Lauritzen C., Madsen M.T., Dahl S.W. Removal of N-terminal polyhistidine tags from recombinant proteins using engineered aminopeptidases. Protein Expr. Purif. 1999;15:389–400. doi: 10.1006/prep.1999.1038. [DOI] [PubMed] [Google Scholar]
- 143.Dolenc I., Turk B., Pungercic G., Ritonja A., Turk V. Oligomeric structure and substrate induced inhibition of human cathepsin C. J. Biol. Chem. 1995;270:21626–21631. doi: 10.1074/jbc.270.37.21626. [DOI] [PubMed] [Google Scholar]
- 144.Lauritzen C., Pedersen J., Madsen M.T., Justesen J., Martensen P.M., Dahl S.W. Active recombinant rat dipeptidyl aminopeptidase I (cathepsin C) produced using the baculovirus expression system. Protein Expr. Purif. 1998;14:434–442. doi: 10.1006/prep.1998.0976. [DOI] [PubMed] [Google Scholar]
- 145.Schäfer F., Schäfer A., Steinert K. A highly specific sysem for efficient enzymatic removal of tags from recombinant proteins. J. Biomol. Tech. 2002;13:158–171. [PMC free article] [PubMed] [Google Scholar]
- 146.Dong A., Xu X., Edwards A.M., Chang C., Chruszcz M., Cuff M., Cymborowski M., Di Leo R., Egorova O., Evdokimova E., Filippova E., Gu J., Guthrie J., Ignatchenko A., Joachimiak A., Klostermann N., Kim Y., Korniyenko Y., Minor W., Que Q., Savchenko A., Skarina T., Tan K., Yakunin A., Yee A., Yim V., Zhang R., Zheng H., Akutsu M., Arrowsmith C., Avvakumov G.V., Bochkarev A., Dahlgren L.-G., Dhe-Paganon S., Dimov S., Dombrovski L., Finerty P., Flodin S., Flores A., Graslund S., Hammerstrom M., Herman M.D., Hong B.-S., Hui R., Johansson I., Liu Y., Nilsson M., Nedyalkova L., Nordlund P., Nyman T., Min J., Ouyang H., Park H.-W., Qi C., Rabeh W., Shen L., Shen Y., Sukumard D., Tempel W., Tong Y., Tresagues L., Vedadi M., Walker J.R., Weigelt J., Welin M., Wu H., Xiao T., Zeng H., Zhu H. In situ proteolysis for protein crystallization and structure determination. Nat. Methods. 2007;4:1019–1021. doi: 10.1038/nmeth1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 147.Wernimont A., Edwards A. In situ proteolysis to generate crystals for structure determination: and update. PLoS ONE. 2009;4:e5094. doi: 10.1371/journal.pone.0005094. [DOI] [PMC free article] [PubMed] [Google Scholar]
