Abstract
Contrary to the results of direct expression, various human proteins (ferritin light-chain, epithermal growth factor, interleukin-2, prepro-ghrelin, deletion mutants of glutamate decarboxylase and arginine deiminase, and mini-proinsulin) were all soluble in Escherichia coli cytoplasm when expressed with the N-terminus fusion of ferritin heavy-chain (FTN-H). Through systematic investigations, we have found that a specific peptide motif within FTN-H has a high affinity to HSP70 chaperone DnaK, and that the peptide motif was composed of a hydrophobic core of three residues (Ile, Phe and Leu) and two flanking regions enriched with polar residues (Gly, Gln and Arg). It was also observed that all the recombinant proteins expressed with the fusion of FTN-H formed spherical nanoparticles with diameters of 10–15 nm, as confirmed by the transmission electron microscopy image. The protein nanoparticles are non-covalently cross-linked supra-molecules formed by the self-assembly function of FTN-H. Upon the formation of the supra-molecule, its size is likely to be limited by the assembly properties of FTN-H, thereby keeping the self-assembled particles soluble. This study reports on the dual function of FTN-H for fusion expression and solubility enhancement of heterologous proteins: (i) high-affinity interaction with DnaK and (ii) formation of self-assembled supra-molecules with limited and constant sizes, thereby avoiding the undesirable formation of insoluble macro-aggregates of heterologous proteins.
INTRODUCTION
Ferritin plays a central role in iron storage and in the maintenance of intracellular iron balance for the growth and survival of most organisms (1–3). The functions of ferritin suggest that it may serve as a cytoprotective protein, minimizing oxygen-free radical formation by sequestering intracellular iron (4). Ferritin molecules isolated from vertebrates are composed of two types of subunits (heavy chain and light chain), whereas those from plants and bacteria contain only heavy chains. Mammalian heavy chain and light chain show ∼54% sequence identity. In mixed subunit 24mer (heteropolymers) with a spherical shell structure, heavy (H) and light (L) chain subunits have similar conformations of principally four α helix bundles (5–8). Four years ago, the Korean Food and Drug Administration prohibited the commercial manufacture of animal ferritin (from equine spleen extracts), which had been used to treat human iron deficiency, because of its probable complication of virus infection. Thereafter, the production of recombinant human ferritin in Escherichia coli has been used as a possible substitute for equine ferritin extracts.
It was reported that contrary to ferritin L-chain (FTN-L), ferritin H-chain (FTN-H) and H–L hybrid (FTN-H fused at the N-terminus of FTN-L) are soluble in E.coli cytoplasm, probably due to their enhanced folding efficiency. The recombinant ferritin H–L hybrid as well as the H-chain showed the same iron storage activity as natural copolymers of human ferritin (9). Various human proteins [epidermal growth factor (EGF), interleukin-2 (IL-2), prepro-ghrelin (ppGRN), a deletion mutant of glutamate decarboxylase (GAD512–585), a deletion mutant of arginine deiminase (ADI132–410) and minipro-insulin (mp-INS)], forming inclusion bodies upon direct expression, were all soluble in E.coli cytoplasm when expressed with the N-terminus fusion of ferritin H-chain. Considering the structural diversity of the above human proteins fused to ferritin H-chain, it seems reasonable to assume that the de novo folding of proteins with ferritin H-chain fusion is not determined by direct protein–protein interaction between FTN-H and the target protein, but instead, the assistance of a global folding enhancer, a molecular chaperone, is much more important. Misfolding and aggregation of proteins are major damaging consequences of stress situations, such as heat shock and pathophysiological states (10–12). The central cellular defense against such damage is a molecular chaperone, which prevents aggregation, assists in refolding and mediates the degradation of misfolded proteins (13,14). Chaperones can cooperate in vitro as part of a functional network, in which ‘holder’ chaperones prevent the aggregation of misfolded proteins, whereas ‘folder’ chaperones actively assist in refolding (15,16). The Hsp70 chaperone DnaK participates in the folding of a subset of newly synthesized proteins, prevents protein aggregation in a denatured condition and refolds misfolded proteins.
Recently, Vandenbroeck et al. (17) identified conserved DnaK-binding sites in the N- and C-terminal halves of helix B and C of diverse 4-helix-bundle proteins, respectively, that belong to the superfamily of interferon-γ/interleukin-10-related cytokines. These cytokines belong to a relatively small group of homodimeric proteins with highly interdigitated interfaces exhibiting the strongly hydrophobic character of the interior core of a single-chain folded domain. They proposed that the binding of DnaK may constitute the hallmark of a novel conserved regulatory mechanism, in which HSP70-like chaperones assist in the formation of a hydrophobic dimeric ‘folding’ interface. Hidaka et al. (18) also found that the propeptide of guanylyl cyclase, which activates peptide II (GCAP-II), an endogenous ligand of guanylyl cyclase C, has a dual function in the proper folding of a mature peptide. This means that the N-terminus propeptide of pro-GCAP-II is critical, not only for disulfide-coupled folding but also for the net stabilization (or dimerization) of pro-GCAP-II.
In this study, diverse recombinant proteins, commonly with FTN-H fusion, were observed to form stable homopolymeric supra-molecules without the formation of insoluble protein aggregates. Seemingly, the supra-molecule size is limited by assembly properties of FTN-H, thereby keeping the supra-molecules soluble. The target proteins localized inside the spherical supra-molecules might have been in a protective environment against undesirable protein aggregation. We also identified a specific DnaK-binding motif within FTN-H and demonstrated the efficacy of FTN-H as a potent solubility enhancer upon heterologous protein expression in E.coli.
MATERIALS AND METHODS
PCR-based site-directed mutagenesis
Site-directed mutagenesis for preparing various mutants of FTN-H and light chain and glutathione S-transferase (GST) was performed according to the standard protocols for overlap PCR (19). PCRs were performed in a final volume of 20 μl with the following reagents: 20 mM Tris–HCl, pH 8.3, 100 pmol of each primer, 200 μM dNTPs, 1.5 mM magnesium chloride and 2.5 U Taq polymerase (BRL). PCR cycling was performed as follows: initial denaturation at 94°C for 120 s and then 20 cycles of 94°C (45 s), 60°C (60 s) and 72°C (120 s). After the completion of 20 cycles, the reaction mixture was incubated for 5 min at 72°C. The products were gel purified after agarose gel electrophoresis.
Recombinant strains and plasmids
After PCR amplification using appropriate primers, each of the recombinant genes encoding human ferritin H- and L-chain (FTN-H and FTN-L) and their mutants, human EGF, human interleukin-2 (hIL-2), GST::(prepro-ghrelin) and other various fusion mutants (FTN-H::FTN-L, FTN-H::EGF, FTN-H::hIL-2, FTN-H::ppGRN, FTN-H::GAD512–585, FTN-H::ADI132–410 and FTN-H::mp-INS) was inserted into the NdeI–HindIII site of plasmid pET28a (Novagen) to construct expression vector. Upon the fusion construction above, specific cleavage sequence for thrombin protease was placed between C-terminus of FTN-H and N-terminus of target protein. After complete DNA sequencing of all the gel-purified plasmid vectors, E.coli strain BL21(DE3) [F− ompT hsdSB(rB−mB−)] was transformed with the plasmid expression vectors, and kanamycin-resistant transformants were selected. For the constitutive co-expression of DnaK chaperone with each of the recombinant proteins above, E.coli dnaK gene was cloned into a compatible ampicillin-resistant plasmid pHCE19 (2) (Bioleaders Inc., Korea) (20). The resulting DnaK expression vector transformed the same bacterial host, E.coli BL21 (DE3), simultaneously with the expression vector for the target recombinant gene. Both ampicillin- and kanamycin-resistant transformants were selected using Luria–Bertani (LB) agar plates supplemented with ampicillin (100 mg/l) and kanamycin (100 mg/l).
Gene expression, purification of recombinant proteins and bioactivity assay
For the shake flask experiments (250 ml Erlenmeyer flasks, 37°C, 200 r.p.m. Shaking Incubator IB-3100, New Power Eng., Korea), LB media containing 100 mg kanamycin (or both ampicillin and kanamycin where applicable) per liter of culture was used. When the culture turbidity (OD600) reached 0.5, the gene expression was induced with the addition of isopropyl-β-d-thiogalactopyranoside (IPTG) (0.5 mM). After further 3–4 h cultivation, the recombinant cells were harvested by centrifugation (6000 r.p.m. for 5 min) and the cell pellets were resuspended in 5 ml distilled water. Cell disruption was done by using Branson Sonifier (Branson Ultrasonics Corp., Danbury, CT). The cell-free supernatant and insoluble protein aggregates were separated at 13 000 r.p.m. for 10 min. The isolated inclusion bodies, if any, were then washed twice with 1% Triton X-100. The cell-free supernatant and the washed inclusion bodies were subject to PAGE analysis, using 14% Tris-glycine precast gel (Novex, San Diego, CA). Coomassie-stained protein bands were scanned and analyzed using a densitometer (Duoscan T1200, Bio-Rad, Hercules, CA).
The purification of recombinant proteins was accomplished using metal affinity chromatography. The corresponding recombinant gene was cloned into the pET28a (Novagen) plasmid vector as mentioned previously, and expressed so that the soluble recombinant protein was synthesized as a fusion protein containing N-terminus six histidine residues. The resulting recombinant fusion protein with polyhistidine tag has a high affinity for ProBond resin (Invitrogen). Recombinant protein was purified through a Ni2+ column with ProBond resin according to the following procedures. The cell-free supernatant containing 1–2 mg of polyhistidine-tagged recombinant protein was loaded onto ProBond resin (Ni+2) column. Prior to sample loading, the resin was washed twice with 10 column volumes of binding buffer (50 mM potassium phosphate, 300 mM KCl and 20 mM imidazole, pH 7.0). Binding buffer contains 20 mM imidazole to minimize non-specific binding of untagged protein contaminants, and binding was carried out in a batch mode at 4°C. Subsequently, the resin was washed twice with 5–8 ml Tris–HCl (10 mM Tris, pH 8.0) prior to thrombin digestion step. Thrombin solution 50 μl (1 U/μl, Amersham, Catalog no. 27-0846-01) was mixed with Tris buffer 4950 μl, and the resulting reaction buffer was added to the column. After overnight thrombin digestion (16 h) at room temperature (22–25°C), the reaction solution was collected and properly concentrated using microcon YM-10 (Millipore, Ireland) before SDS–PAGE analysis.
The enzyme activity of recombinant arginine deiminase (ADI) was assayed by colorimetric determination of the enzyme reaction product, citrulline. The reaction mixture (0.1 M potassium phosphate, pH 6.5, 10 mM l-arginine and 0.1 ml enzyme solution in a final volume of 1 ml) was incubated at 37°C for 5 min, and the amount of citrulline was determined with diacetyl-monoxime (21). The sample heated to 95°C for 10 min and the optical density (A540 nm) was then determined. The protein concentration was determined using the Protein Assay Kit (Bio-Rad) with BSA as a standard. One unit of ADI is the amount of enzyme catalyzing 1 μmol of arginine to citrulline per min at 37°C under the assay conditions.
Peptide synthesis and in vitro DnaK-binding assay
Organic synthesis and purification of two peptides (GRIFLQD and GRALFQD) were implemented by Peptron, Inc. (Taejon, Korea). Each peptide was dissolved in Tris buffer (40 mM Tris–HCl and 0.5 mM ATP, pH 7.8), and the peptide concentration in the solution was adjusted to 10 mg/l. An aliquot of 20 μl of each peptide solution was dropped on PVDF membrane that was already wetted by methanol, and the blotted PVDF membrane was completely dried. [Through Ponceu S staining (0.25% Ponceu S in 1% acetic acid solution), we confirmed that a comparable amount of each peptide or protein was stably immobilized on the PVDF membrane.] The dried PVDF membrane was put into skim milk solution and slowly stirred for 30 min. Subsequently, the PVDF membrane was transferred into Tris buffer solution (40 mM Tris–HCl and 0.5 mM ATP, pH 7.8) containing 100 nmol of DnaK (Stressgen). After 1 h, the membrane was washed twice with phosphate-buffered saline buffer, followed by immunoblotting assay using anti-DnaK Ab (monoclonal IgG, mouse, Stressgen) and anti-mouse immunoglobulin G–horseradish peroxidase as primary and secondary antibodies, respectively.
Transmission electron microscopy (TEM)
Unstained samples of a purified protein solution were prepared for electron microscopy by air-drying small drops of a sample solution onto carbon-coated copper electron microscopy grids. To obtain stained images of protein nanoparticles, the electron microscopy grids containing air-dried samples were incubated with a 2% (w/v) aqueous uranyl acetate solution for 10 min at room temperature and washed 3–4 times with distilled water. Protein nanoparticle images were examined using the Philips Technai 120 kV electron microscope. Electron diffraction patterns were recorded from a selected area that is well occupied with protein nanoparticles in order to obtain high diffraction intensities. Particle size distributions were made by measuring 50 protein nanoparticles.
RESULTS
Solubility enhancer vector system using gene fusion of FTN-H
In a previous study, recombinant human ferritin H-, L-chain, and H–L hybrid were produced using the T7 expression system of E.coli (9). Interestingly, the recombinant human ferritin H-chain (FTN-H) and H–L hybrid were intracellular soluble and active proteins. However, despite high-sequence homology with FTN-H, recombinant human ferritin L-chain (FTN-L) was aggregated into inclusion bodies in E.coli cytoplasm (9,22). With the expectation that the N-terminus fusion of FTN-H could be a plausible solution for soluble recombinant protein synthesis in E.coli, we attempted to heterologously express the following proteins, all of which were produced as inclusion bodies upon direct expression in E.coli: EGF, hIL-2, human ppGRN, deletion mutants of human GAD (GAD512–585), Mycoplasma ADI (ADI132–410) and human mini-proinsulin (mp-INS) (23). Figure 1 shows that the intracellular solubility of all recombinant fusion proteins (i.e. FTN-H::EGF, FTN-H::IL-2, FTN-H::ppGRN, FTN-H::GAD512–585, FTN-H::ADI132–410 and FTN-H::mp-INS) was significantly enhanced. For instance, almost 80% of synthesized FTN-H::ppGRN was soluble in E.coli cytoplasm, although most ppGRN formed inclusion bodies even by GST fusion (data not shown). ADI catalyzes the hydrolysis of arginine into citrulline and ammonia, which have been reported to inhibit cell proliferation by arresting cells in the G1 phase and also to demonstrate anti-cancer activity. As shown in Figures 1 and 2A, we confirmed that ∼30% of the expressed fusion protein, FTN-H::ADI132–410, was soluble in bacterial cytoplasm and biologically active as well, clearly showing arginine-degrading activity.
For the purification of ppGRN, FTN-H::ppGRN in a cell-free supernatant from whole-cell lysate was loaded to an Ni+2 column and retained inside the column due to the affinity interaction between Ni+2 and the N-terminus polyhistidine tag of FTN-H::ppGRN. Thrombin was subsequently added to the column to separate it from ppGRN by cleaving the fusion construct. As shown in Figure 2B (lane 3), ppGRN was successfully recovered through thrombin digestion of the fusion construct, FTN-H::ppGRN. Complete separation and purification of ppGRN was achieved after the removal of thrombin from the eluted solution. The other target proteins in the FTN-H fusion constructs were successfully purified through the same procedure (data not shown).
Search for putative DnaK-binding motif(s) in FTN-H
When 2-HEDS [2-hydroxyethyl disulfide (HOCH2CH2-S-S-CH2CH2OH), 15 mM] was present in the growth culture of the recombinant E.coli, almost 100% of the recombinant FTN-H formed insoluble aggregates after gene expression. However, in the absence of 2-HEDS, >80% of the recombinant FTN-H was soluble in bacterial cytoplasm (Figure 3A). 2-HEDS is the oxidized form of DTT. When 2-HEDS is present in bacterial cytoplasm, its reduction is subject to the inducement of thiol oxidation of proteins, thereby promoting the formation of mis-linked disulfide bonds and protein aggregation. Comparing the results of reducing and non-reducing SDS–PAGE analyses, the aggregation of recombinant FTN-H by 2-HEDS seems to have occurred via intramolecular mislinkage of disulfide bridges (data not shown). From Figure 3A, even in the presence of 2-HEDS, the solubility of recombinant FTN-H was significantly increased when the HSP70 molecular chaperone, DnaK, was constitutively co-expressed. The co-expression of DnaK was carried out using a constitutive promoter localized in another expression vector, pHCE19 (2) (Bioleaders Inc., Korea) (20). Thus, it was possible to maintain the high cytoplasmic concentration of DnaK before the T7 expression of FTN-H was induced by IPTG. As shown in Figure 3B, the insoluble nature of recombinant FTN-L, however, never changed even with DnaK co-expression using the same constitutive promoter system. The results shown in Figure 3 suggest that FTN-H must be capable of having a significant interaction with DnaK, presumably via special amino acid residues acting as high-affinity motif(s) for DnaK.
DnaK generally recognizes and binds to the hydrophobic region of the unfolded polypeptide chain. Thus, to identify the DnaK-binding motif(s), if any, the five hydrophobic clusters consisting of three amino acids (34VYL36, 81IFL83, 105LHL107, 133FIE35 and 161APE163) within the FTN-H sequence were first considered. They showed a relatively high hydrophobicity based on their theoretical hydropathy profile (Figure 4A). Using the site-directed mutagenesis technique, each of the five hydrophobic clusters was mutated one at a time by replacing the hydrophobic amino acids in each cluster with arginine residue (34VYL36→34RRR36, 81IFL83→81RRR83, 105LHL107→105RHR107, 133FIE135→ 133RRE135 and 161APE163→161RRE163), thereby resulting in the construction of five FTN-H-derived mutants. Each mutant was expressed in the E.coli T7 expression system, and its intracellular solubility was analyzed. The purpose of mutating the wild-type sequence was to identify the hydrophobic patch that is recognized and bound by DnaK. If a certain hydrophobic cluster is responsible for DnaK binding, replacing the hydrophobic amino acids comprising the hydrophobic patch with strongly charged amino acids, such as arginine, will significantly affect the solubility of the polypeptide chain in bacterial cytoplasm. After gene expression, three FTN-H mutants (34RRR36, 81RRR83 and 105RHR107) were aggregated into inclusion bodies in E.coli cytoplasm (Figure 4B), implying that the three hydrophobic tripeptides (34VYL36, 81IFL83 and 105LHL107) could be putative motifs interacting with DnaK. Although DnaK was constitutively co-expressed with the three mutants (34RRR36, 81RRR83 and 105RHR107) using the same constitutive promoter system, the inclusion body formation of the three FTN-H mutants was never prevented (Figure 5). This is contrary to the case of wild-type FTN-H, in which the soluble FTN-H synthesis was resumed by DnaK co-expression even in the presence of 2-HEDS (Figure 3A). Therefore, it seems that there must be a special motif(s) in FTN-H with a high affinity to DnaK, leading to the syntheses of soluble FTN-H and other soluble heterologous proteins with FTN-H fusion.
Identification and validation of a specific DnaK-binding motif
Case of the ferritin-light chain mutant
The alignment of the amino acid sequences between FTN-H and FTN-L shows that the sequence of a putative DnaK-binding motif, 81IFL83, is completely different from the aligned cognate sequence (77ALF79) of FTN-L (Figure 6A). Through site-directed mutagenesis, the nucleotide sequence encoding 77ALF79 of FTN-L was replaced with the sequence coding for 77IFL79 so that FTN-L might have the putative DnaK-binding motif of FTN-H. The resulting substitution mutant of FTN-L (77IFL79) was expressed in E.coli. Surprisingly, the results of SDS–PAGE analysis showed that >50% of the synthesized FTN-L mutant was soluble in the bacterial cytoplasm (Figure 6B), whereas almost 100% of wild-type FTN-L was aggregated into inclusion bodies (Figures 1 and 3). Figure 6C shows that it is very similar to the case of FTN-H, in which most of the FTN-L mutant was aggregated by the addition of 2-HEDS, but the synthesis of the soluble FTN-L mutant was resumed by DnaK co-expression (Figure 6C). Consequently, it is evident that the tripeptide motif (IFL) from FTN-H played a critical role in increasing the solubility of the FTN-L mutant, probably by attracting the DnaK chaperone.
Case of the (GST mutant)::ppGRN
As described earlier in this study (Figure 1), the GST fusion protein, GST::ppGRN, was almost 100% aggregated upon expression in E.coli. The fusion expression of ppGRN was attempted using a GST mutant containing a DnaK-binding motif from FTN-H. First, several hydrophobic clusters in the N-terminus region of native GST were mutated by replacing the hydrophobic amino acids with arginine residues, and the cytoplasmic solubility of the resulting GST mutants was analyzed after their expression in the same recombinant E.coli system. As a result, a high degree of aggregation was observed in the synthesis of a GST mutant (85LAP87→85RRR87) (Figure 7A). This is very similar to the previous results obtained with FTN-H mutants (Figure 4). Thus, the hydrophobic cluster (85LAP87) and the sequence around 85LAP87 seem to be important sites in the interaction between GST and DnaK. To prepare GST mutants containing the putative DnaK-binding motif (81IFL83) of FTN-H, we first tried to replace the 85LAP87 of GST with the 81IFL83 of FTN-H, thereby constructing a substitution mutant of GST, 85LAP87→85IFL87 (GST-m1). We also constructed the following additional mutants of GST containing 80RIFLQ84 and 79GRIFLQD85 of FTN-H: 84LLAPV88→84RIFLQ88 (GST-m2) and 83QLLAPVN89→83GRIFLQD89 (GST-m3). Then, three fusion proteins of ppGRN (GST-m1::ppGRN, GST-m2::ppGRN and GST-m3::ppGRN) were synthesized using the same T7 gene expression system. From the results of the SDS–PAGE analysis, it is surprising that ∼50% of the synthesized GST-m3::ppGRN was in the form of soluble protein in E.coli cytoplasm (Figure 7B). Figure 7C shows that almost 100% re-aggregation of GST-m3::ppGRN was also observed in the presence of 2-HEDS, but that the soluble GST-m3::ppGRN appeared again after DnaK co-expression, which is very similar to the cases of FTN-H and the FTN-L mutant.
Apparently, the two mutants, GST-m3 and FTN-L (77IFL79), have a common foreign sequence, GRIFLQD, originating from FTN-H, which is therefore likely to be a peptide motif with a high affinity to DnaK. It is generally accepted that DnaK binds to the hydrophobic region in an almost extended polypeptide chain of unfolded protein at the initial stage of protein folding. Therefore, the important factor determining the DnaK binding may not be protein conformation but the sequence structure of the binding motif. The sequence, GRIFLQD, consists of hydrophilic (polar) amino acid residues flanking the hydrophobic tripeptide. In the middle of the tripeptide, an amino acid with an aromatic side chain (Phe) is located (Figure 8). Presumably, such a unique sequence in the extended polypeptide chain can be easily recognized by DnaK.
In vitro DnaK-binding assay
Through an in vitro assay experiment, we investigated the actual binding of DnaK to the specific peptide GRIFLQD (named ‘peptide A’) above. In addition to the peptide A, we prepared another peptide, GRALFQD (named ‘peptide B’), which exactly corresponds to the 79th–85th amino acids of FTN-L that were replaced by peptide A upon the preparation of the FTN-L mutant (Figure 6B). From the results of Ponceau S staining (Figure 9A), it seems obvious that both peptides A and B were well immobilized on the PVDF membrane. The addition of DnaK to the immobilized peptides and the subsequent immunoblotting analysis using anti-DnaK monoclonal Ab showed that DnaK bound only to peptide A (Figure 9A), which clearly shows that the sequence 83GRILFQD89 within FTN-H has a high binding affinity for DnaK.
We also implemented an additional in vitro assay of DnaK binding to FTN-H and the FTN-L mutant containing GRIFLQD, and to wild type FTN-L that did not contain GRIFLQD (negative control). Since DnaK binds to an unfolded nascent chain at the early stage of folding, we performed the overlay assay using purified and unfolded FTN-H, FTN-L and FTN-L mutant. (The purified proteins were unfolded using a denaturation buffer containing 1 mM DTT and 8 M urea.) Evidently, from Figure 9B, DnaK bound to only the proteins containing GRIFLQD, i.e. FTN-H and the FTN-L mutant, whereas DnaK binding never happened to wild-type FTN-L. Through this additional in vitro assay, we confirmed that the peptide motif GRILFQD (i.e. the 83rd–89th amino acids of FTN-H) has a high binding affinity to DnaK and seems to be directly related to the synthesis of soluble heterologous proteins.
Supra-molecule formation by self-assembly function of N-terminus FTN-H
From TEM images of the purified FTN-H-fusion proteins (Figure 10A), it is very interesting that all the fusion proteins formed spherical nanoparticles with diameters of 10–15 nm, i.e. non-covalently cross-linked supra-molecules by self-assembly. The particle size seems to be quite uniform for each fusion protein. Considering that all the heterologous proteins (EGF, IL-2, ppGRN, GAD512–585 and mpINS) were aggregated into inclusion bodies when directly expressed in E.coli, the results imply that the supra-molecule formation in the shape of a spherical shell may provide significant advantage(s) in preventing protein aggregation. It seems that upon supra-molecule formation, there must be a limitation in the increase of the supra-molecule size due to the assembly properties of FTN-H. Therefore, self-assembled supra-molecules were produced with a constant size of 10–15 nm, thereby avoiding the undesirable formation of insoluble macro-aggregates of heterologous proteins.
Immunoblotting analysis (western blot) following SDS–PAGE analysis of the purified FTN-H::ppGRN showed an apparent immunoreactive band (Figure 10B). When western blot was carried out after native PAGE, however, no immunoreactive bands were visualized (Figure 10C), which clearly indicates that a part of ppGRN, recognized by anti-ppGRN monoclonal Ab, was localized inside the supra-molecules. Thus, another plausible advantage of the self-assembled supra-molecule is its prevention of protein aggregation by non-specific hydrophobic interaction through the localization of critical hydrophobic patches of the target protein inside the supra-molecule.
DISCUSSION
As newly translated polypeptides emerge from the ribosome, they face a formidable task in the crowded environment of the cell: folding into their correct structure for proper functioning. Molecular chaperones are essential for the correct folding of a significant fraction of cellular proteins under both physiological and stress conditions (24), acting in a complex network as a main defense against protein aggregation. Some chaperones exhibit a holding activity, preventing polypeptides from aggregation (25–27). Others show a complementary folding role to assist in refolding and solubilization from aggregates (28,29). Chaperones also minimize aggregation by mediating the degradation of proteins that cannot be properly folded (30). The specific contribution of particular chaperones to this multifunctional folding network has only been partially identified. The Hsp70 chaperone DnaK serves as a central element of the chaperone network, assisting in de novo protein folding, protein translocation, oligomer dissociation and prevention of stress-induced protein aggregation (31–33). DnaK mediates protein folding through ATP-dependent interaction with short linear peptide segments that are exposed on unfolded proteins (34).
The heterologous expression of various human proteins (EGF, IL-2, ppGRN, GAD512–585, ADI132–410 and mp-INS) was attempted through the fusion of the target protein to the C-terminus of FTN-H, all of which were produced as inclusion bodies when directly expressed in E.coli. As a result, the intracellular solubility of recombinant fusion proteins (FTN-H::EGF, FTN-H::IL-2, FTN-H::ppGRN, FTN-H::GAD512–585, FTN-H::ADI132–410 and FTN-H::mp-INS) was significantly enhanced (Figure 1). As explained above, DnaK plays a central role in the initial folding of the nascent polypeptide chain, and thus it seems reasonable to assume that FTN-H has a significant interaction with DnaK and contains specific motif(s) that DnaK efficiently recognizes and binds to. The results shown in Figures 3 and 9 strongly suggest that this assumption is not irrational. Recently, Vandenbroeck et al. (17) identified conserved DnaK-binding sites in the N- and C-terminal halves of helix B and C of diverse 4-helix-bundle proteins, respectively, that belong to the superfamily of interferon-γ/interleukin-10-related cytokines. In this study, we also attempted to identify the specific amino acid residues of FTN-H, if any, with a high affinity to DnaK. Through systematic investigation using site-directed mutagenesis, the oxidative and forced protein aggregation technique and the DnaK co-expression vector system (Figure 4), several putative binding motifs were selected. It has been finally confirmed through an in vitro binding assay that DnaK recognizes an extended peptide strand composed of a hydrophobic core of three residues (Ile, Phe and Leu) and two flanking regions enriched in polar residues (Gly, Arg, Gln and Arg), such as the sequence GRIFLQD (Figure 8), which seems similar to the findings of Rudiger et al. (35). They reported that the DnaK-binding motif consists of a hydrophobic core of four or five residues enriched particularly not only in Leu, but also in Ile, Val, Phe and Tyr and two flanking regions enriched in basic residues. Although recombinant FTN-L was all aggregated into inclusion bodies in E.coli cytoplasm, the cytoplasmic solubility of the recombinant FTN-L mutant containing the putative DnaK-binding motif (GRIFLQD) was significantly increased (Figure 6B). The efficacy of the binding motif was also validated using a GST mutant (GST-m3) containing the same motif sequence, i.e. the fusion protein GST-m3::ppGRN was expressed as a cytoplasmic soluble protein. The sequence GRIFLQD of FTN-H is located in the long loop between the second and the third helix of the FTN-H, and the tertiary structure of the FTN-H shows that the flexible long loop containing the binding motif is localized in the protein exterior. It is therefore likely that association and dissociation of DnaK at the binding motif would be quite efficient at the early stage of protein folding.
Similar to native human ferritin that is heteropolymer (with the shape of a spherical shell) of 24 subunits of two types, heavy chain and light chain, all the recombinant proteins, including FTN-H, FTN-H::EGF, FTN-H::IL-2, FTN-H::ppGRN, FTN-H::GAD512–585, FTN-H::ADI132–410 and FTN-H::mp-INS, have formed stable homopolymers (i.e. supra-molecules) by non-covalent self-association. This was confirmed by the TEM images showing nanometer-scale spherical particles of the recombinant proteins (Figure 10A). Recently, Louzada et al. (36) proposed the mechanism of a quaternary structure (dimer) formation in the folding pathway of glutathione reductase (GR). They proposed that the assembly of the native GR dimer proceed via the initial formation of molten-globule monomers that dimerize to form a partially folded, expanded dimer. Structural rearrangements then take place at the dimer interface, leading to a final, compact dimer structure. Based on this proposed mechanism, it is presumed that the intermolecular association of ferritin (or ferritin-fusion) monomers to form homopolymeric supra-molecules happens at the level of the partially folded (molten-globule) intermediate, and not at the level of the completely folded and thermodynamically optimal polypeptide chain. Protein aggregation, i.e. the off-pathway aggregation step, often initiates from partially folded intermediates upon intracellular chain folding and association (37). Within cells, nascent chains form sequentially on the ribosome, partially fold, and sometimes fail to reach their native state because of the intervention of various factors in a heterologous environment, including non-specific hydrophobic interactions with the intracellular cytoplasmic proteins (38), which sometimes cause the formation of large and insoluble protein aggregates, i.e. inclusion bodies. Supposing that as assumed earlier, the supra-molecules of FTN-H or its fusion mutants are formed through the self-association of partially folded intermediates, they may have a distinct advantage in preventing the formation of insoluble protein aggregates; i.e. upon supra-molecule formation, the supra-molecule size would be limited by the assembly properties of FTN-H, and therefore, self-assembled supra-molecules with constant sizes (10–15 nm) could be produced, thereby avoiding the undesirable formation of insoluble macro-aggregates of recombinant proteins (Figure 11).
As shown in Figure 11, a part of a target protein can be localized within the spherical shell of the supra-molecule upon the intermolecular association of a partially folded ferritin monomer, and thus, a protective environment against undesirable protein aggregation can be created. Evidently, from the results of the native PAGE and western blot analysis for FTN-H::ppGRN (Figure 10B and C), an anti-ppGRN mAb-specific epitope was shown to be localized inside the supra-molecule. Hydrophobic patches of ppGRN may be hidden together inside, which, otherwise, may be involved in a non-specific hydrophobic interaction with the adjacent polypeptide chains, leading to protein aggregation.
In conclusion, this study proposes the dual function of FTN-H used for the fusion expression of heterologous proteins: (i) high-affinity interaction and association with DnaK and (ii) formation of self-assembled supra-molecules with limited and constant sizes, thereby preventing the undesirable formation of insoluble macro-aggregates of heterologous proteins and possibly creating a protective environment against protein aggregation. Similarly, Hidaka et al. (18) found that the propeptide of guanylyl cyclase activating peptide II (GCAP-II), an endogenous ligand of guanylyl cyclase C, has a dual function in the proper folding of the mature peptide. Thus, the N-terminus propeptide of pro-GCAP-II is critical not only for disulfide-coupled folding but also for the net stabilization (or dimerization) of pro-GCAP-II. The ferritin supra-molecular structure at the nanometer scale (or spherical protein nanoparticles) can contribute to the enhancement of the stability of synthesized recombinant proteins and can also be used as a potent 3D probe structure for an ultra-sensitive protein chip if a special expression system is designed so that the captured probe peptide can be properly displayed at the surface of the ferritin nanoparticles.
Acknowledgments
This work was supported by Korea Research Foundation Grants (KRF-2002-041-D00196 and KRF-2004-005-D00057) and Korea Science and Engineering Foundation Grant (R01-2005-000-10355-0). This work was also supported by Microbial Genomics & Applications Center Grant (Microbial Genomics and Applications R & D Program). Funding to pay the Open Access publication charges for this article was provided by R01-2005-000-10355-0.
Conflict of interest statement. None declared.
REFERENCES
- 1.Yang X., Arosio P., Chasteen D. Molecular diffusion into ferritin: pathways, temperature dependence, incubation time and concentration effects. Biophys. J. 2000;78:2049–2059. doi: 10.1016/S0006-3495(00)76752-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Harrison P.M., Arosio P. The ferritins: molecular properties, iron storage function and cellular regulation. Biochim. Biophys. Acta. 1996;1275:161–203. doi: 10.1016/0005-2728(96)00022-9. [DOI] [PubMed] [Google Scholar]
- 3.Treffry A., Zhao Z., Quail M.A., Guest J.R., Harrison P.M. How the presence of three iron binding sites affects the iron storage function of the ferritin (EcFtnA) of Escherichia coli. FEBS Lett. 1998;432:212–218. doi: 10.1016/s0014-5793(98)00867-9. [DOI] [PubMed] [Google Scholar]
- 4.Balla G., Jacob H.S., Balla J., Rosenberg M., Nath K., Apple F., Eaton J.W., Vercellotti G.M. Ferritin: a cytoprotective antioxidant strategem of endothelium. J. Biol. Chem. 1992;267:18148–18153. [PubMed] [Google Scholar]
- 5.Lawson D.M., Artymiuk P.J., Yewdall S.J., Smith J.M., Livingstone J.C., Treffry A., Luzzago A., Levi S., Arosio P., Cesareni G. Solving the structure of human H ferritin by genetically engineering intermolecular crystal contacts. Nature. 1991;349:541–544. doi: 10.1038/349541a0. [DOI] [PubMed] [Google Scholar]
- 6.Liu X., Jin W., Theil E.C. Opening protein pores with chaotropes enhances Fe reduction and chelation of Fe from the ferritin biomineral. Proc. Natl Acad. Sci. USA. 2003;100:3653–3658. doi: 10.1073/pnas.0636928100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Santambrogio P., Levi S., Cozzi A., Rovida E., Albertini A., Arosio P. Production and characterization of recombinant heteropolymers of human ferritin H and L chains. J. Biol. Chem. 1993;268:12744–12748. [PubMed] [Google Scholar]
- 8.Guo J.H., Juan S.H., Aust S.D. Mutational analysis of the four alpha-helix bundle iron-loading channel of rat liver ferritin. Arch. Biochem. Biophys. 1998;352:71–77. doi: 10.1006/abbi.1998.0581. [DOI] [PubMed] [Google Scholar]
- 9.Lee J., Kim S.W., Kim Y.H., Ahn J.Y. Active human ferritin H/L-hybrid and sequence effect on folding efficiency in Escherichia coli. Biochem. Biophys. Res. Commun. 2002;298:225–229. doi: 10.1016/s0006-291x(02)02429-4. [DOI] [PubMed] [Google Scholar]
- 10.Allen S.P., Polazzi J.O., Gierse J.K., Easton A.M. Two novel heat shock genes encoding proteins produced in response to heterologous protein expression in Escherichia coli. J. Bacteriol. 1992;174:6938–6947. doi: 10.1128/jb.174.21.6938-6947.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bardwell J.C., Craig E.A. Ancient heat shock gene is dispensable. J. Bacteriol. 1988;170:2977–2983. doi: 10.1128/jb.170.7.2977-2983.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Bjellqvist B., Sanchez J.C., Pasquali C., Ravier F., Paquet N., Frutiger S., Hughes G.J., Hochstrasser D. Micropreparative two-dimensional electrophoresis allowing the separation of samples containing milligram amounts of proteins. Electrophoresis. 1993;14:1375–1378. doi: 10.1002/elps.11501401212. [DOI] [PubMed] [Google Scholar]
- 13.Buchberger A., Valencia A., McMacken R., Sander C., Bukau B. The chaperone function of DnaK requires the coupling of ATPase activity with substrate binding through residue E171. EMBO J. 1994;13:1687–1695. doi: 10.1002/j.1460-2075.1994.tb06433.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Buchberger A., Schröder H., Hesterkamp T., Schönfeld H.-J., Bukau B. Substrate shuttling between the DnaK and GroEL systems indicates a chaperone network promoting protein folding. J. Mol. Biol. 1996;261:328–333. doi: 10.1006/jmbi.1996.0465. [DOI] [PubMed] [Google Scholar]
- 15.Buchberger A., Reinstein J., Bukau B. The DnaK chaperone system: mechanism and comparison with other Hsp70 systems. In: Bukau B., editor. Molecular Chaperones and Folding Catalysts: Regulation, Cellular Function and Mechanisms. Amsterdam, The Netherlands: Harwood Academic Publishers; 1999. pp. 609–635. [Google Scholar]
- 16.Bukau B., Horwich A.L. The Hsp70 and Hsp60 chaperone machines. Cell. 1998;92:351–366. doi: 10.1016/s0092-8674(00)80928-9. [DOI] [PubMed] [Google Scholar]
- 17.Vandenbroeck K., Alloza I., Brehmer D., Billiau A., Proost P., McFerran N., Rudiger S., Walker B. The conserved helix C region in the superfamily of interferon-gamma/interleukin-10-related cytokines corresponds to a high-affinity binding site for the HSP70 chaperone DnaK. J. Biol. Chem. 2002;277:25668–25676. doi: 10.1074/jbc.M202984200. [DOI] [PubMed] [Google Scholar]
- 18.Hidaka Y., Ohno M., Hemmasi B., Hill O., Forssmann W.G., Shimonishi Y. In vitro disulfide-coupled folding of guanylyl cyclase-activating peptide and its precursor protein. Biochemistry. 1998;37:8498–8507. doi: 10.1021/bi9731246. [DOI] [PubMed] [Google Scholar]
- 19.Pogulis R.J., Vallejo A.N., Pease L.R. In vitro recombination and mutagenesis by overlap extension PCR. Methods Mol. Biol. 1996;57:167–76. doi: 10.1385/0-89603-332-5:167. [DOI] [PubMed] [Google Scholar]
- 20.Lee J., Kim H.C., Kim S.W., Kim S.W., Hong S.I., Park Y.H. Interplay of SOS induction, recombinant gene expression and multimerization of plasmid vectors in Escherichia coli. Biotechnol. Bioeng. 2002;80:84–90. doi: 10.1002/bit.10354. [DOI] [PubMed] [Google Scholar]
- 21.Boyde T.R., Rahmatullah M. Optimization of conditions for the colorimetric determination of citrulline, using diacetyl monoxime. Anal. Biochem. 1980;15:424–31. doi: 10.1016/0003-2697(80)90404-2. [DOI] [PubMed] [Google Scholar]
- 22.Kim S.W., Kim Y.H., Lee J. Thermal stability of human ferritin: concentration dependence and enhanced stability of an N-terminal fusion mutant. Biochem. Biophys. Res. Commun. 2001;289:125–129. doi: 10.1006/bbrc.2001.5931. [DOI] [PubMed] [Google Scholar]
- 23.Shin C.S., Hong M.S., Bae C.S., Lee J. Enhanced production of human mini-proinsulin in fed-batch cultures at high cell density of Escherichia coli BL21(DE3)[pET-3aT2M2] Biotechnol. Prog. 1997;13:249–257. doi: 10.1021/bp970018m. [DOI] [PubMed] [Google Scholar]
- 24.Tomoyasu T., Mogk A., Langen H., Goloubinol P., Bukau B. Genetic dissection of the roles of chaperones and proteases in protein folding and degradation in the Escherichia coli cytosol. Mol. Microbiol. 2001;40:397–413. doi: 10.1046/j.1365-2958.2001.02383.x. [DOI] [PubMed] [Google Scholar]
- 25.Ehrnsperger M., Graber S., Gaestel M., Buchner J. Binding of non-native protein to Hsp25 during heat shock creates a reservoir of folding intermediates for reactivation. EMBO J. 1997;16:221–229. doi: 10.1093/emboj/16.2.221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Veinger L., Diamant S., Buchner J., Goloubinol P. The small heat-shock protein IbpB from Escherichia coli stabilizes stress-denatured proteins for subsequent refolding by a multichaperone network. J. Biol. Chem. 1998;273:11032–11037. doi: 10.1074/jbc.273.18.11032. [DOI] [PubMed] [Google Scholar]
- 27.Mogk A., Tomoyasu T., Goloubinol P., Rudiger S., Roder D., Langen H., Bukau B. Identification of thermolabile Escherichia coli proteins: prevention and reversion of aggregation by DnaK and ClpB. EMBO J. 1999;18:6934–6949. doi: 10.1093/emboj/18.24.6934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Parsell D.A., Kowal A.S., Singer M.A., Lindquist S. Protein disaggregation mediated by heat-shock protein Hsp104. Nature. 1994;372:475–478. doi: 10.1038/372475a0. [DOI] [PubMed] [Google Scholar]
- 29.Ben-Zvi A.P., Goloubinol P. Review: mechanisms of disaggregation and refolding of stable protein aggregates by molecular chaperones. J. Struct. Biol. 2001;135:84–93. doi: 10.1006/jsbi.2001.4352. [DOI] [PubMed] [Google Scholar]
- 30.Huang H.C., Sherman M.Y., Kandror O., Goldberg A.L. The molecular chaperone DnaJ is required for the degradation of a soluble abnormal protein in Escherichia coli. J. Biol. Chem. 2001;276:3920–3928. doi: 10.1074/jbc.M002937200. [DOI] [PubMed] [Google Scholar]
- 31.Gottesman S., Wickner S., Maurizi M.R. Protein quality control: triage by chaperones and proteases. Genes Dev. 1997;11:815–823. doi: 10.1101/gad.11.7.815. [DOI] [PubMed] [Google Scholar]
- 32.Bukau B., Schmid F.X., Buchner J. Assisted protein folding. In: Bukau B., editor. Molecular Chaperones and Folding Catalysts: Regulation, Cellular Function and Mechanisms. Amsterdam, The Netherlands: Harwood Academic Publishers; 1999. pp. 3–10. [Google Scholar]
- 33.Hartl F.U., Hayer-Hartl M. Molecular chaperones in the cytosol: from nascent chain to folded protein. Science. 2002;295:1852–1858. doi: 10.1126/science.1068408. [DOI] [PubMed] [Google Scholar]
- 34.Ewalt K.L., Hendrick J.P., Houry W.A., Hartl F.U. In vivo observation of polypeptide flux through the bacterial chaperonin system. Cell. 1997;90:491–500. doi: 10.1016/s0092-8674(00)80509-7. [DOI] [PubMed] [Google Scholar]
- 35.Rudiger S., Germeroth L., Schneider-Mergener J., Bukau B. Substrate specificity of the DnaK chaperone determined by screening cellulose-bound peptide libraries. EMBO J. 1997;16:1501–1507. doi: 10.1093/emboj/16.7.1501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Louzada P.R., Sebollela A., Scaramello M.E., Ferreira S.T. Predissociated dimers and molten globule monomers in the equilibrium unfolding of yeast glutathione reductase. Biophys. J. 2003;85:3255–3261. doi: 10.1016/S0006-3495(03)74743-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Hasse-Pettingell C.A., King J. Formation of aggregates from a thermolabile in vivo folding intermediate in P22 tailspike maturation. A model for inclusion body formation. J. Biol. Chem. 1988;263:4977–4983. [PubMed] [Google Scholar]
- 38.Mitraki A., King J. Amino acid substitutions influencing intracellular protein folding pathways. FEBS Lett. 1992;307:20–25. doi: 10.1016/0014-5793(92)80894-m. [DOI] [PubMed] [Google Scholar]