Skip to main content
PLOS One logoLink to PLOS One
. 2024 Apr 29;19(4):e0299701. doi: 10.1371/journal.pone.0299701

Polyaminated, acetylated and stop codon readthrough of recombinant Francisella tularensis universal stress protein in Escherichia coli

Benjamin Girardo 1, Lawrence M Schopfer 2, Yinshi Yue 1, Oksana Lockridge 2,*, Marilynn A Larson 1
Editor: Subbulakshmi Chidambaram3
PMCID: PMC11057771  PMID: 38683788

Abstract

Recombinant Francisella tularensis universal stress protein with a C-terminal histidine-tag (rUsp/His6) was expressed in Escherichia coli. Endogenous F. tularensis Usp has a predicted molecular mass of 30 kDa, but rUsp/His6 had an apparent molecular weight of 33 kDa based on Western blot analyses. To determine the source of the higher molecular weight for rUsp/His6, post translational modifications were examined. Tryptic peptides of purified rUsp/His6 were subjected to liquid chromatography tandem mass spectrometry (LC-MS/MS) and fragmentation spectra were searched for acetylated lysines and polyaminated glutamines. Of the 24 lysines in rUsp/His6, 10 were acetylated (K63, K68, K72, K129, K175, K201, K208, K212, K233, and K238) and three of the four glutamines had putrescine, spermidine and spermine adducts (Q55, Q60 and Q267). The level of post-translational modification was substoichiometric, eliminating the possibility that these modifications were the sole contributor to the 3 kDa extra mass of rUsp/His6. LC-MS/MS revealed that stop codon readthrough had occurred resulting in the unexpected addition of 20 extra amino acids at the C-terminus of rUsp/His6, after the histidine tag. Further, the finding of polyaminated glutamines in rUsp/His6 indicated that E. coli is capable of transglutaminase activity.

Introduction

Bacteria, plants, archaea, fungi, and select invertebrates have universal stress proteins (Usp) whose proposed function is to protect against a variety of stressors (i.e., reactive oxygen species and starvation), assist in iron scavenging, and sense ion and energy levels [1, 2]. Escherichia coli and Mycobacterium tuberculosis contain 6 and 10 Usp isoforms, respectively, and associated studies have provided knowledge on the potential function of Usp [36]. However, the presence of multiple Usp isoforms has complicated data interpretation in these investigations. In contrast to other bacteria, Francisella tularensis contains a single copy, highly conserved usp gene. F. tularensis causes the zoonotic disease tularemia, which can be fatal if left untreated. A first step in our goal to elucidate the function of Usp in this intracellular pathogen was to express the recombinant Usp and produce antibodies for detection of Usp in Francisella tularensis.

F. tularensis Usp has a theoretical molecular weight of 30,188 Da calculated from the amino acid sequence (accession number WP_003021757). Recombinant F. tularensis Usp (rUsp/His6) has a C-terminal histidine tag comprised of LEHHHHHH, which contributes 1065 Da for a combined molecular weight of 31,253. However, Western blots probed with an antibody to F. tularensis Usp indicated that rUsp/His6 has an apparent molecular weight of 33 kDa, about 3 kDa heavier than predicted for this protein.

Since recombinant proteins frequently carry non-native post-translational modifications [79], a likely explanation for the excess mass of F. tularensis rUsp/His6 was post-translational modification by E. coli. For this reason, recombinant proteins intended for therapeutic use in humans are rigorously characterized for undesired modifications. Previous studies showed that 0.8% of the insulin Lispro protein expressed in E. coli was acetylated on lysine 31 of the B chain [8] and the recombinant human IL-10 protein was partly acetylated on 4 out of 13 lysines [9]. The E. coli expression system modified recombinant human histones and recombinant human heat shock protein HSP70 by adding putrescine, spermidine, and spermine [7]. When acetylated lysines lose their positive charge, a less soluble protein is often produced that is prone to aggregation [10]. Post-translational modifications are typically substoichiometric, so only a small percentage of the associated residues in a given recombinant protein are modified.

In view of these precedents, we searched for acetylated lysines and polyaminated glutamines in purified F. tularensis rUsp/His6 expressed in E. coli. Our study provides evidence that F. tularensis rUsp/His6 was acetylated on 10 out of 24 lysines and polyaminated on 3 out of 4 glutamines, but these modified peptides were found in low abundance, suggesting a frequency of less than 1% for a given residue. Therefore, the low frequency of post-translational modifications led us to search for an alternative explanation for the extra 3 kDa molecular weight of F. tularensis rUsp/His6. Mass spectrometry data identified the unexpected addition of 20 amino acids after the C-terminal histidine tag. Although stop codon readthrough has been shown to occur [1114], this phenomenon in the pET expression system for recombinant protein production in E. coli has not been previously reported. The pET expression system was described more than thirty years ago [15] and remains one of the most popular systems for recombinant protein production. Improvements to the pET expression system have been made [16], but additional precautions are warranted to prevent stop codon readthrough.

Materials and methods

Nickel-nitrilotriacetic acid (Ni-NTA) chromatography resin, Thermo Scientific #88221. Protein G agarose, Protein Mods LLC, IgG binding capacity 20 mg/mL #PGG. CNBr-activated Sepharose 4-Fast Flow. Amersham Biosciences #17-0981-01. C18 spin columns Pierce #89870. Steriflip vacuum filtration system 0.22 μm membrane Millipore #SCGP00525. Trypsin sequencing grade modified, Promega #V5113. Precast polyacrylamide gels Bio-Rad #4561044 (12%). Polyvinylidene fluoride (PVDF) 0.45 μm Immobilon-P transfer membrane, Millipore #IPVH00010. Enhanced Chemiluminescence (ECL) reagents Radiance Plus, Azure Biosystems #AC2103. Affinity purified rabbit anti-denatured Usp polyclonal, produced in-house. Donkey anti-rabbit IgG conjugated to horse radish peroxidase (HRP), Jackson ImmunoResearch #711-035-152. Bicinchoninic acid (BCA) protein assay, Pierce #23255. pET28a, Novagen #69864–3. 2x Yeast extract tryptone culture medium (2YT), Sigma #Y2377.

Bacterial strains and culturing

F. tularensis LVS (NR-28537) was obtained from BEI Resources. E. coli BL21(DE3) was purchased from Novagen. F. tularensis was subcultured on chocolate agar plates (Remel) for 2 days at 37°C and then used to inoculate filter-sterilized Chamberlain’s Chemically Defined Medium (CDM). Chamberlain’s medium contains 40 mg spermine phosphate per liter [17]. F. tularensis LVS was grown at 37°C in CDM to mid-log growth phase. E. coli BL21(DE3) containing pET28a expressing F. tularensis rUsp/His6 was grown in 2YT at 37°C to an optical density at 600 nm of approximately 1, prior to induction with 0.5 mM IPTG at 18°C for 20 h. To check the solubility and approximate yield of rUsp/His6, an aliquot containing 25 x 1010 of bacterial cells was lysed by sonication in 25 mL of 25 mM Tris HCl, 100 mM NaCl, 20 mM sodium fluoride, 55 mM sodium ortho-vanadate, 55 mM pyrophosphate, 1 mM PMSF, and 2 μg/mL aprotinin. The cell suspension was clarified by centrifugation at 4000 rpm (2683xg) for 30 min in a table top Sorvall centrifuge cooled to 10˚C. Fine particles were removed by filtration through a 0.22 μm membrane. The final protein concentration was 3 mg/mL based on a BCA assay.

Cloning of recombinant F. tularensis Usp into E. coli expression plasmid

The usp gene (locus tag FTT-RS01275) encoding the universal stress protein from F. tularensis SCHU S4 was amplified using PCR with the forward primer 5’-TCTCCATGGCGTACAAAAAGGTTTTATTAGC and reverse primer 5’-ATCTCTCGAGTTTAAGCCTTACAACCAAAACATC. The resulting amplicon was digested with NcoI and XhoI restriction endonucleases and cloned into the respective sites in the pET28a expression plasmid. This cloning design was expected to produce a recombinant F. tularensis Usp with a C-terminus histidine tag, which adds 8 residues (LEHHHHHH) to the 278 amino acid protein for affinity purification. The DNA sequence of the F. tularensis usp gene in pET28a was confirmed before the plasmid was transfected into E. coli BL21(DE3) for expression. To ensure that F. tularensis usp which was cloned into pET28a would produce the correct amino acid sequence, Sanger sequencing of this expression plasmid was performed in both directions with appropriate primers, by the UNMC Genomics Core Facility.

Purification of recombinant F. tularensis Usp with C-terminal histidine tag

After induction of the pET vector for the expression of F. tularensis rUsp/His6 in E. coli BL21(DE3), bacterial cells were washed in PBS and lysed in 120 mL of 20 mM sodium phosphate, 0.5 M NaCl, 30 mM imidazole (pH 7.4) by passage through an EmulsiFlex-C3 (Avestin). The lysate was clarified by centrifugation in a Sorvall centrifuge for 30 min at 2683xg, followed by filtration through a 0.22 μm membrane. F. tularensis rUsp/His6 was purified by Ni-NTA chromatography by gravity flow through 15 mL Ni-NTA packed into Pharmacia column C16/20. The Ni-NTA column was equilibrated with 100 mL binding buffer (20 mM sodium phosphate, 0.5 M NaCl, 30 mM imidazole, pH 7.4) before the clarified lysate was manually loaded. Material that did not bind was washed off with 150 mL binding buffer. Washing was complete when the absorbance at 280 nm of the effluent was the same 0.085 as the absorbance of the binding buffer. Bound protein was eluted with elution buffer (20 mM sodium phosphate, 0.5 M imidazole, 0.5 M NaCl pH 7.4). Four mL fractions were collected manually and tested for Usp by Western blot. The highest concentration of Usp was in fractions 3–5. Pooled fractions were dialyzed against 4 L of phosphate buffered saline (pH 7.4) at 4˚C to remove imidazole and to reduce the salt concentration. Twenty mg of purified F. tularensis rUsp/His6 was recovered from 200 mL culture.

Production of polyclonal antibodies to denatured F. tularensis rUsp/His6 in rabbits

Purified F. tularensis rUsp/His6 in phosphate buffered saline (pH 7.4) was turbid at 2 mg/mL, but almost clear at 0.5 mg/mL. A 10 mL solution of 0.5 mg/mL F. tularensis rUsp/His6 in phosphate buffered saline was heated in a boiling water bath for 10 min to denature the protein. The denatured F. tularensis rUsp/His6 solution was sent to Pacific Immunology (Ramona, CA) for injection into two New Zealand White rabbits. Each rabbit was immunized five times with antigen in Incomplete Freund’s Adjuvant over a period of three months. Serum was collected on day 49 and at 14-day intervals thereafter. A total of 313 mL of serum was collected.

Affinity purification of antibodies to denatured F. tularensis rUsp/His6 from rabbit serum

The antiserum (313 mL) was clarified by centrifugation, and IgG was enriched by ammonium sulfate precipitation [18]. The ammonium sulfate pellet was suspended in 140 mL of 10 mM Tris HCl (pH 8), 0.02% azide and desalted by dialysis against 4 x 4 liters of 10 mM Tris HCl (pH 8), 0.1% azide at 4°C. The dialyzed protein had a volume of 172 mL. Total IgG was isolated on a 30 mL column of Protein G agarose. After the desalted protein suspension was applied to the Protein G column, the column was washed with 200 mL of 10 mM Tris HCl (pH 8). Bound protein was eluted with 0.1 M glycine (pH 2.7). Three mL fractions were collected into tubes preloaded with 0.3 mL of 1 M Tris HCl (pH 8). The eluted samples were immediately mixed to raise the pH of the eluted proteins from pH 2.7 to 8. The total amount of rabbit IgG collected was 430 mg.

Antibodies to denatured F. tularensis rUsp/His6 were separated from total rabbit IgG by affinity chromatography to immobilized rUsp/His6. Immobilization was performed by coupling 4 mg of denatured, purified rUsp/His6 to 3 mL of swollen CNBr-activated Sepharose beads. Eighteen mg of affinity purified rabbit antibodies to denatured F. tularensis rUsp/His6 was recovered from 430 mg of total rabbit IgG (4%).

Western blot

F. tularensis lysate and purified rUsp/His6 were separated on 12% precast SDS gels (Bio-Rad). Proteins were transferred from the polyacrylamide gel to 0.45 μm PVDF membrane on a Bio-Rad Trans-Blot SP semi-dry transfer cell. Blots were hybridized with 5 μl (3.5 μg) affinity purified rabbit polyclonal antibodies to denatured F. tularensis rUsp/His6 in 20 mL Tris buffered saline containing 5% dry milk, overnight at 4°C. The secondary antibody, donkey anti-rabbit IgG conjugated to horse radish peroxidase, was diluted 2 μL (0.4 mg/mL in 50% glycerol) into 20 mL Tris buffered saline, 5% dry milk. Washed blots were developed with ECL reagents from Azure Bioimaging and images were acquired on the Azure 600 Bioimaging device.

Experimental design and statistical rationale

A total of 29 Orbitrap data sets for F. tularensis rUsp/His6 were searched for acetylated and polyaminated adducts using Protein Prospector software. Data sets typically covered 92 to 98.9% of the amino acid sequence of the Usp protein. We manually evaluated each candidate MS/MS spectrum and found convincing evidence for acetylated and polyaminated peptides in 6 Orbitrap data sets. We rejected potential adducts in 23 data sets because the MS/MS spectra did not convince us that an adduct was the only possible interpretation of the data.

We required the MS/MS spectrum to contain ions that identify the modified residue. We searched for alternative interpretations of the MS/MS data and rejected candidate adducts that could be explained by substituting amino acids for the presumed adduct. For example, 128 Da spermidine could also represent 128 Da lysine; 71 Da putrescine could also represent 71 Da Alanine. The substituted amino acids had to fit the known amino acid sequence of F. tularensis Usp. Examples of rejected candidate adducts are described in the Discussion section and include an MS/MS spectrum of a false positive.

Sample preparation for mass spectrometry

Samples were digested with trypsin either in solution or in gel slices. Our protocol for in solution digestion is to reduce disulfide bonds with 10 mM dithiothreitol, alkylate with 50 mM iodoacetamide, digest with trypsin overnight at 37°C, and desalt on a C18 spin column.

The in-gel protocol for the analysis of F. tularensis Usp begins by cutting out a Coomassie blue stained gel slice migrating in the 30 to 35 kDa range. The gel slice is destained with 50% acetonitrile, dried, reduced with dithiothreitol in 100 mM ammonium bicarbonate (pH 8), alkylated with iodoacetamide, washed with 50% acetonitrile, and dried in a vacuum centrifuge [19]. Protein in the gel slice is digested with trypsin in 100 mM ammonium bicarbonate (pH 8) overnight at 37°C. Peptides are extracted with 50% acetonitrile, 0.1% TFA, and dried. The peptides are cleaned on a C18 spin column, dried, and dissolved in 20 μl water.

Protein concentration in the peptide solution was estimated from absorbance at 280 nm in a Nanodrop spectrophotometer (Thermo Fisher Scientific). An absorbance at 280 nm of 1 corresponds to a protein concentration of 1 μg/μl. Injection of 1 μl peptide solution at 1 μg/μl protein concentration yielded 99% coverage of F. tularensis Usp when evaluating purified rUsp/His6.

Liquid chromatography-tandem mass spectrometry

Peptide separation was performed with a Thermo RSLC Ultimate 3000 ultra-high pressure liquid chromatography system (Thermo Scientific) at 36°C, as previously described [20]. Solvent A was 0.1% formic acid in water, and solvent B was 0.1% formic acid in 80% acetonitrile. Peptides were loaded onto an Acclaim PepMap 100 C18 trap column (75 μm x 2 cm; Thermo Scientific catalog number 165535) at a flow rate of 4 μL/min and washed with 98% solvent A/2% solvent B for 10 minutes. Desalted peptides were pumped into a Thermo Easy-Spray PepMap RSLC C18 column (75 μm x 50 cm with 2 μm particles, Thermo Scientific catalog number ES803) and separated at a flow rate of 300 nL/min using a gradient of 9 to 50% solvent B in 30 min, 50 to 99% solvent B in 40 min, hold at 99% solvent B for 10 min, 99 to 9% solvent B in 4 min, and hold at 9% solvent B for 16 min.

Eluted peptides were sprayed directly into a Thermo Orbitrap Fusion Lumos Tribrid mass spectrometer (Thermo Scientific). Data were collected using data dependent acquisition. A survey full scan MS (from 350–1800 m/z) was acquired in the Orbitrap with a resolution of 120,000. The AGC target (Automatic Gain Control for setting the ion population in the Orbitrap before collecting the MS) was set at 4 x 105 and the ion filling time was set at 50 msec. The 25 most intense ions with charge state of 2–6 were isolated in a 3 sec cycle and fragmented using high-energy collision induced dissociation with 35% normalized collision energy. Fragment ions were detected in the Orbitrap with a mass resolution of 30,000 at 200 m/z. The AGC target for MS/MS was set at 5 x 104, and dynamic exclusion was set at 30 sec with a 10 ppm mass window. Data were reported in *.raw format. The *.raw data files were converted to *.mgf files using MSConvert, a ProteoWizard Tool from SourceForge.

Protein prospector search for modified peptides

The search parameters on the Batch-Tag Web page in Protein Prospector (prospector.ucsf.edu/prospector/mshome.htm) were as follows. Database: User protein. User Protein Sequence: pasted the FASTA file of protein WP_003021757 into the window for User Protein Sequence. Taxonomy: Francisella tularensis. Precursor charge range: 2, 3, 4, 5. Parent Tol: 20 ppm, Frag Tol: 30 ppm. Instrument: ESI-Q-high-res. Digest: No enzyme. (Note that the no enzyme digestion option is justified by the overnight tryptic digestion, which provides ample opportunity for non-specific cleavage.) Max Missed Cleavages: 2. Constant mods: none. Expectation Calc Method: None. Variable Mods: Carbamidomethyl (C), Oxidation (M). User Defined Variable Modifications: Acetyl K. C2O1H2; mass modification range 41.9 to 42.1. Putrescine Q. C4H9N1; mass modification range 70.9 to 71.3. Spermidine Q. C7N2H16; mass modification range 127.9 to 128.4. Spermine Q. C10H23N3; mass modification range 185.0 to 185.4.

Results

Immunoblot comparison of endogenous F. tularensis Usp and rUsp/His6

Western blot analyses showed that the endogenous F. tularensis Usp has an apparent molecular weight of ~30 kDa (Fig 1, lane 1), consistent with its theoretical molecular weight of 30188 Da. However, F. tularensis rUsp/His6 is larger at ~33 kDa (Fig 1, lane 2). The histidine tag accounts for 1 kDa, leaving unexplained an excess mass of 2 kDa. The salt in a protein sample can affect migration in an SDS gel. To be sure the difference in molecular weight is not an artifact of salt, the protein samples were mixed to produce a uniform salt concentration. Fig 1 lanes 4 and 5 show 2 bands for the mixed sample, thus ruling out the possibility that salt contributes to the higher molecular weight observed for rUsp/His6. It was concluded that the observed molecular weight difference is real and that F. tularensis rUsp/His6 expressed in E. coli is at least 3 kDa heavier than endogenous F. tularensis Usp.

Fig 1. Western blot hybridized with affinity purified polyclonal antibodies to denatured F. tularensis rUsp/His6.

Fig 1

Lane 1, 0.5 μg F. tularensis lysate. Lane 2, 0.01 μg purified F. tularensis rUsp/His6. Lane 3, protein molecular weight markers with sizes denoted in kilodaltons (kDa). Lanes 4 and 5, combined 0.5 μg F. tularensis lysate plus 0.01 μg F. tularensis rUsp/His6.

These results demonstrated that the affinity purified rabbit polyclonal antibodies for denatured F. tularensis rUsp/His6 is specific for Usp, as indicated by the single band obtained for endogenous F. tularensis Usp in the lysate (Fig 1 lane 1). In addition, the ability to detect endogenous Usp in 0.5 μg of F. tularensis lysate suggests the Usp protein is abundant.

Post-translational modifications on F. tularensis rUsp/His6

A possible explanation for the higher than expected molecular weight of the F. tularensis rUsp/His6 was initially postulated to be due to post-translational modifications. We used LC-MS/MS to generate fragmentation spectra of isolated rUsp/His6 peptides. The spectra were searched for acetylated and polyaminated adducts. Fig 2 presents convincing evidence for acetylated K238 in peptide AAENK(42)NDLIVVGSHR of rUsp/His6. Masses for the y11 and y12 ions are heavier by the 42 Da of the acetyl group, thus supporting acetylation of K238. The location of the 42 Da adduct is further defined by the y1 to y10 ions whose masses are free of 42 Da. Table 1 lists the peptides containing acetylated lysines in F. tularensis rUsp/His6. Of the 24 lysines in Usp, 10 are acetylated in rUsp/His6. It was estimated that less than 1% of any specific peptide was acetylated.

Fig 2. MS/MS spectrum for acetylated lysine 238 in F. tularensis rUsp/His6.

Fig 2

The exact location of the 42.01 Da acetyl adduct is defined by the y1 to y10 ion series and the y11 ion whose structures are shown. The mass of the y11 ion at 1279.71 Da is 42.01 Da higher than the mass of the unmodified y11 ion. Masses of the y1 to y10 ions are consistent with the absence of an adduct on residues NDLIVVGSHR. The MH+2 parent ion has a mass of m/z 832.9394.

Table 1. Acetylated lysines in E. coli expressed F. tularensis rUsp/His6 (Accession # WP_003021757).

Peptide Acetylated K Score % matched intensity
SIEQEAK63EALDK K63+42.01 37.2 76.9
EALDK68LVTK K68+42.01 36.0 96.0
LVTK72ISGIK K72+42.01 36.2 92.5
AK129CDVLTVR K129+42.01 31.0 90.6
DIAK175LYSAK K175+42.01 24.9 71.9
TYETDK201VETTLDK K201+42.01 42.0 95.1
VETTLDK208FAEK K208+42.01 43.0 98.8
FAEK212NGITGEK K212+42.01 44.4 97.5
SVMIGGISNSLLEK233A K233+42.01 38.4 80.3
AAENK238NDLIVVGSHR K238+42.01 55.5 96.2

Score is a statistic calculated by Protein Prospector. The higher the score, the more confidence in the assignment. A score of 20 or less is a poor score with little confidence in the assignment. A score of 30 or higher reflects high confidence in the assigned adduct. % matched intensity reflects the number and intensity of peaks that fit the assigned peptide. A 100% matched intensity means every peak in the spectrum is assigned to the peptide.

Marakasova [21] identified 280 acetylated proteins in F. novicida strain U112 by proteome analysis of proteins that had been enriched for acetylation by addition of glucose and acetate into the culture medium or by direct chemical acetylation of Nε-lysines in the cell lysate with acetyl phosphate. One of the acetylated proteins in this set was Usp. The Usp in F. novicida was acetylated on K180, K146, K219, and K212 [21]. Table 1 shows only acetylated K212 is common to both F. novicida U112 and F. tularensis Usp. Additionally, both SWISS-MODEL [22, 23] and AlphaFold [24, 25] 3D modeling of F. tularensis Usp (Q5NI44) indicated that K212 and the other nine acetylated lysines in Table 1 are exposed on the surface of the Usp protein.

The MS/MS spectrum in Fig 3 provides evidence for the presence of putrescine-modified glutamine 267 in E. coli expressed F. tularensis rUsp/His6. The putrescine adduct adds 71 Da to the masses of the b5 to b9 ions, whose structures are shown. Masses of the y1 to y7 ions define the amino acid sequence for an unmodified portion of the peptide.

Fig 3. MS/MS spectrum for putrescine adduct on glutamine 267 of F. tularensis rUsp/His6.

Fig 3

The b5 to b9 ions support putrescine on Q267 because they carry an added mass of 71 Da. The y-ions do not have an added mass of 71, which means putrescine is not on residues VDVLVVR. The MH+3 parent ion mass is m/z 545.3059. The parent ion minus a water molecule, MH-H2O+3, has a peak at m/z 539.2938.

Evidence for a spermidine adduct on glutamine 267 in peptide QANVDVLVVR of F. tularensis rUsp/His6 is provided in Fig 4. The y-ions support the peptide sequence. None of the y-ions include the mass of spermidine. This leaves glutamine as the spermidine-modified residue. The mass of the parent ion supports the presence of the 128.1 Da spermidine in this peptide.

Fig 4. MS/MS spectrum for the spermidine adduct on Q267 of F. tularensis rUsp/His6.

Fig 4

The structures of y1 to y9 ions are shown; none of the y-ions include the mass for spermidine. The added mass of 128.1 Da from spermidine is on glutamine. The MH+2 parent ion at m/z 620.8537 supports the presence of spermidine in this peptide. A peak for the parent ion minus water is at 611.8492 in charge state +2.

The MS/MS spectrum in Fig 5 supports the presence of a spermine adduct on glutamine 60 in peptide IVDFQHSIEQ(185.1)E of F. tularensis rUsp/His6. The masses of the y2 to y10 ions are consistent with an added mass of 185.1 Da on each of the structures shown. Table 2 summarizes the polyamine adducts identified by mass spectrometry in purified F. tularensis rUsp/His6. These findings provide evidence that a given glutamine such as Q267 can be modified by putrescine, spermidine, or spermine, suggesting that modifications fluctuate and are influenced by factors such as growth conditions.

Fig 5. MS/MS spectrum for the spermine adduct on glutamine 60 of F. tularensis rUsp/His6.

Fig 5

The structures of the y-ions show that all carry an added mass of 185 Da from spermine. The MH+3 parent ion is m/z 510.5917.

Table 2. Polyamine adducts in F. tularensis rUsp/His6.

Peptide Polyamine adduct Score % matched intensity
SILHQ267ANVDVLVVR Q267 putrescine +71 35.6 86.1
VVTVIDCVAPFAPSIVDFQ55H Q55 spermidine +128.1 47.6 92.0
ILHQ267ANVDVLVVR Q267 spermidine +128.1 33.5 44.9
FFLGSTANSILHQ267AN Q267 spermidine +128.1 45.4 88.2
FFLGSTANSILHQ267A Q267 spermidine +128.1 42.4 85.2
LGSTANSILHQ267A Q267 spermidine +128.1 31.8 62.1
Q267ANVDVLVVR Q267 spermidine +128.1 28.5 64.9
APSIVDFQHSIEQ60EA Q60 spermine +185.1 46.1 84.1
IVDFQHSIEQ60EA Q60 spermine +185.1 25 72.6
FFLGSTANSILHQ267A Q267 spermine +185.1 45.4 88.2

Stop codon readthrough in E. coli expressed F. tularensis rUsp/His6

Although we found acetylated lysines and polyaminated glutamines, the greater than expected size of F. tularensis rUsp/His6 cannot be explained by post-translational modifications in E. coli since only a small percent of a given residue was modified. Even if each residue were 100% modified, the mass from the adducts would not add up to 3 kDa. Further, a ladder of bands would be expected if F. tularensis rUsp/His6 were a mixture of adducts with various masses, a result that was not observed. As depicted in Fig 1, immunoblot analysis consistently showed that F. tularensis rUsp/His6 migrated as a single band at 33 kDa, while endogenous F. tularensis Usp migrated as a single band at 30 kDa.

Our search for an alternative explanation led us to examine the possibility that F. tularensis rUsp/His6 comprised extra amino acids originating from the pET28 expression vector. When we updated the F. tularensis rUsp/His6 amino acid sequence to include residues from pET28, the mass spectrometry data confirmed that extra pET28 residues are present at the C-terminus. The C-terminus residues in F. tularensis rUsp/His6 included DPAANKARKEAELAAATAEQ, after the expected histidine tag (LEHHHHHH). Trypsin digestion of rUsp/His6 yielded peptide EAELAAATAEQ as shown in Fig 6. These findings demonstrated that a single TGA stop codon in the pET28 expression vector is not sufficient to terminate translation. The predicted molecular weight of F. tularensis rUsp/His6 containing the stop codon readthrough residues was 33.29 kDa, a mass consistent with the Western blot results (Fig 1).

Fig 6. The 20 amino acid stop codon readthrough peptide, DPAANKARKEAELAAATAEQ, was contributed by the pET28a expression vector.

Fig 6

Trypsin digestion of rUsp/His6 yielded the EAELAAATAEQ peptide. The b2-b6 ion series and the y1-y5 ion series in the MS/MS spectrum support the peptide sequence. The MH+2 parent ion is at m/z 552.2644. The stop codon readthrough is in frame with the histidine tag, adding a total of 28 amino acids to the C-terminus of rUsp/His6 via peptide LEHHHHHHDPAANKARKEAELAAATAEQ.

Discussion

Mechanisms of post-translation modifications in bacteria

In bacteria, lysine acetylation occurs by two mechanisms: enzymatically with acetyltransferase and nonenzymatically with acetyl phosphate [26]. E. coli encode 26 acetyltransferases of which half have been experimentally validated to have enzymatic activity and are predicted to have different targets [27]. Acetylation is one of the mechanisms by which bacteria respond to stress, nutritional status, and inflammation [28], and can influence virulence and host-pathogen interactions [26, 29, 30]. Protein acetylation can occur on Nα-amino groups such as the N-termini of proteins or on Nε-amino groups of lysine residues. Nα-acetylation can tag the protein for degradation, which is often irreversible, whereas Nε-acetylation of lysine residues is reversible and dynamic, depending on the environment [27, 31]. Nε-acetylation of lysine residues can affect many protein functions, including enzymatic activity, stability, and protein- or DNA-protein interactions [32]. Nevertheless, most post-translational modifications are substoichiometric, which means that modifications are not present on all molecules of a given protein [30].

Polyaminated proteins are the product of the enzymatic activity of transglutaminase. Certain bacteria such as Streptomyces mobaraensis (formerly classified as Streptoverticillium mobaraense) secrete huge amounts of inactive pro-transglutaminase. Protease cleavage of an N-terminal peptide activates microbial transglutaminase [33]. Active transglutaminase can polyaminate and crosslink proteins. The crosslinking activity of microbial transglutaminase is used in the production of sausages, ham, yogurt, cheese, imitation crab and other foods [34]. E. coli strain K-12 is not known to have transglutaminase activity, although the UniProt database lists four entries for transglutaminase-like domain-containing proteins. Hemolytic strains of E. coli do have transglutaminase activity in a cytotoxic necrotizing factor that activates small Rho GTPases by deamidating Gln 63 [35]. Our report of polyaminated recombinant rUsp/His6 protein provides evidence for low levels of transglutaminase activity in E. coli strain BL21(DE3).

Active transglutaminase in bacteria generally is associated with the bacterial cell wall, where its crosslinking activity strengthens the cell wall. However, transglutaminase activity is tightly controlled. Uncontrolled transglutaminase activity in an E. coli expression system was lethal to the bacteria due to excessive transglutaminase catalyzed protein crosslinking [33, 36].

Mechanisms of stop codon readthrough in bacteria

Proper termination of protein synthesis occurs at UAA, UAG, and UGA stop codons, and is catalyzed by release factors [37]. Stop codon readthrough has been described for proteins expressed in both bacteria and eukaryotes [11]. One mechanism that contributes to stop codon readthrough is due to suppressor tRNAs that carry mutations in the anticodon for the relevant stop codon [38]. Environmental and metabolic stress substantially increase the level of stop codon readthrough during protein synthesis in bacteria, promoting translational heterogeneity and phenotypic diversity [14]. However, since the rUsp/His6 protein has a single, defined mass of 33 kDa, the cause of stop codon readthrough is not likely explained by stress.

The pET28 plasmid has TGA in the position of the readthrough stop codon, which follows the histidine tag (see Addgene web page https://www.addgene.org/browse/sequence_vdb/2565/). The TGA stop codon is susceptible to readthrough [11, 12]. Therefore, the use of TGA and other factors (i.e., mRNA structural features) during translation apparently contributed to stop codon readthrough in E. coli.

Evaluation of protein prospector search results

We rejected most of the computer-generated candidates for adducts because they did not provide convincing evidence for the adduct. Acceptable spectra have a Matched Intensity in the 60 to 100% range. We rejected a candidate when we could fit unassigned peaks in the MS/MS spectrum to an alternative amino acid sequence. The most convincing MS/MS spectra have ions on both sides of the modified residue as in Fig 2 for acetylated K238 in F. tularensis rUsp/His6, and no peaks that can be assigned to an alternative amino acid sequence.

Even though proteins were digested with a high-quality trypsin, nonspecific cleavage occurs. We find more peptides with a NO ENZYME search than with a TRYPSIN search. Peptides whose C-terminal is not positively charged, such as Glu (E) in Fig 5 do not fragment to yield y1 ions, in all likelihood because the amino acid at this position does not support a positive charge. Nevertheless, we accept the spermine (+185 Da) adduct on Q60 of F. tularensis rUsp/His6 in Fig 5 because the added mass of +185 Da on y2 to y10 ions strongly supports the spermine (+185 Da) adduct. In addition, there are no unassigned peaks in this spectrum to offer an alternative interpretation.

Examples of rejected hits

Protein Prospector suggested Q33 of F. tularensis rUsp/His6 was modified by spermidine in peptide NNTQ(128.1)VLKVVTVIDCVAPFAPSIVDFQHSIEQEAK. The matched series intensity was 100%, which means all peaks in the MS/MS spectrum were assigned to the modified peptide. Supporting b- ions ranged from b5 to b13, but there were no b1, b2, b3, or b4(128) ions. As it turned out, the residue to the N-terminal side of the peptide is K. A missed cleavage would yield peptide KNNTQVLKVVTVIDCVAPFAPSIVDFQHSIEQEAK. The dehydro-mass of K is 128 Da. Therefore, the mass of KNNTQ is equal to the mass of NNTQ(128). Both the adduct and the missed cleavage peptide fit the MS/MS spectrum and have the same parent ion mass of m/z 3925.05. This ambiguity in interpretation led us to reject this candidate.

Spermidine modified Q60 in peptide APSIVFQHSIEQ(128.1)EA of rUsp/His6 was rejected despite the presence of supporting y3 to y13 ions. The MS/MS spectrum in Fig 7 has no y1 or y2 ions. It was noticed that an added K at the C-terminus adds 128 Da. The MS/MS spectrum had actually identified APSIVFQHSIEQEAK. The 128 Da mass was from K and not from spermidine.

Fig 7. False positive spermidine adduct (+128) on Q60 of rUsp/His6.

Fig 7

On first inspection the y3 to y13 ion series, the score of 50.2 and the matched intensity of 89% gave strong support for a spermidine adduct on Q60. However, the absence of y1 and y2 ions leaves open an alternative interpretation. Lysine and spermidine have the same 128 Da mass. The parent ion for the spermidine adduct above is m/z MH+3 600.3044 and that of the Usp peptide APSEIVDFQHSIEQEAK, including K at the C-terminus, is almost identical at m/z 600.3039. It was concluded that the evidence for a spermidine adduct on Q60 is inadequate. The candidate adduct was therefore rejected.

Supporting information

S1 Raw images. Uncropped, unadjusted Western blot in text as Fig 1.

(DOCX)

pone.0299701.s001.docx (2.7MB, docx)
S1 Graphical abstract

(TIF)

pone.0299701.s002.tif (3.5MB, tif)

Acknowledgments

MS data were obtained by the Mass Spectrometry and Proteomics Core Facility at the University of Nebraska Medical Center. Protein Prospector programs are available at no cost https://prospector.ucsf.edu[prospector.ucsf.edu]. Protein Prospector was developed in the University of California San Francisco Mass Spectrometry Facility, directed by Dr. Alma Burlingame. Proteomics Toolkit http://db.systemsbiology.net:8080/proteomicsToolkit was used to confirm amino acid sequences in MS/MS spectra.

Abbreviations

Usp

universal stress protein

rUsp/His6

recombinant universal stress protein with a C-terminal histidine tag

LC-MS/MS

liquid chromatography tandem mass spectrometry

Data Availability

The mass spectrometry data have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository with the data set identifier PXD045793.

Funding Statement

Department of Defense/Defense Threat Reduction Agency (DTRA) grant number HDTRA1-22-1-0022 (MAL) dtra.mil The funder played no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. Cancer Center Support Grant P30CA036727 from National Institutes of Health The funder played no role in the study design, data collection and analysis, decision to publish or preparation of the manuscript.

References

  • 1.Tkaczuk KL, Shumilin IA, Chruszcz M, Evdokimova E, Savchenko A, Minor W. Structural and functional insight into the universal stress protein family. Evol Appl. 2013;6(3):434–49. doi: 10.1111/eva.12057 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Vollmer AC, Bark SJ. Twenty-Five Years of Investigating the Universal Stress Protein: Function, Structure, and Applications. Adv Appl Microbiol. 2018;102:1–36. doi: 10.1016/bs.aambs.2017.10.001 [DOI] [PubMed] [Google Scholar]
  • 3.Nachin L, Nannmark U, Nystrom T. Differential roles of the universal stress proteins of Escherichia coli in oxidative stress resistance, adhesion, and motility. J Bacteriol. 2005;187(18):6265–72. doi: 10.1128/JB.187.18.6265-6272.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Nystrom T, Neidhardt FC. Cloning, mapping and nucleotide sequencing of a gene encoding a universal stress protein in Escherichia coli. Mol Microbiol. 1992;6(21):3187–98. doi: 10.1111/j.1365-2958.1992.tb01774.x [DOI] [PubMed] [Google Scholar]
  • 5.Hingley-Wilson SM, Lougheed KE, Ferguson K, Leiva S, Williams HD. Individual Mycobacterium tuberculosis universal stress protein homologues are dispensable in vitro. Tuberculosis (Edinb). 2010;90(4):236–44. doi: 10.1016/j.tube.2010.03.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sharma D, Lata M, Faheem M, Khan AU, Joshi B, Venkatesan K, et al. Role of M.tuberculosis protein Rv2005c in the aminoglycosides resistance. Microb Pathog. 2019;132:150–5. doi: 10.1016/j.micpath.2019.05.001 [DOI] [PubMed] [Google Scholar]
  • 7.Yu CH, Chou CC, Lee YJ, Khoo KH, Chang GD. Uncovering protein polyamination by the spermine-specific antiserum and mass spectrometric analysis. Amino Acids. 2015;47(3):469–81. doi: 10.1007/s00726-014-1879-8 [DOI] [PubMed] [Google Scholar]
  • 8.Szewczak J, Bierczynska-Krzysik A, Piejko M, Mak P, Stadnik D. Isolation and Characterization of Acetylated Derivative of Recombinant Insulin Lispro Produced in Escherichia coli. Pharm Res. 2015;32(7):2450–7. doi: 10.1007/s11095-015-1637-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Pflumm MN, Gruber SC, Tsarbopoulos A, Wylie D, Pramanik B, Bausch JN, et al. Isolation and characterization of an acetylated impurity in Escherichia coli-derived recombinant human interleukin-10 (IL-10) drug substance. Pharm Res. 1997;14(6):833–6. doi: 10.1023/a:1012127228239 [DOI] [PubMed] [Google Scholar]
  • 10.Kuczynska-Wisnik D, Moruno-Algara M, Stojowska-Swedrzynska K, Laskowska E. The effect of protein acetylation on the formation and processing of inclusion bodies and endogenous protein aggregates in Escherichia coli cells. Microb Cell Fact. 2016;15(1):189. doi: 10.1186/s12934-016-0590-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Dabrowski M, Bukowy-Bieryllo Z, Zietkiewicz E. Translational readthrough potential of natural termination codons in eucaryotes—The impact of RNA sequence. RNA Biol. 2015;12(9):950–8. doi: 10.1080/15476286.2015.1068497 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Baggett NE, Zhang Y, Gross CA. Global analysis of translation termination in E. coli. PLoS Genet. 2017;13(3):e1006676. doi: 10.1371/journal.pgen.1006676 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Loughran G, Chou MY, Ivanov IP, Jungreis I, Kellis M, Kiran AM, et al. Evidence of efficient stop codon readthrough in four mammalian genes. Nucleic Acids Res. 2014;42(14):8928–38. doi: 10.1093/nar/gku608 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zhang H, Lyu Z, Fan Y, Evans CR, Barber KW, Banerjee K, et al. Metabolic stress promotes stop-codon readthrough and phenotypic heterogeneity. Proc Natl Acad Sci U S A. 2020;117(36):22167–72. doi: 10.1073/pnas.2013543117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Rosenberg AH, Lade BN, Chui DS, Lin SW, Dunn JJ, Studier FW. Vectors for selective expression of cloned DNAs by T7 RNA polymerase. Gene. 1987;56(1):125–35. doi: 10.1016/0378-1119(87)90165-x [DOI] [PubMed] [Google Scholar]
  • 16.Shilling PJ, Mirzadeh K, Cumming AJ, Widesheim M, Kock Z, Daley DO. Improved designs for pET expression plasmids increase protein production yield in Escherichia coli. Commun Biol. 2020;3(1):214. doi: 10.1038/s42003-020-0939-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chamberlain RE. Evaluation of Live Tularemia Vaccine Prepared in a Chemically Defined Medium. Appl Microbiol. 1965;13(2):232–5. doi: 10.1128/am.13.2.232-235.1965 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Harlow E, Lane D. Antibodies A Laboratory Manual. Cold Spring Harbor Laboratory. 1988:298–9. [Google Scholar]
  • 19.Peeples ES, Schopfer LM, Duysen EG, Spaulding R, Voelker T, Thompson CM, et al. Albumin, a new biomarker of organophosphorus toxicant exposure, identified by mass spectrometry. Toxicol Sci. 2005;83(2):303–12. doi: 10.1093/toxsci/kfi023 [DOI] [PubMed] [Google Scholar]
  • 20.Lockridge O, Schopfer LM. Naturally Occurring Epsilon Gamma Glutamyl Lysine Isopeptide Crosslinks in Human Neuroblastoma SH-SY5Y Cells. ACS Omega. 2022;7(25):21978–86. doi: 10.1021/acsomega.2c02502 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Marakasova E, Ii A, Nelson KT, van Hoek ML. Proteome Wide Profiling of N-epsilon-Lysine Acetylation Reveals a Novel Mechanism of Regulation of the Chitinase Activity in Francisella novicida. J Proteome Res. 2020;19(4):1409–22. [DOI] [PubMed] [Google Scholar]
  • 22.Bienert S, Waterhouse A, de Beer TA, Tauriello G, Studer G, Bordoli L, et al. The SWISS-MODEL Repository-new features and functionality. Nucleic Acids Res. 2017;45(D1):D313–D9. doi: 10.1093/nar/gkw1132 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46(W1):W296–W303. doi: 10.1093/nar/gky427 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 2022;50(D1):D439–D44. doi: 10.1093/nar/gkab1061 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–9. doi: 10.1038/s41586-021-03819-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Luu J, Carabetta VJ. Contribution of N(epsilon)-lysine Acetylation towards Regulation of Bacterial Pathogenesis. mSystems. 2021;6(4):e0042221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.VanDrisse CM, Escalante-Semerena JC. Protein Acetylation in Bacteria. Annu Rev Microbiol. 2019;73:111–32. doi: 10.1146/annurev-micro-020518-115526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Ma Q, Wood TK. Protein acetylation in prokaryotes increases stress resistance. Biochem Biophys Res Commun. 2011;410(4):846–51. doi: 10.1016/j.bbrc.2011.06.076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ren J, Sang Y, Tan Y, Tao J, Ni J, Liu S, et al. Acetylation of Lysine 201 Inhibits the DNA-Binding Ability of PhoP to Regulate Salmonella Virulence. PLoS Pathog. 2016;12(3):e1005458. doi: 10.1371/journal.ppat.1005458 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Macek B, Forchhammer K, Hardouin J, Weber-Ban E, Grangeasse C, Mijakovic I. Protein post-translational modifications in bacteria. Nat Rev Microbiol. 2019;17(11):651–64. doi: 10.1038/s41579-019-0243-0 [DOI] [PubMed] [Google Scholar]
  • 31.Hu LI, Lima BP, Wolfe AJ. Bacterial protein acetylation: the dawning of a new age. Mol Microbiol. 2010;77(1):15–21. doi: 10.1111/j.1365-2958.2010.07204.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Polevoda B, Sherman F. The diversity of acetylated proteins. Genome Biol. 2002;3(5):reviews0006. doi: 10.1186/gb-2002-3-5-reviews0006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Pasternack R, Dorsch S, Otterbach JT, Robenek IR, Wolf S, Fuchsbauer HL. Bacterial pro-transglutaminase from Streptoverticillium mobaraense—purification, characterisation and sequence of the zymogen. Eur J Biochem. 1998;257(3):570–6. doi: 10.1046/j.1432-1327.1998.2570570.x [DOI] [PubMed] [Google Scholar]
  • 34.Strop P. Versatility of microbial transglutaminase. Bioconjug Chem. 2014;25(5):855–62. doi: 10.1021/bc500099v [DOI] [PubMed] [Google Scholar]
  • 35.Schmidt G, Selzer J, Lerm M, Aktories K. The Rho-deamidating cytotoxic necrotizing factor 1 from Escherichia coli possesses transglutaminase activity. Cysteine 866 and histidine 881 are essential for enzyme activity. J Biol Chem. 1998;273(22):13669–74. doi: 10.1074/jbc.273.22.13669 [DOI] [PubMed] [Google Scholar]
  • 36.Takehana S, Washizu K, Ando K, Koikeda S, Takeuchi K, Matsui H, et al. Chemical synthesis of the gene for microbial transglutaminase from Streptoverticillium and its expression in Escherichia coli. Biosci Biotechnol Biochem. 1994;58(1):88–92. doi: 10.1271/bbb.58.88 [DOI] [PubMed] [Google Scholar]
  • 37.Youngman EM, McDonald ME, Green R. Peptide release on the ribosome: mechanism and implications for translational control. Annu Rev Microbiol. 2008;62:353–73. doi: 10.1146/annurev.micro.61.080706.093323 [DOI] [PubMed] [Google Scholar]
  • 38.Eggertsson G, Soll D. Transfer ribonucleic acid-mediated suppression of termination codons in Escherichia coli. Microbiol Rev. 1988;52(3):354–74. doi: 10.1128/mr.52.3.354-374.1988 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Raw images. Uncropped, unadjusted Western blot in text as Fig 1.

(DOCX)

pone.0299701.s001.docx (2.7MB, docx)
S1 Graphical abstract

(TIF)

pone.0299701.s002.tif (3.5MB, tif)

Data Availability Statement

The mass spectrometry data have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository with the data set identifier PXD045793.


Articles from PLOS ONE are provided here courtesy of PLOS

RESOURCES