Skip to main content
Frontiers in Microbiology logoLink to Frontiers in Microbiology
. 2015 Mar 5;6:149. doi: 10.3389/fmicb.2015.00149

Mining proteomic data to expose protein modifications in Methanosarcina mazei strain Gö1

Deborah R Leon 1,, A Jimmy Ytterberg 1,, Pinmanee Boontheung 1,, Unmi Kim 2,, Joseph A Loo 1,3,4, Robert P Gunsalus 2,4,*, Rachel R Ogorzalek Loo 3,4,*
PMCID: PMC4350412  PMID: 25798134

Abstract

Proteomic tools identify constituents of complex mixtures, often delivering long lists of identified proteins. The high-throughput methods excel at matching tandem mass spectrometry data to spectra predicted from sequence databases. Unassigned mass spectra are ignored, but could, in principle, provide valuable information on unanticipated modifications and improve protein annotations while consuming limited quantities of material. Strategies to “mine” information from these discards are presented, along with discussion of features that, when present, provide strong support for modifications. In this study we mined LC-MS/MS datasets of proteolytically-digested concanavalin A pull down fractions from Methanosarcina mazei Gö1 cell lysates. Analyses identified 154 proteins. Many of the observed proteins displayed post-translationally modified forms, including O-formylated and methyl-esterified segments that appear biologically relevant (i.e., not artifacts of sample handling). Interesting cleavages and modifications (e.g., S-cyanylation and trimethylation) were observed near catalytic sites of methanogenesis enzymes. Of 31 Methanosarcina protein N-termini recovered by concanavalin A binding or from a previous study, only M. mazei S-layer protein MM1976 and its M. acetivorans C2A orthologue, MA0829, underwent signal peptide excision. Experimental results contrast with predictions from algorithms SignalP 3.0 and Exprot, which were found to over-predict the presence of signal peptides. Proteins MM0002, MM0716, MM1364, and MM1976 were found to be glycosylated, and employing chromatography tailored specifically for glycopeptides will likely reveal more. This study supplements limited, existing experimental datasets of mature archaeal N-termini, including presence or absence of signal peptides, translation initiation sites, and other processing. Methanosarcina surface and membrane proteins are richly modified.

Keywords: S-layers, archaeal surface proteins, Methanosarcina mazei, prokaryotic glycosylation, membrane proteins, concanavalin A

Introduction

Knowledge about Archaea and their proteins is limited, making their characterization important. Fortunately, tools are available to identify proteins at high throughput, while bioinformatic analyses can overlay existing knowledge onto this kingdom. Nevertheless, protein modifications unique to these organisms and/or rare in well-studied microbes may elude us, primarily because high throughput proteomic methods focus on matching peptide fragment data to what can be anticipated, primarily from the genome sequence.

Progress in understanding archaeal cell surface structures has been hindered by the limited availability of experimental results, and any new protein modifications that are revealed may hint at function. Here peptide tandem mass spectrometry (MS/MS) datasets from previous investigations of Methanosarina S-layer and surface-exposed proteins (Francoleon et al., 2009; Rohlin et al., 2012), as well as from mixtures recovered by concanavalin A binding were selected for further analysis, motivated by interest in how protein modifications can impact organisms' interactions with their environment and with other organisms.

The above-mentioned emphasis on matching high throughput proteomic data to predictions means that unassigned mass spectra, (often 90% of all data) are ignored (Savitski et al., 2005; Baumgartner et al., 2008; Falkner et al., 2008; Menschaert et al., 2009; Hahne et al., 2013). In principle, unassigned proteomic data could be a treasure trove. In practice, its value depends on sample complexity (whether it contains ~5000 or 50,000 tryptic peptides), the protein and/or peptide separation strategies employed for its acquisition [single-dimension liquid chromatography (LC) of tryptic peptides vs. two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) of proteins followed by peptide LC], and on the quality of the tandem mass spectrometry (MS/MS) data.

There are many reasons why peptide tandem mass spectra may be unassigned in these complex experiments:

  1. The spectra may be poor in quality and/or low in information content.

  2. The MS/MS spectra may be derived from a mixture, rather than from a single peptide.

  3. The spectra may be derived from a peptide that is modified in a manner not considered by the algorithm.

High throughput workflows compromise between speed and depth of analysis, with “success” generally assessed by the number of proteins identified at a specified peptide or protein false discovery rate (FDR). This gene-centric approach does not differentiate between modified and/or processed forms of proteins. Typically, only modifications related to sample handling are considered (Schmidt et al., 2005) to balance sensitivity (number of peptides or proteins identified) with specificity (low false discovery rate). If post-translational modifications (PTMs) are pursued in high-throughput environments, modified peptides are typically enriched via PTM-specific immunoprecipitation or other affinity capture methods; e.g., phosphopeptide binding to TiO2.

Skillful data mining can recover valuable information about unanticipated modifications from high throughput proteomic data. Some strategies include:

  1. Performing an error-tolerant search (Creasy and Cottrell, 2002); i.e., a second search limited to proteins detected in the initial search. It attempts to match mass spectra by considering one additional modification-type from the UniMod collection or single amino acid substitutions (Creasy and Cottrell, 2004). It also relaxes enzyme specificity, e.g., only the N- or C-terminus of a peptide must conform to trypsin's known cleavage specificity, rather than both.

  2. Searching against the genome sequence translated in 6-reading frames (three forward and three reverse strand translations). This approach overcomes DNA sequencing errors, missed open reading frames, incorrect start or stop sites, and alternate initiation. It can also be employed in an error-tolerant mode to overcome nucleobase substitutions.

  3. Attempting to match spectra by assuming that one residue's mass has been shifted by some amount (within a specified range) or by clustering related spectra (Savitski et al., 2006; Bandeira et al., 2007; Falkner et al., 2008; Wilhelm and Jones, 2014).

  4. Manual or computer-assisted de novo interpretation of mass spectra.

Evaluating accuracy is challenging for any data mining strategy. When searches are limited to relatively common modifications, it may be possible to calculate separate false discovery rates for the modified peptides. But when almost anything is possible, confirmatory information must be sought elsewhere.

Here, we illustrate how proteomic datasets can be mined to recover information about Methanosarcina mazei protein modifications and describe some characteristic mass signatures that assist in validating that modifications are present. Clearly, methods including antibody blotting, functional group specific staining, and chemical derivatization also provide essential verification. The focus of this manuscript is on mining existing data to recover information about unanticipated modifications. The knowledge may suggest protein forms (proteoforms) to track in future studies, follow-up experiments to confirm the modifications, it may hint at protein function, or it may simply improve protein annotations.

Materials and methods

Cell cultivation

M. mazei Gö1 was grown at 37°C as single cells (non-aggregated) in pH 6.8 basal mineral medium prepared by the Hungate technique and supplemented with 0.05 M methanol as the sole source of carbon and energy (Sowers et al., 1993). Medium osmolarity was defined by 0.2 M NaCl. Cultivation employed 10-mL anaerobic tubes (Difco, Sparks, MD) sealed with a N2-CO2 (4:1) atmosphere. Cultures were harvested at an average OD600 of ~1.5.

Crude extract/lysate preparation

Eight tubes of 10-mL anaerobic cultures were unsealed and their contents were transferred to 15-mL Falcon™ centrifuge tubes. Cells were sedimented at room temperature for 10 min in a swing bucket rotor at 1125 × g. After centrifugation, 500 μL of chilled lysis buffer was added to each tube [2% (w/v) CHAPS (3-(3-cholamidopropyl) dimethylammonio-1-propanesulfonate) in pH 7.5, 50 mM Tris, 0.15 M NaCl, 1 mM CaCl2, and 1 mM MnCl2 supplemented with 2.25 μL Sigma P8465 protease inhibitor]. The cell pellets were disrupted further by multiple freeze/thaw cycles interspersed with vortexing. The lysates were transferred to 1.5 mL microcentrifuge tubes for centrifugation at 16,000 × g for 15 min at 4°C. The soluble lysate was retained for analysis.

Concanavalin a glycoprotein enrichment

Our previous purification was modified slightly (Francoleon et al., 2009). Briefly, each of four 5-mL centrifuge columns (Pierce, Rockford, IL) was loaded with a 2 mL slurry of Con A-coupled agarose beads (Vector Laboratories, Burlingame, CA). The beads were washed with 3 mL of 50 mM Tris, 0.15 M NaCl (pH 7.5) seven times, followed by six equilibrating washes with 2 mL binding buffer [BB, 50 mM Tris, 0.15 M NaCl, 1 mM CaCl2, 1 mM MnCl2 (pH 7.5)]. After equilibration, 1 mL of lysate and 1 mL of BB buffer were added to each column for incubation in a room temperature rotor. After 30 min, the column flow through was collected by centrifugation and discarded. The protein-bound beads were washed 10 times to minimize non-specific binding: (1) five washes, each with 2 mL of BB buffer supplemented with 0.1 % Tween-20, followed by (2) five 2-mL washes with 50 mM of (NH4)HCO3 (pH 7.8). Glycoproteins were eluted from the lectin media twice: (1) Con A beads were incubated in 2 mL of elution buffer [50 mM (NH4)HCO3/0.2 M methyl-α-D-mannopyranoside/0.2 M methyl-α-D-glucopyranoside (pH 7.8)] for 10 min at room temperature, and centrifuged to recover the eluate. (2) The elution was repeated with 1 mL of buffer. The combined eluate was concentrated to 1.6 μg/μL (Pierce BCA assay) by ultrafiltration through an Amicon® 50 kDa cut-off cellulose membrane (Millipore, Billerica, MA).

Multiple enrichments were also performed on a smaller scale in a manner similar to that described above, but without ultrafiltration.

In-solution trypsin, Glu-C and Asp-N proteolysis

Each proteolytic digestion used 50 μL of concentrated Con A eluate (~78 μg of total protein) which, prior to digestion, was precipitated at −20°C overnight in 9 volumes of chilled acetone. Protein precipitate was recovered by centrifugation at 4°C, 16,000 × g for 20 min. Pellets were washed in 500 μL of chilled 80% acetone/10% methanol/0.2% acetic acid.

For trypsin digestion, the protein was resuspended in 20 μL of room temperature dimethyl sulfoxide (DMSO) with 600 rpm shaking for 30 min (Ytterberg et al., 2006). The solution was diluted to 30% DMSO/50 mM (NH4)HCO3 and sequencing grade trypsin (Promega, Madison, WI) was added at a 1:20 enzyme:protein ratio (w/w). Digestion proceeded overnight at 37°C with 300 rpm shaking.

For Glu-C digestion, precipitated protein was resuspended in 131 μL of pH 7.8, 25 mM (NH4)HCO3and 1:20 (w/w) Glu-C:protein (sequencing grade Staphylococcus aureus Protease V-8, Roche, Indianapolis, IN). The reaction proceeded for 4 h with 300 rpm shaking in a 25°C incubator.

Asp-N cleavage employed sequencing grade Pseudomonas fragi Asp-N (Roche) at 1:20 Asp-N:protein (w/w). The protein pellet, resuspended in 131 μL of pH 7.5, 25 mM Tris buffer, was incubated for 4 h at 37°C with 300 rpm shaking.

Nano-HPLC and data dependent MS/MS

Aliquots (~5 μg) of trypsin- and Glu-C-digested peptides were dried by vacuum centrifugation and resuspended in 125 μL of 5% formic acid (FA) to ~40 ng/μL. Asp-N digested proteins (~10 μg) were desalted using a C-18 spin column (Pierce), dried, and resuspended to ~40 ng/μL in 5% formic acid.

Peptide mixtures were analyzed by liquid chromatography-tandem mass spectrometry (LC-MS/MS) with ESI (electrospray ionization) on an Applied BioSystems QSTAR® Pulsar XL quadrupole time-of-flight mass spectrometer equipped with a nanoelectrospray interface (Protana), a Proxeon (Odense) nano-bore stainless steel emitter (30 μm ID), and an LC Packings nano-LC system as described previously (Francoleon et al., 2009). Homemade pre- (150 × 5 mm) and analytical (75 μm × 150 mm) columns were packed with Jupiter Proteo C12 4-μm resin (Phenomenex). Typically 6 μL of sample was loaded onto the precolumn, washed with 0.1% FA for 4 min and transferred to the LC column. The 200 nL/min mobile phase gradient employed 3–6% B in 6 s, 6–24% B in 18 min, 24–36% B in 6 min, 36–80% B in 2 min, and 80% B for 7.9 min. The column was equilibrated with 3% B for 15 min prior to the next run. Eluents used were 0.1% FA (aq) (solvent A) and 95% CH3CN containing 0.1% FA (solvent B).

Peptide product ion spectra were recorded automatically by IDA (information-dependent analysis) software on the mass spectrometer. Protein sequence searches employed a conservative mass tolerance of 0.3 Da for both precursor and product ions, and 1 (trypsin) or 2 (Asp-N and Glu-C) missed cleavages. Proteins hits were accepted based on ≥2 ascribed peptides, at least of one which possessed a MOWSE score ≥26 (p ≤ 0.02) with MuDPIT scoring. Identifications based on single peptides are presented separately. Correspondences between MS/MS spectra and all ascribed sequences were also verified manually.

Nano-HPLC and data independent acquisition MS/MS

The peptide mixtures described above were also analyzed by LC-ESI-MS/MS on a Xevo™ quadrupole time-of-flight MS (Waters Corporation) equipped with a Universal NanoFlow Sprayer interface and pre-cut Pico Tip Emitter (360 μm OD × 20 μm ID, 10 μm tip; 2.5″ long), connected on-line to a nanoACQUITY® UltraPerformance® HPLC system (Waters Corporation). The nanoACQUITY® system was equipped with Waters' 5 μm Symmetry C18, 180 μm × 20 mm reversed-phase trap and 1.7 μm BEH130 C18, 75 μm × 100 mm reversed-phase analytical columns. Both columns were maintained at 40°C. Typically 3 μL of samples were injected onto the precolumn in aqueous 1% CH3CN/0.1% FA at a flow rate of 5 μL/min for 3 min. Mobile phase A was water with 0.1% FA (v/v) and mobile phase B was CH3CN with 0.1% FA. After desalting and concentrating in the trap column, peptides were transferred to the analytical column and resolved by a gradient of 3–60% mobile phase B delivered over 30 min at a flow rate of 300 nL/min, followed by a 15 min wash with 95% B and 15 min re-equilibration at the initial conditions (3% B, 97% A).

The Xevo™ quadrupole time-of-flight MS was operated in positive ion, V-mode with an average resolution of 9500 FWHM. Full scan mass spectra were acquired from 50 to 2000 m/z. LC-MS and LC-MSE data were collected in alternating low and high collision energy modes throughout the run (Silva et al., 2005), with each spectrum acquired for 1 s per mode.

Proteins were identified using the ProteinLynx Global SERVER™ version 2.4 search engine (PLGS, Waters Corporation). All ions were lock mass corrected, de-isotoped, and decombulated (charge state reduced). PLGS software ascribed collision induced dissociation (CID) product ions to their precursor peptides by time-aligning low- and high-energy-detected ions with a retention time tolerance of approximately ±0.05 min. Sequence searches were restricted to fully tryptic products with up to one missed cleavage, variable methionine oxidation and N-terminal Gln/Glu conversion to pyro-Glu, and peptide and product ion tolerances of 10 and 25 ppm, respectively. Proteins hits were accepted based on ≥2 ascribed peptides, each with 3 or more product ions, and at least seven fragment ions per protein. Correspondences between MS/MS spectra and ascribed sequences were also evaluated manually.

Results and discussion

We identified 154 proteins from concanavalin A pull down fractions and cell surface labeling. The following sections describe these proteins and the additional information that can be recovered by data mining.

Identificationl of concanavalin a interacting proteins

The archaeal cytoplasmic membrane has been described as fulfilling the role of the eukaryotic endoplasmic reticulum (Yurist-Doutsch et al., 2008) because archaeal glycosylation machinery is membrane-bound. Cell N-linked glycoproteins are expected to localize to the cytoplasmic membrane or the associated outer cell envelope region (Eichler, 2003; Albers et al., 2006; Messner, 2009). Thus, glycoprotein capture methods (e.g., lectin affinity chromatography) complement cell surface labeling (Francoleon et al., 2009) for enriching archaeal surface and membrane proteins.

To support ongoing studies characterizing protein glycosylation, M. mazei cell lysate proteins were lectin affinity-captured through direct and indirect binding using Con A, for which the subset of direct binders typically includes glycoproteins containing α-D-mannose and α-D-glucose (Kornfeld and Ferris, 1975; Baenziger and Fiete, 1979; Debray et al., 1981; Jaipuri et al., 2008). Here, we describe what can be learned from LC-MS/MS analyses of the Con A eluate with subsequent data mining. Further studies that employed additional dimensions of separation [e.g., hydrophilic interaction chromatography (HILIC) followed by reversed phase liquid chromatography], in concert with advanced ion activation techniques including infrared multiphoton dissociation (Zubarev, 2004; Cooper et al., 2005) will be discussed elsewhere (Leon et al., in preparation). Those methods greatly increase the numbers of glycoproteins observed, improve ability to localize glycosylation sites, and characterize individual glycan chains. Nevertheless, useful information can be gleaned from these initial LC-MS/MS experiments.

Not all proteins recovered from the Con A eluate are necessarily glycosylated. Indirect binding partners do not bind Con A directly, but associate with one or more direct (glycosylated) interactors; e.g., other subunits of a non-covalent complex. From M. mazei Gö1, 99 Con A eluate proteins were identified by LC-MS/MS with ≥2 peptides (Table 1). An additional 55 protein identifications based on a single peptide are presented in Supplemental Table S-1.

Table 1.

proteins detected by two or more peptides from concanavalin A eluate.

Uniprot accession MM# Name Unique peptides observed Mascot protein score Exprota SignalPb SecPc LipoPd
Q8Q0X9 MM0001 Dipeptide ABC transporter 5 229 1   Y  
Q8Q0X8 MM0002 Dipeptide ABC transporter 20 555 1   Y  
Q8Q0S0 MM0066 Hypothetical protein 5 216 2 Y Y  
Q8Q0K8 MM0128 Lon protease, membrane-bound 8 276        
Q8Q0F2 MM0184 Rps3Ae 2 51        
Q8Q035 MM0303 Hypothetical protein 2 39 1   Y  
Q8PZM5 MM0467 S-layer paralog 17 606   Y Y  
Q8PZ66 MM0628 Methylenetetrahydromethanopterin reductase 6 276        
Q8PZ61 MM0633 Multiheme cytochrome c, hypothetical 2 90     Y  
Q8PZ47 MM0647 Oligosaccharyl transferase 2 91     Y  
Q8PYZ8 MM0700 HppA 3 39   Y* Y  
Q8PYY5 MM0714 Fructose bis phosphate aldolase 2 29        
Q8PYY3 MM0716 Hypothetical protein 3 105 1 Y Y  
Q60187 MM0779 AtpB 6 189        
Q60186 MM0780 AtpA 11 368        
Q60184 MM0782 AtpC 4 154        
Q60183 MM0783 AtpE 4 34,13        
F1SVJ4 MM0784 AtpK 5 365 1 Y* Y  
O59659 MM0785 AtpI 4 115     Y  
Q8PYS4 MM0786 AtpH 2 75        
Q8PYJ5 MM0866 Periplasmic serine protease 4 242 1   Y  
Q8PYIO MM0882 Hypothetical protein 3 106     Y  
Q8PYE0 MM0922 Eif5a, IF5A 2 131        
Q8PY73 MM0991 Thioredoxin 5 150 2 Y   Y
Q8PY52 MM1012 Rpl1P 3 62        
Q8PY39 MM1025 ThiC 3 58        
Q8PXZ6 MM1070 MtaA1 methylcobalamin:CoM methyltransferase 7 374        
Q8PXZ5 MM1071 4Fe:4S ferredoxin, hypothetical 2 121        
Q8PXZ3 MM1073 MtaC2 methyl corrinoid protein 6 230        
Q8PXZ2 MM1074 MtaB2 9 250        
Q8PXZ1 MM1075 Putative regulatory protein 2 92 1   Y  
Q8PXX0 MM1096 Thermosome, gamma subunit 5 137        
Q8PXW0 MM1106 Putative phosphoserine phosphatase 5 81        
Q8PXV8 MM1108 Mtd, F420-dependent methylenetetrahydromethanopterin dehydrogenase 3 126        
Q8PXI2 MM1236 HtpX protease 2 25   Y Y  
Q8PXH8 MM1240 Methyl CoM reductase, alpha subunit 7 237        
Q8PXH7 MM1241 Methyl CoM reductase, gamma subunit 10 563        
Q8PXH4 MM1244 Methyl CoM reductase, beta subunit 13 467        
Q8PXG8 MM1250 Cation transporter 2 66   Y* Y  
Q8PXG7 MM1251 Cation transporter 2 109   Y* Y  
Q8PX89 MM1333 Zinc ABC transporter 10 486 2 Y   Y
Q8PX82 MM1340 Pyruvate synthase, alpha subunit 2 61        
Q8PX60 MM1362 Putative aliphatic sulfonate binding 11 556 2 Y Y  
Q8PX58 MM1364 S-layer paralog 28 1093 1 Y Y  
Q8PX43 MM1379 Thermosome, alpha subunit 10 329        
Q8PWZ8 MM1424 SecF 2 85 1 Y Y  
Q8PWZ7 MM1425 SedD 3 144   Y Y  
Q8PWX0 MM1456 USP-like 4 153        
P80650 MM1540 MtrH 9 276        
P80656 MM1541 MtrG 4 192     Y  
P80654 MM1542 MtrF 2 105     Y  
O59640 MM1543 MtrA 6 309        
P80655 MM1544 MtrB 4 222     Y  
O59638 MM1545 MtrC 5 127     Y  
P80653 MM1546 MtrD 2 133   Y Y  
P80651 MM1547 MtrE 4 117   Y Y  
Q8PWN5 MM1549 Na/proline symporter 4 211   Y* Y  
Q8PWN3 MM1551 Hypothetical protein 4 75 1 Y Y  
Q8PWM3 MM1561 ABC transporter, tungsten binding protein 2 72   Y* Y  
Q8PWJ6 MM1589 Surface layer protein B 3 128 1 Y Y  
Q8PWE2 MM1647 MtaB1 14 357        
Q8PWE1 MM1648 MtaC1 10 533        
Q8PWA1 MM1695 Hypothetical protein 10 425 2 Y Y Y
Q8PW83 MM1713 Hypothetical protein 2 173     Y  
Q8PW41 MM1760 Rps2p 30S ribosomal protein S2 3 104     Y  
Q8PW15 MM1789 Hypothetical protein 2 64     Y  
Q8PVY9 MM1816 Conserved protein 6 238 1 Y Y  
Q8PVW4 MM1843 HdrE, heterodisulfide reductase 6 210   Y Y  
Q8PVV1 MM1859 DdpA, ABC transporter 3 29 1 Y Y  
Q8PVM4 MM1939 Glutamine binding protein 2 115 2 Y   Y
Q8PVI7 MM1976 S-layer protein, (identified) 151 5310 1 Y Y  
Q8PVI6 MM1977 Hypothetical protein 3 118 2 Y Y Y
Q8PVG6 MM1999 Hypothetical protein 4 103 2 Y Y Y
Q8PVG5 MM2000 Hypothetical protein 4 82 2 Y Y Y
Q8PVD2 MM2033 Stomatin-like protein 2 52   Y*    
Q8PVA2 MM2069 Iron III dicitrate binding protein 5 179 2 Y Y Y
Q8PV27 MM2147 SecY 2 67     Y  
Q8PV17 MM2157 Rpsllp, 30S ribosomal protein S11p 2 81     Y  
Q50227 MM2171 Cytochrome b, F420 non-reducing hydrogenase II 2 87     Y  
Q8PUU5 MM2234 Hypothetical protein 2 94 1 Y Y  
Q8PUR8/AAM31960 MM2264 Ef1A 10 301        
Q8PUM8 MM2305 Na/proline symporter 2 51   Y Y  
Q8PUL4 MM2320 EchA 3 75   Y* Y  
Q8PU81 MM2460 Dipeptide oligopeptide binding protein 16 556 2 Y Y Y
F1SVE1 MM2479 FpoO, F420H2 dehydrogenase 2 83        
F1SVH9 MM2481 FpoM, F420H2 dehydrogenase 2 63   Y Y  
F1SVK0 MM2482 FpoL, F420H2 dehydrogenase 2 91   Y Y  
F1SVE0 MM2483 FpoK, F420H2 dehydrogenase 2 64   Y Y  
Q8PU59 MM2487 FpoH, F420H2 dehydrogenase 3 97   Y* Y  
P27094 MM2505 DnaK, chaperone protein 13 451        
Q8PU26 MM2531 Hypothetical protein 3 113 1   Y  
Q8PTZ0 MM2567 ABC transporter, periplasmic binding protein 14 420 2 Y    
Q8PTY1 MM2576 Ferrous iron transport protein B 8 276     Y  
Q8PTS5 MM2637 Ion channel transporter 3 58     Y  
Q8PTQ7 MM2656 Peptidyl-prolyl cis-trans isomerase 2 28        
Q8PTI7 MM2728 Hypothetical protein 2 61 1 Y* Y  
Q8PT25 MM2893 Hypothetical protein 6 193 1 Y Y  
Q8PSQ2 MM3024 Hypothetical protein 3 65 1 Y Y  
Q8PS84 MM3198 Lipoprotein, hypothetical 5 134        
a

Exprot (Saleh et al., 2010); 1, Predicted type I signal peptidase substrate; 2, Predicted type II signal peptidase substrate.

b

SignalP (Bendtsen et al., 2004b); Y, Predicted signal peptide by SignalP 3.0.

c

SecP (Bendtsen et al., 2004a, 2005); Y, Predicted substrate for leaderless secretion by SecretomeP 2.0.

d

LipoP (Juncker et al., 2003); Y, Predicted lipoprotein.

*

Predicted as secreted only by the SignalP eukaryotic predictor.

Standard proteomic search strategies do not reveal unknown glycopeptides, because it is not possible to predict the amount by which peptide masses are incremented. However, manual MS/MS analysis of, e.g., HILIC fractions, reveals some precursor masses (peptides) that are obviously glycosylated, because they dissociate to release low mass-to-charge ratio (m/z) ions, known as oxonium ions, that are characteristic of different sugars (Mechref, 2012). Ions at 163.06 and 127.06 m/z; e.g., signal that hexoses are present, while 204.09, 186.09, and 168.09 m/z reflect present N-acetylhexosamines.

Bona fide glycosylated proteins, revealed by their oxonium ions in peptide MS/MS spectra, were identified from Con A eluate as MM0002, MM0716, MM1364, and S-layer protein MM1976. It is unlikely that these are the only glycoproteins within M. mazei. Indeed, glycoprotein-specific staining (Francoleon et al., 2009) highlights many bands. Supplemental Table S-3 lists, from the proteins observed, those first predicted by SignalP 3.0 to be secreted and then by the NetNGlyc server (Blom et al., 2004) to be potentially glycosylated. The table lists 41 candidate glycoproteins. Interestingly, protein MM0002 was not predicted by SignalP to contain a signal peptide, although it is a candidate for leaderless secretion by SecretomeP, and NetNGlyc does suggest 2 Asn sites as potentially glycosylated. Hence, the list of potentially N-glycosylated proteins can be longer, should proteins secreted leaderlessly also be considered. The predictions from SignalP version 4.0 comprise a subset of the version 3.0 predictions. All but two of the proteins absent from the newer algorithm's list of signal peptide-containing sequences (MM1329 and MM2033) are also candidates for leaderless secretion (according to the SecretomeP algorithm), and thus potentially N-glycosylated. Additional glycoproteins likely await discovery.

It should also be clarified that the tandem MS conditions and protein quantities required for high-throughput peptide identification differ from those employed in oligosaccharide and glycoprotein analysis. Glycoprotein analyses will employ 10–100 fold more material and the MS/MS conditions will be customized for each analyte.

The utility of data mining is that, by employing a relatively simple enrichment method without exhaustive chromatographic purifications, these four proteins (MM0002, MM0716, MM1364, and MM1976) were revealed as glycosylated and bearing at least hexose and N-acetylhexosamine saccharides. It also indicates how glycopeptide knowledge can be extracted from tandem mass spectrometry data, even without upstream enrichment, bonus knowledge when experiments are not specifically targeting glycosylation. Adding chromatographic dimensions will increase the number of glycopeptides detected, because glycopeptide intensities are often suppressed by co-eluting non-glycopeptides, and because the chromatographic conditions that best resolve different glycopeptides differ from those best resolving peptides, in general. Analyses probing more M. mazei glycopeptides and at greater depth are underway.

Prediction of archaeal protein N-termini

Signal peptides are key participants in protein translocation, but our ability to predict them confidently, especially for archaeal species, has been limited by availability of experimental data (Armengaud, 2009). Even among bacteria, e.g., Mycobacterium smegmatis, predictions were found to be erroneous for up to 19% of protein N-termini (Gallien et al., 2009). Complications delineating open reading frames (ORFs) arise when multiple AUG codons lie near the DNA 5' end (ambiguity that often prompts annotators to select the longest open reading frame) (Prats et al., 1992; Kozak, 1997; Meinnel and Giglione, 2008; Fournier et al., 2012), when organisms employ alternative initiation codons; e.g., GUG or UUG (Klunker et al., 2003; Falb et al., 2006; Yamazaki et al., 2006; Meinnel and Giglione, 2008; Running and Reilly, 2009; Elzanowski and Ostell, 2013), or when translation initiates at multiple sites (Prats et al., 1992). Modifications to N-termini are rarely predictable, and their prevalence varies with organism. Strikingly, 14–19% of protein N-termini in halophiles Halobacterium salinarum and Natronomonas pharaonis are acetylated (Falb et al., 2006). Clearly, experimentally characterizing the N-termini of microbial proteins is important.

In a computational approach, SignalP 3.0 (Bendtsen et al., 2004b), LipoP (Juncker et al., 2003), and SecretomeP 2.0 (Bendtsen et al., 2004a, 2005) algorithms were employed to predict secreted proteins, while Exprot predictions for M. mazei Gö1 were obtained from Supplemental information in Saleh et al. (2010). SignalP used gram-positive, gram-negative, and eukaryotic-type models, while SecretomeP was applied employing bacterial models (Bendtsen et al., 2004a, 2005). Predictions for proteins recovered by Con A binding are displayed in Supplemental Table S-2. Predicted and experimental results are compared in the following section and indicate a tendency of prediction algorithms to over-predict signal peptide excision.

Experimentally detected N- and C-termini

Clearly, experimental approaches tailored to recovering as many N-terminal peptides as possible (Ogorzalek Loo et al., 2002; Gevaert et al., 2003; Dormeyer et al., 2007; Shen et al., 2007; Russo et al., 2008; Yamaguchi et al., 2008; Gallien et al., 2009; Xu and Jaffrey, 2010; Fournier et al., 2012; Kim et al., 2013; Venne et al., 2013) provide large datasets for evaluating and enhancing prediction algorithms and improving protein database annotations. Nevertheless, datasets acquired by other experimental approaches with different goals can be harvested to yield equivalent information for a smaller number of proteins, some of which are missed by large-scale “terminalomics” studies, especially because many large-scale approaches recover only free amino termini. Data harvests are also more likely to reveal instances where multiple N-terminal forms are present (e.g., modified and unmodified).

Information about protein N-termini is not automatically returned by database searching algorithms. Although most algorithms now consider both excised and retained initiator methionines when attempting to match MS/MS spectra, N-terminal acetylation is only considered if specified in the search parameters. Because each variable modification (i.e., one which may be present or absent) that must be considered in the search process adds to the time required for completion and often reduces specificity, only abundant variable modifications are usually considered by high-throughput proteomics studies.

From our Con A eluate studies, some LC-MS/MS spectra spanned M. mazei protein N- or C-termini, allowing us to compile that information in Table 2. M. acetivorans C2A and M. mazei Gö1 N- and C-termini information obtained previously (Francoleon et al., 2009), are also included. Information was recovered by semi-tryptic and error-tolerant searches. Semi-tryptic searches seek matches for MS/MS spectra to peptides in which only one terminus matches trypsin's known cleavage specificity. Examples may include (i) peptides with non-Lys or Arg C-termini, but with N-termini reflecting cleavage after Lys or Arg, or (ii) peptide N-termini inconsistent with cleavage after Lys or Arg, but with C-terminal Lys or Arg. Error-tolerant searches, described in the Introduction, consider a large range of potential modifications.

Table 2.

M. mazei and M. acetivorans protein termini recovered.

graphic file with name fmicb-06-00149-i0001.jpg

*Several LC-MS/MS experiments were unable to differentiate between the two proteins; none of the 6 peptides identified was unique.

Red entries correspond to identifications based on a single peptide.

Of M. mazei protein N-termini recovered by concanavalin A binding, only S-layer protein MM1976 underwent signal peptide excision. These experimental results contrasted with those from prediction algorithms SignalP 3.0 (Bendtsen et al., 2004b) and Exprot (Saleh et al., 2010), which were found to over-predict the presence of signal peptides. Although SignalP 3.0 correctly predicted the MM1976 signal peptide, it also predicted leaders for 7 other proteins, while Exprot predicted signal peptides for 6 other proteins (Supplemental Table S-2). The newer algorithm SignalP 4.0 predicted leaders for only MM1362 and MM1547, in addition to MM1976. The poor correlation between prediction and experiment underscores previous conclusions about major problems predicting proteins lacking signal peptides (Antelmann et al., 2001).

Interestingly, SecretomeP 2.0, a machine-learning approach developed to predict non-classically secreted proteins in mammals and bacteria; i.e., proteins exported without a classical N-terminal signal peptide, predicted leaderless secretion of 14 proteins in Supplemental Table S-2 (Bendtsen et al., 2004a). Overlap for 8 of these SecretomeP 2.0 predictions with those by SignalP 3.0 is reasonable, because SecretomeP was trained on datasets of secreted protein sequences that had their signal peptides deleted. Detection of the other six proteins (MM0866, MM1009, MM1075, MM1221, MM1542, and MM1362) by our study would seem to verify these SecretomeP predictions.

M. mazei N-terminal peptides were recovered from 28 proteins over the course of this and previous studies. These protein identifications were supported by MS/MS spectra from multiple peptides in all but 3 cases. (Observing multiple peptides from a protein is one criterion for assessing confidence in the protein's identification.) Table 2 also includes data for 3 M. acetivorans N-termini. Heterogeneous N-termini were observed from 4 of the 31 proteins; e.g., Acetyl-VDAASTGLFLDAAGMK and Acetyl-MVDAASTGLFLDAAGMK were observed from A1A0 H+ ATPase subunit K (MM0784), indicating partial methionine excision prior to acetylation. MM0784 matched predictions in all other respects; the 4 tryptic peptides observed verified 100% of the 8-kDa proteolipid's sequence. Strong b1 ions (i.e., peptide fragments corresponding to protonated Ac-Val - H2O or protonated Ac-Met – H2O (m/z 142.09 and 174.06, respectively) in the MS/MS spectra confirmed the modification as N-acetylation (Yalcin et al., 1995). Because b1 ions are generally observed in MS/MS spectra of N-terminally acetylated peptides, but otherwise relatively rare, their presence provides powerful validation of the modified peptide that is independent of the statistical score; i.e., search algorithms do not accord special significance to b1 ions.

Partial methionine processing has been noted in other archaeal species (Falb et al., 2006). That the N-terminus of the A1 ATPase proteolipid subunit was found to be blocked was not surprising; blocked N-termini have frequently been observed for F0 proteolipids; e.g., N-formylmethionine in bacteria (E. coli and Bacillus, UniProt Accessions P68699 and P00845, respectively), yeast mitochondria (Sebald et al., 1979), and wheat (Howe et al., 1982) and spinach (P69447) chloroplasts.

Putative regulatory protein MM1075 (MtaR) was detected as M.SENAGTSTVIVDK (where M.S denotes methionine excision) and a formylated version, M.SENAGTSTVIVDK, modified at S7 or T8. MS/MS spectra revealed ions y1y5, and y7y11, localizing the modification to S7 or T8. (See Figure 1). Technically, TE, or SD substitutions or C2H4 addition would also match the incremented mass, but the multiple base substitutions required to convert a Thr (ACC codon) to Glu (GAA/GAG) or a Ser (UCU) to Asp (GAU/GAC) are not easily reconciled, leading us to favor interpretation as formylation. High mass accuracy measurements can distinguish addition of CO (formylation) from C2H4 (27.995 vs. 28.031 Da).

Figure 1.

Figure 1

N-terminal peptide of M. mazei MM1075 (MtaR). (A) unmodified, and (B) O-formylated at Ser7 or Thr8. Unidentified m/z 673, *, lies 1 Da below the predicted m/z for unmodified y6.

O-formylation may be important to controlling MM1075's function, adapting cells to acetate- and methanol-dependent growth. It's mRNA levels are 200–500 times higher in methanol- vs. trimethylamine-grown cells, and still higher for acetate culture (Hovey et al., 2005; Krätzer et al., 2009), while the operon's other genes, MM1073 and MM1074, are among the most highly regulated genes known in the Archaea (Bose et al., 2006). It will be interesting to monitor modifications of MM1075 as culture conditions are varied. For example, evidence that the ratio of formylated:unformylated MM1075 varies with; e.g., substrate or with length of time since the substrate was switched, would support a role in adaptation for the modification.

Surface layer protein modifications

The M. mazei sheath or S-layer protein, MM1976, is one of the most abundant proteins made by the cell (Francoleon et al., 2009; Rohlin et al., 2012). Con A binding enriched cell lysates for this protein, permitting characterization of low stoichiometry modifications. N-termini for the S-layer protein were especially varied, although all were consistent with signal peptide cleavage after residue 24. By abundance, the major N-terminal peptide observed was ADVIEIR, although peptides 14, 40, and 42 Da heavier were also found. The modified peptides followed ADVIEIR in elution by <1, 8, and 3 min, respectively.

N-terminal addition of 42 Da, localized by y1y6 and b2 ions, (see Francoleon et al., 2009, Figures 3C,D) was initially attributed to α-amino acetylation. However, none of the tandem mass spectra acquired for this precursor yielded a b1-ion, generally considered diagnostic of N-terminal acetylation (Yalcin et al., 1995). Careful mass measurements on b2 product ions better matched modifications of composition C2H2O, rather than C3H6. The M. acetivorans ortholog, MA0829, also displayed evidence of signal peptide cleavage, yielding the N-terminal tryptic peptides VDVIEIR and a +42 Da variant, similarly lacking its b1 product ion by MS/MS (Francoleon et al., 2009 see Francoleon et al., 2009, Figures 3A,B). Further investigation is required to confidently ascribe N-terminal modifications for these low abundance variants.

MS/MS spectra suggest that the M. mazei +40 Da modification also localizes to the N-terminus, as y1y6, a2, and b2b5 ions were observed. Again, the b1-ion was not seen. The late elution of the +40 Da species relative to unmodified ADVIEIR, leads us to questions whether it might arise from in-source collision-induced dissociation (CID) of an exceptionally labile, hydrophobic group, leaving behind only a residual −N = CH-CH = O, −N = C(CH3)2, or −N = CH-CH2-CH3 N-terminus. Available mass accuracy narrowed consideration to the latter two possibilities. As an alternative to production by CID, the C3H4 increment could correspond to a Schiff base formed by addition of propionaldehdye, although the mechanism for such a modification is unclear. Further effort is required to characterize this C3H4 modification.

In some MM1976 ADVIEIR peptides, Glu5 was incremented by 14 Da, consistent with methyl esterification. Elsewhere, artifactual methyl adducts were attributed to incubation in acidic methanol during gel fixation or staining (Parker et al., 1998; Xing et al., 2008). Here the analyses displaying the modification were performed on proteins digested in solution; i.e., not subjected to staining. Their only exposure to methanol was shorter, at lower concentration and at lower temperature than conditions known to esterify. Thus, the 14 Da adducts are unlikely to be artifacts. Other instances of relevant methyl esters have been described previously (Hoelz et al., 2006).

Other modified proteins

The predicted N-terminal peptide for MM1540, subunit H of tetrahydrosarcinopterin S-methyl transferase (MtrH) was confirmed to be MFKFDKKQE. Interestingly, peptide M. ASAWDWLR (residues 232–239) was also observed, potentially reflecting protein processing, anomalous cleavage, or alternate initiation, although we could find no rationalization for the latter possibility. That MM1540 is the catalytic subunit of the S-methyl transferase and binds methyl tetrahydrosarcinopterin (Hippler and Thauer, 1999) encourages speculation that an inadvertent methyl transfer to Met231 instead of the coenzyme M thiol might lead to a sulfonium-activated cyanogen bromide-like cleavage at the observed position.

The N-terminal peptide of M. acetivorans C2A MA0456 (MtaC1, methanol-5-hydroxybenzimidazolyl cobamide co-methyl transferase) was previously recovered as MLDFTEASLK and in its methionine sulfoxide form. Methionines are often oxidized under experimental conditions. A related peptide 58-Da heavier than the unmodified species was also observed (Figure 2). Tandem MS of the latter peptide localized the +58-Da to the first residue, revealing intense 190- and 126-Da product ions, consistent with the b1 ion from an N-terminally acetylated peptide and a corresponding 64-Da neutral loss product, unique to methionine sulfoxide. As discussed earlier, b1 observations strongly suggest N-terminal acetylation. Larger b-ions also showed 64-Da neutral loss products, establishing the variant M. acetivorans peptide as Ac-MoxLDFTEASLK. Peptide Ac-MoxLDFTEASLKK, was also observed. Previous mass analyses of intact M. acetivorans proteins reported only the free amino terminal form (Patrie et al., 2006). Interestingly, we observed only non-acetylated MLDFTEASLK from the M. mazei ortholog, MM1648, despite the larger protein quantity available for analysis. Note that M. mazei initiates translation 10 residues downstream of the originally annotated position) (Deppenmeier et al., 2002).

Figure 2.

Figure 2

N-terminal peptides of M. acetivorans MA0456 (MtaC1). (A) Unmodified MLDFTEASLK, (B) Low abundance variant Ac-MoxLDFTEASLK. Intense 190-Da ions correspond to the b1 product from the N-terminally acetylated peptide. Larger b-ions show 64-Da neutral loss products characteristic of methionine sulfoxide. [Based on its measured mass, the product ion observed at m/z 129.11 is attributed to y1–H2O (m/z 129.10), rather than unmodified b1 (m/z 129.01)].

Careful data mining revealed additional modifications to MA0456 (MtaC1). Three versions of peptide 149–159 were found: ANGYDVVDLGR, the Asn2 deamidated peptide, and a peptide 42-Da heavier than predicted, with its modification localized to the first residue by y10 and b2 ions (Figure 3). The absence of a b1-ion and accurate mass measurement ruled out Nα-acetylation for the heavier peptide, but could support an AI/L substitution or Nα-trimethylation. However, amino acid substitution is hard to rationalize from the nucleotides coding Ala (GCA) vs. those coding Leu/Ile (CTX, TTA, TTG/ATT, ATC, ATA), as is sub-stoichiometric substitution. Nα-trimethylation would require prior cleavage of the protein to expose N-terminal ANGYDVVDLGR for modification. Interestingly, the peptide lies near His136, an axial ligand to the Co2+ of the MtaC1 corrinoid co-factor that accepts CH3 from methanol and subsequently transfers it to coenzyme M (Sauer et al., 1997; Randaccio et al., 2007). Neutral losses of 59-Da, sometimes observed from trimethylated residues, were not apparent in these spectra. Additional experiments are required to verify the source of this substitution; e.g., genetic drift (DNA), sloppy transcription (RNA), miscoding (translation), or post-translational modification, but at present trimethylation is favored.

Figure 3.

Figure 3

Peptide ANGYDVVDLGR of M. acetivorans protein MA0456 (MtaC1) is present in two forms. (A) Peptide with N-terminal +42-Da modification, consistent with trimethylation or a C3H6 composition. (B) Unmodified peptide.

Despite the larger protein amounts available for the two M. mazei orthologs, MM1648 (MtaC1) and MM1073 (MtaC2), 42-Da incremented peptides ANGYNVVDLGR and ANGYDVVDLGR, respectively, were not observed. Only unmodified and -17 Da variants were observed, with the latter reflecting succinimides, unremarkable for Asn-Gly bonds. Instead, a semi-tryptic peptide was observed in M. mazei MM1073 (MtaC2). Peptide 138–150, C*HVAEGDVHDIGK was incremented by 25-Da at its N-terminus, consistent with cyanylation see the MS/MS spectrum displayed in Figure 4). Such modification seems remarkable, but may reflect a radical-induced side reaction given that this region binds the corrinoid cofactor. A classic chemical cleavage scheme relies on S-cyanocysteine's base-catalyzed ability to cleave the N-terminal peptide bond to yield iminothiazolidinyl peptides (Jacobson et al., 1973; Degani and Patchornik, 1974; Nefsky and Bretscher, 1989; Wu and Watson, 1997). Uncleaved S-cyanocysteine 134–150 (GTVVC*HVAEGDVHDIGK) was not observed by LC-MS. At present, we cannot differentiate in vivo modification from modification induced by exposure to air during sample handling.

Figure 4.

Figure 4

M. mazei semi-tryptic peptide from MM1073 (MtaC2). Peptide 138–150, C*HVAEGDVHDIGK is incremented by 25-Da at its N-terminus, consistent with cyanylation. The sequence is common to segment 138–150 of methanol corrinoid proteins MM1648 (MtaC1) and MM1073 (MtaC2).

Limited experiments performed on M. acetivorans MA0456 (MtaC1), revealed only unmodified tryptic peptide GTVVCHVAEGDVHDIGK (124–140), but those studies cannot be considered conclusive, because smaller quantities of protein and a narrower range of sample handling conditions were pursued. In particular, the M. acetivorans samples were reduced with dithiothreitol (DTT), whereas cyanylated M. mazei protein was observed from untreated samples. Excess DTT removes the cyanide group from internal cysteines, thereby stopping the cleavage reaction (Degani and Patchornik, 1974; Nefsky and Bretscher, 1989).

Protein complexes for energy transfer-acquisition

Numerous protein complexes were recovered by Con A fractionation, including, tetrahydromethanopterin S-methyl transferase (mtr), methylcobalamin:CoM methyltransferase (mta), methyl CoM reductase (mcr), F420H2 dehydrogenase (fpo), heterodisulfide reductase (hdr), and A1A0 H+ ATPase (aha). Conceivably, Con A fractionation could show utility for enriching select M. mazei protein complexes in support of other studies or for additional characterization. Hence, we considered which subunits were identified in the eluate. A second reason for interest in the proteins and complexes detected is that mass spectrometrists often discount the presence of certain proteins in mixtures as reflective of contamination. These assumptions are not always justified. Here we sought evidence that proteins not annotated as surface- or membrane-localized, or not known to be glycosylated might rationally be carried along in the Con A fractionation by interactions with other proteins. Should most of the detected proteins be rationalized, there would be additional impetus to explore the cellular localization of any remaining proteins.

All eight tetramethyl methanopterin S-methyltransferase subunits (MtrABCDEFGH) were detected, including integral membrane proteins MtrC, MtrD, and MtrE (Fischer et al., 1992; Lienard et al., 1996; Lienard and Gottschalk, 1998; Thauer, 1998; Kahnt et al., 2007). The abundance of this complex enabled N- and C-termini to be defined for many of the subunits, as well as providing some unexpected observations. Tryptic peptides IVTDEDKGIFDR (40–51) and IVTDXDK (40–46), with X corresponding to Glu-28 Da, were observed from catalytic subunit MtrH (MM1540). The latter peptide could reflect decarboxylation of Glu44, perhaps from exposure to reactive species.

Strikingly, Con A binding recovered representatives from all types of the organism's membrane-bound hydrogenases involved in electron transfer: the F420 non-reducing hydrogenase (Vht), Ech hydrogenase, and the F420H2 dehydrogenase (Fpo) (Künkel et al., 1997, 1998). These included seven proteins encoded in the F420H2 dehydrogenase gene cluster MM2479-2491, the hydrogenase components EchA and EchB (MM2320, MM2321), and F420–non-reducing hydrogenase MM2171 (VhtC). Previously, we recovered a large subunit of the F420H2 dehydrogenase [MM2170 (VhtA) or MM2313] by biotin-tagging (Francoleon et al., 2009). Membrane-bound heterodisulfide reductase proteins HdrE and HdrD (MM1843 and MM1844) were also recovered, as was MM0628, provider of reduced F420 (Bäumer et al., 2000; Thauer et al., 2008).

The methyl coenzyme M product (CH3-S-CoM) is reductively demethylated by coenzyme B (CoB-SH) in a process catalyzed by the membrane-associated methyl coenzyme M reductase complex, Mcr (Hoppert and Mayer, 1990). All three α, β, and γ subunits (MM1240, McrA; MM1244, McrB; MM1241, McrC) were eluted from concanavalin A. From α-subunit McrA, 1-N-methyl-histidine was observed in peptide HAALVSMGEMLPAR (271–284), consistent with previous observations in M. barkeri (Grabarse et al., 2000). Because peptides spanning residues 285, 465, and 472 were not recovered, we could not determine if the unusual amino acids 5-methylarginine, S-methylcysteine, and thioglycine were present, as in other methanogens (Grabarse et al., 2000; Kahnt et al., 2007). However, the N-terminus was found to be acetylated. The peptide 271–284 MS/MS spectrum, illustrated in Figure 5A, demonstrates that the 1-N-methyl histidine residue enhances b-ion intensities from tryptic peptides. Peptides 271–284 and 1–7 were only observed in modified form. Similarly, 1-N-methyl histidine was found in the active site region of the M. acetivorans ortholog, MA4546. (See Figure 5B).

Figure 5.

Figure 5

MS/MS spectra of the 271–284 tryptic peptides from the alpha subunit of membrane-associated methyl coenzyme M reductase (McrA) show that the 1-N-methyl histidine residue signals its presence with strongly enhanced b-ion intensities. (A) M. mazei MM1240 H*AALVSMGEMLPAR, and (B) M. acetivorans MA4546 H*AALVSMoxGEMoxLPAR.

That protein products from 8 of 10 A1A0 ATPase-related genes were recovered via Con A elution, (ahaABCDE, ahaHIK) suggests that this capture method may be useful in isolating these unstable complexes (Müller et al., 1999). Previous experiments (Lemker et al., 2001, 2003) purifying ATPase subcomplexes from F1F0 ATPase-negative E. coli cells over-expressing the M. mazei ahaA-ahaG operon recovered subunits A, B, C, D, and F. Questions persist regarding the participation in ATPase of M. mazei AhaG, considered authentic based on observation of an appropriately migrating SDS-PAGE band following heterologous expression in Escherichia coli (Lemker et al., 2001), but homologs of which are absent in several archaea. In our studies, recovery of AhaG may have been reduced, because the small (6.3-kDa) protein is expected to yield only two tryptic peptides larger than 900 Da, one of which is very hydrophobic. Although the mass spectrometer is capable of detecting tryptic peptides below 900 Da in mass, the peptides are often lost upstream in reversed phase HPLC, because their hydrophilicity causes them to elute with salt. Large hydrophobic peptides are also problematic for the chromatography because they fail to elute during the analysis. Thus, we cannot rule out AhaG as a component of the A1A0 ATPase.

Protein complexes for methanol metabolism

Observed in these studies were products from two of the three operons coding methanol-specific methyl cobalamin:CoM methyltransferases: (1) MtaA1 (MM1070) with heterodimeric MtaB2/MtaC2 (MM1074/MM1073), and (2) MtaB1/MtaC1 (MM1647/MM1648). The MtaB/MtaC complexes transfer a methyl group from methanol to the corrinoid cofactor of MtaC. Subsequently, that methyl is transferred to coenzyme M (HS-CoM), catalyzed by MtaA1. Alternative enzymes catalyzing transfer to HS-CoM (MtaA2 and MtbA), were not observed. That MtbA was absent is unsurprising, as it is specific for growth on H2/CO2 or trimethylamine (Harms and Thauer, 1996; Hovey et al., 2005), and Ding et al. (2002) did not identify ortholog MtaA2 from methanol-cultivated M. thermophila. However, methanol-induced expression of MtaB3 and MtaC3 was established in M. thermophila (Ding et al., 2002), leading us to address their absence in our data. First, we would not expect these methyl transferases to bind concanavalin A or associate with Con A binders, because MtaB3 and MtaC3 are generally considered soluble. Also, several MtaB3 (MM0175) peptides are non-unique, (shared with other isozymes) complicating its identification from complex mixtures. DNA microarray analyses indicated that mtaB1/mtaC1 were induced 10–33X in methanol, while mtaB2/mtaC2 were induced only in acetate (Hovey et al., 2005). Quantifying roughly, by comparing numbers of peptides recovered, we see that the trend in protein abundances follows the same direction as the transcripts: 14 peptides vs. 9 for MtaB1 vs. MtaB2 and 10 peptides vs. 6 for MtaC1 vs. MtaC2.

Additional proteins recovered by Con A

MM0633, a hypothetical protein containing a multi-heme cytochrome c domain suggested as part of a membrane-bound complex, belongs to a gene cluster showing elevated expression under aceticlastic growth (Hovey et al., 2005). In these methylotrophic studies, however, MM0633 was the only cluster member observed. As it lacks transmembrane regions, we may wonder if its presence reflects interaction with some other membrane protein or glycosylation.

The oligosaccharyl transferase (MM0647) detected in the ConA pull down experiment (Table 1) is a product of one of three aglB homologs encoded in the M. mazei genome (MM646, MM0647, MM2210) (Magidovich and Eichler, 2009). Its detection makes MM0647 a logical candidate for the AglB oligosaccharyl transferase that links glycans to asparagines on surface layer protein MM1976 and on other N-linked glycoproteins. Two minor S-layer proteins similar to MM1976, MM0467, and MM1364, were identified where the latter was also shown to be glycosylated (described below). Ongoing analyses of M. mazei N-linked glycans will reveal more about the oligosaccharides transferred.

Proteins with roles in cobalt and iron uptake were also observed: MM2069, an iron ABC transporter, MM1999 and MM2000 involved in cobalt uptake, MM0893 (CbiM), a cobalt ATP-dependent transporter, and MM0994 (CbiC). Identifications for CbiC and CbiM were based on a single recovered peptide for each, a lower standard of confidence. Numerous other transporters were also observed.

Previously detected Hsp70 analog MM2505 and the membrane-bound ATP-dependent protease LonB were also found in Con A eluate, along with two of three subunits comprising the M. mazei thermosome (Bateman et al., 2004), a eukaryotic-type chaperonin complex. Previously, surface-biotinylation with streptavidin affinity chromatography retrieved all 3 M. mazei subunits [MM1379 (α), MM0072 (β), and MM1096 (γ)], confirming the proposal (Trent et al., 2003) that a fraction of thermosome (or rosettasome) complexes are membrane-localized. In the present M. mazei lectin capture, as well as for our previous M. acetivorans surface-tagging and capture efforts (Francoleon et al., 2009), the thermosome γ-subunit was not recovered. Archaeal thermosomes vary in whether their double ring structures are composed of identical subunits, or of two or three different sequences; e.g., the Methanopyrus kandleri complex is homomeric (Andrä et al., 1998). Indeed, M. mazei proteins most closely related to the M. kandleri thermosome, MK1006, are MM1379, MM1096, and MM0072, respectively.

Protein export and processing

Machinery to transport proteins across the membrane is essential to protein secretion. Recent cryo-electron microscopy studies revealed important components of this machinery in yeast, where protein transport across the endoplasmic reticulum begins with the signal peptide of the nascent chain engaging the signal recognition particle (SRP) in the cytoplasm. Co-translational translocation is initiated when the signal peptide is transferred to the protein conducting channel, overlapping 4 binding sites on the large ribosomal subunit. The archaeal analog to translocation across the ER is transport across the cell membrane. The hetero-trimeric protein-conducting channel (akin to yeast Sec61α, β, and γ) consists of integral membrane proteins: MM2147, MM1372, and MM1009, respectively, all of which we observed (Becker et al., 2009; Kampmann and Blobel, 2009), along with accessory factors MM1424 (SecF) and MM1425 (SecD). As in Eukarya and Bacteria, ribosomes contact membranes via Sec-based sites (Ring and Eichler, 2004), consistent with our observations of associated ribosomal proteins. The SecP algorithm predicts secretion of ribosomal proteins MM1760, MM2124, MM2135, and MM2157 (Bendtsen et al., 2004a, 2005), although the relationship between non-classical secretion predictions, disordered regions, and protein-protein or protein-nucleotide interactions is yet unclear. The mass spectrometry/proteomics community often cites the presence of ribosomal proteins in cell membrane preparations as evidence of poor quality, but it is important to consider that some presence in preparations enriching membrane-associated complexes is legitimate.

Of the total number of M. mazei open reading frames detected by our study, genome sequence analysis (Deppenmeier et al., 2002) annotated about 20% as hypothetical, thus highlighting the efficacy of the Con A pull-down approach for discovery. Of the 28 hypothetical proteins, 24 were predicted to be secreted by Exprot, SignalP, SecP, and/or LipoP (Juncker et al., 2003; Bendtsen et al., 2004a,b, 2005; Saleh et al., 2010). In addition, we find that hypothetical proteins MM0716 and MM1364 are glycosylated. Glycosylation of MM1364 correlates with homology to known Methanosarcinae S-layer proteins (Francoleon et al., 2009), that also bear glycans.

Conclusions

LC-MS/MS analyses of proteolytically-digested concanavalin A eluate from M. mazei Gö1 cell lysates led to the identification of 154 proteins. Among these, constituents of membrane-bound or membrane-associated complexes known from the literature were well-represented, including all 8 subunits of tetrahydromethanopterin S-methyl transferase (Mtr), seven proteins encoded by the F420H2 dehydrogenase fpo operon, the 3 subunits of methyl coenzyme M reductase (Mcr), the protein products from 8 of 10 A1A0 ATPase-related genes (ahaABCDEHIK), and components of the machinery translocating proteins across the cell membrane [protein channel constituents MM2147, MM1372, and MM1009 and accessory factors MM1424 (SecF) and MM1425 (SecD)]. All of these proteins do not bear Con A-interacting saccharides, because lectin binding is performed under non-denaturing conditions. However, the results can be useful in considering strategies to enrich or isolate select membrane complexes from M. mazei, and perhaps other Methanosarcinae, in order to monitor dynamic changes in protein modifications and/or retrieve complexes from strains not engineered to synthesize tagged proteins for easy retrieval.

Tandem mass spectrometry data associated with protein identifications can be mined to recover novel information that is not automatically provided by the high-throughput analyses. Here it was found that S-layer protein MM1976 was present in multiple forms, including four variants of its N-terminal peptide ADVIEIR. Instances of protein formylation, methyl esterification, methylation, and cyanylation were also found. Knowledge of unanticipated modifications, even if not providing immediate insight, does suggest features to monitor for evidence of dynamic changes. Knowledge gained by data mining can also complement what is obtained from experiments specifically targeting that modification, because the experimental conditions (e.g., chromatography resin and elution conditions) are often different. A high throughput LC-MS/MS run injects only a few hundred nanograms of a peptide mixture. Further effort is underway to characterize unknown glycans, the sites they modify, and other post-translational modifications, particularly in extensively modified S-layer protein MM1976.

N-termini recovered from a subset of proteins secreted to the membrane or cell surface provide a dataset for comparison to signal peptide algorithms. Disagreement between the number of proteins predicted vs. detected with signal peptide-excised N-termini suggests that leaderless secretion is of greater importance than present models imply.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This research was supported by the Department of Energy Biosciences Division grant award DE-FG02-08ER64689 to RPG, by the UCLA-DOE Institute of Genomics and Proteomics (DE-FC03-02ER6342) to JL and RPG, by the Ruth L. Kirschstein National Research Service Award (Grant GM007185) to DL and by the National Institutes of Health (R01 GM085402) to ROL and JL.

Supplementary material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/journal/10.3389/fmicb.2015.00149/abstract

References

  1. Albers S.-V., Szabó Z., Driessen A. J. M. (2006). Protein secretion in the archaea: multiple paths towards a unique cell surface. Nat. Rev. Microbiol. 4, 537–547. 10.1038/nrmicro1440 [DOI] [PubMed] [Google Scholar]
  2. Andrä S., Frey G., Jaenicke R., Stetter K. O. (1998). The thermosome from Methanopyrus kandleri possesses an NH+4-dependent atpase activity. Eur. J. Biochem. 255, 93–99. 10.1046/j.1432-1327.1998.2550093.x [DOI] [PubMed] [Google Scholar]
  3. Antelmann H., Tjalsma H., Voigt B., Ohlmeier S., Bron S., Van Dijl J. M., et al. (2001). A proteomic view on genome-based signal peptide predictions. Genome Res. 11, 1484–1502. 10.1101/gr.182801 [DOI] [PubMed] [Google Scholar]
  4. Armengaud J. (2009). A perfect genome annotation is within reach with the proteomics and genomics alliance. Curr. Opin. Microbiol. 12, 292–300. 10.1016/j.mib.2009.03.005 [DOI] [PubMed] [Google Scholar]
  5. Baenziger J. U., Fiete D. (1979). Structural determinants of concanavalin a specificity for oligosaccharides. J. Biol. Chem. 254, 2400–2407. [PubMed] [Google Scholar]
  6. Bandeira N., Tsur D., Frank A., Pevzner P. A. (2007). Protein identification by spectral networks analysis. Proc. Natl. Acad. Sci. U.S.A. 104, 6140–6145. 10.1073/pnas.0701130104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bateman A., Coin L., Durbin R., Finn R. D., Hollich V., Griffiths-Jones S., et al. (2004). The pfam protein families database. Nucleic Acids Res. 32, D138–D141. 10.1093/nar/gkh121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bäumer S., Ide T., Jacobi C., Johann A., Gottschalk G., Deppenmeier U. (2000). The F420H2 Dehydrogenase from Methanosarcina mazei is a Redox-Driven proton pump closely related to NADH Dehydrogenases. J. Biol. Chem. 275, 17968–17973. 10.1074/jbc.M000650200 [DOI] [PubMed] [Google Scholar]
  9. Baumgartner C., Rejtar T., Kullolli M., Akella L. M., Karger B. L. (2008). SeMoP: a new computational strategy for the unrestricted search for modified peptides using LC-MS/MS data. J. Proteome Res. 7, 4199–4208. 10.1021/pr800277y [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Becker T., Bhushan S., Jarasch A., Armache J.-P., Funes S., Jossinet F., et al. (2009). Structure of monomeric yeast and mammalian Sec61 complexes interacting with the translating ribosome. Science 326, 1369–1373. 10.1126/science.1178535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bendtsen J. D., Jensen L. J., Blom N., Von Heijne G., Brunak S. (2004a). Feature based prediction of non-classical and leaderless protein secretion. Protein Eng. Des. Sel. 17, 349–356. 10.1093/protein/gzh037 [DOI] [PubMed] [Google Scholar]
  12. Bendtsen J. D., Kiemer L., Fausbøll A., Brunak S. (2005). Non-classical protein secretion in bacteria. BMC Bioinformatics 5:58. 10.1186/1471-2180-5-58 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bendtsen J. D., Nielsen H., Von Heijne G., Brunak S. (2004b). Improved prediction of signal peptides: signalP 3.0. J. Mol. Biol. 340, 783–795. 10.1016/j.jmb.2004.05.028 [DOI] [PubMed] [Google Scholar]
  14. Blom N., Ponten T. S., Gupta R., Gammeltoft S., Brunak S. (2004). Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 4, 1633–1649. 10.1002/pmic.200300771 [DOI] [PubMed] [Google Scholar]
  15. Bose A., Pritchett M. A., Rother M., Metcalf W. W. (2006). Differential regulation of the three methanol methyltransferase isozymes in Methanosarcina acetivorans C2A. J. Bacteriol. 188, 7274–7283. 10.1128/JB.00535-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cooper H. J., Håkansson K., Marshall A. G. (2005). The role of electron capture dissociation in biomolecular analysis. Mass Spectrom. Rev. 24, 201–222. 10.1002/mas.20014 [DOI] [PubMed] [Google Scholar]
  17. Creasy D. M., Cottrell J. C. (2004). Unimod: protein modifications for mass spectrometry. Proteomics 4, 1534–1536. 10.1002/pmic.200300744 [DOI] [PubMed] [Google Scholar]
  18. Creasy D. M., Cottrell J. S. (2002). Error tolerant searching of uninterpreted tandem mass spectrometry data. Proteomics 2, 1426–1434. [DOI] [PubMed] [Google Scholar]
  19. Debray H., Decout D., Strecker G., Spik G., Montreuil J. (1981). Specificity of twelve lectins towards oligosaccharides and glycopeptides related to n-glycosylproteins. Eur. J. Biochem. 117, 41–55. 10.1111/j.1432-1033.1981.tb06300.x [DOI] [PubMed] [Google Scholar]
  20. Degani Y., Patchornik A. (1974). Cyanylation of sulfhydryl groups by 2-nitro-5-thiocyanobenzoic acid. High-yield modification and cleavage of peptides at cysteine residues. Biochemistry 13, 1–11. 10.1021/bi00698a001 [DOI] [PubMed] [Google Scholar]
  21. Deppenmeier U., Johann A., Hartsch T., Merkl R., Schmitz R. A., Martinez-Arias R., et al. (2002). The genome of Methanosarcina mazei: evidence for lateral gene transfer between Bacteria and Archaea. J. Mol. Microbiol. Biotechnol. 4, 453–461. [PubMed] [Google Scholar]
  22. Ding Y.-H. R., Zhang S.-P., Tomb J.-F., Ferry J. G. (2002). Genomic and proteomic analyses reveal multiple homologs of genes encoding enzymes of the methanol:coenzyme m methyltransferase system that are differentially expressed in methanol- and acetate-grown Methanosarcina thermophila. FEMS Microbiol. Lett. 215, 127–132. 10.1111/j.1574-6968.2002.tb11381.x [DOI] [PubMed] [Google Scholar]
  23. Dormeyer W., Mohammed S., Van Breukelen B., Krijgsveld J., Heck A. J. R. (2007). Targeted analysis of protein termini. J. Proteome Res. 6, 4364–4645. 10.1021/pr070375k [DOI] [PubMed] [Google Scholar]
  24. Eichler J. (2003). Facing extremes: archaeal surface-layer (glyco)proteins. Microbiology 149, 3347–3351. 10.1099/mic.0.26591-0 [DOI] [PubMed] [Google Scholar]
  25. Elzanowski A., Ostell J. (2013). The Genetic Codes [Online]. National Center for Biotechnology Information (NCBI). Available online at: http://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi?mode=c (Accessed August 30 2014).
  26. Falb M., Aivaliotis M., Garcia-Rizo C., Bisle B., Tebbe A., Kliein C., et al. (2006). Archaeal N-terminal protein maturation commonly involves N-terminal acetylation: a large-scale proteomics survey. J. Mol. Biol. 362, 915–924. 10.1016/j.jmb.2006.07.086 [DOI] [PubMed] [Google Scholar]
  27. Falkner J. A., Falkner J. W., Yocum A. K., Andrews P. C. (2008). A spectral clustering approach to MS/MS identification of post-translational modifications. J. Proteome Res. 7, 4614–4622. 10.1021/pr800226w [DOI] [PubMed] [Google Scholar]
  28. Fischer R., Gärtner P., Yeliseev A., Thauer R. K. (1992). N5-Methyltetrahydromethanopterin: coenzyme M methyltransferase in methanogenic archaebacteria is a membrane protein. Arch. Microbiol. 158, 208–217. 10.1007/BF00290817 [DOI] [PubMed] [Google Scholar]
  29. Fournier C. T., Cherny J. J., Truncali K., Robbins-Pianka A., Lin M. S., Krizanc D., et al. (2012). Amino termini of many yeast proteins map to downstream start codons. J. Prot. Res. 11, 5712–5719. 10.1021/pr300538f [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Francoleon D. R., Boontheung P., Yang Y., Kim U., Ytterberg A. J., Denny P. A., et al. (2009). S-layer, surface-accessible, and concanavalin A binding proteins of Methanosarcina acetivorans and Methanosarcina mazei. J. Prot. Res. 8, 1972–1982. 10.1021/pr800923e [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gallien S., Perrodou E., Carapito C., Deshayes C., Reyrat J. M., Van Dorsselaer A., et al. (2009). Orthoproteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol. Genome Res. 19, 128–135. 10.1101/gr.081901.108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Gevaert K., Goethals M., Martens L., Van Damme J., Staes A., Thomas G. R., et al. (2003). Exploring proteomes and analyzing protein processing by mass spectrometric identification of sorted N-terminal peptides. Nat. Biotechnol. 21, 566–569. 10.1038/nbt810 [DOI] [PubMed] [Google Scholar]
  33. Grabarse W., Halhlert F., Shima S., Thauer R. K., Ermler U. (2000). Comparison of three Methyl-Coenzyme M reductases from phylogenetically distant organisms: unusual amino acid modification, conservation and adaptation. J. Mol. Biol. 303, 329–344. 10.1006/jmbi.2000.4136 [DOI] [PubMed] [Google Scholar]
  34. Hahne H., Pachi F., Ruprecht B., Maier S. K., Klaeger S., Helm D., et al. (2013). DMSO enhances electrospray response, boosting sensitivity of proteomic experiments. Nat. Methods 10, 989–991. 10.1038/nmeth.2610 [DOI] [PubMed] [Google Scholar]
  35. Harms U., Thauer R. K. (1996). Methylcobalamin:coenzyme M methyltransferase isoenzymes MtaA and MtbA from Methanosarcina barkeri. Eur. J. Biochem. 235, 653–659. 10.1111/j.1432-1033.1996.00653.x [DOI] [PubMed] [Google Scholar]
  36. Hippler B., Thauer R. K. (1999). The energy conserving methyltetrahydromethanopterin:coenzyme M methyltransferase complex from methanogenic archaea: function of the subunit MtrH. FEBS Lett. 449, 165–168. 10.1016/S0014-5793(99)00429-9 [DOI] [PubMed] [Google Scholar]
  37. Hoelz D. J., Arnold R. J., Dobrolecki L. E., Abdel-Aziz W., Loehrer A. P., Novotny M. V., et al. (2006). The discovery of labile methyl esters on proliferating cell nuclear antigen by MS/MS. Proteomics 6, 4808–14818. 10.1002/pmic.200600142 [DOI] [PubMed] [Google Scholar]
  38. Hoppert M., Mayer F. (1990). Electron microscopy of native and artificial methylreductase high-molecular-weight complexes in strain Gö1 and Methanococcus voltae. FEBS Lett. 267, 33–37. 10.1016/0014-5793(90)80281-M [DOI] [PubMed] [Google Scholar]
  39. Hovey R., Lentes S., Ehrenreich A., Salmon K., Saba K., Gottschalk G., et al. (2005). DNA microarray analysis of Methanosarcina mazei Gö1 reveals adaptation to different methanogenic substrates. Mol. Gen. Genomics 273, 225–239. 10.1007/s00438-005-1126-9 [DOI] [PubMed] [Google Scholar]
  40. Howe C. J., Auffret A. D., Doherty A., Bowman C. M., Dyer T. A., Gray J. C. (1982). Location and nucleotide sequence of the gene for the proton translocating subunit of wheat chloroplast ATP synthase. Proc Natl Acad Sci U.S.A. 79, 6903–6907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Jacobson G. R., Schaffer M. H., Stark G. R., Vanaman T. C. (1973). Specific chemical cleavage in high yield at the amino peptide bonds of cysteine and cystine residues. J. Biol. Chem. 248, 6583–6591. [PubMed] [Google Scholar]
  42. Jaipuri F. A., Collet B. Y. M., Pohl N. L. (2008). Synthesis and quantitative evaluation of glycero-D-manno-heptose binding to concanavalin A by fluorous-tag assistance. Angew. Chem. Int. Ed. 47, 1707–1710. 10.1002/anie.200704262 [DOI] [PubMed] [Google Scholar]
  43. Juncker A. S., Willenbrock H., Von Heijne G., Brunak S., Nielsen H., Krogh A. (2003). Prediction of lipoprotein signal peptides in gram-negative bacteria. Protein Sci. 12, 1652–1662. 10.1110/ps.0303703 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Kahnt J., Buchenau B., Mahlert F., Krüger M., Shima S., Thauer R. K. (2007). Post-translational modifications in the active site region of Methyl-Coenzyme M reductase from methanogenic and methanotrophic archaea. FEBS J. 274, 4913–4921. 10.1111/j.1742-4658.2007.06016.x [DOI] [PubMed] [Google Scholar]
  45. Kampmann M., Blobel G. (2009). Nascent proteins caught in the act. Science 326, 1352–1353. 10.1126/science.1183690 [DOI] [PubMed] [Google Scholar]
  46. Kim J.-S., Dai Z., Aryal U. K., Moore R. J., Camp I. D. G., Qian W.-J. (2013). Resin-assisted enrichment of n.terminal peptides for characterizing proteolytic processing. Anal. Chem. 85, 6826–6832. 10.1021/ac401000q [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Klunker D., Haas B., Hirtreiter A., Figueiredo L., Naylor D. J., Pfeifer G., et al. (2003). Coexistence of group I and group II chaperonins in the archaeon Methanosarcina mazei. J. Biol. Chem. 278, 33256–33267. 10.1074/jbc.M302018200 [DOI] [PubMed] [Google Scholar]
  48. Kornfeld R., Ferris C. (1975). Interaction of immunoglobulin glycopeptides with concanavalin A. J. Biol. Chem. 250, 2614–2619. [PubMed] [Google Scholar]
  49. Kozak M. (1997). Interpreting cDNA sequences: some insights from studies on translation. Mamm. Genome 7, 563–574. [DOI] [PubMed] [Google Scholar]
  50. Krätzer C., Carini P., Hovey R., Deppenmeier U. (2009). Transcriptional profiling of methyltransferase genes during growth of Methanosarcina mazei on Trimethylamine. J. Bacteriol. 191, 5108–5115. 10.1128/JB.00420-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Künkel A., Vaupel M., Heim S., Thauer R. K., Hedderich R. (1997). Heterodisulfide reductase from methanol-grown cells of Methanosarcina barkeri is not a flavoenzyme. Eur. J. Biochem. 244, 226–234. 10.1111/j.1432-1033.1997.00226.x [DOI] [PubMed] [Google Scholar]
  52. Künkel A., Vorholt J. A., Thauer R. K., Hedderich R. (1998). An Escherichia coli Hydrogenase-3-type hydrogenase in methanogenic archaea. FEBS J. 252, 467–476. 10.1046/j.1432-1327.1998.2520467.x [DOI] [PubMed] [Google Scholar]
  53. Lemker T., Grüber G., Schmid R., Müller V. (2003). Defined subcomplexes of the A1 ATPase from the Archaeon Methanosarcina mazei Gö1: biochemical properties and redox regulation. FEBS Letters 544, 206–209. 10.1016/S0014-5793(03)00496-4 [DOI] [PubMed] [Google Scholar]
  54. Lemker T., Ruppert C., Stöger H., Wimmers S., Müller V. (2001). Overproduction of a functional A1 ATPase from the Archaeon Methanosarcina Gö1 in Escherichia coli. Eur. J. Biochem. 268, 3744–3750. 10.1046/j.1432-1327.2001.02284.x [DOI] [PubMed] [Google Scholar]
  55. Lienard T., Becher B., Marschall M., Bowien S., Gottschalk G. (1996). Sodium ion translocation by N5-methyltetrahydromethanopterin : coenzyme M methyltransferase from Methanosarcina mazei Gö1 reconstituted in ether lipid liposomes. Eur. J. Biochem. 239, 857–864. 10.1111/j.1432-1033.1996.0857u.x [DOI] [PubMed] [Google Scholar]
  56. Lienard T., Gottschalk G. (1998). Cloning, sequencing and expression of the genes encoding the sodium translocating N5-Methyltetrahydromethanopterin:coenzyme M methyltransferase of the methyloptrophic archaeon Methanosarcina mazei Gö1. FEBS Lett. 425, 204–208. 10.1016/S0014-5793(98)00229-4 [DOI] [PubMed] [Google Scholar]
  57. Magidovich H., Eichler J. (2009). Glycosyltransferases and oligosaccharyltransferases in archaea: putative components of the N-Glycosylation pathway in the third domain of life. FEMS Microbiol. Lett. 300, 122–130. 10.1111/j.1574-6968.2009.01775.x [DOI] [PubMed] [Google Scholar]
  58. Mechref Y. (2012). Use of CID/ETD mass spectrometry to analyze glycopeptides. Curr. Protoc. Protein Sci. 68:12.11.1–12.1.11. 10.1002/0471140864.ps1211s68 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Meinnel T., Giglione C. (2008). Tools for analyzing and predicting N-Terminal protein modifications. Proteomics 8, 626–649. 10.1002/pmic.200700592 [DOI] [PubMed] [Google Scholar]
  60. Menschaert G., Vandekerckhove T. T. M., Landuyt B., Hayakawa E., Schoofs L., Luyten W., et al. (2009). Spectral clustering in peptidomics studies helps to unravel modification profile of biologically active peptides and enhances peptide identification rate. Proteomics 9, 4381–4388. 10.1002/pmic.200900248 [DOI] [PubMed] [Google Scholar]
  61. Messner P. (2009). Prokaryotic protein glycosylation is rapidly expanding from “curiosity” to “ubiquity”. Chem. Biochem. 10, 2151–2154. 10.1002/cbic.200900388 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Müller V., Ruppert C., Lemker T. (1999). Structure and Function of the A1A0-ATPases from methanogenic archaea. J. Bioenerg. Biomembr. 31, 15–27. [DOI] [PubMed] [Google Scholar]
  63. Nefsky B., Bretscher A. (1989). Landmark mapping: a general method for localizing cysteine residues within a protein. Proc Nat. Acad. Sci. U.S.A. 86, 3549–3553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Ogorzalek Loo R. R., Du P., Loo J. A., Holler T. (2002). In-vivo labeling: a glimpse of the dynamic proteome and additional constraints for protein identification. J. Am. Soc. Mass Spectrom. 13, 804–812. 10.1016/S1044-0305(02)00408-7 [DOI] [PubMed] [Google Scholar]
  65. Parker K. C., Garrels J. I., Hines W., Butler E. M., Mckee A. H. Z., Patterson D., et al. (1998). Identification of yeast proteins from two-dimensional gels. working out spot cross-contamination. Electrophoresis 19, 1920–1932. 10.1002/elps.1150191110 [DOI] [PubMed] [Google Scholar]
  66. Patrie S. M., Ferguson J. T., Robinson D. E., Whipple D., Rother M., Metcalf W. W., et al. (2006). Top-Down Mass Spectrometry of <60 kDa proteins from Methanosarcina acetivorans using Q-FTMS with automated octopole collisionally activated dissociation (OCAD). Mol. Cell. Proteomics 5, 14–36. 10.1074/mcp.M500219-MCP200 [DOI] [PubMed] [Google Scholar]
  67. Prats A. C., Vagner S., Prats H., Amalric F. (1992). Cis-Acting elements involved in the alternative translation initiation process of human basic fibroblast growth factor mRNA. Mol. Cell. Biol. 12, 4796–4805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Randaccio L., Geremia S., Wuerges J. (2007). Crystallography of vitamin B12 proteins. J. Organometallic Chem. 692, 1198–1215 10.1016/j.jorganchem.2006.11.040 [DOI] [Google Scholar]
  69. Ring G., Eichler J. (2004). Membrane binding of ribosomes occurs at SecYE-based sites in the archaea Haloferax volcanii. J. Mol. Biol. 336, 997–1010. 10.1016/j.jmb.2004.01.008 [DOI] [PubMed] [Google Scholar]
  70. Rohlin L., Leon D. R., Kim U., Loo J. A., Ogorzalek Loo R. R., Gunsalus R. P. (2012). Identification of the major expressed S-Layer and cell surface-layer-related proteins in the model methanogenic archaea: Methanosarcina barkeri fusaro and Methanosarcina acetivorans C2A. Archaea 2012:873589. 10.1155/2012/873589 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Running W., Reilly J. P. (2009). Ribosomal proteins of Deinococcus radiodurans: their solvent accessibility and reactivity. J. Proteome Res. 8, 1228–1246. 10.1021/pr800544y [DOI] [PubMed] [Google Scholar]
  72. Russo A., Chandramouli N., Zhang L., Deng H. (2008). Reductive glutaraldehydation of amine groups for identification of protein N-termini. J. Proteome Res. 7, 4178–4182. 10.1021/pr800224v [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Saleh M., Song C., Nasserulla S., Leduc L. G. (2010). Indicators from archaeal secretomes. Microbiol. Res. 165, 1–10. 10.1016/j.micres.2008.03.002 [DOI] [PubMed] [Google Scholar]
  74. Sauer K., Harms U., Thauer R. K. (1997). Methanol:coenzyme M methyltransferase from Methanosarcina barkeri. purification, properties, and encoding genes of the corrinoid protein MT1. Eur. J. Biochem. 243, 670–677. 10.1111/j.1432-1033.1997.t01-1-00670.x [DOI] [PubMed] [Google Scholar]
  75. Savitski M. M., Nielsen M. L., Zubarev R. A. (2005). New data base-independent, sequence tag-nased scoring of peptide MS/MS data validates mowse scores, recovers below threshold data, singles out modified peptides, and assesses the quality of MS/MS techniques. Mol. Cell. Proteomics 4, 1180–1188. 10.1074/mcp.T500009-MCP200 [DOI] [PubMed] [Google Scholar]
  76. Savitski M. M., Nielsen M. L., Zubarev R. A. (2006). Modificomb, a new proteomic tool for mapping substoichiometric post-translational modifications, finding novel types of modifications, and fingerprinting complex protein mixtures. Mol. Cell. Proteomics 5, 935–948. 10.1074/mcp.T500034-MCP200 [DOI] [PubMed] [Google Scholar]
  77. Schmidt A., Kellermann J., Lottspeich F. (2005). A novel strategy for quantitative proteomics using isotope-coded protein labels. Proteomics 5, 4–15. 10.1002/pmic.200400873 [DOI] [PubMed] [Google Scholar]
  78. Sebald W., Graf T., Lukins H. B. (1979). The Dicyclohexylcarbodiimide-binding protein of the mitochondrial ATPase complex from Neurospora crassa and Saccharomyces cerevisiae identification and isolation. Eur. J. Biochem. 93, 587–599. 10.1111/j.1432-1033.1979.tb12859.x [DOI] [PubMed] [Google Scholar]
  79. Shen P.-T., Hsu J.-L., Chen S.-H. (2007). Dimethyl isotope-coded affinity selection for the analysis of free and blocked N-Termini of proteins using LC-MS/MS. Anal. Chem. 79, 9520–9530. 10.1021/ac701678h [DOI] [PubMed] [Google Scholar]
  80. Silva J. C., Denny R., Dorschel C. A., Gorenstein M., Kass I. J., Li G. Z., et al. (2005). Quantitative proteomic analysis by accurate mass retention time pairs. Anal. Chem. 77, 2187–2200. 10.1021/ac048455k [DOI] [PubMed] [Google Scholar]
  81. Sowers K. R., Boone J. E., Gunsalus R. P. (1993). Disaggregation of Methanosarcina spp. and growth as single cells at elevated osmolarity. Appl. Environ. Microbiol. 59, 3832–3839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Thauer R. K. (1998). Biochemistry of methanogenesis: a tribute to marjory stephenson. Microbiology 144, 2377–2406. 10.1099/00221287-144-9-2377 [DOI] [PubMed] [Google Scholar]
  83. Thauer R. K., Kaster A.-K., Seedorf H., Buckel W., Hedderich R. (2008). Methanogenic archaea: ecologically relevant differences in energy conservation. Nat. Rev. Microbiol. 6, 579–591. 10.1038/nrmicro1931 [DOI] [PubMed] [Google Scholar]
  84. Trent J. D., Kagawa H. K., Paavola C. D., Mcmillan R. A., Howard J., Jahnke L., et al. (2003). Intracellular localization of a group II chaperonin indicates a membrane-related function. Proc. Natl. Acad. Sci. U.S.A. 100, 15589–15594. 10.1073/pnas.2136795100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Venne A. S., Vögtle F.-N., Meisinger C., Sickmann A., Zahedi R. P. (2013). Novel highly sensitive, specific, and straightforward strategy for comprehensive N.terminal proteomics reveals unknown substrates of the mitochondrial peptidase Icp55. J. Prot. Res. 12, 3823–3830. 10.1021/pr400435d [DOI] [PubMed] [Google Scholar]
  86. Wilhelm T., Jones A. M. E. (2014). Identification of related peptides through the analysis of fragment ion mass shifts. J. Prot. Res. 13, 4002–4011. 10.1021/pr500347e [DOI] [PubMed] [Google Scholar]
  87. Wu J., Watson J. T. (1997). A novel methodology for assignment of disulfide bond pairings in proteins. Protein Sci. 6, 391–398. 10.1002/pro.5560060215 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Xing G., Zhang J., Chen Y., Zhao Y. (2008). Identification of four novel types of in vitro protein modifications. J. Proteome Res. 7, 4603–4608. 10.1021/pr800456q [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Xu G., Jaffrey S. R. (2010). N-CLAP: global profiling of N-Termini by chemoselective labeling of the α-amine of proteins. Cold Spring Harb. Protoc. 10:pdb.prot5528. 10.1101/pdb.prot5528 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Yalcin T., Khouw C., Csizmadia I. G., Peterson M. R., Harrison A. G. (1995). Why are b Ions stable species in peptide spectra? J. Am. Soc. Mass Spectrom. 6, 1165–1174. 10.1016/1044-0305(95)00569-2 [DOI] [PubMed] [Google Scholar]
  91. Yamaguchi M., Nakayama D., Shima K., Kuyama H., Ando E., Okamura T.-A., et al. (2008). Selective isolation of N-terminal peptides from proteins and their de novo sequencing by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry without regard to unblocking of N-terminal amino acids. Rapid Commun. Mass Spectrom. 22, 3313–3319. 10.1002/rcm.3735 [DOI] [PubMed] [Google Scholar]
  92. Yamazaki S., Yamazaki J., Nishijima K., Otsuka R., Mise M., Ishikawa H., et al. (2006). Proteome analysis of an aerobic hyperthermophilic crenarchaeon, Aeropyrum pernix K1. Mol. Cell. Prot. 5, 811–823. 10.1074/mcp.M500312-MCP200 [DOI] [PubMed] [Google Scholar]
  93. Ytterberg A. J., Peltier J. B., van Wijk K. J. (2006). Protein profiling of plastoglobules in chloroplasts and chromoplasts. A surprising site for differential accumulation of metabolic enzymes. Plant Physiol. 140, 984–997. 10.1104/pp.105.076083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Yurist-Doutsch S., Chaban B., Vandyke D. J., Jarrell K. F., Eichler J. (2008). Sweet to the extreme: protein glycosylation in archaea. Mol. Microbiol. 68, 1079–1084. 10.1111/j.1365-2958.2008.06224.x [DOI] [PubMed] [Google Scholar]
  95. Zubarev R. A. (2004). Electron-capture dissociation tandem mass spectrometry. Curr. Opin. Biotechnol. 15, 12–16. 10.1016/j.copbio.2003.12.002 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Frontiers in Microbiology are provided here courtesy of Frontiers Media SA

RESOURCES