Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2023 Sep 29;22(11):3418–3426. doi: 10.1021/acs.jproteome.3c00488

Orbitrap Mass Spectrometry and High-Field Asymmetric Waveform Ion Mobility Spectrometry (FAIMS) Enable the in-Depth Analysis of Human Serum Proteoforms

Jake T Kline , Michael W Belford , Cornelia L Boeser , Romain Huguet , Ryan T Fellers §, Joseph B Greer §, Sylvester M Greer , David M Horn , Kenneth R Durbin §, Jean-Jacques Dunyach , Nagib Ahsan ∥,, Luca Fornelli †,∥,*
PMCID: PMC10629265  PMID: 37774690

Abstract

graphic file with name pr3c00488_0008.jpg

Blood serum and plasma are arguably the most commonly analyzed clinical samples, with dozens of proteins serving as validated biomarkers for various human diseases. Top-down proteomics may provide additional insights into disease etiopathogenesis since this approach focuses on protein forms, or proteoforms, originally circulating in blood, potentially providing access to information about relevant post-translational modifications, truncations, single amino acid substitutions, and many other sources of protein variation. However, the vast majority of proteomic studies on serum and plasma are carried out using peptide-centric, bottom-up approaches that cannot recapitulate the original proteoform content of samples. Clinical laboratories have been slow to adopt top-down analysis, also due to higher sample handling requirements. In this study, we describe a straightforward protocol for intact proteoform sample preparation based on the depletion of albumin and immunoglobulins, followed by simplified protein fractionation via polyacrylamide gel electrophoresis. After molecular weight-based fractionation, we supplemented the traditional liquid chromatography–tandem mass spectrometry (LC-MS2) data acquisition with high-field asymmetric waveform ion mobility spectrometry (FAIMS) to further simplify serum proteoform mixtures. This LC-FAIMS-MS2 method led to the identification of over 1000 serum proteoforms < 30 kDa, outperforming traditional LC-MS2 data acquisition and more than doubling the number of proteoforms identified in previous studies.

Keywords: Protein depletion, blood, biomarkers, serum, top-down proteomics, Orbitrap, mass spectrometry, FAIMS, PEPPI, proteoform

Introduction

Top-down proteomics (TDP) is defined as the high-throughput investigation of protein forms, or proteoforms,1 typically performed by mass spectrometry (MS).2 Despite recent technological advances in separation techniques3 and high-resolution mass spectrometry,4,5 TDP still offers relatively limited throughput compared to its peptide-centric counterpart, bottom-up proteomics (BUP).6 However, while BUP analyzes short peptides derived from the enzymatic digestions of proteins, and thus suffers from the protein inference problem,7 TDP offers the possibility to interrogate the actual protein molecules present in the cell or tissue of interest. This feature makes TDP a powerful tool for the investigation of clinical samples and, more specifically, for the discovery and validation of biomarkers.

In light of this, it is not surprising that the application of TDP has seen a rapid expansion in translational studies over the past decade.8 Intact protein analysis by MS has been used on a variety of clinical samples and tissues, including (but not limited to) cerebrospinal fluid,9 cardiac tissue,10,11 gingival crevicular fluid,12 brain tissue,13,14 and saliva.1518 Kelleher and co-workers analyzed the various proteoforms of peripheral blood mononuclear cells (PBMCs) in an attempt to identify biomarkers for transplant rejection.19 Such work has recently led to the creation of the largest proteoform repository to date, the Blood Proteoform Atlas, which includes >30 000 unique proteoforms.20

Blood serum and plasma arguably represent the most widely used and easily accessible samples for clinical studies, and also the largest sources of known disease biomarkers.21,22 Historically, BUP has been extensively applied to the characterization of these tissues,23 whose complexity arises from the extreme dynamic range of concentration (spanning 11 orders of magnitude) of their protein components. More than 90% of the total protein content in human plasma is attributed to only 22 highly abundant proteins (HAPs).24 Therefore, alongside strategies based on the direct analysis of the whole protein content of serum and plasma,24 approaches based on the depletion of HAPs, and particularly human serum albumin and immunoglobulins, have been proposed and validated and are now easily applicable in any clinical proteomics laboratory thanks to a large variety of commercially available protein depletion kits.25,26 Surprisingly, the application of MS-based top-down proteomics for the large-scale exploration of proteoforms present in human serum and plasma has been limited. A notable exception is given by the study from Cheon et al.,27 who applied extensive molecular weight (MW)-based off-line fractionation of human plasma proteoforms using gel-eluted liquid fraction entrapment electrophoresis (GELFrEE).28 By combining multiple HAP depletion methods and through the analysis of multiple GELFrEE fractions, the authors identified a total of 442 low MW proteoforms (<25 kDa) from 71 proteins.27 To our knowledge, this study represents the largest high-throughput TDP study on human plasma to date, since other studies either focused on the peptidome (i.e., mostly proteins <10 kDa)29 or selectively targeted the proteoforms of plasma or serum proteins of interest, such as apolipoproteins (e.g., ApoC-III and ApoA-I)3033 or transthyretin.34

Here, we propose a simplified method for the in-depth analysis of the low MW intact proteome (<30 kDa) of human serum. In the attempt to make the experimental protocol accessible to the largest possible number of laboratories, we replaced GELFrEE (whose commercial version has been discontinued) with a method named polyacrylamide-gel-based prefractionation for analysis of intact proteoforms and protein complexes by mass spectrometry (PEPPI-MS),35 which is based on a simple SDS-PAGE apparatus typically available to any protein biochemistry research group. We opted for the use of HAP depletion using two different commercial kits targeting albumin and immunoglobulins to limit the quantity of antibody light chain (∼25 kDa) contaminating the 0–30 kDa portion of the serum proteome under examination. The performances of the two kits were compared to evaluate whether their use may result in the unsought enrichment of specific protein groups, as previously reported for other kits benchmarked using BUP.36,37

To improve the depth of intact proteome coverage, in our Orbitrap MS-based TDP experiments we supplemented in-solution PEPPI-MS proteoform fractionation with gas-phase fractionation (GPF) of proteoform cations using high-field asymmetric waveform ion mobility spectrometry (FAIMS).38 Recent top-down studies,39 also involving complex human samples such as brain tissue from Alzheimer’s patients,13 demonstrated that FAIMS can substantially increase the number of identified proteoforms, particularly low-abundance ones. In FAIMS, protein cations are separated based on their differential mobility once subjected to an asymmetric waveform (i.e., including low and high electric field components). A compensatory DC voltage (CV) is applied to prevent collision with the electrodes of ions with specific mobility characteristics. Therefore, incremental changes in the applied CV values allow for the transmission of different fractions of the ions originally generated via electrospray ionization. While in the first FAIMS-TDP studies the CV was maintained constant throughout the whole duration of a liquid chromatography-tandem mass spectrometry (LC-MS2) experiment and progressively increased across consecutive runs (a practice known as “external stepping”),13,39 more recently Tholey and co-workers have demonstrated that an “internal stepping” strategy, whereby multiple CV values are applied in cycle within the same LC-MS2 experiment, can be efficiently implemented, thus reducing the overall number of runs required for thorough sample characterization.4042

Our TDP analysis of two PEPPI fractions (0–15 and 15–30 kDa) with and without FAIMS (with internal CV stepping) confirmed the utility of GPF via differential mobility spectrometry, which in conjunction with MW-based fractionation can lead to the identification of more than 1000 low MW serum proteoforms from a limited number of LC-FAIMS-MS2 runs.

Experimental Section

Serum Protein Extraction and Fractionation

Human pooled serum was purchased from Millipore Sigma (catalog number H4522). Two commercial kits were used to deplete 60 μL of serum of albumin and immunoglobulins following the manufacturers’ protocols: High Select HSA/Immunoglobulin Depletion Midi Spin Columns (catalog number A36367; Thermo Fisher Scientific, Rockford, IL) and ProteoExtract (catalog number 122642; Millipore Sigma, MA, USA). Two duplicate samples were prepared using each kit, with each duplicate equally split in two prior to protein fractionation (see below) so that the same input samples would be later analyzed in the non-FAIMS and FAIMS experiments.

Protein content of depleted serum was quantified using a NanoDrop OneC microvolume UV–vis spectrophotometer (Thermo Fisher Scientific, Rockford, IL) measuring absorbance at 280 nm, and a volume corresponding to 300 μg of protein was acetone precipitated overnight at −20 °C. Each protein pellet was resuspended in 1% SDS (w/v) and subsequently fractionated based on molecular weight (MW) into two fractions, 0–15 and 15–30 kDa, using PEPPI-MS on a 12% T polyacrylamide gel. Additional details for the PEPPI-MS protocol are provided in the Supporting Information. A 10 μL aliquot of recovered protein solution from each fraction was used for visualizing the result of PEPPI using SDS-PAGE followed by silver staining. Immediately prior to LC-MS2 analysis, samples were desalted and SDS removed using methanol/chloroform/water precipitation and reconstituted in mobile phase A (vide infra). Alternatively, the other two SDS cleanup methods were applied for a specific comparison. Anion-exchange disk-assisted sequential sample preparation (AnExSP) was performed based on a previously published method.41 Briefly, the sample volume was reduced using centrifugal 3 kDa MWCO ultrafiltration columns (Amicon Ultra, Millipore Sigma, St Louis, MO). Next, SDS was removed from samples by buffer exchanging twice with 8 M urea using the centrifugal column. Following SDS removal, the sample was buffer exchanged twice into 100 mM ammonium bicarbonate (ABC) at pH 8.5 using spin filtration and concentrated to ∼15 μL prior to recovery based on the manufacturer’s suggested protocol. Next, a StageSpinTip was prepared using a 6 mm diameter disk of Empore anion SPE (3M, Saint Paul, MN) in a 200 μL pipet tip. The StageSpinTip was conditioned twice with 80 μL of methanol (7000g for 3 min for remaining centrifugation steps) and then equilibrated twice with 80 μL of 100 mM ABC. The sample was then loaded onto the tip and the volume increased to 200 μL with 100 mM ABC prior to centrifugation. The sample was washed three times with 200 μL of 100 mM ABC. Finally, the sample was eluted in two steps using 40 μL of 0.5% formic acid (v/v) and 50% methanol (v/v). The sample volume was vacuum concentrated to ∼5 μL, then the volume was increased to 20 μL with mobile phase A (vide infra). Sample cleanup based on HiPPR Detergent Removal Columns (Thermo Scientific, catalog no. 87779) followed the manufacturer’s suggested protocol using LC-MS grade water. Cleaned samples were vacuum concentrated to a volume of ∼5 μL, which was brought up to 20 μL with mobile phase A (vide infra) prior to column injection.

Intact Protein Mass Spectrometry

Proteoform separation was obtained using an Ultimate 3000 UHPLC system (Thermo Scientific, Sunnyvale, CA), with trap and analytical columns (150 μm i.d. by 30 mm length and 100 μm i.d. by 250 mm length, respectively) packed in-house with macroporous polystyrene-divinylbenzene PLRP-S material (5 μm particle size, 1000 Å pore size; Agilent, Santa Clara, CA). Columns were heated at 55 °C, and the analytical flow rate was set at 1 μL/min. Mobile phase A consisted of 5% acetonitrile and 0.2% formic acid (v/v) in water, while mobile phase B was 5% H2O and 0.2% formic acid in acetonitrile (v/v). An 85 min chromatographic method (total run time), including two final washes preceding column re-equilibration, was used (details in the Supporting Information). The column outlet was connected online to a custom-made nanoelectrospray ionization source (built according to the instructions provided by the University of Washington Proteomics Resources and fitted with narrow mounting posts to be positioned in front of the FAIMS unit) coupled to an Orbitrap Eclipse Tribrid mass spectrometer (Thermo Scientific, San Jose, CA), which was operated in intact protein mode. Data-dependent acquisition was based on broadband mass spectra (MS1) recorded at 120 000 (at 200 m/z) resolving power (r.p.) over a 400–2000 m/z window, and HCD MS2 spectra were recorded at 60 000 r.p. (at 200 m/z) after the fragmentation of the most abundant ions with charge states between +5 and +50. HCD normalized collision energy (NCE) of 35% was applied, and dynamic exclusion was enabled for 45 s. The target values for the automatic gain control (AGC) were set at 4.8e5 and 2.5e5 charges for MS1 and MS2, respectively. Both MS1 and MS2 acquisitions were based on a single time-domain transient (i.e., “microscan”) per mass spectrum. In experiments not using FAIMS, a 3 s data acquisition cycle time was used, and three technical replicates of each PEPPI fraction were run (total number of collected RAW files: 24). In experiments using FAIMS Pro (Thermo Scientific, San Jose, CA), the compensation voltage (CV) was stepped during the runs, utilizing the values reported in Table S1, applying 3 CV values per LC-FAIMS-MS2 run. In this case, each 0–15 kDa PEPPI fraction was run three times, while larger 15–30 kDa fractions were subjected to two LC-MS2 runs (total number of collected RAW files: 20). FAIMS experiments also used shorter MS1–MS2 cycle times of 1.5–2 s to accommodate for the use of multiple CV values in the same run. All mass spectrometry data files are uploaded on MassIVE (repository number MSV000092574).

Analysis of Top-down MS Data

RAW files were searched against a human UniProt database (DB) derived from an .xml file using ProSight PD 4.2 (Proteinaceous, Inc., Evanston, IL), part of Proteome Discoverer 3.0 (Thermo Scientific). Standard processing and consensus workflows for “high–high” data analysis were used. For FAIMS experiments, the FAIMS CV was kept unspecified to account for varying CV values within a single RAW file (so that all MS1 spectra were considered in the analysis, regardless of the CV value used for their acquisition). MS1 spectra were deconvoluted within the high–high cRAWler node by using the Xtract algorithm. The DB search was two-tiered, including an “annotated proteoform” (mass tolerances: 2.2 Da and 10 ppm for precursor and fragment ions, respectively) and a “subsequence search” (mass tolerances set at 10 ppm). Up to four PTMs were considered per proteoform, based on the UniProt annotation. Unless otherwise stated, three proteoform-spectrum matches (PrSMs) were allowed per precursor. PrSMs were filtered using a three-level, context-dependent FDR calculation with a cutoff set at 1%.43 The FDR was recalculated based on the grouping of the file, i.e., one FDR calculation was performed for the FAIMS versus non-FAIMS comparison and another one for the High Select versus ProteoExtract comparison. Plots were generated by using GraphPad Prism 9 (GraphPad Software). GO analysis was performed using DAVID.44

Results and Discussion

FAIMS Improves the Coverage of the Intact Serum Proteome

TDP inherently suffers from signal dilution problems that are proportional to the mass of the analyzed proteins. The combination of broad isotopic clusters and wide charge state envelopes that characterize protein ions generated via electrospray ionization typically restricts TDP to the analysis of proteins <30 kDa and limits the spectral dynamic range.45,46 Therefore, multidimensional prefractionation of proteoforms is widely applied in TDP to reduce sample complexity. However, these strategies increase sample preparation efforts and produce a large number of protein fractions, ultimately requiring longer data acquisition and analysis times. Here, GPF via FAIMS was evaluated as a means to counterbalance limited sample prefractionation. To benchmark the utility of FAIMS, depleted protein samples were fractionated by PEPPI-MS into only two fractions, according to the workflow graphically described in Figure 1.

Figure 1.

Figure 1

Workflow followed for the TDP analysis of commercial serum. Note that two identical sets of samples were subjected to PEPPI-MS to minimize variation in the proteoforms to be identified via Orbitrap MS with or without the use of FAIMS.

As shown in Figure 2A, PEPPI is characterized by a high degree of reproducibility which is also important for the implementation of label-free quantitative TDP studies.47 Each protein fraction was later subjected to LC-MS2 analysis, with or without FAIMS. Notably, MW-based fractionation is advantageous because CV values in LC-FAIMS-MS2 experiments can be adjusted to the expected MW of the proteoforms under analysis, following the findings by Gerbasi et al.39

Figure 2.

Figure 2

Summary of the results in the presence or absence of FAIMS Pro. (A) Results of MW-based fractionation of serum proteins with PEPPI-MS, visualized on a silver stained PAGE gel. (B) Mass distribution of proteoforms (at 1% FDR) without (blue bars) and with (red bars) FAIMS. (C) Venn diagram of unique accession numbers identified in the two groups of experiments (without FAIMS, blue; FAIMS, red). (D) Venn diagram of unique number of proteoforms with and without FAIMS. The results presented here include the analysis of serum samples depleted using both kits under evaluation.

The comparison of non-FAIMS results (based on the analysis of each PEPPI fraction in triplicate) versus the FAIMS results (based on three runs for each 0–15 kDa fraction and two runs for each 15–30 kDa fraction, always using different internally stepped CV values) demonstrate a +35% increase (i.e., 894 unique proteoforms identified with FAIMS versus 661 without) in the identification of proteoforms < 15 kDa when FAIMS is used (Figure 2B). As displayed in Figure 2C and D (and summarized in Table S2), FAIMS globally outperforms standard experiments in both the number of identified proteoforms (1165 versus 947) and UniProt accession numbers (105 versus 81). These results are obtained through efficient GPF, as exemplified in Figure 3 and Figure S1, which show that the application of different CV values can unveil the presence of different coeluting proteoforms with various masses and charge state distributions. Importantly, a comparison of technical replicate runs performed with or without FAIMS demonstrates that when FAIMS is not applied, most of the identified proteoforms in each run remain the same; conversely, the application of different CV values across runs changes substantially the identifications obtained across LC-FAIMS-MS2 experiments on the same PEPPI fraction (Figure S2).

Figure 3.

Figure 3

Effect of different compensation voltage values (CV) on the detection of human serum proteoforms. This example refers to the 0–15 kDa fraction. (A) Total ion chromatograms at CV values of −50 (top), −40 (middle), and −30 V (bottom). Up to three different CV values were used in a single LC-MS2 run. (B) Corresponding MS1 spectra obtained at the same retention time by applying different CV values. The three spectra show different species and prevalent charge states of the same proteoforms.

Additional analysis of the spectral intensity of precursor ions linked to identified proteoforms demonstrates that GPF does not uniquely facilitate the identification of particularly low-abundance proteoforms. Figure 4 shows the distribution of intensities of precursor ions corresponding to three groups of identified proteoforms: the comparison of the precursors of proteoforms identified in both non-FAIMS and FAIMS data sets (in blue and pink, respectively) shows a statistically significant increase in the precursor intensity when GPF is applied.

Figure 4.

Figure 4

Comparison of the intensities of proteoforms shared between FAIMS and non-FAIMS to the intensities of proteoforms exclusive to FAIMS. Shared proteoform intensities from non-FAIMS acquisition are in blue; those for FAIMS acquisition are in pink, while intensities for FAIMS-specific proteoforms are in red. Dotted lines represent the 75% (top) and 25% (bottom) quartiles. Dashed lines represent the median. Significance was determined by the Kolmogorov–Smirnov test; ***P value = 0.0007; ****P value < 0.0001.

Even though the precursor ions of proteoforms uniquely identified in FAIMS (displayed in red) are characterized by reduced intensity compared with the rest of the precursors that led to an identification in the LC-FAIMS-MS2 runs, their intensity is not low in absolute terms (but rather similar to the majority of precursors included in the non-FAIMS data set). This reflects the capability of FAIMS to reduce spectral congestion so that additional precursors can be selected for fragmentation in data-dependent acquisition experiments. Overall, these results indicate that the complexity of the intact proteome of serum is so high that only a minor fraction of proteoforms (631, corresponding to 43% of the total) and proteins (59, corresponding to 45% of the total) is shared between the two data sets: when the identifications of FAIMS and non-FAIMS experiments are combined, the number of identifications reaches 1481 proteoforms from 130 protein accessions. The LC-FAIMS-MS2 strategy is clearly advantageous over regular LC-MS2 (+23% and +30% in the number of identified proteoforms and UniProt accession numbers, respectively), and this is demonstrated also by the results of a DB search where the third technical replicate for the FAIMS 0–15 kDa fraction was not considered (Figure S3), yet the number of identifications is substantially unaltered. Collectively, these results suggest that the use of three FAIMS-based runs per sample with more closely spaced CV values could help further expand the proteome coverage in future studies, and they also indicate that the third LC-FAIMS-MS2 run for the 0–15 kDa fraction, which used particularly negative CV values (usually applied to small polypeptides, like in BUP experiments), did not account for a significant number of unique proteoform identifications—possibly also due to limitations in the sample preparation (vide infra).

As further proof of the inherent complexity of the serum proteome, we compared these TDP results against a recently published BUP data set collected using the same exact commercial pooled serum sample and utilizing the same commercial HAP depletion kits (Figure S4).48 Surprisingly, only 12% of UniProt accession numbers are shared between the TDP and BUP data sets (when considering only the proteins < 30 kDa identified in the BUP study, the fraction decreases to 9%). By extending the comparison to include the results of a larger BUP study based on the analysis of clinical serum samples,49 the differences in protein identification across data sets are even more apparent.

Main Sources of Variability among Human Serum Proteoforms

Regarding chemical modifications discovered by TDP, 535 and 520 proteoforms of the non-FAIMS and the FAIMS data sets, respectively, include at least one PTM (corresponding to 56% and 45% of the identifications, respectively), indicating that serum proteoforms are frequently modified (Table S2 and Figure S5). Acetylation and phosphorylation are the most frequently observed PTMs. Protein truncation also plays a substantial role in expanding the proteoform variability in serum. Considering the five UniProt accessions represented by the largest number of proteoforms in this TDP data set, N-terminal sequence truncation (beyond the removal of initiator Met or of signal peptide) is present in 12%–66% of the cases. Double truncation (i.e., contemporary truncation at the N- and C-terminus) is observed in up to 41% of the identified proteoforms derived from a single gene (Figure S6).

We posit that truncation is one of the reasons that explain why the majority of identified proteoforms have a mass < 15 kDa (Figure 2B). The almost complete lack of proteoforms in the 20–25 kDa mass bin is justified when considering that the BUP analysis of the same pooled serum used for the present TDP study has led to substantially more identifications in the 0–15 kDa range than in the 15–30 kDa bin (61 vs 33, respectively, as reported in Figure S4). Additionally, while the gel displayed in Figure 2A shows an apparent protein band at MW ∼ 25 kDa, that more than likely corresponds to just one protein, a residual immunoglobulin light chain. The DB search identified both lambda and kappa light chains of human immunoglobulin; however only a few distinct sequences are included in the DB, and thus the number of identified proteoforms is not as high as for other abundant proteins (e.g., apolipoprotein A-I). The large number of small proteoforms present in the sample explains also why the use of FAIMS had a major impact almost exclusively on analytes < 15 kDa. In separate studies based on the analysis of different samples such as whole cell lysates, FAIMS increased the number of proteoform identifications throughout the whole 0–30 kDa range.47 Therefore, we conclude that the results reported here reflect the specific compositional characteristics of the analyzed pooled serum sample.

A Closer Look to Serum Depletion and TDP

Differences were observed when RAW files (with and without FAIMS) collected after the depletion of albumin and immunoglobulins were compared with the High Select and ProteoExtract kits (Figure 5). Figure 5A and B show that, although the majority of proteoforms are shared between the two data sets (772 unique proteoforms, corresponding to 51% of the 1510 total), only 64 UniProt accession numbers (corresponding to 42% of the 152 total) were identified in both data sets.

Figure 5.

Figure 5

Summary of comparative analysis of High Select and ProteoExtract serum depletion kits. High Select results are in brown, while ProteoExtract results are in purple. (A) Venn diagram of unique accession numbers (at 1% FDR). (B) Venn diagram of unique proteoforms (at 1% FDR). (C) Mass distribution of proteoforms identified using the two kits with almost identical profiles. (D) Comparison of most prominent functional categories of some of the protein accessions uniquely identified by High Select or ProteoExtract.

Notably, such a discrepancy is not explained in terms of proteoform mass bias, since the mass distributions of proteoforms identified using the two kits are virtually identical (Figure 5C). Due to the above-illustrated complexity of the serum proteome and the stochastic nature of precursor selection in data-dependent acquisition, minor discrepancies across data sets are to be expected. However, a more detailed analysis of the lists of UniProt accession numbers identified uniquely in one of the two data sets suggests that minor differential protein enrichment may have occurred as a consequence of the use of the two different HAP depletion kits. Specifically, GO analysis on the 43 UniProt accession numbers unique for the samples depleted with High Select reveals that most of these proteins are hydrophilic, with several being glycosylated or generally polar. Conversely, some of the 45 UniProt accession numbers unique to the ProteoExtract kit are enriched in two major protein groups: those including immunoglobulin-like domains and protein kinases (Figure 5D). This may imply that the ProteoExtract kit has potentially a lower affinity and/or binding capacity than the High Select one for antibody chains and other immunoglobulin-like proteins when compared to the High Select option. Grand average of hydropathy (GRAVY) analysis performed on both lists of unique UniProt accessions confirms that the average hydrophobicity of the proteins identified using the High Select is slightly lower than in the case of the ProteoExtract kit (i.e., the hydrophobic proteins being 9.3% and 11.3% in High Select and ProteoExtract, respectively); however, no statistically significant difference is observed when comparing the two sets of GRAVY values (Figure S7). This seemingly confirms that the observed difference in depletion selectivity is likely to be attributed to unintentional, nonspecific enrichment of proteins characterized by particular domains or post-translational modification. However, while some of the proteins potentially enriched by each HAP depletion kit are represented by multiple proteoforms (this is the case of proteins included in the “Ig-like domains” category, for instance), others are represented by single proteoforms, as in the case of proteins linked to the “kinase” category enriched by the use of the ProteoExtract kit. Additionally, several of the functional categories into which the GO analysis grouped the two groups of proteins uniquely identified by each kit were the same. All considered, the actual differences in performance between the two tested depletion kits seem to be limited.

The Relevance of Detergent Cleanup Methods

Since the analysis of the initially collected 44 RAW files demonstrated that the majority of identifiable proteoforms have a mass < 15 kDa, we benchmarked also SDS cleanup methods that are known to outperform the traditional MeOH/CHCl3/H2O precipitation method in the recovery of small proteins. A separate set of serum samples was subjected to HAP depletion with the High Select kit and subsequent PEPPI-MS fractionation prior to LC-FAIMS-MS2 analysis (details in Table S3). SDS was removed from samples using the MeOH/CHCl3/H2O procedure, a commercial HiPRR detergent removal cartridge, and the anion exchange method described by Takemori et al.41 The summary of results is displayed in Figure 6, which shows that the MeOH/CHCl3/H2O method is globally outperformed by the other two, with differences that are particularly marked for the smaller proteoforms: while the standard method led to the identification of 87 and 125 proteoforms in the 0–5 and 5–10 kDa mass bins, respectively, the HiPRR cartridge produced 186 and 223 proteoform identifications (an increase of +114% and +78%, respectively), and the anion exchange method returned 316 (+263%) and 308 (+146%) proteoform IDs.

Figure 6.

Figure 6

Summary of the comparison of three SDS removal methods. (A) Global count of identified UniProt accession numbers and unique proteoforms. (B) Mass distribution of identified proteoforms. (C) Venn diagram of identified UniProt accession numbers. (D) Venn diagram of identified proteoforms. Results of the three cleanup methods are color-coded based on the legend included in the figure.

These observations are in line with what was reported by Takemori et al. and suggest that future TDP studies of human serum or plasma may be based on different SDS removal protocols depending on the targeted proteoform mass range. Finally, it is important to underline that while the anion exchange cleanup protocol produced the highest number of proteoform identifications < 15 kDa, it is by far the most complex and time-consuming of the three tested methods.

Brief Considerations on Serum Biomarkers

Ideally, increasing the number of proteoforms identified from human serum or plasma could lead to new biomedical insights or the ability to perform more advanced clinical analyses. Therefore, to evaluate the applicability of TDP to biomarker identification, we considered a list of validated serum and plasma biomarkers compiled by Anderson in 2010.50 The list includes a total of 109 biomarkers approved by the FDA throughout 2008, in addition to the other 96 protein targets used in various clinical assays in the USA (Table S4). Importantly, while only a small fraction of these proteins have masses < 30 kDa (e.g., just 34% of the FDA-approved ones), TDP also detects truncation products of larger proteins, as previously implied. Figure 7 and Figure S8 show that the list of UniProt accession numbers globally identified in this study includes a total of 20 of the 109 FDA-approved biomarkers and 30 of the extended list.

Figure 7.

Figure 7

Biomarkers in the TDP data set. This is a comparison of the proteins (i.e., UniProt accession numbers) from this TDP study against the extended list of plasma biomarkers proposed by Anderson (201 total UniProt accessions without including immunoglobulin chains). (A) Venn diagram showing the biomakers included in the FAIMS and non-FAIMS TDP data sets. (B) Venn diagram differentiating the biomarkers identified by the mean of the High Select and ProteoExtract depletion kits.

GPF is advantageous also for biomarker identification: the LC-FAIMS-MS2 strategy slightly outperforms the standard LC-MS2 acquisition with 29 identified biomarkers versus 25, respectively (Figure 7A). The two depletion kits have overall comparable performance but also show a total of five biomarkers identified by only one of the two kits (Figure 7B). However, it is important to underline that the biomarkers uniquely identified by either of the two depletion kits are represented by 1–4 proteoforms each, and hence we cannot rule out the possibility that some of these differential identifications are to be interpreted not as the product of actual kit-dependent enrichment but rather as an effect of the stochastic nature of data-dependent acquisition (as visible in Figure S2).

Conclusions

Recent advances in mass spectrometry instrumentation, including new ion mobility devices such as FAIMS, and the introduction of sophisticated data acquisition strategies are enabling fast and extensive BUP analysis of the plasma/serum proteome for clinical analysis.51 However, these impressive achievements necessarily lack the possibility of precisely reconstructing the original landscape of proteoforms circulating in patients’ blood, thus limiting the potential for biomarker discovery. The top-down study presented here demonstrates that modern Orbitrap technology coupled with FAIMS allows researchers to investigate a sufficiently large portion of the serum proteome when supported by streamlined sample preparation and protein MW-based fractionation as a viable clinical analysis technique with a relatively higher potential for biomarker discovery. While commonly applied detergent removal strategies such as methanol/chloroform/water precipitation can lead to overall satisfactory results, the present study also demonstrates that alternative cleanup protocols may be beneficial for specifically targeting clinically relevant low molecular weight serum proteoforms.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jproteome.3c00488.

  • Table S1, list of compensation voltage values for main study analysis; Table S3, list of compensation voltage values for SDS cleanup comparison; Figure S1, total ion chromatograms obtained at different compensation voltages in the LC-FAIMS-MS2 experiments; Figure S2, analysis of technical reproducibility across runs; Figure S3, comparison of the number of identified UniProt accessions and proteoforms as a function of PrSMs per precursor; Figure S4, comparison of top-down and bottom-up results; Figure S5, analysis of post-translational modifications in each data set; Figure S6, analysis of type and frequency of sources of variability found on the five UniProt accessions represented by the highest number of distinct proteoforms; Figure S7, GRAVY index score of serum proteins; Figure S8, comparison of the TDP data sets (UniProt accession numbers) against the list of FDA-approved plasma biomarkers proposed by Anderson (PDF)

  • Table S2, list of identified proteoforms in experiments with and without FAIMS (XLSX)

  • Table S4, extended list of FDA-approved plasma biomarkers according based on Anderson 2010 (XLSX)

Author Contributions

J.T.K. performed research and analyzed data. R.T.F., J.B.G., K.R.D., D.M.H., and S.M.G. contributed to data analysis and data analysis software development. M.W.B., C.L.B., R.H., and J.J.D. contributed to mass spectrometry and FAIMS setup. N.A. codesigned the experiments and performed data analysis. L.F. contributed to data analysis and experimental design and wrote the manuscript. All authors read and approved the final manuscript.

The study was supported by the research grant number R35SGM147397 awarded by the National Institute of General Medical Sciences of the National Institutes of Health to L.F. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

The authors declare the following competing financial interest(s): Michael W. Belford, Cornelia L. Boeser, Romain Huguet, Sylvester M. Greer, David M. Horn, and Jean-Jacques Dunyach are employees of Thermo Fisher Scientific. Ryan T. Fellers, Joseph B. Greer, and Kenneth R. Durbin are part of Proteinaceous, Inc.

Supplementary Material

pr3c00488_si_001.pdf (2.1MB, pdf)
pr3c00488_si_002.xlsx (407.7KB, xlsx)
pr3c00488_si_003.xlsx (14.8KB, xlsx)

References

  1. Smith L. M; Kelleher N. L Proteoform: a single term describing protein complexity. Nat. Methods 2013, 10 (3), 186–187. 10.1038/nmeth.2369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Toby T. K.; Fornelli L.; Kelleher N. L. Progress in Top-Down Proteomics and the Analysis of Proteoforms. Annu. Rev. Anal Chem. (Palo Alto Calif) 2016, 9 (1), 499–519. 10.1146/annurev-anchem-071015-041550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. McCool E. N.; Lubeckyj R. A.; Shen X.; Chen D.; Kou Q.; Liu X.; Sun L. Deep Top-Down Proteomics Using Capillary Zone Electrophoresis-Tandem Mass Spectrometry: Identification of 5700 Proteoforms from the Escherichia coli Proteome. Anal. Chem. 2018, 90 (9), 5529–5533. 10.1021/acs.analchem.8b00693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Denisov E.; Damoc E.; Makarov A. Exploring frontiers of orbitrap performance for long transients. Int. J. Mass Spectrom. 2021, 466, 116607 10.1016/j.ijms.2021.116607. [DOI] [Google Scholar]
  5. Baba T.; Ryumin P.; Duchoslav E.; Chen K.; Chelur A.; Loyd B.; Chernushevich I. Dissociation of Biomolecules by an Intense Low-Energy Electron Beam in a High Sensitivity Time-of-Flight Mass Spectrometer. J. Am. Soc. Mass Spectrom. 2021, 32 (8), 1964–1975. 10.1021/jasms.0c00425. [DOI] [PubMed] [Google Scholar]
  6. Ntai I.; LeDuc R. D.; Fellers R. T.; Erdmann-Gilmore P.; Davies S. R.; Rumsey J.; Early B. P.; Thomas P. M.; Li S.; Compton P. D.; Ellis M. J.; Ruggles K. V.; Fenyo D.; Boja E. S.; Rodriguez H.; Townsend R. R.; Kelleher N. L. Integrated Bottom-Up and Top-Down Proteomics of Patient-Derived Breast Tumor Xenografts. Mol. Cell Proteomics 2016, 15 (1), 45–56. 10.1074/mcp.M114.047480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Nesvizhskii A. I.; Aebersold R. Interpretation of shotgun proteomic data: the protein inference problem. Mol. Cell Proteomics 2005, 4 (10), 1419–1440. 10.1074/mcp.R500012-MCP200. [DOI] [PubMed] [Google Scholar]
  8. Brown K. A.; Melby J. A.; Roberts D. S.; Ge Y. Top-down proteomics: challenges, innovations, and applications in basic and clinical research. Expert Rev. Proteomics 2020, 17 (10), 719–733. 10.1080/14789450.2020.1855982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Desiderio C.; D’Angelo L.; Rossetti D. V.; Iavarone F.; Giardina B.; Castagnola M.; Massimi L.; Tamburrini G.; Di Rocco C. Cerebrospinal fluid top-down proteomics evidenced the potential biomarker role of LVV- and VV-hemorphin-7 in posterior cranial fossa pediatric brain tumors. Proteomics 2012, 12 (13), 2158–2166. 10.1002/pmic.201100499. [DOI] [PubMed] [Google Scholar]
  10. Zhang J.; Guy M. J.; Norman H. S.; Chen Y. C.; Xu Q.; Dong X.; Guner H.; Wang S.; Kohmoto T.; Young K. H.; Moss R. L.; Ge Y. Top-down quantitative proteomics identified phosphorylation of cardiac troponin I as a candidate biomarker for chronic heart failure. J. Proteome Res. 2011, 10 (9), 4054–4065. 10.1021/pr200258m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Dong X.; Sumandea C. A.; Chen Y. C.; Garcia-Cazarin M. L.; Zhang J.; Balke C. W.; Sumandea M. P.; Ge Y. Augmented phosphorylation of cardiac troponin I in hypertensive heart failure. J. Biol. Chem. 2012, 287 (2), 848–857. 10.1074/jbc.M111.293258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Iavarone F.; Olianas A.; Patini R.; Gallenzi P.; Di Tonno L.; Desiderio C.; Cabras T.; Manconi B.; Vincenzoni F.; Cordaro M.; Messana I.; Urbani A.; Castagnola M. Top down proteomic analysis of gingival crevicular fluid in deciduous, exfoliating and permanent teeth in children. J. Proteomics 2020, 226, 103890 10.1016/j.jprot.2020.103890. [DOI] [PubMed] [Google Scholar]
  13. Fulcher J. M.; Makaju A.; Moore R. J.; Zhou M.; Bennett D. A.; De Jager P. L.; Qian W. J.; Pasa-Tolic L.; Petyuk V. A. Enhancing Top-Down Proteomics of Brain Tissue with FAIMS. J. Proteome Res. 2021, 20 (5), 2780–2795. 10.1021/acs.jproteome.1c00049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Davis R. G.; Park H. M.; Kim K.; Greer J. B.; Fellers R. T.; LeDuc R. D.; Romanova E. V.; Rubakhin S. S.; Zombeck J. A.; Wu C.; Yau P. M.; Gao P.; van Nispen A. J.; Patrie S. M.; Thomas P. M.; Sweedler J. V.; Rhodes J. S.; Kelleher N. L. Top-Down Proteomics Enables Comparative Analysis of Brain Proteoforms Between Mouse Strains. Anal. Chem. 2018, 90 (6), 3802–3810. 10.1021/acs.analchem.7b04108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Olianas A.; Guadalupi G.; Cabras T.; Contini C.; Serrao S.; Iavarone F.; Castagnola M.; Messana I.; Onali S.; Chessa L.; Diaz G.; Manconi B. Top-Down Proteomics Detection of Potential Salivary Biomarkers for Autoimmune Liver Diseases Classification. Int. J. Mol. Sci. 2023, 24 (2), 959. 10.3390/ijms24020959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Manconi B.; Liori B.; Cabras T.; Vincenzoni F.; Iavarone F.; Lorefice L.; Cocco E.; Castagnola M.; Messana I.; Olianas A. Top-down proteomic profiling of human saliva in multiple sclerosis patients. J. Proteomics 2018, 187, 212–222. 10.1016/j.jprot.2018.07.019. [DOI] [PubMed] [Google Scholar]
  17. Serrao S.; Firinu D.; Olianas A.; Deidda M.; Contini C.; Iavarone F.; Sanna M. T.; Boroumand M.; Amado F.; Castagnola M.; Messana I.; Del Giacco S.; Manconi B.; Cabras T. Top-Down Proteomics of Human Saliva Discloses Significant Variations of the Protein Profile in Patients with Mastocytosis. J. Proteome Res. 2020, 19 (8), 3238–3253. 10.1021/acs.jproteome.0c00207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Wu S.; Brown J. N.; Tolic N.; Meng D.; Liu X.; Zhang H.; Zhao R.; Moore R. J.; Pevzner P.; Smith R. D.; Pasa-Tolic L. Quantitative analysis of human salivary gland-derived intact proteome using top-down mass spectrometry. Proteomics 2014, 14 (10), 1211–1222. 10.1002/pmic.201300378. [DOI] [PubMed] [Google Scholar]
  19. Toby T. K.; Abecassis M.; Kim K.; Thomas P. M.; Fellers R. T.; LeDuc R. D.; Kelleher N. L.; Demetris J.; Levitsky J. Proteoforms in Peripheral Blood Mononuclear Cells as Novel Rejection Biomarkers in Liver Transplant Recipients. Am. J. Transplant 2017, 17 (9), 2458–2467. 10.1111/ajt.14359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Melani R. D.; Gerbasi V. R.; Anderson L. C.; Sikora J. W.; Toby T. K.; Hutton J. E.; Butcher D. S.; Negrao F.; Seckler H. S.; Srzentic K.; Fornelli L.; Camarillo J. M.; LeDuc R. D.; Cesnik A. J.; Lundberg E.; Greer J. B.; Fellers R. T.; Robey M. T.; DeHart C. J.; Forte E.; Hendrickson C. L.; Abbatiello S. E.; Thomas P. M.; Kokaji A. I.; Levitsky J.; Kelleher N. L. The Blood Proteoform Atlas: A reference map of proteoforms in human hematopoietic cells. Science 2022, 375 (6579), 411–418. 10.1126/science.aaz5284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Geyer P. E.; Holdt L. M.; Teupser D.; Mann M. Revisiting biomarker discovery by plasma proteomics. Mol. Syst. Biol. 2017, 13 (9), 942. 10.15252/msb.20156297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Surinova S.; Schiess R.; Huttenhain R.; Cerciello F.; Wollscheid B.; Aebersold R. On the development of plasma protein biomarkers. J. Proteome Res. 2011, 10 (1), 5–16. 10.1021/pr1008515. [DOI] [PubMed] [Google Scholar]
  23. Shen Y.; Kim J.; Strittmatter E. F.; Jacobs J. M.; Camp D. G. 2nd; Fang R.; Tolie N.; Moore R. J.; Smith R. D. Characterization of the human blood plasma proteome. Proteomics 2005, 5 (15), 4034–4045. 10.1002/pmic.200401246. [DOI] [PubMed] [Google Scholar]
  24. Hortin G. L.; Sviridov D. The dynamic range problem in the analysis of the plasma proteome. J. Proteomics 2010, 73 (3), 629–636. 10.1016/j.jprot.2009.07.001. [DOI] [PubMed] [Google Scholar]
  25. Lee P. Y.; Osman J.; Low T. Y.; Jamal R. Plasma/serum proteomics: depletion strategies for reducing high-abundance proteins for biomarker discovery. Bioanalysis 2019, 11 (19), 1799–1812. 10.4155/bio-2019-0145. [DOI] [PubMed] [Google Scholar]
  26. Gianazza E.; Miller I.; Palazzolo L.; Parravicini C.; Eberini I. With or without you - Proteomics with or without major plasma/serum proteins. J. Proteomics 2016, 140, 62–80. 10.1016/j.jprot.2016.04.002. [DOI] [PubMed] [Google Scholar]
  27. Cheon D. H.; Nam E. J.; Park K. H.; Woo S. J.; Lee H. J.; Kim H. C.; Yang E. G.; Lee C.; Lee J. E. Comprehensive Analysis of Low-Molecular-Weight Human Plasma Proteome Using Top-Down Mass Spectrometry. J. Proteome Res. 2016, 15 (1), 229–244. 10.1021/acs.jproteome.5b00773. [DOI] [PubMed] [Google Scholar]
  28. Tran J. C.; Doucette A. A. Gel-eluted liquid fraction entrapment electrophoresis: an electrophoretic method for broad molecular weight range proteome separation. Anal. Chem. 2008, 80 (5), 1568–1573. 10.1021/ac702197w. [DOI] [PubMed] [Google Scholar]
  29. Shen Y.; Liu T.; Tolic N.; Petritis B. O.; Zhao R.; Moore R. J.; Purvine S. O.; Camp D. G.; Smith R. D. Strategy for degradomic-peptidomic analysis of human blood plasma. J. Proteome Res. 2010, 9 (5), 2339–2346. 10.1021/pr901083m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Seckler H. D. S.; Fornelli L.; Mutharasan R. K.; Thaxton C. S.; Fellers R.; Daviglus M.; Sniderman A.; Rader D.; Kelleher N. L.; Lloyd-Jones D. M.; Compton P. D.; Wilkins J. T. A Targeted, Differential Top-Down Proteomic Methodology for Comparison of ApoA-I Proteoforms in Individuals with High and Low HDL Efflux Capacity. J. Proteome Res. 2018, 17 (6), 2156–2164. 10.1021/acs.jproteome.8b00100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Wilkins J. T.; Seckler H. S.; Rink J.; Compton P. D.; Fornelli L.; Thaxton C. S.; LeDuc R.; Jacobs D.; Doubleday P. F.; Sniderman A.; Lloyd-Jones D. M.; Kelleher N. L. Spectrum of Apolipoprotein AI and Apolipoprotein AII Proteoforms and Their Associations With Indices of Cardiometabolic Health: The CARDIA Study. J. Am. Heart Assoc 2021, 10 (17), e019890 10.1161/JAHA.120.019890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Nicolardi S.; van der Burgt Y. E.; Dragan I.; Hensbergen P. J.; Deelder A. M. Identification of new apolipoprotein-CIII glycoforms with ultrahigh resolution MALDI-FTICR mass spectrometry of human sera. J. Proteome Res. 2013, 12 (5), 2260–2268. 10.1021/pr400136p. [DOI] [PubMed] [Google Scholar]
  33. Demus D.; Naber A.; Dotz V.; Jansen B. C.; Bladergroen M. R.; Nouta J.; Sijbrands E. J. G.; Van Hoek M.; Nicolardi S.; Wuhrer M. Large-Scale Analysis of Apolipoprotein CIII Glycosylation by Ultrahigh Resolution Mass Spectrometry. Front Chem. 2021, 9, 678883 10.3389/fchem.2021.678883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Theberge R.; Infusini G.; Tong W.; McComb M. E.; Costello C. E. Top-Down Analysis of Small Plasma Proteins Using an LTQ-Orbitrap. Potential for Mass Spectrometry-Based Clinical Assays for Transthyretin and Hemoglobin. Int. J. Mass Spectrom. 2011, 300 (2–3), 130–142. 10.1016/j.ijms.2010.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Takemori A.; Butcher D. S.; Harman V. M.; Brownridge P.; Shima K.; Higo D.; Ishizaki J.; Hasegawa H.; Suzuki J.; Yamashita M.; Loo J. A.; Loo R. R. O.; Beynon R. J.; Anderson L. C.; Takemori N. PEPPI-MS: Polyacrylamide-Gel-Based Prefractionation for Analysis of Intact Proteoforms and Protein Complexes by Mass Spectrometry. J. Proteome Res. 2020, 19 (9), 3779–3791. 10.1021/acs.jproteome.0c00303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Zolotarjova N.; Martosella J.; Nicol G.; Bailey J.; Boyes B. E.; Barrett W. C. Differences among techniques for high-abundant protein depletion. Proteomics 2005, 5 (13), 3304–3313. 10.1002/pmic.200402021. [DOI] [PubMed] [Google Scholar]
  37. Liu G.; Zhao Y.; Angeles A.; Hamuro L. L.; Arnold M. E.; Shen J. X. A novel and cost effective method of removing excess albumin from plasma/serum samples and its impacts on LC-MS/MS bioanalysis of therapeutic proteins. Anal. Chem. 2014, 86 (16), 8336–8343. 10.1021/ac501837t. [DOI] [PubMed] [Google Scholar]
  38. Purves R. W.; Prasad S.; Belford M.; Vandenberg A.; Dunyach J. J. Optimization of a New Aerodynamic Cylindrical FAIMS Device for Small Molecule Analysis. J. Am. Soc. Mass Spectrom. 2017, 28 (3), 525–538. 10.1007/s13361-016-1587-6. [DOI] [PubMed] [Google Scholar]
  39. Gerbasi V. R.; Melani R. D.; Abbatiello S. E.; Belford M. W.; Huguet R.; McGee J. P.; Dayhoff D.; Thomas P. M.; Kelleher N. L. Deeper Protein Identification Using Field Asymmetric Ion Mobility Spectrometry in Top-Down Proteomics. Anal. Chem. 2021, 93 (16), 6323–6328. 10.1021/acs.analchem.1c00402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kaulich P. T.; Cassidy L.; Winkels K.; Tholey A. Improved Identification of Proteoforms in Top-Down Proteomics Using FAIMS with Internal CV Stepping. Anal. Chem. 2022, 94 (8), 3600–3607. 10.1021/acs.analchem.1c05123. [DOI] [PubMed] [Google Scholar]
  41. Takemori A.; Kaulich P. T.; Cassidy L.; Takemori N.; Tholey A. Size-Based Proteome Fractionation through Polyacrylamide Gel Electrophoresis Combined with LC-FAIMS-MS for In-Depth Top-Down Proteomics. Anal. Chem. 2022, 94 (37), 12815–12821. 10.1021/acs.analchem.2c02777. [DOI] [PubMed] [Google Scholar]
  42. Leipert J.; Kaulich P. T.; Steinbach M. K.; Steer B.; Winkels K.; Blurton C.; Leippe M.; Tholey A. Digital Microfluidics and Magnetic Bead-based Intact Proteoform Elution for Quantitative Top-down Nanoproteomics of Single C. elegans Nematodes. Angew. Chem., Int. Ed. 2023, 62, e202301969 10.1002/anie.202301969. [DOI] [PubMed] [Google Scholar]
  43. LeDuc R. D.; Fellers R. T.; Early B. P.; Greer J. B.; Shams D. P.; Thomas P. M.; Kelleher N. L. Accurate Estimation of Context-Dependent False Discovery Rates in Top-Down Proteomics. Mol. Cell Proteomics 2019, 18 (4), 796–805. 10.1074/mcp.RA118.000993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sherman B. T.; Hao M.; Qiu J.; Jiao X.; Baseler M. W.; Lane H. C.; Imamichi T.; Chang W. DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 2022, 50 (W1), W216–21. 10.1093/nar/gkac194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Compton P. D.; Zamdborg L.; Thomas P. M.; Kelleher N. L. On the scalability and requirements of whole protein mass spectrometry. Anal. Chem. 2011, 83 (17), 6868–6874. 10.1021/ac2010795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Fornelli L.; Toby T. K. Characterization of large intact protein ions by mass spectrometry: What directions should we follow?. Biochim Biophys Acta Proteins Proteom 2022, 1870 (4), 140758 10.1016/j.bbapap.2022.140758. [DOI] [PubMed] [Google Scholar]
  47. Kline J. T.; Belford M. W.; Huang J.; Greer J. B.; Bergen D.; Fellers R. T.; Greer S. M.; Horn D. M.; Zabrouskov V.; Huguet R.; Boeser C. L.; Durbin K. R.; Fornelli L. Improved Label-Free Quantification of Intact Proteoforms Using Field Asymmetric Ion Mobility Spectrometry. Anal. Chem. 2023, 95 (23), 9090–9096. 10.1021/acs.analchem.3c01534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Ahsan N.; Fornelli L.; Najar F. Z.; Gamagedara S.; Hossan M. R.; Rao R. S. P.; Punyamurtula U.; Bauer A.; Yang Z.; Foster S. B.; Kane M. A. Proteomics evaluation of five economical commercial abundant protein depletion kits for enrichment of diseases-specific biomarkers from blood serum. Proteomics 2023, e2300150 10.1002/pmic.202300150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Han C. L.; Sheng Y. C.; Wang S. Y.; Chen Y. H.; Kang J. H. Serum proteome profiles revealed dysregulated proteins and mechanisms associated with fibromyalgia syndrome in women. Sci. Rep 2020, 10 (1), 12347 10.1038/s41598-020-69271-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Anderson N. L. The clinical plasma proteome: a survey of clinical assays for proteins in plasma and serum. Clin Chem. 2010, 56 (2), 177–185. 10.1373/clinchem.2009.126706. [DOI] [PubMed] [Google Scholar]
  51. Rice S. J.; Belani C. P. Optimizing data-independent acquisition (DIA) spectral library workflows for plasma proteomics studies. Proteomics 2022, 22 (17), e2200125 10.1002/pmic.202200125. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

pr3c00488_si_001.pdf (2.1MB, pdf)
pr3c00488_si_002.xlsx (407.7KB, xlsx)
pr3c00488_si_003.xlsx (14.8KB, xlsx)

Articles from Journal of Proteome Research are provided here courtesy of American Chemical Society

RESOURCES