Abstract
Due to their oftentimes ambiguous nature, phosphopeptide positional isomers can present challenges in bottom-up mass spectrometry-based workflows as search engine scores alone are often not enough to confidently distinguish them. Additional scoring algorithms can remedy this by providing confidence metrics in addition to these search results, reducing ambiguity. Here we describe challenges to interpreting phosphoproteomics data and review several different approaches to determine sites of phosphorylation for both data-dependent and data-independent acquisition-based workflows. Finally, we discuss open questions regarding neutral losses, gas-phase rearrangement, and false localization rate estimation experienced by both types of acquisition workflows and best practices for managing ambiguity in phosphosite determination.
Introduction:
Post-translational modifications (PTMs) are a significant regulatory mechanism of protein activity in cells and can operate on a much faster time scale than gene regulation. The addition of phosphates,1 glycans,2 methyls,3 acetyls,4 and even other proteins5,6 can act as signals, activate or change the function of specific proteins, or mark proteins for degradation. Phosphorylation is a common form of cell signaling PTM, where cascades of kinases phosphorylating other kinases amplify signals from the cell surface to the nucleus. There are estimated to be over 500 unique kinases in the human genome, which phosphorylate over 200,000 unique phosphosites.7 While phosphorylation has been observed naturally on many residues including aspartic acid,8 arginine,9 cysteine,10 and histidine,11,12 the most commonly observed phosphorylation sites in proteomics experiments are serine (S), threonine (T), and tyrosine (Y).
Given the physiological significance of PTMs, it is important to reliably determine which proteins within a sample are modified. While PTMs change a protein’s overall mass and charge, allowing for analysis with techniques like gel electrophoresis,13 liquid chromatography (LC) coupled tandem mass spectrometry (MS/MS) is the preferred technique for monitoring most PTMs.14 Unlike gel electrophoresis, MS/MS provides sequence-level information across a protein which can indicate the amino acid-specific site of the modification. Advancements in search engine and site localization algorithms, as well as data acquisition methods, have made it possible to identify tens of thousands of sites of phosphorylation in a single MS/MS acquisition. The review of these advancements is the focus of this work.
There are three general approaches to analyzing PTMs with LC-MS/MS: top-down, which measures PTMs on intact proteins, bottom-up (sometimes referred to as shotgun proteomics), which uses enzymatic digestion to process proteins into peptides before measurement, and middle-down, which is a hybrid of top-down and bottom-up. Top-down proteomics can identify the number of occupied phosphosites on a protein molecule, but it can sometimes be a challenge to determine the exact sites of phosphorylation without additional experiments.15 In contrast, bottom-up proteomics simplifies identifying sites of phosphorylation by analyzing shorter peptide sequences at the cost of context in multiply-phosphorylated proteins.16 In some experiments17 these techniques are used in conjunction to take advantage of both methods. Middle-down proteomics attempts to address the complications from both of these methods by generating large peptides using restricted enzymatic proteolysis.18,19 While bottom-up methods can typically achieve a higher depth of measured phosphosites, top-down and middle-down make it easier to study interactions between multiple sites of phosphorylation or crosstalk between different types of PTMs.20,21
Bottom-up proteomics methods for measuring phosphopeptides:
Figure 1 outlines a basic workflow of bottom-up proteomics for PTM analysis. As proteomics samples are processed, proteins are typically denatured and digested with an enzyme such as trypsin, which cuts each protein at regular sites. Due to the relatively low abundance of naturally occurring phosphopeptides, samples must first be enriched using techniques such as metal-affinity chromatography,22,23 titanium dioxide,24 strong cation exchange,25 or antibodies26,27 to produce quantifiable amounts of phosphopeptides from the sample. Phosphopeptides are then separated using an LC, ionized using electrospray, and taken into the MS where both precursor and sequence-specific fragment ions are measured. From those measurements, data analysis helps identify peptides and assign specific sites of modification.
Figure 1:

Outline of a standard bottom-up LC-MS/MS phosphoproteomics workflow. (a) Peptide samples are prepared by lysis, reduction, alkylation, and enzymatic digestion. (b) After digestion, phosphopeptides are enriched and prepared for LC-MS/MS measurement. (c) Peptides are chromatographically separated to reduce interference from competing signals and improve mass spectrum interpretation. (d) Eluting peptides are ionized and fragmented to produce sequence-level data. (e) Pattern recognition algorithms are used to identify phosphopeptides and fragmentation data is interpreted to identify the most likely site of phosphorylation in each peptide.
There are several different ways that a precursor ion can be ionized and fragmented, which have additional considerations when analyzing PTMs. While most current experiments use positive ion mode for ionizing phosphopeptides, negative ion mode has historically been a powerful technique for identifying phosphopeptides28 and the combination of both positive and negative ion modes can improve phosphopeptide detection confidence.29 Phosphopeptides are commonly analyzed using resonance collision-induced dissociation (CID)30 or beam-type CID31 (commonly referred to on Thermo instruments by the branded name, HCD32). CID methods produce b- and y-type ions, where beam-type CID undergoes multiple fragmentation events and typically produces long runs of y-ions while resonance CID results in the most energetically favored fragmentation pathways producing both b- and y-type ions with similar efficiency. For PTMs, both approaches of ion generation have advantages and disadvantages for assigning phosphopeptides.33 Additionally, CID methods impart energy into peptides directly through collision,34,35 which can cause the loss of PTMs during fragmentation as neutral losses,36 and complicate their interpretation. An alternative fragmentation approach uses electron transfer dissociation (ETD)37,38 or electron capture dissociation (ECD).39 ETD and ECD use free radicals to make peptides unstable, fragmenting them along the backbone while typically leaving PTMs intact.40 However, these methods typically produces fewer intense fragment ions when compared with CID, which can result in lower numbers of peptide identifications.41 Tradeoffs between different fragmentation techniques suggest that using a combinatorial approach could help better assign phosphopeptides.42,43
Once ions are fragmented, they can be analyzed in a variety of ways. Data-dependent acquisition (DDA)44 uses precursor MS1 measurements to trigger selected (data-dependent) MS2s preferentially on the most abundant peptide signals. Instruments configured to collect DDA measurements typically use a top-N configuration, where the top-N most abundant precursor ions are selected for MS2 acquisition from the most recently collected MS1 spectrum. In general, this cycle is repeated every one to five seconds so that a new MS1 spectrum can indicate peptides that are newly eluting from the HPLC column into the mass spectrometer. In most DDA configurations, peptides elute over many cycles and a process called dynamic exclusion45,46 is employed where recently measured precursor m/zs are put on an exclusion list for a specified duration such that precursors are not repeatedly re-measured in each cycle. As such, most peptides are identified by only a single MS2, allowing for more data acquisition time spent measuring low-abundance peptides. No matter what fragmentation technique is used, database search engines47–52 such as Mascot53 assign numerical scores to each potential peptide-spectrum match (PSM).54 Search engines can be configured to search for variable PTMs,55 which add the mass of modifications to user-specified amino acids.
Data-independent acquisition (DIA)56,57 is an alternate acquisition method where MS2s are collected in a systematic manner regardless of precursor intensities. Systematic measurements from extracted fragment ion chromatograms can improve overall confidence in detecting specific phosphopeptides because each ion is measured multiple times over the elution profile of the peptide. However, precursor masses can be critical to assigning PTMs and peptide-centric DIA search engines58 have to infer the specific precursor mass of a peptide from a wider precursor isolation window. As a result, care must be taken to either associate those fragment ions with precursors using tools like DIA-Umpire59 or search for ions that are specific to the modified peptide. Targeted measurements, such as Parallel reaction monitoring (PRM),60 can be thought of as a subtype of DIA where specific precursors are selected for fragmentation in targeted retention time windows. This approach marries the benefits of DIA with specific precursor information at the cost of measuring only a limited number of peptides per acquisition.
Challenges in interpreting phosphopeptide positional isomers:
The sheer number of kinases61 and prevalence of serines, threonines, and tyrosines means that many proteins can be phosphorylated in multiple locations, where those sites are frequently at neighboring amino acids.62 Modified peptides that differ only in which acceptor amino acid is phosphorylated are isomers because they have the same molecular formula, yet have different structures. As such, knowing the specific site of phosphorylation can be extremely informative, but these positional isomers can be very difficult to differentiate. For example, Figure 2 shows major phosphosites of Human CDK1 (cyclin-dependent kinase 1), an important regulator of the cell cycle by controlling cell division.63 The three most commonly-cited phosphosites (T14, Y15, and Y19) all fall within the same tryptic peptide: IGEGTYGVVYK, which has exact conservation across human, mouse, rat, chicken, and fruit flies, with nearly identical homology in yeast (VGEGTYGVVYK). Each of these amino acids serves the same inhibitory function but differs in what upstream kinases phosphorylate them. T14 and Y15 can both be phosphorylated by MYT1,64 while Y15 can also be phosphorylated by PKC65 and WEE1.66 Little is understood about kinases that target Y19. Palou et al62 observed that the SWE1 ortholog of WEE1 phosphorylates Y19 in S. cerevisiae, although this finding should be viewed skeptically as tyrosine phosphorylation signaling is rare or nonexistant in yeast67 underlining the need to validate antibody-based phosphorylation assays with mass spectrometry.68 Regardless, phosphorylation at each site produces the same CDK1 inhibition, yet observing each site indicates different upstream biology.69
Figure 2:

CDK phosphosites with at least 10 literature references. Highlighted phosphosites represent well-documented positional isomers. These positional isomers may serve similar functions and be located adjacent to one another, such as inhibition (T14, Y14, Y19), but can differ in what kinases can phosphorylate them. Phosphosite reference counts were derived from PhosphoSitePlus.79
The easiest and most confident way to know the localization of peptides is to add stable isotope-labelled synthetic phosphopeptide standards to each sample.70,71 However, the cost and scale of generating a massive library of synthetic phosphopeptide standards remains impractical,72,73 requiring computational methods to assign sites of phosphorylation. Due to the prevalence of positional isomers within the proteome, the use of standard search engine scores is often not enough to confidently identify isoforms.74 That said, simply considering the difference between the top two highest-scoring peptides can be an effective approach to localization. This concept was first considered in 1995 when Yates et al55 used deltaCn, or the normalized difference between the best and second-best cross-correlation scores to identify phosphopeptides. The Mascot delta score (MD-score) approach75 similarly considers the top two highest scores of a Mascot search, where the difference in Ions Scores relates to the separation between the quality of ions assigned to the first and second best peptides considered for that PSM. Since positional isomers have the same precursor mass and many of the same fragment ions, generally both the first and second-best peptides considered for a PSM are different positional isomers of the same peptide. As such, the MD-score frequently becomes a measure of how much better the top positional isomer scores above the other isomeric forms. Similarly, the SLIP score76 for Protein Prospector77,78 computes the difference between expectation values, analogous to probability values adjusted for the number of precursors considered within a given mass tolerance.
While search engine scores consider the presence or absence of every potential sequence-specific fragment ion, only some ions change between positional isomers. Figure 3 shows a spectrum from the Phosphopedia database80 associated with each singly-phosphorylated form of the IGEGTYGVVYK peptide from CDK1 discussed previously in Figure 2. Notably, each positional isomer has the same precursor mass, but different fragmentation patterns. For example, y2 to y5 (as well as b6 to b10) fragment ions are mass-shifted between the pY15 and pY19 isomers. These peaks are called site-determining ions and can be used to differentiate the isoforms of a specific peptide. In some cases, only a few site-determining ions are produced, such as only y6 and b5 differentiating between pT14 and pY15. Notably, the overall relative intensities of the fragmentation patterns,34 as well as the LC retention times,80 can differ between serine/threonine and tyrosine phosphorylated peptides, which can be used as an additional source of localization information. However, serine and threonine phosphorylated peptides generally have more similar relative fragmentation patterns and retention times due to the closeness of the chemistries of those amino acids. Spectra are considered “ambiguous” if no site-determining ions are present or if those ions cannot be distinguished from noise.81
Figure 3:

Resonance CID MS2 spectra from the CDK1 peptide IGEGTYGVVYK from the Phosphopedia library.80 Each spectrum corresponds to one of three positional isomers. Peaks are colored based on what isomer they can be used to identify. (a) The pT14 isomer; pink b/y-type ions uniquely identify this positional isomer. (b) The pY15 isomer; this particular species does not possess any unique site-determining ions and can only be identified in comparison to pT14 or pY19, with C-terminal ions shared with pT14 (blue) and N-terminal ions shared with pY19 (green). (c) The pY19 isomer; orange b/y-type ions uniquely identify this positional isomer.
When a phosphopeptide has only two acceptor sites, each positional isomer has an equal number of unique ions that could distinguish that isomer. In peptides with more than two acceptor sites, the internal acceptor sites will not have any unique site-determining ions that can be used to definitively identify it. For instance, in the peptide IGEGTYGVVYK, the pT14 isomer can be distinguished from both pY15 and pY19 via the unique b5 and y6 fragment ions. Similarly, pY19 can be distinguished from both pT14 and pY15 by the isomer-specific b6–b9 and y2–y5 fragment ions. In contrast, the phosphopeptide pY15 shares the shifted/unshifted C-terminus fragments with pT14 and the shifted/unshifted N-terminus fragments with pY19. This can create an interpretation challenge if the pT14 and pY19 isomers were to co-elute, where pY15 can only be confidently identified if the other isomers can be ruled out. As such, most approaches to identifying the site of phosphorylation consider the delta between the best and second-best matches. In this case, the pY15 isomer can be localized compared to pT14 or pY19, but not both simultaneously. If the delta between the top and second-best match is sufficiently large, that result would imply that any further result (third-best species and so forth) would not score better than the second-best match, allowing them to be discounted from consideration.
The variety and scope of positional isomers in phosphopeptide-enriched datasets are most easily visualized by considering precursor-extracted ion chromatograms across retention time. Figure 4a presents a three-dimensional landscape of precursor intensities in a human phosphopeptide-enriched sample downloaded from the MassIVE repository (ProteomeXchange dataset PXD042974). By focusing on a narrow 800.5 to 803 m/z range from 15 to 26 minutes, this plot shows the isotopic structure of peptides in this range, indicating charge, as well as peptides with the same precursor mass and isotopic pattern, suggesting the same molecular formula. Three peptides have precursor mass patterns suggesting positional isomers, where the yellow peptide was identified as singly phosphorylated SRTESITATSPASMVGGKPGSFR from IRS1. Based on the precursor data, this peptide has two positional isomers. In Figure 4b and the corresponding blowup Figure 4c, the ambiguous isomer (pS307|pT309) is identified by the blue ions while the ambiguous isomer (pT311|pS312) is identified by the red ions, but pS307 and pT309 as well as pT311 and pS312 cannot be uniquely distinguished from each other due to missing ions. In situations with localization ambiguity, existing literature can act as a powerful tool82 where pS307 and pS312 have been measured in 64 and 87 studies, respectively, from data in PhosphositePlus. Meanwhile, the other acceptor sites in SRTESITATSPASMVGGKPGSFR have only been observed in a handful of high-throughput studies. This result suggests the presence of IRS1 phosphorylation at S307 and S312, which are both phosphorylated by p70-S6 kinase.83 The purple peptide (m/z=801.733, z=3) and red peptide (m/z=800.867, z=2) are unassigned but show similar precursor patterns.
Figure 4:

Positional isomers observed in a human phosphopeptide-enriched sample. (a) A three-dimensional precursor intensity landscape, where highlighted precursors are suggestive of positional isomers producing multiple peaks with the same m/z value and isotopic distribution, but at different retention times. The yellow peak corresponds to the singly phosphorylated IRS1 peptide SRTESITATSPASMVGGKPGSFR while the red and purple peaks are unassigned. If the retention time between these peaks is not sufficiently different enough, dynamic exclusion in DDA may cause only one isoform to be analyzed. (b) Extracted chromatograms of the peptide SRTESITAPASMVGGKPGSFR, where site-determining ions specific to the ambiguous SRTE(pSIT)ATSPASMVGGKPGSFR (e.g., either pS307 or pT309) are highlighted in red and site-determining ions belonging to SRTESITA(TpS)PASMVGGKPGSFR (e.g., either pT311 or pS312) are highlighted in blue. Ions belonging to both peptides are dashed gray lines. (c) A zoomed-in view of (b) showing lower-intensity peak shapes in greater detail.
While dynamic exclusion is a key tool for measuring low-abundance peptides with DDA, the approach can create complications when analyzing phosphopeptide positional isomers. Because positional isomers have the same molecular formula, they also have the same precursor mass. As such, if those positional isomers elute in close proximity to each other, the precursor mass from analyzing the first eluting positional isomer may still be on the instrument exclusion list during the time the second isomer elutes. In this case, the second positional isomer may never trigger an MS2 spectrum, even if its overall signal is higher than the first isomer.84 Similarly, inherent stochasticity caused by the data-dependent nature of DDA may cause the first isomer to be missed if it is low in abundance, leaving room to collect an MS2 spectrum of the second isomer. Left unchecked, these characteristics of DDA cause a dramatic drop in the reproducibility of DDA-based phosphoproteomics experiments relative to DDA experiments of unmodified peptides.85 Furthermore, positional isomers with the same precursors and nearby elution times can complicate precursor-based quantification and quantification approaches that use “match-between-runs”86 where MS2-based detections in one injection can be transferred to other injections based on retention time and precursor mass matching alone. These factors underline the need for using isomer-specific fragment ions for detecting positional isomers.
Computational approaches to localize phosphates to specific sites in phosphopeptides:
Ascore74 is the earliest method that utilizes site-determining ions to localize phosphate groups. The Ascore algorithm consists of three major steps. First, a set of fragment ions is generated for all known positional isomers from a given sequence. Second, these peaks are allocated into a series of 100 m/z wide bins and a binomial probability scorer is used to generate a Peptide Score for each isoform based on the number of matching peaks. The peak depth, or the ‘n’ most intense peaks within each 100 m/z window where n is an integer from 1 to 10 is determined by the largest delta Peptide Score for a given value of n. The last step is to recalculate the Peptide Score for the two highest scoring isomers, but considering only site-determining ions. In this scorer, the probability P(x) for PSM x is calculated as:
| (Eq. 1) |
where N is the number of possible ions that could differentiate between the two isomers and n is the number of those ions present in the spectrum. Assuming unit resolution, p is set to the peak depth divided by 100 (as the windows are 100 m/z wide). This formulation can be interpreted as the integration of the binomial distribution for the probability that the same or higher number of ions observed could be observed by random chance. Finally, the Peptide Score is set to −10*log10(P(x)) to ensure increasing numbers for more significant matches. The Ascore is the difference between these two site-specific Peptide Scores. This approach for scoring only site-determining ions for localization is demonstrably better than calculating the delta score between first and best-scoring peptides when considering all possible fragment ions, as demonstrated by Ascore’s precision and sensitivity improvements over both the MD-score and deltaCn.74
The fundamental Ascore algorithm using the binomial distribution for calculating localization probabilities has been implemented in several other contexts and tools.87 For example, the Phosphate Localization score (PLscore)88 simplifies the algorithm by utilizing a single peak density and extending the principles of Ascore to InsPecT89 results. PhosphoRS90 expands the idea of peak depth to allow for variable amounts of peaks within each window, thus accounting for an unequal distribution of peaks across m/z windows in a single spectrum. SLoMo (site localization of modifications)91 is designed to allow for ETD/ECD data to be analyzed, while the Cscore (complementary score)42 allows for both ETD/ECD and CID data to be utilized concurrently, increasing the confidence of peptide site localization. PhosCalc92 can be used to analyze MS3 spectra93 in addition to MS2 spectra, which can help assist in CID-based experiments where backbone fragmentation may be poor. Another method for utilizing MS3 in conjunction with MS2 spectra is the PTM score used in MSQuant,94 where localization is performed using the same approach as Ascore but limiting peak depth to four.
Other algorithms have been developed to score site-determining ions without assuming peaks are measured at random in a binomial distribution. For example, LuciPHOr95 considers the likelihood odds ratio that a fragment peak would be assigned correctly or randomly and computes the cumulative sum of the log odds to assess each considered positional isomer, where the confidence of the reported isomer is the delta of the top two permutations. PTMProphet96 uses a Bayesian approach to assess the confidence at each possible site using mixture models. This approach allows for ambiguous results to be returned when the exact site cannot be determined, but it can be limited to specific amino acids. PTMiner97 is an approach for open-searching, where a database search engine is allowed to detect unanticipated modifications considering large mass tolerances. This strategy uses an empirical Bayesian-based approach to iteratively learn the prior probabilities for observing each type of modification on any given amino acid. Similar to the open-searching approach, pSite98 uses a support vector machine to consider the confidence of each individual amino acid in de novo-derived sequences that have been assigned to proteins using sequence alignment.99,100
While Ascore and related algorithms primarily utilize database searching, the use of spectral libraries allows for incorporating sequence-specific features such as intensity values, neutral losses, and unusual fragments into a localization analysis process.101 Recent methods allowed for the development of computationally predicted spectral libraries, where non-phosphorylated spectra are shifted by +80 Da to simulate phosphorylation.102,103 First developed for Orbitraps, this method was expanded upon by Suni et al,101,104 using a two-step process of enzymatic dephosphorylation followed by in silico rephosphorylation and has been extended to ToF platforms.105 Dephosphorylated peptides share many of the same peaks as phosphorylated peptides but are dramatically easier to detect. As such, library searching for “re-phosphorylated” forms of dephosphorylated peptides greatly improves phosphopeptide detection using SpectraST.106 In this approach, localization is performed by recalculating the original deltaDot (delta score of the dot product) for the first and second-best matches that correspond to the same peptide sequence. DeepFLR107 also utilizes library searching to assess site localization where phosphopeptide libraries are built from deep learning-based predictions. It is important to note that library searching localization is based on changes in relative peak intensity, and these algorithms may not require the presence or absence of any site determining ions to localize peptides. A summary of these algorithms can be found in Table 1.
Table 1:
DDA-Based Localization Algorithms
| Name | Category | Summary Description | Reference |
|---|---|---|---|
| Ascore | Binomial | Uses sequence information to predict fragmentation patterns; looks for the presence or absence of site-determining ions. | 74 |
| Cscore | Binomial | Implementation of Ascore that utilizes both ETD and CID data concurrently to increase localization confidence. | 42 |
| DeepFLR | Library Matching | Constructs a spectral library using deep learning; localization scores are calculated using delta cosine similarity. | 107 |
| LuciPHOr | Probabilistic | Computes the likelihood odds that a given peak is matched at random before calculating a cumulative sum of the log odds to represent the entire peptide. The score itself is the delta of the two most likely isoforms. | 95 |
| MD-Score | Delta Score | Difference between the first and second best results from the MASCOT search engine for positional isomers. | 75 |
| PhosCalc | Binomial | Implementation of Ascore that can be applied to both MS2 and MS3 spectra. | 92 |
| PhosphoRS | Binomial | Implementation of Ascore that uses variable peak densities per 100 m/z window to account for unequal peak distribution in a spectrum. | 90 |
| Phosphorylation Localization Score (PLS) | Binomial | Extends Ascore method to InsPecT; uses a single peak density to simplify the scoring process. | 88 |
| PTMiner | Binomial | Designed for high mass-tolerance open searching; uses an empirical Bayesian-based iterative learning strategy for site localization. | 97 |
| PTMProphet | Bayesian | Uses a Bayesian mixture model to evaluate confidence at each possible site. | 96 |
| PTM score (MSQuant) | Binomial | Implementation of Ascore that allows for the analysis of MS3 spectra along with MS2 spectra; only considers the top 4 most intense peaks per 100 m/z window. | 94 |
| pSite | Bayesian | Site-based approach that first evaluates individual site confidence using support vector machines for de novo derived sequences assigned through sequence alignment. | 98 |
| SLIP | Delta Score | Designed to work with Protein Prospector using expectation values. The SLIP score is the difference between the first and second-best expectation values. | 76 |
| SLoMo | Binomial | Implementation of Ascore that allows ETD and ECD data to be used. | 91 |
| Suni et al. | Library Matching | Constructs a spectral library by applying mass shifts to enzymatically dephosphorylated peptide spectra; localization scores are recalculated deltaDot scores from SpectraST using these libraries. | 101 |
Phosphoproteomics analysis methods for data-independent acquisition:
While DDA typically attempts to fragment only a single peptide in each MS2 spectrum, DIA actively generates highly multiplexed MS2 spectra where multiple peptides are co-fragmented at the same time and extracted computationally.108 Specter109 is one approach to deconvolve this data using a non-negative least squares approach to find the simplest linear combination of library spectra that interpret a given window. Given complete data, this approach is able to distinguish highly similar library spectra including phosphopeptide positional isomers. However, this approach uses intensity alone to identify positional isomers, with no consideration for site-determining ions. Time alignment is another method of grouping fragment ion signals. For example, PIQED110 leverages the DIA-Umpire59 tool to match time-extracted fragment ions with specific precursor ions in MS1 spectra to create pseudo-DDA spectra that can be searched using standard search engines. PIQED creates a pipeline that processes pseudo-DDA spectra through search engines, such as Comet,111 using statistical analysis with the TPP,112 including PTMProphet.96
Other approaches to analyzing phosphopeptides measured with DIA are built into DIA-specific processing tools. Inference of peptidoforms (IPF)113 is a Bayesian hierarchical model integrated into the OpenSWATH114 workflow that determines the most likely positional isomer (peptidoform) at a given retention time based on the elution pattern of site-determining ions. Thesaurus84 is built into EncyclopeDIA115 and uses an Ascore-like approach to model the probability of observing site-determining ions by chance. DIA MS2s are typically far more complex than DDA MS2s, and thus it is incorrect to assume the probability of observing an ion by chance is related to the number of peaks considered per m/z window. As a result, Thesaurus constructs a background distribution for each m/z window based on the frequency that any given m/z value is observed. Finally, site localization has been built into Spectronaut through directDIA116,117 using a deconvolution approach similar to DIA-Umpire. A summary of these algorithms can be found in Table 2.
Table 2:
DIA-Based Methods
| Name | Category | Summary Description | Reference |
|---|---|---|---|
| Inference of Peptidoforms (IPF) | Bayesian | Applies a Bayesian hierarchical model to spectral library data to determine the most likely positional isomer or ‘peptidoform’ at a given retention time based on elution pattern. | 113 |
| PIQED | Workflow | Sequences DIA-Umpire to deconvolve DIA data through the creation of pseudo-DDA spectra that can be localized through TPP-compatible methods like PTMProphet. | 110 |
| Spectronaut | Closed Source | Allows for analysis of DIA without libraries through directDIA using deconvolution methods similar to DIA-Umpire. | 117 |
| Thesaurus | Probabilistic | Creates a background distribution using differential frequencies of peptides in a dataset and then calculates probabilities based on whether site-determining ions can be matched in that background distribution by random chance. | 84 |
Estimating false localization rates across datasets:
Several experimental factors, such as data acquisition parameters, chromatography, instrument resolution, peak selection, and even the localization approach used, can cause results between algorithms to vary greatly.118 For normal proteomics experiments, false discovery rate (FDR) estimation can control for false positives within a set of PSMs.119,120 The target-decoy approach uses the score distribution of decoy peptides to model a null distribution from which to evaluate target peptide matches.121 This principle works because the decoy-based null distribution closely approximates incorrect target matches, where: i) all decoy matches are incorrect, ii) all correct matches are from the target database, and iii) incorrect matches are equally likely to be from the target or decoy database.
A similar concept for analyzing the accuracy of site localization algorithms is the false localization rate (FLR). However, the FLR approach cannot make use of standard decoys because they model incorrect peptides, not incorrect localizations.81 One key difference is that incorrect localizations still score similarly to correct peptides with only a few site-determining ions mismatched. As a result, FLR estimations lack a standardized metric, resulting in arbitrary cutoffs being used to evaluate the performance of localization algorithms.118
Chalkley et al.81 assert that the true FLR rate of a dataset can only be truly measured with prior knowledge of correct localizations, which may be modeled with synthetic peptides or singly-phosphorylated peptides. Most current methods of FLR estimation utilize biologically improbable amino acids to represent decoys, such as with PhosphoFLR,122 which considers decoy localizations to alanine and leucine. DeepFLR107 additionally tested an alternate decoy generation approach that randomly swapped a phosphorylated amino acid with an unphosphorylated amino acid, which maintains biologically plausibly modified amino acids at the cost of introducing sequence order changes where the resulting peptide is not necessarily present in the sample. The authors determined that this approach was more effective at estimating the real FLR than moving the phosphate to a randomly selected, biologically improbable amino acid. One possible explanation of this result is that exchanging amino acids is better suited for deep learning prediction tasks, which may account for the improvement in accuracy. However, localization engines that do not use fragment intensities likely will not benefit from this technique and may produce worse results due to introducing sequence alterations.
Other methods for FLR estimation are integrated into localization algorithms themselves. SLIP scoring76 takes advantage of amino acid distribution and constructs decoys from glutamine and proline localizations, which are residues commonly observed near actual phosphosites but may vary from dataset to dataset. LuciPHOr95 uses a Bayesian approach similar to PeptideProphet123 to consider score distributions of S/T/Y localizations to other amino acids. PTMProphet96 computes probabilities for each potential site of phosphorylation and calculates the localization mean best probability as a reasonable estimate for FLR.
Benchmarking studies can also be helpful in validating FLR estimates between algorithms. Locard-Paulet et al.124 exhaustively evaluated the performance of 22 different proteomic pipelines using synthetic phosphopeptide data with known phosphosites. These pipelines were composed of search engines, localization algorithms, and validation strategies embedded within different phosphoproteomics software tools. The direct comparison of FLR used in this study serves as a useful resource for adapting the score thresholds between pipelines, helping to mitigate the variation in FLR between algorithms at the same threshold.
Open questions when analyzing phosphoproteomics datasets:
Regardless of the data acquisition method used, several key challenges still persist within the site localization process. The use of resonance or beam-type CID-derived fragmentation techniques produces b- and y-ions for site identification,118 however, this technique creates additional dataset-complicating ions. These new ions include neutral losses and gas-phase re-arrangements, which may result in failure to detect or correctly localize the phosphosite.125 Efforts to incorporate and adjust for these complications within scoring techniques likewise face similar challenges, particularly with regard to creating a standardized method of reporting false localization rates (FLRs).81
In CID-based mass spectrometry, the phosphate group of a phosphopeptide can often be lost in the form of phosphoric acid (H3PO4) in positive ion mode or metaphosphoric acid (HPO3) in negative ion mode.126,127 Serine and threonine residues undergo these neutral losses through beta elimination at a rapid rate due to the labile nature of the phosphate group,128 while tyrosine is typically unable to undergo neutral losses. However, isobaric tag labeling with tandem mass tags (TMT) has been observed to have phosphotyrosine neutral losses.129 For example, in Figure 3a, the CDK1-pT14 isomer has several neutral loss peaks (y7–98 = 811.435 m/z, y8–98 = 868.456 m/z, y9–98 = 997.499 m/z, and y10–98 = 1054.520 m/z) that are absent in the pY15 and pY19 spectra. These losses can undercut localization by shifting the mass of modified residues since the beta elimination neutral loss of phosphoric acid from phosphorylated serine and phosphorylated threonine produces the same fragment mass as beta elimination of water from an unmodified serine or threonine.118 Some site-localization algorithms still attempt to use phosphate neutral loss ions as site-determining ions, but this has been demonstrated to increase localization error rates.101 Library searching may provide an avenue to consider phosphoric acid neutral losses by leveraging differences in the observed intensity for each ion type.
During the fragmentation process of CID-based experiments, the same gas phase beta-elimination reactions that induce neutral losses can result in the translocation of the phosphate group to a nearby acceptor site.125,130 Gas-phase rearrangements result in chimeric spectra131 that appear to be the product of both isomers but at the elution time of the original isomer. While these gas-phase products are typically low in abundance, it has been estimated that as many as 37% of positional isomers that appear to be identically co-eluting are actually side-products formed in the gas-phase,84 matching estimates from Plaumbo and Reid125 showing that 36% of tested phosphopeptides underwent some degree of site rearrangement. Plaumbo and Reid suggest that CID-based proteomic data should be re-evaluated before making conclusions regarding biological significance, especially if localization software considers the possibility of multiple positional isomers from the same MS2 spectrum. In these cases, positional isomers can be validated by considering known kinase motifs, comparisons to similar data produced using ETD or ECD fragmentation, or by comparison to CID MS2 spectra of synthetic phosphopeptide standards.125 While gas-phase rearrangement is reproducibly common across resonance and beam-type CID instruments, Aguiar et al132 argue that from a practical perspective, it may have a limited impact on phosphopeptide localization. This study considered two combinatorial libraries of singly phosphorylated peptides with two acceptor sites and found that rearrangement products were rarely observed to decrease localization accuracy in +2H MS2 spectra. The authors further argue that using ETD, which does not produce rearrangement products, did not dramatically affect localization confidence in Ascore. While combinatorial libraries contain significantly more phosphopeptides than the pool of phosphopeptides studied by Palumbo and Reid, the limited combinations of chemistry restrain the practical conclusions that can be drawn from this result, leaving the real effects of gas-phase rearrangement on proteomics studies still somewhat unresolved.
Regardless, chimeric spectra containing fragments produced by multiple isomers resulting from gas-phase rearrangement or simply shortened LC gradients are a major challenge for detecting middle positional isomers. Since middle isomers such as IGEGTpYGVVYK from CDK1 (Figure 3) also produce site localizing ions that could be the result of combinations of other isomers (i.e., IGEGpTYGVVYK and IGEGTYGVVpYK), it can be additionally challenging to confidently confirm these sites. In these cases, changes in retention time, fragment intensity, or quantification are critical to confidently validate these peptides. The effective and automated use of these factors to improve site localization, in combination with literature mining or evolutionary conservation to assess the prior probability of observing a phosphosite, remain fruitful computational problems for future researchers.
Conclusion:
The field of phosphoproteomics has enjoyed rapid growth over the last twenty years as proteomics instrumentation and methodologies have improved. While mass spectrometry is a powerful method for detecting phosphopeptides, challenges remain regarding determining the exact sites of phosphorylation, which are needed to improve our understanding of the biological context surrounding these modifications. New approaches to making mass spectrometry measurements with data-independent acquisition are improving our ability to detect and quantify phosphosites. Despite these advances, neutral losses, gas-phase rearrangements, and lack of standard FLR metrics remain open challenges in both DDA-based and DIA-based proteomic workflows. Representing ambiguity in incomplete phosphosite localization, both in the literature as well as in databases and data standard formats such as mzIdentML,133,134 will be critical to our ability to interpret and reuse results from phosphoproteomics studies.
Acknowledgments:
We thank Damien B. Wilburn for insightful discussions. This research was supported by NIH funded R35GM150723, R01GM133981, and R21CA267394 to BCS.
Footnotes
Competing interests:
The authors declare the following competing interests: BCS is a founder and shareholder in Proteome Software, which operates in the field of proteomics.
References:
- (1).Mann M, Ong SE, Grønborg M, Steen H, Jensen ON, and Pandey A (2002) Analysis of protein phosphorylation using mass spectrometry: deciphering the phosphoproteome. Trends Biotechnol. 20, 261–268. [DOI] [PubMed] [Google Scholar]
- (2).Hart GW, Housley MP, and Slawson C (2007) Cycling of O-linked beta-N-acetylglucosamine on nucleocytoplasmic proteins. Nature 446, 1017–1022. [DOI] [PubMed] [Google Scholar]
- (3).Rathert P, Dhayalan A, Ma H, and Jeltsch A (2008) Specificity of protein lysine methyltransferases and methods for detection of lysine methylation of non-histone proteins. Mol. Biosyst 4, 1186–1190. [DOI] [PubMed] [Google Scholar]
- (4).Zhao S, Xu W, Jiang W, Yu W, Lin Y, Zhang T, Yao J, Zhou L, Zeng Y, Li H, Li Y, Shi J, An W, Hancock SM, He F, Qin L, Chin J, Yang P, Chen X, Lei Q, Xiong Y, and Guan K-L (2010) Regulation of cellular metabolism by protein lysine acetylation. Science 327, 1000–1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (5).Kirkpatrick DS, Denison C, and Gygi SP (2005) Weighing in on ubiquitin: the expanding role of mass-spectrometry-based proteomics. Nat. Cell Biol 7, 750–757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (6).Hendriks IA, and Vertegaal ACO (2016) A comprehensive compilation of SUMO proteomics. Nat. Rev. Mol. Cell Biol 17, 581–595. [DOI] [PubMed] [Google Scholar]
- (7).Cesaro L, and Pinna LA (2020) Prevalence and significance of the commonest phosphorylated motifs in the human proteome: a global analysis. Cell. Mol. Life Sci 77, 5281–5298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Collet JF, Stroobant V, Pirard M, Delpierre G, and Van Schaftingen E (1998) A new class of phosphotransferases phosphorylated on an aspartate residue in an amino-terminal DXDX(T/V) motif. J. Biol. Chem 273, 14107–14112. [DOI] [PubMed] [Google Scholar]
- (9).Fuhrmann J, Schmidt A, Spiess S, Lehner A, Turgay K, Mechtler K, Charpentier E, and Clausen T (2009) McsB is a protein arginine kinase that phosphorylates and inhibits the heat-shock regulator CtsR. Science 324, 1323–1327. [DOI] [PubMed] [Google Scholar]
- (10).Sun F, Ding Y, Ji Q, Liang Z, Deng X, Wong CCL, Yi C, Zhang L, Xie S, Alvarez S, Hicks LM, Luo C, Jiang H, Lan L, and He C (2012) Protein cysteine phosphorylation of SarA/MgrA family transcriptional regulators mediates bacterial virulence and antibiotic resistance. Proc. Natl. Acad. Sci. U. S. A 109, 15461–15466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Fuhs SR, and Hunter T (2017) pHisphorylation: the emergence of histidine phosphorylation as a reversible regulatory modification. Curr. Opin. Cell Biol 45, 8–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Leijten NM, Heck AJR, and Lemeer S (2022) Histidine phosphorylation in human cells; a needle or phantom in the haystack? Nat. Methods 19, 827–828. [DOI] [PubMed] [Google Scholar]
- (13).DeKroon RM, Robinette JB, Osorio C, Jeong JSY, Hamlett E, Mocanu M, and Alzate O (2012) Analysis of protein posttranslational modifications using DIGE-based proteomics. Methods Mol. Biol 854, 129–143. [DOI] [PubMed] [Google Scholar]
- (14).Witze ES, Old WM, Resing KA, and Ahn NG (2007) Mapping protein post-translational modifications with mass spectrometry. Nat. Methods 4, 798–806. [DOI] [PubMed] [Google Scholar]
- (15).Ge Y, Rybakova IN, Xu Q, and Moss RL (2009) Top-down high-resolution mass spectrometry of cardiac myosin binding protein C revealed that truncation alters protein phosphorylation state. Proc. Natl. Acad. Sci. U. S. A 106, 12658–12663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (16).Siuti N, and Kelleher NL (2007) Decoding protein modifications using top-down mass spectrometry. Nat. Methods 4, 817–821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (17).Borchers CH, Thapar R, Petrotchenko EV, Torres MP, Speir JP, Easterling M, Dominski Z, and Marzluff WF (2006) Combined top-down and bottom-up proteomics identifies a phosphorylation site in stem-loop-binding proteins that contributes to high-affinity RNA binding. Proc. Natl. Acad. Sci. U. S. A 103, 3094–3099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Wu C, Tran JC, Zamdborg L, Durbin KR, Li M, Ahlf DR, Early BP, Thomas PM, Sweedler JV, and Kelleher NL (2012) A protease for “middle-down” proteomics. Nat. Methods 9, 822–824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Cristobal A, Marino F, Post H, van den Toorn HWP, Mohammed S, and Heck AJR (2017) Toward an Optimized Workflow for Middle-Down Proteomics. Anal. Chem 89, 3318–3325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Hunter T (2007) The age of crosstalk: phosphorylation, ubiquitination, and beyond. Mol. Cell 28, 730–738. [DOI] [PubMed] [Google Scholar]
- (21).van der Laarse SAM, Leney AC, and Heck AJR (2018) Crosstalk between phosphorylation and O-GlcNAcylation: friend or foe. FEBS J. 285, 3152–3167. [DOI] [PubMed] [Google Scholar]
- (22).Andersson L, and Porath J (1986) Isolation of phosphoproteins by immobilized metal (Fe3+) affinity chromatography. Anal. Biochem 154, 250–254. [DOI] [PubMed] [Google Scholar]
- (23).Stensballe A, Andersen S, and Jensen ON (2001) Characterization of phosphoproteins from electrophoretic gels by nanoscale Fe(III) affinity chromatography with off-line mass spectrometry analysis. Proteomics 1, 207–222. [DOI] [PubMed] [Google Scholar]
- (24).Pinkse MWH, Uitto PM, Hilhorst MJ, Ooms B, and Heck AJR (2004) Selective isolation at the femtomole level of phosphopeptides from proteolytic digests using 2D-NanoLC-ESI-MS/MS and titanium oxide precolumns. Anal. Chem 76, 3935–3943. [DOI] [PubMed] [Google Scholar]
- (25).Beausoleil SA, Jedrychowski M, Schwartz D, Elias JE, Villén J, Li J, Cohn MA, Cantley LC, and Gygi SP (2004) Large-scale characterization of HeLa cell nuclear phosphoproteins. Proc. Natl. Acad. Sci. U. S. A 101, 12130–12135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Rush J, Moritz A, Lee KA, Guo A, Goss VL, Spek EJ, Zhang H, Zha X-M, Polakiewicz RD, and Comb MJ (2005) Immunoaffinity profiling of tyrosine phosphorylation in cancer cells. Nat. Biotechnol 23, 94–101. [DOI] [PubMed] [Google Scholar]
- (27).Stokes MP, Farnsworth CL, Moritz A, Silva JC, Jia X, Lee KA, Guo A, Polakiewicz RD, and Comb MJ (2012) PTMScan direct: identification and quantification of peptides from critical signaling proteins by immunoaffinity enrichment coupled with LC-MS/MS. Mol. Cell. Proteomics 11, 187–201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).Shou W, Verma R, Annan RS, Huddleston MJ, Chen SL, Carr SA, and Deshaies RJ (2002) Mapping phosphorylation sites in proteins by mass spectrometry, in Methods in Enzymology, pp 279–296. Academic Press. [DOI] [PubMed] [Google Scholar]
- (29).Busman M, Schey KL, Oatis JE, and Knapp DR (1996) Identification of phosphorylation sites in phosphopeptides by positive and negative mode electrospray ionization-tandem mass spectrometry. J. Am. Soc. Mass Spectrom 7, 243–249. [DOI] [PubMed] [Google Scholar]
- (30).Louris JN, Cooks RG, Syka JEP, Kelley PE, Stafford GC, and Todd JFJ (1987) Instrumentation, applications, and energy deposition in quadrupole ion-trap tandem mass spectrometry. Anal. Chem 59, 1677–1685. [Google Scholar]
- (31).Nemeth-Cawley JF, and Rouse JC (2002) Identification and sequencing analysis of intact proteins via collision-induced dissociation and quadrupole time-of-flight mass spectrometry. J. Mass Spectrom 37, 270–282. [DOI] [PubMed] [Google Scholar]
- (32).Olsen JV, Macek B, Lange O, Makarov A, Horning S, and Mann M (2007) Higher-energy C-trap dissociation for peptide modification analysis. Nat. Methods 4, 709–712. [DOI] [PubMed] [Google Scholar]
- (33).Jedrychowski MP, Huttlin EL, Haas W, Sowa ME, Rad R, and Gygi SP (2011) Evaluation of HCD- and CID-type fragmentation within their respective detection platforms for murine phosphoproteomics. Mol. Cell. Proteomics 10, M111.009910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (34).DeGnore JP, and Qin J (1998) Fragmentation of phosphopeptides in an ion trap mass spectrometer. J. Am. Soc. Mass Spectrom 9, 1175–1188. [DOI] [PubMed] [Google Scholar]
- (35).Sleno L, and Volmer DA (2004) Ion activation methods for tandem mass spectrometry. J. Mass Spectrom 39, 1091–1112. [DOI] [PubMed] [Google Scholar]
- (36).Lehmann WD, Krüger R, Salek M, Hung C-W, Wolschin F, and Weckwerth W (2007) Neutral loss-based phosphopeptide recognition: a collection of caveats. J. Proteome Res 6, 2866–2873. [DOI] [PubMed] [Google Scholar]
- (37).Syka JEP, Coon JJ, Schroeder MJ, Shabanowitz J, and Hunt DF (2004) Peptide and protein sequence analysis by electron transfer dissociation mass spectrometry. Proc. Natl. Acad. Sci. U. S. A 101, 9528–9533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (38).Mechref Y (2012) Use of CID/ETD mass spectrometry to analyze glycopeptides. Curr. Protoc. Protein Sci Chapter 12, 12.11.1–12.11.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (39).Zubarev RA, Kelleher NL, and McLafferty FW (1998) Electron Capture Dissociation of Multiply Charged Protein Cations. A Nonergodic Process. J. Am. Chem. Soc 120, 3265–3266. [DOI] [PubMed] [Google Scholar]
- (40).Hart-Smith G (2014) A review of electron-capture and electron-transfer dissociation tandem mass spectrometry in polymer chemistry. Anal. Chim. Acta 808, 44–55. [DOI] [PubMed] [Google Scholar]
- (41).Molina H, Matthiesen R, Kandasamy K, and Pandey A (2008) Comprehensive comparison of collision induced dissociation and electron transfer dissociation. Anal. Chem 80, 4825–4835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Hansen TA, Sylvester M, Jensen ON, and Kjeldsen F (2012) Automated and high confidence protein phosphorylation site localization using complementary collision-activated dissociation and electron transfer dissociation tandem mass spectrometry. Anal. Chem 84, 9694–9699. [DOI] [PubMed] [Google Scholar]
- (43).Vandenbogaert M, Hourdel V, Jardin-Mathé O, Bigeard J, Bonhomme L, Legros V, Hirt H, Schwikowski B, and Pflieger D (2012) Automated phosphopeptide identification using multiple MS/MS fragmentation modes. J. Proteome Res 11, 5695–5703. [DOI] [PubMed] [Google Scholar]
- (44).Stahl DC, Swiderek KM, Davis MT, and Lee TD (1996) Data-controlled automation of liquid chromatography/tandem mass spectrometry analysis of peptide mixtures. J. Am. Soc. Mass Spectrom 7, 532–540. [DOI] [PubMed] [Google Scholar]
- (45).Davis MT, Spahr CS, McGinley MD, Robinson JH, Bures EJ, Beierle J, Mort J, Yu W, Luethy R, and Patterson SD (2001) Towards defining the urinary proteome using liquid chromatography-tandem mass spectrometry. II. Limitations of complex mixture analyses. Proteomics 1, 108–117. [DOI] [PubMed] [Google Scholar]
- (46).Kohli BM, Eng JK, Nitsch RM, and Konietzko U (2005) An alternative sampling algorithm for use in liquid chromatography/tandem mass spectrometry experiments. Rapid Commun. Mass Spectrom 19, 589–596. [DOI] [PubMed] [Google Scholar]
- (47).Eng JK, McCormack AL, and Yates JR (1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom 5, 976–989. [DOI] [PubMed] [Google Scholar]
- (48).Craig R, and Beavis RC (2004) TANDEM: matching proteins with tandem mass spectra. Bioinformatics 20, 1466–1467. [DOI] [PubMed] [Google Scholar]
- (49).Tabb DL, Fernando CG, and Chambers MC (2007) MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. J. Proteome Res 6, 654–661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (50).Cox J, Neuhauser N, Michalski A, Scheltema RA, Olsen JV, and Mann M (2011) Andromeda: a peptide search engine integrated into the MaxQuant environment. J. Proteome Res 10, 1794–1805. [DOI] [PubMed] [Google Scholar]
- (51).Bern M, Kil YJ, and Becker C (2012) Byonic: advanced peptide and protein identification software. Curr. Protoc. Bioinformatics Chapter 13, 13.20.1–13.20.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (52).Kim S, and Pevzner PA (2014) MS-GF+ makes progress towards a universal database search tool for proteomics. Nat. Commun 5, 5277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (53).Perkins DN, Pappin DJ, Creasy DM, and Cottrell JS (1999) Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551–3567. [DOI] [PubMed] [Google Scholar]
- (54).Eng JK, Searle BC, Clauser KR, and Tabb DL (2011) A face in the crowd: recognizing peptides through database search. Mol. Cell. Proteomics 10, R111.009522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (55).Yates JR 3rd, Eng JK, McCormack AL, and Schieltz D (1995) Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. Anal. Chem 67, 1426–1436. [DOI] [PubMed] [Google Scholar]
- (56).Venable JD, Dong M-Q, Wohlschlegel J, Dillin A, and Yates JR (2004) Automated approach for quantitative analysis of complex peptide mixtures from tandem mass spectra. Nat. Methods 1, 39–45. [DOI] [PubMed] [Google Scholar]
- (57).Gillet LC, Navarro P, Tate S, Röst H, Selevsek N, Reiter L, Bonner R, and Aebersold R (2012) Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics 11, O111.016717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (58).Ting YS, Egertson JD, Payne SH, Kim S, MacLean B, Käll L, Aebersold R, Smith RD, Noble WS, and MacCoss MJ (2015) Peptide-Centric Proteome Analysis: An Alternative Strategy for the Analysis of Tandem Mass Spectrometry Data. Mol. Cell. Proteomics 14, 2301–2307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (59).Tsou C-C, Avtonomov D, Larsen B, Tucholska M, Choi H, Gingras A-C, and Nesvizhskii AI (2015) DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics. Nat. Methods 12, 258–64, 7 p following 264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (60).Peterson AC, Russell JD, Bailey DJ, Westphall MS, and Coon JJ (2012) Parallel reaction monitoring for high resolution and high mass accuracy quantitative, targeted proteomics. Mol. Cell. Proteomics 11, 1475–1488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (61).Manning G, Whyte DB, Martinez R, Hunter T, and Sudarsanam S (2002) The protein kinase complement of the human genome. Science 298, 1912–1934. [DOI] [PubMed] [Google Scholar]
- (62).Schweiger R, and Linial M (2010) Cooperativity within proximal phosphorylation sites is revealed from large-scale proteomics data. Biol. Direct 5, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (63).Kalous J, Jansová D, and Šušor A (2020) Role of Cyclin-Dependent Kinase 1 in Translational Regulation in the M-Phase. Cells 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (64).Mueller PR, Coleman TR, Kumagai A, and Dunphy WG (1995) Myt1: a membrane-associated inhibitory kinase that phosphorylates Cdc2 on both threonine-14 and tyrosine-15. Science 270, 86–90. [DOI] [PubMed] [Google Scholar]
- (65).LaGory EL, Sitailo LA, and Denning MF (2010) The protein kinase Cdelta catalytic fragment is critical for maintenance of the G2/M DNA damage checkpoint. J. Biol. Chem 285, 1879–1887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (66).Szmyd R, Niska-Blakie J, Diril MK, Renck Nunes P, Tzelepis K, Lacroix A, van Hul N, Deng L-W, Matos J, Dreesen O, Bisteau X, and Kaldis P (2019) Premature activation of Cdk1 leads to mitotic events in S phase and embryonic lethality. Oncogene 38, 998–1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (67).Gnad F, de Godoy LMF, Cox J, Neuhauser N, Ren S, Olsen JV, and Mann M (2009) High-accuracy identification and bioinformatic analysis of in vivo protein phosphorylation sites in yeast. Proteomics 9, 4642–4652. [DOI] [PubMed] [Google Scholar]
- (68).Mazhar S, Leonard D, Sosa A, Schlatzer D, Thomas D, and Narla G (2020) Challenges and Reinterpretation of Antibody-Based Research on Phosphorylation of Tyr307 on PP2Ac. Cell Rep. 30, 3164–3170.e3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (69).Palou G, Palou R, Zeng F, Vashisht AA, Wohlschlegel JA, and Quintana DG (2015) Three Different Pathways Prevent Chromosome Segregation in the Presence of DNA Damage or Replication Stress in Budding Yeast. PLoS Genet. 11, e1005468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (70).Keshishian H, McDonald ER 3rd, Mundt F, Melanson R, Krug K, Porter DA, Wallace L, Forestier D, Rabasha B, Marlow SE, Jane-Valbuena J, Todres E, Specht H, Robinson ML, Jean Beltran PM, Babur O, Olive ME, Golji J, Kuhn E, Burgess M, MacMullan MA, Rejtar T, Wang K, Mani DR, Satpathy S, Gillette MA, Sellers WR, and Carr SA (2021) A highly multiplexed quantitative phosphosite assay for biology and preclinical studies. Mol. Syst. Biol 17, e10156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (71).Searle BC, Chien A, Koller A, Hawke D, Herren AW, Kim Kim J, Lee KA, Leib RD, Nelson AJ, Patel P, Ren JM, Stemmer PM, Zhu Y, Neely BA, and Patel B (2023) A Multipathway Phosphopeptide Standard for Rapid Phosphoproteomics Assay Development. Mol. Cell. Proteomics 22, 100639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (72).Barber KW, Muir P, Szeligowski RV, Rogulina S, Gerstein M, Sampson JR, Isaacs FJ, and Rinehart J (2018) Encoding human serine phosphopeptides in bacteria for proteome-wide identification of phosphorylation-dependent interactions. Nat. Biotechnol 36, 638–644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (73).Gassaway BM, Li J, Rad R, Mintseris J, Mohler K, Levy T, Aguiar M, Beausoleil SA, Paulo JA, Rinehart J, Huttlin EL, and Gygi SP (2022) A multi-purpose, regenerable, proteome-scale, human phosphoserine resource for phosphoproteomics. Nat. Methods [DOI] [PMC free article] [PubMed] [Google Scholar]
- (74).Beausoleil SA, Villén J, Gerber SA, Rush J, and Gygi SP (2006) A probability-based approach for high-throughput protein phosphorylation analysis and site localization. Nat. Biotechnol 24, 1285–1292. [DOI] [PubMed] [Google Scholar]
- (75).Savitski MM, Lemeer S, Boesche M, Lang M, Mathieson T, Bantscheff M, and Kuster B (2011) Confident phosphorylation site localization using the Mascot Delta Score. Mol. Cell. Proteomics 10, M110.003830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (76).Baker PR, Trinidad JC, and Chalkley RJ (2011) Modification site localization scoring integrated into a search engine. Mol. Cell. Proteomics 10, M111.008078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (77).Clauser KR, Baker P, and Burlingame AL (1999) Role of accurate mass measurement (+/− 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. Anal. Chem 71, 2871–2882. [DOI] [PubMed] [Google Scholar]
- (78).Chalkley RJ, Baker PR, Huang L, Hansen KC, Allen NP, Rexach M, and Burlingame AL (2005) Comprehensive analysis of a multidimensional liquid chromatography mass spectrometry dataset acquired on a quadrupole selecting, quadrupole collision cell, time-of-flight mass spectrometer: II. New developments in Protein Prospector allow for reliable and comprehensive automatic analysis of large datasets. Mol. Cell. Proteomics 4, 1194–1204. [DOI] [PubMed] [Google Scholar]
- (79).Hornbeck PV, Zhang B, Murray B, Kornhauser JM, Latham V, and Skrzypek E (2015) PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (80).Lawrence RT, Searle BC, Llovet A, and Villén J (2016) Plug-and-play analysis of the human phosphoproteome by targeted high-resolution mass spectrometry. Nat. Methods 13, 431–434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (81).Chalkley RJ, and Clauser KR (2012) Modification site localization scoring: strategies and performance. Mol. Cell. Proteomics 11, 3–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (82).Lee DCH, Jones AR, and Hubbard SJ (2015) Computational phosphoproteomics: from identification to localization. Proteomics 15, 950–963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (83).Xu X, Sarikas A, Dias-Santagata DC, Dolios G, Lafontant PJ, Tsai S-C, Zhu W, Nakajima H, Nakajima HO, Field LJ, Wang R, and Pan Z-Q (2008) The CUL7 E3 ubiquitin ligase targets insulin receptor substrate 1 for ubiquitin-dependent degradation. Mol. Cell 30, 403–414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (84).Searle BC, Lawrence RT, MacCoss MJ, and Villén J (2019) Thesaurus: quantifying phosphopeptide positional isomers. Nat. Methods 16, 703–706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (85).Wolf-Yadlin A, Hautaniemi S, Lauffenburger DA, and White FM (2007) Multiple reaction monitoring for robust quantitative proteomic analysis of cellular signaling networks. Proc. Natl. Acad. Sci. U. S. A 104, 5860–5865. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (86).Lim MY, Paulo JA, and Gygi SP (2019) Evaluating False Transfer Rates from the Match-between-Runs Algorithm with a Two-Proteome Model. J. Proteome Res 18, 4020–4026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (87).Phanstiel DH, Brumbaugh J, Wenger CD, Tian S, Probasco MD, Bailey DJ, Swaney DL, Tervo MA, Bolin JM, Ruotti V, Stewart R, Thomson JA, and Coon JJ (2011) Proteomic and phosphoproteomic comparison of human ES and iPS cells. Nat. Methods 8, 821–827. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (88).Albuquerque CP, Smolka MB, Payne SH, Bafna V, Eng J, and Zhou H (2008) A multidimensional chromatography technology for in-depth phosphoproteome analysis. Mol. Cell. Proteomics 7, 1389–1396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (89).Tanner S, Shu H, Frank A, Wang L-C, Zandi E, Mumby M, Pevzner PA, and Bafna V (2005) InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. Anal. Chem 77, 4626–4639. [DOI] [PubMed] [Google Scholar]
- (90).Taus T, Köcher T, Pichler P, Paschke C, Schmidt A, Henrich C, and Mechtler K (2011) Universal and confident phosphorylation site localization using phosphoRS. J. Proteome Res 10, 5354–5362. [DOI] [PubMed] [Google Scholar]
- (91).Bailey CM, Sweet SMM, Cunningham DL, Zeller M, Heath JK, and Cooper HJ (2009) SLoMo: automated site localization of modifications from ETD/ECD mass spectra. J. Proteome Res 8, 1965–1971. [DOI] [PubMed] [Google Scholar]
- (92).MacLean D, Burrell MA, Studholme DJ, and Jones AM (2008) PhosCalc: a tool for evaluating the sites of peptide phosphorylation from mass spectrometer data. BMC Res. Notes 1, 30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (93).Ulintz PJ, Yocum AK, Bodenmiller B, Aebersold R, Andrews PC, and Nesvizhskii AI (2009) Comparison of MS(2)-only, MSA, and MS(2)/MS(3) methodologies for phosphopeptide identification. J. Proteome Res 8, 887–899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (94).Olsen JV, Blagoev B, Gnad F, Macek B, Kumar C, Mortensen P, and Mann M (2006) Global, in vivo, and site-specific phosphorylation dynamics in signaling networks. Cell 127, 635–648. [DOI] [PubMed] [Google Scholar]
- (95).Fermin D, Walmsley SJ, Gingras A-C, Choi H, and Nesvizhskii AI (2013) LuciPHOr: algorithm for phosphorylation site localization with false localization rate estimation using modified target-decoy approach. Mol. Cell. Proteomics 12, 3409–3419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (96).Shteynberg DD, Deutsch EW, Campbell DS, Hoopmann MR, Kusebauch U, Lee D, Mendoza L, Midha MK, Sun Z, Whetton AD, and Moritz RL (2019) PTMProphet: Fast and Accurate Mass Modification Localization for the Trans-Proteomic Pipeline. J. Proteome Res 18, 4262–4272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (97).An Z, Zhai L, Ying W, Qian X, Gong F, Tan M, and Fu Y (2019) PTMiner: Localization and Quality Control of Protein Modifications Detected in an Open Search and Its Application to Comprehensive Post-translational Modification Characterization in Human Proteome. Mol. Cell. Proteomics 18, 391–405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (98).Yang H, Chi H, Zhou W-J, Zeng W-F, Liu C, Wang R-M, Wang Z-W, Niu X-N, Chen Z-L, and He S-M (2018) pSite: Amino Acid Confidence Evaluation for Quality Control of De Novo Peptide Sequencing and Modification Site Localization. J. Proteome Res 17, 119–128. [DOI] [PubMed] [Google Scholar]
- (99).Searle BC, Dasari S, Turner M, Reddy AP, Choi D, Wilmarth PA, McCormack AL, David LL, and Nagalla SR (2004) High-throughput identification of proteins and unanticipated sequence modifications using a mass-based alignment algorithm for MS/MS de novo sequencing results. Anal. Chem 76, 2220–2230. [DOI] [PubMed] [Google Scholar]
- (100).Han Y, Ma B, and Zhang K (2005) SPIDER: software for protein identification from sequence tags with de novo sequencing error. J. Bioinform. Comput. Biol 3, 697–716. [DOI] [PubMed] [Google Scholar]
- (101).Suni V, Imanishi SY, Maiolica A, Aebersold R, and Corthals GL (2015) Confident site localization using a simulated phosphopeptide spectral library. J. Proteome Res 14, 2348–2359. [DOI] [PubMed] [Google Scholar]
- (102).Hu Y, Li Y, and Lam H (2011) A semi-empirical approach for predicting unobserved peptide MS/MS spectra from spectral libraries. Proteomics 11, 4702–4711. [DOI] [PubMed] [Google Scholar]
- (103).Hu Y, and Lam H (2013) Expanding tandem mass spectral libraries of phosphorylated peptides: advances and applications. J. Proteome Res 12, 5971–5977. [DOI] [PubMed] [Google Scholar]
- (104).Suni V, Suomi T, Tsubosaka T, Imanishi SY, Elo LL, and Corthals GL (2018) SimPhospho: a software tool enabling confident phosphosite assignment. Bioinformatics 34, 2690–2692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (105).Takai A, Tsubosaka T, Hirano Y, Hayakawa N, Tani F, Haapaniemi P, Suni V, and Imanishi SY (2019) Optimization of TripleTOF spectral simulation and library searching for confident localization of phosphorylation sites. PLoS One 14, e0225885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (106).Lam H, Deutsch EW, Eddes JS, Eng JK, King N, Stein SE, and Aebersold R (2007) Development and validation of a spectral library searching method for peptide identification from MS/MS. Proteomics 7, 655–667. [DOI] [PubMed] [Google Scholar]
- (107).Zong Y, Wang Y, Yang Y, Zhao D, Wang X, Shen C, and Qiao L (2023) DeepFLR facilitates false localization rate control in phosphoproteomics. Nat. Commun 14, 2269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (108).Pino LK, Just SC, MacCoss MJ, and Searle BC (2020) Acquiring and Analyzing Data Independent Acquisition Proteomics Experiments without Spectrum Libraries. Mol. Cell. Proteomics [DOI] [PMC free article] [PubMed] [Google Scholar]
- (109).Peckner R, Myers SA, Jacome ASV, Egertson JD, Abelin JG, MacCoss MJ, Carr SA, and Jaffe JD (2018) Specter: linear deconvolution for targeted analysis of data-independent acquisition mass spectrometry proteomics. Nat. Methods 15, 371–378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (110).Meyer JG, Mukkamalla S, Steen H, Nesvizhskii AI, Gibson BW, and Schilling B (2017) PIQED: automated identification and quantification of protein modifications from DIA-MS data. Nat. Methods 14, 646–647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (111).Eng JK, Jahan TA, and Hoopmann MR (2013) Comet: an open-source MS/MS sequence database search tool. Proteomics 13, 22–24. [DOI] [PubMed] [Google Scholar]
- (112).Deutsch EW, Mendoza L, Shteynberg D, Farrah T, Lam H, Tasman N, Sun Z, Nilsson E, Pratt B, Prazen B, Eng JK, Martin DB, Nesvizhskii AI, and Aebersold R (2010) A guided tour of the Trans-Proteomic Pipeline. Proteomics 10, 1150–1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (113).Rosenberger G, Liu Y, Röst HL, Ludwig C, Buil A, Bensimon A, Soste M, Spector TD, Dermitzakis ET, Collins BC, Malmström L, and Aebersold R (2017) Inference and quantification of peptidoforms in large sample cohorts by SWATH-MS. Nat. Biotechnol 35, 781–788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (114).Röst HL, Rosenberger G, Navarro P, Gillet L, Miladinović SM, Schubert OT, Wolski W, Collins BC, Malmström J, Malmström L, and Aebersold R (2014) OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol 32, 219–223. [DOI] [PubMed] [Google Scholar]
- (115).Searle BC, Pino LK, Egertson JD, Ting YS, Lawrence RT, MacLean BX, Villén J, and MacCoss MJ (2018) Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry. Nat. Commun 9, 5128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (116).Bekker-Jensen DB, Bernhardt OM, Hogrebe A, Martinez-Val A, Verbeke L, Gandhi T, Kelstrup CD, Reiter L, and Olsen JV (2020) Rapid and site-specific deep phosphoproteome profiling by data-independent acquisition without the need for spectral libraries. Nat. Commun 11, 787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (117).Martinez-Val A, Bekker-Jensen DB, Hogrebe A, and Olsen JV (2021) Data Processing and Analysis for DIA-Based Phosphoproteomics Using Spectronaut. Methods Mol. Biol 2361, 95–107. [DOI] [PubMed] [Google Scholar]
- (118).Potel CM, Lemeer S, and Heck AJR (2019) Phosphopeptide Fragmentation and Site Localization by Mass Spectrometry: An Update. Anal. Chem 91, 126–141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (119).Elias JE, and Gygi SP (2007) Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods 4, 207–214. [DOI] [PubMed] [Google Scholar]
- (120).Käll L, Canterbury JD, Weston J, Noble WS, and MacCoss MJ (2007) Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat. Methods 4, 923–925. [DOI] [PubMed] [Google Scholar]
- (121).Tabb DL (2008) What’s driving false discovery rates? J. Proteome Res 7, 45–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (122).Ramsbottom KA, Prakash A, Riverol YP, Camacho OM, Martin M-J, Vizcaíno JA, Deutsch EW, and Jones AR (2022) Method for Independent Estimation of the False Localization Rate for Phosphoproteomics. J. Proteome Res 21, 1603–1615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (123).Keller A, Nesvizhskii AI, Kolker E, and Aebersold R (2002) Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal. Chem 74, 5383–5392. [DOI] [PubMed] [Google Scholar]
- (124).Locard-Paulet M, Bouyssié D, Froment C, Burlet-Schiltz O, and Jensen LJ (2020) Comparing 22 Popular Phosphoproteomics Pipelines for Peptide Identification and Site Localization. J. Proteome Res 19, 1338–1345. [DOI] [PubMed] [Google Scholar]
- (125).Palumbo AM, and Reid GE (2008) Evaluation of gas-phase rearrangement and competing fragmentation reactions on protein phosphorylation site assignment using collision induced dissociation-MS/MS and MS3. Anal. Chem 80, 9735–9747. [DOI] [PubMed] [Google Scholar]
- (126).Boersema PJ, Mohammed S, and Heck AJR (2009) Phosphopeptide fragmentation and analysis by mass spectrometry. J. Mass Spectrom 44, 861–878. [DOI] [PubMed] [Google Scholar]
- (127).Palumbo AM, Smith SA, Kalcic CL, Dantus M, Stemmer PM, and Reid GE (2011) Tandem mass spectrometry strategies for phosphoproteome analysis. Mass Spectrom. Rev 30, 600–625. [DOI] [PubMed] [Google Scholar]
- (128).Penkert M, Yates LM, Schümann M, Perlman D, Fiedler D, and Krause E (2017) Unambiguous Identification of Serine and Threonine Pyrophosphorylation Using Neutral-Loss-Triggered Electron-Transfer/Higher-Energy Collision Dissociation. Anal. Chem 89, 3672–3680. [DOI] [PubMed] [Google Scholar]
- (129).Everley RA, Huttlin EL, Erickson AR, Beausoleil SA, and Gygi SP (2017) Neutral Loss Is a Very Common Occurrence in Phosphotyrosine-Containing Peptides Labeled with Isobaric Tags. J. Proteome Res 16, 1069–1076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (130).Palumbo AM, Tepe JJ, and Reid GE (2008) Mechanistic insights into the multistage gas-phase fragmentation behavior of phosphoserine- and phosphothreonine-containing peptides. J. Proteome Res 7, 771–779. [DOI] [PubMed] [Google Scholar]
- (131).Houel S, Abernathy R, Renganathan K, Meyer-Arendt K, Ahn NG, and Old WM (2010) Quantifying the impact of chimera MS/MS spectra on peptide identification in large-scale proteomics studies. J. Proteome Res 9, 4152–4160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (132).Aguiar M, Haas W, Beausoleil SA, Rush J, and Gygi SP (2010) Gas-phase rearrangements do not affect site localization reliability in phosphoproteomics data sets. J. Proteome Res 9, 3103–3107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (133).Jones AR, Eisenacher M, Mayer G, Kohlbacher O, Siepen J, Hubbard SJ, Selley JN, Searle BC, Shofstahl J, Seymour SL, Julian R, Binz P-A, Deutsch EW, Hermjakob H, Reisinger F, Griss J, Vizcaíno JA, Chambers M, Pizarro A, and Creasy D (2012) The mzIdentML data standard for mass spectrometry-based proteomics results. Mol. Cell. Proteomics 11, M111.014381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (134).Seymour SL, Farrah T, Binz P-A, Chalkley RJ, Cottrell JS, Searle BC, Tabb DL, Vizcaíno JA, Prieto G, Uszkoreit J, Eisenacher M, Martínez-Bartolomé S, Ghali F, and Jones AR (2014) A standardized framing for reporting protein identifications in mzIdentML 1.2. Proteomics 14, 2389–2399. [DOI] [PMC free article] [PubMed] [Google Scholar]
