Abstract
The search for clinically useful protein biomarkers using advanced mass spectrometry approaches represents a major focus in cancer research. However, the direct analysis of human samples may be challenging due to limited availability, the absence of appropriate control samples, or the large background variability observed in patient material. As an alternative approach, human tumors orthotopically implanted into a different species (xenografts) are clinically relevant models that have proven their utility in pre-clinical research. Patient derived xenografts for glioblastoma have been extensively characterized in our laboratory and have been shown to retain the characteristics of the parental tumor at the phenotypic and genetic level. Such models were also found to adequately mimic the behavior and treatment response of human tumors. The reproducibility of such xenograft models, the possibility to identify their host background and perform tumor-host interaction studies, are major advantages over the direct analysis of human samples. At the proteome level, the analysis of xenograft samples is challenged by the presence of proteins from two different species which, depending on tumor size, type or location, often appear at variable ratios. Any proteomics approach aimed at quantifying proteins within such samples must consider the identification of species specific peptides in order to avoid biases introduced by the host proteome. Here, we present an in-house methodology and tool developed to select peptides used as surrogates for protein candidates from a defined proteome (e.g., human) in a host proteome background (e.g., mouse, rat) suited for a mass spectrometry analysis. The tools presented here are applicable to any species specific proteome, provided a protein database is available. By linking the information from both proteomes, PeptideManager significantly facilitates and expedites the selection of peptides used as surrogates to analyze proteins of interest.
Keywords: mass spectrometry, targeted proteomics, rodent xenografts, human glioblastoma, mixed samples, automated tool, unique peptide selection
Introduction
Mass spectrometry(MS)-based proteomics provides various approaches (i.e., shotgun, supervised and targeted approaches) (Domon and Aebersold, 2010) in the field of cancer research (Smith, 2012; Deracinois et al., 2013; Marx, 2013) and is nowadays widely used in pre-clinical and clinical investigations (Lee et al., 2011), and for biomarker studies (Li et al., 2011; Meng and Veenstra, 2011; Pan et al., 2012; Waldemarson et al., 2012).
Shotgun proteomics approach is the pipeline followed when biomarker discovery is considered; i.e., protein identification (Eng et al., 1994; Nesvizhskii, 2007) and label-free relative quantification (Bantscheff et al., 2007; Asara et al., 2008; Neilson et al., 2011). Regarding the evaluation (and validation) of biomarker candidates, targeted proteomics is considered for the precise and even the absolute quantification of these candidates (Gallien et al., 2011; Whiteaker et al., 2011; Gillette and Carr, 2013; Marx, 2013). Targeted proteomics is also increasingly used to perform supervised discovery of selected biomarker candidates (Gillette and Carr, 2013; Kim et al., 2013; Marx, 2013; Percy et al., 2014).
In cancer research, the direct analysis of human samples are often hampered because of strong inter-patient variability and limited sample availability. The latter is particularly true for samples requiring invasive sample collection procedures (e.g., biopsies) and is even worse for samples from healthy donors (control samples) (Pesch et al., 2014). Moreover, patient samples are normally limited to one time point and they do not offer the opportunity for a controlled interventional study. To circumvent those restrictions, animal models consisting in the orthotopical implantation of human tumors into animals (xenografts) have proven their utility as relevant models in many studies (Whiteaker et al., 2007; Huszthy et al., 2012; Tang et al., 2012; Klink et al., 2013). In the search for more biomarkers and more effective treatment options against brain tumors, our lab has developed patient derived xenografts for human glioblastoma in immunodeficient mice and rats (Wang et al., 2009). Those animal models have been extensively characterized and shown to retain the characteristics of the parental tumor at the phenotypic and genetic level (Niclou et al., 2008; Rajcevic et al., 2009; Golebiewska et al., 2013). Such models were also found to adequately mimic the behavior and treatment response of human tumors (Keunen et al., 2011). Another major advantage of these models over human samples is their reproducibility and their identifiable host background allowing a detailed analysis of tumor-host interactions. The use of xenograft samples also gives an easy and direct access to control samples exhibiting a more controlled experimental setup compared to human samples.
In bottom-up MS-based proteomics, peptides are generally used as the representative of the proteins either for identification or quantification purposes (Aebersold and Mann, 2003; Chait, 2006). Peptides are identified and monitored via their mass-to-charge ratio and their fragmentation pattern produced under collision induced dissociation (CID) (peptide ion-types, mass-to-charges and relative intensities) (Eng et al., 1994; Nesvizhskii, 2007). Therefore, it is desirable to target a few representative peptides for each protein of interest. The choice of these peptides used as protein surrogates is therefore crucial to unequivocally identify and quantify the protein of interest. A peptide is considered as proteotypic (Kuster et al., 2005; Mallick et al., 2007) when it fulfills two selection criteria: it is a unique representative of a single protein within the proteome of interest and it possesses a good MS detectability (Brownridge and Beynon, 2011). Such peptides allow umambiguous analysis of the targeted protein. However, the selection of proteotypic peptides is often tedious and time-consuming. It has to carefully take into account several essential criteria (Gallien et al., 2011) including uniqueness of peptide sequence, protein digestion efficiency, enzymatical and chemical modifications at the protein or peptide level, physicochemical behavior of the peptide.
In the context of xenograft samples, the use of MS-based targeted (or supervised) proteomics approaches for protein biomarker studies needs to consider the presence of various proteomes from different species. This leads to more complex samples and a more constrained and restricted choice of the surrogate peptides (Figure 1). The tool PeptideManager was built with the purpose of collecting and combining information from different public protein databases [UniProt (SwissProt or TrEMBL or both (UniProt)), RefSeq or IPI] and/or species to facilitate and speed up the surrogate peptide selection compared to the manual selection process. The proposed software is specially designed for the selection of surrogate peptides in the cases involving various species proteomes such as in xenografts. Obviously, it can also be used in the general case when a single proteome is implicated.
General consideration on peptide selection
The separation, detection and characterization by liquid chromatography (LC)-MS/MS of a proteotypic peptide of a given protein allows for confident identification given that the peptide identification is reliable (via the m/z of its precursor and of its fragments). However, for quantitative purposes, the choice of a surrogate peptide for a given protein is not trivial since the sequence uniqueness of the peptide is not the only pre-requisite (Gallien et al., 2011). Indeed, several parameters need to be considered to warrant the selection of peptides that reliably represent the targeted protein at the quantitative level (Lange et al., 2008; Gallien et al., 2011). Such peptides are termed proteotypic (Kuster et al., 2005; Mallick et al., 2007) and are characterized by two major conditions; uniqueness of its sequence within the proteome of interest and a good detectability in LC-MS(/MS). The good detectability of peptides in MS is influenced by its physicochemical properties and other experimental-related parameters, which are described hereafter.
Ideally, the enzymatic digestion of the proteins should not introduce any bias. However, since the yield of an enzymatic reaction is rarely complete, the enzymatic digestion itself introduces some variability across samples (Brownridge and Beynon, 2011; Loziuk et al., 2013). This can be limited in two ways. Firstly, a precise control of the experimental conditions will limit the variability across sample replicates. Secondly, since the digestion efficiency is not identical for all the peptides derived from the same protein, a rational choice of the peptides exhibiting the best cleavage propensities should be performed. The yield of the proteolytic cleavage depends on the protease and on the amino acid composition around the cleavage sites. The cleavage efficiencies can be estimated theoretically for some proteases and various web-tools are freely available to predict which proteolytic peptide bonds are likely to be missed by the standard enzymes [e.g., PeptideCutter web-tool on ExPASy (http://web.expasy.org/peptide_cutter/) (Eyers et al., 2011; Artimo et al., 2012; Lawless and Hubbard, 2012)].
The reduction and the alkylation steps of the cysteines in proteins are intended to make cleavage sites accessible that would otherwise be hindered. Nevertheless, as any chemical reaction, this sample preparation procedure can introduce unpredictable biases because of an incomplete reaction. Therefore, peptides containing amino acids that are prone to chemical modification [e.g., cysteines (incomplete reduction/alkylation) and methionine (partially oxidized)] should be avoided (Bischoff and Schluter, 2012). Correspondingly, amino acids with the potential for enzymatic post-translational modifications (PTMs) should be avoided if the analysis of those modifications is not in the scope of the study. Peptides containing sequence uncertainties (e.g., sequence conflicts between public protein databases) or amino acids showing variability due to single-nucleotide polymorphisms (SNPs) should be discarded as well.
Depending on whether all isoforms or a particular isoform of a given protein are of interest, either the selected peptides should be representative of all the isoforms of the protein or the selected peptides should be representative of only one specific protein isoform.
The mass-to-charge ratio of the selected peptide should comply with the mass range restriction of the mass spectrometer(s) that will be used for the subsequent analyses (e.g., m/z between 400 and 1600 on a triple quadrupole platform). Moreover, very short peptides are less likely to have a unique sequence and they are more susceptible to interferences. On the other hand, lengthy peptides are not desirable due to their hydrophobicity and to issues regarding their synthesis and purification. Typically, peptide ranging from 5 to around 22 amino acids are best suited for quantification purpose (Gallien et al., 2011).
Various tools are currently available for the theoretical or semi-empirical estimation of the proteotypic properties of a peptide sequence including the digestion efficacy according to the cleavage site (Kuster et al., 2005; Mallick et al., 2007; Fusaro et al., 2009; Eyers et al., 2011; Lawless and Hubbard, 2012; Mohammed et al., 2014; Qeli et al., 2014). However, these tools do not handle samples containing proteins from multiple species that therefore require a very time-consuming manual selection of surrogate peptide candidates. The proposed tool was therefore intended to automate this selection step of unique peptide candidates by combining all relevant information present in public protein databases to help the filtering of inappropriate peptides (e.g., peptide sequence present in both proteomes).
The tool PeptideManager was designed to: (1) build and store peptide databases from public repositories, (2) link available information (e.g., PTM sites, SNPs, signal peptide) from public databases at the peptide level, (3) allow queries and peptide pre-selection within those databases, and most importantly (4) perform peptide pre-selection within a given proteome of interest while taking into account the presence of another species proteome within the sample (Figure 2).
Materials and methods
Glioblastoma samples were collected at the Neurosurgery Department of the Center Hospitalier in Luxembourg (CHL) from patients having given their informed consent. Collection and use of patient tumor material has been approved by the National Ethics Committee for Research (CNER) of Luxembourg. All animal procedures were approved by the national authorities responsible for animal experiments in Luxembourg.
Peptide database creation
The peptide selection tool PeptideManager was written in C# language and the GUI (graphical user interface) was designed with Visual Studio Express 2010. The databases were built in the SQLite format (http://www.sqlite.org/about.html). PeptideManager can be run on any computer on which the Microsoft.NET framework is installed. PeptideManager and its user guide are freely available at http://peptidemanager-lrno.sourceforge.net.
The peptide databases (in SQLite format) derived from different public protein databases can be built by PeptideManager, namely UniProt (Uniprot, 2009, 2014; Magrane and Consortium, 2011), RefSeq (Pruitt et al., 2007, 2012, 2014) and IPI (Kersey et al., 2004). The different file formats requested to build a peptide database are indicated in Table 1. Protein sequences are digested in silico by the proteolytic enzyme selected (trypsin, Lys-C, Arg-C or Lys-N). The required information present in the public database is extracted and inserted into the PeptideManager database. If present in the public protein data repository, information about PTMs, sequence conflicts or other sequence related modifications are extracted and inserted at the peptide level in the newly generated database. In order to limit the size of the created peptide database, a minimal peptide length (3–5 amino acids) as well as the number of allowed miscleavages (0–2) can be selected. Once created, the database is indexed in PeptideManager and is immediately available for search requests.
Table 1.
FASTA-like file format that only contains protein sequences and essential protein information.
More complete file format than.faa files. They include additional information such as post-translational modifications (PTMs), single nucleotide polymorphisms (SNPs), etc.
Contrary to similar tools like PeptidePicker (Mohammed et al., 2014), CONSeQuence (Eyers et al., 2011) or even Skyline (Maclean et al., 2010), PeptideManager is solely intended to build a peptide database, to allow search queries within these databases and to help the user to perform unique peptide selection for targeted experiments. The advantages of PeptideManager are to allow the user to define a host/background proteome, to import databases from UniProt, RefSeq and IPI, to extract all the available information (protein features) and to suggest default filters for the selection of unique peptide sequences (with or without the presence of a host/background proteome).
Surrogate peptide selection
As shown in Figure 3, search queries can be performed with protein accession numbers (individually or in batch mode), protein name or peptide sequences (individually or in batch mode). The displayed results can be filtered against unsuitable amino acids or lengths for example. The results can be directly saved in csv file format. When only one species proteome is studied, this type of search request provides all the information needed to select surrogate peptide candidates (unique peptide sequences) for the given protein. Moreover, this type of request can be used to define the uniqueness of a list of detected peptides from experimental evidences and confirm the identification of the related proteins.
The major advantage of PeptideManager is to possess a search function specifically dedicated to peptide selection when various species proteomes are involved (Figure 1). In addition to the previous search request, the proteome of interest and the proteome of the host species (background proteome) can be selected (Figure 4). In this case, the information will be gathered by PeptideManager within the peptide databases of both proteomes. The protein accession number is used to extract all peptide information related to the targeted protein in the database of the proteome of interest. Subsequently, the peptide sequences are searched against the peptide database of the host species proteome to determine the number of occurrences (hits) within the background proteome. In order to monitor/quantify a protein of interest without any bias coming from the host/background proteome, only peptides that are unique sequences (i.e., unique representative of the protein of interest) within the proteome of interest and absent within the host/background proteome should be selected. An example of a result file (csv format) is partly displayed in Figure 4.
As mentioned, the software is not only intended for the selection of unique sequences of a given protein but also to perform any kind of search queries within the databases. In order to keep all the information available for the user, filters are available and can be used/or not according to the purpose of the query. However, in the case of the selection of unique peptide sequences (with or without the presence of a host proteome), it is possible to apply “advised default values” of the filters. A comprehensive user guide is provided as Supplementary Material. This user guide explains the different steps to use correctly the software and includes several case studies in order to exemplify different situations that users could meet, e.g., protein isoform differentiation, post-translational modification monitoring, unique peptide selection with or without the presence of a host/background proteome.
PeptideManager was used for the selection of surrogate peptides to develop SRM (selected reaction monitoring) assays on a triple-quadrupole platform for the targeted analysis of protein biomarker candidates. This was done in the context of human GBM xenografts (mice and rats) and PeptideManager greatly facilitated and expedited the selection of unique peptide from mixed samples involving different species proteomes. Compared to the few proteins per day for which unique peptide selection can be done manually, PeptideManager succeeded in performing the selection of peptide candidates for hundreds of proteins a day.
In order to further validate the selected peptides and/or decrease further the number of peptide candidates, additional information, not present in the public protein databases, such as the enzymatic digestion efficiencies and the LC-MS(/MS) behaviors of the peptide candidates can be used. From the selected peptides, predictive computational tools (Artimo et al., 2012; Lawless and Hubbard, 2012) can be used to filter out peptides arising from poorly efficient cleavage sites that would lead to an erroneous values of peptide/protein amount as well as a decrease in sensitivity. MS-based data from the proteomic community are freely available in public data repositories such as PeptideAtlas (Desiere et al., 2006), GPM Proteomics Database (Craig et al., 2004, 2005) or PRIDE (Vizcaino et al., 2013) that permit to estimate the MS-behaviors of peptide sequence candidates.
In these public databases, proteomics data can be retrieved by protein accession number, peptide sequence, species or even sample type (e.g., brain, kidney, liver). For a given protein, all the previously detected peptides identifying this protein are displayed; however they may not be unique and may contain miscleavages. Using the list of pre-selected unique peptide candidates from PeptideManager will allow to rapidly access the pertinent information from the data provided by those public repositories. Additional empirical and theoretical information (available in PeptideAtlas) may help to select the best candidates. For example, how many times a peptide sequence was detected within a given set of experiments can help to estimate the detectability of this peptide by LC-MS (due to digestion efficacy, good ionization, LC or MS behavior, etc.). Theoretical scores based on various parameters (e.g., amino acid composition and hydrophobicity) and algorithms can give theoretical estimations of the LC-MS behavior and detectability of the peptides (Mallick et al., 2007; Fusaro et al., 2009; Eyers et al., 2011; Qeli et al., 2014). All this information can be used to further rationalize the peptide selection and keep those peptides predicted to demonstrate the best LC-MS behavior and leading to the MS measurements with the highest possible sensitivity. If MS/MS spectra are available, they can be used to select the most appropriate charge state of the peptide to be monitored. In the context of the development of an SRM assay, the fragmentation spectra can be used to select the most intense fragments for a given peptide which should lead to an optimal sensitivity for that peptide when monitoring those transitions by SRM. Reference transitions for a large number of peptides are publicly available in the SRMAtlas repository (Picotti et al., 2008, 2013; Huttenhain et al., 2013). However, those transitions need to be validated (e.g., absence of interferences) within the biological samples of interest.
Conclusions and perspectives
PeptideManager was used for the selection of unique peptide candidates as protein surrogates in the context of a supervised evaluation (SRM assays on a triple-quadrupole platform) of protein biomarker candidates in human GBM xenografts (mice and rats). This tool greatly facilitated and expedited the peptide selection for these mixed samples involving different species proteomes. Moreover, it is applicable to complex samples (with or without the presence of a host (background) proteome) of all species for which a protein database is available.
PeptideManager could be extended to include additional information helping to the peptide selection by directly inserting the information from MS-based proteomics data public repositories (e.g., PeptideAtlas) and the theoretical information about the enzymatic cleavage efficiency (e.g., PeptideCutter) within the peptide databases.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
This work was supported by a grant from FNR (Fonds National de la Recherche) of Luxembourg to Simone P. Niclou (CORE Project ESCAPE). The authors would like to thank Anna Golebiewska, Sébastien Bougnaud and Anaïs Oudin for the images of the mouse brain section.
Supplementary material
The Supplementary Material for this article can be found online at: http://www.frontiersin.org/journal/10.3389/fgene.2014.00305/abstract
References
- Aebersold R., Mann M. (2003). Mass spectrometry-based proteomics. Nature 422, 198–207 10.1038/nature01511 [DOI] [PubMed] [Google Scholar]
- Artimo P., Jonnalagedda M., Arnold K., Baratin D., Csardi G., De Castro E., et al. (2012). ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res. 40, W597–W603 10.1093/nar/gks400 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Asara J. M., Christofk H. R., Freimark L. M., Cantley L. C. (2008). A label-free quantification method by MS/MS TIC compared to SILAC and spectral counting in a proteomics screen. Proteomics 8, 994–999 10.1002/pmic.200700426 [DOI] [PubMed] [Google Scholar]
- Bantscheff M., Schirle M., Sweetman G., Rick J., Kuster B. (2007). Quantitative mass spectrometry in proteomics: a critical review. Anal. Bioanal. Chem. 389, 1017–1031 10.1007/s00216-007-1486-6 [DOI] [PubMed] [Google Scholar]
- Bischoff R., Schluter H. (2012). Amino acids: chemistry, functionality and selected non-enzymatic post-translational modifications. J. Proteomics 75, 2275–2296 10.1016/j.jprot.2012.01.041 [DOI] [PubMed] [Google Scholar]
- Brownridge P., Beynon R. J. (2011). The importance of the digest: proteolysis and absolute quantification in proteomics. Methods 54, 351–360 10.1016/j.ymeth.2011.05.005 [DOI] [PubMed] [Google Scholar]
- Chait B. T. (2006). Chemistry. Mass spectrometry: bottom-up or top-down? Science 314, 65–66 10.1126/science.1133987 [DOI] [PubMed] [Google Scholar]
- Craig R., Cortens J. P., Beavis R. C. (2004). Open source system for analyzing, validating, and storing protein identification data. J. Proteome Res. 3, 1234–1242 10.1021/pr049882h [DOI] [PubMed] [Google Scholar]
- Craig R., Cortens J. P., Beavis R. C. (2005). The use of proteotypic peptide libraries for protein identification. Rapid Commun. Mass Spectrom. 19, 1844–1850 10.1002/rcm.1992 [DOI] [PubMed] [Google Scholar]
- Deracinois B., Flahaut C., Duban-Deweer S., Karamanos Y. (2013). Comparative and quantitative global proteomics approaches: an overview. Proteomes 1, 180–218 10.3390/proteomes1030180 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Desiere F., Deutsch E. W., King N. L., Nesvizhskii A. I., Mallick P., Eng J., et al. (2006). The peptideatlas project. Nucleic Acids Res. 34, D655–D658 10.1093/nar/gkj040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Domon B., Aebersold R. (2010). Options and considerations when selecting a quantitative proteomics strategy. Nat. Biotechnol. 28, 710–721 10.1038/nbt.1661 [DOI] [PubMed] [Google Scholar]
- Eng J. K., McCormack A. L., Yates J. R. (1994). An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 5, 976–989 10.1016/1044-0305(94)80016-2 [DOI] [PubMed] [Google Scholar]
- Eyers C. E., Lawless C., Wedge D. C., Lau K. W., Gaskell S. J., Hubbard S. J. (2011). CONSeQuence: prediction of reference peptides for absolute quantitative proteomics using consensus machine learning approaches. Mol. Cell. Proteomics 10:M110.003384 10.1074/mcp.M110.003384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fusaro V. A., Mani D. R., Mesirov J. P., Carr S. A. (2009). Prediction of high-responding peptides for targeted protein assays by mass spectrometry. Nat. Biotechnol. 27, 190–198 10.1038/nbt.1524 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gallien S., Duriez E., Domon B. (2011). Selected reaction monitoring applied to proteomics. J. Mass Spectrom. 46, 298–312 10.1002/jms.1895 [DOI] [PubMed] [Google Scholar]
- Gillette M. A., Carr S. A. (2013). Quantitative analysis of peptides and proteins in biomedicine by targeted mass spectrometry. Nat. Methods 10, 28–34 10.1038/nmeth.2309 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Golebiewska A., Bougnaud S., Stieber D., Brons N. H., Vallar L., Hertel F., et al. (2013). Side population in human glioblastoma is non-tumorigenic and characterizes brain endothelial cells. Brain 136, 1462–1475 10.1093/brain/awt025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huszthy P. C., Daphu I., Niclou S. P., Stieber D., Nigro J. M., Sakariassen P. O., et al. (2012). In vivo models of primary brain tumors: pitfalls and perspectives. Neuro. Oncol. 14, 979–993 10.1093/neuonc/nos135 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huttenhain R., Surinova S., Ossola R., Sun Z., Campbell D., Cerciello F., et al. (2013). N-glycoprotein SRMAtlas: a resource of mass spectrometric assays for N-glycosites enabling consistent and multiplexed protein quantification for clinical applications. Mol. Cell. Proteomics 12, 1005–1016 10.1074/mcp.O112.026617 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kersey P. J., Duarte J., Williams A., Karavidopoulou Y., Birney E., Apweiler R. (2004). The international protein index: an integrated database for proteomics experiments. Proteomics 4, 1985–1988 10.1002/pmic.200300721 [DOI] [PubMed] [Google Scholar]
- Keunen O., Johansson M., Oudin A., Sanzey M., Rahim S. A., Fack F., et al. (2011). Anti-VEGF treatment reduces blood supply and increases tumor cell invasion in glioblastoma. Proc. Natl. Acad. Sci. U.S.A. 108, 3749–3754 10.1073/pnas.1014480108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim Y. J., Gallien S., Van Oostrum J., Domon B. (2013). Targeted proteomics strategy applied to biomarker evaluation. Proteomics Clin. Appl. 7, 739–747 10.1002/prca.201300070 [DOI] [PubMed] [Google Scholar]
- Klink B., Miletic H., Stieber D., Huszthy P. C., Valenzuela J. A., Balss J., et al. (2013). A novel, diffusely infiltrative xenograft model of human anaplastic oligodendroglioma with mutations in FUBP1, CIC, and IDH1. PLoS ONE 8:e59773 10.1371/journal.pone.0059773 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuster B., Schirle M., Mallick P., Aebersold R. (2005). Scoring proteomes with proteotypic peptide probes. Nat. Rev. Mol. Cell Biol. 6, 577–583 10.1038/nrm1683 [DOI] [PubMed] [Google Scholar]
- Lange V., Picotti P., Domon B., Aebersold R. (2008). Selected reaction monitoring for quantitative proteomics: a tutorial. Mol. Syst. Biol. 4:222 10.1038/msb.2008.61 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawless C., Hubbard S. J. (2012). Prediction of missed proteolytic cleavages for the selection of surrogate peptides for quantitative proteomics. OMICS 16, 449–456 10.1089/omi.2011.0156 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee J. M., Han J. J., Altwerger G., Kohn E. C. (2011). Proteomics and biomarkers in clinical trials for drug development. J. Proteomics 74, 2632–2641 10.1016/j.jprot.2011.04.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J., Kelm K. B., Tezak Z. (2011). Regulatory perspective on translating proteomic biomarkers to clinical diagnostics. J. Proteomics 74, 2682–2690 10.1016/j.jprot.2011.07.028 [DOI] [PubMed] [Google Scholar]
- Loziuk P. L., Wang J., Li Q., Sederoff R. R., Chiang V. L., Muddiman D. C. (2013). Understanding the role of proteolytic digestion on discovery and targeted proteomic measurements using liquid chromatography tandem mass spectrometry and design of experiments. J. Proteome Res. 12, 5820–5829 10.1021/pr4008442 [DOI] [PubMed] [Google Scholar]
- Maclean B., Tomazela D. M., Shulman N., Chambers M., Finney G. L., Frewen B., et al. (2010). Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968 10.1093/bioinformatics/btq054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Magrane M., Consortium U. (2011). UniProt Knowledgebase: a hub of integrated protein data. Database (Oxford) 2011:bar009 10.1093/database/bar009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mallick P., Schirle M., Chen S. S., Flory M. R., Lee H., Martin D., et al. (2007). Computational prediction of proteotypic peptides for quantitative proteomics. Nat. Biotechnol. 25, 125–131 10.1038/nbt1275 [DOI] [PubMed] [Google Scholar]
- Marx V. (2013). Targeted proteomics. Nat. Methods 10, 19–22 10.1038/nmeth.2285 [DOI] [PubMed] [Google Scholar]
- Meng Z., Veenstra T. D. (2011). Targeted mass spectrometry approaches for protein biomarker verification. J. Proteomics 74, 2650–2659 10.1016/j.jprot.2011.04.011 [DOI] [PubMed] [Google Scholar]
- Mohammed Y., Domanski D., Jackson A. M., Smith D. S., Deelder A. M., Palmblad M., et al. (2014). PeptidePicker: a scientific workflow with web interface for selecting appropriate peptides for targeted proteomics experiments. J. Proteomics 106C, 151–161 10.1016/j.jprot.2014.04.018 [DOI] [PubMed] [Google Scholar]
- Neilson K. A., Ali N. A., Muralidharan S., Mirzaei M., Mariani M., Assadourian G., et al. (2011). Less label, more free: approaches in label-free quantitative mass spectrometry. Proteomics 11, 535–553 10.1002/pmic.201000553 [DOI] [PubMed] [Google Scholar]
- Nesvizhskii A. I. (2007). Protein identification by tandem mass spectrometry and sequence database searching. Methods Mol. Biol. 367, 87–119 10.1385/1-59745-275-0:87 [DOI] [PubMed] [Google Scholar]
- Niclou S. P., Danzeisen C., Eikesdal H. P., Wiig H., Brons N. H., Poli A. M., et al. (2008). A novel eGFP-expressing immunodeficient mouse model to study tumor-host interactions. FASEB J. 22, 3120–3128 10.1096/fj.08-109611 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan S., Chen R., Brand R. E., Hawley S., Tamura Y., Gafken P. R., et al. (2012). Multiplex targeted proteomic assay for biomarker detection in plasma: a pancreatic cancer biomarker case study. J. Proteome Res. 11, 1937–1948 10.1021/pr201117w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Percy A. J., Chambers A. G., Yang J., Hardie D. B., Borchers C. H. (2014). Advances in multiplexed MRM-based protein biomarker quantitation toward clinical utility. Biochim. Biophys. Acta 1844, 917–926 10.1016/j.bbapap.2013.06.008 [DOI] [PubMed] [Google Scholar]
- Pesch B., Bruning T., Johnen G., Casjens S., Bonberg N., Taeger D., et al. (2014). Biomarker research with prospective study designs for the early detection of cancer. Biochim. Biophys. Acta 1844, 874–883 10.1016/j.bbapap.2013.12.007 [DOI] [PubMed] [Google Scholar]
- Picotti P., Clement-Ziza M., Lam H., Campbell D. S., Schmidt A., Deutsch E. W., et al. (2013). A complete mass-spectrometric map of the yeast proteome applied to quantitative trait analysis. Nature 494, 266–270 10.1038/nature11835 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Picotti P., Lam H., Campbell D., Deutsch E. W., Mirzaei H., Ranish J., et al. (2008). A database of mass spectrometric assays for the yeast proteome. Nat. Methods 5, 913–914 10.1038/nmeth1108-913 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pruitt K. D., Brown G. R., Hiatt S. M., Thibaud-Nissen F., Astashyn A., Ermolaeva O., et al. (2014). RefSeq: an update on mammalian reference sequences. Nucleic Acids Res. 42, D756–D763 10.1093/nar/gkt1114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pruitt K. D., Tatusova T., Brown G. R., Maglott D. R. (2012). NCBI reference sequences (RefSeq): current status, new features and genome annotation policy. Nucleic Acids Res. 40, D130–D135 10.1093/nar/gkr1079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pruitt K. D., Tatusova T., Maglott D. R. (2007). NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35, D61–D65 10.1093/nar/gkl842 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qeli E., Omasits U., Goetze S., Stekhoven D. J., Frey J. E., Basler K., et al. (2014). Improved prediction of peptide detectability for targeted proteomics using a rank-based algorithm and organism-specific data. J. Proteomics 108, 269–283 10.1016/j.jprot.2014.05.011 [DOI] [PubMed] [Google Scholar]
- Rajcevic U., Petersen K., Knol J. C., Loos M., Bougnaud S., Klychnikov O., et al. (2009). iTRAQ-based proteomics profiling reveals increased metabolic activity and cellular cross-talk in angiogenic compared with invasive glioblastoma phenotype. Mol. Cell. Proteomics 8, 2595–2612 10.1074/mcp.M900124-MCP200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith R. D. (2012). Mass spectrometry in biomarker applications: from untargeted discovery to targeted verification, and implications for platform convergence and clinical application. Clin. Chem. 58, 528–530 10.1373/clinchem.2011.180596 [DOI] [PubMed] [Google Scholar]
- Tang H. Y., Beer L. A., Chang-Wong T., Hammond R., Gimotty P., Coukos G., et al. (2012). A xenograft mouse model coupled with in-depth plasma proteome analysis facilitates identification of novel serum biomarkers for human ovarian cancer. J. Proteome Res. 11, 678–691 10.1021/pr200603h [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uniprot C. (2009). The universal protein resource (UniProt) 2009. Nucleic Acids Res. 37, D169–D174 10.1093/nar/gkn664 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uniprot C. (2014). Activities at the universal protein resource (UniProt). Nucleic Acids Res. 42, D191–D198 10.1093/nar/gkt1140 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vizcaino J. A., Cote R. G., Csordas A., Dianes J. A., Fabregat A., Foster J. M., et al. (2013). The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 41, D1063–D1069 10.1093/nar/gks1262 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waldemarson S., Krogh M., Alaiya A., Kirik U., Schedvins K., Auer G., et al. (2012). Protein expression changes in ovarian cancer during the transition from benign to malignant. J. Proteome Res. 11, 2876–2889 10.1021/pr201258q [DOI] [PubMed] [Google Scholar]
- Wang J., Miletic H., Sakariassen P. O., Huszthy P. C., Jacobsen H., Brekka N., et al. (2009). A reproducible brain tumour model established from human glioblastoma biopsies. BMC Cancer 9:465 10.1186/1471-2407-9-465 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Whiteaker J. R., Lin C., Kennedy J., Hou L., Trute M., Sokal I., et al. (2011). A targeted proteomics-based pipeline for verification of biomarkers in plasma. Nat. Biotechnol. 29, 625–634 10.1038/nbt.1900 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Whiteaker J. R., Zhang H., Zhao L., Wang P., Kelly-Spratt K. S., Ivey R. G., et al. (2007). Integrated pipeline for mass spectrometry-based discovery and confirmation of biomarkers demonstrated in a mouse model of breast cancer. J. Proteome Res. 6, 3962–3975 10.1021/pr070202v [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.