Skip to main content
Protein & Cell logoLink to Protein & Cell
. 2012 Jan 9;3(2):140–147. doi: 10.1007/s13238-011-1129-8

Staged-probability strategy of processing shotgun proteomic data to discover more functionally important proteins

Hong Xu 1, Guijun Ma 1, Qingqiao Tan 1, Qiang Zhou 2, Wen Su 3, Rongxiu Li 1,
PMCID: PMC4875408  PMID: 22228504

Abstract

Biologically important proteins related to membrane receptors, signal transduction, regulation, transcription, and translation are usually low in abundance and identified with low probability in mass spectroscopy (MS)-based analyses. Most valuable proteomics information on them were hitherto discarded due to the application of excessively strict data filtering for accurate identification. In this study, we present a staged-probability strategy for assessing proteomic data for potential functionally important protein clues. MS-based protein identifications from the second (L2) and third (L3) layers of the cascade affinity fractionation using the Trans-Proteomic Pipeline software were classified into three probability stages as 1.00–0.95, 0.95–0.50, and 0.50–0.20 according to their distinctive identification correctness rates (i.e. 100%–95%, 95%–50%, and 50%–20%, respectively). We found large data volumes and more functionally important proteins located at the previously unacceptable lower probability stages of 0.95–0.50 and 0.50–0.20 with acceptable correctness rate. More importantly, low probability proteins in L2 were verified to exist in L3. Together with some MS spectrogram examples, comparisons of protein identifications of L2 and L3 demonstrated that the staged-probability strategy could more adequately present both quantity and quality of proteomic information, especially for researches involving biomarker discovery and novel therapeutic target screening.

Keywords: lower minimum probability threshold, probability grade, TPP, low-abundant protein

References

  1. Abassi Y.A., Vuori K. Tyrosine 221 in Crk regulates adhesion-dependent membrane localization of Crk and Rac and activation of Rac signaling. EMBO J. 2002;21:4571–4582. doi: 10.1093/emboj/cdf446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T., the The Gene Ontology Consortium et al. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Carr S., Aebersold R., Baldwin M., Burlingame A., Clauser K., Nesvizhskii A. The Need for Guidelines in Publication of Peptide and Protein Identification Data. Mol Cell Proteomics. 2004;3:531–533. doi: 10.1074/mcp.T400006-MCP200. [DOI] [PubMed] [Google Scholar]
  4. Charych E.I., Yu W., Miralles C.P., Serwanski D.R., Li X., Rubio M., De Blas A.L. The brefeldin A-inhibited GDP/GTP exchange factor 2, a protein involved in vesicular trafficking, interacts with the beta subunits of the GABA receptors. J Neurochem. 2004;90:173–189. doi: 10.1111/j.1471-4159.2004.02481.x. [DOI] [PubMed] [Google Scholar]
  5. Chin L.S., Raynor M.C., Wei X., Chen H.Q., Li L. Hrs interacts with sorting nexin 1 and regulates degradation of epidermal growth factor receptor. J Biol Chem. 2001;276:7069–7078. doi: 10.1074/jbc.M004129200. [DOI] [PubMed] [Google Scholar]
  6. Davis R.J. Signal transduction by the JNK group of MAP kinases. Cell. 2000;103:239–252. doi: 10.1016/S0092-8674(00)00116-1. [DOI] [PubMed] [Google Scholar]
  7. Deutsch E., Deutsch W., Lam H., Aebersold R. Data analysis and bioinformatics tools for tandem mass spectrometry in proteomics. Physiol Genomics. 2008;33:18–25. doi: 10.1152/physiolgenomics.00298.2007. [DOI] [PubMed] [Google Scholar]
  8. Eng J.K., McCormack A.L., Yates Iii J.R. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J Am Soc Mass Spectrom. 1994;5:976–989. doi: 10.1016/1044-0305(94)80016-2. [DOI] [PubMed] [Google Scholar]
  9. European Institute for Bioinformatics (EBI). (2005). Available at: http://www.ensembl.org.
  10. Gasteiger E., Gattiker A., Hoogland C., Ivanyi I., Appel R.D., Bairoch A. ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003;31:3784–3788. doi: 10.1093/nar/gkg563. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Kersey P.J., Duarte J., Williams A., Karavidopoulou Y., Birney E., Apweiler R. The International Protein Index: an integrated database for proteomics experiments. Proteomics. 2004;4:1985–1988. doi: 10.1002/pmic.200300721. [DOI] [PubMed] [Google Scholar]
  12. Kitayama H., Sugimoto Y., Matsuzaki T., Ikawa Y., Noda M. A ras-related gene with transformation suppressor activity. Cell. 1989;56:77–84. doi: 10.1016/0092-8674(89)90985-9. [DOI] [PubMed] [Google Scholar]
  13. Nesvizhskii A.I., Aebersold R. Analysis, statistical validation and dissemination of large-scale proteomics datasets generated by tandem MS. Drug Discov Today. 2004;9:173–181. doi: 10.1016/S1359-6446(03)02978-7. [DOI] [PubMed] [Google Scholar]
  14. Okuda A., Sakai M., Muramatsu M. The structure of the rat glutathione S-transferase P gene and related pseudogenes. J Biol Chem. 1987;262:3858–3863. [PubMed] [Google Scholar]
  15. Patterson S.D., Aebersold R.H. Data analysis — the Achilles heel of proteomics. Nat Genet. 2003;33:311–323. doi: 10.1038/ng1106. [DOI] [PubMed] [Google Scholar]
  16. Quon K.C., Marczynski G.T., Shapiro L. Cell cycle control by an essential bacterial two-component signal transduction protein. Cell. 1996;84:83–93. doi: 10.1016/S0092-8674(00)80995-2. [DOI] [PubMed] [Google Scholar]
  17. Rabilloud T. Membrane proteins ride shotgun. Nat Biotechnol. 2003;21:508–510. doi: 10.1038/nbt0503-508. [DOI] [PubMed] [Google Scholar]
  18. Slamon D.J., Clark G.M., Wong S.G., Levin W.J., Ullrich A., McGuire W.L. Human breast cancer: correlation of relapse and survival with amplification of the HER-2/neu oncogene. Science. 1987;235:177–182. doi: 10.1126/science.3798106. [DOI] [PubMed] [Google Scholar]
  19. Tan Q., Dong D., Li R. A novel fractionation method prior to MS-based proteomics analysis using cascade biomimetic affinity chromatography. J Chromatogr B Analyt Technol Biomed Life Sci. 2009;877:3799–3805. doi: 10.1016/j.jchromb.2009.09.024. [DOI] [PubMed] [Google Scholar]
  20. Wang T., Arifoglu P., Ronai Z., Tew K. D. Glutathione Stransferase P1-1 (GSTP1-1) Inhibits c-Jun N-terminal Kinase (JNK1) Signaling through Interaction with the C Terminus. J Biol Chem. 2001;276:20999–21003. doi: 10.1074/jbc.M101355200. [DOI] [PubMed] [Google Scholar]
  21. Yu K., Sabelli A., DeKeukelaere L., Park R., Sindi S., Gatsonis C. A., Salomon A. Integrated platform for manual and high-throughput statistical validation of tandem mass spectra. Proteomics. 2009;9:3115–3125. doi: 10.1002/pmic.200800899. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Protein & Cell are provided here courtesy of Oxford University Press

RESOURCES