Abstract
“Early diagnosis and prevention is a key factor in reducing the mortality and morbidity of cancer. However, currently available screening tools lack enough sensitivity for early diagnosis. It is important to develop noninvasive techniques and methods that can screen and identify asymptomatic patients who have cancer. Biomarkers of cancer status can also serve as powerful tools in monitoring the course of cancer, and in determining the efficacy and safety of novel therapies. Thus, discovery of novel specific biomarkers are needed that may provide informative clues for early diagnosis and treatment of cancer. Recently, remarkable progress has been made in the development of new proteomics technology. The progress that has been made in this field is helpful in identifying biomarkers that can be used for early diagnosis of cancer and improving the understanding of the molecular etiological mechanism of cancer. This article describes the current state of the art in this field.”
Keywords: Biomarker, quantitative proteomics, cancer
Early diagnosis and prevention is crucial for reducing mortality and morbidity of cancer. Current clinical examination such as mammography and invasive needle or surgical evaluation for breast cancer; chest X-ray for lung cancer, etc., is often not sufficiently sensitive for early diagnosis. In order to detect cancer in its early stages, it is necessary to identify biomarkers for those asymptomatic patients who may have cancer. Biomarkers could be proteins, metabolites or electrolytes whose differential expression indicates the presence of disease [1]. Biomarkers of protein origin are powerful both in monitoring the development of cancer and in determining the efficacy and safety of drugs. Novel biomarkers are urgently needed for early diagnosis and treatment of cancer. Nowadays, the remarkable progress in proteomics technology has offered unprecedented opportunity for biomarker discovery.
Traditionally, two-dimensional gel electrophoresis (2-DE) is the standard method for comparing protein expression between normality and disease-perturbed state. However, it is later known that 2-DE lacks resolution and sensitivity. Only abundant proteins can be resolved on a 2-D gel. Biomarkers, which are usually low in abundance, are rarely found with 2-DE method. Gel-free, or mass spectrometry (MS)-based, proteomics techniques are emerging as the choice for quantitatively measuring protein levels due to better sensitivity and reproducibility over 2-DE-based methods [2-3]. However, quantitative proteomic profiling of complex biological samples for the purposes of biomarker discovery remains a challenge in proteomics. Multiple innovative profiling techniques have been introduced with the aims of comprehensively identifying and quantifying the proteins. These can be roughly divided into two categories: isotope-labeled and label-free mass spectrometry.
ISOTOPE-LABELED MASS SPECTROMETRY
Isotope-labeling methods have been developed that introduce stable isotope tags to proteins via chemical reactions using isotope-coded affinity tags (ICAT [4-5] and iTRAQ [6]), enzymatic labeling (e.g., using 18O water for trypsin digestion [7]), or via metabolic labeling [8] (SILAC). The pioneering ICAT technology selectively targets peptides containing a specific amino acid (cysteine) with a stable isotope-coded internal reference or standard [9]. The extracted proteins from treatment and control samples are labeled with either light or heavy ICAT reagents by reacting with cysteinyl thiols on the proteins. Peptides containing the labeled and unlabeled ICAT tags are recovered by avidin affinity chromatography and are then analyzed by LC-MS/MS. Differential protein expression is determined by the isotope peak ratio of the peptide. Enrichment of low-abundance proteins can be performed through cell lysate fractionation [10]. ICAT technology has been widely used for protein identification and quantification in mammalian, liver and breast tumor cells [11]. However, disadvantages of ICAT analyses are obvious: it is only applicable to proteins containing cysteine; it can only identify 300-400 proteins, far fewer than 2-DE method; the peptides contain large labels, which makes database searching more difficult, especially for short peptides [10].
Isobaric tag for relative and absolute quantitation (iTRAQ) is another labeling technique first developed by Ross et al. [12], which use special isobaric tags to label proteins extracted from samples for comparison. The proteins from control and treatment samples have the same mass after reacting with the iTRAQ reagents. These peptides give four ion species of different masses upon collision induced dissociation. These ions allow samples to be quantified in MS/MS mode [12]. The amine specificity of the labeling reagents makes most peptides amenable to this labeling strategy with no loss of information. This is especially important for proteins with post-translational modifications, such as phosphorylation and glycosylation. In addition, the multiplexing capacity of these reagents allows for comparison among multiple cellular states. However, as a chemical labeling method, iTRAQ may generate side products during labeling, and cause some loss of analytic sensitivity because chemical strategies involve a derivatization step that might not be complete.
Absolute Quantification (AQUA), uses synthesized isotopically labeled peptides that mimic native peptides as internal standards [13-14]. The method has the potential for high throughput and multiplexed sample analysis [15]. Stable isotope labeling of amino acids in cell culture (SILAC), which takes a similar strategy but utilizes different media containing light- or heavy forms of particular amino acids, has emerged as a popular label-based quantification techniques [16]. It was first developed by Mann et al. [8] based on metabolic incorporation of ‘light’ or ‘heavy’ form of amino acids into the proteins in living cultured cells. Usually, heavily labeled (13C or 15N) arginine or lysine or both are used in culture medium to ensure complete labeling of every trypsinized peptide fragment. In experiments, one cell population is fed with regular amino acids and the other fed with 13C or 15N labeled amino acids. After several rounds of cell division, heavy amino acids will be incorporated into newly synthesized proteins. In the mass spectrometry spectrum, the light and heavy peptides will show up as two distinct peaks separated by the incremental mass of the labeled amino acids. By comparing the signal intensity, relative quantification can be achieved. Because of its simplicity in principle, SILAC is widely used for biomarker discovery [17], cell signaling dynamics [18], identification of posttranslational modification sites [19-20], protein-protein interaction [21-23], and subcellular proteomics [24].
The dynamics of protein turnover is another key feature to the understanding of regulation of protein expression in cells [25-26]. Recently, we developed a general method for determination of protein synthesis rate using labeling of amino acids with deuterium or 15N at low enrichment [27-28]. This method, “modified SILAC (mSILAC)”, can measure protein synthesis rate quantitatively based on analysis of mass isotopmer distribution (MIDA) [28]. Once precursor enrichment is known, protein synthesis is determined from isotopomer distribution. In experiments with 30-50% enriched 15N amino acids, incorporation of 15N amino acids result in sufficient mass shift in the new protein. The 15N enrichment can be estimated from the mass shift by curve fitting and the expected isotopomer distribution of the new peptide can be generated by the concatenation function. Synthesis rate is then calculated by multiple linear regression analysis of the observed peptide spectrum on the expected new and the old (unlabeled) spectra [28]. This method obviates the need for the use of a 100% newly synthesized protein as a reference as in Vogt’s methods [29-30]. The concatenation function provides an ideal 100% labeled spectrum and multiple regression analysis uses all the information from the mass spectrum. Our mathematical algorithm represents a major improvement in the calculation of protein synthesis rate, permitting the use of isotope labeling of protein through the pathways of amino acid metabolism with low cost isotopes [27-28].
For small peptides, another method, multiple reaction monitoring (MRM), attempts to monitor pre-specified ions (and their daughter product ions after MS/MS) as specific signatures for a particular protein instead of aiming to measure everything in a sample. In this method, initial selective scanning for a particular precursor ion in the first MS is followed by scans of a particular transition. Only one of the ions produced during collision induced dissociation is selected based on prior knowledge of precursors and transition state [31]. Due to limitation in mass spectrometer dynamic range and resolution of chromatography separation, MRM is frequently combined with immunodepletion, size exclusion chromatography [32], or enrichment of stable isotope standards with capture by anti-peptide antibody (SISCAPA, [33]). These hybrid methods are particularly attractive because they can measure precisely the quantity of low abundant proteins, which are otherwise bypassed in other conventional methods. MRM is comparable to ELISA with regard to sensitivity but obviate the necessity of antibody, which is costly to obtain and sometimes unavailable. MRM is usually more accurate than ELISA because it does not have propensity of cross reaction as antibody has.
LABEL-FREE MASS SPECTROMETRY
With the advances of new instrumentation, computing power, and advanced bioinformatics, generic label-free LC-MS shotgun screening methods such as Multidimensional Protein Identification Technology (MudPIT) have been alternatives for relative and absolute protein quantitation in biological samples. Without requiring modification of the analytes, the label-free approaches were based on the observation that MS intensity is linearly proportional to ion concentration even in complex samples like serum [34]. However, it was later observed that not all peptides are equally detectable because of competition between ions, dynamic range limitation, and sensitivity of MS instrumentation [35]. A new technique called spectral counting circumvented these pitfalls by not looking at differences in ion peak areas or intensities, as used in isotope-based methods [35]. Nevertheless, the main bottleneck for label-free LC-MS still lies in the computer softwares and sample preparation. Although many software tools are available, there is huge space for improvement, such as user interface and calculation efficiency of workflow. Moreover, the accuracy of label free quantitation depends largely on experimental set-up like protein extraction and sample stability, which is illustrated by the fact that the quantitative reproducibility of technical replicates is much better than that of experimental replicates [36-37]. With improvements in the efficiency of data analysis workflow, label-free mass spectrometry will be potentially widely used for biomarker discovery and validation.
CONCLUSION
The recent development of mass spectrometry based strategies for absolute protein quantification offers great opportunity for biomarker discovery. While the prospects of this technology are exciting and promising, the current methodology is far from perfect. First, most current methods require complicated sample preparation, such as immunosubtraction, multidimensional LC separation, immunoaffinity and solid phase extraction, in order to enhance the analytical dynamic range and detection sensitivity. To establish high throughput pipeline, we should have ideally one-step preparation. Secondly, useful and validated biomarkers are still rare based on these developed methods because low abundance biomarkers are always immersed in large quantities of routine proteins especially in plasma samples. There is huge space for improvement of sensitivity.
ACKNOWLEDGMENT
This work is jointly funded by the Bone Biology Program of the Cancer and Smoking Related Disease Research Program and the Nebraska Tobacco Settlement Biomedical Research Program (289104-845610 to GX), and partially supported by a grant awarded to WNPL from the UCLA Center of Excellence in Pancreatic disease (P01 AT003960-01) and Harbor-UCLA GCRC Mass Spectrometry Core (M01 RR00425-33).
References
- 1.Rifai N, Gillette MA, Carr SA. Protein biomarker discovery and validation: the long and uncertain path to clinical utility. Nat. Biotechnol. 2006;24:971–983. doi: 10.1038/nbt1235. [DOI] [PubMed] [Google Scholar]
- 2.Aebersold R, Mann M. Mass spectrometry-based proteomics. Nature. 2003;422:198–207. doi: 10.1038/nature01511. [DOI] [PubMed] [Google Scholar]
- 3.Xiao GG, Recker RR, Deng HW. Recent advances in proteomics and cancer biomarker discovery. Clin. Med. Oncol. 2008;2:1–10. doi: 10.4137/cmo.s539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gygi SP, Rist B, Gerber SA, et al. Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat. Biotechnol. 1999;17:994–999. doi: 10.1038/13690. [DOI] [PubMed] [Google Scholar]
- 5.Haqqani AS, Nesic M, Preston E, et al. Characterization of vascular protein expression patterns in cerebral ischemia/reperfusion using laser capture microdissection and ICAT-nanoLC-MS/MS. FASEB J. 2005;19:1809–1821. doi: 10.1096/fj.05-3793com. [DOI] [PubMed] [Google Scholar]
- 6.Shadforth IP, Dunkley TP, Lilley KS, et al. i-Tracker: for quantitative proteomics using iTRAQ. BMC Genomics. 2005;6:145. doi: 10.1186/1471-2164-6-145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Heller M, Mattou H, Menzel C, et al. Trypsin catalyzed 16O-to-18O exchange for comparative proteomics: tandem mass spectrometry comparison using MALDI-TOF, ESI-QTOF, and ESI-ion trap mass spectrometers. J. Am. Soc. Mass Spectrom. 2003;14:704–718. doi: 10.1016/S1044-0305(03)00207-1. [DOI] [PubMed] [Google Scholar]
- 8.Ong SE, Blagoev B, Kratchmarova I, et al. Stable isotope labeling by amino acids in cell culture, SILAC, as a simple and accurate approach to expression proteomics. Mol. Cell Proteomics. 2002;1:376–386. doi: 10.1074/mcp.m200025-mcp200. [DOI] [PubMed] [Google Scholar]
- 9.Shiio Y, Aebersold R. Quantitative proteome analysis using isotope-coded affinity tags and mass spectrometry. Nat. Protoc. 2006;1:139–145. doi: 10.1038/nprot.2006.22. [DOI] [PubMed] [Google Scholar]
- 10.Graves PR, Haystead TA. Molecular biologist’s guide to proteomics. Microbiol. Mol. Biol. Rev. 2002;66:39–63. doi: 10.1128/MMBR.66.1.39-63.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Somiari RI, Somiari S, Russell S, et al. Proteomics of breast carcinoma. J. Chromatogr. B Analyt. Technol. Biomed. Life Sci. 2005;815:215–225. doi: 10.1016/j.jchromb.2004.11.012. [DOI] [PubMed] [Google Scholar]
- 12.Ross PL, Huang YN, Marchese JN, et al. Multiplexed protein quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents. Mol. Cell Proteomics. 2004;3:1154–1169. doi: 10.1074/mcp.M400129-MCP200. [DOI] [PubMed] [Google Scholar]
- 13.Gerber SA, Kettenbach AN, Rush J, et al. The absolute quantification strategy: application to phosphorylation profiling of human separase serine 1126. Methods Mol. Biol. 2007;359:71–86. doi: 10.1007/978-1-59745-255-7_5. [DOI] [PubMed] [Google Scholar]
- 14.Kirkpatrick DS, Gerber SA, Gygi SP. The absolute quantification strategy: a general procedure for the quantification of proteins and post-translational modifications. Methods. 2005;35:265–273. doi: 10.1016/j.ymeth.2004.08.018. [DOI] [PubMed] [Google Scholar]
- 15.Aggarwal K, Choe LH, Lee KH. Shotgun proteomics using the iTRAQ isobaric tags. Brief Funct. Genomic Proteomic. 2006;5:112–120. doi: 10.1093/bfgp/ell018. [DOI] [PubMed] [Google Scholar]
- 16.Mann M. Functional and quantitative proteomics using SILAC. Nat. Rev. Mol. Cell Biol. 2006;7:952–958. doi: 10.1038/nrm2067. [DOI] [PubMed] [Google Scholar]
- 17.Gronborg M, Kristiansen TZ, Iwahori A, et al. Biomarker discovery from pancreatic cancer secretome using a differential proteomic approach. Mol. Cell Proteomics. 2006;5:157–171. doi: 10.1074/mcp.M500178-MCP200. [DOI] [PubMed] [Google Scholar]
- 18.Guerrera IC, Keep NH, Godovac-Zimmermann J. Proteomics study reveals cross-talk between Rho guanidine nucleotide dissociation inhibitor 1 post-translational modifications in epidermal growth factor stimulated fibroblasts. J. Proteome Res. 2007;6:2623–2630. doi: 10.1021/pr070078f. [DOI] [PubMed] [Google Scholar]
- 19.Liang X, Fonnum G, Hajivandi M, et al. Quantitative comparison of IMAC and TiO2 surfaces used in the study of regulated, dynamic protein phosphorylation. J. Am. Soc. Mass Spectrom. 2007;18:1932–1944. doi: 10.1016/j.jasms.2007.08.001. [DOI] [PubMed] [Google Scholar]
- 20.Soufi B, Jers C, Hansen ME, et al. Insights from site-specific phosphoproteomics in bacteria. Biochim. Biophys. Acta. 2008;1784:186–192. doi: 10.1016/j.bbapap.2007.07.018. [DOI] [PubMed] [Google Scholar]
- 21.Selbach M, Mann M. Protein interaction screening by quantitative immunoprecipitation combined with knockdown (QUICK) Nat. Methods. 2006;3:981–983. doi: 10.1038/nmeth972. [DOI] [PubMed] [Google Scholar]
- 22.Bose R, Molina H, Patterson AS, et al. Phosphoproteomic analysis of Her2/neu signaling and inhibition. Proc. Natl. Acad. Sci. U S A. 2006;103:9773–9778. doi: 10.1073/pnas.0603948103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Foster LJ, Rudich A, Talior I, et al. Insulin-dependent interactions of proteins with GLUT4 revealed through stable isotope labeling by amino acids in cell culture (SILAC) J. Proteome Res. 2006;5:64–75. doi: 10.1021/pr0502626. [DOI] [PubMed] [Google Scholar]
- 24.Waanders LF, Hanke S, Mann M. Top-down quantitation and characterization of SILAC-labeled proteins. J. Am. Soc. Mass Spectrom. 2007;18:2058–2064. doi: 10.1016/j.jasms.2007.09.001. [DOI] [PubMed] [Google Scholar]
- 25.Doherty MK, Whitehead C, McCormack H, et al. Proteome dynamics in complex organisms: using stable isotopes to monitor individual protein turnover rates. Proteomics. 2005;5:522–533. doi: 10.1002/pmic.200400959. [DOI] [PubMed] [Google Scholar]
- 26.Pratt JM, Petty J, Riba-Garcia I, et al. Dynamics of protein turnover, a missing dimension in proteomics. Mol. Cell Proteomics. 2002;1:579–591. doi: 10.1074/mcp.m200046-mcp200. [DOI] [PubMed] [Google Scholar]
- 27.Xiao GG, Garg M, Lim S, et al. Determination of Protein Synthesis in vivo Using Labeling from Deuterated Water and Analysis of MALDI-TOF Spectrum. J. Appl. Physiol. 2008;104:828–836. doi: 10.1152/japplphysiol.00976.2007. [DOI] [PubMed] [Google Scholar]
- 28.Zhao Y, Lee WNP, Lim S, et al. Quantitative Proteomics: Measuring Protein Synthesis Using 15N Amino Acids Labeling in Pancreas Cancer Cells. Anal. Chem. 2009;81:764–771. doi: 10.1021/ac801905g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Vogt JA, Schroer K, Holzer K, et al. Protein abundance quantification in embryonic stem cells using incomplete metabolic labelling with 15N amino acids, matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry, and analysis of relative isotopologue abundances of peptides. Rapid Commun. Mass Spectrom. 2003;17:1273–1282. doi: 10.1002/rcm.1045. [DOI] [PubMed] [Google Scholar]
- 30.Vogt JA, Hunzinger C, Schroer K, et al. Determination of fractional synthesis rates of mouse hepatic proteins via metabolic 13C-labeling, MALDI-TOF MS and analysis of relative isotopologue abundances using average masses. Anal. Chem. 2005;77:2034–2042. doi: 10.1021/ac048722m. [DOI] [PubMed] [Google Scholar]
- 31.Ong SE, Mann M. Mass spectrometry-based proteomics turns quantitative. Nat. Chem. Biol. 2005;1:252–262. doi: 10.1038/nchembio736. [DOI] [PubMed] [Google Scholar]
- 32.Liao H, Wu J, Kuhn E, et al. Use of mass spectrometry to identify protein biomarkers of disease severity in the synovial fluid and serum of patients with rheumatoid arthritis. Arthritis Rheum. 2004;50:3792–3803. doi: 10.1002/art.20720. 2004. [DOI] [PubMed] [Google Scholar]
- 33.Anderson NL, Anderson NG, Haines LR, et al. Mass spectrometric quantitation of peptides and proteins using stable isotope standards and capture by anti-peptide antibodies (SISCAPA) J. Proteome Res. 2004;3:235–244. doi: 10.1021/pr034086h. [DOI] [PubMed] [Google Scholar]
- 34.Chelius D, Bondarenko PV. Quantitative profiling of proteins in complex mixtures using liquid chromatography and mass spectrometry. J. Proteome Res. 2002;1:317–323. doi: 10.1021/pr025517j. [DOI] [PubMed] [Google Scholar]
- 35.Zhang B, VerBerkmoes NC, Langston MA, et al. Detecting differential and correlated protein expression in label-free shotgun proteomics. J. Proteome Res. 2006;5:2909–2918. doi: 10.1021/pr0600273. [DOI] [PubMed] [Google Scholar]
- 36.Wang W, Zhou H, Lin H, et al. Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. Anal. Chem. 2003;75:4818–4826. doi: 10.1021/ac026468x. [DOI] [PubMed] [Google Scholar]
- 37.Jaffe JD, Mani DR, Leptos KC, et al. PEPPeR, a platform for experimental proteomic pattern recognition. Mol. Cell Proteomics. 2006;5:1927–1941. doi: 10.1074/mcp.M600222-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]