Skip to main content
BMC Bioinformatics logoLink to BMC Bioinformatics
. 2010 Apr 29;11:219. doi: 10.1186/1471-2105-11-219

WaveletQuant, an improved quantification software based on wavelet signal threshold de-noising for labeled quantitative proteomic analysis

Reviewed by: Fan Mo 1,#, Qun Mo 2,#, Yuanyuan Chen 1, David R Goodlett 3, Leroy Hood 4, Gilbert S Omenn 5, Song Li 2,, Biaoyang Lin 1,6,7,
PMCID: PMC2878310  PMID: 20429928

Abstract

Background

Quantitative proteomics technologies have been developed to comprehensively identify and quantify proteins in two or more complex samples. Quantitative proteomics based on differential stable isotope labeling is one of the proteomics quantification technologies. Mass spectrometric data generated for peptide quantification are often noisy, and peak detection and definition require various smoothing filters to remove noise in order to achieve accurate peptide quantification. Many traditional smoothing filters, such as the moving average filter, Savitzky-Golay filter and Gaussian filter, have been used to reduce noise in MS peaks. However, limitations of these filtering approaches often result in inaccurate peptide quantification. Here we present the WaveletQuant program, based on wavelet theory, for better or alternative MS-based proteomic quantification.

Results

We developed a novel discrete wavelet transform (DWT) and a 'Spatial Adaptive Algorithm' to remove noise and to identify true peaks. We programmed and compiled WaveletQuant using Visual C++ 2005 Express Edition. We then incorporated the WaveletQuant program in the Trans-Proteomic Pipeline (TPP), a commonly used open source proteomics analysis pipeline.

Conclusions

We showed that WaveletQuant was able to quantify more proteins and to quantify them more accurately than the ASAPRatio, a program that performs quantification in the TPP pipeline, first using known mixed ratios of yeast extracts and then using a data set from ovarian cancer cell lysates. The program and its documentation can be downloaded from our website at http://systemsbiozju.org/data/WaveletQuant.

Background

Quantitative proteomics technologies have been developed to comprehensively identify and quantify proteins in two or more complex samples [1-4]. There are three ways to perform quantitative proteomic analysis: a) the spectral counting method that counts the number of fragment ion spectra for a particular peptide [5]; b) differential stable isotope labeling, in which quantified peptides differ by the mass shifts introduced by the stable isotopes used [6]; and c) label-free quantification that quantifies the precursor ion signal intensities across different LC-MS runs [7-9].

Quantification using the differential stable isotope labeling method is one of the methods for quantification of two or more samples within a single experiment. The technique is based on use of stable isotopes to differentially label proteins or peptides, and on use of mass spectrometry to compare the relative abundance of the proteins in different samples. Over the years, many stable isotope tagging approaches have been developed, which include the ICAT [6], ITRAQ [10], and SILAC [11] approaches. In addition, numerous quantification software were developed, including XPRESS [6], ASAPRatio [12], MSQuant http://msquant.sourceforge.net/, ZoomQuant [13], STEM [14], Multi-Q [15], i-tracker [16], Libra [17], maxQuant [18], muxQuant [19], HTAPP (high-throughput autonomous proteomic pipeline) [20], msInspect [21], the APEX Quantitative Proteomics Tool [22], MASIC [23], and Census [24].

In our quantitative proteomics analysis, we found that errors associated with ratios calculated by the ASAPRatio increased proportionally with the relative abundance ratios of the two isotopic partners. Several factors might have contributed to the increase of relative errors. We found one of the factors to be background noise that was not completely removed by the Savitzky-Golay smooth filtering method.

Wavelets are mathematical functions that divide a given function or a continuous-time signal into different frequency components, and then study each component with a resolution matched to its scale [25,26]. They have advantages over traditional Fourier transforms in analyzing data for which signals have discontinuities and sharp peaks, and in deconstructing and reconstructing signals more accurately [27].

Various programs integrating wavelet transforms have been developed for analyzing various types of proteomics data, such as MALDI, SELDI-TOF and LC/MS. Yang et al. compared five smoothing methods used in peak detection algorithms for MALDI mass spectrometry data analysis [28]. They found that the wavelet smoothing performed best among the five smoothing methods: moving average filter, Savitzky-Golay filter, Gaussian filter, Kaiser window, and wavelet based filters [28]. Du et al. showed that a continuous wavelet transform (CWT)-based peak detection algorithm enhances the effective signal-to-noise ratio in SELDI-TOF spectra; it could identify both strong and weak peaks while keeping false positive rates low [29]. Randolph and Yasui applied a translation-invariant wavelet analysis to perform multiscale decomposition, feature extraction and quantification for MALDI-TOF spectra [30]. Alexandrov et al. developed the MALDIDWT program for analyzing serum protein profiles for biomarker discovery [31]. Lange et al. used wavelet techniques to develop a mass spectrometer-independent peak-picking algorithm as an alternative to vendors' peak-picking software bundled with mass spectrometers [32]. Schulz-Trieglaff et al. developed an algorithm that uses a mother wavelet to mimic the distribution of isotopic peak intensities [33]. The latter two algorithms by Lange et al. and Schulz-Trieglaff et al. were further implemented in OpenMS software [34]. Zhang et al. used an undecimated wavelet transform to remove random noise for prOTOF MS data, which does not require a priori knowledge of protein masses[35]. Using metabolomics data as examples, Tautenhahn et al. developed a new feature detection algorithm centWave for high-resolution LC/MS data sets applying continuous wavelet transformation and optional Gauss-fitting in the chromatographic domain[36].

Wavelet theory has also been applied to MS data to reduce data dimension or to reduce computation time. For example, Hussong et al. implemented a feature finding algorithm based on a hand-tailored adaptive wavelet transform that drastically reduces the computation time in mass spectrometry data analysis [37]. Liu et al. used the wavelet detail coefficients to characterize features and reduce the dimensionality of MS data [38].

In this manuscript, we report development of a new wavelet transform algorithm for improved quantitative proteomics analysis. We demonstrate that our approach has an improved ability to smooth isotopic peaks and remove background noise when compared with approaches using other smoothing methods.

Implementation

Technical details of the development of the WaveletQuant program

The wavelet transform is an excellent tool for signal processing because of its de-noising ability; one can obtain multi-resolution decomposition of signals, while retaining their local characteristic details.

The first step in the wavelet transform method is to choose a proper threshold to de-noise signals. The principle of wavelet-based de-noising is to recognize the noise from the high frequency part of wavelet coefficients. Those coefficients that are less than the threshold are set to zero. Other coefficients are preserved. Then we reconstruct the de-noised signals using the new coefficients. As indicated in the Results presented below, setting the wavelet coefficients of noise to zero while at the same time preserving the wavelet coefficients of signals is critical for a successful wavelet transform. Choosing an optimal threshold is the key to retaining maximal true signals while reducing as much noise as possible.

Given a measured signal x(t) with a Gaussian white noise n(t) can be presented by the following formula:

graphic file with name 1471-2105-11-219-i1.gif (1)

The method is composed of three components: (i) the discrete wavelet transform (DWT) of signal x(t); (ii) setting the threshold for the wavelet coefficients on each scale; and (iii) obtaining de-noised signals by inverse wavelet transform based on the threshold wavelet coefficients. A more detailed description of the wavelet transform process is shown in Additional file 1.

We then adopted the universal threshold T proposed by Donoho and Johnstone [39] to remove Gaussian white noise, described as follows:

graphic file with name 1471-2105-11-219-i2.gif (2)

where N is the length of signal x(t), σ is the noise level, and MAD is the median absolute deviation estimated in the first scale. The factor 0.6745 in the denominator rescales the numerator so that is a suitable estimator for the standard deviation for Gaussian white noise. Significant wavelet coefficients could be derived by setting a threshold rule. There are hard and soft threshold rules, each with its advantages and disadvantages.

graphic file with name 1471-2105-11-219-i3.gif (3)

The hard threshold can preserve local characteristics, but the reconstructed signals are not very smooth. The soft threshold can obtain a smoothed curve, but it always distorts the signal. In this paper, we combined the strengths of the two methods, and developed a new rule:

graphic file with name 1471-2105-11-219-i5.gif (5)

where 0 <λ < 1. When λ is zero, it returns to the hard threshold. When λ is one, it returns to the soft threshold. We set λ = [0.1, 0.4]. The procedure of wavelet-based de-noising is given as:

graphic file with name 1471-2105-11-219-i6.gif (6)

Xu et al. [40] proposed a spatially selective noise filtration technique. They declared that the singularity of a signal should have a large peak value in different scales, while noise should have fading energy with increasing scales. Inspired by their work, we developed a 'Spatial Adaptive Algorithm' to identify true peaks (a more detailed description of the method is presented in Additional file 1).

Assuming the largest scale of decomposition is J, Wf(j, n)denotes DWT coefficient of signal f at position n in scale j. We denoted the correlation of bordered scales as follows:

graphic file with name 1471-2105-11-219-i7.gif (7)

where l represents the scale and j <J - l + 1. As the singularity of signals increases along with increasing scales, bordered points affect each other in detailed scales. We chose l = 2 to compute the correlation:

graphic file with name 1471-2105-11-219-i8.gif (8)

where Corr2(j, n) is denoted as correlation coefficient of the position n in scale j.

To make correlation coefficient and wavelet coefficient more comparable, we defined the correlation coefficient uniformly:

graphic file with name 1471-2105-11-219-i9.gif

And:

graphic file with name 1471-2105-11-219-i10.gif (9)

Then we compared NewCorr2(j, n) with Wf(j, n) to obtain the edges of important signals. In summary, by multiplying wavelet coefficients of bordered scales, we computed a correlation coefficient to suppress the noise and to strengthen the signal. Our algorithm improved the identification of real signals and the orientational precision of the identified signals.

Software implementation

We programmed and compiled WaveletQuant using Visual C++ 2005 Express Edition. A program flow chart is shown in Figure 1. We evaluated peptide abundances by reconstructing a raw single-ion chromatogram over a chromatographic elution period. Then the wavelet algorithms we described in the previous sections were applied to obtain an adjusted chromatogram peak area. Signals inside the peak region were decomposed into four levels and correlation coefficients of bordered points in each scale were calculated. Next, we recursively computed new correlation coefficients and modified them by comparing them with Wf(j, n). According to these new values, a noise threshold was re-calculated. If the wavelet coefficients were less than the noise threshold, they were set to zero. The remaining wavelet coefficients were considered signals. Finally, we reconstructed the de-noised signals and used the de-noised peak areas to calculate peptide abundance. Because the Trans-Proteomic Pipeline (TPP) http://tools.proteomecenter.org/software.php is a commonly used open source proteomics analysis pipeline, we decided to build our program into the TPP. We compiled a new TPP package by replacing the ASAPRatio program with the WaveletQuant program. The package and its documentation can be downloaded from our website at http://www.zcni.zju.edu.cn/en/WaveletQuant_for_Quantitative_Proteomics/waveletquant.html or http://systemsbiozju.org/data/WaveletQuant.

Figure 1.

Figure 1

WaveletQuant program flow chart. Equations 10 and 11 are shown in additional file 1.

Result

Application of the WaveletQuant program to data generated using yeast extracts mixed in known ratios

We compared the performance of our WaveletQuant program with the ASAPRatio program using a dataset generated by mixing different ratios of yeast cell extract grown in heavy vs. light isotopic media. Proteins were mixed in the following ratio 1:2, 1:1.5, 1:1, 1.5:1, 2:1, and then analyzed by LTQ-MS.

We found that the WaveletQuant program was able to perform curve fitting for respective chromatogram pairs and to quantify more accurately the difference of mixed yeast extracts than the original ASAPRatio program. Figures 2 and 3 show several examples where the WaveletQuant program performed better than the ASAPRatio program, which uses the Savitzky-Golay filter for denoising. The WaveletQuant program achieved more accurate quantification. An advantage of our program is the ability to separate a high peak from an overlapping low peak. The ASAPRatio program tends to merge the two peaks into one and quantified it. However, the low peak can be noise or a signal from another peptide that eluted immediately following the peptide being analyzed. The most obvious examples are those shown in Figures 2 and 3. In Figure 2A and 3B, the two peaks in the heavy peptide (bottom panel) were regarded as one peak by ASAPRatio program. However, the WaveletQuant program was able to separate the two peaks. For the peptide shown in Figure 2B, ASAPRatio failed to quantify it, but WaveletQuant did find the correct peaks and was able to quantify it. Figure 3A showed a subtle example where the 2nd peak could often be mistaken as from the first peak. In Figure 3A, for the heavy labeled peptide (bottom half of the two panels), our program was able to separate two overlapped peaks, while the ASAPRatio regarded it as one peak. The ASAPRatio therefore over quantified the heavy labeled peptides. Additional examples were shown in Additional file 2, 3 and 4.

Figure 2.

Figure 2

Comparison of the quantification performance of the WaveletQuant and the ASAPRatio for yeast extracts mixed in 1:1 ratio. A: Comparison of the quantification performance of yeast extracts mixed in 1:1 ratio between the WaveletQuant (right) and the ASAPRatio (left). The MS spectra OR20070625_HS_L-H-1-1_12.03299.03299.3 (+3 charge state) are illustrated. LC-MS chromatograms of the isotopically light and heavy peptide partners are shown. Raw chromatograms are plotted in red, smoothed chromatograms in blue, areas used for calculating abundance ratio of the charge state in green, and backgrounds in cyan. On the top is peptide abundance ratio. On the right are start and end scan numbers, background, elution time of the isotopically light and heavy peptide partners, acceptance, abundance ratio, and weight of the charge states. Users may change scan numbers, background levels, and acceptance of the charge state. B: Comparison of the quantification performance of yeast extracts mixed in 1:1 ratio between the WaveletQuant (right) and the ASAPRatio (left). MS spectra OR20070625_HS_L-H-1-1_12.10969.10969.2 (+2 charge state) are illustrated.

Figure 3.

Figure 3

Comparison of the quantification performance of the WaveletQuant and the ASAPRatio for yeast extracts mixed in 2:1 ratio. A: Comparison of the quantification performance of yeast extracts mixed in 2:1 ratio between the WaveletQuant (right) and the ASAPRatio (left). MS spectra OR20070625_HS_L-H-4-1_15.10440.10440.3 (+2 charge state) are illustrated. B: Comparison of the quantification performance of yeast extracts mixed in 1:2 ratio between the WaveletQuant (right) and the ASAPRatio (left). MS spectra OR20070625_HS_L-H-1-2_10.06981.06981.4 (+3 charge state) are illustrated.

The protein abundance ratios for all proteins obtained by the new algorithm and old algorithm of yeast extract mixtures are summarized in Table 1. The ratios were calculated by averaging all unique peptides' ratios after recursively eliminating outliers by calculating the average number A and +- √A's (square root of A) range (except in the first round, they were eliminated by choosing the median m and +- √m's range). We found that relative errors of the ratios obtained by the WaveletQuant method were much less (from 1 to 27% for different known mixed ratios) than those obtained from the ASAPRatio program (from 20 to 52%) (Table 1), suggesting that wavelet-based signal threshold de-noising is more efficient and more precise than the ASAPRatio program.

Table 1.

Comparison of quantification results between the ASAPRatio and the WaveletQuant programs.

Light/Heavy
ratio
Mixed
ratio
ASAPRatio
ratio
ASAPRatio
Relative error (%)
WaveletQuant
ratio
WaveletQuant
Relative error (%)
L/H 0.5 0.73 47 0.39 21
L/H 0.67 0.85 27 0.64 3
L/H 1 1.51 50 1.09 9
L/H 1.5 1.2 20 1.48 1
L/H 2 0.95 52 2.15 7

Application of the WaveletQuant program to data generated by ICAT for ovarian cancer cell lines

We have previously conducted quantitative proteomics studies comparing cisplatin-resistant ovarian cancer cells with cisplatin-sensitive cancer cells [41]. Using the RAW data of the cytosolic fractions, we compared TPP with ASAPRatio and TPP with WaveletQuant implementation. We found that TPP with the ASAPRatio was able to quantify the protein expression of 226 proteins, while TPP with WaveletQuant quantified 222 proteins, and 204 proteins were quantified by both algorithms. The total number of proteins quantified combining both programs is 245, which is about 10% more than using either program alone. We found that the average standard deviation for the ratios of quantification were 0.57 for TPP with ASARatio and 0.47 for TPP with WaveletQuant. Thus WaveletQuant appears to have better accuracy for quantification than the ASAPRatio.

Discussion

We developed a new software for quantitative proteomics using the wavelet transform. Mass spectrometry data are usually noisy. In order to better quantify mass spectrometry data, smoothing filters, such as the moving average filter, Gaussian filter Butterworth low-pass filter, and Savitzky-Golay filter can be used to reduce the noise in MS peaks. The moving average filter was used in the MZmine program [42]. The Gaussian filter was used by the local maximum search (LMS) program, which was developed for SELDI MS data analysis [43]. The smoothing used in the XPRESS program was performed with the Butterworth low-pass filter http://www.qsl.net/kp4md/butrwrth.htm, for which low-frequency excitation signal components down to and including the current ones are transmitted, while high-frequency components, up to and including infinite ones, are blocked. The ASAPRatio program uses the Savitzky-Golay method [12], which performs a least squares fit of a small set of consecutive data points to a polynomial and then takes the central point of the fitted polynomial curve as the output. The Savitzky-Golay smoothing tends to preserve features of the distribution such as relative maxima, minima, and width; this is its main advantage as these features are often 'flattened' by other smoothing methods (e.g. moving averages). However, as we showed in Results, a disadvantage of the Savitzky-Golay filter is that it smoothes signals by increasing window sizes and lowering filter frequencies; thus, the smoothed shape could create poor representations of true signals and generate inaccurate quantification. We found that wavelet smoothing is better than the Savitzky-Golay filter used with ASAPRatio. Yang et al. similarly found that wavelet smoothing performed better than moving average filter, Savitzky-Golay filter, Gaussian filter, and Kaiser window [28].

In addition, we implemented orthogonal wavelets to decompose signals in our WaveletQuant program. The wavelet transform is different from that used by Lange et al. and Schulz-Trieglaff et al. [32,33]. Many wavelets could be chosen to perform wavelet transform, including Daubechies' orthogonal and bi-orthogonal wavelets, Gaussian wavelets and coiflets [25]. Each wavelet has its own advantage depending on wavelet shapes and wavelet widths. The orthogonal wavelet can keep the energy (i.e. sum of squares of coefficients, usually referred to as "energy" in the signal processing field) of a signal unchanged. We have therefore selected the orthogonal wavelet transform for our MS data analysis.

We implemented two methods. First, by combining the advantage of hard threshold and soft threshold, we developed the wavelet-based signal threshold de-noising algorithm to distinguish signals from noise in MS data. Second, we developed the spatial adaptive algorithm, which not only was effective in removing high frequency noise but also was effective for low frequency de-noising. Combining these two algorithms, our WaveletQuant program performs better than the ASAPRatio program on the datasets from yeast that we tested (Figures 2 and 3). Finally, in a test using high throughput proteomics data generated from cell lysates in an ovarian cancer study, we found that the ratios obtained by our program have lower overall standard deviation than that obtained by the ASAPRatio.

Of note, we also mixed proteins in 1:4 and 4:1 ratios and analyzed them by LTQ-MS. However, due to the limited dynamic range of the routine LC/MS that we performed, the average ratios calculated by both the ASAPRatio and the WaveletQuant programs were far-off from the original mixed ratios, with large standard deviation. This is not surprising as the mixed ratios are outside the dynamic range of a routine LC/MS analysis, which Canterbury et al. estimated to be 0.5 to 2.5 in a systematic analysis [44]. This result also suggests that our WaveletQuant program did not improve the dynamic range of the quantification. Another possibility is that the experiment failed due to unknown reasons. Therefore, we have not included the data in this report.

Finally, we have implemented our wavelet transform algorithm and developed the WaveletQuant program. As the TPP pipeline is widely used in proteomic data analysis, we incorporated WaveletQuant software into the TPP pipeline http://tools.proteomecenter.org/TPP.php. Users can employ the WaveletQuant- implemented TPP pipeline as an alternative to the standard TPP pipeline.

Conclusions

We have developed an improved and/or alternative program for quantitative proteomics analysis, which is implemented in the standard TPP pipeline for the convenience of users.

Availability and requirements

• Project name: WaveletQuant

• Project home page: http://www.zcni.zju.edu.cn/en/WaveletQuant_for_Quantitative_Proteomics/waveletquant.html or http://systemsbiozju.org/data/WaveletQuant.

• Operating systems: Windows 2000, Windows XP or higher

• Programming languages: MSVC++ 7.1 or higher

• Other requirements: None

• License: This is a free software. You can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation.

Abbreviations

LC: Liquid Chromatography; MS: Mass Spectrometry; LTQ: Linear ion Trap Quadrupole; MALDI: Matrix Assisted Laser Desorption/Ionization; SELDI: Surface Enhanced Laser Desorption/Ionization; TOF: Time Of Flight; ICAT: Isotope Coded Affinity Tagging; ITRAQ: Isobaric Tag for Relative and Absolute Quantitation; SILAC: Stable Isotope Labeling with Amino Acids in Cell Culture; DWT: Discrete Wavelet Transform; CWT: Continuous Wavelet Transform; LSM: Local Maximum Search.

Authors' contributions

FM, QM, YC and BL conceived the research. FM and YC implemented the software. DRG, LH, GSO and SL contributed resources and provided guidance for the research. BL, FM, QM and YC wrote and revised the manuscript. All authors read and approved the final manuscript.

Supplementary Material

Additional file 1

A detailed description of the wavelet transform process and the spatial Adaptive Algorithm. A detailed description of the wavelet transform process and the spatial Adaptive Algorithm.

Click here for file (170.5KB, DOC)
Additional file 2

Comparisons of the quantification performance of BSA in a 1:2 ratio. Comparisons of the quantification performance of BSA mixed at 1:2 ratio between our program (Panels B) and ASAPRatio program (Panels A). Panel A and B are sepctra: OR20070625_HS_L-H-1-2_10.06981.06981.4; ions with +3 charge state.

Click here for file (1.5MB, TIFF)
Additional file 3

Comparisons of the quantification performance of BSA in a 1:4 ratio. Comparisons of the quantification performance of BSA mixed at 1:4 ratio between our program (Panels B) and ASAPRatio program (Panels A). Panel A and B are spectra:OR20070625_HS_L-H-1-4_09.06730.06730.3; ions with +2 charge state.

Click here for file (1.5MB, TIFF)
Additional file 4

Comparisons of the quantification performance of BSA in a 4:1 ratio. Comparisons of the quantification performance of BSA mixed at 4:1 ratio between our program (Panels B) and ASAPRatio program (Panels A). Panel A and B are spectra: OR20070625_HS_L-H-4-1.15.10440.10440.3; ions with +2 charge state.

Click here for file (1.5MB, TIFF)

Contributor Information

Fan Mo, Email: mofan.hz@gmail.com.

Qun Mo, Email: moqun@zju.edu.cn.

Yuanyuan Chen, Email: cyy024@hotmail.com.

David R Goodlett, Email: goodlett@u.washington.edu.

Leroy Hood, Email: lhood@systemsbiology.org.

Gilbert S Omenn, Email: gomenn@med.umich.edu.

Song Li, Email: songli@zju.edu.cn.

Biaoyang Lin, Email: bylin@u.washington.edu.

Acknowledgements

We thank Dan Martin and David Shteynberg at the Institute for Systems Biology for sharing with us the data from the control yeast mixture proteomics experiments for testing this software. This work is partly supported by NSF of China (10771190 and 10971189), grants from the Ministry of Science and Technology (2006AA02Z4A2, 2006AA02A303, 2006DFA32950 and 2007DFC30360), the Doctoral Program Foundation of Ministry of Education of China (20070335176), and the grants MEDC GR687 (Michigan Proteomics Alliance for Cancer Research) and U54DA 021519 (NCIBI) from NIH, USA.

References

  1. Aebersold R, Mann M. Mass spectrometry-based proteomics. Nature. 2003;422(6928):198–207. doi: 10.1038/nature01511. [DOI] [PubMed] [Google Scholar]
  2. Ong SE, Mann M. Mass spectrometry-based proteomics turns quantitative. Nat Chem Biol. 2005;1(5):252–262. doi: 10.1038/nchembio736. [DOI] [PubMed] [Google Scholar]
  3. Qian WJ, Jacobs JM, Liu T, Camp DG II, Smith RD. Advances and challenges in liquid chromatography-mass spectrometry-based proteomics profiling for clinical applications. Mol Cell Proteomics. 2006;5(10):1727–1744. doi: 10.1074/mcp.M600162-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Mueller LN, Brusniak MY, Mani DR, Aebersold R. An assessment of software solutions for the analysis of mass spectrometry based quantitative proteomics data. J Proteome Res. 2008;7(1):51–61. doi: 10.1021/pr700758r. [DOI] [PubMed] [Google Scholar]
  5. Liu H, Sadygov RG, Yates JR III. A model for random sampling and estimation of relative protein abundance in shotgun proteomics. Anal Chem. 2004;76(14):4193–4201. doi: 10.1021/ac0498563. [DOI] [PubMed] [Google Scholar]
  6. Han DK, Eng J, Zhou H, Aebersold R. Quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry. Nat Biotechnol. 2001;19(10):946–951. doi: 10.1038/nbt1001-946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Mueller LN, Rinner O, Schmidt A, Letarte S, Bodenmiller B, Brusniak MY, Vitek O, Aebersold R, Muller M. SuperHirn - a novel tool for high resolution LC-MS-based peptide/protein profiling. Proteomics. 2007;7(19):3470–3480. doi: 10.1002/pmic.200700057. [DOI] [PubMed] [Google Scholar]
  8. Hwang D, Zhang N, Lee H, Yi E, Zhang H, Lee IY, Hood L, Aebersold R. MS-BID: a Java package for label-free LC-MS-based comparative proteomic analysis. Bioinformatics. 2008;24(22):2641–2642. doi: 10.1093/bioinformatics/btn491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Andreev VP, Li L, Cao L, Gu Y, Rejtar T, Wu SL, Karger BL. A new algorithm using cross-assignment for label-free quantitation with LC-LTQ-FT MS. J Proteome Res. 2007;6(6):2186–2194. doi: 10.1021/pr0606880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Aggarwal K, Choe LH, Lee KH. Shotgun proteomics using the iTRAQ isobaric tags. Brief Funct Genomic Proteomic. 2006;5(2):112–120. doi: 10.1093/bfgp/ell018. [DOI] [PubMed] [Google Scholar]
  11. Ong SE, Foster LJ, Mann M. Mass spectrometric-based approaches in quantitative proteomics. Methods. 2003;29(2):124–130. doi: 10.1016/S1046-2023(02)00303-1. [DOI] [PubMed] [Google Scholar]
  12. Li X, Zhang H, Ranish JA, Aebersold R. Automated Statistical Analysis of Protein Abundance Ratios from Data Generated by Stable-Isotope Dilution and Tandem Mass Spectrometry. ANALYTICAL CHEMISTRY-WASHINGTON DC. 2003;75(23):6648–6657. doi: 10.1021/ac034633i. [DOI] [PubMed] [Google Scholar]
  13. Halligan BD, Slyper RY, Twigger SN, Hicks W, Olivier M, Greene AS. ZoomQuant: an application for the quantitation of stable isotope labeled peptides. J Am Soc Mass Spectrom. 2005;16(3):302–306. doi: 10.1016/j.jasms.2004.11.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Shinkawa T, Taoka M, Yamauchi Y, Ichimura T, Kaji H, Takahashi N, Isobe T. STEM: a software tool for large-scale proteomic data analyses. J Proteome Res. 2005;4(5):1826–1831. doi: 10.1021/pr050167x. [DOI] [PubMed] [Google Scholar]
  15. Lin WT, Hung WN, Yian YH, Wu KP, Han CL, Chen YR, Chen YJ, Sung TY, Hsu WL. Multi-Q: a fully automated tool for multiplexed protein quantitation. J Proteome Res. 2006;5(9):2328–2338. doi: 10.1021/pr060132c. [DOI] [PubMed] [Google Scholar]
  16. Shadforth IP, Dunkley TP, Lilley KS, Bessant C. i-Tracker: for quantitative proteomics using iTRAQ. BMC Genomics. 2005;6:145. doi: 10.1186/1471-2164-6-145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Keller A, Eng J, Zhang N, Li XJ, Aebersold R. A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol. 2005;1:2005–0017. doi: 10.1038/msb4100024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Cox J, Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol. 2008;26(12):1367–1372. doi: 10.1038/nbt.1511. [DOI] [PubMed] [Google Scholar]
  19. Palmblad M, Mills DJ, Bindschedler LV. Heat-shock response in Arabidopsis thaliana explored by multiplexed quantitative proteomics using differential metabolic labeling. J Proteome Res. 2008;7(2):780–785. doi: 10.1021/pr0705340. [DOI] [PubMed] [Google Scholar]
  20. Yu K, Sabelli A, DeKeukelaere L, Park R, Sindi S, Gatsonis CA, Salomon A. Integrated platform for manual and high-throughput statistical validation of tandem mass spectra. Proteomics. 2009;9(11):3115–3125. doi: 10.1002/pmic.200800899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. May D, Law W, Fitzgibbon M, Fang Q, McIntosh M. Software platform for rapidly creating computational tools for mass spectrometry-based proteomics. J Proteome Res. 2009;8(6):3212–3217. doi: 10.1021/pr900169w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Braisted JC, Kuntumalla S, Vogel C, Marcotte EM, Rodrigues AR, Wang R, Huang ST, Ferlanti ES, Saeed AI, Fleischmann RD. The APEX Quantitative Proteomics Tool: generating protein quantitation estimates from LC-MS/MS proteomics results. BMC Bioinformatics. 2008;9:529. doi: 10.1186/1471-2105-9-529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Monroe ME, Shaw JL, Daly DS, Adkins JN, Smith RD. MASIC: a software program for fast quantitation and flexible visualization of chromatographic profiles from detected LC-MS(/MS) features. Comput Biol Chem. 2008;32(3):215–217. doi: 10.1016/j.compbiolchem.2008.02.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Park SK, Venable JD, Xu T, Yates JR III. A quantitative analysis software tool for mass spectrometry-based proteomics. Nat Methods. 2008;5(4):319–322. doi: 10.1038/nmeth.1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Meyer Y. Wavelets: Algorithms and Applications. Philadelphia: Society for Industrial and Applied Mathematics; 1993. [Google Scholar]
  26. Mallat SG. A theory for multiresolution signal decomposition: the waveletrepresentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 1989;11(7):674–693. doi: 10.1109/34.192463. [DOI] [Google Scholar]
  27. Crandall R. Projects in Scientific Computation. New York: Springer-Verlag; 1994. [Google Scholar]
  28. Yang C, He Z, Yu W. Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis. BMC Bioinformatics. 2009;10:4. doi: 10.1186/1471-2105-10-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Du P, Kibbe WA, Lin SM. Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching. Bioinformatics. 2006;22(17):2059. doi: 10.1093/bioinformatics/btl355. [DOI] [PubMed] [Google Scholar]
  30. Randolph TW, Yasui Y. Multiscale processing of mass spectrometry data. Biometrics. 2006;62(2):589–597. doi: 10.1111/j.1541-0420.2005.00504.x. [DOI] [PubMed] [Google Scholar]
  31. Alexandrov T, Decker J, Mertens B, Deelder AM, Tollenaar RA, Maass P, Thiele H. Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation. Bioinformatics. 2009;25(5):643–649. doi: 10.1093/bioinformatics/btn662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lange E, Gropl C, Reinert K, Kohlbacher O, Hildebrandt A. High-accuracy peak picking of proteomics data using wavelet techniques. Pac Symp Biocomput. 2006;11:243–254. full_text. [PubMed] [Google Scholar]
  33. Schulz-Trieglaff O, Hussong R, Gropl C, Leinenbach A, Hildebrandt A, Huber C, Reinert K. Computational quantification of peptides from LC-MS data. J Comput Biol. 2008;15(7):685–704. doi: 10.1089/cmb.2007.0117. [DOI] [PubMed] [Google Scholar]
  34. Sturm M, Bertsch A, Gropl C, Hildebrandt A, Hussong R, Lange E, Pfeifer N, Schulz-Trieglaff O, Zerck A, Reinert K. OpenMS - an open-source software framework for mass spectrometry. BMC Bioinformatics. 2008;9:163. doi: 10.1186/1471-2105-9-163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Zhang S, DeGraba TJ, Wang H, Hoehn GT, Gonzales DA, Suffredini AF, Ching WK, Ng MK, Zhou X, Wong ST. A novel peak detection approach with chemical noise removal using short-time FFT for prOTOF MS data. Proteomics. 2009;9(15):3833–3842. doi: 10.1002/pmic.200800030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Tautenhahn R, Bottcher C, Neumann S. Highly sensitive feature detection for high resolution LC/MS. BMC Bioinformatics. 2008;9:504. doi: 10.1186/1471-2105-9-504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Hussong R, Gregorius B, Tholey A, Hildebrandt A. Highly accelerated feature detection in proteomics data sets using modern graphics processing units. Bioinformatics. 2009;25(15):1937–1943. doi: 10.1093/bioinformatics/btp294. [DOI] [PubMed] [Google Scholar]
  38. Liu Y. Feature extraction and dimensionality reduction for mass spectrometry data. Comput Biol Med. 2009;39(9):818–823. doi: 10.1016/j.compbiomed.2009.06.012. [DOI] [PubMed] [Google Scholar]
  39. Donoho DL, Johnstone JM. Ideal spatial adaptation by wavelet shrinkage. Biometrika. 1994;81:425–455. doi: 10.1093/biomet/81.3.425. [DOI] [Google Scholar]
  40. Xu Y, Weaver JB, Healy DM, Lu J. Wavelet transform domain filters: a spatially selective noise filtration technique. Image Processing, IEEE Transactions on. 1994;3(6):747–758. doi: 10.1109/83.336245. [DOI] [PubMed] [Google Scholar]
  41. Stewart JJ, White JT, Yan X, Collins S, Drescher CW, Urban ND, Hood L, Lin B. Proteins associated with Cisplatin resistance in ovarian cancer cells identified by quantitative proteomic technology and integrated with mRNA expression levels. Mol Cell Proteomics. 2006;5(3):433–443. doi: 10.1074/mcp.M500140-MCP200. [DOI] [PubMed] [Google Scholar]
  42. Katajamaa M, Miettinen J, Oresic M. MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data. Bioinformatics. 2006;22(5):634–636. doi: 10.1093/bioinformatics/btk039. [DOI] [PubMed] [Google Scholar]
  43. Yasui Y, Pepe M, Thompson ML, Adam BL, Wright GL Jr, Qu Y, Potter JD, Winget M, Thornquist M, Feng Z. A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detection. Biostatistics. 2003;4(3):449–463. doi: 10.1093/biostatistics/4.3.449. [DOI] [PubMed] [Google Scholar]
  44. Canterbury JD, Yi X, Hoopmann MR, MacCoss MJ. Assessing the dynamic range and peak capacity of nanoflow LC-FAIMS-MS on an ion trap mass spectrometer for proteomics. Anal Chem. 2008;80(18):6888–6897. doi: 10.1021/ac8004988. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional file 1

A detailed description of the wavelet transform process and the spatial Adaptive Algorithm. A detailed description of the wavelet transform process and the spatial Adaptive Algorithm.

Click here for file (170.5KB, DOC)
Additional file 2

Comparisons of the quantification performance of BSA in a 1:2 ratio. Comparisons of the quantification performance of BSA mixed at 1:2 ratio between our program (Panels B) and ASAPRatio program (Panels A). Panel A and B are sepctra: OR20070625_HS_L-H-1-2_10.06981.06981.4; ions with +3 charge state.

Click here for file (1.5MB, TIFF)
Additional file 3

Comparisons of the quantification performance of BSA in a 1:4 ratio. Comparisons of the quantification performance of BSA mixed at 1:4 ratio between our program (Panels B) and ASAPRatio program (Panels A). Panel A and B are spectra:OR20070625_HS_L-H-1-4_09.06730.06730.3; ions with +2 charge state.

Click here for file (1.5MB, TIFF)
Additional file 4

Comparisons of the quantification performance of BSA in a 4:1 ratio. Comparisons of the quantification performance of BSA mixed at 4:1 ratio between our program (Panels B) and ASAPRatio program (Panels A). Panel A and B are spectra: OR20070625_HS_L-H-4-1.15.10440.10440.3; ions with +2 charge state.

Click here for file (1.5MB, TIFF)

Articles from BMC Bioinformatics are provided here courtesy of BMC

RESOURCES