Skip to main content
Bioinformatics logoLink to Bioinformatics
. 2008 May 3;24(13):1498–1502. doi: 10.1093/bioinformatics/btn216

High-performance hardware implementation of a parallel database search engine for real-time peptide mass fingerprinting

István A Bogdán 1, Jenny Rivers 2, Robert J Beynon 2, Daniel Coca 1,*
PMCID: PMC3133936  PMID: 18453553

Abstract

Motivation: Peptide mass fingerprinting (PMF) is a method for protein identification in which a protein is fragmented by a defined cleavage protocol (usually proteolysis with trypsin), and the masses of these products constitute a ‘fingerprint’ that can be searched against theoretical fingerprints of all known proteins. In the first stage of PMF, the raw mass spectrometric data are processed to generate a peptide mass list. In the second stage this protein fingerprint is used to search a database of known proteins for the best protein match. Although current software solutions can typically deliver a match in a relatively short time, a system that can find a match in real time could change the way in which PMF is deployed and presented. In a paper published earlier we presented a hardware design of a raw mass spectra processor that, when implemented in Field Programmable Gate Array (FPGA) hardware, achieves almost 170-fold speed gain relative to a conventional software implementation running on a dual processor server. In this article we present a complementary hardware realization of a parallel database search engine that, when running on a Xilinx Virtex 2 FPGA at 100 MHz, delivers 1800-fold speed-up compared with an equivalent C software routine, running on a 3.06 GHz Xeon workstation. The inherent scalability of the design means that processing speed can be multiplied by deploying the design on multiple FPGAs. The database search processor and the mass spectra processor, running on a reconfigurable computing platform, provide a complete real-time PMF protein identification solution.

Contact: d.coca@sheffield.ac.uk

REFERENCES

  1. Anish TA, et al. Hardware-accelerated protein identification for mass spectrometry. Rapid Commun. Mass Spectrom. 2005;19:833–837. doi: 10.1002/rcm.1853. [DOI] [PubMed] [Google Scholar]
  2. Bogdan I, et al. Hardware acceleration of processing of mass spectrometric data for proteomics. Bioinform. Gene Express. 2007;23:724–731. doi: 10.1093/bioinformatics/btl656. [DOI] [PubMed] [Google Scholar]
  3. Fagin B, et al. A special-purpose processor for gene sequence analysis. Comput. Appl. BioSci. 1993;9:221–226. doi: 10.1093/bioinformatics/9.2.221. [DOI] [PubMed] [Google Scholar]
  4. Gattiker A, et al. FindPept, a tool to identify unmatched masses in peptide mass fingerprinting protein identification. Proteomix. 2002;2:1435–1444. doi: 10.1002/1615-9861(200210)2:10<1435::AID-PROT1435>3.0.CO;2-9. [DOI] [PubMed] [Google Scholar]
  5. Guccione AS, Keller E. Gene matching using Jbits. Lecture Notes in Computer Science; Proceedings of the Reconfigurable Computing is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications; London: Springer-Verlag; 2002. pp. 1168–1171. [Google Scholar]
  6. Guerdoux-Jamet P, Lavenier D. SAMBA: hardware accelerator for biological sequence comparison. Comput. Appl. BioSci. 1997;13:609–615. doi: 10.1093/bioinformatics/13.6.609. [DOI] [PubMed] [Google Scholar]
  7. Hughey R. Parallel hardware for sequence comparison and alignment. Comput. Appl. BioSci. 1996;12:473–479. doi: 10.1093/bioinformatics/12.6.473. [DOI] [PubMed] [Google Scholar]
  8. Krogh A, et al. Hidden Markov models in computational biology: applications to protein modeling. J. Mol. Biol. 1994;235:1501–1531. doi: 10.1006/jmbi.1994.1104. [DOI] [PubMed] [Google Scholar]
  9. Lavenier D. Speeding up genome computations with systolic accelerator. SIAM News. 1998;31:1–8. [Google Scholar]
  10. Marongiu A, et al. Designing hardware for protein sequence analysis. Bioinformatics. 2003;19:1739–1740. doi: 10.1093/bioinformatics/btg228. [DOI] [PubMed] [Google Scholar]
  11. Oliver T, et al. Using reconfigurable hardware to accelerate multiple sequence alignment with ClustaIW. Bioinformatics. 2005;21:3431–3432. doi: 10.1093/bioinformatics/bti508. [DOI] [PubMed] [Google Scholar]
  12. Panitz F, et al. SNP mining porcine ESTs with MAVIANT, a novel tool for SNP evaluation and annotation. Bioinformatics. 2007;23:i387–i391. doi: 10.1093/bioinformatics/btm192. [DOI] [PubMed] [Google Scholar]
  13. Pappin DJC, et al. Rapid identification of proteins by peptide-mass fingerprinting. Curr. Biol. 1993;3:327–332. doi: 10.1016/0960-9822(93)90195-t. [DOI] [PubMed] [Google Scholar]
  14. Perkins DN, et al. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. 1999;20:3551–3567. doi: 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2. [DOI] [PubMed] [Google Scholar]
  15. Samuelsson J, et al. Modular, scriptable and automated analysis tools for high-throughput peptide mass fingerprinting. Bioinformatics. 2004;20:3628–3635. doi: 10.1093/bioinformatics/bth460. [DOI] [PubMed] [Google Scholar]
  16. Schechter I, Berger A. On the size of the active site in proteases. Biochem. Biophys. Res. Com. 1967;27:157–162. doi: 10.1016/s0006-291x(67)80055-x. [DOI] [PubMed] [Google Scholar]
  17. Simmler H, et al. Real-Time Primer Design for DNA Chips. Intersci. Concurr. Comput. Pract. Exper. 2004;16:855–872. [Google Scholar]
  18. Sun Y, Buhler J. Designing patterns for profile HMM search. Bioinformatics. 2007;23:e36–e43. doi: 10.1093/bioinformatics/btl323. [DOI] [PubMed] [Google Scholar]
  19. Wozniak A. Using video-oriented instructions to speed up sequence comparison. Comput. Appl. BioSci. 1997;13:145–150. doi: 10.1093/bioinformatics/13.2.145. [DOI] [PubMed] [Google Scholar]
  20. Zhang W, Chait BT. ProFound: an expert system for protein identification using mass spectrometric peptide mapping information. Anal. Chem. 2000;72:2482–2489. doi: 10.1021/ac991363o. [DOI] [PubMed] [Google Scholar]

Articles from Bioinformatics are provided here courtesy of Oxford University Press

RESOURCES