Hardware acceleration of processing of mass spectrometric data for proteomics

MOTIVATION High-resolution mass spectrometers generate large data files that are complex, noisy and require extensive processing to extract the optimal data from raw spectra. This processing is readily achieved in software and is often embedded in manufacturers' instrument control and data processing environments. However, the speed of this data processing is such that it is usually performed off-line, post data acquisition. We have been exploring strategies that would allow real-time advanced processing of mass spectrometric data, making use of the reconfigurable computing paradigm, which exploits the flexibility and versatility of Field Programmable Gate Arrays (FPGAs). This approach has emerged as a powerful solution for speeding up time-critical algorithms. We describe here a reconfigurable computing solution for processing raw mass spectrometric data generated by MALDI-ToF instruments. The hardware-implemented algorithms for de-noising, baseline correction, peak identification and deisotoping, running on a Xilinx Virtex 2 FPGA at 180 MHz, generate a mass fingerprint over 100 times faster than an equivalent algorithm written in C, running on a Dual 3 GHz Xeon workstation.

[1]  Fredrik Levander,et al.  Modular, scriptable and automated analysis tools for high-throughput peptide mass fingerprinting , 2004, Bioinform..

[2]  Reinhard Männer,et al.  Real-time primer design for DNA chips , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[3]  Bertil Schmidt,et al.  Using reconfigurable hardware to accelerate multiple sequence alignment with ClustalW , 2005, Bioinform..

[4]  Dominique Lavenier Speeding up genome computations with a systolicacceleratorDominique , 1998 .

[5]  R. Beynon,et al.  Multiplexed absolute quantification for proteomics using concatenated signature peptides encoded by QconCAT genes , 2006, Nature Protocols.

[6]  Steven A. Guccione,et al.  Gene Matching Using JBits , 2002, FPL.

[7]  R. Beynon,et al.  Multiplexed absolute quantification in proteomics using artificial QCAT proteins of concatenated signature peptides , 2005, Nature Methods.

[8]  Michel Dumontier,et al.  Hardware-accelerated protein identification for mass spectrometry. , 2005, Rapid communications in mass spectrometry : RCM.

[9]  Richard Hughey,et al.  Parallel hardware for sequence comparison and alignment , 1996, Comput. Appl. Biosci..

[10]  Edmond J. Breen,et al.  Automatic Poisson peak harvesting for high throughput protein identification , 2000, Electrophoresis.

[11]  Joshua D. Knowles,et al.  Closed-loop, multiobjective optimization of analytical instrumentation: gas chromatography/time-of-flight mass spectrometry of the metabolomes of human serum and of yeast fermentations. , 2005, Analytical chemistry.

[12]  Barry S. Fagin,et al.  A special-purpose processor for gene sequence analysis , 1993, Comput. Appl. Biosci..

[13]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[14]  Andrzej Wozniak,et al.  Using video-oriented instructions to speed up sequence comparison , 1997, Comput. Appl. Biosci..

[15]  Vittorio Rosato,et al.  Designing hardware for protein sequence analysis , 2003, Bioinform..

[16]  Dominique Lavenier,et al.  SAMBA: hardware accelerator for biological sequence comparison , 1997, Comput. Appl. Biosci..