Information management for proteomics: a perspective

Proteomics is a data-rich discipline that makes extensive use of separation tools, mass spectrometry and bioinformatics to analyze and interpret the features and dynamics of the proteome. A major challenge for the field is how proteomics data can be stored and managed, such that data become permanent and can be mined with current and future tools. This article details our experience in the development of a commercial proteomic information management system. We identify the challenges faced in data acquisition, workflow management, data permanence, security, data interpretation and analysis, as well as the solutions implemented to address these issues. We finally provide a perspective on data management in proteomics and the implications for academic and industry-based researchers working in this field.

[1]  Sean Martin,et al.  Globally distributed object identification for biological knowledgebases , 2004, Briefings Bioinform..

[2]  O John Semmes,et al.  Analysis of Human Proteome Organization Plasma Proteome Project (HUPO PPP) reference specimens using surface enhanced laser desorption/ionization‐time of flight (SELDI‐TOF) mass spectrometry: Multi‐institution correlation of spectra and identification of biomarkers , 2005, Proteomics.

[3]  J. Yates,et al.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. , 1995, Analytical chemistry.

[4]  Richard M Caprioli,et al.  MALDI mass spectrometry for direct tissue analysis: a new tool for biomarker discovery. , 2005, Journal of proteome research.

[5]  M. Wilkins,et al.  Optimal replication and the importance of experimental design for gel-based quantitative proteomics. , 2005, Journal of proteome research.

[6]  Giulio Superti-Furga,et al.  Protein complexes and proteome organization from yeast to man. , 2003, Current opinion in chemical biology.

[7]  Fuchu He,et al.  Comparison of alternative analytical techniques for the characterisation of the human serum proteome in HUPO Plasma Proteome Project , 2005, Proteomics.

[8]  Ying Wang,et al.  The human plasma proteome: Analysis of Chinese serum using shotgun strategy , 2005, Proteomics.

[9]  Lennart Martens,et al.  6th HUPO Annual World Congress – Proteomics Standards Initiative Workshop 6–10 October 2007, Seoul, Korea , 2008, Proteomics.

[10]  Edmond J. Breen,et al.  Automated Peak Harvesting of MALDI-MS spectra for high throughput proteomics , 2003 .

[11]  W. Dunn,et al.  Measuring the metabolome: current analytical technologies. , 2005, The Analyst.

[12]  Geoffrey J. Barton,et al.  TarO: a target optimisation system for structural biology , 2008, Nucleic Acids Res..

[13]  E. Petricoin,et al.  Proteomic approaches in cancer risk and response assessment. , 2004, Trends in molecular medicine.

[14]  Edmond J. Breen,et al.  Automatic Poisson peak harvesting for high throughput protein identification , 2000, Electrophoresis.

[15]  Matthew E Monroe,et al.  A proteomic study of the HUPO Plasma Proteome Project's pilot samples using an accurate mass and time tag strategy , 2005, Proteomics.

[16]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[17]  Gilbert S. Omenn,et al.  Advancement of Biomarker Discovery and Validation through the HUPO Plasma Proteome Project , 2004, Disease markers.

[18]  Johann Joets,et al.  The PROTICdb database for 2-DE proteomics. , 2007, Methods in molecular biology.

[19]  J. Zaia Mass spectrometry of oligosaccharides. , 2004, Mass spectrometry reviews.

[20]  Jonas S. Almeida,et al.  AGML Central: web based gel proteomic infrastructure , 2005, Bioinform..

[21]  Chris F. Taylor,et al.  Current status of proteomic standards development , 2004, Expert review of proteomics.

[22]  M. Mann,et al.  Proteomics to study genes and genomes , 2000, Nature.

[23]  Gilbert S Omenn,et al.  Immunoassay and antibody microarray analysis of the HUPO Plasma Proteome Project reference specimens: Systematic variation between sample types and calibration of mass spectrometry data , 2005, Proteomics.

[24]  Ela Hunt,et al.  An object model and database for functional genomics , 2004, Bioinform..