Open source system for analyzing, validating, and storing protein identification data.

This paper describes an open-source system for analyzing, storing, and validating proteomics information derived from tandem mass spectrometry. It is based on a combination of data analysis servers, a user interface, and a relational database. The database was designed to store the minimum amount of information necessary to search and retrieve data obtained from the publicly available data analysis servers. Collectively, this system was referred to as the Global Proteome Machine (GPM). The components of the system have been made available as open source development projects. A publicly available system has been established, comprised of a group of data analysis servers and one main database server.

[1]  John R Yates,et al.  Influence of basic residue content on fragment ion peak intensities in low-energy collision-induced dissociation spectra of peptides. , 2004, Analytical chemistry.

[2]  B. Chait,et al.  Protein indentification using mass spectrometric information , 1998, Electrophoresis.

[3]  V. Wysocki,et al.  Selective gas-phase cleavage at the peptide bond C-terminal to aspartic acid in fixed-charge derivatives of Asp-containing peptides. , 2000, Analytical chemistry.

[4]  Chris L. Tang,et al.  Efficiency of database search for identification of mutated and modified proteins via mass spectrometry. , 2001, Genome research.

[5]  M. Mann,et al.  Use of mass spectrometry-derived data to annotate nucleotide and protein sequence databases. , 2001, Trends in biochemical sciences.

[6]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[7]  David Fenyö,et al.  RADARS, a bioinformatics solution that automates proteome mass spectral analysis, optimises protein identification, and archives data in a relational database , 2002, Proteomics.

[8]  M. Mann,et al.  What does it mean to identify a protein in proteomics? , 2002, Trends in biochemical sciences.

[9]  Robert J. Beynon Enabling Proteomics: The Need for an Extendable ‘Workbench’ for User-Configurable Solutions , 2004, Comparative and functional genomics.

[10]  Rolf Apweiler,et al.  Progress in Establishing Common Standards for Exchanging Proteomics Data: The Second Meeting of the HUPO Proteomics Standards Initiative , 2003, Comparative and functional genomics.

[11]  V. Wysocki,et al.  Mobile and localized protons: a framework for understanding peptide dissociation. , 2000, Journal of mass spectrometry : JMS.

[12]  R. Beavis,et al.  A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes. , 2003, Analytical chemistry.

[13]  Larry Wall,et al.  Programming Perl , 1991 .

[14]  Steven P Gygi,et al.  SILVER helps assign peptides to tandem mass spectra using intensity-based scoring , 2004, Journal of the American Society for Mass Spectrometry.

[15]  Chris F. Taylor,et al.  A systematic approach to modeling, capturing, and disseminating proteomics experimental data , 2003, Nature Biotechnology.

[16]  David Fenyö,et al.  The Biopolymer Markup Language , 1999, Bioinform..

[17]  R. Aebersold,et al.  Mass spectrometry in proteomics. , 2001, Chemical reviews.

[18]  Simon C. Potter,et al.  An overview of Ensembl. , 2004, Genome research.

[19]  Lars Malmström,et al.  Proteome annotations and identifications of the human pulmonary fibroblast. , 2004, Journal of proteome research.

[20]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology , 2003, Nucleic Acids Res..

[21]  Ruedi Aebersold,et al.  Constellations in a cellular universe , 2003, Nature.

[22]  R. Beavis,et al.  A method for reducing the time required to match protein sequences with tandem mass spectra. , 2003, Rapid communications in mass spectrometry : RCM.

[23]  Rolf Apweiler,et al.  Common interchange standards for proteomics data: Public availability of tools and schema. Report on the Proteomic Standards Initiative Workshop, 2nd Annual HUPO Congress, Montreal, Canada, 8–11th October 2003 , 2004, Proteomics.