A system for integrative and post-planned analysis of 2-DE/MS centered proteomics data

Summary Proteomic analysis is intrinsically an iterative, incremental process. Information is usually acquired gradually by researchers, and in different projects. At the same time, there are relatively few examples of biological data management systems which take into account this reality, most of them usually treat the experiment generated data as static and unchangeable: data are never reconsidered, or seldom, whereas technology becomes more powerful or that other researchers have brought information on data correction. And yet, post-planned analysis [21] which involves multiple iterations and subsequent re-investigations of previously prepared data might bring tremendous benefits. Named PARIS (Proteomic Analysis and Resources Indexation System), the system we developed here seeks to address this requirement. Compliant with the majority of 2-DE analysis and MALDI-TOF based protein identification softwares, it automatically takes data from them and stores the raw and processed data in a relational database suitable for advanced exploration. Taking into account the standards proposed by PSI (Proteomics Standard Initiative), the system exports the stored data in XML format for data exchange and knowledge sharing. PARIS also manages information about experiments and their biological contexts, and allows the user to search and analyze a large data collection in a global manner. It provides tools for data integration and advanced, cross multi-experiment, multi-experimenter data exploration, and supports visual verification and correction of the analysis results. Implemented in Java, the system is platform independent, accessible to multiple users through Internet. It is also scalable for use for one or many laboratories, and therefore suitable to inter-institute collaborative work. PARIS can be tested and downloaded at http://genome.jouy.inra.fr/paris

[1]  N. Anderson,et al.  Proteomics: applications in basic and applied biology. , 2000, Current opinion in biotechnology.

[2]  David K Han,et al.  PROTEOME-3D: An Interactive Bioinformatics Tool for Large-Scale Data Exploration and Knowledge Discovery* , 2003, Molecular & Cellular Proteomics.

[3]  P. Kemmeren,et al.  Protein interaction verification and functional annotation by integrated analysis of genome-scale data. , 2002, Molecular cell.

[4]  Jonas S. Almeida,et al.  An XML standard for the dissemination of annotated 2D gel electrophoresis data complemented with mass spectrometry results , 2004, BMC Bioinformatics.

[5]  Charles DeLisi,et al.  Predictome: a database of putative functional links between proteins , 2002, Nucleic Acids Res..

[6]  Mark Gerstein,et al.  SPINE 2: a system for collaborative structural proteomics within a federated database framework. , 2003, Nucleic acids research.

[7]  Andrew Jones,et al.  Proposal for a Standard Representation of Two-Dimensional Gel Electrophoresis Data , 2003, Comparative and functional genomics.

[8]  Rong Wang,et al.  The need for a public proteomics repository , 2004, Nature Biotechnology.

[9]  A. Trubuil,et al.  Proteomic Signature of Lactococcus lactis NCDO763 Cultivated in Milk , 2005, Applied and Environmental Microbiology.

[10]  Robert Gentleman,et al.  A graph-theoretic approach to testing associations between disparate sources of functional genomics data , 2004, Bioinform..

[11]  Olivier Golaz,et al.  Federated two‐dimensional electrophoresis database: A simple means of publishing two‐dimensional electrophoresis data , 1996, Electrophoresis.

[12]  Chris F. Taylor,et al.  A systematic approach to modeling, capturing, and disseminating proteomics experimental data , 2003, Nature Biotechnology.

[13]  Stanimire Tomov,et al.  Clinical Proteomics and Biomarker Discovery , 2004, Annals of the New York Academy of Sciences.

[14]  Mark S. Boguski,et al.  Biomedical informatics for proteomics , 2003, Nature.

[15]  Jung Eun Shim,et al.  An integrated proteome database for two‐dimensional electrophoresis data analysis and laboratory information management system , 2002, Proteomics.

[16]  P. Auger,et al.  Integrative biology: linking levels of organization. , 2003, Comptes rendus biologies.

[17]  Daniel Teichroew,et al.  Mining protein data from two‐dimensional gels: Tools for systematic post‐planned analyses , 1999 .

[18]  Pierre-Alain Binz,et al.  The Make 2D‐DB II package: Conversion of federated two‐dimensional gel electrophoresis databases into a relational format and interconnection of distributed databases , 2003, Proteomics.

[19]  John P Helfrich Raw data to knowledge warehouse in proteomic-based drug discovery: a scientific data management issue. , 2002, BioTechniques.

[20]  Jari Häkkinen,et al.  PROTEIOS: an open source proteomics initiative , 2005, Bioinform..

[21]  Halima Bensmail,et al.  A novel approach for clustering proteomics data using Bayesian fast Fourier transform , 2005, Bioinform..

[22]  P F Lemkin,et al.  The 2DWG meta‐database of two‐dimensional electrophoretic gel images on the Internet , 1997, Electrophoresis.

[23]  B. Honoré,et al.  Functional genomics studied by proteomics. , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.