PARPs database: A LIMS systems for protein-protein interaction data mining or laboratory information management system

BackgroundIn the "post-genome" era, mass spectrometry (MS) has become an important method for the analysis of proteins and the rapid advancement of this technique, in combination with other proteomics methods, results in an increasing amount of proteome data. This data must be archived and analysed using specialized bioinformatics tools.DescriptionWe herein describe "PARPs database," a data analysis and management pipeline for liquid chromatography tandem mass spectrometry (LC-MS/MS) proteomics. PARPs database is a web-based tool whose features include experiment annotation, protein database searching, protein sequence management, as well as data-mining of the peptides and proteins identified.ConclusionUsing this pipeline, we have successfully identified several interactions of biological significance between PARP-1 and other proteins, namely RFC-1, 2, 3, 4 and 5.

[1]  Susan Smith,et al.  Resolution of Sister Telomere Association Is Required for Progression Through Mitosis , 2004, Science.

[2]  G. Maga,et al.  Human Proliferating Cell Nuclear Antigen, Poly(ADP-ribose) Polymerase-1, and p21waf1/cip1 , 2003, Journal of Biological Chemistry.

[3]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[4]  Katharina Dittmar,et al.  In silico characterization of the family of PARP-like poly(ADP-ribosyl)transferases (pARTs) , 2005, BMC Genomics.

[5]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[6]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[7]  Dipanwita Roy Chowdhury,et al.  Human protein reference database as a discovery resource for proteomics , 2004, Nucleic Acids Res..

[8]  J. Yates,et al.  Direct analysis of protein complexes using mass spectrometry , 1999, Nature Biotechnology.

[9]  D Fenyö,et al.  Identifying the proteome: software tools. , 2000, Current opinion in biotechnology.

[10]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[11]  Donald F Hunt Personal commentary on proteomics. , 2002, Journal of proteome research.

[12]  E. Birney,et al.  The International Protein Index: An integrated database for proteomics experiments , 2004, Proteomics.

[13]  A. Spradling,et al.  Regulation of chromatin structure and gene activity by poly(ADP-ribose) polymerases. , 2003, Current topics in developmental biology.

[14]  Arnaud Droit,et al.  Bioinformatic Standards for Proteomics-Oriented Mass Spectrometry , 2006 .

[15]  Rolf Apweiler,et al.  Common interchange standards for proteomics data: Public availability of tools and schema. Report on the Proteomic Standards Initiative Workshop, 2nd Annual HUPO Congress, Montreal, Canada, 8–11th October 2003 , 2004, Proteomics.

[16]  Ivar Jacobson,et al.  The unified modeling language reference manual , 2010 .

[17]  Patrick Lambrix,et al.  Representations of molecular pathways: an evaluation of SBML, PSI MI and BioPAX , 2005, Bioinform..

[18]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[19]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[20]  William Stafford Noble,et al.  Learning to predict protein-protein interactions from protein sequences , 2003, Bioinform..

[21]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[22]  Ting Chen,et al.  Assessment of the reliability of protein-protein interactions and protein function prediction , 2002, Pacific Symposium on Biocomputing.

[23]  Chris F. Taylor,et al.  A systematic approach to modeling, capturing, and disseminating proteomics experimental data , 2003, Nature Biotechnology.

[24]  Mark Gerstein,et al.  SPINE 2: a system for collaborative structural proteomics within a federated database framework. , 2003, Nucleic acids research.

[25]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[26]  G. Poirier,et al.  Poly(ADP-ribosyl)ation reactions in the regulation of nuclear functions. , 1999, The Biochemical journal.

[27]  E. Wolf,et al.  A computationally directed screen identifying interacting coiled coils from Saccharomyces cerevisiae. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[28]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[30]  R. Aebersold,et al.  A uniform proteomics MS/MS analysis platform utilizing open XML file formats , 2005, Molecular systems biology.

[31]  C Bozzi,et al.  Measurement of the branching fraction, and bounds on the CP-violating asymmetries, of neutral B decays to D*+/- D-/+. , 2003, Physical review letters.

[32]  B Talbot,et al.  Structural and functional analysis of poly(ADP ribose) polymerase: an immunological study. , 1988, Biochimica et biophysica acta.

[33]  Michèle Rouleau,et al.  Poly(ADP-ribosyl)ated chromatin domains: access granted , 2004, Journal of Cell Science.

[34]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[35]  G. de Murcia,et al.  The PARP superfamily , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[36]  William S. Hancock The challenges ahead. , 2002 .

[37]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2006, Nucleic Acids Res..

[38]  Ian M. Donaldson,et al.  BIND: the Biomolecular Interaction Network Database , 2001, Nucleic Acids Res..

[39]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[40]  Tatiana A. Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[41]  P. Uetz,et al.  Systematic and large-scale two-hybrid screens. , 2000, Current opinion in microbiology.

[42]  Arnaud Droit,et al.  Experimental and bioinformatic approaches for interrogating protein-protein interactions to determine protein function. , 2005, Journal of molecular endocrinology.

[43]  Jung Eun Shim,et al.  An integrated proteome database for two‐dimensional electrophoresis data analysis and laboratory information management system , 2002, Proteomics.

[44]  J. Yates,et al.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. , 1995, Analytical chemistry.