Global protein interactome exploration through mining genome-scale data in Arabidopsis thaliana

BackgroundMany essential cellular processes, such as cellular metabolism, transport, cellular metabolism and most regulatory mechanisms, rely on physical interactions between proteins. Genome-wide protein interactome networks of yeast, human and several other animal organisms have already been established, but this kind of network reminds to be established in the field of plant.ResultsWe first predicted the protein protein interaction in Arabidopsis thaliana with methods, including ortholog, SSBP, gene fusion, gene neighbor, phylogenetic profile, coexpression, protein domain, and used Naïve Bayesian approach next to integrate the results of these methods and text mining data to build a genome-wide protein interactome network. Furthermore, we adopted the data of GO enrichment analysis, pathway, published literature to validate our network, the confirmation of our network shows the feasibility of using our network to predict protein function and other usage.ConclusionsOur interactome is a comprehensive genome-wide network in the organism plant Arabidopsis thaliana, and provides a rich resource for researchers in related field to study the protein function, molecular interaction and potential mechanism under different conditions.

[1]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[2]  D. Chandler,et al.  Analysis of protein interaction and function with a 3-dimensional MALDI-MS protein array. , 2005, BioTechniques.

[3]  Chuan Wang,et al.  InPrePPI: an integrated evaluation method based on genomic context for predicting protein-protein interactions in prokaryotic genomes , 2007, BMC Bioinformatics.

[4]  S. Van Huffel,et al.  The Bayesian approach: a natural framework for statistical modeling , 2007, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[5]  R. Chanet,et al.  Protein interaction mapping: a Drosophila case study. , 2005, Genome research.

[6]  Marc R Wilkins,et al.  Interactive three-dimensional visualization and contextual analysis of protein interaction networks. , 2008, Journal of proteome research.

[7]  Christian J Stoeckert,et al.  Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale. , 2006, Genome research.

[8]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Peter D. Karp,et al.  MetaCyc and AraCyc. Metabolic Pathway Databases for Plant Research1[w] , 2005, Plant Physiology.

[10]  Erik L. L. Sonnhammer,et al.  InParanoid 7: new algorithms and tools for eukaryotic orthology analysis , 2009, Nucleic Acids Res..

[11]  M. Kanehisa,et al.  Computation with the KEGG pathway database. , 1998, Bio Systems.

[12]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[13]  Donald R Ort,et al.  Plant Physiology and TAIR Partnership , 2008, Plant Physiology.

[14]  Erik L. L. Sonnhammer,et al.  InParanoid 6: eukaryotic ortholog clusters with inparalogs , 2007, Nucleic Acids Res..

[15]  Y. Zhang,et al.  IntAct—open source resource for molecular interaction data , 2006, Nucleic Acids Res..

[16]  S. Kanaya,et al.  Large-scale identification of protein-protein interaction of Escherichia coli K-12. , 2006, Genome research.

[17]  H. Lehrach,et al.  A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome , 2005, Cell.

[18]  Dong Xu,et al.  Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae. , 2004, Nucleic acids research.

[19]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[20]  Guang Li,et al.  AtPID: Arabidopsis thaliana protein interactome database—an integrative platform for plant systems biology , 2007, Nucleic Acids Res..

[21]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[22]  P. Zimmermann,et al.  Genome-Scale Proteomics Reveals Arabidopsis thaliana Gene Models and Proteome Dynamics , 2008, Science.

[23]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[24]  A. Harvey Millar,et al.  A Predicted Interactome for Arabidopsis1[C][W][OA] , 2007, Plant Physiology.

[25]  Erik L. L. Sonnhammer,et al.  Inparanoid: a comprehensive database of eukaryotic orthologs , 2004, Nucleic Acids Res..

[26]  Mingzhi Lin,et al.  Computational Identification of Potential Molecular Interactions in Arabidopsis1[C][W] , 2009, Plant Physiology.

[27]  Hui Lu,et al.  Correlation between gene expression profiles and protein-protein interactions within and across genomes , 2005, Bioinform..

[28]  Chris Mungall,et al.  AmiGO: online access to ontology and annotation data , 2008, Bioinform..

[29]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[30]  Matteo Pellegrini,et al.  Prolinks: a database of protein functional linkages derived from coevolution , 2004, Genome Biology.

[31]  Susumu Goto,et al.  KEGG bioinformatics resource for plant genomics research. , 2007, Methods in molecular biology.

[32]  Ian M. Donaldson,et al.  BIND: the Biomolecular Interaction Network Database , 2001, Nucleic Acids Res..

[33]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[34]  K. N. Chandrika,et al.  Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets , 2006, Nature Genetics.

[35]  M. Gerstein,et al.  A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data , 2003, Science.

[36]  Alessandro Vespignani,et al.  Global protein function prediction from protein-protein interaction networks , 2003, Nature Biotechnology.

[37]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[38]  S. L. Wong,et al.  A Map of the Interactome Network of the Metazoan C. elegans , 2004, Science.

[39]  Randall C Willis,et al.  Searching, viewing, and visualizing data in the Biomolecular Interaction Network Database (BIND). , 2006, Current protocols in bioinformatics.

[40]  A. Fraser,et al.  A first-draft human protein-interaction map , 2004, Genome Biology.

[41]  T. Barrette,et al.  Probabilistic model of the human protein-protein interaction network , 2005, Nature Biotechnology.

[42]  Eve Syrkin Wurtele,et al.  Articulation of three core metabolic processes in Arabidopsis: Fatty acid biosynthesis, leucine catabolism and starch metabolism , 2008, BMC Plant Biology.

[43]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[44]  David W. Scott The New S Language , 1990 .

[45]  K. Gould,et al.  Proteomics Analysis Reveals Stable Multiprotein Complexes in Both Fission and Budding Yeasts Containing Myb-Related Cdc5p/Cef1p, Novel Pre-mRNA Splicing Factors, and snRNAs , 2002, Molecular and Cellular Biology.

[46]  Rebecca L Poole The TAIR database. , 2007, Methods in molecular biology.