PCD-DPPI: Protein complex detection from dynamic PPI using shuffled frog-leaping algorithm

Abstract In the post-genome period, the identification of a protein complex from large PPI (protein-protein interaction) networks is a challenging task. Protein complexes plays a vital character in numerous molecular processes of the cell. Although, the maximum computational techniques of protein complex detection have aimed at static PPI networks that cannot represent the logical dynamicity of protein interactions. In recent times, the dynamicity of PPI networks has been exploited by building a set of dynamic PPI subnetworks with respect to every time-point in a gene expression matrix. This paper presents a new technique, called Protein Complex Detection based on Dynamic PPI (PCD-DPPI) to accomplish dynamicity in protein complex detection. The initial stage of the proposed technique is based on shuffled frog-leaping algorithm, an Optimization technique that take out a few sets of genes that are co-regulated beneath some conditions from the input gene expression matrix. An individual extracted gene set is defined as a bicluster. In the second stage, depending on the biclusters, few dynamic PPI subnetworks are taken out from the input static PPI network. By employing a complex detection approach on every dynamic PPI subnetworks, protein complexes are identified and the result is aggregated. The proposed and existing algorithms were applied to various datasets such as DIP, Krogan “Extended”, Krogan “Core”, Gavin2, Gavin6, PPI 1, Collins and Gavin + Krogan. Experimental results have been proved that the proposed PCD-DPPI efficiently depicts the dynamicity characteristics in static PPI networks and attains expressively better results when compared to other existing standard methods.

[1]  Yuan Zhang,et al.  Evolutionary analysis of functional modules in dynamic PPI networks , 2012, BCB.

[2]  Ying Xu,et al.  QUBIC: a qualitative biclustering algorithm for analyses of gene expression data , 2009, Nucleic acids research.

[3]  Roded Sharan,et al.  Discovering statistically significant biclusters in gene expression data , 2002, ISMB.

[4]  Johannes Goll,et al.  Protein interaction data curation: the International Molecular Exchange (IMEx) consortium , 2012, Nature Methods.

[5]  Siu-Ming Yiu,et al.  Predicting Protein Complexes from PPI Data: A Core-Attachment Approach , 2009, J. Comput. Biol..

[6]  Saeed Jalili,et al.  PCD-GED: Protein complex detection considering PPI dynamics based on time series gene expression data. , 2015, Journal of theoretical biology.

[7]  Sven Bergmann,et al.  Iterative signature algorithm for the analysis of large-scale gene expression data. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[9]  Zhenjia Wang,et al.  UniBic: Sequential row-based biclustering algorithm for analysis of gene expression data , 2016, Scientific Reports.

[10]  Alan J Tackett,et al.  Analysis of stable and transient protein-protein interactions. , 2012, Methods in molecular biology.

[11]  Saeed Jalili,et al.  BiCAMWI: A Genetic-Based Biclustering Algorithm for Detecting Dynamic Protein Complexes , 2016, PloS one.

[12]  A. Kudlicki,et al.  Logic of the Yeast Metabolic Cycle: Temporal Compartmentalization of Cellular Processes , 2005, Science.

[13]  Sean R. Collins,et al.  Toward a Comprehensive Atlas of the Physical Interactome of Saccharomyces cerevisiae*S , 2007, Molecular & Cellular Proteomics.

[14]  Hon Wai Leong,et al.  A survey of computational methods for protein complex prediction from protein interaction networks , 2012, J. Bioinform. Comput. Biol..

[15]  Jun S Liu,et al.  Bayesian biclustering of gene expression data , 2008, BMC Genomics.

[16]  Adrian E. Raftery,et al.  Integrating external biological knowledge in the construction of regulatory networks from time-series expression data , 2012, BMC Systems Biology.

[17]  Philip S. Yu,et al.  An Improved Biclustering Method for Analyzing Gene Expression Profiles , 2005, Int. J. Artif. Intell. Tools.

[18]  Nazar Zaki,et al.  Protein complex detection using interaction reliability assessment and weighted clustering coefficient , 2013, BMC Bioinformatics.

[19]  Richard M. Karp,et al.  Discovering local structure in gene expression data: the order-preserving submatrix problem. , 2003 .

[20]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[21]  Yongjin Park,et al.  How networks change with time , 2012, Bioinform..

[22]  Chen-Ching Lin,et al.  Dynamic protein interaction modules in human hepatocellular carcinoma progression , 2013, BMC Systems Biology.

[23]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[24]  Haiyuan Yu,et al.  Genome-scale analysis of interaction dynamics reveals organization of biological networks , 2012, Bioinform..

[25]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[26]  Xiaoli Li,et al.  Computational approaches for detecting protein complexes from protein interaction networks: a survey , 2010, BMC Genomics.

[27]  Guimei Liu,et al.  Complex discovery from weighted PPI networks , 2009, Bioinform..

[28]  Hon Wai Leong,et al.  MCL-CAw: a refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure , 2010, BMC Bioinformatics.

[29]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[30]  T. M. Murali,et al.  Extracting Conserved Gene Expression Motifs from Gene Expression Data , 2002, Pacific Symposium on Biocomputing.

[31]  Chee Keong Kwoh,et al.  Discovery of Protein Complexes with Core-Attachment Structures from Tandem Affinity Purification (TAP) Data , 2012, J. Comput. Biol..

[32]  Chung-Yen Lin,et al.  A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles , 2010, BMC Bioinformatics.

[33]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[34]  S. Pu,et al.  Up-to-date catalogues of yeast protein complexes , 2008, Nucleic acids research.

[35]  Pedro L. Iglesias-Rey,et al.  The efficiency of setting parameters in a modified Shuffled Frog Leaping Algorithm applied to optimizing water distribution networks. , 2016 .

[36]  Desmond J. Higham,et al.  A clustering coefficient for weighted networks, with application to gene expression data , 2007, AI Commun..

[37]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[38]  Muzaffar Eusuff,et al.  Shuffled frog-leaping algorithm: a memetic meta-heuristic for discrete optimization , 2006 .

[39]  Lothar Thiele,et al.  A systematic comparison and evaluation of biclustering methods for gene expression data , 2006, Bioinform..

[40]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[41]  Peng Yang,et al.  Detecting temporal protein complexes from dynamic protein-protein interaction networks , 2014, BMC Bioinformatics.

[42]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[43]  K. Komurov,et al.  Revealing static and dynamic modular architecture of the eukaryotic protein interaction network , 2007, Molecular Systems Biology.

[44]  Joana P Gonçalves,et al.  BiGGEsTS: integrated environment for biclustering analysis of time series gene expression data , 2009, BMC Research Notes.

[45]  Sushmita Mitra,et al.  Multi-objective evolutionary biclustering of gene expression data , 2006, Pattern Recognit..

[46]  Saeed Jalili,et al.  CAMWI: Detecting protein complexes using weighted clustering coefficient and weighted density , 2015, Comput. Biol. Chem..