Protein-protein interaction network prediction using stochastic learning automata induced differential evolution

Display Omitted Formulation of PPI problem as a single objective optimization problem.Two novel approaches are proposed to predict a PPI network by independently maximizing individual objectives.Individual objectives are optimized by differential evolution (DE) and stochastic learning automata (SLA).DE is employed to globally explore the search space and SLA for adaptive tuning of the control parameters of the algorithm.The proposed technique outperforms the existing PPI prediction methods. Protein-protein interactions (PPIs) are of biological interest for their active participation in coordinating a number of cellular processes in living organisms. This paper attempts to formulate PPIs as an optimization problem with an aim to independently maximize (a) the stability of a complex formed by two proteins predicted to be interacting, (b) the difference between their individual accessible solvent area and that of the corresponding protein-protein complex, (c) their functional similarity and d) the occurrence of their interacting domain pairs. The novelty of the paper lies in ranking the set of PPI networks, obtained through independently optimizing individual objectives, using two approaches. The first approach is concerned with identifying the equally good PPI networks based on their fitness-based ranks with respect to individual four objectives. The second approach aims at sorting the PPI networks based on their fuzzy memberships to satisfy individual four objectives. The paper also proposes a novel single objective optimization algorithm to optimize individual objectives, influencing the true prediction of a PPI network. The proposed algorithm is realized by an amalgamation of the differential evolution and the stochastic learning automata, where the former is employed to globally explore the search space and the latter for the adaptive tuning of the control parameter of the algorithm. The proposed technique outperforms the existing methods, including relative specific similarity, domain cohesion coupling, random decision forest, fuzzy support vector machine and evolutionary/swarm algorithm based approaches, with respect to both sensitivity and specificity.

[1]  Sanjay Nilapwar Characterization and exploitation of protein ligand interactions for structure based drug design , 2009 .

[2]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[3]  Giulio Superti-Furga,et al.  A physical and functional map of the human TNF-α/NF-κB signal transduction pathway , 2004, Nature Cell Biology.

[5]  Zhirong Sun,et al.  Inferring functional linkages between proteins from evolutionary scenarios. , 2006, Journal of molecular biology.

[6]  Edith D. Wong,et al.  Saccharomyces Genome Database: the genomics resource of budding yeast , 2011, Nucleic Acids Res..

[7]  Taehoon Kim,et al.  CHARMM‐GUI: A web‐based graphical user interface for CHARMM , 2008, J. Comput. Chem..

[8]  Werner Braun,et al.  Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules , 1998, J. Comput. Chem..

[9]  Pratyusha Rakshit,et al.  Evaluating the designing perspective of Protein-Protein Interaction network using evolutionary algorithm , 2012, 2012 Fourth World Congress on Nature and Biologically Inspired Computing (NaBIC).

[10]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[11]  Pratyusha Rakshit,et al.  An Evolutionary Approach for Analysing the Effect of Interaction Site Structural Features on Protein- Protein Complex Formation , 2013, PReMI.

[12]  F. Cohen,et al.  Co-evolution of proteins with their interaction partners. , 2000, Journal of molecular biology.

[13]  Jung-Hsien Chiang,et al.  In Silico Prediction of Human Protein Interactions Using Fuzzy–SVM Mixture Models and Its Application to Cancer Research , 2008, IEEE Transactions on Fuzzy Systems.

[14]  Matej Crepinsek,et al.  A note on teaching-learning-based optimization algorithm , 2012, Inf. Sci..

[15]  A. Valencia,et al.  Similarity of phylogenetic trees as indicator of protein-protein interaction. , 2001, Protein engineering.

[16]  Ziv Bar-Joseph,et al.  Evaluation of different biological data and computational classification methods for use in protein interaction prediction , 2006, Proteins.

[17]  Peter A. Kollman,et al.  Computational alanine scanning of the 1:1 human growth hormone–receptor complex , 2002, J. Comput. Chem..

[18]  Werner Braun,et al.  Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules , 1998 .

[19]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[20]  Gary D Bader,et al.  Domain‐mediated protein interaction prediction: From genome to network , 2012, FEBS letters.

[21]  Marjan Mernik,et al.  Exploration and exploitation in evolutionary algorithms: A survey , 2013, CSUR.

[22]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[23]  Marjan Mernik,et al.  Is a comparison of results meaningful from the inexact replications of computational experiments? , 2016, Soft Comput..

[24]  R. Chanet,et al.  Protein interaction mapping: a Drosophila case study. , 2005, Genome research.

[25]  Xiaomei Wu,et al.  Prediction of yeast protein–protein interaction network: insights from the Gene Ontology and annotations , 2006, Nucleic acids research.

[26]  E. Sprinzak,et al.  Correlated sequence-signatures as markers of protein-protein interaction. , 2001, Journal of molecular biology.

[27]  Pratyusha Rakshit,et al.  Protein Function Prediction Using Adaptive Swarm Based Algorithm , 2013, SEMCCO.

[28]  S. L. Wong,et al.  A Map of the Interactome Network of the Metazoan C. elegans , 2004, Science.

[29]  Sailu Yellaboina,et al.  DOMINE: a comprehensive collection of known and predicted domain-domain interactions , 2010, Nucleic Acids Res..

[30]  David Baker,et al.  Protein–protein docking predictions for the CAPRI experiment , 2003, Proteins.

[31]  Swagatam Das,et al.  Inducing Niching Behavior in Differential Evolution Through Local Information Sharing , 2015, IEEE Transactions on Evolutionary Computation.

[32]  Haruki Nakamura,et al.  Filtering high-throughput protein-protein interaction data using a combination of genomic features , 2005, BMC Bioinformatics.

[33]  Pratyusha Rakshit,et al.  A modified bat algorithm to predict Protein-Protein Interaction network , 2014, 2014 IEEE Congress on Evolutionary Computation (CEC).

[34]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[35]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[36]  Marjan Mernik,et al.  Replication and comparison of computational experiments in applied evolutionary computing: Common pitfalls and guidelines to avoid them , 2014, Appl. Soft Comput..

[37]  Kay Chen Tan,et al.  Multimodal Optimization Using a Biobjective Differential Evolution Algorithm Enhanced With Mean Distance-Based Selection , 2013, IEEE Transactions on Evolutionary Computation.

[38]  D. Eisenberg,et al.  Protein function in the post-genomic era , 2000, Nature.

[39]  A. Valencia,et al.  Correlated mutations contain information about protein-protein interaction. , 1997, Journal of molecular biology.

[40]  Dervis Karaboga,et al.  On clarifying misconceptions when comparing variants of the Artificial Bee Colony Algorithm by offering a new implementation , 2015, Inf. Sci..

[41]  Jing J. Liang,et al.  Problem Definitions and Evaluation Criteria for the CEC 2005 Special Session on Real-Parameter Optimization , 2005 .

[42]  William Stafford Noble,et al.  Large-scale identification of yeast integral membrane protein interactions. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[43]  B. Stoddard,et al.  Design, activity, and structure of a highly specific artificial endonuclease. , 2002, Molecular cell.

[44]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[45]  Edward A. Bender,et al.  Mathematical methods in artificial intelligence , 1996 .

[46]  Werner Braun,et al.  Efficient search for all low energy conformations of polypeptides by Monte Carlo methods , 1991 .

[47]  R. Kini,et al.  Prediction of potential protein‐protein interaction sites from amino acid sequence , 1996, FEBS letters.

[48]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[49]  Anton J. Enright,et al.  Protein interaction maps for complete genomes based on gene fusion events , 1999, Nature.

[50]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[51]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[52]  Mei Liu,et al.  Prediction of protein-protein interactions using random decision forest framework , 2005, Bioinform..

[53]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[54]  Mark A. Ragan,et al.  BMC Systems Biology BioMed Central Research article Protein-protein interaction as a predictor of subcellular location , 2008 .

[55]  D. Eisenberg,et al.  Use of Logic Relationships to Decipher Protein Network Organization , 2004, Science.

[56]  Pratyusha Rakshit,et al.  Muti-objective evolutionary approach of ligand design for protein-ligand docking problem , 2013, 2013 IEEE Congress on Evolutionary Computation.

[57]  S. Jones,et al.  Prediction of protein-protein interaction sites using patch analysis. , 1997, Journal of molecular biology.

[58]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[59]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[60]  S. Lakshmivarahan,et al.  Absolutely Expedient Learning Algorithms For Stochastic Automata , 1973 .

[61]  Dong-Soo Han,et al.  A Computational Model for Predicting Protein Interactions Based on Multidomain Collaboration , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[62]  J. Davies,et al.  Molecular Biology of the Cell , 1983, Bristol Medico-Chirurgical Journal.

[63]  Bruce Tidor,et al.  Barstar is electrostatically optimized for tight binding to barnase , 2001, Nature Structural Biology.

[64]  Jingyu Hou,et al.  Predicting protein functions from PPI networks using functional aggregation. , 2012, Mathematical biosciences.

[65]  H. Scheraga,et al.  Accessible surface areas as a measure of the thermodynamic parameters of hydration of peptides. , 1987, Proceedings of the National Academy of Sciences of the United States of America.