Simplified Swarm Optimization-Based Function Module Detection in Protein–Protein Interaction Networks

Proteomics research has become one of the most important topics in the field of life science and natural science. At present, research on protein–protein interaction networks (PPIN) mainly focuses on detecting protein complexes or function modules. However, existing approaches are either ineffective or incomplete. In this paper, we investigate detection mechanisms of functional modules in PPIN, including open database, existing detection algorithms, and recent solutions. After that, we describe the proposed approach based on the simplified swarm optimization (SSO) algorithm and the knowledge of Gene Ontology (GO). The proposed solution implements the SSO algorithm for clustering proteins with similar function, and imports biological gene ontology knowledge for further identifying function complexes and improving detection accuracy. Furthermore, we use four different categories of species datasets for experiment: fruitfly, mouse, scere, and human. The testing and analysis result show that the proposed solution is feasible, efficient, and could achieve a higher accuracy of prediction than existing approaches.

[1]  Marco Dorigo Ant colony optimization , 2004, Scholarpedia.

[2]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2012 update , 2011, Nucleic Acids Res..

[3]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[4]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[5]  Aidong Zhang,et al.  MAE-FMD: Multi-agent evolutionary method for functional module detection in protein-protein interaction networks , 2014, BMC Bioinformatics.

[6]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[7]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[8]  Austin G. Meyer,et al.  Systematic humanization of yeast genes reveals conserved functions and genetic modularity , 2015, Science.

[9]  David Nelson,et al.  Bioinformatic Analysis Using Complex Networks and Clustering Proteins Linked with Alzheimer's Disease , 2016, UKCI.

[10]  Ivet Bahar,et al.  iGNM 2.0: the Gaussian network model database for biomolecular structural dynamics , 2015, Nucleic Acids Res..

[11]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[12]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[13]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Jijun Tang,et al.  Identification of Protein–Protein Interactions via a Novel Matrix-Based Sequence Representation Model with Amino Acid Contact Information , 2016, International journal of molecular sciences.

[15]  Bin Xu,et al.  From Function to Interaction: A New Paradigm for Accurately Predicting Protein Complexes Based on Protein-to-Protein Interaction Networks , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  Mario Albrecht,et al.  FunSimMat: a comprehensive functional similarity database , 2007, Nucleic Acids Res..

[17]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[18]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2009 update , 2009, Nucleic Acids Res..

[19]  Yuan Zhang,et al.  A generative model of identifying informative proteins from dynamic PPI networks , 2014, Science China Life Sciences.

[20]  Hong Yan,et al.  Fast prediction of protein-protein interaction sites based on Extreme Learning Machines , 2014, Neurocomputing.

[21]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[22]  Xiu-Juan Lei,et al.  The Information Flow Clustering Model and Algorithm Based on the Artificial Bee Colony Mechanism of PPI Network: The Information Flow Clustering Model and Algorithm Based on the Artificial Bee Colony Mechanism of PPI Network , 2012 .

[23]  Thomas Stützle,et al.  Ant Colony Optimization , 2009, EMO.

[24]  Rafael C. Jimenez,et al.  The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases , 2013, Nucleic Acids Res..

[25]  Dmitrij Frishman,et al.  Negatome 2.0: a database of non-interacting proteins derived by literature mining, manual annotation and protein structure analysis , 2013, Nucleic Acids Res..

[26]  J. Dorado,et al.  Plasmod-PPI: A web-server predicting complex biopolymer targets in plasmodium with entropy measures of protein–protein interactions , 2010 .

[27]  Lei Xiu The Information Flow Clustering Model and Algorithm Based on the Artificial Bee Colony Mechanism of PPI Network , 2012 .

[28]  Tatsuya Akutsu,et al.  Prediction of heterotrimeric protein complexes by two-phase learning using neighboring kernels , 2014, BMC Bioinformatics.

[29]  Md. Islam,et al.  Comparative analysis of differential network modularity in tissue specific normal and cancer protein interaction networks , 2013, Journal of Clinical Bioinformatics.

[30]  Roded Sharan,et al.  Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Aidong Zhang,et al.  Improved Ant Colony Optimization for Detecting Functional Modules in Protein-Protein Interaction Networks , 2012, ICICA.

[32]  D. Karaboga,et al.  On the performance of artificial bee colony (ABC) algorithm , 2008, Appl. Soft Comput..

[33]  Jiangning Song,et al.  Identifying Coevolution Between Amino Acid Residues in Protein Families: Advances in the Improvement and Evaluation of Correlated Mutation Algorithms , 2013 .

[34]  J. Dorado,et al.  Trypano-PPI: a web server for prediction of unique targets in trypanosome proteome by using electrostatic parameters of protein-protein interactions. , 2010, Journal of proteome research.

[35]  Kengo Kinoshita,et al.  SAHG, a comprehensive database of predicted structures of all human proteins , 2010, Nucleic Acids Res..

[36]  Aidong Zhang,et al.  Ant Colony Optimization with Multi-Agent Evolution for Detecting Functional Modules in Protein-Protein Interaction Networks , 2012, ICICA.

[37]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.