Discovering Entities Similarities in Biological Networks Using a Hybrid Immune Algorithm

Disease phenotypes are generally caused by the failure of gene modules which often have similar biological roles. Through the study of biological networks, it is possible to identify the intrinsic structure of molecular interactions in order to identify the so-called “disease modules”. Community detection is an interesting and valuable approach to discovering the structure of the community in a complex network, revealing the internal organization of the nodes, and has become a leading research topic in the analysis of complex networks. This work investigates the link between biological modules and network communities in test-case biological networks that are commonly used as a reference point and which include Protein–Protein Interaction Networks, Metabolic Networks and Transcriptional Regulation Networks. In order to identify small and structurally well-defined communities in the biological context, a hybrid immune metaheuristic algorithm Hybrid-IA is proposed and compared with several metaheuristics, hyper-heuristics, and the well-known greedy algorithm Louvain, with respect to modularity maximization. Considering the limitation of modularity optimization, which can fail to identify smaller communities, the reliability of Hybrid-IA was also analyzed with respect to three well-known sensitivity analysis measures (NMI, ARI and NVI) that assess how similar the detected communities are to real ones. By inspecting all outcomes and the performed comparisons, we will see that on one hand Hybrid-IA finds slightly lower modularity values than Louvain, but outperforms all other metaheuristics, while on the other hand, it can detect communities more similar to the real ones when compared to those detected by Louvain.

[1]  Lun Hu,et al.  A Fast Fuzzy Clustering Algorithm for Complex Networks via a Generalized Momentum Method , 2022, IEEE Transactions on Fuzzy Systems.

[2]  Anderson P. Avila-Santos,et al.  BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria , 2022, Briefings Bioinform..

[3]  Zhuhong You,et al.  Identifying Protein Complexes from Protein–Protein Interaction Networks , 2014 .

[4]  Giuseppe Vizzari,et al.  A Novel Spatial-Temporal Analysis Approach to Pedestrian Groups Detection , 2022, KES.

[5]  Zhu-Hong You,et al.  HiSCF: leveraging higher-order structures for clustering analysis in biological networks , 2020, Bioinform..

[6]  Shengwu Xiong,et al.  A Variational Bayesian Framework for Cluster Analysis in a Complex Network , 2020, IEEE Transactions on Knowledge and Data Engineering.

[7]  Jordi Vitrià,et al.  Corridor Detection from Large GPS Trajectories Datasets , 2020, Applied Sciences.

[8]  Tamer Kahveci,et al.  Stability Analysis of Biological Networks’ Diffusion State , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[9]  Vincenzo Cutello,et al.  Optimization Algorithms for Detection of Social Interactions , 2020, Algorithms.

[10]  R. A. Scollo,et al.  Where the Local Search Affects Best in an Immune Algorithm , 2020, AI*IA.

[11]  Emiliano Tramontana,et al.  Eliciting cities points of interest from people movements and suggesting effective itineraries , 2020, Intelligenza Artificiale.

[12]  Vincenzo Cutello,et al.  An Immunological Algorithm for Graph Modularity Optimization , 2019, UKCI.

[13]  Plamen Angelov,et al.  Advances in Computational Intelligence Systems - Contributions Presented at the 18th UK Workshop on Computational Intelligence, September 5-7, 2018, Nottingham, UK , 2016, UKCI.

[14]  Mario Pavone,et al.  DENSA: An effective negative selection algorithm with flexible boundaries for self-space and dynamic number of detectors , 2017, Eng. Appl. Artif. Intell..

[15]  Halife Kodaz,et al.  Community detection from biological and social networks: A comparative analysis of metaheuristic algorithms , 2017, Appl. Soft Comput..

[16]  David R. Penas,et al.  Parameter estimation in large-scale systems biology models: a parallel and self-adaptive cooperative strategy , 2017, BMC Bioinformatics.

[17]  Keith C. C. Chan,et al.  Fuzzy Clustering in a Complex Network Based on Content Relevance and Link Structures , 2016, IEEE Transactions on Fuzzy Systems.

[18]  Santo Fortunato,et al.  Detection of gene communities in multi-networks reveals cancer drivers , 2015, Scientific Reports.

[19]  Domingo Giménez,et al.  Hyperheuristics Based on Parametrized Metaheuristic Schemes , 2015, GECCO.

[20]  F. Glover,et al.  Metaheuristics , 2016, Springer International Publishing.

[21]  Hong Cheng,et al.  GBAGC: A General Bayesian Framework for Attributed Graph Clustering , 2014, TKDD.

[22]  Walter Didimo,et al.  Fast layout computation of clustered networks: Algorithmic advances and experimental analysis , 2014, Inf. Sci..

[23]  Francisco Torres-Quiroz,et al.  Integrative avenues for exploring the dynamics and evolution of protein interaction networks. , 2013, Current opinion in biotechnology.

[24]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data , 2012, BMC Bioinformatics.

[25]  Binhua Tang,et al.  Hierarchical Modularity in ERα Transcriptional Network Is Associated with Distinct Functions and Implicates Clinical Outcomes , 2012, Scientific Reports.

[26]  Pinar Civicioglu,et al.  Transforming geocentric cartesian coordinates to geodetic coordinates by using differential search algorithm , 2012, Comput. Geosci..

[27]  Giuseppe Nicosia,et al.  Clonal selection: an immunological algorithm for global optimization over continuous spaces , 2012, J. Glob. Optim..

[28]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[29]  Asher Mullard,et al.  Protein–protein interaction inhibitors get into the groove , 2012, Nature Reviews Drug Discovery.

[30]  Vincenzo Cutello,et al.  A Memetic Immunological Algorithm for Resource Allocation Problem , 2011, ICARIS.

[31]  S. Baserga,et al.  Assembling a Protein-Protein Interaction Map of the SSU Processome from Existing Datasets , 2011, PloS one.

[32]  Joachim Gudmundsson,et al.  Detecting Commuting Patterns by Clustering Subtrajectories , 2008, Int. J. Comput. Geom. Appl..

[33]  Germán Terrazas,et al.  Nature Inspired Cooperative Strategies for Optimization, NICSO 2010, May 12-14, 2010, Granada, Spain , 2012, NISCO.

[34]  Ming Chen,et al.  PRIN: a predicted rice interactome network , 2011, BMC Bioinformatics.

[35]  Vincenzo Cutello,et al.  An Information-Theoretic Approach for Clonal Selection Algorithms , 2010, ICARIS.

[36]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[37]  Joshua S Yuan,et al.  Plant Protein-Protein Interaction Network and Interactome , 2010, Current genomics.

[38]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[40]  Bernhard O. Palsson,et al.  BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions , 2010, BMC Bioinformatics.

[41]  N. Loberto,et al.  Activity of plasma membrane β‐galactosidase and β‐glucosidase , 2009, FEBS letters.

[42]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[43]  Hossein Nezamabadi-pour,et al.  GSA: A Gravitational Search Algorithm , 2009, Inf. Sci..

[44]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[45]  David Warde-Farley,et al.  Dynamic modularity in protein interaction networks predicts breast cancer outcome , 2009, Nature Biotechnology.

[46]  A. Barabasi,et al.  High-Quality Binary Protein Interaction Map of the Yeast Interactome Network , 2008, Science.

[47]  Krin A. Kay,et al.  The implications of human metabolic network topology for disease comorbidity , 2008, Proceedings of the National Academy of Sciences.

[48]  Natali Gulbahce,et al.  The art of community detection , 2008, BioEssays : news and reviews in molecular, cellular and developmental biology.

[49]  Rachel B. Brem,et al.  Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks , 2008, Nature Genetics.

[50]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[51]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[52]  Ulrik Brandes,et al.  On Modularity Clustering , 2008, IEEE Transactions on Knowledge and Data Engineering.

[53]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[54]  M. Meilă Comparing clusterings---an information based distance , 2007 .

[55]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[56]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[57]  Fred W. Glover,et al.  Principles of scatter search , 2006, Eur. J. Oper. Res..

[58]  Ibrahim Eksin,et al.  A new optimization method: Big Bang-Big Crunch , 2006, Adv. Eng. Softw..

[59]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[60]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[61]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[62]  M. Newman Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[63]  M. Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[64]  Dennis M. Wilkinson,et al.  A method for finding communities of related genes , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[65]  D. Bu,et al.  Topological structure analysis of the protein-protein interaction network in budding yeast. , 2003, Nucleic acids research.

[66]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[67]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[68]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[69]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[70]  J. Wojcik,et al.  The protein–protein interaction map of Helicobacter pylori , 2001, Nature.

[71]  Robert Ross,et al.  Reduction in Obesity and Related Comorbid Conditions after Diet-Induced Weight Loss or Exercise-Induced Weight Loss in Men , 2000, Annals of Internal Medicine.

[72]  A. Barabasi,et al.  Scale-free characteristics of random networks: the topology of the world-wide web , 2000 .

[73]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[74]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[75]  L. Hubert,et al.  Comparing partitions , 1985 .

[76]  F. Glover HEURISTICS FOR INTEGER PROGRAMMING USING SURROGATE CONSTRAINTS , 1977 .

[77]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..