MONET: a toolbox integrating top-performing methods for network modularisation

Abstract Summary We define a disease module as a partition of a molecular network whose components are jointly associated with one or several diseases or risk factors thereof. Identification of such modules, across different types of networks, has great potential for elucidating disease mechanisms and establishing new powerful bio-markers. To this end, we launched the “Disease Module Identification (DMI) DREAM Challenge”, a community effort to build and evaluate unsupervised molecular network modularisation algorithms (Choobdar et al., 2018). Here we present MONET, a toolbox providing easy and unified access to the three top methods from the DMI DREAM Challenge for the bioinformatics community. Availability and Implementation MONET is a command line tool for Linux, based on Docker and Singularity containers; the core algorithms were written in R, Python, Ada and C++. It is freely available for download at https://github.com/BergmannLab/MONET.git Contact mattia.tomasoni@unil.ch (MT); sven.bergmann@unil.ch (SB)

[1]  Donna K. Slonim,et al.  Open Community Challenge Reveals Molecular Network Modules with Key Roles in Diseases , 2018 .

[2]  Alex Arenas,et al.  Analysis of the structure of complex networks at different resolution levels , 2007, physics/0703218.

[3]  Albert Solé-Ribalta,et al.  Navigability of interconnected networks under random failures , 2013, Proceedings of the National Academy of Sciences.

[4]  Arthur Brady,et al.  Fault Tolerance in Protein Interaction Networks: Stable Bipartite Subgraphs and Redundant Pathways , 2009, PloS one.

[5]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Noah M. Daniels,et al.  Going the Distance for Protein Function Prediction: A New Distance Metric for Protein Interaction Networks , 2013, PloS one.

[7]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[8]  Lenore Cowen,et al.  Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data , 2012, BMC Bioinformatics.

[9]  Alexandre Arenas,et al.  Identifying modular flows on multilayer networks reveals highly overlapping organization in social systems , 2014, ArXiv.

[10]  Mason A. Porter,et al.  Author Correction: The physics of spreading processes in multilayer networks , 2016, 1604.02021.

[11]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Vanessa Sochat,et al.  Singularity: Scientific containers for mobility of compute , 2017, PloS one.

[13]  Peter Christen,et al.  Data Pre-Processing , 2012 .

[14]  Srinivasan Parthasarathy,et al.  Markov clustering of protein interaction networks with improved balance and scalability , 2010, BCB '10.

[15]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[17]  Daniel Marbach,et al.  Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics , 2016, PLoS Comput. Biol..

[18]  Sergio Gómez,et al.  Size reduction of complex networks preserving modularity , 2007, ArXiv.

[19]  T. Ideker,et al.  Systematic interpretation of genetic interactions using protein networks , 2005, Nature Biotechnology.

[20]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[22]  Joydeep Ghosh,et al.  Data Clustering Algorithms And Applications , 2013 .

[23]  Ignacio Marín,et al.  Iterative Cluster Analysis of Protein Interaction Data , 2005, Bioinform..

[24]  Sergio Gómez,et al.  Hierarchical Multiresolution Method to Overcome the Resolution Limit in Complex Networks , 2012, Int. J. Bifurc. Chaos.

[25]  Srinivasan Parthasarathy,et al.  Scalable graph clustering using stochastic flows: applications to community discovery , 2009, KDD.

[26]  Lenore Cowen,et al.  New directions for diffusion-based network prediction of protein function: incorporating pathways with confidence , 2014, Bioinform..

[27]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.