Benchmarking network algorithms for contextualizing genes of interest

Computational approaches have shown promise in contextualizing genes of interest with known molecular interactions. In this work, we evaluate seventeen previously published algorithms based on characteristics of their output and their performance in three tasks: cross validation, prediction of drug targets, and behavior with random input. Our work highlights strengths and weaknesses of each algorithm and results in a recommendation of algorithms best suited for performing different tasks.

[1]  Steve Hoberman,et al.  A computational model to define the molecular causes of type 2 diabetes mellitus. , 2005, Diabetes technology & therapeutics.

[2]  Pankaj Agarwal,et al.  Inferring pathways from gene lists using a literature-derived network of biological relationships , 2005, Bioinform..

[3]  H. Lehrach,et al.  A protein interaction network links GIT1, an enhancer of huntingtin aggregation, to Huntington's disease. , 2004, Molecular cell.

[4]  John A. Tallarico,et al.  Functional CRISPR screening identifies the ufmylation pathway as a regulator of SQSTM1/p62 , 2016, eLife.

[5]  Christian von Mering,et al.  STRING: a database of predicted functional associations between proteins , 2003, Nucleic Acids Res..

[6]  Xingming Zhao,et al.  HISP: a hybrid intelligent approach for identifying directed signaling pathways , 2017, Journal of molecular cell biology.

[7]  Edward L. Huttlin,et al.  The BioPlex Network: A Systematic Exploration of the Human Interactome , 2015, Cell.

[8]  Bart De Moor,et al.  Candidate gene prioritization by network analysis of differential expression using machine learning approaches , 2010, BMC Bioinformatics.

[9]  James Bailey,et al.  Discovery and analysis of consistent active sub-networks in cancers , 2013, BMC Bioinformatics.

[10]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[11]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.

[12]  Albert-László Barabási,et al.  A DIseAse MOdule Detection (DIAMOnD) Algorithm Derived from a Systematic Analysis of Connectivity Patterns of Disease Proteins in the Human Interactome , 2015, PLoS Comput. Biol..

[13]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  David Warde-Farley,et al.  GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function , 2008, Genome Biology.

[15]  Abby Shuman Hill Integrated experimental and computational analysis of intercellular communication with application to endometriosis , 2018 .

[16]  Christopher Lawrence,et al.  DEGENERATION , 2020, Side Effects May Include Strangers.

[17]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[18]  Xing-Ming Zhao,et al.  Integrative analysis of mutational and transcriptional profiles reveals driver mutations of metastatic breast cancers , 2016, Cell Discovery.

[19]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[20]  A. Barabasi,et al.  A Protein–Protein Interaction Network for Human Inherited Ataxias and Disorders of Purkinje Cell Degeneration , 2006, Cell.

[21]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[22]  Julio Saez-Rodriguez,et al.  Identification of drug-specific pathways based on gene expression data: application to drug induced lung injury. , 2015, Integrative biology : quantitative biosciences from nano to macro.

[23]  S. Jaeger,et al.  Causal Network Models for Predicting Compound Targets and Driving Pathways in Cancer , 2014, Journal of biomolecular screening.

[24]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[25]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[26]  M. Vidal,et al.  Protein interaction mapping in C. elegans using proteins involved in vulval development. , 2000, Science.

[27]  Núria Queralt-Rosinach,et al.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes , 2015, Database J. Biol. Databases Curation.

[28]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[29]  Ahmed Enayetallah,et al.  Causal reasoning on biological networks: interpreting transcriptional changes , 2012, Bioinform..

[30]  J. Hirschhorn,et al.  Biological interpretation of genome-wide association studies using predicted gene functions , 2015, Nature Communications.

[31]  E. Lander,et al.  Genetic Screens in Human Cells Using the CRISPR-Cas9 System , 2013, Science.

[32]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[33]  Jeremy Miller,et al.  Identifying disease-specific genes based on their topological significance in protein networks , 2009, BMC Syst. Biol..

[34]  Yen‐Hua Huang,et al.  Prioritizing disease candidate genes by a gene interconnectedness-based approach , 2011, BMC Genomics.

[35]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[36]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[37]  H. Lehrach,et al.  A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome , 2005, Cell.

[38]  Jing Chen,et al.  Disease candidate gene identification and prioritization using protein interaction networks , 2009, BMC Bioinformatics.

[39]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..