RIDDLE: reflective diffusion and local extension reveal functional associations for unannotated gene sets via proximity in a gene network

The growing availability of large-scale functional networks has promoted the development of many successful techniques for predicting functions of genes. Here we extend these network-based principles and techniques to functionally characterize whole sets of genes. We present RIDDLE (Reflective Diffusion and Local Extension), which uses well developed guilt-by-association principles upon a human gene network to identify associations of gene sets. RIDDLE is particularly adept at characterizing sets with no annotations, a major challenge where most traditional set analyses fail. Notably, RIDDLE found microRNA-450a to be strongly implicated in ocular diseases and development. A web application is available at http://www.functionalnet.org/RIDDLE.

[1]  Kriston L. McGary,et al.  Open Access Method , 2007 .

[2]  Matthew A. Hibbs,et al.  Discovery of biological networks from diverse functional genomic data , 2005, Genome Biology.

[3]  Kesheng Liu,et al.  Information Flow Analysis of Interactome Networks , 2009, PLoS Comput. Biol..

[4]  Jesse Gillis,et al.  The Impact of Multifunctional Genes on "Guilt by Association" Analysis , 2011, PloS one.

[5]  Qi Liu,et al.  Improving gene set analysis of microarray data by SAM-GS , 2007, BMC Bioinformatics.

[6]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[7]  A. Reymond,et al.  A High-Resolution Anatomical Atlas of the Transcriptome in the Mouse Embryo , 2011, PLoS biology.

[8]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[9]  Yonina C. Eldar,et al.  eQED: an efficient method for interpreting eQTL associations using protein networks , 2008, Molecular systems biology.

[10]  Korbinian Strimmer,et al.  BMC Bioinformatics BioMed Central Methodology article A general modular framework for gene set enrichment analysis , 2009 .

[11]  S. Kasif,et al.  Network-Based Analysis of Affected Biological Processes in Type 2 Diabetes Models , 2007, PLoS genetics.

[12]  A. Fraser,et al.  A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans , 2008, Nature Genetics.

[13]  Yves Moreau,et al.  Network Analysis of Differential Expression for the Identification of Disease-Causing Genes , 2009, PloS one.

[14]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[15]  Yuanfang Guan,et al.  A Genomewide Functional Network for the Laboratory Mouse , 2008, PLoS Comput. Biol..

[16]  E. Sonnhammer,et al.  Global networks of functional coupling in eukaryotes from comprehensive data integration. , 2009, Genome research.

[17]  Pankaj Agarwal,et al.  A global pathway crosstalk network , 2008, Bioinform..

[18]  Xiang-Sun Zhang,et al.  NOA: a novel Network Ontology Analysis method , 2011, Nucleic acids research.

[19]  David Warde-Farley,et al.  GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function , 2008, Genome Biology.

[20]  Hunter B. Fraser,et al.  Using protein complexes to predict phenotypic effects of gene mutation , 2007, Genome Biology.

[21]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.

[22]  M. Daly,et al.  Guilt by association , 2000, Nature Genetics.

[23]  Matthew A. Hibbs,et al.  Exploring the human genome with functional maps. , 2009, Genome research.

[24]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[25]  P. Khatri,et al.  A systems biology approach for pathway level analysis. , 2007, Genome research.

[26]  Joel Dudley,et al.  Network-Based Elucidation of Human Disease Similarities Reveals Common Functional Modules Enriched for Pluripotent Drug Targets , 2010, PLoS Comput. Biol..

[27]  Christopher D. Lasher,et al.  Discovering Networks of Perturbed Biological Processes in Hepatocyte Cultures , 2011, PloS one.

[28]  C. S. Sullivan,et al.  Detection of viral microRNAs by Northern blot analysis. , 2011, Methods in molecular biology.

[29]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[30]  A. F. Scott,et al.  OMIM: Online Mendelian Inheritance in Man , 2002 .

[31]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[32]  Xiang Li,et al.  A novel network-based method for measuring the functional relationship between gene sets , 2011, Bioinform..

[33]  Pingzhao Hu,et al.  Computational prediction of cancer-gene function , 2007, Nature Reviews Cancer.

[34]  R. Lavker,et al.  MicroRNAs of the mammalian eye display distinct and overlapping tissue specificity. , 2006, Molecular vision.

[35]  Zhiping Weng,et al.  Identification of functional modules that correlate with phenotypic difference: the influence of network topology , 2010, Genome Biology.

[36]  A. Barabasi,et al.  High-Quality Binary Protein Interaction Map of the Yeast Interactome Network , 2008, Science.

[37]  Cengizhan Ozturk,et al.  Pathway analysis of high-throughput biological data within a Bayesian network framework , 2011, Bioinform..

[38]  William Stafford Noble,et al.  Learning kernels from biological networks by maximizing entropy , 2004, ISMB/ECCB.

[39]  Haifeng Li,et al.  Systematic discovery of functional modules and context-specific functional annotation of human genome , 2007, ISMB/ECCB.

[40]  Q. Cui,et al.  An Analysis of Human MicroRNA and Disease Associations , 2008, PloS one.

[41]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[42]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[43]  E. Snitkin,et al.  Genome-wide prioritization of disease genes and identification of disease-disease associations from an integrated human functional linkage network , 2009, Genome Biology.

[44]  E. Marcotte,et al.  It's the machine that matters: Predicting gene function and phenotype from protein networks. , 2010, Journal of proteomics.

[45]  J. Rashbass Online Mendelian Inheritance in Man. , 1995, Trends in genetics : TIG.

[46]  Christian von Mering,et al.  STRING 7—recent developments in the integration and prediction of protein interactions , 2006, Nucleic Acids Res..

[47]  Thomas Lengauer,et al.  Statistical Applications in Genetics and Molecular Biology Calculating the Statistical Significance of Changes in Pathway Activity From Gene Expression Data , 2011 .

[48]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[49]  KellerAndreas,et al.  A novel algorithm for detecting differentially regulated paths based on gene set enrichment analysis , 2009 .

[50]  Weidong Tian,et al.  Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function , 2008, Genome Biology.

[51]  Jason Weston,et al.  Protein ranking: from local to global structure in the protein similarity network. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[52]  LiHaifeng,et al.  Systematic discovery of functional modules and context-specific functional annotation of human genome , 2007 .

[53]  Christina Backes,et al.  A novel algorithm for detecting differentially regulated paths based on gene set enrichment analysis , 2009, Bioinform..