Efficient key pathway mining: combining networks and OMICS data.

Systems biology has emerged over the last decade. Driven by the advances in sophisticated measurement technology the research community generated huge molecular biology data sets. These comprise rather static data on the interplay of biological entities, for instance protein-protein interaction network data, as well as quite dynamic data collected for studying the behavior of individual cells or tissues in accordance with changing environmental conditions, such as DNA microarrays or RNA sequencing. Here we bring the two different data types together in order to gain higher level knowledge. We introduce a significantly improved version of the KeyPathwayMiner software framework. Given a biological network modelled as a graph and a set of expression studies, KeyPathwayMiner efficiently finds and visualizes connected sub-networks where most components are expressed in most cases. It finds all maximal connected sub-networks where all nodes but k exceptions are expressed in all experimental studies but at most l exceptions. We demonstrate the power of the new approach by comparing it to similar approaches with gene expression data previously used to study Huntington's disease. In addition, we demonstrate KeyPathwayMiner's flexibility and applicability to non-array data by analyzing genome-scale DNA methylation profiles from colorectal tumor cancer patients. KeyPathwayMiner release 2 is available as a Cytoscape plugin and online at http://keypathwayminer.mpi-inf.mpg.de.

[1]  Mara L. Hartsperger,et al.  HiNO: An Approach for Inferring Hierarchical Organization from Regulatory Networks , 2010, PloS one.

[2]  Sven Rahmann,et al.  Extension and Robustness of Transitivity Clustering for Protein–Protein Interaction Network Analysis , 2011, Internet Math..

[3]  J. Olson,et al.  Regional and cellular gene expression changes in human Huntington's disease brain. , 2006, Human molecular genetics.

[4]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[5]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[6]  Tobias Friedrich,et al.  Efficient algorithms for extracting biological key pathways with global constraints , 2012, GECCO '12.

[7]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[8]  Andreas Tauch,et al.  CoryneRegNet 6.0—Updated database content, new analysis methods and novel features focusing on community demands , 2011, Nucleic Acids Res..

[9]  Rainer Breitling,et al.  Graph-based iterative Group Analysis enhances microarray interpretation , 2004, BMC Bioinformatics.

[10]  Andreas Tauch,et al.  Towards the integrated analysis, visualization and reconstruction of microbial gene regulatory networks , 2008, Briefings Bioinform..

[11]  Dirk Sudholt,et al.  Analysis of different MMAS ACO algorithms on unimodal functions and plateaus , 2009, Swarm Intelligence.

[12]  Fidel Ramírez,et al.  Computing topological parameters of biological networks , 2008, Bioinform..

[13]  Jan Baumbach,et al.  KeyPathwayMiner: Detecting Case-Specific Biological Pathways Using Expression Data , 2011, Internet Math..

[14]  J. Baumbach,et al.  Linking Cytoscape and the corynebacterial reference database CoryneRegNet , 2008, BMC Genomics.

[15]  Alessandro Filla,et al.  DNA damage induced by polyglutamine-expanded proteins. , 2003, Human molecular genetics.

[16]  Yuren Zhou,et al.  Runtime Analysis of an Ant Colony Optimization Algorithm for TSP Instances , 2009, IEEE Transactions on Evolutionary Computation.

[17]  Michael L. Creech,et al.  Integration of biological networks and gene expression data using Cytoscape , 2007, Nature Protocols.

[18]  Gary D Bader,et al.  PSICQUIC and PSISCORE: accessing and scoring molecular interactions , 2011, Nature Methods.

[19]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[20]  M Madan Babu,et al.  Uncovering a hidden distributed architecture behind scale-free transcriptional regulatory networks. , 2006, Journal of molecular biology.

[21]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[22]  Akhilesh Pandey,et al.  Human Protein Reference Database and Human Proteinpedia as discovery tools for systems biology. , 2009, Methods in molecular biology.

[23]  J. Baumbach,et al.  On the power and limits of evolutionary conservation—unraveling bacterial gene regulatory networks , 2010, Nucleic acids research.

[24]  A. Goesmann,et al.  RhizoRegNet--a database of rhizobial transcription factors and regulatory networks. , 2011, Journal of biotechnology.

[25]  Richard M. Karp,et al.  Detecting Disease-Specific Dysregulated Pathways Via Analysis of Clinical Expression Profiles , 2008, RECOMB.

[26]  A. Goesmann,et al.  From Corynebacterium glutamicum to Mycobacterium tuberculosis—towards transfers of gene regulatory networks and integrated data analyses with MycoRegNet , 2009, Nucleic acids research.

[27]  Dorothea Emig,et al.  Partitioning biological data with transitivity clustering , 2010, Nature Methods.