Gene Regulatory Networks from Gene Ontology

Gene Ontology (GO) provides a controlled vocabulary and hierarchy of terms to facilitate the annotation of gene functions and molecular attributes. Given a set of genes, a Gene Ontology Network (GON) can be constructed from the corresponding GO annotations and semantic relations among GO terms. Transitive rules can be applied to GO semantic relations to infer transitive regulations among genes. Using information content as a measure of functional specificity, a shortest regulatory path detection algorithm is developed to identify transitive regulations in GON. Since direct regulations may be overlooked during the detection of gene regulations, gene functional similarities deduced from GO terms are used to detect direct gene regulations. Both direct and transitive GO regulations are then used to construct a Gene Regulatory Network (GRN). The proposed approach is evaluated on seven E.coli sub-networks extracted from an existing known GRN. Our approach was able to detect the GRN with 85.77% precision, 55.7% recall, and 66.26% F1-score averaged across all seven networks.

[1]  Thomas Lengauer,et al.  A new measure for functional similarity of gene products based on Gene Ontology , 2006, BMC Bioinformatics.

[2]  Michael A. Siani-Rose,et al.  A Knowledge-Based Clustering Algorithm Driven by Gene Ontology , 2004, Journal of biopharmaceutical statistics.

[3]  Donald B. Johnson,et al.  A Note on Dijkstra's Shortest Path Algorithm , 1973, JACM.

[4]  C. Wijmenga,et al.  Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes. , 2006, American journal of human genetics.

[5]  K. Dolinski,et al.  Use and misuse of the gene ontology annotations , 2008, Nature Reviews Genetics.

[6]  Rachael P. Huntley,et al.  The GOA database in 2009—an integrated Gene Ontology Annotation resource , 2008, Nucleic Acids Res..

[7]  Carole A. Goble,et al.  Investigating Semantic Similarity Measures Across the Gene Ontology: The Relationship Between Sequence and Annotation , 2003, Bioinform..

[8]  Dario Floreano,et al.  GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods , 2011, Bioinform..

[9]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[10]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[11]  R. Kustra,et al.  Data-Fusion in Clustering Microarray Data: Balancing Discovery and Interpretability , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  Joachim Selbig,et al.  Validation and functional annotation of expression-based clusters based on gene ontology , 2005, BMC Bioinformatics.

[13]  W. Wong,et al.  Transitive functional annotation by shortest-path analysis of gene expression data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[14]  J.C. Rajapakse,et al.  SVM-RFE With MRMR Filter for Gene Selection , 2010, IEEE Transactions on NanoBioscience.

[15]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[16]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[17]  Gil Alterovitz,et al.  GO PaD: the Gene Ontology Partition Database , 2006, Nucleic Acids Res..