A software framework integrating gene expression patterns, binding site analysis and gene ontology to hypothesize gene regulation relationships

One known challenge in analyzing gene expression data is to combine analysis outcomes obtained disparately by applying multiple, independent meta-analysis methods. Here we present an integrative computational system that narrows down biological hypotheses by integrating gene expression patterns, transcription factor (TF) binding site analysis outcomes, and Gene Ontology (GO) enrichment analysis outcomes. This system identifies regulated genes from microarray experiments through statistical processes, categorizes similarly behaving groups of genes and then carries out binding site analysis and gene function enrichment analysis based on some significant clusters. The output is an ordered set of "putative" pair-wise relationships between TFs and their potential target genes. The relationships are ranked based on their closeness to the experimental context. We demonstrate the effectiveness of our framework using two independent microarray data sets.

[1]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[2]  Kozo Nakamura,et al.  [Interactions of chondrocytes and osteoblasts during endochondral bone formation]. , 2006, Clinical calcium.

[3]  Luke E. K. Achenie,et al.  Expression Profile of Osteoblast Lineage at Defined Stages of Differentiation* , 2005, Journal of Biological Chemistry.

[4]  L. Pachter,et al.  rVista for comparative sequence-based discovery of functional transcription factor binding sites. , 2002, Genome research.

[5]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Gordon K Smyth,et al.  Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2004, Statistical applications in genetics and molecular biology.

[7]  T. Speed,et al.  GOstat: find statistically overrepresented Gene Ontologies within a group of genes. , 2004, Bioinformatics.

[8]  Lynn Kuo,et al.  PBC: A Software Framework Facilitating Pattern-Based Clustering for Microarray Data Analysis , 2009, 2009 International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing.

[9]  Markus Krätzig,et al.  A software framework for data analysis , 2007, Comput. Stat. Data Anal..

[10]  T. Osborne,et al.  Genome-wide analysis of FoxO1 binding in hepatic chromatin: Potential involvement of FoxO1 in linking retinoid signaling to hepatic gluconeogenesis , 2012, Nucleic acids research.