Biological evaluation of biclustering algorithms using Gene Ontology and chIP-chip data

In this paper, we propose a new framework for assessing the biological significance of the outputs of any biclustering algorithm. The framework relies on the p-value computed by a Fisher's exact test on a 2x2 contingency table derived from gene ontology (GO) enrichment level and chromatin immunoprecipitation (ChIP) data enrichment level. We illustrate the framework using our published robust biclustering algorithm (RoBA), the Cheng and Church (CC) algorithm, and a well-defined set of yeast cell cycle gene expression data and chip-chip data. Our evaluation also shows that the biclusters identified by RoBA are biologically more homogeneous than the ones identified by the Cheng and Church (CC) algorithm.

[1]  K. Struhl,et al.  The transcription factor Ifh1 is a key regulator of yeast ribosomal protein genes , 2004, Nature.

[2]  Ahmed H. Tewfik,et al.  Parallel identification of gene biclusters with coherent evolutions , 2006, IEEE Transactions on Signal Processing.

[3]  Ahmed H. Tewfik,et al.  DNA Microarray Data Analysis: A Novel Biclustering Algorithm Approach , 2006, EURASIP J. Adv. Signal Process..

[4]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[5]  Lothar Thiele,et al.  A systematic comparison and evaluation of biclustering methods for gene expression data , 2006, Bioinform..

[6]  Richard M. Karp,et al.  Discovering local structure in gene expression data: the order-preserving submatrix problem , 2002, RECOMB '02.

[7]  J. Lieb,et al.  ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments. , 2004, Genomics.

[8]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[9]  J. Bähler Cell-cycle control of gene expression in budding and fission yeast. , 2005, Annual review of genetics.

[10]  Waseem Ahmad,et al.  cHawk : An Efficient Biclustering Algorithm based on Bipartite Graph Crossing Minimization , 2007 .

[11]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[12]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[13]  D. Botstein,et al.  Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF , 2001, Nature.