Unravelling the Yeast Cell Cycle Using the TriGen Algorithm

Analyzing microarray data represents a computational challenge due to the characteristics of these data. Clustering techniques are widely applied to create groups of genes that exhibit a similar behavior under the conditions tested. Biclustering emerges as an improvement of classical clustering since it relaxes the constraints for grouping allowing genes to be evaluated only under a subset of the conditions and not under all of them. However, this technique is not appropriate for the analysis of temporal microarray data in which the genes are evaluated under certain conditions at several time points. In this paper, we present the results of applying the TriGen algorithm, a genetic algorithm that finds triclusters that take into account the experimental conditions and the time points, to the yeast cell cycle problem, where the goal is to identify all genes whose expression levels are regulated by the cell cycle.

[1]  Patrik D'haeseleer,et al.  Genetic network inference: from co-expression clustering to reverse engineering , 2000, Bioinform..

[2]  Masahiro Okamoto,et al.  Novel technique for preprocessing high dimensional time-course data from DNA microarray: mathematical model-based clustering , 2006, Bioinform..

[3]  Kai Wang,et al.  INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY (ISMB) , 2009 .

[4]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[5]  Christodoulos A. Floudas,et al.  Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures , 2008, BMC Bioinformatics.

[6]  Ziv Bar-Joseph,et al.  Analyzing time series gene expression data , 2004, Bioinform..

[7]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Federico Divina,et al.  Improved biclustering on expression data through overlapping control , 2009, Int. J. Intell. Comput. Cybern..

[10]  Roy P. Pargas,et al.  Test‐data generation using genetic algorithms , 1999, Softw. Test. Verification Reliab..

[11]  J. Hartigan Direct Clustering of a Data Matrix , 1972 .

[12]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .