Identifying Synchronous and Asynchronous Co-regulations from Time Series Gene Expression Data

The complexity of a biological system provides a great diversity of correlations among genes/gene clusters, including synchronous and asynchronous co-regulations, each of which can be further divided into two categories: activation and inhibition. Most existing methods can only identify the synchronous activation patterns, such as shifting, scaling and shifting-and-scaling, however, few focuses on capturing both synchronous and asynchronous co-regulations. In this paper, we propose a coding scheme, where two genes with the same code must be co-regulated. Based on the coding scheme, an efficient clustering algorithm is devised to simultaneously capture all known co-regulated relationships (synchronous and asynchronous) among genes/gene clusters. Furthermore, the detailed and complete co-regulation information, which facilitates the study of genetic regulatory networks, can be easily derived from the resulting clusters. Experiments from both real and synthetic microarray datasets prove the effectiveness and efficiency of our method.

[1]  Jian Pei,et al.  Interactive exploration of coherent patterns in time-series gene expression data , 2003, KDD '03.

[2]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[3]  Philip S. Yu,et al.  MaPle: a fast algorithm for maximal pattern-based clustering , 2003, Third IEEE International Conference on Data Mining.

[4]  Guangyu Xu,et al.  A Novel Approach to Revealing Positive and Negative Co-Regulated Genes , 2007, Journal of Computer Science and Technology.

[5]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[6]  M. Gerstein,et al.  Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions. , 2001, Journal of molecular biology.

[7]  Ge Yu,et al.  Mining Positive and Negative Co-regulation Patterns from Microarray Data , 2006, Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06).

[8]  Anthony K. H. Tung,et al.  Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[9]  Mohammed J. Zaki,et al.  TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data , 2005, SIGMOD '05.

[10]  M. Gerstein,et al.  Genomic analysis of gene expression relationships in transcriptional regulatory networks. , 2003, Trends in genetics : TIG.

[11]  Philip S. Yu,et al.  Clustering by pattern similarity in large data sets , 2002, SIGMOD '02.