A Novel Local Features-Based Approach for Clustering Microarray Data

DNA Microarray technology makes it possible to moni- tor simultaneously the dynamic expression levels of tens of thousands of genes during some important biological pro- cesses. A first step to comprehend and interpret the result- ing mass of data is via clustering techniques. However, most existing methods are based on clustering genes by compar- ing their expression levels on all experiment conditions al- though genes in a functional cluster more often than not correlate only under a subset of conditions. Besides, most clustering algorithms depend on some critical user parame- ters in determining the number of resulting clusters. Unfor- tunately, correct parameter values are rarely known in real datasets. In this paper, we propose a novel clustering algo- rithm that (1) goes beyond global approaches to discovery gene clusters based on local features, and (2) automatically determines the number of resulting clusters. Furthermore, we introduce the norm-based method to improve it, as is proved reasonable. Extensive experiments are conducted on both synthetic and real data sets. Experiments prove that our method is efficiency and efficient.

[1]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[2]  Alfonso Valencia,et al.  A hierarchical unsupervised growing neural network for clustering gene expression patterns , 2001, Bioinform..

[3]  Michael K. Ng,et al.  A highly-usable projected clustering algorithm for gene expression profiles , 2003, BIOKDD.

[4]  Aidong Zhang,et al.  Interrelated two-way clustering: an unsupervised approach for gene expression data analysis , 2001, Proceedings 2nd Annual IEEE International Symposium on Bioinformatics and Bioengineering (BIBE 2001).

[5]  R. Caponetto,et al.  Cellular neural network simulator for image processing applications , 1998, 1998 Fifth IEEE International Workshop on Cellular Neural Networks and their Applications. Proceedings (Cat. No.98TH8359).

[6]  F. Zou,et al.  Bifurcation and chaos in cellular neural networks , 1993 .

[7]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Josef A. Nossek,et al.  A chaotic attractor with cellular neural networks , 1991 .

[9]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[10]  A. Zhang,et al.  A Maximum Entropy Approach to Classifying Gene Array Data Sets , 2001 .

[11]  Leon O. Chua,et al.  Cellular neural networks: applications , 1988 .

[12]  G. Getz,et al.  Coupled two-way clustering analysis of gene microarray data. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Wei Zhang,et al.  A Digital Image Encryption Scheme Based on the Hybrid of Cellular Neural Network and Logistic Map , 2005, ISNN.

[14]  A. Brazma,et al.  Gene expression data analysis , 2000, FEBS letters.

[15]  Mohamed I. Sobhy,et al.  Methods of attacking chaotic encryption and countermeasures , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[16]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[17]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[18]  Lin-Bao Yang,et al.  Cellular neural networks: theory , 1988 .

[19]  Leon O. Chua,et al.  The CNN paradigm , 1993 .