Investigation of Self-Organizing Oscillator Networks for Use in Clustering Microarray Data

The self-organizing oscillator network (SOON) is a comparatively new clustering algorithm that does not require the knowledge of the number of clusters. The SOON is distance based, and its clustering behavior is different to density-based algorithms in a number of ways. This paper examines the effect of adjusting the control parameters of the SOON with four different datasets; the first is a (communications) modulation dataset representing one modulation scheme under a variety of noise conditions. This allows the assessment of the behavior of the algorithm with data varying between highly separable and nonseparable cases. The main thrust of this paper is to evaluate its efficacy in biological datasets. The second is taken from microarray experiments on the cell cycle of yeast, while the third and the fourth represent two microarray cancer datasets, i.e., the lymphoma and the liver cancer datasets. The paper demonstrates that the SOON is a viable tool to analyze these problems, and can add many useful insights to the biological data that may not always be available using other clustering methods.

[1]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[2]  Haiying Wang,et al.  Improving biomolecular pattern discovery and visualization with hybrid self-adaptive networks. , 2002, IEEE transactions on nanobioscience.

[3]  D. Botstein,et al.  Gene expression patterns in human liver cancers. , 2002, Molecular biology of the cell.

[4]  P. Brown,et al.  DNA arrays for analysis of gene expression. , 1999, Methods in enzymology.

[5]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[6]  Jarkko Venna,et al.  Analysis and visualization of gene expression data using Self-Organizing Maps , 2002, Neural Networks.

[7]  B Fritzke,et al.  A growing neural gas network learns topologies. G. Tesauro, DS Touretzky, and TK Leen, editors , 1995, NIPS 1995.

[8]  Jiebo Luo,et al.  Image segmentation via adaptive K-mean clustering and knowledge-based morphological operations with biomedical applications , 1998, IEEE Trans. Image Process..

[9]  John G. Proakis,et al.  Digital Communications , 1983 .

[10]  Lawrence O. Hall,et al.  Fast fuzzy clustering of infrared images , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[11]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[12]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[14]  Alfonso Valencia,et al.  A hierarchical unsupervised growing neural network for clustering gene expression patterns , 2001, Bioinform..

[15]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[16]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[17]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Asoke K. Nandi,et al.  Novel fuzzy reinforced learning vector quantisation algorithm and its application in image compression , 2003 .

[19]  Murray E. Alexander,et al.  Fast noniterative registration of magnetic resonance images , 2001, SPIE Medical Imaging.

[20]  G. D. Xistris,et al.  SELF-ORGANISING NEURAL NETWORKS FOR AUTOMATED MACHINERY MONITORING SYSTEMS , 1996 .

[21]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Geoffrey J. McLachlan,et al.  A mixture model-based approach to the clustering of microarray expression data , 2002, Bioinform..

[23]  Hichem Frigui,et al.  Self-Organization of Pulse-Coupled Oscillators with Application to Clustering , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Alfred Ultsch,et al.  Data Mining and Knowledge Discovery with Emergent Self-Organizing Feature Maps for Multivariate Time Series , 1999 .

[25]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Kimmo Kiviluoto,et al.  Predicting bankruptcies with the self-organizing map , 1998, Neurocomputing.

[27]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[28]  P. Mahalanobis On the generalized distance in statistics , 1936 .

[29]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[30]  David Botstein,et al.  The Stanford Microarray Database , 2001, Nucleic Acids Res..