A New Profile Alignment Method for Clustering Gene Expression Data

We focus on clustering gene expression temporal profiles, and propose a novel, simple algorithm that is powerful enough to find an efficient distribution of genes over clusters. We also introduce a variant of a clustering index that can effectively decide upon the optimal number of clusters for a given dataset. The clustering method is based on a profile-alignment approach, which minimizes the mean-square-error of the first order differentials, to hierarchically cluster microarray time-series data. The effectiveness of our algorithm has been tested on datasets drawn from standard experiments, showing that our approach can effectively cluster the datasets based on profile similarity.

[1]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[2]  A Gordon,et al.  Classification, 2nd Edition , 1999 .

[3]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[4]  A. Brazma,et al.  Gene expression data analysis , 2000, FEBS letters.

[5]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[6]  Sorin Drăghici,et al.  Data Analysis Tools for DNA Microarrays , 2003 .

[7]  Laurent Bréhélin,et al.  Clustering Gene Expression Series with Prior Knowledge , 2005, WABI.

[8]  Laurie J. Heyer,et al.  Exploring expression data: identification and analysis of coexpressed genes. , 1999, Genome research.

[9]  Rita Casadio,et al.  Algorithms in Bioinformatics, 5th International Workshop, WABI 2005, Mallorca, Spain, October 3-6, 2005, Proceedings , 2005, WABI.

[10]  D. Botstein,et al.  The transcriptional program in the response of human fibroblasts to serum. , 1999, Science.

[11]  Shyamal D. Peddada,et al.  Gene Selection and Clustering for Time-course and Dose-response Microarray Experiments Using Order-restricted Inference , 2003, Bioinform..

[12]  G. Sherlock Analysis of large-scale gene expression data. , 2000, Current opinion in immunology.

[13]  David G. Stork,et al.  Pattern Classification , 1973 .

[14]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[16]  Shirley Dex,et al.  JR 旅客販売総合システム(マルス)における運用及び管理について , 1991 .