Exploiting the Geometry of Gene Expression Patterns for Unsupervised Learning

Typical gene expression clustering algorithms are restricted to a specific underlying pattern model while overlooking the possibility that other information carrying patterns may co-exist in the data. This may potentially lead to a large bias in the results. In this paper we discuss a new method that is able to cluster simultaneously various types of patterns. Our method is based on the observation that many of the patterns that are considered significant to infer gene function and regulatory mechanisms all share the geometry of linear manifolds

[1]  Jesús S. Aguilar-Ruiz,et al.  Shifting and scaling patterns from gene expression data , 2005, Bioinform..

[2]  Philip S. Yu,et al.  /spl delta/-clusters: capturing subspace correlation in a large data set , 2002, Proceedings 18th International Conference on Data Engineering.

[3]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Inderjit S. Dhillon,et al.  Minimum Sum-Squared Residue Co-Clustering of Gene Expression Data , 2004, SDM.

[5]  Philip S. Yu,et al.  A fast algorithm for subspace clustering by pattern similarity , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[6]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[7]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[8]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[9]  R. Harpaz The EM Algorithm as a Lower Bound Optimization Technique , 2006 .

[10]  Robert M. Haralick,et al.  Linear Manifold Clustering , 2005, MLDM.

[11]  Ozgur Ozturk,et al.  A time series analysis of microarray data , 2004, Proceedings. Fourth IEEE Symposium on Bioinformatics and Bioengineering.

[12]  Christian Böhm,et al.  Computing Clusters of Correlation Connected objects , 2004, SIGMOD '04.