Ranking inter-relationships between clusters

The evaluation of the relationships between clusters is important to identify vital unknown information in many real-life applications, such as in the fields of crime detection, evolution trees, metallurgical industry and biology engraftment. This article proposes a method called ‘mode pattern + mutual information’ to rank the inter-relationship between clusters. The idea of the mode pattern is used to find outstanding objects from each cluster, and the mutual information criterion measures the close proximity of a pair of clusters. Our approach is different from the conventional algorithms of classifying and clustering, because our focus is not to classify objects into different clusters, but instead, we aim to rank the inter-relationship between clusters when the clusters are given. We conducted experiments on a wide range of real-life datasets, including image data and cancer diagnosis data. The experimental results show that our algorithm is effective and promising.

[1]  Wen-Hsiung Li,et al.  Fundamentals of molecular evolution , 1990 .

[2]  Feng Chen,et al.  Identifying targets for drug discovery using bioinformatics , 2008, Expert opinion on therapeutic targets.

[3]  R. La Brooy A method for optimising the nesting of multiple, highly complex shapes using a modified simulated annealing algorithm , 2009, Int. J. Syst. Sci..

[4]  S. Thompson,et al.  Quantifying heterogeneity in a meta‐analysis , 2002, Statistics in medicine.

[5]  Hans-Peter Kriegel,et al.  Supervised probabilistic principal component analysis , 2006, KDD '06.

[6]  Paul M. B. Vitányi,et al.  Clustering by compression , 2003, IEEE Transactions on Information Theory.

[7]  Man Lung Yiu,et al.  Reverse Nearest Neighbors Search in Ad Hoc Subspaces , 2006, IEEE Transactions on Knowledge and Data Engineering.

[8]  Jiyuan An,et al.  DDR: an index method for large time-series datasets , 2005, Inf. Syst..

[9]  Jiawei Han,et al.  Mining closed relational graphs with connectivity constraints , 2005, 21st International Conference on Data Engineering (ICDE'05).

[10]  Yu Chao,et al.  Effect of Different Scions/Rootstocks on Quality of Cucumber Fruits in Greenhouse , 2006 .

[11]  Yi-Ping Phoebe Chen,et al.  Kernel-based naive bayes classifier for breast cancer prediction , 2007 .

[12]  Huiqing Liu,et al.  Simple rules underlying gene expression profiles of more than six subtypes of acute lymphoblastic leukemia (ALL) patients , 2003, Bioinform..

[13]  Steven J. Leon Linear Algebra With Applications , 1980 .

[14]  Peter Timms,et al.  CIDB: Chlamydia Interactive Database for cross-querying genomics, transcriptomics and proteomics data. , 2007, Biomolecular engineering.

[15]  Robert Tibshirani,et al.  Margin Trees for High-dimensional Classification , 2007, J. Mach. Learn. Res..

[16]  Feng Chen,et al.  Identifying targets for drug discovery using bioinformatics , 2008 .

[17]  Kotagiri Ramamohanarao,et al.  Using emerging patterns to construct weighted decision trees , 2006, IEEE Transactions on Knowledge and Data Engineering.

[18]  V. Raina,et al.  Detection of BCR-ABL transcripts in acute lymphoblastic leukemia in Indian patients. , 1998, Leukemia research.

[19]  Lise Getoor,et al.  Link mining: a new data mining challenge , 2003, SKDD.

[20]  P. Malet,et al.  Molecular detection of a late-appearing BCR-ABL gene in a child with T-cell acute lymphoblastic leukemia , 1998, Annals of Hematology.

[21]  Nai-Kuan Chou,et al.  ECG data compression using truncated singular value decomposition , 2001, IEEE Trans. Inf. Technol. Biomed..

[22]  Sophie Lambert-Lacroix,et al.  Effective dimension reduction methods for tumor classification using gene expression data , 2003, Bioinform..

[23]  Naren Ramakrishnan,et al.  Redescription Mining: Structure Theory and Algorithms , 2005, AAAI.

[24]  Sanjoy Dasgupta,et al.  An Iterative Improvement Procedure for Hierarchical Clustering , 2003, NIPS.

[25]  Naren Ramakrishnan,et al.  Algorithms for Storytelling , 2006, IEEE Transactions on Knowledge and Data Engineering.

[26]  Xindong Wu,et al.  Identifying bridging rules between conceptual clusters , 2006, KDD '06.

[27]  Deept Kumar,et al.  Turning CARTwheels: an alternating algorithm for mining redescriptions , 2003, KDD.

[28]  H. A. Güvenira,et al.  An expert system for the differential diagnosis of erythemato-squamous diseases , 1999 .

[29]  Wendi Heinzelman,et al.  Energy-efficient communication protocol for wireless microsensor networks , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[30]  David Eppstein,et al.  On Nearest-Neighbor Graphs , 1992, ICALP.

[31]  Feng Chen,et al.  Using bioinformatics techniques for gene identification in drug discovery and development. , 2008, Current drug metabolism.

[32]  Taieb Znati,et al.  A mobility-based framework for adaptive clustering in wireless ad hoc networks , 1999, IEEE J. Sel. Areas Commun..

[33]  Jiyuan An,et al.  Finding edging genes from microarray data. , 2008, Journal of biotechnology.

[34]  M P Link,et al.  Simultaneous expression of RBTN-2 and BCR-ABL oncogenes in a T-ALL with a t(11;14)(p13;q11) and a late-appearing Philadelphia chromosome. , 1994, Leukemia.