论文信息 - Detection of Differences between Syntactic and Semantic Similarities

Detection of Differences between Syntactic and Semantic Similarities

One of the most important problems with rule induction methods is that it is very difficult for domain experts to check millions of rules generated from large datasets. The discovery from these rules requires deep interpretation from domain knowledge. Although several solutions have been proposed in the studies on data mining and knowledge discovery, these studies are not focused on similarities between rules obtained. When one rule r 1 has reasonable features and the other rule r 2 with high similarity to r 1 includes unexpected factors, the relations between these rules will become a trigger to the discovery of knowledge. In this paper, we propose a visualization approach to show the similarity relations between rules based on multidimensional scaling, which assign a two-dimensional cartesian coordinate to each data point from the information about similiaries between this data and others data. We evaluated this method on two medical data sets, whose experimental results show that knowledge useful for domain experts could be found.

Shusaku Tsumoto | Shoji Hirano | S. Tsumoto | S. Hirano

[1] Andrzej Skowron,et al. From the Rough Set Theory to the Evidence Theory , 1991 .

[2] R. Ruff,et al. Principles of Neurology, 5th Ed. , 1995, Neurology.

[3] Jerzy W. Grzymala-Busse,et al. Rough Sets , 1995, Commun. ACM.

[4] Shusaku Tsumoto,et al. Foundations of Intelligent Systems, 15th International Symposium, ISMIS 2005, Saratoga Springs, NY, USA, May 25-28, 2005, Proceedings , 2005, ISMIS.

[5] Shusaku Tsumoto,et al. The Application of Rough Sets-Based Data Mining Technique to Differential Diagnosis of Meningoenchepahlitis , 1996, ISMIS.

[6] James R. Keane. Principles of Neurology, 5th Edition , 1994 .

[7] Brian Everitt,et al. Cluster analysis , 1974 .

[8] C. Eckart,et al. The approximation of one matrix by another of lower rank , 1936 .

[9] Ning Zhong,et al. Methodologies for Knowledge Discovery and Data Mining , 2002, Lecture Notes in Computer Science.

[10] J. Kacprzyk,et al. Advances in the Dempster-Shafer theory of evidence , 1994 .

[11] R. Adams,et al. Principles of Neurology , 1996 .

[12] Robert C. Kohberger,et al. Cluster Analysis (3rd ed.) , 1994 .

[13] Yiyu Yao,et al. An Analysis of Quantitative Measures Associated with Rules , 1999, PAKDD.