Geometric Approach to Data Mining

In this paper, a new, geometric approach to pattern identification in data mining is presented. It is based on applying string edit distance computation to measuring the similarity between multi-dimensional curves. The string edit distance computation is extended to allow the possibility of using strings, where each element is a vector rather than just a symbol. We discuss an approach for representing 3D-curves using the curvature and the tension as their symbolic representation. This transformation preserves all the information contained in the original 3D-curve. We validate this approach through experiments using synthetic and digitalized data. In particular, the proposed approach is suitable to measure the similarity of 3D-curves invariant under translation, rotation, and scaling. It also can be applied for partial curve matching.

[1]  M. Braae,et al.  Theoretical and linguistic aspects of the fuzzy logic controller , 1979, Autom..

[2]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[3]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[4]  Abraham Kandel,et al.  3D-curve similarity using fuzzy string matching , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[5]  W. Eric L. Grimson On the Recognition of Curved Objects , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Dr. Hans Hellendoorn,et al.  An Introduction to Fuzzy Control , 1996, Springer Berlin Heidelberg.

[7]  W. O'Dell,et al.  Three-dimensional myocardial deformations: calculation with displacement field fitting to tagged MR images. , 1995, Radiology.

[8]  Nicholas Ayache,et al.  Evaluating 3D registration of CT-scan images using crest lines , 1993, Optics & Photonics.

[9]  Luis F. Chaparro,et al.  Signal analysis in fuzzy information space , 1996, Fuzzy Sets Syst..

[10]  Stan Z. Li,et al.  Invariant representation, matching and pose estimation of 3D space curves under similarity transformations , 1997, Pattern Recognit..

[11]  E. Shikin,et al.  Handbook and atlas of curves , 1995 .

[12]  Jayaram K. Udupa,et al.  Registration of 3D objects and surfaces , 1990, IEEE Computer Graphics and Applications.

[13]  Edward J. Delp,et al.  Partial Shape Recognition: A Landmark-Based Approach , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Ernesto Bribiesca,et al.  A Geometric structure for two-dimensional shapes and three-dimensional surfaces , 1992, Pattern Recognit..

[15]  A. Gray Modern Differential Geometry of Curves and Surfaces , 1993 .

[16]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[17]  Mandyam D. Srinath,et al.  Partial Shape Classification Using Contour Matching in Distance Transformation , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Horst Bunke,et al.  Applications of approximate string matching to 2D shape recognition , 1993, Pattern Recognit..

[19]  John F. Roddick,et al.  A bibliography of temporal , 1999 .

[20]  Haim J. Wolfson On curve matching , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  G Taubes Computational Molecular Biology: Software Matchmakers Help Make Sense of Sequences , 1996, Science.

[22]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[23]  Wayne Smith Mapping and sequencing the human genome: a beginner's guide to the computational science perspective , 1997, CROS.

[24]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..