A new approach to mining fuzzy databases using nearest neighbor classification by exploiting attribute hierarchies

Data classification is a well-organized operation in the field of data mining. This article presents an application of the k-nearest neighbor classification technique for mining a fuzzy database. We consider a data set in which attribute values have certain similarities in nature and analyze the observations for the domain of each attribute, on the basis of fuzzy similarity relations. The proposed technique is general and the presented case study demonstrates the suitability of using this fuzzy approach for mining fuzzy databases, especially when the database contains various levels of abstraction. © 2004 Wiley Periodicals, Inc. Int J Int Syst 19: 1277–1290, 2004.

[1]  James C. Bezdek,et al.  An Integrated Framework for Generalized Nearest Prototype Classifier Design , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[2]  Isabelle Bloch,et al.  On fuzzy distances and their use in image processing under imprecision , 1999, Pattern Recognit..

[3]  Abraham Kandel,et al.  Knowledge discovery in time series databases , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[4]  Donald H. Kraft,et al.  Rough and Fuzzy Sets for Data Mining of a Controlled Vocabulary for Textual Retrieval , 2000 .

[5]  Adnan Yazici,et al.  Handling complex and uncertain information in the ExIFO and NF2 data models , 1999, IEEE Trans. Fuzzy Syst..

[6]  Ding-An Chiang,et al.  Mining time series data by a fuzzy linguistic summary system , 2000, Fuzzy Sets Syst..

[7]  Didier Dubois,et al.  Fuzzy sets and systems ' . Theory and applications , 2007 .

[8]  Katherine Schipper,et al.  Application of Classification Techniques in Business, Banking and Finance. , 1983 .

[9]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[10]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[11]  Michael K. Ng,et al.  A fuzzy k-modes algorithm for clustering categorical data , 1999, IEEE Trans. Fuzzy Syst..

[12]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  Susan M. Bridges,et al.  Mining fuzzy association rules and fuzzy frequency episodes for intrusion detection , 2000 .

[14]  Sujeet Shenoi,et al.  Proximity relations in the fuzzy relational database model , 1999 .

[15]  Bernadette Bouchon-Meunier,et al.  Towards general measures of comparison of objects , 1996, Fuzzy Sets Syst..

[16]  B. Buckles,et al.  A fuzzy representation of data for relational databases , 1982 .

[17]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[18]  Hongjun Lu,et al.  Effective Data Mining Using Neural Networks , 1996, IEEE Trans. Knowl. Data Eng..

[19]  Bradley P. Carlin,et al.  BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS , 1996, Stat. Comput..

[20]  Weixin Xie,et al.  On some properties of distance measures , 2001, Fuzzy Sets Syst..

[21]  Ranjit Biswas,et al.  On extended fuzzy relational database model with proximity relations , 2001, Fuzzy Sets Syst..

[22]  Venu Govindaraju,et al.  Potential improvement of classifier accuracy by using fuzzy measures , 2000, IEEE Trans. Fuzzy Syst..

[23]  Lotfi A. Zadeh,et al.  Similarity relations and fuzzy orderings , 1971, Inf. Sci..

[24]  Rami Zwick,et al.  Measures of similarity among fuzzy concepts: A comparative analysis , 1987, Int. J. Approx. Reason..