Discovering functional dependencies from similarity-based fuzzy relational databases

We present here a data mining technique for discovering all minimal non-trivial coarsest functional dependencies (FD) based on equivalence classes from similarity-based fuzzy relational databases. The similarity-based fuzzy data model has been recognized as most suitable for describing imprecise data that are analogical over discrete domains. Various searching techniques for discovering functional dependencies on crisp relational databases have been proposed recently. However, they have not been fully explored on the similarity-based fuzzy relational data model. In this work, we present a form of functional dependency based on equivalence classes on the similarity-based fuzzy relational database and a method to test the validity of such dependency. In addition, a data mining technique based on top-down level-wise searching is proposed. The time and space complexities of the proposed algorithm are analyzed. Experimental results showing the behaviors of these functional dependencies are discussed. The dependencies discovered contain not only the conventional functional dependencies when similarity relations are reduced to identity relations but also semantic dependencies that describe the conceptual relationships between attributes. The results developed here can be applied to fuzzy database design, query optimization and database reverse engineering.

[1]  Heikki Mannila,et al.  Algorithms for Inferring Functional Dependencies from Relations , 1994, Data Knowl. Eng..

[2]  Sang Jo Lee,et al.  Systematic generation method and efficient representation of proximity relations for fuzzy relational database systems , 1994, Proceedings of Twentieth Euromicro Conference. System Architecture and Integration.

[3]  Arun K. Majumdar,et al.  Fuzzy Functional Dependencies and Lossless Join Decomposition of Fuzzy Relational Database Systems , 1988, ACM Trans. Database Syst..

[4]  T. Liao,et al.  A review of similarity measures for fuzzy systems , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[5]  Lotfi A. Zadeh,et al.  Similarity relations and fuzzy orderings , 1971, Inf. Sci..

[6]  Peter A. Flach,et al.  Database Dependency Discovery: A Machine Learning Approach , 1999, AI Commun..

[7]  Patrick Bosc,et al.  Functional dependencies revisited under graduality and imprecision , 1997, 1997 Annual Meeting of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.97TH8297).

[8]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[9]  Reinhold Haux,et al.  Nondeterministic dependencies in relations: An extention of the concept of functional dependency , 1985, Inf. Syst..

[10]  B. Buckles,et al.  A fuzzy representation of data for relational databases , 1982 .

[11]  Hannu Toivonen,et al.  Efficient discovery of functional and approximate dependencies using partitions , 1998, Proceedings 14th International Conference on Data Engineering.

[12]  Terence M. Barron,et al.  Approximate dependencies in database systems , 1995, Decis. Support Syst..

[13]  Guoqing Chen Fuzzy logic in data modeling: semantics, constraints, and database design , 1998 .

[14]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[15]  Siegfried Bell,et al.  Discovery of data dependencies in relational databases , 1999 .

[16]  Sujeet Shenoi,et al.  Analyzing FD Inference in Relational Databases , 1996, Data Knowl. Eng..

[17]  Juan Miguel Medina,et al.  Towards the implementation of a generalized fuzzy relational database model , 1995, Fuzzy Sets Syst..

[18]  T. Sudkamp Similarity, interpolation, and fuzzy rule construction , 1993 .

[19]  C. Pappis,et al.  A comparative assessment of measures of similarity of fuzzy values , 1993 .

[20]  Adnan Yazici,et al.  Fuzzy Database Modeling , 1998, J. Database Manag..

[21]  Patrick Bosc,et al.  Fuzzy querying in conventional databases , 1992 .

[22]  Adnan Yazici,et al.  An integrity constraint for a fuzzy relational database , 1993, [Proceedings 1993] Second IEEE International Conference on Fuzzy Systems.

[23]  Shyi-Ming Chen,et al.  A comparison of similarity measures of fuzzy values , 1995 .

[24]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Databases: An Overview , 1992, AI Mag..

[25]  Heikki Mannila,et al.  Similarity of Attributes by External Probes , 1998, KDD.

[26]  H. Sachar,et al.  Theoretical aspects of design of and retrieval from similarity-based relational database systems , 1986 .