Processing Imprecise Database Queries by Fuzzy Clustering Algorithms

Nowadays database management systems are one of the most critical resources in every company. Despite ad- vanced possibilities of SQL, relational database management systems do not support flexible query conditions. Main assumptions of this work were two facts. First, that real data not representing random distribution (white noise), but have natural trend to granularity. The second one, that in everyday contacts we do not using strict defined conditions. The second feature lead us to use fuzzy logic which closer represent- ing natural communication. First gives us opportunity to auto- matically construct functions defining membership to discrete groups based only on data distribution. The problem of extending database systems with natural lan- guage expressions is a matter of many research centers. The ba- sic idea of presented research is to extend an existing query lan- guage and make database systems able to satisfy user needs more closely. This paper deals mostly with gaining imprecise in- formation from relational database systems. Presented concept is based on fuzzy sets and automatic clustering techniques that allow to build membership function and fuzzy queries. Thanks to applied solutions, the relational database system is more flex- ible, and similar to natural way of communication. Index Terms—fuzzy logic, fuzzy sets, fuzzy set theory, fuzzy systems, fuzzy clustering, FSQL, fuzzy queries, imprecise queries.

[1]  A. Pelikant,et al.  Implementation of automatically generated membership functions based on grouping algorithms , 2007, EUROCON 2007 - The International Conference on "Computer as a Tool".

[2]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[3]  Guy De Tré,et al.  On advances in soft computing applied to databases and information systems , 2012, Fuzzy Sets Syst..

[4]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Patricia Melin,et al.  A new validation index for fuzzy clustering and its comparisons with other methods , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[6]  Abraham Kandel,et al.  Implementing Imprecision in Information Systems , 1985, Inf. Sci..

[7]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[8]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[9]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[10]  Hui Xiong,et al.  Information-Theoretic Distance Measures for Clustering Validation: Generalization and Normalization , 2009, IEEE Transactions on Knowledge and Data Engineering.

[11]  B. Buckles,et al.  A domain calculus for fuzzy relational databases , 1989 .

[12]  R. Yager,et al.  Approximate Clustering Via the Mountain Method , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[13]  Yoshikane Takahashi Fuzzy Database Query Languages and Their Relational Completeness Theorem , 1993, IEEE Trans. Knowl. Data Eng..

[14]  Algorytm etykietowania analizujący rozmyte zapytania w metajęzyku naturalnym , 2011 .

[15]  Yoshikane Takahashi A fuzzy query language for relational databases , 1991, IEEE Trans. Syst. Man Cybern..

[16]  John F. Roddick,et al.  An Incremental Multi-Centroid, Multi-Run Sampling Scheme for k-medoids-based Algorithms - Extended Report , 2002 .

[17]  Shi-Kuo Chang,et al.  Translation of Fuzzy Queries for Relational Database System , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Patrick Bosc,et al.  SQLf: a relational database language for fuzzy querying , 1995, IEEE Trans. Fuzzy Syst..

[19]  B. Buckles,et al.  A fuzzy representation of data for relational databases , 1982 .