Comparison of Linguistic Summaries and Fuzzy Functional Dependencies Related to Data Mining

Data mining methods based on fuzzy logic have been developed recently and have become an increasingly important research area. In this chapter, the authors examine possibilities for discovering potentially useful knowledge from relational database by integrating fuzzy functional dependencies and linguistic summaries. Both methods use fuzzy logic tools for data analysis, acquiring, and representation of expert knowledge. Fuzzy functional dependencies could detect whether dependency between two examined attributes in the whole database exists. If dependency exists only between parts of examined attributes’ domains, fuzzy functional dependencies cannot detect its characters. Linguistic summaries are a convenient method for revealing this kind of dependency. Using fuzzy functional dependencies and linguistic summaries in a complementary way could mine valuable information from relational databases. Mining intensities of dependencies between database attributes could support decision making, reduce the number of attributes in databases, and estimate missing values. The proposed approach is evaluated with case studies using real data from the official statistics. Strengths and weaknesses of the described methods are discussed. At the end of the chapter, topics for further research activities are outlined. Miroslav Hudec University of Economics in Bratislava, Slovakia Miljan Vučetić University of Belgrade, Serbia Mirko Vujošević University of Belgrade, Serbia

[1]  Tora C. Löfgren,et al.  Response Burden in Official Business Surveys: Measurement and Reduction Practices of National Statistical Institutes , 2015 .

[2]  Edson C. Tandoc,et al.  A tale of two newsrooms: How market orientation influences web analytics use , 2015 .

[3]  Marco R. Spruit,et al.  Historical Data Analysis through Data Mining From an Outsourcing Perspective: The Three-Phases Model , 2010, Int. J. Bus. Intell. Res..

[4]  Janusz Kacprzyk,et al.  LINGUISTIC SUMMARIES OF DATA USING FUZZY LOGIC , 2001 .

[5]  N. Gibson,et al.  A Dual-Database Trusted Broker System for Resolving, Protecting, and Utilizing Multi-Sourced Data , 2014 .

[6]  D. Radojevic,et al.  Interpolative Realization of Boolean Algebra as a Consistent Frame for Gradation and/or Fuzziness , 2008 .

[7]  J. Fodor On fuzzy implication operators , 1991 .

[8]  Victor C. X. Wang Handbook of Research on Scholarly Publishing and Research Methods , 2014 .

[9]  Miroslav Hudec,et al.  A new method for computing fuzzy functional dependencies in relational database systems , 2013, Expert Syst. Appl..

[10]  Ronald R. Yager,et al.  Summary SQL - A Fuzzy Tool For Data Mining , 1997, Intell. Data Anal..

[11]  Ildar Z. Batyrshin,et al.  On linguistic representation of quantitative dependencies , 2004, Expert Syst. Appl..

[12]  José Galindo,et al.  Introduction and Trends to Fuzzy Logic and Fuzzy Databases , 2008, Handbook of Research on Fuzzy Information Processing in Databases.

[13]  Kyung-Whan Oh,et al.  Properties of fuzzy implication operators , 1987, Int. J. Approx. Reason..

[14]  Cornelia Tudorie Qualifying Objects in Classical Relational Database Querying , 2008, Handbook of Research on Fuzzy Information Processing in Databases.

[15]  Ananth Ramaswamy,et al.  Nonlinear Structural Control Using Magnetorheological Damper , 2013 .

[16]  Tzung-Pei Hong,et al.  An effective parallel approach for genetic-fuzzy data mining , 2014, Expert Syst. Appl..

[17]  Adnan Yazici,et al.  A complete axiomatization for fuzzy functional and multivalued dependencies in fuzzy database relations , 2001, Fuzzy Sets Syst..

[18]  Gracián Triviño,et al.  Automatically Generated Linguistic Summaries of Energy Consumption Data , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[19]  Chun-Ming Chen,et al.  INTELLIGENT QUERIES BASED ON FUZZY SET THEORY AND SQL , 2007 .

[20]  Shalin Hai-Jew,et al.  Sampling Public Sentiment Using Related Tags (and User-Created Content) Networks from Social Media Platforms , 2015 .

[21]  Swati Aggarwal,et al.  Neutrosophic classifier: An extension of fuzzy classifer , 2014, Appl. Soft Comput..

[22]  Simon Fong,et al.  Opportunities and Challenges of Integrating Bio-Inspired Optimization and Data Mining Algorithms , 2013 .

[23]  Gary E. Gorman,et al.  Enhancing Qualitative and Mixed Methods Research with Technology , 2015, Online Inf. Rev..

[24]  Slawomir Zadrozny,et al.  Protoforms of Linguistic Database Summaries as a Human Consistent Tool for Using Natural Language in Data Mining , 2009, Int. J. Softw. Sci. Comput. Intell..

[25]  János Abonyi,et al.  Computational Intelligence in Data Mining , 2005, Informatica.

[26]  Slawomir Zadrozny,et al.  Issues in the practical use of the OWA operators in fuzzy querying , 2008, Journal of Intelligent Information Systems.

[27]  P. Chitakunye,et al.  Reflexivity in Qualitative Research: A Researcher and Informant Perspective , 2015 .

[28]  A. Kandel,et al.  Applicability of some fuzzy implication operators , 1989 .

[29]  Olga Pons,et al.  Data Summarization in Relational Databases through Fuzzy Dependencies , 1999, Inf. Sci..

[30]  J. Hox,et al.  Prevention and treatment of item nonresponse. , 2003 .

[31]  E. Trillas,et al.  When QM‐operators are implication functions and conditional fuzzy relations , 2000 .

[32]  Mingsheng Ying,et al.  Implication operators in fuzzy logic , 2002, IEEE Trans. Fuzzy Syst..

[33]  Daniel Sánchez,et al.  Quality Assessment in Linguistic Summaries of Data , 2012, IPMU.

[34]  János Abonyi,et al.  Introduction to Fuzzy Data Mining Methods , 2008, Handbook of Research on Fuzzy Information Processing in Databases.

[35]  A. Ghorbani,et al.  Market Research Methodologies: Multi-Method and Qualitative Approaches , 2014 .

[36]  Etienne E. Kerre,et al.  Normalization Based on Fuzzy Functional Dependency in a Fuzzy Relational Data Model , 1996, Inf. Syst..

[37]  Giuseppe M. R. Manzella Knowledge Building and Computer Tools , 2015 .

[38]  Peter Fox,et al.  Collaborative Knowledge in Scientific Research Networks , 2014 .

[39]  Wei-Sen Chen,et al.  Using neural networks and data mining techniques for the financial distress prediction model , 2009, Expert Syst. Appl..

[40]  Shu-Hsien Liao,et al.  Data mining techniques and applications - A decade review from 2000 to 2011 , 2012, Expert Syst. Appl..

[41]  Ronald R. Yager,et al.  A new approach to the summarization of data , 1982, Inf. Sci..

[42]  Stefan Lessmann,et al.  Tuning metaheuristics: A data mining based approach for particle swarm optimization , 2011, Expert Syst. Appl..

[43]  Rouzbeh Razavi,et al.  Intelligent Bandwidth Allocation of IPTV Streams with Bitstream Complexity Measures , 2013, Int. J. Handheld Comput. Res..

[44]  Patrick Bosc,et al.  SQLf Query Functionality on Top of a Regular Relational Database Management System , 2000 .

[45]  Ronald R. Yager,et al.  On ordered weighted averaging aggregation operators in multicriteria decisionmaking , 1988, IEEE Trans. Syst. Man Cybern..

[46]  Lotfi A. Zadeh,et al.  From Computing with Numbers to Computing with Words - from Manipulation of Measurements to Manipulation of Perceptions , 2005, Logic, Thought and Action.

[47]  Miroslav Hudec,et al.  Construction of Fuzzy Sets and Applying Aggregation Operators for Fuzzy Queries , 2012, ICEIS.

[48]  I. Burhan Turksen,et al.  Recent Advancement in Fuzzy System: Full Type 2 Fuzziness , 2015 .

[49]  Miroslav Hudec Fuzzy database queries in official statistics: Perspective of using linguistic terms in query conditions , 2013 .

[50]  Miroslav Hudec Fuzzy improvement of the SQL , 2011 .

[51]  Mojca Bavdaz,et al.  Sources of Measurement Errors in Business Surveys , 2010 .

[52]  Janusz Kacprzyk,et al.  The Ordered Weighted Averaging Operators , 1997 .

[53]  M. Gupta,et al.  Theory of T -norms and fuzzy inference methods , 1991 .

[54]  J. Andújar,et al.  Model of behaviour of conductivity versus pH in acid mine drainage water, based on fuzzy logic and data mining techniques , 2009 .

[55]  Baoding Liu Uncertain Logic for Modeling Human Language , 2011 .

[56]  Adam M. Leadbetter Examining Trust in Collaborative Research Networks , 2015 .

[57]  Richard W. Schwester Teaching Research Methods in Public Administration , 2015 .

[58]  LiHua Xu,et al.  Measurement Development and Validation in Research: Statistical Techniques and Illustrations , 2015 .

[59]  Ingo Glöckner,et al.  Quantifier Selection for Linguistic Data Summarization , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[60]  Ronald R. Yager,et al.  On Linguistic Summaries of Data , 1991, Knowledge Discovery in Databases.

[61]  L. Zadeh A COMPUTATIONAL APPROACH TO FUZZY QUANTIFIERS IN NATURAL LANGUAGES , 1983 .

[62]  Bilal Ahmed Khan,et al.  An Advanced Fuzzy Logic Based Traffic Controller , 2014, Int. J. Innov. Digit. Econ..

[63]  Miroslav Hudec,et al.  An approach to fuzzy database querying, analysis and realization , 2009, Comput. Sci. Inf. Syst..

[64]  Mohanad Halaweh,et al.  Discount focus subgroup method: An innovative focus group method used for researching an emerging technology , 2015 .

[65]  J. Kacprzyk,et al.  Fquery for Access: Fuzzy Querying for a Windows-Based DBMS , 1995 .

[66]  Donald H. Kraft,et al.  Fuzzy sets in database and information systems: Status and opportunities , 2005, Fuzzy Sets Syst..