Linguistic Summaries Generation with Hybridization Method Based on Rough and Fuzzy Sets

In this paper authors propose a new algorithm for linguistic data summarization based on hybridization of rough sets and fuzzy sets techniques. The new algorithm applies rough sets theory for feature selection in early stages of linguistic summaries’ generation. The rough sets theory was used to reduce on significant way, the amount on summaries obtained by others algorithms. The algorithm combines lower approximation, k grade dependency and fuzzy sets to get linguistic summaries. The results of proposed algorithm are compared with association rules approach. In order to validate the algorithm proposed, authors apply both qualitative and quantitative methods. Authors used two databases in order to validate the algorithm; theses databases belong to “Repository of Project Management Research”. The first database is associated to personality traits and human performance in software projects. The second database is associated to analysis of revenue assurance in different organization. Considering quantitative approach, the algorithm proposed, obtains better results than the algorithm based on association rules; while regards execution time, the best algorithm was the algorithm based on association rules, because rough sets theory was high time-consuming technique.

[1]  Rafael Bello,et al.  Extensions to Linguistic Summaries Indicators based on Neutrosophic Theory, Applications in Project Management Decisions , 2018 .

[2]  Salvatore Greco,et al.  Fuzzy Similarity Relation as a Basis for Rough Approximations , 1998, Rough Sets and Current Trends in Computing.

[3]  Anne Laurent,et al.  Scalable Fuzzy Algorithms for Data Management and Analysis - Methods and Design , 2009, Scalable Fuzzy Algorithms for Data Management and Analysis.

[4]  D. Vanderpooten Similarity Relation as a Basis for Rough Approximations , 1995 .

[5]  Grégory Smits,et al.  FRELS: Fast and Reliable Estimated Linguistic Summaries , 2019, 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[6]  Slawomir Zadrozny,et al.  Linguistic database summaries and their protoforms: towards natural language based knowledge discovery tools , 2005, Inf. Sci..

[7]  W. Duncan A GUIDE TO THE PROJECT MANAGEMENT BODY OF KNOWLEDGE , 1996 .

[8]  Yao Zhang,et al.  An extension of rough fuzzy set , 2016, J. Intell. Fuzzy Syst..

[9]  Andrzej Skowron,et al.  Rough sets: Some extensions , 2007, Inf. Sci..

[10]  Olgierd Hryniewicz,et al.  Personalized linguistic summaries in smartphone-based monitoring of bipolar disorder patients , 2019, EUSFLAT Conf..

[11]  Hannu Toivonen,et al.  Apriori Algorithm , 2010, Encyclopedia of Machine Learning.

[12]  João Varajão,et al.  ISO 21500: 2012 and PMBoK 5 processes in information systems project management , 2017, Comput. Stand. Interfaces.

[13]  Janusz Kacprzyk,et al.  Linguistic Database Summaries Using Fuzzy Logic: towards a Human-Consistent Data Mining Tool , 2009 .

[14]  Guoyin Wang,et al.  Extension of rough set under incomplete information systems , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[15]  R. Dijkman,et al.  A method for improving the generation of linguistic summaries , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[16]  Ali Karaşan,et al.  Proceedings of the 2019 Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology (EUSFLAT 2019) , 2019 .

[17]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.

[18]  Anna Wilbik,et al.  On the generation of useful linguistic summaries of sequences , 2016, 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[19]  Anna Wilbik,et al.  Linguistic summaries of time series : on some extended aggregation techniques , 2010 .

[20]  Janusz Kacprzyk,et al.  Using Ant Colony Optimization and Genetic Algorithms for the Linguistic Summarization of Creep Data , 2014, IEEE Conf. on Intelligent Systems.

[21]  Olivier Pivert,et al.  Efficient Generation of Reliable Estimated Linguistic Summaries , 2018, 2018 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[22]  Andreas Holzinger,et al.  Augmenting Statistical Data Dissemination by Short Quantified Sentences of Natural Language , 2018, Journal of Official Statistics.

[23]  Iliana Perez Pupo,et al.  Repositorio de datos para investigaciones en gestión de proyectos , 2019 .

[24]  Ronald R. Yager,et al.  A new approach to the summarization of data , 1982, Inf. Sci..

[25]  Slawomir Zadrozny,et al.  Linguistic Data Summarization , 2009, Scalable Fuzzy Algorithms for Data Management and Analysis.

[26]  A. Furnham Personality and occupational success : 16PF correlates of cabin crew performance , 1991 .

[27]  Janusz Kacprzyk,et al.  A Novel View of Bipolarity in Linguistic Data Summaries , 2014 .

[28]  Mihail Popescu,et al.  Linguistic Summarization of In-Home Sensor Data. , 2019, Journal of biomedical informatics.

[29]  Rafael Bello,et al.  Rough sets in the Soft Computing environment , 2012, Inf. Sci..

[30]  Alejandro Ramos-Soto,et al.  Enriching linguistic descriptions of data: A framework for composite protoforms , 2019, Fuzzy Sets Syst..

[31]  C. Barbaranelli,et al.  The "big five questionnaire": A new questionnaire to assess the five factor model , 1993 .