Uncertainty and grey data analytics

The purpose of this paper is to propose a framework for data analytics where everything is grey in nature and the associated uncertainty is considered as an essential part in data collection, profiling, imputation, analysis and decision making.,A comparative study is conducted between the available uncertainty models and the feasibility of grey systems is highlighted. Furthermore, a general framework for the integration of grey systems and grey sets into data analytics is proposed.,Grey systems and grey sets are useful not only for small data, but also big data as well. It is complementary to other models and can play a significant role in data analytics.,The proposed framework brings a radical change in data analytics. It may bring a fundamental change in our way to deal with uncertainties.,The proposed model has the potential to avoid the mistake from a misleading data imputation.,The proposed model takes the philosophy of grey systems in recognising the limitation of our knowledge which has significant implications in our way to deal with our social life and relations.,This is the first time that the whole data analytics is considered from the point of view of grey systems.

[1]  Mamello Thinyane,et al.  Small data and sustainable development — Individuals at the center of data-driven societies , 2017, 2017 ITU Kaleidoscope: Challenges for a Data-Driven Society (ITU K).

[2]  Robert Ivor John,et al.  Grey sets and greyness , 2012, Inf. Sci..

[3]  Xizhao Wang,et al.  Learning from Uncertainty for Big Data: Future Analytical Challenges and Strategies , 2016, IEEE Systems, Man, and Cybernetics Magazine.

[4]  Juvencio Mendoza Castelán Introduction To Database , 2011 .

[5]  Statistical Uncertainty Analysis for Small-Sample, High Log-Variance Data: Cautions for Bootstrapping and Bayesian Bootstrapping. , 2019, Journal of chemical theory and computation.

[6]  Fabio Cuzzolin Belief Functions: Theory and Applications , 2014, Lecture Notes in Computer Science.

[7]  Deng Ju-Long,et al.  Control problems of grey systems , 1982 .

[8]  Daniel Morinigo-Sotelo,et al.  Early Fault Detection in Induction Motors Using AdaBoost With Imbalanced Small Data and Optimized Sampling , 2017, IEEE Transactions on Industry Applications.

[9]  K. Shobha,et al.  Imputation of Multivariate Attribute Values in Big Data , 2019 .

[10]  Jeffrey Forrest,et al.  Grey Data Analysis - Methods, Models and Applications , 2017, Computational Risk Management.

[11]  Erik M. Fredericks,et al.  Uncertainty in big data analytics: survey, opportunities, and challenges , 2019, Journal of Big Data.

[12]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.

[13]  Xizhao Wang,et al.  Editorial: Uncertainty in learning from big data , 2015, Fuzzy Sets Syst..

[14]  Jerry M. Mendel,et al.  Type-2 fuzzy sets made simple , 2002, IEEE Trans. Fuzzy Syst..

[15]  Chris J. Hinde,et al.  A new extension of fuzzy sets using rough sets: R-fuzzy sets , 2010, Inf. Sci..

[16]  Robert Ivor John,et al.  Uncertainty Representation of Grey Numbers and Grey Sets , 2014, IEEE Transactions on Cybernetics.

[17]  Sifeng Liu,et al.  Grey Systems, Grey Models and Their Roles in Data Analytics , 2019, International journal of simulation: systems, science & technology.

[18]  Krassimir T. Atanassov,et al.  Intuitionistic fuzzy sets , 1986 .

[19]  Yiyu Yao,et al.  Two views of the theory of rough sets in finite universes , 1996, Int. J. Approx. Reason..

[20]  Patrick Bosc,et al.  Fuzzy databases : principles and applications , 1996 .

[21]  Bruno Salgues,et al.  Society 5.0 , 2018 .

[22]  Carmela Troncoso,et al.  Small Data , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[23]  Witold Pedrycz,et al.  Shadowed sets: representing and processing fuzzy sets , 1998, IEEE Trans. Syst. Man Cybern. Part B.