dynXcube - Categorizing dynamic data analysis

Abstract Data analysis has gained strategic importance for virtually any organization. It covers areas like business analytics, big data, business intelligence, and data mining, among others. The past decades have also witnessed increasing efforts to capture, analyze, and interpret dynamic data instead of just static snapshot data. This is due to the fact that many real-life applications are characterized by changing data structures. Hence, analytic systems need to be able to adapt to changes. In recent years, many models for dynamic data analysis have been proposed and successfully applied in a diverse range of real-life projects. Since the number of respective algorithms has continuously risen, it has become increasingly demanding to keep track in this field. This is not only related to the algorithms that have been proposed so far and their relationships to each other. It also applies to the disclosure of gaps in research that need to be filled by appropriate new algorithms and, therefore, uncover new research opportunities. To contribute to the review of this field, we propose a holistic framework to categorize dynamic data analysis, the dynXcube-Framework. We show that dynXcube is very useful to present the state-of-the-art of dynamic data analysis in a consolidated way. Furthermore, it has the potential to disclose gaps in current research, thus providing a road map for future activities in the field of dynamic data analysis. Therefore, dynXcube is a significant step towards an improved accessibility of dynamic data analysis methods for academics and professionals alike and will help to stimulate future research in this important field.

[1]  P. Laplace A Philosophical Essay On Probabilities , 1902 .

[2]  Zeshui Xu,et al.  Towards felicitous decision making: An overview on challenges and trends of Big Data , 2016, Inf. Sci..

[3]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[4]  Georg Peters,et al.  Rough clustering utilizing the principle of indifference , 2014, Inf. Sci..

[5]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[6]  Dimitris J. Bertsimas,et al.  Dynamic Classification of Online Customers , 2003, SDM.

[7]  Jinjun Chen,et al.  A Space-Filling Multidimensional Visualization (SFMDVis) for Exploratory Data Analysis , 2017 .

[8]  Georg Peters,et al.  DCC: a framework for dynamic granular clustering , 2016 .

[9]  Pawel Kasprowski,et al.  Life Sciences Data Analysis , 2017, Inf. Sci..

[10]  R. Weber,et al.  A Rough-Fuzzy approach for Support Vector Clustering , 2016, Inf. Sci..

[11]  James M. Keller,et al.  A possibilistic fuzzy c-means clustering algorithm , 2005, IEEE Transactions on Fuzzy Systems.

[12]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[13]  Richard Weber,et al.  Uncertainty modeling in dynamic clustering — A soft computing perspective , 2010, International Conference on Fuzzy Systems.

[14]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[15]  Robert L. Winkler,et al.  The accuracy of extrapolation (time series) methods: Results of a forecasting competition , 1982 .

[16]  Witold Pedrycz,et al.  Granular Computing: Analysis and Design of Intelligent Systems , 2013 .

[17]  Shuo Tong The Optimization Method of Dynamic Traffic State Identification on Urban Arterial Road , 2015 .

[18]  Sankar K. Pal,et al.  RFCM: A Hybrid Clustering Algorithm Using Rough and Fuzzy Sets , 2007, Fundam. Informaticae.

[19]  Sheng-De Wang,et al.  Fuzzy support vector machines , 2002, IEEE Trans. Neural Networks.

[20]  Witold Pedrycz,et al.  Rough–Fuzzy Collaborative Clustering , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[21]  Frank Klawonn,et al.  Dynamic data assigning assessment clustering of streaming data , 2008, Appl. Soft Comput..

[22]  James Martin Managing the data-base environment , 1983 .

[23]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[24]  Richard Weber,et al.  Dynamic Rough-Fuzzy Support Vector Clustering , 2017, IEEE Transactions on Fuzzy Systems.

[25]  Stephen J. Roberts,et al.  Sequential Dynamic Classification Using Latent Variable Models , 2010, Comput. J..

[26]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[27]  Hilmi Berk Celikoglu,et al.  Dynamic Classification of Traffic Flow Patterns Simulated by a Switching Multimode Discrete Cell Transmission Model , 2014, IEEE Transactions on Intelligent Transportation Systems.

[28]  S. Angel Latha Mary,et al.  A Density Based Dynamic Data Clustering Algorithm based on Incremental Dataset , 2012 .

[29]  Richard Weber,et al.  Soft clustering - Fuzzy and rough approaches and their extensions and derivatives , 2013, Int. J. Approx. Reason..

[30]  Richard Weber,et al.  A model updating strategy for predicting time series with seasonal patterns , 2010, Appl. Soft Comput..

[31]  Eric Bair,et al.  Semi‐supervised clustering methods , 2013, Wiley interdisciplinary reviews. Computational statistics.

[32]  Pawan Lingras,et al.  Advances in Rough and Soft Clustering: Meta-Clustering, Dynamic Clustering, Data-Stream Clustering , 2016, IJCRS.

[33]  Tianrui Li,et al.  Composite rough sets for dynamic data mining , 2014, Inf. Sci..

[34]  Richard Weber,et al.  Dynamic rough clustering and its applications , 2012, Appl. Soft Comput..

[35]  Georg Peters Fuzzy linear regression with fuzzy intervals , 1994 .

[36]  James C. Bezdek,et al.  A mixed c-means clustering model , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[37]  David Madigan,et al.  Dynamic Logistic Regression and Dynamic Model Averaging for Binary Classification , 2012, Biometrics.

[38]  Lotfi A. Zadeh,et al.  Generalized theory of uncertainty (GTU) - principal concepts and ideas , 2006, Comput. Stat. Data Anal..

[39]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[40]  Pawan Lingras,et al.  Temporal analysis of clusters of supermarket customers: conventional versus interval set approach , 2005, Inf. Sci..

[41]  Richard Weber,et al.  Fuzzy-ID3: A class of methods for automatic knowledge acquisition , 1992 .

[42]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[43]  Witold Pedrycz,et al.  Granular computing: an introduction , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[44]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[45]  Ian Davidson,et al.  Constrained Clustering: Advances in Algorithms, Theory, and Applications , 2008 .

[46]  Richard Weber,et al.  A methodology for dynamic data mining based on fuzzy clustering , 2005, Fuzzy Sets Syst..

[47]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[48]  J. Keynes A Treatise on Probability. , 1923 .

[49]  Gert de Cooman,et al.  Imprecise probability trees: Bridging two theories of imprecise probability , 2008, Artif. Intell..

[50]  Hava T. Siegelmann,et al.  Support Vector Clustering , 2002, J. Mach. Learn. Res..

[51]  M. A. Wincek Forecasting With Dynamic Regression Models , 1993 .