On the architecture of a clustering platform for the analysis of big volumes of data

In the last years the volume of data that was generated by the mankind has increased and the complexity of data generated has also increased. Since the computers have evolved and provide more processing power, it is possible to carry out the real-time analysis of big volumes of data. This paper suggests the architecture of a big data processing platform called BigTim, which is able to run clustering algorithms on Windows operating systems. The platform will allow running clustering algorithms in a parallelized manner using the advantage that the entire processing power of a multi-core computer.

[1]  Ivan Zelinka,et al.  Handbook of Optimization - From Classical to Modern Approach , 2012, Handbook of Optimization.

[2]  Héctor Pomares,et al.  Self-evolving parameter-free Rule-based Controller , 2012, 2012 IEEE International Conference on Fuzzy Systems.

[3]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[4]  Simone A. Ludwig MapReduce-based fuzzy c-means clustering algorithm: implementation and scalability , 2015, Int. J. Mach. Learn. Cybern..

[5]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[6]  Hong Yan,et al.  Fuzzy Algorithms: With Applications to Image Processing and Pattern Recognition , 1996, Advances in Fuzzy Systems - Applications and Theory.

[7]  Michael K. Ng,et al.  Agglomerative Fuzzy K-Means Clustering Algorithm with Selection of Number of Clusters , 2008, IEEE Transactions on Knowledge and Data Engineering.

[8]  A. L. Kazakov,et al.  On Mathematical Models for Optimization Problem of Logistics Infrastructure , 2015 .

[9]  Stefan Preitl,et al.  Novel Adaptive Charged System Search algorithm for optimal tuning of fuzzy controllers , 2014, Expert Syst. Appl..

[10]  Yuhui Shi,et al.  Biomimicry of parasitic behavior in a coevolutionary particle swarm optimization algorithm for global optimization , 2015, Appl. Soft Comput..

[11]  Nikita Tomin,et al.  Random Forest Based Model for Preventing Large-Scale Emergencies in Power Systems , 2015 .

[12]  E. Petriu,et al.  Fuzzy logic-based adaptive gravitational search algorithm for optimal tuning of fuzzy-controlled servo systems , 2013 .

[13]  T. Boros,et al.  Six-axis educational robot workcell with integrated vision system , 2012, 2012 4th IEEE International Symposium on Logistics and Industrial Informatics.

[14]  Zahir Tari,et al.  A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis , 2014, IEEE Transactions on Emerging Topics in Computing.

[15]  Imre J. Rudas,et al.  Human Intent Representation in Knowledge Intensive Product Model , 2009, J. Comput..

[16]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[17]  Z. Johanyák,et al.  A Hybrid Algorithm for Parameter Tuning in Fuzzy Model Identification , 2012, Acta Polytechnica Hungarica.

[18]  Claudia-Adina Dragos,et al.  Novel Tensor Product Models for Automatic Transmission System Control , 2012, IEEE Systems Journal.

[19]  Emanuel-Florin Iftene,et al.  Efficiency of a Combined Protection Method against Correlation , 2014, Int. J. Comput. Commun. Control.

[20]  Eneko Osaba,et al.  A migration strategy for distributed evolutionary algorithms based on stopping non-promising subpopulations: A case study on routing problems , 2015 .

[21]  Ján Vascák Automatic Design and Optimization of Fuzzy Inference Systems , 2013, Handbook of Optimization.

[22]  Igor Skrjanc,et al.  Fault detection for nonlinear systems with uncertain parameters based on the interval fuzzy model , 2007, Eng. Appl. Artif. Intell..

[23]  Florin G. Filip Decision support and control for large-scale complex systems , 2007, Annu. Rev. Control..