Fast classification in industrial Big Data environments

Many modern industrial applications, e.g. those incorporating hundreds or thousands of electrical sensors and actuators, must be categorised into Big Data environments, in which it is essential to design suitable information processing models. Central data processing in such environments is impossible and must be carried out in a distributed way on resource-limited cyber-physical systems. One of the challenging tasks for machine learning is thus the design of a classifier which is simple, accurate and has an acceptable realisation time. We present ComRef-2D-ConvHull method for linear classification optimisation in lower-dimensional feature space, which is based on ComRef from [1]. Compared to original ComRef, we consider only classification optimisation in 2-dimensional feature spaces in ComRef-2D-ConvHull. Due to the decreased time complexity for calculations in 2-dimensional feature space, we expect many industrial Big Data enviroments to profit from our method. Tests regarding the generalisation ability of ComRef-2D-ConvHull on several reference data sets and on a real-world industrial dataset show promising results.

[1]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[2]  Kristin P. Bennett,et al.  Duality and Geometry in SVM Classifiers , 2000, ICML.

[3]  Kagermann Henning Recommendations for implementing the strategic initiative INDUSTRIE 4.0 , 2013 .

[4]  Jürgen Jasperneite,et al.  A field level architecture for reconfigurable real-time automation systems , 2014, 2014 10th IEEE Workshop on Factory Communication Systems (WFCS 2014).

[5]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[6]  Guodong Guo,et al.  Support Vector Machines Applications , 2014 .

[7]  Uwe Mönks,et al.  Sensorless drive diagnosis using automated feature extraction, significance ranking and reduction , 2013, 2013 IEEE 18th Conference on Emerging Technologies & Factory Automation (ETFA).

[8]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[9]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[10]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[11]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[12]  H. Larsen,et al.  Importance weighted OWA aggregation of multicriteria queries , 1999, 18th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.99TH8397).

[13]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[14]  Uwe Mönks,et al.  Assisting the Design of Sensor and Information Fusion Systems , 2014 .

[15]  Kunle Olukotun,et al.  Map-Reduce for Machine Learning on Multicore , 2006, NIPS.

[16]  Volker Lohweg,et al.  Combinatorial refinement of feature weighting for linear classification , 2014, Proceedings of the 2014 IEEE Emerging Technology and Factory Automation (ETFA).

[17]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[18]  F. P. Preparata,et al.  Convex hulls of finite sets of points in two and three dimensions , 1977, CACM.

[19]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[20]  Miroslaw Malek,et al.  A survey of online failure prediction methods , 2010, CSUR.

[21]  Peter J. Haas,et al.  Ricardo: integrating R and Hadoop , 2010, SIGMOD Conference.

[22]  Uwe Mönks,et al.  Machine conditioning by importance controlled information fusion , 2013, 2013 IEEE 18th Conference on Emerging Technologies & Factory Automation (ETFA).

[23]  Arlo Faria,et al.  MapReduce : Distributed Computing for Machine Learning , 2006 .

[24]  Rolf P. Würtz,et al.  Organic Computing , 2004, Informatik-Spektrum.

[25]  Uwe Mönks,et al.  Fast evidence-based information fusion , 2014, 2014 4th International Workshop on Cognitive Information Processing (CIP).

[26]  Uwe Mönks,et al.  An extended perspective on evidential aggregation rules in machine condition monitoring , 2012, 2012 3rd International Workshop on Cognitive Information Processing (CIP).