Scalable Analytics Platform for Machine Learning in Smart Production Systems

Manufacturing industry is facing major challenges to meet customer requirements, which are constantly changing. Therefore, products have to be manufactured with efficient processes, minimal interruptions, and low resource consumptions. To achieve this goal, huge amounts of data generated by industrial equipment needs to be managed and analyzed by modern technologies. Since the big data era in manufacturing industry is still at an early stage, there is a need for a reference architecture that incorporates big data and machine learning technologies and aligns with the Industrie 4.0 standards and requirements. In this paper, requirements for designing a scalable analytics platform for industrial data are derived from Industrie 4.0 standards and literature. Based on these requirements, a reference big data architecture for industrial machine learning applications is proposed and compared to related works. Finally, the proposed architecture has been implemented in the Lab Big Data at the SmartFactoryOWL and its scalability and performance have been evaluated on parallel computation of an industrial PCA model. The results show that the proposed architecture is linearly scalable and adaptable to machine learning use cases and will help to improve the industrial automation processes in production systems.

[1]  Peter Fettke,et al.  Industrie 4.0 , 2018, Handbuch Innovationsforschung.

[2]  Inder Monga,et al.  Lambda architecture for cost-effective batch and speed big data processing , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[3]  Oliver Niggemann,et al.  A self-configurable fault detection system for Industrial Ethernet networks , 2017, Autom..

[4]  Yi Cao,et al.  Nonlinear process fault detection and identification using kernel PCA and kernel density estimation , 2016 .

[5]  Gerald E. Plassman A Survey of Singular Value Decomposition Methods and Performance Comparison of Some Available Serial Codes , 2013 .

[6]  Jing Li,et al.  Scalable Formalization of Publish/Subscribe Messaging Scheme Based on Message Brokers , 2007, WS-FM.

[7]  Taghi M. Khoshgoftaar,et al.  A Multi-dimensional Comparison of Toolkits for Machine Learning with Big Data , 2015, 2015 IEEE International Conference on Information Reuse and Integration.

[8]  Florian Pethig,et al.  A Survey of Internet of Things and Big Data integrated Solutions for Industrie 4.0 , 2018, 2018 IEEE 23rd International Conference on Emerging Technologies and Factory Automation (ETFA).

[9]  Seunghak Lee,et al.  On Model Parallelization and Scheduling Strategies for Distributed Machine Learning , 2014, NIPS.

[10]  Yoji Yamato,et al.  Proposal of Lambda Architecture Adoption for Real Time Predictive Maintenance , 2016, 2016 Fourth International Symposium on Computing and Networking (CANDAR).

[11]  Zhiqiang Ge,et al.  Data Mining and Analytics in the Process Industry: The Role of Machine Learning , 2017, IEEE Access.

[12]  Rainer Drath,et al.  Industrie 4.0: Hit or Hype? [Industry Forum] , 2014, IEEE Industrial Electronics Magazine.

[13]  Gianmarco De Francisci Morales,et al.  SAMOA: scalable advanced massive online analysis , 2015, J. Mach. Learn. Res..

[14]  Kevin Leahy,et al.  An industrial big data pipeline for data-driven analytics maintenance applications in large-scale smart manufacturing facilities , 2015, Journal of Big Data.

[15]  P. Erhan Eren,et al.  Big Data for Industry 4.0: A Conceptual Framework , 2016, 2016 International Conference on Computational Science and Computational Intelligence (CSCI).

[16]  Hazem Nounou,et al.  Kernel PCA-based GLRT for nonlinear fault detection of chemical processes , 2016 .

[17]  Nagayoshi Kasashima,et al.  Cyber-physical system architecture for machining production line , 2018, 2018 IEEE Industrial Cyber-Physical Systems (ICPS).

[18]  Athanasios V. Vasilakos,et al.  A Manufacturing Big Data Solution for Active Preventive Maintenance , 2017, IEEE Transactions on Industrial Informatics.

[19]  Athanasios V. Vasilakos,et al.  Machine learning on big data: Opportunities and challenges , 2017, Neurocomputing.

[20]  Qihui Wu,et al.  A survey of machine learning for big data processing , 2016, EURASIP Journal on Advances in Signal Processing.

[21]  Birgit Vogel-Heuser,et al.  A flexible architecture for data mining from heterogeneous data sources in automated production systems , 2017, 2017 IEEE International Conference on Industrial Technology (ICIT).

[22]  Kagermann Henning Recommendations for implementing the strategic initiative INDUSTRIE 4.0 , 2013 .

[23]  E. F. Vogel,et al.  A plant-wide industrial process control problem , 1993 .