PREMISES, a Scalable Data-Driven Service to Predict Alarms in Slowly-Degrading Multi-Cycle Industrial Processes

In recent years, the number of industry-4.0-enabled manufacturing sites has been continuously growing, and both the quantity and variety of signals and data collected in plants are increasing at an unprecedented rate. At the same time, the demand of Big Data processing platforms and analytical tools tailored to manufacturing environments has become more and more prominent. Manufacturing companies are collecting huge amounts of information during the production process through a plethora of sensors and networks. To extract value and actionable knowledge from such precious repositories, suitable data-driven approaches are required. They are expected to improve the production processes by reducing maintenance costs, reliably predicting equipment failures, and avoiding quality degradation. To this aim, Machine Learning techniques tailored for predictive maintenance analysis have been adopted in PREMISES (PREdictive Maintenance service for Industrial procesSES), an innovative framework providing a scalable Big Data service able to predict alarming conditions in slowly-degrading processes characterized by cyclic procedures. PREMISES has been experimentally tested and validated on a real industrial use case, resulting efficient and effective in predicting alarms. The framework has been designed to address the main Big Data and industrial requirements, by being developed on a solid and scalable processing framework, Apache Spark, and supporting the deployment on modularized containers, specifically upon the Docker technology stack.

[1]  Elena Baralis,et al.  SeLINA: A Self-Learning Insightful Network Analyzer , 2016, IEEE Transactions on Network and Service Management.

[2]  Miriam A. M. Capretz,et al.  Energy Consumption Prediction with Big Data: Balancing Prediction Accuracy and Computational Resources , 2016, 2016 IEEE International Congress on Big Data (BigData Congress).

[3]  Sheldon M. Ross,et al.  Introduction to probability models , 1975 .

[4]  Daniele Apiletti,et al.  iSTEP, an Integrated Self-Tuning Engine for Predictive Maintenance in Industry 4.0 , 2018, 2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Ubiquitous Computing & Communications, Big Data & Cloud Computing, Social Computing & Networking, Sustainable Computing & Communications (ISPA/IUCC/BDCloud/SocialCom/SustainCom).

[5]  Miriam A. M. Capretz,et al.  MLaaS: Machine Learning as a Service , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[6]  Jay Lee,et al.  Service Innovation and Smart Analytics for Industry 4.0 and Big Data Environment , 2014 .

[7]  Lin Li,et al.  Industrial Big Data in an Industry 4.0 Environment: Challenges, Schemes, and Applications for Predictive Maintenance , 2017, IEEE Access.

[8]  Gaurav,et al.  Real-time processing of IoT events with historic data using Apache Kafka and Apache Spark with dashing framework , 2017, 2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[9]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[10]  Ameet Talwalkar,et al.  MLlib: Machine Learning in Apache Spark , 2015, J. Mach. Learn. Res..

[11]  Ben Y. Zhao,et al.  Complexity vs. performance: empirical analysis of machine learning as a service , 2017, Internet Measurement Conference.

[12]  Tania Cerquitelli,et al.  Data miners' little helper: data transformation activity cues for cluster analysis on document collections , 2017, WIMS.

[13]  Elena Baralis,et al.  Mining Sensor Data for Predictive Maintenance in the Automotive Industry , 2018, 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA).

[14]  Elena Baralis,et al.  Data mining for better healthcare: A path towards automated data analysis? , 2016, 2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW).

[15]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[16]  Ernesto Damiani,et al.  A Model-Driven Methodology for Big Data Analytics-as-a-Service , 2017, 2017 IEEE International Congress on Big Data (BigData Congress).

[17]  Daqiang Zhang,et al.  Towards smart factory for industry 4.0: a self-organized multi-agent system with big data based feedback and coordination , 2016, Comput. Networks.