Performance Analysis of IoT-Based Sensor, Big Data Processing, and Machine Learning Model for Real-Time Monitoring System in Automotive Manufacturing

With the increase in the amount of data captured during the manufacturing process, monitoring systems are becoming important factors in decision making for management. Current technologies such as Internet of Things (IoT)-based sensors can be considered a solution to provide efficient monitoring of the manufacturing process. In this study, a real-time monitoring system that utilizes IoT-based sensors, big data processing, and a hybrid prediction model is proposed. Firstly, an IoT-based sensor that collects temperature, humidity, accelerometer, and gyroscope data was developed. The characteristics of IoT-generated sensor data from the manufacturing process are: real-time, large amounts, and unstructured type. The proposed big data processing platform utilizes Apache Kafka as a message queue, Apache Storm as a real-time processing engine and MongoDB to store the sensor data from the manufacturing process. Secondly, for the proposed hybrid prediction model, Density-Based Spatial Clustering of Applications with Noise (DBSCAN)-based outlier detection and Random Forest classification were used to remove outlier sensor data and provide fault detection during the manufacturing process, respectively. The proposed model was evaluated and tested at an automotive manufacturing assembly line in Korea. The results showed that IoT-based sensors and the proposed big data processing system are sufficiently efficient to monitor the manufacturing process. Furthermore, the proposed hybrid prediction model has better fault prediction accuracy than other models given the sensor data as input. The proposed system is expected to support management by improving decision-making and will help prevent unexpected losses caused by faults during the manufacturing process.

[1]  Vili Podgorelec,et al.  Improving mining of medical data by outliers prediction , 2005, 18th IEEE Symposium on Computer-Based Medical Systems (CBMS'05).

[2]  Yuan-Shin Lee,et al.  A flexible data schema and system architecture for the virtualization of manufacturing machines (VMM) , 2017 .

[3]  Aurélien Garivier,et al.  On the Complexity of Best-Arm Identification in Multi-Armed Bandit Models , 2014, J. Mach. Learn. Res..

[4]  María Jesús López Boada,et al.  Real-Time Vehicle Roll Angle Estimation Based on Neural Networks in IoT Low-Cost Devices , 2018, Sensors.

[5]  François Laviolette,et al.  Risk bounds for the majority vote: from a PAC-Bayesian analysis to a learning algorithm , 2015, J. Mach. Learn. Res..

[6]  Eduardo Casilari-Pérez,et al.  On the Capability of Smartphones to Perform as Communication Gateways in Medical Wireless Personal Area Networks , 2014, Sensors.

[7]  Li Zhao,et al.  A Cloud-Based Car Parking Middleware for IoT-Based Smart Cities: Design and Implementation , 2014, Sensors.

[8]  Jay Lohokare,et al.  An IoT ecosystem for the implementation of scalable wireless home automation systems at smart city level , 2017, TENCON 2017 - 2017 IEEE Region 10 Conference.

[9]  Eliane Araújo,et al.  Manufacturing and economic development: the actuality of Kaldor's first and second laws , 2016 .

[10]  Abdullah Kadri,et al.  A Modular IoT Platform for Real-Time Indoor Air Quality Monitoring , 2018, Sensors.

[11]  Antonio Puliafito,et al.  AllJoyn Lambda: An architecture for the management of smart environments in IoT , 2014, 2014 International Conference on Smart Computing Workshops.

[12]  Hyoungjoo Lee,et al.  Machine learning-based novelty detection for faulty wafer detection in semiconductor manufacturing , 2012, Expert Syst. Appl..

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Davide Anguita,et al.  Marine Safety and Data Analytics: Vessel Crash Stop Maneuvering Performance Prediction , 2017, ICANN.

[15]  Di Wu,et al.  Empirical study of the effects of open source adoption on software development economics , 2007, J. Syst. Softw..

[16]  Edison Pignaton de Freitas,et al.  NoSQL real-time database performance comparison , 2018, Int. J. Parallel Emergent Distributed Syst..

[17]  Kwang-Jae Kim,et al.  A data mining approach considering missing values for the optimization of semiconductor-manufacturing processes , 2012, Expert Syst. Appl..

[18]  Ray Y. Zhong,et al.  A big data approach for logistics trajectory discovery from RFID-enabled production data , 2015 .

[19]  Lubna K. Alazzawi,et al.  Performance Evaluation of the WSN Routing Protocols Scalability , 2008, J. Comput. Networks Commun..

[20]  Aamir Nizam Ansari,et al.  An Internet of things approach for motion detection using Raspberry Pi , 2015, Proceedings of 2015 International Conference on Intelligent Computing and Internet of Things.

[21]  Jon Raffe Willmott,et al.  The Development of a Low-Cost, Near Infrared, High-Temperature Thermal Imaging System and Its Application to the Retrieval of Accurate Lava Lake Temperatures at Masaya Volcano, Nicaragua , 2018, Remote. Sens..

[22]  Haoxiang Wang,et al.  Efficient IoT-based sensor BIG Data collection-processing and analysis in smart buildings , 2017, Future Gener. Comput. Syst..

[23]  Mashrur Chowdhury,et al.  A Distributed Message Delivery Infrastructure for Connected Vehicle Technology Applications , 2018, IEEE Transactions on Intelligent Transportation Systems.

[24]  Mohd Amran Mohd Radzi,et al.  Fault Detection of Broken Rotor Bar in LS-PMSM Using Random Forests , 2017, ArXiv.

[25]  Tiago M. Fernández-Caramés,et al.  A Cost-Effective IoT System for Monitoring Indoor Radon Gas Concentration , 2018, Sensors.

[26]  Brendan J. Frey,et al.  Are Random Forests Truly the Best Classifiers? , 2016, J. Mach. Learn. Res..

[27]  Sang Do Noh,et al.  Implementation of Cyber-Physical Production Systems for Quality Prediction and Operation Control in Metal Casting , 2018, Sensors.

[28]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[29]  Matthias Sax,et al.  Apache Kafka , 2019, Encyclopedia of Big Data Technologies.

[30]  Ray Y. Zhong,et al.  Intelligent Manufacturing in the Context of Industry 4.0: A Review , 2017 .

[31]  Jianjun Hu,et al.  ASPIE: A Framework for Active Sensing and Processing of Complex Events in the Internet of Manufacturing Things , 2018 .

[32]  Mei Han,et al.  An outliers detection method of time series data for soft sensor modeling , 2016, 2016 Chinese Control and Decision Conference (CCDC).

[33]  Junjie Li,et al.  Fault Diagnosis Method for a Mine Hoist in the Internet of Things Environment , 2018, Sensors.

[34]  Su-Young Chi,et al.  An implementation of a high throughput data ingestion system for machine logs in manufacturing industry , 2016, 2016 Eighth International Conference on Ubiquitous and Future Networks (ICUFN).

[35]  A. Lavopa,et al.  Manufacturing as an engine of growth: Which is the best fuel? , 2017 .

[36]  Abdennaceur Kachouri,et al.  Outlier detection for wireless sensor networks using density-based clustering approach , 2017, IET Wirel. Sens. Syst..

[37]  Adam Jacobs,et al.  The pathologies of big data , 2009, Commun. ACM.

[38]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[39]  Ying-Jen Chen,et al.  Manufacturing intelligence for reducing false alarm of defect classification by integrating similarity matching approach in CMOS image sensor manufacturing , 2016, Comput. Ind. Eng..

[40]  Jongtae Rhee,et al.  An Open Source-Based Real-Time Data Processing Architecture Framework for Manufacturing Sustainability , 2017 .

[41]  Kris Ven,et al.  The Organizational Adoption of Open Source Server Software by Belgian Organizations , 2006, OSS.

[42]  Lin Li,et al.  Monitoring Citrus Soil Moisture and Nutrients Using an IoT Based System , 2017, Sensors.

[43]  Robert J. Meijer,et al.  Sensor Data Storage Performance: SQL or NoSQL, Physical or Virtual , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[44]  Sandeep K. Sood,et al.  Wearable IoT sensor based healthcare system for identifying and controlling chikungunya virus , 2017, Computers in Industry.

[45]  Antonio J. Tallón-Ballesteros,et al.  Deleting or keeping outliers for classifier training? , 2014, 2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014).

[46]  Yingfeng Zhang,et al.  A big data driven analytical framework for energy-intensive manufacturing industries , 2018, Journal of Cleaner Production.

[47]  László Monostori,et al.  A Step towards Intelligent Manufacturing: Modelling and Monitoring of Manufacturing Processes through Artificial Neural Networks , 1993 .

[48]  Yong-Han Lee,et al.  MongoDB-Based Repository Design for IoT-Generated RFID/Sensor Big Data , 2016, IEEE Sensors Journal.

[49]  Fortunato Dualibe,et al.  A System for Controlling and Monitoring IoT Applications , 2018 .

[50]  Isaías González Pérez,et al.  Integration of Sensor and Actuator Networks and the SCADA System to Promote the Migration of the Legacy Flexible Manufacturing System towards the Industry 4.0 Concept , 2018, J. Sens. Actuator Networks.

[51]  S. Joe Qin,et al.  Process data analytics in the era of big data , 2014 .

[52]  Italo Meroni,et al.  A Low-Cost Environmental Monitoring System: How to Prevent Systematic Errors in the Design Phase through the Combined Use of Additive Manufacturing and Thermographic Techniques , 2016, Sensors.

[53]  Duc Truong Pham,et al.  Machine-learning techniques and their applications in manufacturing , 2005 .

[54]  Benjamin T. Hazen,et al.  Mitigating Supply Chain Risk via Sustainability Using Big Data Analytics: Evidence from the Manufacturing Supply Chain , 2017 .

[55]  Bart Verspagen,et al.  Manufacturing and economic growth in developing countries, 1950–2005 , 2015 .

[56]  Florin Radulescu,et al.  MongoDB vs Oracle -- Database Comparison , 2012, 2012 Third International Conference on Emerging Intelligent Data and Web Technologies.

[57]  Ankit Jain,et al.  Learning Storm , 2014 .

[58]  Mouzhi Ge,et al.  Big Data for Internet of Things: A Survey , 2018, Future Gener. Comput. Syst..

[59]  Nengcheng Chen,et al.  Design and implementation of the real-time GIS data model and Sensor Web service platform for environmental big data management with the Apache Storm , 2015, 2015 Fourth International Conference on Agro-Geoinformatics (Agro-geoinformatics).

[60]  Chaowei Phil Yang,et al.  Evaluating the Open Source Data Containers for Handling Big Geospatial Raster Data , 2018, ISPRS Int. J. Geo Inf..

[61]  Jongtae Rhee,et al.  Real-Time Monitoring System Using Smartphone-Based Sensors and NoSQL Database for Perishable Supply Chain , 2017 .

[62]  Marimuthu Palaniswami,et al.  Fuzzy c-Means Algorithms for Very Large Data , 2012, IEEE Transactions on Fuzzy Systems.

[63]  Jongtae Rhee,et al.  A Personalized Healthcare Monitoring System for Diabetic Patients by Utilizing BLE-Based Sensors and Real-Time Data Processing , 2018, Sensors.

[64]  V. Sugumaran,et al.  Machine learning approach for automated visual inspection of machine components , 2011, Expert Syst. Appl..

[65]  Hyung Rim Choi,et al.  Development of IoT-Based Sensor Tag for Smart Factory , 2017 .

[66]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[67]  Jie Huang,et al.  Benchmarking modern distributed streaming platforms , 2016, 2016 IEEE International Conference on Industrial Technology (ICIT).

[68]  Joonho Kwon,et al.  DISPAQ: Distributed Profitable-Area Query from Big Taxi Trip Data † , 2017, Sensors.

[69]  Germán Terrazas,et al.  Towards a big data platform for managing machine generated data in the cloud , 2017, 2017 IEEE 15th International Conference on Industrial Informatics (INDIN).

[70]  Yu-Cheng Lin,et al.  A Real-Time Construction Safety Monitoring System for Hazardous Gas Integrating Wireless Sensor Network and Building Information Modeling Technologies , 2018, Sensors.

[71]  Nobuya Haraguchi,et al.  The Importance of Manufacturing in Economic Development: Has This Changed? , 2017 .

[72]  Przemysław Oborski,et al.  Developments in integration of advanced monitoring systems , 2014, The International Journal of Advanced Manufacturing Technology.

[73]  Diego Cabrera,et al.  Fault diagnosis in spur gears based on genetic algorithm and random forest , 2016 .

[74]  Christine Morin,et al.  Experimental Study on the Performance and Resource Utilization of Data Streaming Frameworks , 2018, 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[75]  Nengcheng Chen,et al.  Efficient Streaming Mass Spatio-Temporal Vehicle Data Access in Urban Sensor Networks Based on Apache Storm , 2017, Sensors.

[76]  Julian Szymanski,et al.  An IoT-Based Computational Framework for Healthcare Monitoring in Mobile Environments , 2017, Sensors.

[77]  Tianyi Ma,et al.  Delivering Real-Time Information Services on Public Transit: A Framework , 2017, IEEE Transactions on Intelligent Transportation Systems.

[78]  Yasser Morgan,et al.  Real-time Support Vector Machine based Network Intrusion Detection system using Apache Storm , 2016, 2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON).

[79]  Giuseppe Ricci,et al.  Realtime Gas Emission Monitoring at Hazardous Sites Using a Distributed Point-Source Sensing Infrastructure , 2016, Sensors.

[80]  Jong-Won Park,et al.  Cloud computing platform based real-time processing for stream reasoning , 2017, 2017 Sixth International Conference on Future Generation Communication Technologies (FGCT).

[81]  Weisi Han,et al.  Wearable Sensors Integrated with Internet of Things for Advancing eHealth Care , 2018, Sensors.

[82]  Teng Wang,et al.  Real-time monitoring of high-power disk laser welding based on support vector machine , 2018, Comput. Ind..

[83]  V. K. Giri,et al.  Feature selection and classification of mechanical fault of an induction motor using random forest classifier , 2016 .

[84]  Gaurav,et al.  Real-time processing of IoT events with historic data using Apache Kafka and Apache Spark with dashing framework , 2017, 2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[85]  Gian Antonio Susto,et al.  Machine Learning for Predictive Maintenance: A Multiple Classifier Approach , 2015, IEEE Transactions on Industrial Informatics.

[86]  Enrique Onieva,et al.  Real-time predictive maintenance for wind turbines using Big Data frameworks , 2017, 2017 IEEE International Conference on Prognostics and Health Management (ICPHM).

[87]  Nengcheng Chen,et al.  A Spatio-Temporal Enhanced Metadata Model for Interdisciplinary Instant Point Observations in Smart Cities , 2017, ISPRS Int. J. Geo Inf..

[88]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .