Feature engineering in big data analytics for IoT-enabled smart manufacturing - Comparison between deep learning and statistical learning

Abstract As IoT-enabled manufacturing is still in its infancy, there are several key research gaps that need to be addressed. These gaps include the understanding of the characteristics of the big data generated from industrial IoT sensors, the challenges they present to process data analytics, as well as the specific opportunities that the IoT big data could bring to advance manufacturing. In this paper, we use an inhouse-developed IoT-enabled manufacturing testbed to study the characteristics of the big data generated from the testbed. Since the quality of the data usually has the most impact on process modeling, data veracity is often the most challenging characteristic of big data. To address that, we explore the role of feature engineering in developing effective machine learning models for predicting key process variables. We compare complex deep learning approaches to a simple statistical learning approach, with different level or extent of feature engineering, to explore their pros and cons for potential industrial IoT-enabled manufacturing applications.

[1]  Jin Wang,et al.  Statistical process monitoring as a big data analytics tool for smart manufacturing , 2017, Journal of Process Control.

[2]  Devarshi Shah Big Data Analytics and Its Applications in Soft Sensor for Smart Manufacturing , 2019 .

[3]  Yoram Reich,et al.  Strengthening learning algorithms by feature discovery , 2012, Inf. Sci..

[4]  Jin Wang,et al.  A feature-based soft sensor for spectroscopic data analysis , 2019, Journal of Process Control.

[5]  Jesus Flores-Cerrillo,et al.  Consistency-Enhanced Evolution for Variable Selection Can Identify Key Chemical Information from Spectroscopic Data , 2020 .

[6]  Michael Baldea,et al.  A Geometric Framework for Monitoring and Fault Detection for Periodic Processes , 2017 .

[7]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[8]  Jin Wang,et al.  Statistics pattern analysis: A new process monitoring framework and its application to semiconductor batch processes , 2011 .

[9]  Svante Wold,et al.  Batch Process Modeling and MSPC , 2009 .

[10]  Yoshua Bengio,et al.  Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[11]  Jin Wang,et al.  Comparison of variable selection methods for PLS-based soft sensor modeling , 2015 .

[12]  A. L. Gama,et al.  Measurement of flow rate using straight pipes and pipe bends with integrated piezoelectric sensors , 2018 .

[13]  R. P. Evans,et al.  Flow Rate Measurements Using Flow-Induced Pipe Vibration , 2004 .

[14]  Charu C. Aggarwal,et al.  Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods , 2017, KDD.

[15]  Flavio Sanson Fogliatto,et al.  Variable selection methods in multivariate statistical process control: A systematic literature review , 2018, Comput. Ind. Eng..

[16]  Tahir Mehmood,et al.  A review of variable selection methods in Partial Least Squares Regression , 2012 .

[17]  V. Venkatasubramanian The promise of artificial intelligence in chemical engineering: Is it here, finally? , 2018, AIChE Journal.

[18]  Hanmin Peng,et al.  A flow sensing method of power spectrum based on piezoelectric effect and vortex-induced vibrations , 2019, Measurement.

[19]  Shaul Markovitch,et al.  Feature Generation Using General Constructor Functions , 2002, Machine Learning.

[20]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[21]  Bo Lu,et al.  Profile-driven Features for Offline Quality Prediction in Batch Processes , 2017 .

[22]  N. Lomb Least-squares frequency analysis of unequally spaced data , 1976 .

[23]  Jin Wang,et al.  Multivariate Statistical Process Monitoring Based on Statistics Pattern Analysis , 2010 .

[24]  Ray Y. Zhong,et al.  Internet of things enabled manufacturing: A review , 2018 .

[25]  Deepak S. Turaga,et al.  Feature Engineering for Predictive Modeling using Reinforcement Learning , 2017, AAAI.

[26]  Devarshi Shah,et al.  Feature-based Virtual Metrology for Semiconductor Manufacturing , 2018 .

[27]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Devarshi Shah,et al.  An Internet-of-things Enabled Smart Manufacturing Testbed , 2019 .