Process-Monitoring-for-Quality — A Model Selection Criterion

Abstract The new big data driven manufacturing quality philosophy, Process Monitoring for Quality (PMQ), proposes Big Data — Big Models, a new modeling paradigm that includes a big data-driven learning process that requires many models to be created to find the final one. Since many candidates are created, one of the main challenges is to select the model that efficiently solves the tradeoff between complexity and prediction. Most mature manufacturing organizations generate only a few Defects Per Million of Opportunities (DPMO); therefore, manufacturing-derived data sets for classification of quality tend to be highly unbalanced. The Penalized Maximum Probability of Correct Decision (PMPCD) is developed to solve the posed tradeoff. According to simulation and experimental results, the model selection criterion induces parsimony by selecting the model with the minimum number of features needed for an effective/efficient defect detection.

[1]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[2]  Qeethara Al-Shayea Artificial Neural Networks in Medical Diagnosis , 2024, International Journal of Research Publication and Reviews.

[3]  K. Gnana Sheela,et al.  Review on Methods to Fix Number of Hidden Neurons in Neural Networks , 2013 .

[4]  G. R. Karpagam,et al.  A survey of machine learning techniques , 2017 .

[5]  Ruchika Malhotra,et al.  A systematic review of machine learning techniques for software fault prediction , 2015, Appl. Soft Comput..

[6]  David R. Anderson,et al.  Model Selection and Multimodel Inference , 2003 .

[7]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection: A Data Mining Perspective , 1998 .

[8]  Luis M. Candanedo,et al.  Accurate occupancy detection of an office room from light, temperature, humidity and CO2 measurements using statistical learning models , 2016 .

[9]  Robert F. Murphy,et al.  A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells , 2001, Bioinform..

[10]  Rubén Morales-Menéndez,et al.  Process-Monitoring-for-Quality - A Model Selection Criterion for Genetic Programming , 2019, EMO.

[11]  L. Breiman The Little Bootstrap and other Methods for Dimensionality Selection in Regression: X-Fixed Prediction Error , 1992 .

[12]  Herbert K. H. Lee,et al.  Model Selection for Neural Network Classification , 2001, J. Classif..

[13]  Richard Demo Souza,et al.  A Survey of Machine Learning Techniques Applied to Self-Organizing Cellular Networks , 2017, IEEE Communications Surveys & Tutorials.

[14]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[15]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[16]  Ruben Morales-Menendez,et al.  Process-Monitoring-for-Quality — Big Models , 2018 .

[17]  Martin Fodslette Møller,et al.  A scaled conjugate gradient algorithm for fast supervised learning , 1993, Neural Networks.

[18]  S. Larson The shrinkage of the coefficient of multiple correlation. , 1931 .

[19]  Honglak Lee,et al.  Efficient L1 Regularized Logistic Regression , 2006, AAAI.

[20]  Ping Zhang On the choice of penalty term in generalized FPE criterion , 1994 .

[21]  Peter Auer,et al.  A learning rule for very simple universal approximators consisting of a single layer of perceptrons , 2008, Neural Networks.

[22]  Elena Deza,et al.  Encyclopedia of Distances , 2014 .

[23]  Tae-Hyung Kim,et al.  Feature selection for manufacturing process monitoring using cross-validation , 2013 .

[24]  Gaurang Panchal,et al.  Searching Most Efficient Neural Network Architecture Using Akaike's Information Criterion (AIC) , 2010 .

[25]  Eric R. Ziegel,et al.  Probability and Statistics for Engineering and the Sciences , 2004, Technometrics.

[26]  Christine W. Chan,et al.  Artificial intelligence for monitoring and supervisory control of process systems , 2007, Eng. Appl. Artif. Intell..

[27]  Ashraf Saad,et al.  Evolving an artificial neural network classifier for condition monitoring of rotating mechanical systems , 2007, Appl. Soft Comput..

[28]  Anirban Mukherjee,et al.  Automatic Defect Detection on Hot-Rolled Flat Steel Products , 2013, IEEE Transactions on Instrumentation and Measurement.

[29]  Leszek Plaskota,et al.  Complexity of Predictive Neural Networks , 2006 .

[30]  Rubén Morales-Menéndez,et al.  Machine Learning and Pattern Recognition Techniques for Information Extraction to Improve Production Control and Design Decisions , 2017, ICDM.

[31]  Huijun Gao,et al.  Data-Based Techniques Focused on Modern Industry: An Overview , 2015, IEEE Transactions on Industrial Electronics.

[32]  Ruben Morales-Menendez,et al.  Process-Monitoring-for-Quality — A Model Selection Criterion for l1 - Regularized Logistic Regression , 2019 .

[33]  Martin Fodslette Meiller A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning , 1993 .

[34]  Ruben Morales-Menendez,et al.  Machine learning techniques for quality control in high conformance manufacturing environment , 2018 .

[35]  Carlos A. Escobar,et al.  Big Data-Driven Manufacturing—Process-Monitoring-for-Quality Philosophy , 2017 .

[36]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[37]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[38]  Okyay Kaynak,et al.  Big Data for Modern Industry: Challenges and Trends [Point of View] , 2015, Proc. IEEE.

[39]  Joshua D. Knowles,et al.  Fifty years of pulsar candidate selection: from simple filters to a new principled real-time classification approach , 2016, Monthly Notices of the Royal Astronomical Society.

[40]  Inci Batmaz,et al.  A review of data mining applications for quality improvement in manufacturing industry , 2011, Expert Syst. Appl..

[41]  Ismail Mohamad,et al.  Standardization and Its Effects on K-Means Clustering Algorithm , 2013 .

[42]  David H. Wolpert,et al.  The Lack of A Priori Distinctions Between Learning Algorithms , 1996, Neural Computation.

[43]  J. Suykens,et al.  Robust complexity criteria for nonlinear regression in NARX models , 2003 .

[44]  Murat Hüsnü Sazli,et al.  A brief review of feed-forward neural networks , 2006 .

[45]  Ruben Morales-Menendez,et al.  Process-Monitoring-for-Quality — A Model Selection Criterion for Support Vector Machine , 2019 .

[46]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[47]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .